Skip to main content

2017 | OriginalPaper | Buchkapitel

A Tool for Analyzing Clinical Datasets as Blackbox

verfasst von : Nafees Qamar, Yilong Yang, Andras Nadas, Zhiming Liu, Janos Sztipanovits

Erschienen in: Software Engineering in Health Care

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We present a technique for the automatic identification of clinically-relevant patterns in medical datasets. To preserve patient privacy, we propose and implement the idea of treating medical dataset as a black box for both internal and external users of data. The proposed approach directly handles clinical data queries on a given medical dataset, unlike the conventional approach of relying on the data de-identification process. Our integrated toolkit combines software engineering technologies such as Java EE and RESTful web services, which allows exchanging medical data in an unidentifiable XML format and restricts users to computed information. Existing techniques could make it possible for an adversary to succeed in data re-identification attempts by applying advanced computational techniques; therefore, we disallow the use of retrospective processing of data. We validate our approach on an endoscopic reporting application based on openEHR and MST standards. The implemented prototype system can be used to query datasets by clinical researchers, governmental or non-governmental organizations in monitoring health care services to improve quality of care.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
De-identification process is defined as a technology to delete or remove the identifiable information such as name, and SSN from the released information, and suppress or generalize quasi-identifiers, such as zip code date of birth, to ensure that medical data is not re-identifiable (the reverse process of de-identification.).
 
Literatur
1.
Zurück zum Zitat Benitez, K., Malin, B.: Evaluating re-identification risks with respect to the hipaa privacy rule. JAMIA 17(2), 169–177 (2010) Benitez, K., Malin, B.: Evaluating re-identification risks with respect to the hipaa privacy rule. JAMIA 17(2), 169–177 (2010)
2.
Zurück zum Zitat Choi, C., Münch, R., Bunk, B., Barthelmes, J., Ebeling, C., Schomburg, D., Schobert, M., Jahn, D.: Combination of a data warehouse concept with web services for the establishment of the pseudomonas systems biology database systomonas. J. Integr. Bioinform. 4(1), 12–21 (2007) Choi, C., Münch, R., Bunk, B., Barthelmes, J., Ebeling, C., Schomburg, D., Schobert, M., Jahn, D.: Combination of a data warehouse concept with web services for the establishment of the pseudomonas systems biology database systomonas. J. Integr. Bioinform. 4(1), 12–21 (2007)
3.
Zurück zum Zitat Capitani di Vimercati, S., Foresti, S., Livraga, G., Samarati, P.: Protecting privacy in data release. In: Aldini, A., Gorrieri, R. (eds.) FOSAD 2011. LNCS, vol. 6858, pp. 1–34. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23082-0_1 CrossRef Capitani di Vimercati, S., Foresti, S., Livraga, G., Samarati, P.: Protecting privacy in data release. In: Aldini, A., Gorrieri, R. (eds.) FOSAD 2011. LNCS, vol. 6858, pp. 1–34. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-23082-0_​1 CrossRef
4.
Zurück zum Zitat Dwork, C.: Differential privacy. In: Bugliesi, M., Preneel, B., Sassone, V., Wegener, I. (eds.) ICALP 2006. LNCS, vol. 4052, pp. 1–12. Springer, Heidelberg (2006). doi:10.1007/11787006_1 CrossRef Dwork, C.: Differential privacy. In: Bugliesi, M., Preneel, B., Sassone, V., Wegener, I. (eds.) ICALP 2006. LNCS, vol. 4052, pp. 1–12. Springer, Heidelberg (2006). doi:10.​1007/​11787006_​1 CrossRef
5.
Zurück zum Zitat El Emam, K., Fineberg, A.: An overview of techniques for de-identifying personal health information. Access to Information and Privacy Division of Health Canada (2009) El Emam, K., Fineberg, A.: An overview of techniques for de-identifying personal health information. Access to Information and Privacy Division of Health Canada (2009)
6.
Zurück zum Zitat Ferraiolo, D.F., Sandhu, R.S., Gavrila, S.I., Kuhn, D.R., Chandramouli, R.: Proposed NIST standard for role-based access control. ACM Trans. Inf. Syst. Secur. 4(3), 224–274 (2001)CrossRef Ferraiolo, D.F., Sandhu, R.S., Gavrila, S.I., Kuhn, D.R., Chandramouli, R.: Proposed NIST standard for role-based access control. ACM Trans. Inf. Syst. Secur. 4(3), 224–274 (2001)CrossRef
7.
Zurück zum Zitat Garde, S., Hovenga, E.J.S., Buck, J., Knaup, P.: Ubiquitous information for ubiquitous computing: expressing clinical data sets with openEHR archetypes. In: MIE, pp. 215–220 (2006) Garde, S., Hovenga, E.J.S., Buck, J., Knaup, P.: Ubiquitous information for ubiquitous computing: expressing clinical data sets with openEHR archetypes. In: MIE, pp. 215–220 (2006)
8.
Zurück zum Zitat Kreger, H.: Web services conceptual architecture (WSCA 1.0). Technical report, IBM Software Group, May 2001 Kreger, H.: Web services conceptual architecture (WSCA 1.0). Technical report, IBM Software Group, May 2001
9.
Zurück zum Zitat Liu, Z., Qamar, N., Qian, J.: A quantitative analysis of the performance and scalability of de-identification tools for medical data. In: Gibbons, J., MacCaull, W. (eds.) FHIES 2013. LNCS, vol. 8315, pp. 274–289. Springer, Heidelberg (2014). doi:10.1007/978-3-642-53956-5_18 CrossRef Liu, Z., Qamar, N., Qian, J.: A quantitative analysis of the performance and scalability of de-identification tools for medical data. In: Gibbons, J., MacCaull, W. (eds.) FHIES 2013. LNCS, vol. 8315, pp. 274–289. Springer, Heidelberg (2014). doi:10.​1007/​978-3-642-53956-5_​18 CrossRef
10.
Zurück zum Zitat McDonald, C.J., Blevins, L., Dexter, P.R., Schadow, G., Hook, J., Abernathy, G., Dugan, T., Martin, A., Phillips, D.R., Davis, M.: Demonstration of the Indianapolis SPIN query tool for de-identified access to content of the Indiana network for patient care’s (a real RHIO) database. In: American Medical Informatics Association Annual Symposium (AMIA 2006), Washington, DC, USA, 11–15 November 2006 (2006) McDonald, C.J., Blevins, L., Dexter, P.R., Schadow, G., Hook, J., Abernathy, G., Dugan, T., Martin, A., Phillips, D.R., Davis, M.: Demonstration of the Indianapolis SPIN query tool for de-identified access to content of the Indiana network for patient care’s (a real RHIO) database. In: American Medical Informatics Association Annual Symposium (AMIA 2006), Washington, DC, USA, 11–15 November 2006 (2006)
12.
Zurück zum Zitat Oster, S., Langella, S., Hastings, S., Ervin, D., Madduri, R.K., Phillips, J., Kurç, T.M., Siebenlist, F., Covitz, P.A., Shanbhag, K., Foster, I.T., Saltz, J.H.: Model formulation: cagrid 1.0: an enterprise grid infrastructure for biomedical research. JAMIA 15(2), 138–149 (2008) Oster, S., Langella, S., Hastings, S., Ervin, D., Madduri, R.K., Phillips, J., Kurç, T.M., Siebenlist, F., Covitz, P.A., Shanbhag, K., Foster, I.T., Saltz, J.H.: Model formulation: cagrid 1.0: an enterprise grid infrastructure for biomedical research. JAMIA 15(2), 138–149 (2008)
13.
Zurück zum Zitat Ping, X.-O., Chung, Y., Tseng, Y.-J., Liang, J.-D., Yang, P.-M., Huang, G.-T., Lai, F.: A web-based data-querying tool based on ontology-driven methodology and flowchart-based model. JMIR Med. Inform. 1(1), e2 (2013) Ping, X.-O., Chung, Y., Tseng, Y.-J., Liang, J.-D., Yang, P.-M., Huang, G.-T., Lai, F.: A web-based data-querying tool based on ontology-driven methodology and flowchart-based model. JMIR Med. Inform. 1(1), e2 (2013)
14.
Zurück zum Zitat Prather, J.C., Lobach, D.F., Goodwin, L.K., Hales, J.W., Hage, M.L., Hammond, W.E.: Medical data mining: knowledge discovery in a clinical data warehouse. In: American Medical Informatics Association Annual Symposium (AMIA 1997), Nashville, TN, USA, 25–29 October 1997 (1997) Prather, J.C., Lobach, D.F., Goodwin, L.K., Hales, J.W., Hage, M.L., Hammond, W.E.: Medical data mining: knowledge discovery in a clinical data warehouse. In: American Medical Informatics Association Annual Symposium (AMIA 1997), Nashville, TN, USA, 25–29 October 1997 (1997)
15.
Zurück zum Zitat Price, M., Weber, J., McCallum, G.: Scoop - the social collaboratory for outcome oriented primary care. In: Proceedings of IEEE International Conference on Computer Based Medical Systems 27–29 May 2014 (2014) Price, M., Weber, J., McCallum, G.: Scoop - the social collaboratory for outcome oriented primary care. In: Proceedings of IEEE International Conference on Computer Based Medical Systems 27–29 May 2014 (2014)
16.
Zurück zum Zitat Qamar, N., Faber, J., Ledru, Y., Liu, Z.: Automated reviewing of healthcare security policies. In: Weber, J., Perseil, I. (eds.) FHIES 2012. LNCS, vol. 7789, pp. 176–193. Springer, Heidelberg (2013). doi:10.1007/978-3-642-39088-3_12 CrossRef Qamar, N., Faber, J., Ledru, Y., Liu, Z.: Automated reviewing of healthcare security policies. In: Weber, J., Perseil, I. (eds.) FHIES 2012. LNCS, vol. 7789, pp. 176–193. Springer, Heidelberg (2013). doi:10.​1007/​978-3-642-39088-3_​12 CrossRef
18.
Zurück zum Zitat Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Trans. Knowl. Data Eng. 13(6), 1010–1027 (2001)CrossRef Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Trans. Knowl. Data Eng. 13(6), 1010–1027 (2001)CrossRef
19.
Zurück zum Zitat Sweeney, L.: Simple demographics often identify people uniquely, pp. 50–59. Carnegie Mellon University, Pittsburgh, Data Privacy Working Paper 3 (2000) Sweeney, L.: Simple demographics often identify people uniquely, pp. 50–59. Carnegie Mellon University, Pittsburgh, Data Privacy Working Paper 3 (2000)
20.
Zurück zum Zitat Templ, M.: Statistical disclosure control for microdata using the R-package sdcMicro. Trans. Data Priv. 1(2), 67–85 (2008)MathSciNet Templ, M.: Statistical disclosure control for microdata using the R-package sdcMicro. Trans. Data Priv. 1(2), 67–85 (2008)MathSciNet
21.
Zurück zum Zitat Xiao, X., Wang, G., Gehrke, J.: Interactive anonymization of sensitive data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2009), pp. 1051–1054 (2009) Xiao, X., Wang, G., Gehrke, J.: Interactive anonymization of sensitive data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2009), pp. 1051–1054 (2009)
Metadaten
Titel
A Tool for Analyzing Clinical Datasets as Blackbox
verfasst von
Nafees Qamar
Yilong Yang
Andras Nadas
Zhiming Liu
Janos Sztipanovits
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-63194-3_15