Skip to main content
Erschienen in: Health and Technology 1/2013

01.03.2013 | Original Paper

A rule-based transformation system for converting semi-structured medical documents

verfasst von: Johannes Heurix, Antonio Rella, Stefan Fenz, Thomas Neubauer

Erschienen in: Health and Technology | Ausgabe 1/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The electronic health record (EHR) enabled the cost-efficient handling and processing of health information. By storing medical documents in digitized form, the EHR ensures the fast and reliable availability of health information. However, as not all health care providers use the same EHR format and health document structures, the efficiency of the data sharing process is severely reduced. In this paper we present a rule-based transformation system that converts semi-structured (annotated) text into standardized formats, such as HL7 CDA. Our approach identifies relevant information in the input document by analyzing its structure as well as its content and inserts the required elements into corresponding reusable CDA templates, where the templates are selected according to the CDA document type-specific requirements. The research results enable the efficient transformation of various EHR structures into a standardized format and therefore contribute to increasing the data sharing efficiency across health care providers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
‘a’ for annotation to distinguish these tags from any pre-existing tags in the source input string
 
2
As this is only used in CDA body templates, we simply use <paragraph> here.
 
Literatur
1.
Zurück zum Zitat Appelt DE, Israel DJ. Introduction to information extraction technology. A Tutorial Prepared for IJCAI-99; 1999. Appelt DE, Israel DJ. Introduction to information extraction technology. A Tutorial Prepared for IJCAI-99; 1999.
2.
Zurück zum Zitat Chaudry B, Wang J, Wu S, Maglione M, Mojica W, Roth E, et al. Systematic review: impact of health information technology on quality, efficiency, and costs of medical care. Ann Intern Med 2006;144(10):742–52. Chaudry B, Wang J, Wu S, Maglione M, Mojica W, Roth E, et al. Systematic review: impact of health information technology on quality, efficiency, and costs of medical care. Ann Intern Med  2006;144(10):742–52.
3.
Zurück zum Zitat Cimiano P, Ladwig G, Staab S. Gimme’ the context: context-driven automatic semantic annotation with C-PANKOW. In: Proceedings of the 14th international conference on world wide web. 2005. p. 332–41. Cimiano P, Ladwig G, Staab S. Gimme’ the context: context-driven automatic semantic annotation with C-PANKOW. In: Proceedings of the 14th international conference on world wide web. 2005. p. 332–41.
4.
Zurück zum Zitat Cunningham H, Maynard D, Bontcheva K, Tablan V, Aswani N, Roberts I, et al. Developing language processing components with GATE version 6 (a User Guide): University of Sheffield, Department of Computer Science; 2011. Cunningham H, Maynard D, Bontcheva K, Tablan V, Aswani N, Roberts I, et al. Developing language processing components with GATE version 6 (a User Guide): University of Sheffield, Department of Computer Science; 2011.
5.
Zurück zum Zitat Ferrucci D, Lally A. Uima: An architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng 2004;10:327–48.CrossRef Ferrucci D, Lally A. Uima: An architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng  2004;10:327–48.CrossRef
6.
Zurück zum Zitat Friedlin J, McDonald CJ. A natural language processing system to extract and code concepts relating to congestive heart failure from chest radiology reports. In: AMIA annual symposium proceedings. 2006. p. 269–73. Friedlin J, McDonald CJ. A natural language processing system to extract and code concepts relating to congestive heart failure from chest radiology reports. In: AMIA annual symposium proceedings. 2006. p. 269–73.
7.
Zurück zum Zitat Friedman C, Alderson PO, Austin JH, Cimino JJ, Johnson SB. A general natural-language text processor for clinical radiology. J Am Med Inform Assoc 1994;1(2):161–74.CrossRef Friedman C, Alderson PO, Austin JH, Cimino JJ, Johnson SB. A general natural-language text processor for clinical radiology. J Am Med Inform Assoc 1994;1(2):161–74.CrossRef
8.
Zurück zum Zitat Friedman C, Johnson SB, Forman B, Starren J. Architectural requirements for a multipurpose natural language processor in the clinical environment. In: Proceedings of the Annual Symposium on Computer Application in Medical Care. 1995. p. 347–51. Friedman C, Johnson SB, Forman B, Starren J. Architectural requirements for a multipurpose natural language processor in the clinical environment. In: Proceedings of the Annual Symposium on Computer Application in Medical Care. 1995. p. 347–51.
9.
Zurück zum Zitat Grouin C, Rosier A, Dameron O, Zeigenbaum P. Testing tactics to localize de-identification. In: Medical informatics in Europe conference (MIE’2009). 2009. p. 735–39. Grouin C, Rosier A, Dameron O, Zeigenbaum P. Testing tactics to localize de-identification. In: Medical informatics in Europe conference (MIE’2009). 2009. p. 735–39.
10.
Zurück zum Zitat Guo Y, Gaizauskas R, Roberts I, Demetriou G, Hepple M. Identifying personal health information using support vector machines. In: i2b2 workshop on challenges in natural language processing for clinical data; 2006. Guo Y, Gaizauskas R, Roberts I, Demetriou G, Hepple M. Identifying personal health information using support vector machines. In: i2b2 workshop on challenges in natural language processing for clinical data; 2006.
11.
Zurück zum Zitat Haug PJ, Ranum DL, Frederick PR. Computerized extraction of coded findings from free-text radiologic reports. Radiology 1990;174(2):543–48. Haug PJ, Ranum DL, Frederick PR. Computerized extraction of coded findings from free-text radiologic reports. Radiology 1990;174(2):543–48.
13.
Zurück zum Zitat Integrating the Healthcare Enterprise (IHE): IHE IT infrastructure (ITI) technical framework 6.0. Tech. rep. 2009. Integrating the Healthcare Enterprise (IHE): IHE IT infrastructure (ITI) technical framework 6.0. Tech. rep. 2009.
14.
Zurück zum Zitat Long W. Extracting diagnoses from discharge summaries. In: AMIA annual symposium proceedings. 2005. p. 470–4. Long W. Extracting diagnoses from discharge summaries. In: AMIA annual symposium proceedings. 2005. p. 470–4.
15.
Zurück zum Zitat Morrison F, Li L, Lai A, Hripcsak G. Repurposing the clinical record: Can an existing natural language processing system de-identify clinical notes? J Am Med Inform Assoc. 2009;16:37–39.CrossRef Morrison F, Li L, Lai A, Hripcsak G. Repurposing the clinical record: Can an existing natural language processing system de-identify clinical notes? J Am Med Inform Assoc. 2009;16:37–39.CrossRef
16.
Zurück zum Zitat Mystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: A review of recent research. IMIA Yearb Med Inform. 2008;1:128–44. Mystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: A review of recent research. IMIA Yearb Med Inform. 2008;1:128–44.
17.
Zurück zum Zitat Neubauer T, Heurix J. A methodology for the pseudonymization of medical data. Int J Med Inform. 2011;80(3):190–204.CrossRef Neubauer T, Heurix J. A methodology for the pseudonymization of medical data. Int J Med Inform. 2011;80(3):190–204.CrossRef
18.
Zurück zum Zitat Regenstrief Institute Inc.: Logical observation identifiers names and codes (LOINC). 2008. Regenstrief Institute Inc.: Logical observation identifiers names and codes (LOINC). 2008.
19.
Zurück zum Zitat Sarawagi S. Information extraction. Found. Trends Databases. 2008;1:261–377.CrossRef Sarawagi S. Information extraction. Found. Trends Databases. 2008;1:261–377.CrossRef
20.
Zurück zum Zitat Sibanda T, He T, Szolovits P, Uzuner O. Syntactically-informed semantic category recognition in discharge summaries. In: AMIA annual symposium proceedings. 2006. p. 714–18. Sibanda T, He T, Szolovits P, Uzuner O. Syntactically-informed semantic category recognition in discharge summaries. In: AMIA annual symposium proceedings. 2006. p. 714–18.
21.
Zurück zum Zitat Taira RK, Bui AAT, Kangarloo H. Identification of patient name references within medical documents using semantic selectional restrictions. In: AMIA annual symposium proceedings. 2002. p. 714–18. Taira RK, Bui AAT, Kangarloo H. Identification of patient name references within medical documents using semantic selectional restrictions. In: AMIA annual symposium proceedings. 2002. p. 714–18.
23.
Zurück zum Zitat Velupillai S, Dalianisa H, Hassela M, Nilsson GH. Developing a standard for de-identifying electronic patient records written in Swedish: Precision, recall and f-measure in a manual and computerized annotation trial. Int J Med Inform Assoc. 2007;14:564–73.CrossRef Velupillai S, Dalianisa H, Hassela M, Nilsson GH. Developing a standard for de-identifying electronic patient records written in Swedish: Precision, recall and f-measure in a manual and computerized annotation trial. Int J Med Inform Assoc. 2007;14:564–73.CrossRef
24.
Zurück zum Zitat World Health Organization (WHO): international statistical classification of diseases and related health problems (ICD). 2007. World Health Organization (WHO): international statistical classification of diseases and related health problems (ICD). 2007.
25.
Zurück zum Zitat Zeng QT, Goryachev S, Weiss S, Sordo M, Murphy SN, Lazarus R. Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system. BMC Med Inform Dec Making. 2006;6(30). Zeng QT, Goryachev S, Weiss S, Sordo M, Murphy SN, Lazarus R. Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system. BMC Med Inform Dec Making. 2006;6(30).
26.
Zurück zum Zitat Zweigenbaum P, Bachimont B, Bouaud J, Charlet J, Boisvieux JF. A multilingual architecture for building a normalised conceptual representation from medical language. In: Proceedings of the annual symposium on computer applications in medical care. 1995. p. 357–61. Zweigenbaum P, Bachimont B, Bouaud J, Charlet J, Boisvieux JF. A multilingual architecture for building a normalised conceptual representation from medical language. In: Proceedings of the annual symposium on computer applications in medical care. 1995. p. 357–61.
Metadaten
Titel
A rule-based transformation system for converting semi-structured medical documents
verfasst von
Johannes Heurix
Antonio Rella
Stefan Fenz
Thomas Neubauer
Publikationsdatum
01.03.2013
Verlag
Springer-Verlag
Erschienen in
Health and Technology / Ausgabe 1/2013
Print ISSN: 2190-7188
Elektronische ISSN: 2190-7196
DOI
https://doi.org/10.1007/s12553-013-0040-0

Weitere Artikel der Ausgabe 1/2013

Health and Technology 1/2013 Zur Ausgabe

Premium Partner