Skip to main content

2017 | OriginalPaper | Buchkapitel

A Public Health Surveillance Platform Exploiting Free-Text Sources via Natural Language Processing and Linked Data: Application in Adverse Drug Reaction Signal Detection Using PubMed and Twitter

verfasst von : Pantelis Natsiavas, Nicos Maglaveras, Vassilis Koutkias

Erschienen in: Knowledge Representation for Health Care

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a platform enabling the systematic exploitation of diverse, free-text data sources for public health surveillance applications. The platform relies on Natural Language Processing (NLP) and a micro-services architecture, utilizing Linked Data as a data representational formalism. In order to perform NLP in an extendable and modular fashion, the proposed platform employs the Apache Unstructured Information Management Architecture (UIMA) and semantically annotates the results through a newly developed UIMA Semantic Common Analysis Structure Consumer (SCC). The SCC output is a graph represented in the Resource Description Framework (RDF) based on the W3C Web Annotation Data Model (WADM) and SNOMED-CT. We also present the use of the proposed platform through an exemplar application scenario concerning the detection of adverse drug reaction (ADR) signals using data retrieved from PubMed and Twitter.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Harpaz, R., Callahan, A., Tamang, S., Low, Y., Odgers, D., Finlayson, S., Jung, K., LePendu, P., Shah, N.H.: Text mining for adverse drug events: the promise, challenges, and state of the art. Drug Saf. 37, 777–790 (2014)CrossRef Harpaz, R., Callahan, A., Tamang, S., Low, Y., Odgers, D., Finlayson, S., Jung, K., LePendu, P., Shah, N.H.: Text mining for adverse drug events: the promise, challenges, and state of the art. Drug Saf. 37, 777–790 (2014)CrossRef
2.
Zurück zum Zitat Bizer, C.: The emerging web of Linked Data. IEEE Intell. Syst. 24, 87–92 (2009)CrossRef Bizer, C.: The emerging web of Linked Data. IEEE Intell. Syst. 24, 87–92 (2009)CrossRef
5.
Zurück zum Zitat Sarker, A., Ginn, R., Nikfarjam, A., O’Connor, K., Smith, K., Jayaraman, S., Upadhaya, T., Gonzalez, G.: Utilizing social media data for pharmacovigilance: a review. J. Biomed. Inform. 54, 202–212 (2015)CrossRef Sarker, A., Ginn, R., Nikfarjam, A., O’Connor, K., Smith, K., Jayaraman, S., Upadhaya, T., Gonzalez, G.: Utilizing social media data for pharmacovigilance: a review. J. Biomed. Inform. 54, 202–212 (2015)CrossRef
6.
Zurück zum Zitat Council for International Organizations of Medical Sciences (CIOMS): Practical Aspects of Signal Detection in Pharmacovigilance. Council for International Organizations of Medical Sciences. Report of CIOMS Working Group VIII. CIOMS, Geneva (2010) Council for International Organizations of Medical Sciences (CIOMS): Practical Aspects of Signal Detection in Pharmacovigilance. Council for International Organizations of Medical Sciences. Report of CIOMS Working Group VIII. CIOMS, Geneva (2010)
7.
Zurück zum Zitat Klann, J.G., Buck, M.D., Brown, J., Hadley, M., Elmore, R., Weber, G.M., Murphy, S.N.: Query Health: standards-based, cross-platform population health surveillance. J. Am. Med. Inform. Assoc. 21, 650–656 (2014)CrossRef Klann, J.G., Buck, M.D., Brown, J., Hadley, M., Elmore, R., Weber, G.M., Murphy, S.N.: Query Health: standards-based, cross-platform population health surveillance. J. Am. Med. Inform. Assoc. 21, 650–656 (2014)CrossRef
8.
Zurück zum Zitat Teodoro, D., Pasche, E., Gobeill, J., Emonet, S., Ruch, P., Lovis, C.: Building a transnational biosurveillance network using Semantic Web technologies: requirements, design, and preliminary evaluation. J. Med. Internet Res. 14(3), e73 (2012)CrossRef Teodoro, D., Pasche, E., Gobeill, J., Emonet, S., Ruch, P., Lovis, C.: Building a transnational biosurveillance network using Semantic Web technologies: requirements, design, and preliminary evaluation. J. Med. Internet Res. 14(3), e73 (2012)CrossRef
9.
Zurück zum Zitat Daniulaityte, R., Chen, L., Lamy, F.R., Carlson, R.G., Thirunarayan, K., Sheth, A.: When “Bad” is “Good”: identifying personal communication and sentiment in drug-related tweets. JMIR Public Heal. Surveill. 2, e162 (2016)CrossRef Daniulaityte, R., Chen, L., Lamy, F.R., Carlson, R.G., Thirunarayan, K., Sheth, A.: When “Bad” is “Good”: identifying personal communication and sentiment in drug-related tweets. JMIR Public Heal. Surveill. 2, e162 (2016)CrossRef
10.
Zurück zum Zitat Huff, A.G., Breit, N., Allen, T., Whiting, K., Kiley, C.: Evaluation and verification of the global rapid identification of threats system for infectious diseases in textual data sources. Interdiscip. Perspect. Infect. Dis. 2016, 5080746 (2016) Huff, A.G., Breit, N., Allen, T., Whiting, K., Kiley, C.: Evaluation and verification of the global rapid identification of threats system for infectious diseases in textual data sources. Interdiscip. Perspect. Infect. Dis. 2016, 5080746 (2016)
11.
Zurück zum Zitat Yang, M., Kiang, M., Shang, W.: Filtering big data from social media – building an early warning system for adverse drug reactions. J. Biomed. Inform. 54, 230–240 (2015)CrossRef Yang, M., Kiang, M., Shang, W.: Filtering big data from social media – building an early warning system for adverse drug reactions. J. Biomed. Inform. 54, 230–240 (2015)CrossRef
12.
Zurück zum Zitat Cameron, D., Smith, G.A., Daniulaityte, R., Sheth, A.P., Dave, D., Chen, L., Anand, G., Carlson, R., Watkins, K.Z., Falck, R.: PREDOSE: a Semantic Web platform for drug abuse epidemiology using social media. J. Biomed. Inform. 46, 985–997 (2013)CrossRef Cameron, D., Smith, G.A., Daniulaityte, R., Sheth, A.P., Dave, D., Chen, L., Anand, G., Carlson, R., Watkins, K.Z., Falck, R.: PREDOSE: a Semantic Web platform for drug abuse epidemiology using social media. J. Biomed. Inform. 46, 985–997 (2013)CrossRef
13.
Zurück zum Zitat Shang, N., Xu, H., Rindflesch, T.C., Cohen, T.: Identifying plausible adverse drug reactions using knowledge extracted from the literature. J. Biomed. Inform. 52, 293–310 (2014)CrossRef Shang, N., Xu, H., Rindflesch, T.C., Cohen, T.: Identifying plausible adverse drug reactions using knowledge extracted from the literature. J. Biomed. Inform. 52, 293–310 (2014)CrossRef
14.
Zurück zum Zitat Freifeld, C.C., Brownstein, J.S., Menone, C.M., Bao, W., Filice, R., Kass-Hout, T., Dasgupta, N.: Digital drug safety surveillance: monitoring pharmaceutical products in Twitter. Drug Saf. 37, 343–350 (2014)CrossRef Freifeld, C.C., Brownstein, J.S., Menone, C.M., Bao, W., Filice, R., Kass-Hout, T., Dasgupta, N.: Digital drug safety surveillance: monitoring pharmaceutical products in Twitter. Drug Saf. 37, 343–350 (2014)CrossRef
15.
Zurück zum Zitat Chew, C., Eysenbach, G.: Pandemics in the age of Twitter: content analysis of tweets during the 2009 H1N1 outbreak. PLoS ONE 5, e14118 (2010)CrossRef Chew, C., Eysenbach, G.: Pandemics in the age of Twitter: content analysis of tweets during the 2009 H1N1 outbreak. PLoS ONE 5, e14118 (2010)CrossRef
16.
Zurück zum Zitat Ram, S., Zhang, W., Williams, M., Pengetnze, Y.: Predicting asthma-related emergency department visits using big data. IEEE J. Biomed. Heal. Inform. 19, 1216–1223 (2015)CrossRef Ram, S., Zhang, W., Williams, M., Pengetnze, Y.: Predicting asthma-related emergency department visits using big data. IEEE J. Biomed. Heal. Inform. 19, 1216–1223 (2015)CrossRef
17.
Zurück zum Zitat Gesualdo, F., Stilo, G., D’Ambrosio, A., Carloni, E., Pandolfi, E., Velardi, P., Fiocchi, A., Tozzi, A.E.: Can Twitter be a source of information on allergy? correlation of pollen counts with tweets reporting symptoms of allergic rhinoconjunctivitis and names of antihistamine drugs. PLoS ONE 10, e0133706 (2015)CrossRef Gesualdo, F., Stilo, G., D’Ambrosio, A., Carloni, E., Pandolfi, E., Velardi, P., Fiocchi, A., Tozzi, A.E.: Can Twitter be a source of information on allergy? correlation of pollen counts with tweets reporting symptoms of allergic rhinoconjunctivitis and names of antihistamine drugs. PLoS ONE 10, e0133706 (2015)CrossRef
18.
Zurück zum Zitat Gittelman, S., Lange, V., Gotway Crawford, C.A., Okoro, C.A., Lieb, E., Dhingra, S.S., Trimarchi, E.: A new source of data for public health surveillance: Facebook likes. J. Med. Internet Res. 17(4), e98 (2015)CrossRef Gittelman, S., Lange, V., Gotway Crawford, C.A., Okoro, C.A., Lieb, E., Dhingra, S.S., Trimarchi, E.: A new source of data for public health surveillance: Facebook likes. J. Med. Internet Res. 17(4), e98 (2015)CrossRef
19.
Zurück zum Zitat Fullwood, M.D., Kecojevic, A., Basch, C.H.: Examination of YouTube videos related to synthetic cannabinoids. Int. J. Adolesc. Med. Health (2016) Fullwood, M.D., Kecojevic, A., Basch, C.H.: Examination of YouTube videos related to synthetic cannabinoids. Int. J. Adolesc. Med. Health (2016)
20.
Zurück zum Zitat Shin, S.-Y., Seo, D.-W., An, J., Kwak, H., Kim, S.-H., Gwack, J., Jo, M.-W.: High correlation of Middle East respiratory syndrome spread with google search and Twitter trends in Korea. Sci. Rep. 6, 32920 (2016)CrossRef Shin, S.-Y., Seo, D.-W., An, J., Kwak, H., Kim, S.-H., Gwack, J., Jo, M.-W.: High correlation of Middle East respiratory syndrome spread with google search and Twitter trends in Korea. Sci. Rep. 6, 32920 (2016)CrossRef
21.
Zurück zum Zitat Santillana, M., Nguyen, A.T., Dredze, M., Paul, M.J., Nsoesie, E.O., Brownstein, J.S.: Combining search, social media, and traditional data sources to improve influenza surveillance. PLoS Comput. Biol. 11, e1004513 (2015)CrossRef Santillana, M., Nguyen, A.T., Dredze, M., Paul, M.J., Nsoesie, E.O., Brownstein, J.S.: Combining search, social media, and traditional data sources to improve influenza surveillance. PLoS Comput. Biol. 11, e1004513 (2015)CrossRef
22.
Zurück zum Zitat Koutkias, V., Lillo-Le Louët, A., Jaulent, M.C.: Exploiting heterogeneous publicly available data sources for drug safety surveillance: computational framework and case studies. Expert Opin. Drug Saf. 16, 113–124 (2016) Koutkias, V., Lillo-Le Louët, A., Jaulent, M.C.: Exploiting heterogeneous publicly available data sources for drug safety surveillance: computational framework and case studies. Expert Opin. Drug Saf. 16, 113–124 (2016)
23.
Zurück zum Zitat Poulymenopoulou, M., Papakonstantinou, D., Malamateniou, F., Vassilacopoulos, G.: A health analytics semantic ETL service for obesity surveillance. Stud. Health Technol. Inform. 210, 840–844 (2015) Poulymenopoulou, M., Papakonstantinou, D., Malamateniou, F., Vassilacopoulos, G.: A health analytics semantic ETL service for obesity surveillance. Stud. Health Technol. Inform. 210, 840–844 (2015)
24.
Zurück zum Zitat Chorianopoulos, K., Talvis, K.: Flutrack.org: open-source and Linked Data for epidemiology. Health Inform. J. 22(4), 962–974 (2015)CrossRef Chorianopoulos, K., Talvis, K.: Flutrack.org: open-source and Linked Data for epidemiology. Health Inform. J. 22(4), 962–974 (2015)CrossRef
25.
Zurück zum Zitat Kato, Y., Izui, T., Murakawa, Y., Okabayashi, K., Ueki, M., Tsuchiya, Y., Narita, M.: Research and development environments for robot services and its implementation. In: 2011 IEEE/SICE International Symposium on System Integration (SII), pp. 306–311 (2011) Kato, Y., Izui, T., Murakawa, Y., Okabayashi, K., Ueki, M., Tsuchiya, Y., Narita, M.: Research and development environments for robot services and its implementation. In: 2011 IEEE/SICE International Symposium on System Integration (SII), pp. 306–311 (2011)
26.
Zurück zum Zitat Vögler, M., Schleicher, J., Inzinger, C., Nastic, S., Sehic, S., Dustdar, S.: LEONORE – large-scale provisioning of resource-constrained IoT deployments. In: 9th International Symposium on Service-Oriented System Engineering, pp. 78–87 (2015) Vögler, M., Schleicher, J., Inzinger, C., Nastic, S., Sehic, S., Dustdar, S.: LEONORE – large-scale provisioning of resource-constrained IoT deployments. In: 9th International Symposium on Service-Oriented System Engineering, pp. 78–87 (2015)
27.
Zurück zum Zitat Ono, K., Muetze, T., Kolishovski, G., Shannon, P., Demchak, B.: CyREST: turbocharging cytoscape access for external tools via a RESTful API. F1000Research 4, 478 (2015) Ono, K., Muetze, T., Kolishovski, G., Shannon, P., Demchak, B.: CyREST: turbocharging cytoscape access for external tools via a RESTful API. F1000Research 4, 478 (2015)
28.
Zurück zum Zitat Fages, F., Soliman, S. (eds.): PPSWR 2005. LNCS, vol. 3703. Springer, Heidelberg (2005) Fages, F., Soliman, S. (eds.): PPSWR 2005. LNCS, vol. 3703. Springer, Heidelberg (2005)
29.
Zurück zum Zitat Samwald, M., Jentzsch, A., Bouton, C., Kallesøe, C.S., Willighagen, E., Hajagos, J., Marshall, M.S., Prud’hommeaux, E., Hassenzadeh, O., Pichler, E., Stephens, S.: Linked open drug data for pharmaceutical research and development. J Cheminform. 3, 19 (2011)CrossRef Samwald, M., Jentzsch, A., Bouton, C., Kallesøe, C.S., Willighagen, E., Hajagos, J., Marshall, M.S., Prud’hommeaux, E., Hassenzadeh, O., Pichler, E., Stephens, S.: Linked open drug data for pharmaceutical research and development. J Cheminform. 3, 19 (2011)CrossRef
30.
Zurück zum Zitat Callahan, A., Cruz-Toledo, J., Ansell, P., Dumontier, M.: Bio2RDF release 2: improved coverage, interoperability and provenance of life science Linked Data. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) The Semantic Web: Semantics and Big Data, pp. 200–212. Springer, Heidelberg (2013)CrossRef Callahan, A., Cruz-Toledo, J., Ansell, P., Dumontier, M.: Bio2RDF release 2: improved coverage, interoperability and provenance of life science Linked Data. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) The Semantic Web: Semantics and Big Data, pp. 200–212. Springer, Heidelberg (2013)CrossRef
31.
Zurück zum Zitat Salvadores, M., Alexander, P.R., Musen, M.A., Noy, N.F.: BioPortal as a dataset of linked biomedical ontologies and terminologies in RDF. Semant. Web. 4, 277–284 (2013) Salvadores, M., Alexander, P.R., Musen, M.A., Noy, N.F.: BioPortal as a dataset of linked biomedical ontologies and terminologies in RDF. Semant. Web. 4, 277–284 (2013)
32.
Zurück zum Zitat Sneps-Sneppe, M., Namiot, D.: Micro-service architecture for emerging telecom applications. Int. J. Open Inf. Technol. 2, 34–38 (2014) Sneps-Sneppe, M., Namiot, D.: Micro-service architecture for emerging telecom applications. Int. J. Open Inf. Technol. 2, 34–38 (2014)
33.
Zurück zum Zitat Fielding, R.T., Taylor, R.N.: Principled design of the modern web architecture. In: Proceedings of the 22nd International Conference on Software Engineering, pp. 407–416. ACM, New York (2000) Fielding, R.T., Taylor, R.N.: Principled design of the modern web architecture. In: Proceedings of the 22nd International Conference on Software Engineering, pp. 407–416. ACM, New York (2000)
34.
Zurück zum Zitat Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C., Chute, C.G.: Mayo clinical text analysis and knowledge extraction system (cTAKES): architecture, component evaluation and applications. J. Am. Med. Inform. Assoc. 17, 507–513 (2010)CrossRef Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C., Chute, C.G.: Mayo clinical text analysis and knowledge extraction system (cTAKES): architecture, component evaluation and applications. J. Am. Med. Inform. Assoc. 17, 507–513 (2010)CrossRef
36.
Zurück zum Zitat Koutkias, V.G., Jaulent, M.-C.: Computational approaches for pharmacovigilance signal detection: toward integrated and semantically-enriched frameworks. Drug Saf. 38, 219–232 (2015)CrossRef Koutkias, V.G., Jaulent, M.-C.: Computational approaches for pharmacovigilance signal detection: toward integrated and semantically-enriched frameworks. Drug Saf. 38, 219–232 (2015)CrossRef
Metadaten
Titel
A Public Health Surveillance Platform Exploiting Free-Text Sources via Natural Language Processing and Linked Data: Application in Adverse Drug Reaction Signal Detection Using PubMed and Twitter
verfasst von
Pantelis Natsiavas
Nicos Maglaveras
Vassilis Koutkias
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-55014-5_4