Skip to main content
Top

2018 | OriginalPaper | Chapter

MedFact: Towards Improving Veracity of Medical Information in Social Media Using Applied Machine Learning

Authors : Hamman Samuel, Osmar Zaïane

Published in: Advances in Artificial Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Since the advent of Web 2.0 and social media, anyone with an Internet connection can create content online, even if it is uncertain or fake information, which has attracted significant attention recently. In this study, we address the challenge of uncertain online health information by automating systematic approaches borrowed from evidence-based medicine. Our proposed algorithm, MedFact, enables recommendation of trusted medical information within health-related social media discussions and empowers online users to make informed decisions about the credibility of online health information. MedFact automatically extracts relevant keywords from online discussions and queries trusted medical literature with the aim of embedding related factual information into the discussion. Our retrieval model takes into account layperson terminology and hierarchy of evidence. Consequently, MedFact is a departure from current consensus-based approaches for determining credibility using “wisdom of the crowd”, binary “Like” votes and ratings, popular in social media. Moving away from subjective metrics, MedFact introduces objective metrics. We also present preliminary work towards a granular veracity score by using supervised machine learning to compare statements within uncertain social media text and trusted medical text. We evaluate our proposed algorithm on various data sets from existing health social media involving both patient and medic discussions, with promising results and suggestions for ongoing improvements and future research.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The GenSim Python API includes the TextRank algorithm [21] implementation
 
2
SNOMED CT data set available from U.S. National Library of Medicine (NLM)
 
3
CHV data set available from the Consumer Health Vocabulary Initiative
 
4
SEW historical data set available via PIKES home page
 
5
The TRIP database is accessible programmatically via web services that were most kindly made available to the authors by Jon Brassey, the TRIP database creator
 
6
POS tagging is done using the Penn Treebank tags set, all steps in this particular pipeline are programmed with the NLTK Python library http://​nltk.​org.
 
7
Sentiment analysis is performed using the TextBlob Python library
 
8
The spaCy Python library is used for generating dependency trees https://​spacy.​io.
 
9
We implement a shallow CNN with the ConText tool
 
10
Health Stack Exchange’s beta web site https://​health.​stackexchange.​com.
 
11
Data set curated from the Stack Exchange Data Dump from the Internet Archive
 
12
QuackWatch web site http://​quackwatch.​org.
 
13
DocCheck web site http://​doccheck.​com.
 
Literature
1.
go back to reference Kata, A.: Anti-vaccine activists, web 2.0, and the postmodern paradigm-an overview of tactics and tropes used online by the anti-vaccination movement. Vaccine 30(25), 3778–3789 (2012)CrossRef Kata, A.: Anti-vaccine activists, web 2.0, and the postmodern paradigm-an overview of tactics and tropes used online by the anti-vaccination movement. Vaccine 30(25), 3778–3789 (2012)CrossRef
2.
go back to reference Rippen, H., Risk, A.: e-Health code of ethics (May 24). J. Med. Internet Res. 2(2) (2000) Rippen, H., Risk, A.: e-Health code of ethics (May 24). J. Med. Internet Res. 2(2) (2000)
3.
go back to reference Greenhalgh, T.: How to Read a Paper: The Basics of Evidence-Based Medicine. Wiley, Chichester (2010) Greenhalgh, T.: How to Read a Paper: The Basics of Evidence-Based Medicine. Wiley, Chichester (2010)
4.
go back to reference Ackley, B.J.: Evidence-Based Nursing Care Guidelines: Medical-Surgical Interventions. Elsevier Health Sciences, St. Louis (2008) Ackley, B.J.: Evidence-Based Nursing Care Guidelines: Medical-Surgical Interventions. Elsevier Health Sciences, St. Louis (2008)
5.
go back to reference Child, J.: Trust-the fundamental bond in global collaboration. Organ. Dyn. 29(4), 274–288 (2001)CrossRef Child, J.: Trust-the fundamental bond in global collaboration. Organ. Dyn. 29(4), 274–288 (2001)CrossRef
6.
go back to reference Varlamis, I., Eirinaki, M., Louta, M.: A study on social network metrics and their application in trust networks. In: Proceedings of the IEEE International Conference on Advances in Social Networks Analysis and Mining, pp. 168–175 (2010) Varlamis, I., Eirinaki, M., Louta, M.: A study on social network metrics and their application in trust networks. In: Proceedings of the IEEE International Conference on Advances in Social Networks Analysis and Mining, pp. 168–175 (2010)
7.
8.
go back to reference Grant, S., Betts, B.: Encouraging user behaviour with achievements: an empirical study. In: IEEE International Working Conference on Mining Software Repositories (MSR), pp. 65–68 (2013) Grant, S., Betts, B.: Encouraging user behaviour with achievements: an empirical study. In: IEEE International Working Conference on Mining Software Repositories (MSR), pp. 65–68 (2013)
9.
go back to reference Aljazzaf, Z.M.: Trust-Based Service Selection. Ph.D. thesis. University of Western Ontario (2011) Aljazzaf, Z.M.: Trust-Based Service Selection. Ph.D. thesis. University of Western Ontario (2011)
10.
go back to reference Park, M.: HealthTrust: Assessing the Trustworthiness of Healthcare Information on the Internet. Ph.D. thesis. University of Kansas (2013) Park, M.: HealthTrust: Assessing the Trustworthiness of Healthcare Information on the Internet. Ph.D. thesis. University of Kansas (2013)
11.
go back to reference Aphinyanaphongs, Y., Aliferis, C., et al.: Text categorization models for identifying unproven cancer treatments on the web. In: World Congress on Medical Informatics (MedInfo), p. 968. IOS Press (2007) Aphinyanaphongs, Y., Aliferis, C., et al.: Text categorization models for identifying unproven cancer treatments on the web. In: World Congress on Medical Informatics (MedInfo), p. 968. IOS Press (2007)
12.
go back to reference Oliphant, T.: “I am making my decision on the basis of my experience”: constructing authoritative knowledge about treatments for depression. Can. J. Inf. Libr. Sci. 33(3–4), 215–232 (2009) Oliphant, T.: “I am making my decision on the basis of my experience”: constructing authoritative knowledge about treatments for depression. Can. J. Inf. Libr. Sci. 33(3–4), 215–232 (2009)
13.
go back to reference Stephens, G.J., Silbert, L.J., Hasson, U.: Speaker-listener neural coupling underlies successful communication. Proc. Natl. Acad. Sci. 107(32), 14425–14430 (2010)CrossRef Stephens, G.J., Silbert, L.J., Hasson, U.: Speaker-listener neural coupling underlies successful communication. Proc. Natl. Acad. Sci. 107(32), 14425–14430 (2010)CrossRef
14.
go back to reference Nyhan, B., Reifler, J., Richey, S., Freed, G.L.: Effective messages in vaccine promotion: a randomized trial. Pediatrics 133(4) (2014) Nyhan, B., Reifler, J., Richey, S., Freed, G.L.: Effective messages in vaccine promotion: a randomized trial. Pediatrics 133(4) (2014)
15.
go back to reference Nyhan, B., Reifler, J.: When corrections fail: the persistence of political misperceptions. Polit. Behav. 32(2), 303–330 (2010)CrossRef Nyhan, B., Reifler, J.: When corrections fail: the persistence of political misperceptions. Polit. Behav. 32(2), 303–330 (2010)CrossRef
16.
go back to reference Plous, S.: The Psychology of Judgment and Decision Making. McGraw-Hill, New York (1993) Plous, S.: The Psychology of Judgment and Decision Making. McGraw-Hill, New York (1993)
17.
go back to reference Dunning, D.: The dunning-kruger effect: on being ignorant of one’s own ignorance. Adv. Exp. Soc. Psychol. 44, 247 (2011)CrossRef Dunning, D.: The dunning-kruger effect: on being ignorant of one’s own ignorance. Adv. Exp. Soc. Psychol. 44, 247 (2011)CrossRef
18.
go back to reference Proctor, R., Schiebinger, L.L.: Agnotology: The Making and Unmaking of Ignorance. Stanford University Press, Stanford (2008) Proctor, R., Schiebinger, L.L.: Agnotology: The Making and Unmaking of Ignorance. Stanford University Press, Stanford (2008)
19.
go back to reference Henderson, J.: Expert and lay knowledge: a sociological perspective. Nutr. Diet. 67(1), 4–5 (2010)CrossRef Henderson, J.: Expert and lay knowledge: a sociological perspective. Nutr. Diet. 67(1), 4–5 (2010)CrossRef
20.
go back to reference Straus, S.E., Richardson, S.W., Glasziou, P., Haynes, B.R.: Evidence-Based Medicine: How to Practice and Teach EBM. Elsevier/Churchill Livingstone, New York (2005) Straus, S.E., Richardson, S.W., Glasziou, P., Haynes, B.R.: Evidence-Based Medicine: How to Practice and Teach EBM. Elsevier/Churchill Livingstone, New York (2005)
21.
go back to reference Mihalcea, R., Tarau, P.: TextRank: bringing order into text. In: EMNLP, vol. 4, pp. 404–411 (2004) Mihalcea, R., Tarau, P.: TextRank: bringing order into text. In: EMNLP, vol. 4, pp. 404–411 (2004)
22.
go back to reference Cornet, R., de Keizer, N.: Forty years of SNOMED: a literature review. BMC Med. Inform. Decis. Mak. 8(1), S2 (2008)CrossRef Cornet, R., de Keizer, N.: Forty years of SNOMED: a literature review. BMC Med. Inform. Decis. Mak. 8(1), S2 (2008)CrossRef
23.
go back to reference Smith, C., Stavri, P.: Consumer health vocabulary. In: Consumer Health Informatics, pp. 122–128 (2005) Smith, C., Stavri, P.: Consumer health vocabulary. In: Consumer Health Informatics, pp. 122–128 (2005)
24.
go back to reference Corcoglioniti, F., Rospocher, M., Aprosio, A.P.: Extracting knowledge from text with PIKES. In: International Semantic Web Conference (2015) Corcoglioniti, F., Rospocher, M., Aprosio, A.P.: Extracting knowledge from text with PIKES. In: International Semantic Web Conference (2015)
25.
go back to reference Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. arXiv (2013) Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. arXiv (2013)
26.
go back to reference Brassey, J.: TRIP database: identifying high quality medical literature from a range of sources. New Rev. Inf. Netw. 11(2), 229–234 (2005)CrossRef Brassey, J.: TRIP database: identifying high quality medical literature from a range of sources. New Rev. Inf. Netw. 11(2), 229–234 (2005)CrossRef
27.
go back to reference Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRefMATH Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRefMATH
28.
go back to reference Pang, B., Lee, L., et al.: Opinion mining and sentiment analysis. Found. Trends\({\textregistered }\) Inf. Retr. 2(1–2), 1–135 (2008) Pang, B., Lee, L., et al.: Opinion mining and sentiment analysis. Found. Trends\({\textregistered }\) Inf. Retr. 2(1–2), 1–135 (2008)
29.
go back to reference De Marneffe, M.C., Manning, C.D.: Stanford Typed Dependencies Manual. Technical report, Stanford University (2008) De Marneffe, M.C., Manning, C.D.: Stanford Typed Dependencies Manual. Technical report, Stanford University (2008)
30.
go back to reference Johnson, R., Zhang, T.: Effective use of word order for text categorization with convolutional neural networks. In: North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT) (2015) Johnson, R., Zhang, T.: Effective use of word order for text categorization with convolutional neural networks. In: North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT) (2015)
31.
go back to reference Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)CrossRefMATH Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)CrossRefMATH
Metadata
Title
MedFact: Towards Improving Veracity of Medical Information in Social Media Using Applied Machine Learning
Authors
Hamman Samuel
Osmar Zaïane
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-89656-4_9

Premium Partner