Skip to main content
Top
Published in: Journal of Intelligent Information Systems 2/2021

18-05-2021

Selecting the most helpful answers in online health question answering communities

Authors: Cheng Ying Lin, Yi-Hung Wu, Arbee L. P. Chen

Published in: Journal of Intelligent Information Systems | Issue 2/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The online question answering (QA) community has been popular in recent years. In this paper, we focus on the online health question answering (HQA) community. The HQA community provides a platform for health consumers to inquire about health information. There are two ways to use this platform. One is to post a question and wait for answers to be provided by authenticated doctors. The other is to search for relevant questions with answers. For the latter, health consumers may prefer an accepted answer marked by the previous health consumer. However, there is a large proportion of questions without an accepted answer and it is inconvenient for people who want to search for relevant questions. To address this issue, we aim to select high-quality answers from the answers without marked accepted answers. We propose a deep learning approach to achieve this goal. To train the model for the prediction of answer quality, we first view the accepted answer as the positive answer and propose a method to label the negative answer. Next, we capture the semantic information on the question and the answer by the deep learning structure. We then combine the information to predict the quality score of the answer. We collect data from one of the biggest Chinese HQA community and divide them into groups by the medical departments for detailed analysis. Finally, we conduct experiments to show the effectiveness of categorization and the labeling method. The results show that our approach outperforms other studies and we further research into the differences among the results of different categories.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Azzam, A., Tazi, N., & Hossny, A.H. (2017). A question routing technique using deep neural network. Database Systems for Advanced Applications, 1, 35–49. Azzam, A., Tazi, N., & Hossny, A.H. (2017). A question routing technique using deep neural network. Database Systems for Advanced Applications, 1, 35–49.
go back to reference Bagheri, A., Sammani, A., van der Heijden, P.G.M., Asselbergs, F.W., & Oberski, D.L. (2020). ETM: Enrichment by topic modeling for automated clinical sentence classification to detect patients’ disease history. Journal of Intelligent Information Systems (JIIS), 55, 329–349.CrossRef Bagheri, A., Sammani, A., van der Heijden, P.G.M., Asselbergs, F.W., & Oberski, D.L. (2020). ETM: Enrichment by topic modeling for automated clinical sentence classification to detect patients’ disease history. Journal of Intelligent Information Systems (JIIS), 55, 329–349.CrossRef
go back to reference Blei, D. M. , Ng, A.Y., & Jordan, M.I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH Blei, D. M. , Ng, A.Y., & Jordan, M.I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH
go back to reference Blooma, M.J., Yeow-Kuan Chua, A., & Hoe-Lian Goh, D. (2008). A predictive framework for retrieving the best answer. Symposium on Applied Computing, 1107-1111. Blooma, M.J., Yeow-Kuan Chua, A., & Hoe-Lian Goh, D. (2008). A predictive framework for retrieving the best answer. Symposium on Applied Computing, 1107-1111.
go back to reference Cai, R., Zhu, B., Ji, L., Hao, T., Yan, J., & Liu, W. (2017). An CNN-LSTM attention approach to understanding user query intent from online health communities. ICDM Workshops, 430–437. Cai, R., Zhu, B., Ji, L., Hao, T., Yan, J., & Liu, W. (2017). An CNN-LSTM attention approach to understanding user query intent from online health communities. ICDM Workshops, 430–437.
go back to reference Chung, J., Gulcehre, C., Cho, K., & Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555. Chung, J., Gulcehre, C., Cho, K., & Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.​3555.
go back to reference Feng, M., Xiang, B., Glass, M.R., Wang, L., & Zhou, B. (2015). Applying deep learning to answer selection: A study and an open task. IEEE Automatic Speech Recognition and Understanding Workshop, 813–820. Feng, M., Xiang, B., Glass, M.R., Wang, L., & Zhou, B. (2015). Applying deep learning to answer selection: A study and an open task. IEEE Automatic Speech Recognition and Understanding Workshop, 813–820.
go back to reference Flesch, R. (1948). A new readability yardstick. Journal of Applied Psychology, 32(3), 221–233.CrossRef Flesch, R. (1948). A new readability yardstick. Journal of Applied Psychology, 32(3), 221–233.CrossRef
go back to reference He, J., Fu, M., & Tu, M. (2019). Applying deep matching networks to Chinese medical question answering: a study and a dataset. BMC Medical Informatics & Decision Making, 19-S(2), 91–100. He, J., Fu, M., & Tu, M. (2019). Applying deep matching networks to Chinese medical question answering: a study and a dataset. BMC Medical Informatics & Decision Making, 19-S(2), 91–100.
go back to reference Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9, 1735–1780.CrossRef Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9, 1735–1780.CrossRef
go back to reference Hu, Z., Zhang, Z., Yang, H., Chen, Q., Zhu, R., & Zuo, D. (2018). Predicting the quality of online health expert question-answering services with temporal features in a deep learning framework. Neurocomputing, 275, 2769–2782.CrossRef Hu, Z., Zhang, Z., Yang, H., Chen, Q., Zhu, R., & Zuo, D. (2018). Predicting the quality of online health expert question-answering services with temporal features in a deep learning framework. Neurocomputing, 275, 2769–2782.CrossRef
go back to reference Jeon, J., Croft, W.B., Lee, J. H., & Park, S. (2006). A framework to predict the quality of answers with non-textual features. International ACM SIGIR Conference on Research and Development in Information Retrieval, 228–235. Jeon, J., Croft, W.B., Lee, J. H., & Park, S. (2006). A framework to predict the quality of answers with non-textual features. International ACM SIGIR Conference on Research and Development in Information Retrieval, 228–235.
go back to reference Kalchbrenner, N., Grefenstette, E., & Blunsom, P. (2014). A convolutional neural network for modelling sentences. Meeting of the Association for Computational Linguistics, 1, 655–665. Kalchbrenner, N., Grefenstette, E., & Blunsom, P. (2014). A convolutional neural network for modelling sentences. Meeting of the Association for Computational Linguistics, 1, 655–665.
go back to reference Kim, Y. (2014). Convolutional neural networks for sentence classification. Empirical Methods in Natural Language Processing, 1746–1751. Kim, Y. (2014). Convolutional neural networks for sentence classification. Empirical Methods in Natural Language Processing, 1746–1751.
go back to reference Kingma, D.P., & Ba, J. (2015). Adam: A method for stochastic optimization. International Conference on Learning Representations. Kingma, D.P., & Ba, J. (2015). Adam: A method for stochastic optimization. International Conference on Learning Representations.
go back to reference Liu, P., Qiu, X., & Huang, X. (2016). Recurrent neural network for text classification with multi-task learning. International Joint Conferences on Artificial Intelligence, 2873–2879. Liu, P., Qiu, X., & Huang, X. (2016). Recurrent neural network for text classification with multi-task learning. International Joint Conferences on Artificial Intelligence, 2873–2879.
go back to reference Lowe, R., Pow, N., Serban, I.V., & Pineau, J. (2015). The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. Special Interest Group on Discourse and Dialogue, 285–294. Lowe, R., Pow, N., Serban, I.V., & Pineau, J. (2015). The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. Special Interest Group on Discourse and Dialogue, 285–294.
go back to reference McLaughlin, G.H. (1969). SMOG grading - a new readability formula. Journal of Reading, 12(8), 639–646. McLaughlin, G.H. (1969). SMOG grading - a new readability formula. Journal of Reading, 12(8), 639–646.
go back to reference Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. International Conference on Learning Representations. Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. International Conference on Learning Representations.
go back to reference Mueller, J., & Thyagarajan, A. (2016). Siamese recurrent architectures for learning sentence similarity. Association for the Advancement of Artificial Intelligence, 2786–2792. Mueller, J., & Thyagarajan, A. (2016). Siamese recurrent architectures for learning sentence similarity. Association for the Advancement of Artificial Intelligence, 2786–2792.
go back to reference Riahi, F., Zolaktaf, Z., Shafiei, M.M., & Milios, E.E. (2012). Finding expert users in community question answering. International World Wide Web conferences, 791–798. Riahi, F., Zolaktaf, Z., Shafiei, M.M., & Milios, E.E. (2012). Finding expert users in community question answering. International World Wide Web conferences, 791–798.
go back to reference Rumelhart, D.E., Hinton, G.E., & Williams, R.J. (1986). Learning representations by back-propagating errors. Nature, 323, 533–536.CrossRef Rumelhart, D.E., Hinton, G.E., & Williams, R.J. (1986). Learning representations by back-propagating errors. Nature, 323, 533–536.CrossRef
go back to reference Sahu, T.P., Nagwani, N.K., & Verma, S. (2016). Selecting best answer: An empirical analysis on community question answering sites. IEEE Access, 4, 4797–4808.CrossRef Sahu, T.P., Nagwani, N.K., & Verma, S. (2016). Selecting best answer: An empirical analysis on community question answering sites. IEEE Access, 4, 4797–4808.CrossRef
go back to reference Shah, C., & Pomerantz, J. (2010). Evaluating and predicting answer quality in community QA. International ACM SIGIR Conference on Research and Development in Information Retrieval, 411–418. Shah, C., & Pomerantz, J. (2010). Evaluating and predicting answer quality in community QA. International ACM SIGIR Conference on Research and Development in Information Retrieval, 411–418.
go back to reference Shao, B., & Yan, J. (2017). Recommending answerers for stack overflow with LDA model. Chinese Conference on Computer Supported Cooperative Work and Social Computing, 80–86. Shao, B., & Yan, J. (2017). Recommending answerers for stack overflow with LDA model. Chinese Conference on Computer Supported Cooperative Work and Social Computing, 80–86.
go back to reference Tai, L.K., Setyonugroho, W., & Chen, A.L.P. (2020). Finding discriminatory features from electronic health records for depression prediction. Journal of Intelligent Information Systems (JIIS), 55, 371–396.CrossRef Tai, L.K., Setyonugroho, W., & Chen, A.L.P. (2020). Finding discriminatory features from electronic health records for depression prediction. Journal of Intelligent Information Systems (JIIS), 55, 371–396.CrossRef
go back to reference Tan, M., Xiang, B., & Zhou, B. (2016). STM-based deep learning models for non-factoid answer selection. International Conference on Learning Representations. Tan, M., Xiang, B., & Zhou, B. (2016). STM-based deep learning models for non-factoid answer selection. International Conference on Learning Representations.
go back to reference Tang, D., Qin, B., Feng, X., & Liu, T. (2016). Effective LSTMs for target-dependent sentiment classification. International Conference on Computational Linguistics, 3298–3307. Tang, D., Qin, B., Feng, X., & Liu, T. (2016). Effective LSTMs for target-dependent sentiment classification. International Conference on Computational Linguistics, 3298–3307.
go back to reference Tian, Y., Ma, W., Xia, F., & Song, Y. (2019). ChiMed: A Chinese medical corpus for question answering. BioNLP, 250–260. Tian, Y., Ma, W., Xia, F., & Song, Y. (2019). ChiMed: A Chinese medical corpus for question answering. BioNLP, 250–260.
go back to reference Toba, H., Ming, Z., Adriani, M., & Chua, T.-S. (2014). Discovering high quality answers in community question answering archives using a hierarchy of classifiers. Information Sciences, 261, 101–115.MathSciNetCrossRef Toba, H., Ming, Z., Adriani, M., & Chua, T.-S. (2014). Discovering high quality answers in community question answering archives using a hierarchy of classifiers. Information Sciences, 261, 101–115.MathSciNetCrossRef
go back to reference Tran, T.N.T., Felfernig, A., Trattner, C., & Holzinger, A. (2020). Recommender systems in the healthcare domain: state-of-the-art and research issues. Journal of Intelligent Information Systems (JIIS). Tran, T.N.T., Felfernig, A., Trattner, C., & Holzinger, A. (2020). Recommender systems in the healthcare domain: state-of-the-art and research issues. Journal of Intelligent Information Systems (JIIS).
go back to reference Wang, Z., Hamza, W., & Florian, R. (2017). Bilateral multi-perspective matching for natural language sentences. International Joint Conferences on Artificial Intelligence, 4144–4150. Wang, Z., Hamza, W., & Florian, R. (2017). Bilateral multi-perspective matching for natural language sentences. International Joint Conferences on Artificial Intelligence, 4144–4150.
go back to reference Wang, X., Jiang, W., & Luo, Z. (2016). Combination of convolutional and recurrent neural network for sentiment analysis of short texts. International Conference on Computational Linguistics, 2428–2437. Wang, X., Jiang, W., & Luo, Z. (2016). Combination of convolutional and recurrent neural network for sentiment analysis of short texts. International Conference on Computational Linguistics, 2428–2437.
go back to reference Wang, M., Smith, N.A., & Mitamura, T. (2007). What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA. The Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 22–32. Wang, M., Smith, N.A., & Mitamura, T. (2007). What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA. The Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 22–32.
go back to reference Wang, P., Xu, B., Xu, J., Tian, G., Liu, C.-L., & Hao, H. (2016). Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing, 174, 806–814.CrossRef Wang, P., Xu, B., Xu, J., Tian, G., Liu, C.-L., & Hao, H. (2016). Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing, 174, 806–814.CrossRef
go back to reference Ye, D., Zhang, S., Wang, H., Cheng, J., Zhang, X., Ding, Z., & Li, P. (2018). Multi-Level composite neural networks for medical question answer matching. IEEE International Conference on Data Science in Cyberspace, 139–145. Ye, D., Zhang, S., Wang, H., Cheng, J., Zhang, X., Ding, Z., & Li, P. (2018). Multi-Level composite neural networks for medical question answer matching. IEEE International Conference on Data Science in Cyberspace, 139–145.
go back to reference Yoon, S., Shin, J., & Jung, K. (2018). Learning to rank question-answer pairs using hierarchical recurrent encoder with latent topic clustering. Annual Conference of the North American Chapter of the Association for Computational Linguistics, 1575–1584. Yoon, S., Shin, J., & Jung, K. (2018). Learning to rank question-answer pairs using hierarchical recurrent encoder with latent topic clustering. Annual Conference of the North American Chapter of the Association for Computational Linguistics, 1575–1584.
go back to reference Zhang, T., Cho, J.H.D., & Zhai, C. (2014). Understanding user intents in online health forums. ACM International Conference on Bioinformatics and Computational Biology, 220–229. Zhang, T., Cho, J.H.D., & Zhai, C. (2014). Understanding user intents in online health forums. ACM International Conference on Bioinformatics and Computational Biology, 220–229.
go back to reference Zhang, C., Du, N., Fan, W., Li, Y., Lu, C.-T., & Yu, P.S. (2017). Bringing semantic structures to user intent detection in online medical queries. IEEE International Conference on Big Data, 1019–1026. Zhang, C., Du, N., Fan, W., Li, Y., Lu, C.-T., & Yu, P.S. (2017). Bringing semantic structures to user intent detection in online medical queries. IEEE International Conference on Big Data, 1019–1026.
go back to reference Zhang, C., Fan, W., Du, N., & Yu, P.S. (2016). Mining user intentions from medical queries. International World Wide Web Conferences, 1373–1384. Zhang, C., Fan, W., Du, N., & Yu, P.S. (2016). Mining user intentions from medical queries. International World Wide Web Conferences, 1373–1384.
go back to reference Zhang, R., Lee, H., & Radev, D.R. (2016). Dependency sensitive convolutional neural networks for modeling sentences and documents. North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1512–1521. Zhang, R., Lee, H., & Radev, D.R. (2016). Dependency sensitive convolutional neural networks for modeling sentences and documents. North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1512–1521.
go back to reference Zhang, S., Zhang, X., Wang, H., Cheng, J., Li, P., & Ding, Z. (2017). Chinese medical question answer matching using end-to-end character-level multi-scale CNNs. Applied Sciences, 7(8). Zhang, S., Zhang, X., Wang, H., Cheng, J., Li, P., & Ding, Z. (2017). Chinese medical question answer matching using end-to-end character-level multi-scale CNNs. Applied Sciences, 7(8).
go back to reference Zhou, T.C., Lyu, M.R., & King, I. (2012). A classification-based approach to question routing in community question answering. International World Wide Web Conference, 783–790. Zhou, T.C., Lyu, M.R., & King, I. (2012). A classification-based approach to question routing in community question answering. International World Wide Web Conference, 783–790.
Metadata
Title
Selecting the most helpful answers in online health question answering communities
Authors
Cheng Ying Lin
Yi-Hung Wu
Arbee L. P. Chen
Publication date
18-05-2021
Publisher
Springer US
Published in
Journal of Intelligent Information Systems / Issue 2/2021
Print ISSN: 0925-9902
Electronic ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-021-00640-1

Other articles of this Issue 2/2021

Journal of Intelligent Information Systems 2/2021 Go to the issue

Premium Partner