Skip to main content
Top

2018 | OriginalPaper | Chapter

Domain Supervised Deep Learning Framework for Detecting Chinese Diabetes-Related Topics

Authors : Xinhuan Chen, Yong Zhang, Kangzhi Zhao, Qingcheng Hu, Chunxiao Xing

Published in: Database Systems for Advanced Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

As millions of people are diagnosed with diabetes every year in China, many diabetes-related websites in Chinese provide news and articles. However, most of the online articles are uncategorized or lack a clear or unified topic, users often cannot find their topics of interest effectively and efficiently. The problem of health text classification on Chinese websites cannot be easily addressed by applying existing approaches, which have been used for English documents, in a straightforward manner. To address this problem and meet users’ demand for diabetes-related information needs, we propose a Chinese domain lexicon, adopt some professional diabetes topic explanations as domain knowledge and incorporate them into deep learning approach to form our topic classification framework. Our experiments using real datasets showed that the framework significantly achieved a higher effectiveness and accuracy in categorizing diabetes-related topics than most of the state-of-the-art benchmark approaches. Our experimental analysis also revealed that some health websites provided some incorrect or misleading category information.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Adeva, J.G., Atxa, J.P., Carrillo, M.U., Zengotitabengoa, E.A.: Automatic text classification to support systematic reviews in medicine. Expert Syst. Appl. 41(4), 1498–1508 (2014)CrossRef Adeva, J.G., Atxa, J.P., Carrillo, M.U., Zengotitabengoa, E.A.: Automatic text classification to support systematic reviews in medicine. Expert Syst. Appl. 41(4), 1498–1508 (2014)CrossRef
2.
go back to reference Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(Suppl. 1), 267–270 (2004)CrossRef Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(Suppl. 1), 267–270 (2004)CrossRef
3.
go back to reference Bollegala, D., Mu, T., Goulermas, J.Y.: Cross-domain sentiment classification using sentiment sensitive embeddings. TKDE 28(2), 398–410 (2016) Bollegala, D., Mu, T., Goulermas, J.Y.: Cross-domain sentiment classification using sentiment sensitive embeddings. TKDE 28(2), 398–410 (2016)
4.
go back to reference Charalampous, K., Gasteratos, A.: A tensor-based deep learning framework. Image Vis. Comput. 32(11), 916–929 (2014)CrossRef Charalampous, K., Gasteratos, A.: A tensor-based deep learning framework. Image Vis. Comput. 32(11), 916–929 (2014)CrossRef
5.
go back to reference Cheeseman, P., Kelly, J., Self, M., Stutz, J., Taylor, W., Freeman, D.: Autoclass: a Bayesian classification system. In: Readings in Knowledge Acquisition and Learning, pp. 431–441 (1993) Cheeseman, P., Kelly, J., Self, M., Stutz, J., Taylor, W., Freeman, D.: Autoclass: a Bayesian classification system. In: Readings in Knowledge Acquisition and Learning, pp. 431–441 (1993)
7.
go back to reference Hinton, G., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRef Hinton, G., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRef
8.
go back to reference Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)MathSciNetCrossRef Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)MathSciNetCrossRef
10.
go back to reference Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: ICML, pp. 1188–1196 (2014) Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: ICML, pp. 1188–1196 (2014)
11.
go back to reference Liu, B., Huang, M., Sun, J., Zhu, X.: Incorporating domain and sentiment supervision in representation learning for domain adaptation. In: IJCAI, pp. 1277–1283 (2015) Liu, B., Huang, M., Sun, J., Zhu, X.: Incorporating domain and sentiment supervision in representation learning for domain adaptation. In: IJCAI, pp. 1277–1283 (2015)
13.
go back to reference Liu, X., Chen, H.: AZDrugMiner: an information extraction system for mining patient-reported adverse drug events in online patient forums. In: Zeng, D., Yang, C.C., Tseng, V.S., Xing, C., Chen, H., Wang, F.-Y., Zheng, X. (eds.) ICSH 2013. LNCS, vol. 8040, pp. 134–150. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39844-5_16CrossRef Liu, X., Chen, H.: AZDrugMiner: an information extraction system for mining patient-reported adverse drug events in online patient forums. In: Zeng, D., Yang, C.C., Tseng, V.S., Xing, C., Chen, H., Wang, F.-Y., Zheng, X. (eds.) ICSH 2013. LNCS, vol. 8040, pp. 134–150. Springer, Heidelberg (2013). https://​doi.​org/​10.​1007/​978-3-642-39844-5_​16CrossRef
14.
go back to reference Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS, pp. 3111–3119 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS, pp. 3111–3119 (2013)
15.
go back to reference Miotto, R., Wang, F., Wang, S., Jiang, X., Dudley, J.T.: Deep learning for healthcare: review, opportunities and challenges. Brief. Bioinform. 1, 11 (2017). bbx044 Miotto, R., Wang, F., Wang, S., Jiang, X., Dudley, J.T.: Deep learning for healthcare: review, opportunities and challenges. Brief. Bioinform. 1, 11 (2017). bbx044
16.
go back to reference Nie, L., Wang, M., Zhang, L., Yan, S., Zhang, B., Chua, T.S.: Disease inference from health-related questions via sparse deep learning. TKDE 27(8), 2107–2119 (2015) Nie, L., Wang, M., Zhang, L., Yan, S., Zhang, B., Chua, T.S.: Disease inference from health-related questions via sparse deep learning. TKDE 27(8), 2107–2119 (2015)
19.
go back to reference Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986) Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)
20.
go back to reference Rosenblatt, F.: The perceptron, a perceiving and recognizing automaton Project Para. Cornell Aeronautical Laboratory (1957) Rosenblatt, F.: The perceptron, a perceiving and recognizing automaton Project Para. Cornell Aeronautical Laboratory (1957)
21.
go back to reference Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approx. Reason. 50(7), 969–978 (2009)CrossRef Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approx. Reason. 50(7), 969–978 (2009)CrossRef
22.
go back to reference Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval (1986) Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval (1986)
23.
go back to reference Sarikaya, R., Hinton, G.E., Deoras, A.: Application of deep belief networks for natural language understanding. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 22(4), 778–784 (2014)CrossRef Sarikaya, R., Hinton, G.E., Deoras, A.: Application of deep belief networks for natural language understanding. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 22(4), 778–784 (2014)CrossRef
24.
go back to reference Schmitt, B.H., Pan, Y., Tavassoli, N.T.: Language and consumer memory: the impact of linguistic differences between Chinese and English. J. Consum. Res. 21, 419–431 (1994)CrossRef Schmitt, B.H., Pan, Y., Tavassoli, N.T.: Language and consumer memory: the impact of linguistic differences between Chinese and English. J. Consum. Res. 21, 419–431 (1994)CrossRef
26.
go back to reference Simon, G.J., Caraballo, P.J., Therneau, T.M., Cha, S.S., Castro, M.R., Li, P.W.: Extending association rule summarization techniques to assess risk of diabetes mellitus. TKDE 27(1), 130–141 (2015) Simon, G.J., Caraballo, P.J., Therneau, T.M., Cha, S.S., Castro, M.R., Li, P.W.: Extending association rule summarization techniques to assess risk of diabetes mellitus. TKDE 27(1), 130–141 (2015)
27.
go back to reference Sinha, A.P., Zhao, H.: Incorporating domain knowledge into data mining classifiers: an application in indirect lending. Decis. Support Syst. 46(1), 287–299 (2008)CrossRef Sinha, A.P., Zhao, H.: Incorporating domain knowledge into data mining classifiers: an application in indirect lending. Decis. Support Syst. 46(1), 287–299 (2008)CrossRef
28.
go back to reference Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: ICML, pp. 1096–1103. ACM (2008) Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: ICML, pp. 1096–1103. ACM (2008)
29.
go back to reference Wang, H., Shi, X., Yeung, D.Y.: Relational stacked denoising autoencoder for tag recommendation. In: AAAI, pp. 3052–3058 (2015) Wang, H., Shi, X., Yeung, D.Y.: Relational stacked denoising autoencoder for tag recommendation. In: AAAI, pp. 3052–3058 (2015)
30.
go back to reference Wang, J., Wang, Z., Zhang, D., Yan, J.: Combining knowledge with deep convolutional neural networks for short text classification. In: IJCAI, pp. 2915–2921 (2017) Wang, J., Wang, Z., Zhang, D., Yan, J.: Combining knowledge with deep convolutional neural networks for short text classification. In: IJCAI, pp. 2915–2921 (2017)
31.
go back to reference Wu, Z., Jiang, Y.G., Wang, J., Pu, J., Xue, X.: Exploring inter-feature and inter-class relationships with deep neural networks for video classification. In: MM, pp. 167–176. ACM (2014) Wu, Z., Jiang, Y.G., Wang, J., Pu, J., Xue, X.: Exploring inter-feature and inter-class relationships with deep neural networks for video classification. In: MM, pp. 167–176. ACM (2014)
32.
go back to reference Xu, W., Sun, H., Deng, C., Tan, Y.: Variational autoencoder for semi-supervised text classification. In: AAAI, pp. 3358–3364 (2017) Xu, W., Sun, H., Deng, C., Tan, Y.: Variational autoencoder for semi-supervised text classification. In: AAAI, pp. 3358–3364 (2017)
33.
go back to reference Yang, H., Kundakcioglu, E., Li, J., Wu, T., Mitchell, J.R., Hara, A.K., Pavlicek, W., Hu, L.S., Silva, A.C., Zwart, C.M., et al.: Healthcare intelligence: turning data into knowledge. IEEE Intell. Syst. 29(3), 54–68 (2014)CrossRef Yang, H., Kundakcioglu, E., Li, J., Wu, T., Mitchell, J.R., Hara, A.K., Pavlicek, W., Hu, L.S., Silva, A.C., Zwart, C.M., et al.: Healthcare intelligence: turning data into knowledge. IEEE Intell. Syst. 29(3), 54–68 (2014)CrossRef
34.
35.
go back to reference Yin, X.: Diabetology. Shanghai Scientific and Technical Publishers (2003) Yin, X.: Diabetology. Shanghai Scientific and Technical Publishers (2003)
Metadata
Title
Domain Supervised Deep Learning Framework for Detecting Chinese Diabetes-Related Topics
Authors
Xinhuan Chen
Yong Zhang
Kangzhi Zhao
Qingcheng Hu
Chunxiao Xing
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-91458-9_4

Premium Partner