Skip to main content
Top
Published in: World Wide Web 3/2022

08-12-2021

Bayesian networks and chained classifiers based on SVM for traditional chinese medical prescription generation

Authors: Yingpei Wu, Chaohan Pei, Chunyang Ruan, Ruofei Wang, Yun Yang, Yanchun Zhang

Published in: World Wide Web | Issue 3/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Traditional Chinese Medicine(TCM) is playing an increasingly prominent role in lung cancer treatment, as it can prolong patients’ survival, improve their quality of life, and reduce the adverse effects of radiotherapy and chemotherapy. However, the effectiveness of TCM treatment depends more on the personal experience of doctors, and the standardization of TCM prescriptions needs to be strengthened. In this study, we use TCM clinical prescriptions to train a standardized TCM prescription generation model to provide an auxiliary prescription reference for physicians. However, in our initial experiments, we found two severe problems in the dataset. The first problem is a strong correlation between each herb; for instance, some herbs often appear together to treat specific symptoms. The second is a severe class imbalance within each label, a few herbs always appear in most prescriptions, but most herbs have a low frequency of occurrence in the total dataset. To solve the correlation between each herb label, we adopt the Bayes Classifier Chain(BCC) algorithm, whose basic classifier is Cost-Sensitive SVM targeted to the class imbalance of the label. Based on this, we also improve the BCC method according to the characteristics of TCM prescription dataset. In our BCC classifier, the Directed Acyclic Graph (DAG) construction method has high interpretability in the scenario of TCM prescription. After combining multi-label learning algorithms with several SVM algorithms and comparing their performance in detail, we find that BBC+CS-SVM best deals with class imbalance within the label in multi-label classification problems.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Akbani, R., Kwek, S., Japkowicz, N.: Applying support vector machines to imbalanced datasets. In: European conference on machine learning, pp. 39–50. Springer(2004) Akbani, R., Kwek, S., Japkowicz, N.: Applying support vector machines to imbalanced datasets. In: European conference on machine learning, pp. 39–50. Springer(2004)
2.
go back to reference Bach, F.R., Heckerman, D., Horvitz, E.: Considering cost asymmetry in learning classifiers. Journal of Machine Learning Research 7(Aug), 1713–1741 (2006) Bach, F.R., Heckerman, D., Horvitz, E.: Considering cost asymmetry in learning classifiers. Journal of Machine Learning Research 7(Aug), 1713–1741 (2006)
3.
go back to reference Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research 16, 321–357 (2002)CrossRef Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research 16, 321–357 (2002)CrossRef
5.
go back to reference Dembczynski, K., Cheng, W., Hüllermeier, E.: Bayes optimal multilabel classification via probabilistic classifier chains. In: ICML (2010) Dembczynski, K., Cheng, W., Hüllermeier, E.: Bayes optimal multilabel classification via probabilistic classifier chains. In: ICML (2010)
6.
go back to reference Gonçalves, E.C., Plastino, A., Freitas, A.A.: A genetic algorithm for optimizing the label ordering in multi-label classifier chains. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence, pp. 469–476. IEEE (2013) Gonçalves, E.C., Plastino, A., Freitas, A.A.: A genetic algorithm for optimizing the label ordering in multi-label classifier chains. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence, pp. 469–476. IEEE (2013)
7.
go back to reference Kubat, M., Matwin, S., et al.: Addressing the curse of imbalanced training sets: one-sided selection. In: Icml, vol. 97, pp. 179–186. Nashville, USA (1997) Kubat, M., Matwin, S., et al.: Addressing the curse of imbalanced training sets: one-sided selection. In: Icml, vol. 97, pp. 179–186. Nashville, USA (1997)
8.
go back to reference Li, C., Liu, D., Yang, K., Huang, X., Lv, J.: Herb-know: Knowledge enhanced prescription generation for traditional chinese medicine. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 1560–1567. IEEE (2020) Li, C., Liu, D., Yang, K., Huang, X., Lv, J.: Herb-know: Knowledge enhanced prescription generation for traditional chinese medicine. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 1560–1567. IEEE (2020)
9.
go back to reference Li, W., Yang, Z., Sun, X.: Exploration on generating traditional chinese medicine prescription from symptoms with an end-to-end method. arXiv:1801.09030 (2018) Li, W., Yang, Z., Sun, X.: Exploration on generating traditional chinese medicine prescription from symptoms with an end-to-end method. arXiv:1801.​09030 (2018)
10.
go back to reference Liu, R., Hou, W., Hua, B.J., et al.: Chinese herbal decoction based on syndrome differentiation as maintenance therapy in patients with extensive-stage small-cell lung cancer: an exploratory and small prospective cohort study. Evid.-Based Complement. Alternat. Med. 2015 (2015) Liu, R., Hou, W., Hua, B.J., et al.: Chinese herbal decoction based on syndrome differentiation as maintenance therapy in patients with extensive-stage small-cell lung cancer: an exploratory and small prospective cohort study. Evid.-Based Complement. Alternat. Med. 2015 (2015)
11.
go back to reference Liu, X..Y.., Wu, J.., Zhou, Z..H..: Exploratory undersampling for class-imbalance learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 39(2), 539–550 (2008) Liu, X..Y.., Wu, J.., Zhou, Z..H..: Exploratory undersampling for class-imbalance learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 39(2), 539–550 (2008)
12.
go back to reference Ma, J., Wang, Z.: Discovering syndrome regularities in traditional chinese medicine clinical by topic model. In: International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp. 157–162. Springer (2016) Ma, J., Wang, Z.: Discovering syndrome regularities in traditional chinese medicine clinical by topic model. In: International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp. 157–162. Springer (2016)
13.
go back to reference Masnadi-Shirazi, H., Vasconcelos, N., Iranmehr, A.: Cost-sensitive support vector machines. arXiv:1212.0975 (2012) Masnadi-Shirazi, H., Vasconcelos, N., Iranmehr, A.: Cost-sensitive support vector machines. arXiv:1212.​0975 (2012)
14.
go back to reference Pei, C., Ruan, C., Zhang, Y., Yang, Y.: Bayes classifier chain based on svm for traditional chinese medical prescription generation. In: Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data, pp. 748–763. Springer (2020) Pei, C., Ruan, C., Zhang, Y., Yang, Y.: Bayes classifier chain based on svm for traditional chinese medical prescription generation. In: Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data, pp. 748–763. Springer (2020)
15.
go back to reference Read, J., Martino, L., Olmos, P.M., Luengo, D.: Scalable multi-output label prediction: From classifier chains to classifier trellises. Pattern Recognition 48(6), 2096–2109 (2015)CrossRef Read, J., Martino, L., Olmos, P.M., Luengo, D.: Scalable multi-output label prediction: From classifier chains to classifier trellises. Pattern Recognition 48(6), 2096–2109 (2015)CrossRef
16.
go back to reference Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine learning 85(3), 333 (2011)MathSciNetCrossRef Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine learning 85(3), 333 (2011)MathSciNetCrossRef
17.
go back to reference Ruan, C., Ma, J., Wang, Y., Zhang, Y., Yang, Y.: Discovering regularities from traditional chinese medicine prescriptions via bipartite embedding model. In: International Joint Conferences on Artificial Intelligence, pp. 3346–3352 (2019) Ruan, C., Ma, J., Wang, Y., Zhang, Y., Yang, Y.: Discovering regularities from traditional chinese medicine prescriptions via bipartite embedding model. In: International Joint Conferences on Artificial Intelligence, pp. 3346–3352 (2019)
18.
go back to reference Ruan, C., Wang, Y., Zhang, Y., Ma, J., Chen, H., Aickelin, U., Zhu, S., Zhang, T.: Thcluster: herb supplements categorization for precision traditional chinese medicine. In: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 417–424. IEEE (2017) Ruan, C., Wang, Y., Zhang, Y., Ma, J., Chen, H., Aickelin, U., Zhu, S., Zhang, T.: Thcluster: herb supplements categorization for precision traditional chinese medicine. In: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 417–424. IEEE (2017)
19.
go back to reference Ruan, C., Wang, Y., Zhang, Y., Yang, Y.: Exploring regularity in traditional chinese medicine clinical data using heterogeneous weighted networks embedding. In: International Conference on Database Systems for Advanced Applications, pp. 310–313. Springer (2019) Ruan, C., Wang, Y., Zhang, Y., Yang, Y.: Exploring regularity in traditional chinese medicine clinical data using heterogeneous weighted networks embedding. In: International Conference on Database Systems for Advanced Applications, pp. 310–313. Springer (2019)
20.
go back to reference Sucar, L.E., Bielza, C., Morales, E.F., Hernandez-Leal, P., Zaragoza, J.H., Larrañaga, P.: Multi-label classification with bayesian network-based chain classifiers. Pattern Recognition Letters 41, 14–22 (2014)CrossRef Sucar, L.E., Bielza, C., Morales, E.F., Hernandez-Leal, P., Zaragoza, J.H., Larrañaga, P.: Multi-label classification with bayesian network-based chain classifiers. Pattern Recognition Letters 41, 14–22 (2014)CrossRef
21.
go back to reference Torre, L., Bray, F., Siegel, R.L., Ferlay, J., Lortet-Tieulent, J.: Global cancer statistics, 2012. CA: A Cancer Journal for Clinicians 65(2), 87–108 (2015) Torre, L., Bray, F., Siegel, R.L., Ferlay, J., Lortet-Tieulent, J.: Global cancer statistics, 2012. CA: A Cancer Journal for Clinicians 65(2), 87–108 (2015)
22.
go back to reference Tsoumakas, G., Vlahavas, I.: Random k-labelsets: An ensemble method for multilabel classification. In: European conference on machine learning, pp. 406–417. Springer (2007) Tsoumakas, G., Vlahavas, I.: Random k-labelsets: An ensemble method for multilabel classification. In: European conference on machine learning, pp. 406–417. Springer (2007)
23.
go back to reference Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. arXiv:1706.03762 (2017) Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. arXiv:1706.​03762 (2017)
24.
go back to reference Wu, G., Chang, E.Y.: Adaptive feature-space conformal transformation for imbalanced-data learning. In: Proceedings of the 20th International Conference on Machine Learning (ICML-03), pp. 816–823 (2003) Wu, G., Chang, E.Y.: Adaptive feature-space conformal transformation for imbalanced-data learning. In: Proceedings of the 20th International Conference on Machine Learning (ICML-03), pp. 816–823 (2003)
25.
go back to reference Xu, Q., Tang, W., Teng, F., Peng, W., Zhang, Y., Li, W., Wen, C., Guo, J.: Intelligent syndrome differentiation of traditional chinese medicine by ann: A case study of chronic obstructive pulmonary disease. IEEE Access 7, 76167–76175 (2019)CrossRef Xu, Q., Tang, W., Teng, F., Peng, W., Zhang, Y., Li, W., Wen, C., Guo, J.: Intelligent syndrome differentiation of traditional chinese medicine by ann: A case study of chronic obstructive pulmonary disease. IEEE Access 7, 76167–76175 (2019)CrossRef
26.
go back to reference Yao, L., Zhang, Y., Wei, B., Zhang, W., Jin, Z.: A topic modeling approach for traditional chinese medicine prescriptions. IEEE Transactions on Knowledge and Data Engineering 30(6), 1007–1021 (2018)CrossRef Yao, L., Zhang, Y., Wei, B., Zhang, W., Jin, Z.: A topic modeling approach for traditional chinese medicine prescriptions. IEEE Transactions on Knowledge and Data Engineering 30(6), 1007–1021 (2018)CrossRef
27.
go back to reference Zhang, T., et al.: Statistical behavior and consistency of classification methods based on convex risk minimization. The Annals of Statistics 32(1), 56–85 (2004)MathSciNetCrossRef Zhang, T., et al.: Statistical behavior and consistency of classification methods based on convex risk minimization. The Annals of Statistics 32(1), 56–85 (2004)MathSciNetCrossRef
Metadata
Title
Bayesian networks and chained classifiers based on SVM for traditional chinese medical prescription generation
Authors
Yingpei Wu
Chaohan Pei
Chunyang Ruan
Ruofei Wang
Yun Yang
Yanchun Zhang
Publication date
08-12-2021
Publisher
Springer US
Published in
World Wide Web / Issue 3/2022
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-021-00981-5

Other articles of this Issue 3/2022

World Wide Web 3/2022 Go to the issue

Premium Partner