Skip to main content
Erschienen in: Wireless Personal Communications 3/2022

12.01.2022

Performance Assessment of Machine Learning Classifiers Using Selective Feature Approaches for Cervical Cancer Detection

verfasst von: Nitin Kumar Chauhan, Krishna Singh

Erschienen in: Wireless Personal Communications | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Worldwide, cervical cancer is the leading cause of death among women from cancer. The symptoms of this gynecological disease are difficult to recognize at early stage, especially in those countries that don’t have facility of screening programs. In diagnosis of cervical cancer, machine learning methods can be used to detect the malignous cancer cells at initial stage. The foremost apprehension in disease diagnosis involves data imbalance issue and non-uniform scaling in dataset. In this article, a prevalent oversampling approach Synthetic Minority Oversampling Technique along with fivefold cross-validation is being used on unscaled and scaled data to handle these issues. A promising comparison is been made among the performance of most prevalent machine learning (ML) classifiers such as Naive Bayes, Logistic Regression, K-Nearest Neighbor, Support Vector Machine (SVM), Linear Discriminant analysis, Multi-Layer Perceptron, Decision Tree (DT) and Random Forest (RF) on unscaled data and scaled data obtained by Min–Max scaling, Standard scaling and Normalization. RF, SVM and DT are the top three ML algorithms obtained in cervical cancer diagnosis for which optimization possibilities are explored with feature selection methods as Univariate feature selection and Recursive feature elimination (RFE). Overall performance of Random Forest predictor with RFE (RF-RFE) is superior to all others being implemented.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
System Configuration.
Window 10 OS.
Core i3, Integrated Graphics.
4 GB RAM, 1 TB Hard disk.
 
Literatur
1.
Zurück zum Zitat Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., & Bray, F. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 71(3), 209–249. https://doi.org/10.3322/caac.21660CrossRef Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., & Bray, F. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 71(3), 209–249. https://​doi.​org/​10.​3322/​caac.​21660CrossRef
3.
6.
Zurück zum Zitat Dewi, Y. N., Riana, D., & Mantoro, T. (2017). Improving Naïve Bayes performance in single image pap smear using weighted principal component analysis (WPCA). In 2017 International conference on computing, engineering, and design (ICCED) (pp. 1–5). IEEE. https://doi.org/10.1109/CED.2017.8308130 Dewi, Y. N., Riana, D., & Mantoro, T. (2017). Improving Naïve Bayes performance in single image pap smear using weighted principal component analysis (WPCA). In 2017 International conference on computing, engineering, and design (ICCED) (pp. 1–5). IEEE. https://​doi.​org/​10.​1109/​CED.​2017.​8308130
8.
Zurück zum Zitat Ashraf, F. B., & Momo, N. S. (2019). Comparative analysis on prediction models with various data preprocessings in the prognosis of cervical cancer. In 2019 10th International conference on computing, communication and networking technologies (ICCCNT) (pp. 1–6). IEEE. https://doi.org/10.1109/ICCCNT45670.2019.8944850 Ashraf, F. B., & Momo, N. S. (2019). Comparative analysis on prediction models with various data preprocessings in the prognosis of cervical cancer. In 2019 10th International conference on computing, communication and networking technologies (ICCCNT) (pp. 1–6). IEEE. https://​doi.​org/​10.​1109/​ICCCNT45670.​2019.​8944850
9.
Zurück zum Zitat Ahishakiye, E., Wario, R., Mwangi, W., & Taremwa, D. (2020). Prediction of cervical cancer basing on risk factors using ensemble learning. In 2020 IST-Africa conference (IST-Africa) (pp. 1–12). IEEE. ISSN 2576-8581. Ahishakiye, E., Wario, R., Mwangi, W., & Taremwa, D. (2020). Prediction of cervical cancer basing on risk factors using ensemble learning. In 2020 IST-Africa conference (IST-Africa) (pp. 1–12). IEEE. ISSN 2576-8581.
11.
14.
Zurück zum Zitat Ahmed, M., Kabir, M. M. J., Kabir, M., & Hasan, M. M. (2019). Identification of the risk factors of cervical cancer applying feature selection approaches. In 2019 3rd international conference on electrical, computer & telecommunication engineering (ICECTE) (pp. 201–204). IEEE. https://doi.org/10.1109/ICECTE48615.2019.9303554 Ahmed, M., Kabir, M. M. J., Kabir, M., & Hasan, M. M. (2019). Identification of the risk factors of cervical cancer applying feature selection approaches. In 2019 3rd international conference on electrical, computer & telecommunication engineering (ICECTE) (pp. 201–204). IEEE. https://​doi.​org/​10.​1109/​ICECTE48615.​2019.​9303554
15.
Zurück zum Zitat Omone, O. M., Gbenimachor, A. U., Kovács, L., & Kozlovszky, M. (2021). Knowledge estimation with HPV and cervical cancer risk factors using logistic regression. In 2021 IEEE 15th international symposium on applied computational intelligence and informatics (SACI) (pp. 000381–000386). IEEE. https://doi.org/10.1109/SACI51354.2021.9465585 Omone, O. M., Gbenimachor, A. U., Kovács, L., & Kozlovszky, M. (2021). Knowledge estimation with HPV and cervical cancer risk factors using logistic regression. In 2021 IEEE 15th international symposium on applied computational intelligence and informatics (SACI) (pp. 000381–000386). IEEE. https://​doi.​org/​10.​1109/​SACI51354.​2021.​9465585
17.
Zurück zum Zitat Saha, R., Bajger, M., & Lee, G. (2019). Prior guided segmentation and nuclei feature based abnormality detection in cervical cells. In 2019 IEEE 19th international conference on bioinformatics and bioengineering (BIBE) (pp. 742–746). IEEE. https://doi.org/10.1109/BIBE.2019.00139 Saha, R., Bajger, M., & Lee, G. (2019). Prior guided segmentation and nuclei feature based abnormality detection in cervical cells. In 2019 IEEE 19th international conference on bioinformatics and bioengineering (BIBE) (pp. 742–746). IEEE. https://​doi.​org/​10.​1109/​BIBE.​2019.​00139
18.
Zurück zum Zitat Islam, M. J., Wu, Q. J., Ahmadi, M., & Sid-Ahmed, M. A. (2007). Investigating the performance of naive-Bayes classifiers and k-nearest neighbor classifiers. In 2007 International conference on convergence information technology (ICCIT 2007) (pp. 1541–1546). IEEE. https://doi.org/10.1109/ICCIT.2007.252 Islam, M. J., Wu, Q. J., Ahmadi, M., & Sid-Ahmed, M. A. (2007). Investigating the performance of naive-Bayes classifiers and k-nearest neighbor classifiers. In 2007 International conference on convergence information technology (ICCIT 2007) (pp. 1541–1546). IEEE. https://​doi.​org/​10.​1109/​ICCIT.​2007.​252
19.
Zurück zum Zitat Kotsiantis, S. B. (2007). Supervised machine learning: A review of classification techniques. Informatica, 31(3), 249–268.MathSciNetMATH Kotsiantis, S. B. (2007). Supervised machine learning: A review of classification techniques. Informatica, 31(3), 249–268.MathSciNetMATH
24.
25.
27.
Zurück zum Zitat Biau, G. (2012). Analysis of a random forests model. The Journal of Machine Learning Research, 13, 1063–1095.MathSciNetMATH Biau, G. (2012). Analysis of a random forests model. The Journal of Machine Learning Research, 13, 1063–1095.MathSciNetMATH
33.
Zurück zum Zitat Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2018). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.MathSciNetMATH Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2018). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.MathSciNetMATH
34.
Zurück zum Zitat Hajian-Tilaki, K. (2013). Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian Journal of Internal Medicine, 4(2), 627–635. Hajian-Tilaki, K. (2013). Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian Journal of Internal Medicine, 4(2), 627–635.
35.
Zurück zum Zitat Powers, D. M. (2020). Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint arXiv:2010.16061 Powers, D. M. (2020). Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint arXiv:​2010.​16061
36.
Zurück zum Zitat Jović, A., Brkić, K., & Bogunović, N. (2015). A review of feature selection methods with applications. In 2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO) (pp. 1200–1205). IEEE. https://doi.org/10.1109/MIPRO.2015.7160458 Jović, A., Brkić, K., & Bogunović, N. (2015). A review of feature selection methods with applications. In 2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO) (pp. 1200–1205). IEEE. https://​doi.​org/​10.​1109/​MIPRO.​2015.​7160458
Metadaten
Titel
Performance Assessment of Machine Learning Classifiers Using Selective Feature Approaches for Cervical Cancer Detection
verfasst von
Nitin Kumar Chauhan
Krishna Singh
Publikationsdatum
12.01.2022
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 3/2022
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-022-09467-7

Weitere Artikel der Ausgabe 3/2022

Wireless Personal Communications 3/2022 Zur Ausgabe

Neuer Inhalt