Skip to main content
Top
Published in: Health and Technology 4/2022

01-06-2022 | Original Paper

A hybrid approach for lung cancer diagnosis using optimized random forest classification and K-means visualization algorithm

Authors: Ananya Bhattacharjee, R. Murugan, Tripti Goel

Published in: Health and Technology | Issue 4/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Lung cancer detection has become one of the most challenging oncology problems. It is an arduous task for radiologists to detect nodules based on the naked eye vision. The main goal of this paper is to present a well-defined approach for malignant nodule detection from computed tomography scans and a visualization tool to show how the extracted features are responsible for the malignant cluster. Inspired by hyperparameter optimization and visualization technique, we uniquely deployed a hybrid approach based on an optimized random forest classifier and a K-means visualization tool that tried to best tune the model’s hyperparameters to provide the optimal results and visualize the malignant and non-malignant clusters, respectively. Out of the four experiments performed for the hyperparameter optimization, the best model classified malignant and non-malignant cases effectively and achieved a 10-Fold cross-validation accuracy of 92.14% on the LIDC-IDRI dataset. Moreover, the least inertia score and the highest silhouette score obtained by the best visualization configuration were 16.21 and 0.815, respectively. The proposed hybrid approach appeared to be apt for lung cancer diagnosis. The integration of the visualization approach provided the ability to localize the malignant cluster and hence drew inference out of it.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Erratum: Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries, CA. Cancer J Clin. 2020;70:313–313. https://doi.org/10.3322/caac.21609. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Erratum: Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries, CA. Cancer J Clin. 2020;70:313–313. https://​doi.​org/​10.​3322/​caac.​21609.
7.
go back to reference Dehkharghanian T, Rahnamayan S, Riasatian A, Bidgoli AA, Kalra S, Zaveri M, Babaie M, Sajadi MSS, Gonzalelz R, Diamandis P, Pantanowitz L, Huang T, Tizhoosh HR. Selection, Visualization, and Interpretation of Deep Features in Lung Adenocarcinoma and Squamous Cell Carcinoma. Am J Pathol. 2021;191:2172–83. https://doi.org/10.1016/j.ajpath.2021.08.013.CrossRef Dehkharghanian T, Rahnamayan S, Riasatian A, Bidgoli AA, Kalra S, Zaveri M, Babaie M, Sajadi MSS, Gonzalelz R, Diamandis P, Pantanowitz L, Huang T, Tizhoosh HR. Selection, Visualization, and Interpretation of Deep Features in Lung Adenocarcinoma and Squamous Cell Carcinoma. Am J Pathol. 2021;191:2172–83. https://​doi.​org/​10.​1016/​j.​ajpath.​2021.​08.​013.CrossRef
25.
go back to reference Armato S, McLennan G, Bidaut L, Gray M, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman E, Kazerooni E, MacMahon H, Beeke E, Yankelevitz D, Biancardi AM, Bland PH, Brown MS, Engelmann R, Laderach G, Max D, Pais R, Qing D, Roberts R, Smith A, Starkey A, Batrah P, Caligiuri P, Farooqi A, Gladish G, Jude C, Munden R, Petkovska I, Quint L, Schwartz L, Sundaram B, Dodd L, Fenimore C, Gur D, Petrick N, Freymann J, Kirby J, Hughes B, Sallamm C, Heath M, Kuhn M, Dharaiya E, Burns R, Fryd D, Salganicoff M, Anand V, Shreter U, Vastagh S, Croft B. The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): A completed reference database of lung nodules on CT scans. Med Phys. 2011;38:915–31. https://doi.org/10.1118/1.3528204.CrossRef Armato S, McLennan G, Bidaut L, Gray M, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman E, Kazerooni E, MacMahon H, Beeke E, Yankelevitz D, Biancardi AM, Bland PH, Brown MS, Engelmann R, Laderach G, Max D, Pais R, Qing D, Roberts R, Smith A, Starkey A, Batrah P, Caligiuri P, Farooqi A, Gladish G, Jude C, Munden R, Petkovska I, Quint L, Schwartz L, Sundaram B, Dodd L, Fenimore C, Gur D, Petrick N, Freymann J, Kirby J, Hughes B, Sallamm C, Heath M, Kuhn M, Dharaiya E, Burns R, Fryd D, Salganicoff M, Anand V, Shreter U, Vastagh S, Croft B. The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): A completed reference database of lung nodules on CT scans. Med Phys. 2011;38:915–31. https://​doi.​org/​10.​1118/​1.​3528204.CrossRef
28.
go back to reference Li P. Robust logitboost and adaptive base class (ABC) logitboost. In: 2010 Twenty-Sixth Conference on Uncertainity in Artificial Intelligence (UAI’10). AUAI Press. 2010. pp. 302–311. Li P. Robust logitboost and adaptive base class (ABC) logitboost.  In: 2010 Twenty-Sixth Conference on Uncertainity in Artificial Intelligence (UAI’10). AUAI Press. 2010. pp. 302–311.
33.
go back to reference El-askary NS, Salem MA, Roushdy MI. Feature Extraction and Analysis for Lung Nodule Classification using Random Forest. In: 2019 Eighth International Conference on Software and Information Engineering (ICSIE '19). 2019. pp. 248–252. El-askary NS, Salem MA, Roushdy MI. Feature Extraction and Analysis for Lung Nodule Classification using Random Forest. In: 2019 Eighth International Conference on Software and Information Engineering (ICSIE '19). 2019. pp. 248–252.
Metadata
Title
A hybrid approach for lung cancer diagnosis using optimized random forest classification and K-means visualization algorithm
Authors
Ananya Bhattacharjee
R. Murugan
Tripti Goel
Publication date
01-06-2022
Publisher
Springer Berlin Heidelberg
Published in
Health and Technology / Issue 4/2022
Print ISSN: 2190-7188
Electronic ISSN: 2190-7196
DOI
https://doi.org/10.1007/s12553-022-00679-2

Other articles of this Issue 4/2022

Health and Technology 4/2022 Go to the issue

Premium Partner