Skip to main content
Erschienen in: Neural Computing and Applications 3/2020

10.01.2019 | Intelligent Biomedical Data Analysis and Processing

Automatic detection of lung cancer from biomedical data set using discrete AdaBoost optimized ensemble learning generalized neural networks

verfasst von: P. Mohamed Shakeel, Amr Tolba, Zafer Al-Makhadmeh, Mustafa Musa Jaber

Erschienen in: Neural Computing and Applications | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Today, most of the people are affected by lung cancer, mainly because of the genetic changes of the tissues in the lungs. Other factors such as smoking, alcohol, and exposure to dangerous gases can also be considered the contributory causes of lung cancer. Due to the serious consequences of lung cancer, the medical associations have been striving to diagnose cancer in its early stage of growth by applying the computer-aided diagnosis process. Although the CAD system at healthcare centers is able to diagnose lung cancer during its early stage of growth, the accuracy of cancer detection is difficult to achieve, mainly because of the overfitting of lung cancer features and the dimensionality of the feature set. Thus, this paper introduces the effective and optimized neural computing and soft computing techniques to minimize the difficulties and issues in the feature set. Initially, lung biomedical data were collected from the ELVIRA Biomedical Data Set Repository. The noise present in the data was eliminated by applying the bin smoothing normalization process. The minimum repetition and Wolf heuristic features were subsequently selected to minimize the dimensionality and complexity of the features. The selected lung features were analyzed using discrete AdaBoost optimized ensemble learning generalized neural networks, which successfully analyzed the biomedical lung data and classified the normal and abnormal features with great effectiveness. The efficiency of the system was then evaluated using MATLAB experimental setup in terms of error rate, precision, recall, G-mean, F-measure, and prediction rate.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat World Cancer Report (2014) World Health Organization, Chapter 5.1. ISBN 92-832-0429-8 World Cancer Report (2014) World Health Organization, Chapter 5.1. ISBN 92-832-0429-8
3.
Zurück zum Zitat Lung Cancer—Patient Version. NCI. Archived from the original on 9 March 2016. Retrieved 5 Mar 2016 Lung Cancer—Patient Version. NCI. Archived from the original on 9 March 2016. Retrieved 5 Mar 2016
4.
Zurück zum Zitat Horn L, Lovly CM, Johnson DH (2015) Chapter 107: neoplasms of the lung. In: Kasper DL, Hauser SL, Jameson JL, Fauci AS, Longo DL, Loscalzo J (eds) Harrison’s principles of internal medicine, 19th edn. McGraw-Hill, New York. ISBN 978-0-07-180216-1 Horn L, Lovly CM, Johnson DH (2015) Chapter 107: neoplasms of the lung. In: Kasper DL, Hauser SL, Jameson JL, Fauci AS, Longo DL, Loscalzo J (eds) Harrison’s principles of internal medicine, 19th edn. McGraw-Hill, New York. ISBN 978-0-07-180216-1
5.
Zurück zum Zitat Alberg AJ, Brock MV, Samet JM (2016) Chapter 52: epidemiology of lung cancer. In: Murray & Nadel’s textbook of respiratory medicine, 6th edn. Saunders Elsevier, Amsterdam. ISBN 978-1-4557-3383-5 Alberg AJ, Brock MV, Samet JM (2016) Chapter 52: epidemiology of lung cancer. In: Murray & Nadel’s textbook of respiratory medicine, 6th edn. Saunders Elsevier, Amsterdam. ISBN 978-1-4557-3383-5
6.
Zurück zum Zitat Collins LG, Haines C, Perkel R, Enck RE (2007) Lung cancer: diagnosis and management. Am Fam Phys 75(1):56–63. PMID 17225705. Archived from the original on 29 September 2007 Collins LG, Haines C, Perkel R, Enck RE (2007) Lung cancer: diagnosis and management. Am Fam Phys 75(1):56–63. PMID 17225705. Archived from the original on 29 September 2007
8.
Zurück zum Zitat Lee SLA, Kouzani AZ, Hu EJ (2010) Random forest based lung nodule classification aided by clustering. Comput Med Imaging Graph 34:535–542CrossRef Lee SLA, Kouzani AZ, Hu EJ (2010) Random forest based lung nodule classification aided by clustering. Comput Med Imaging Graph 34:535–542CrossRef
9.
Zurück zum Zitat Jinsa K, Gunavathi K (2014) Lung cancer classification using neural networks for CT images. Comput Methods Programs Biomed 113:202–209CrossRef Jinsa K, Gunavathi K (2014) Lung cancer classification using neural networks for CT images. Comput Methods Programs Biomed 113:202–209CrossRef
10.
Zurück zum Zitat Diaz JM, Pinon RC, Solano G (2014) Lung cancer classification using genetic algorithm to optimize prediction models, IISA 2014. In: The 5th international conference on information, intelligence, systems and applications in IEEE Diaz JM, Pinon RC, Solano G (2014) Lung cancer classification using genetic algorithm to optimize prediction models, IISA 2014. In: The 5th international conference on information, intelligence, systems and applications in IEEE
11.
Zurück zum Zitat Fang L, Zhao H, Wang P, Yu M, Yan J, Cheng W, Chen P (2015) Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data. Biomed Signal Process Control 21:82–89CrossRef Fang L, Zhao H, Wang P, Yu M, Yan J, Cheng W, Chen P (2015) Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data. Biomed Signal Process Control 21:82–89CrossRef
12.
Zurück zum Zitat Seelan LJ, Padma Suresh L, Krishna Veni SH (2016) Automatic extraction of Lung lesion by using optimized toboggan based approach with feature normalization and transfer learning methods. In: International conference on emerging technological trends (ICETT) in IEEE Seelan LJ, Padma Suresh L, Krishna Veni SH (2016) Automatic extraction of Lung lesion by using optimized toboggan based approach with feature normalization and transfer learning methods. In: International conference on emerging technological trends (ICETT) in IEEE
17.
Zurück zum Zitat Emarya E, Zawbaabc HM, Hassanien AE (2016) Binary grey wolf optimization approaches for feature selection. Neurocomputing 172:371–381CrossRef Emarya E, Zawbaabc HM, Hassanien AE (2016) Binary grey wolf optimization approaches for feature selection. Neurocomputing 172:371–381CrossRef
18.
Zurück zum Zitat Sun T, Wanga J, Li X, Lv P, Liu F, Luo Y, Gao Q, Zhu H, Guo X (2013) Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set. Comput Methods Progr Biomed 111(2):519–524CrossRef Sun T, Wanga J, Li X, Lv P, Liu F, Luo Y, Gao Q, Zhu H, Guo X (2013) Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set. Comput Methods Progr Biomed 111(2):519–524CrossRef
19.
Zurück zum Zitat Bhattacharjee A et al (2001) Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. PNAS 98(24):13790–13795CrossRef Bhattacharjee A et al (2001) Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. PNAS 98(24):13790–13795CrossRef
20.
Zurück zum Zitat Pandey KK, Pradhan N (2014) An analytical and comparative study of various data preprocessing method in data mining. Int J Emerg Technol Adv Eng 4(10):174–180 Pandey KK, Pradhan N (2014) An analytical and comparative study of various data preprocessing method in data mining. Int J Emerg Technol Adv Eng 4(10):174–180
21.
Zurück zum Zitat Dodge Y (2003) The Oxford dictionary of statistical terms. Oxford University Press, Oxford. ISBN 0-19-920613-9 (entry for normalization of scores) Dodge Y (2003) The Oxford dictionary of statistical terms. Oxford University Press, Oxford. ISBN 0-19-920613-9 (entry for normalization of scores)
22.
Zurück zum Zitat Claypo N, Jaiyen S (2014) Opinion mining for Thai restaurant reviews using neural networks and mRMR feature selection. In: Computer science and engineering conference (ICSEC) 2014 international, pp 394–397 Claypo N, Jaiyen S (2014) Opinion mining for Thai restaurant reviews using neural networks and mRMR feature selection. In: Computer science and engineering conference (ICSEC) 2014 international, pp 394–397
23.
Zurück zum Zitat Nguyen H, Franke K, Petrovic S (2010) Towards a generic feature-selection measure for intrusion detection. In: Proceeding of the international conference on pattern recognition (ICPR), Istanbul Nguyen H, Franke K, Petrovic S (2010) Towards a generic feature-selection measure for intrusion detection. In: Proceeding of the international conference on pattern recognition (ICPR), Istanbul
24.
Zurück zum Zitat Einicke GA (2018) Maximum-entropy rate selection of features for classifying changes in knee and ankle dynamics during running. IEEE J Biomed Health Inform 28(4):1097–1103CrossRef Einicke GA (2018) Maximum-entropy rate selection of features for classifying changes in knee and ankle dynamics during running. IEEE J Biomed Health Inform 28(4):1097–1103CrossRef
26.
Zurück zum Zitat Korayem L, Khorsid M, Kassem SS (2015) Using grey Wolf algorithm to solve the capacitated vehicle routing problem. In: Proceedings of the 3rd international conference on manufacturing, optimization, industrial and material engineering (MOIME’15). Institute of Physics Publishing, BaliCrossRef Korayem L, Khorsid M, Kassem SS (2015) Using grey Wolf algorithm to solve the capacitated vehicle routing problem. In: Proceedings of the 3rd international conference on manufacturing, optimization, industrial and material engineering (MOIME’15). Institute of Physics Publishing, BaliCrossRef
27.
Zurück zum Zitat Mohamed A-AA, El-Gaafary AAM, Mohamed YS, Hemeida AM (2015) Design static VAR compensator controller using artificial neural network optimized by modify Grey Wolf optimization. In: Proceedings of the international joint conference on neural networks (IJCNN’15), Anchorage Mohamed A-AA, El-Gaafary AAM, Mohamed YS, Hemeida AM (2015) Design static VAR compensator controller using artificial neural network optimized by modify Grey Wolf optimization. In: Proceedings of the international joint conference on neural networks (IJCNN’15), Anchorage
29.
Zurück zum Zitat Rojas R (2009) AdaBoost and the super bowl of classifiers a tutorial introduction to adaptive boosting. Technical Report, Freie University, Berlin Rojas R (2009) AdaBoost and the super bowl of classifiers a tutorial introduction to adaptive boosting. Technical Report, Freie University, Berlin
31.
Zurück zum Zitat Mgbe CO, Mom JM, Igwue GA (2015) Performance evaluation of generalized regression neural network path loss prediction model in macrocellular environment. Perform Eval 2(2):204–208 Mgbe CO, Mom JM, Igwue GA (2015) Performance evaluation of generalized regression neural network path loss prediction model in macrocellular environment. Perform Eval 2(2):204–208
32.
Zurück zum Zitat Shakeel PM, Baskar S, Dhulipala VRS, Mishra S, Jaber MM (2018) Maintaining security and privacy in health care system using learning based deep-Q-networks. J Med Syst 42:186CrossRef Shakeel PM, Baskar S, Dhulipala VRS, Mishra S, Jaber MM (2018) Maintaining security and privacy in health care system using learning based deep-Q-networks. J Med Syst 42:186CrossRef
34.
Zurück zum Zitat Gordon GJ et al (2002) Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer Res 62:4963–4967 Gordon GJ et al (2002) Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer Res 62:4963–4967
35.
Zurück zum Zitat Beer DG et al (2002) Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med 8(8):816–823CrossRef Beer DG et al (2002) Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med 8(8):816–823CrossRef
36.
Zurück zum Zitat Wigle DA et al (2002) Molecular profiling of non-small cell lung cancer and correlation with disease-free survival. Cancer Res 62:3005–3008 Wigle DA et al (2002) Molecular profiling of non-small cell lung cancer and correlation with disease-free survival. Cancer Res 62:3005–3008
38.
Zurück zum Zitat Ayshwarya SS (2018) Lung cancer prediction using feed forward back propagation neural networks with optimal features. Int J Appl Eng Res 13(1):318–325. ISSN 0973-4562 Ayshwarya SS (2018) Lung cancer prediction using feed forward back propagation neural networks with optimal features. Int J Appl Eng Res 13(1):318–325. ISSN 0973-4562
39.
Zurück zum Zitat Zhao Z, Feng J, Jing K, Shi E (2017) A hybrid ACOR algorithm for pattern classification neural network training. In: International conference on computing intelligence and information system (CIIS) in IEEE Zhao Z, Feng J, Jing K, Shi E (2017) A hybrid ACOR algorithm for pattern classification neural network training. In: International conference on computing intelligence and information system (CIIS) in IEEE
41.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks (PDF). Adv Neural Inf Process Syst 1:1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks (PDF). Adv Neural Inf Process Syst 1:1097–1105
42.
Zurück zum Zitat Nunes TM, Coelho AL, Lima CA, Papa JP, de Albuquerque VHC (2014) EEG signal classification for epilepsy diagnosis via optimum path forest—a systematic assessment. Neurocomputing 136:103–123CrossRef Nunes TM, Coelho AL, Lima CA, Papa JP, de Albuquerque VHC (2014) EEG signal classification for epilepsy diagnosis via optimum path forest—a systematic assessment. Neurocomputing 136:103–123CrossRef
43.
Zurück zum Zitat Rebouças Filho PP, Cortez PC, da Silva Barros AC, De Albuquerque VHC (2014) Novel adaptive balloon active contour method based on internal force for image segmentation—a systematic evaluation on synthetic and real images. Expert Syst Appl 41(17):7707–7721CrossRef Rebouças Filho PP, Cortez PC, da Silva Barros AC, De Albuquerque VHC (2014) Novel adaptive balloon active contour method based on internal force for image segmentation—a systematic evaluation on synthetic and real images. Expert Syst Appl 41(17):7707–7721CrossRef
Metadaten
Titel
Automatic detection of lung cancer from biomedical data set using discrete AdaBoost optimized ensemble learning generalized neural networks
verfasst von
P. Mohamed Shakeel
Amr Tolba
Zafer Al-Makhadmeh
Mustafa Musa Jaber
Publikationsdatum
10.01.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 3/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-03972-2

Weitere Artikel der Ausgabe 3/2020

Neural Computing and Applications 3/2020 Zur Ausgabe

Intelligent Biomedical Data Analysis and Processing

IoT-based 3D convolution for video salient object detection

Intelligent Biomedical Data Analysis and Processing

Diagnosis of the Hypopnea syndrome in the early stage