Skip to main content
Erschienen in: Wireless Personal Communications 3/2023

30.03.2023

RoughSet based Feature Selection for Prediction of Breast Cancer

verfasst von: Hanumanthu Bhukya, M Sadanandam

Erschienen in: Wireless Personal Communications | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Breast cancer is the most deadly cancer and has highest mortality rate in women all over the world. Early prediction of breast cancer can improve the survival rate of the patient. Consequently, high accuracy in cancer prediction is important to avoid any mis-diagnosis. Machine learning algorithms can contribute in early prediction and diagnosis of breast cancer. In this study, we have used rough set based feature selector to extract relevant features from the breast cancer feature set and classify them using machine learning algorithm like Decision Tree, Naive Bayes, Support Vector Machine, K-Nearest Neighbor, Logistic Regression, Random Forest, Adaboost. The main aim is to predict cancerous breast nodules, using rough set driven feature selection and machine learning classification algorithms. The results were evaluated pertaining to accuracy, sensitivity and specificity and positive predictive value. It is observed that random forest outperformed all other classifiers and achieved the highest accuracy using the proposed approach (95.23%).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kumari, V., Ahmed, A., Kanumuri, T., Shakher, C., & Sheoran, G. (2020). Early detection of cancerous tissues in human breast utilizing near field microwave holography. International Journal of Imaging Systems and Technology, 30, 391–400. https://doi.org/10.1002/ima.22384CrossRef Kumari, V., Ahmed, A., Kanumuri, T., Shakher, C., & Sheoran, G. (2020). Early detection of cancerous tissues in human breast utilizing near field microwave holography. International Journal of Imaging Systems and Technology, 30, 391–400. https://​doi.​org/​10.​1002/​ima.​22384CrossRef
2.
Zurück zum Zitat Martinez-del-Rincon, J., Santofimia, M. J., del Toro, X., et al. (2017). Nonlinear classifiers applied to EEG analysis for epilepsy seizure detection. Expert Systems with Applications, 86, 99–112.CrossRef Martinez-del-Rincon, J., Santofimia, M. J., del Toro, X., et al. (2017). Nonlinear classifiers applied to EEG analysis for epilepsy seizure detection. Expert Systems with Applications, 86, 99–112.CrossRef
3.
Zurück zum Zitat Labrèche, F., Goldberg, M.S., Hashim, D., Weiderpass, E. (2020). Breast cancer. In Occupational Cancers, Springer, Berlin/Heidelberg, Germany, pp. 417–438 Labrèche, F., Goldberg, M.S., Hashim, D., Weiderpass, E. (2020). Breast cancer. In Occupational Cancers, Springer, Berlin/Heidelberg, Germany, pp. 417–438
4.
Zurück zum Zitat Kumar, V., Misha, B.K., Mazzara, M., Thanh, D.N., Verma, A. (2019) Prediction of malignant and benign breast cancer: A data mining approach in healthcare applications. In Advances in Data Science and Management, Springer, Berlin/Heidelberg, Germany, , pp. 435–442 Kumar, V., Misha, B.K., Mazzara, M., Thanh, D.N., Verma, A. (2019) Prediction of malignant and benign breast cancer: A data mining approach in healthcare applications. In Advances in Data Science and Management, Springer, Berlin/Heidelberg, Germany, , pp. 435–442
5.
Zurück zum Zitat Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., & Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal For Clinicians, 68(6), 394–424. Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., & Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal For Clinicians, 68(6), 394–424.
7.
Zurück zum Zitat Parsian, A., Ramezani, M., & Ghadimi, N. (2017). A hybrid neural network gray wolf optimization algorithm for melanoma detection. Biomedical Research, 28(8), 3408–3411. Parsian, A., Ramezani, M., & Ghadimi, N. (2017). A hybrid neural network gray wolf optimization algorithm for melanoma detection. Biomedical Research, 28(8), 3408–3411.
8.
Zurück zum Zitat Luque, C., Luna, J. M., Luque, M., & Ventura, S. (2019). An advanced review on text mining in medicine. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3), e1302. Luque, C., Luna, J. M., Luque, M., & Ventura, S. (2019). An advanced review on text mining in medicine. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3), e1302.
9.
Zurück zum Zitat Hassan, M., & Hamada, M. (2018). Genetic algorithm approaches for improving prediction accuracy of multi-criteria recommender systems. International Journal of Computational Intelligence Systems, 11(1), 146–162.CrossRef Hassan, M., & Hamada, M. (2018). Genetic algorithm approaches for improving prediction accuracy of multi-criteria recommender systems. International Journal of Computational Intelligence Systems, 11(1), 146–162.CrossRef
10.
Zurück zum Zitat Tanimu, J.J., Hamada, M., Hassan, M., Yusuf, S.I. (2021) A contemporary machine learning method for accurate prediction of cervical cancer. In Proceedings of the 3rd ETLT 2021. ACM International Conference on Information and Communication Technology, Aizu, Japan, p. 04004 Tanimu, J.J., Hamada, M., Hassan, M., Yusuf, S.I. (2021) A contemporary machine learning method for accurate prediction of cervical cancer. In Proceedings of the 3rd ETLT 2021. ACM International Conference on Information and Communication Technology, Aizu, Japan, p. 04004
11.
Zurück zum Zitat Abba, A.H., Hassan, M., (2018) Design and implementation of a CSV validation system. In Proceedings of the 3rd international Conference on Applications in information Technology, Wakamatsu, Japan, pp. 111–116 Abba, A.H., Hassan, M., (2018) Design and implementation of a CSV validation system. In Proceedings of the 3rd international Conference on Applications in information Technology, Wakamatsu, Japan, pp. 111–116
12.
Zurück zum Zitat Osianwo, F. Y., Akinsola, J. E. T., Awodele, O., Hinimikaiye, J. O., Olakanmi, O., & Akiniobi, J. (2017). Supervised machine learning algorithm: Classification and comparisiom. International Journal of Computer Trends and Technology, 3, 128–138. Osianwo, F. Y., Akinsola, J. E. T., Awodele, O., Hinimikaiye, J. O., Olakanmi, O., & Akiniobi, J. (2017). Supervised machine learning algorithm: Classification and comparisiom. International Journal of Computer Trends and Technology, 3, 128–138.
13.
Zurück zum Zitat Bazazeh, D., Shubair, R. (2017) Comparative study of machine learning algorithms for breast cancer detection and diagnosis. In Proceedings of the 2017 International Conference on Electronic Devices, Systems, and Applications, Kuching, Malaysia, pp. 2–5 Bazazeh, D., Shubair, R. (2017) Comparative study of machine learning algorithms for breast cancer detection and diagnosis. In Proceedings of the 2017 International Conference on Electronic Devices, Systems, and Applications, Kuching, Malaysia, pp. 2–5
14.
Zurück zum Zitat Boeri, C., Chiappa, C., Galli, F., de Berardinis, V., Bardelli, L., Carcano, G., & Rovera, F. (2020). Machine learning techniques in breast cancer prognosis prediction: A primary evaluation. Cancer Medicine, 9, 3234–3243.CrossRef Boeri, C., Chiappa, C., Galli, F., de Berardinis, V., Bardelli, L., Carcano, G., & Rovera, F. (2020). Machine learning techniques in breast cancer prognosis prediction: A primary evaluation. Cancer Medicine, 9, 3234–3243.CrossRef
15.
Zurück zum Zitat Sakri, S. B., Rashid, N. B. A., & Zain, Z. M. (2018). Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access, 6, 29637–29647.CrossRef Sakri, S. B., Rashid, N. B. A., & Zain, Z. M. (2018). Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access, 6, 29637–29647.CrossRef
16.
Zurück zum Zitat Ni, Q., Stevic, I., Pan, C., et al. (2018). Different signatures of miR-16, miR-30b and miR-93 in exosomes from breast cancer and DCIS patients. Science and Reports, 8(1), 12974.CrossRef Ni, Q., Stevic, I., Pan, C., et al. (2018). Different signatures of miR-16, miR-30b and miR-93 in exosomes from breast cancer and DCIS patients. Science and Reports, 8(1), 12974.CrossRef
17.
Zurück zum Zitat Ricciardi, C., Valente, S. A., Edmund, K., Cantoni, V., Green, R., Fiorillo, A., Picone, I., Santini, S., & Cesarelli, M. (2020). Linear discriminant analysis and principal component analysis to predict coronary artery disease. Health Informatics Journal, 26, 2181–2192.CrossRef Ricciardi, C., Valente, S. A., Edmund, K., Cantoni, V., Green, R., Fiorillo, A., Picone, I., Santini, S., & Cesarelli, M. (2020). Linear discriminant analysis and principal component analysis to predict coronary artery disease. Health Informatics Journal, 26, 2181–2192.CrossRef
18.
Zurück zum Zitat Bader Alazzam, M., Mansour, H., Hammam, M. M., et al. (2021). machine learning of medical applications involving complicated proteins and genetic measurements. Computational Intelligence and Neuroscience, 2021, 1–6.CrossRef Bader Alazzam, M., Mansour, H., Hammam, M. M., et al. (2021). machine learning of medical applications involving complicated proteins and genetic measurements. Computational Intelligence and Neuroscience, 2021, 1–6.CrossRef
19.
Zurück zum Zitat Dhanya, R., Paul, I. R., Sindhu Akula, S., Sivakumar, M., & Nair J. J. (2019) A comparative study for breast cancer prediction using machine learning and feature selection. In 2019 International Conference on Intelligent Computing and Control Systems (ICCS), pp. 1049–1055 Dhanya, R., Paul, I. R., Sindhu Akula, S., Sivakumar, M., & Nair J. J. (2019) A comparative study for breast cancer prediction using machine learning and feature selection. In 2019 International Conference on Intelligent Computing and Control Systems (ICCS), pp. 1049–1055
20.
Zurück zum Zitat Islam, M. M., Iqbal, H., Haque, M. R., & Hasan, M. K. (2017) Prediction of breast cancer using support vector machine and K-Nearest neighbors. In 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), pp. 226–229 Islam, M. M., Iqbal, H., Haque, M. R., & Hasan, M. K. (2017) Prediction of breast cancer using support vector machine and K-Nearest neighbors. In 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), pp. 226–229
22.
Zurück zum Zitat Bazazeh, D., & Shubair, R. (2016) Comparative study of machine learning algorithms for breast cancer detection and diagnosis. In 2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA), pp. 1–4 Bazazeh, D., & Shubair, R. (2016) Comparative study of machine learning algorithms for breast cancer detection and diagnosis. In 2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA), pp. 1–4
23.
Zurück zum Zitat Jain, R., & Mazumdar, J. (2003). A genetic algorithm based nearest neighbor classification to breast cancer diagnosis. Australasian Physical and Engineering Sciences in Medicine, 26, 6.CrossRef Jain, R., & Mazumdar, J. (2003). A genetic algorithm based nearest neighbor classification to breast cancer diagnosis. Australasian Physical and Engineering Sciences in Medicine, 26, 6.CrossRef
24.
Zurück zum Zitat Aličković, E., & Subasi, A. (2015). Breast cancer diagnosis using GA feature selection and Rotation Forest. Neural Computing and Applications, 28, 753–763.CrossRef Aličković, E., & Subasi, A. (2015). Breast cancer diagnosis using GA feature selection and Rotation Forest. Neural Computing and Applications, 28, 753–763.CrossRef
26.
Zurück zum Zitat daoudyvan, A., & Maalmi, K. (2020). Breast cancer classification with reduced feature set using association rules and support vector machine. Network Modeling Analysis in Health Informatics and Bioinformatics, 9, 34.CrossRef daoudyvan, A., & Maalmi, K. (2020). Breast cancer classification with reduced feature set using association rules and support vector machine. Network Modeling Analysis in Health Informatics and Bioinformatics, 9, 34.CrossRef
28.
Zurück zum Zitat El Rahman, S. A. (2021). Predicting breast cancer survivability based on machine learning and features selection algorithms: a comparative study. Journal of Ambient Intelligence and Humanized Computing, 12, 8585–8623.CrossRef El Rahman, S. A. (2021). Predicting breast cancer survivability based on machine learning and features selection algorithms: a comparative study. Journal of Ambient Intelligence and Humanized Computing, 12, 8585–8623.CrossRef
29.
Zurück zum Zitat Kamel, S. R., YaghoubZadeh, R., & Kheirabadi, M. (2019). Improving the performance of support-vector machine by selecting the best features by Gray Wolf algorithm to increase the accuracy of diagnosis of breast cancer. Journal of Big Data, 6, 90.CrossRef Kamel, S. R., YaghoubZadeh, R., & Kheirabadi, M. (2019). Improving the performance of support-vector machine by selecting the best features by Gray Wolf algorithm to increase the accuracy of diagnosis of breast cancer. Journal of Big Data, 6, 90.CrossRef
31.
Zurück zum Zitat Sharma, A., & Mishra, P. K. (2021). Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. International Journal of Information Technology, 14(4), 1949–1960.CrossRef Sharma, A., & Mishra, P. K. (2021). Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. International Journal of Information Technology, 14(4), 1949–1960.CrossRef
32.
Zurück zum Zitat Hu, Q., Whitney, H. M., & Giger, M. L. (2020). A deep learning methodology for improved breast cancer diagnosis using multiparametric MRI. Science and Reports, 10(1), 1–11. Hu, Q., Whitney, H. M., & Giger, M. L. (2020). A deep learning methodology for improved breast cancer diagnosis using multiparametric MRI. Science and Reports, 10(1), 1–11.
Metadaten
Titel
RoughSet based Feature Selection for Prediction of Breast Cancer
verfasst von
Hanumanthu Bhukya
M Sadanandam
Publikationsdatum
30.03.2023
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 3/2023
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-023-10378-4

Weitere Artikel der Ausgabe 3/2023

Wireless Personal Communications 3/2023 Zur Ausgabe

Neuer Inhalt