Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 10/2021

23-01-2021 | Research Article-Electrical Engineering

Performance Analysis of Machine Learning Algorithms for Thyroid Disease

Authors: Hafiz Abbad Ur Rehman, Chyi-Yeu Lin, Zohaib Mushtaq, Shun-Feng Su

Published in: Arabian Journal for Science and Engineering | Issue 10/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Thyroid disease arises from an anomalous growth of thyroid tissue at the verge of the thyroid gland. Thyroid disorderliness normally ensues when this gland releases abnormal amounts of hormones where hypothyroidism (inactive thyroid gland) and hyperthyroidism (hyperactive thyroid gland) are the two main types of thyroid disorder. This study proposes the use of efficient classifiers by using machine learning algorithms in terms of accuracy and other performance evaluation metrics to detect and diagnose thyroid disease. This research presents an extensive analysis of different classifiers which are K-nearest neighbor (KNN), Naïve Bayes, support vector machine, decision tree and logistic regression implemented with or without feature selection techniques. Thyroid data were taken from DHQ Teaching Hospital, Dera Ghazi Khan, Pakistan. Thyroid dataset was unique and different from other existing studies because it included three additional features which were pulse rate, body mass index and blood pressure. Experiment was based on three iterations; the first iteration of the experiment did not employ feature selection while the second and third were with L1-, L2-based feature selection technique. Evaluation and analysis of the experiment have been done which consisted of many factors such as accuracy, precision and receiver operating curve with area under curve. The result indicated that classifiers which involved L1-based feature selection achieved an overall higher accuracy (Naive Bayes 100%, logistic regression 100% and KNN 97.84%) compared to without feature selection and L2-based feature selection technique.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Miller, K.D., et al.: Cancer treatment and survivorship statistics, 2016. CA Cancer J. Clin. 66(4), 271–289 (2016)CrossRef Miller, K.D., et al.: Cancer treatment and survivorship statistics, 2016. CA Cancer J. Clin. 66(4), 271–289 (2016)CrossRef
2.
go back to reference Shroff, S.; Pise, S.; Chalekar, P.; Panicker, S.S.: Thyroid disease diagnosis: a survey. In: IEEE 9th International Conference on Intelligent Systems and Control, 2015 (ISCO 2015), pp. 1–6. IEEE (2015) Shroff, S.; Pise, S.; Chalekar, P.; Panicker, S.S.: Thyroid disease diagnosis: a survey. In: IEEE 9th International Conference on Intelligent Systems and Control, 2015 (ISCO 2015), pp. 1–6. IEEE (2015)
6.
go back to reference Pal, R.; Anand, T.; Dubey, S.K.: Evaluation and performance analysis of classification techniques for thyroid detection. Int. J. Bus. Inf. Syst. 28(2), 163–177 (2018) Pal, R.; Anand, T.; Dubey, S.K.: Evaluation and performance analysis of classification techniques for thyroid detection. Int. J. Bus. Inf. Syst. 28(2), 163–177 (2018)
8.
go back to reference Acharya, U.R.; Choriappa, P.; Fujita, H., et al.: Thyroid lesion classification in 242 patient population using Gabor transform features from high resolution ultrasound images. Knowl. Based Syst. 107, 235–245 (2016)CrossRef Acharya, U.R.; Choriappa, P.; Fujita, H., et al.: Thyroid lesion classification in 242 patient population using Gabor transform features from high resolution ultrasound images. Knowl. Based Syst. 107, 235–245 (2016)CrossRef
9.
go back to reference Chandel, K.; Kunwar, V.; Sabitha, S.; Choudhury, T.; Mukherjee, S.: A comparative study on thyroid disease detection using K-nearest neighbor and Naive Bayes classification techniques. CSI Trans. 4(2–4), 313–319 (2016)CrossRef Chandel, K.; Kunwar, V.; Sabitha, S.; Choudhury, T.; Mukherjee, S.: A comparative study on thyroid disease detection using K-nearest neighbor and Naive Bayes classification techniques. CSI Trans. 4(2–4), 313–319 (2016)CrossRef
10.
go back to reference Bekar, E.T.; Ulutagay, G.; Kantarcı, S.: Classification of thyroid disease by using data mining models: a comparison of decision tree algorithms. Oxf. J. Intell. Decis. Data Sci. 2016(2), 13–28 (2016)CrossRef Bekar, E.T.; Ulutagay, G.; Kantarcı, S.: Classification of thyroid disease by using data mining models: a comparison of decision tree algorithms. Oxf. J. Intell. Decis. Data Sci. 2016(2), 13–28 (2016)CrossRef
11.
go back to reference Prasad, V.; Rao, T.S.; Babu, M.S.P.: Thyroid disease diagnosis via hybrid architecture composing rough data sets theory and machine learning algorithms. Soft Comput. 20(3), 1179–1189 (2016)CrossRef Prasad, V.; Rao, T.S.; Babu, M.S.P.: Thyroid disease diagnosis via hybrid architecture composing rough data sets theory and machine learning algorithms. Soft Comput. 20(3), 1179–1189 (2016)CrossRef
12.
go back to reference Mushtaq, Z.; Yaqub, A.; Sani, S.; Khalid, A.: Effective K-nearest neighbor classifications for Wisconsin breast cancer data sets. J. Chin. Inst. Eng. 43(1), 1–13 (2019) Mushtaq, Z.; Yaqub, A.; Sani, S.; Khalid, A.: Effective K-nearest neighbor classifications for Wisconsin breast cancer data sets. J. Chin. Inst. Eng. 43(1), 1–13 (2019)
13.
go back to reference Tomar, D.; Agarwal, S.: A survey on data mining approaches for healthcare. Int. J. Bio-Sci. Bio-Technol. 5(5), 241–266 (2013)CrossRef Tomar, D.; Agarwal, S.: A survey on data mining approaches for healthcare. Int. J. Bio-Sci. Bio-Technol. 5(5), 241–266 (2013)CrossRef
14.
go back to reference Jahantigh, F.F.: Kidney diseases diagnosis by using fuzzy logic. In: 2015 International Conference on Industrial Engineering and Operations Management, 2015 (IEOM2015), pp. 2369–2375. IEEE (2015) Jahantigh, F.F.: Kidney diseases diagnosis by using fuzzy logic. In: 2015 International Conference on Industrial Engineering and Operations Management, 2015 (IEOM2015), pp. 2369–2375. IEEE (2015)
15.
go back to reference Durairaj, M.; Ranjani, V.A.: Data mining applications in healthcare sector: a study. Int. J. Sci. Technol. Res. 2(10), 29–35 (2013) Durairaj, M.; Ranjani, V.A.: Data mining applications in healthcare sector: a study. Int. J. Sci. Technol. Res. 2(10), 29–35 (2013)
16.
go back to reference Liu, D.Y.; Chen, H.-L.; Yang, B.; Lv, X.-E.; Li, L.-N.; Liu, J.: Design of an enhanced fuzzy k-nearest neighbor classifier based computer aided diagnostic system for thyroid disease. J. Med. Syst. 36(5), 3243–3254 (2012)CrossRef Liu, D.Y.; Chen, H.-L.; Yang, B.; Lv, X.-E.; Li, L.-N.; Liu, J.: Design of an enhanced fuzzy k-nearest neighbor classifier based computer aided diagnostic system for thyroid disease. J. Med. Syst. 36(5), 3243–3254 (2012)CrossRef
17.
go back to reference Acharya, U.R.; Vinitha Sree, V.S.; Molinari, F.; Garberoglio, R.; Witkowska, A.; Suri, J.S.: Automated benign and malignant thyroid lesion characterization and classification in 3D contrast-enhanced ultrasound. In: Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012 (EMBS2012), pp. 452–455. IEEE (2012) Acharya, U.R.; Vinitha Sree, V.S.; Molinari, F.; Garberoglio, R.; Witkowska, A.; Suri, J.S.: Automated benign and malignant thyroid lesion characterization and classification in 3D contrast-enhanced ultrasound. In: Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012 (EMBS2012), pp. 452–455. IEEE (2012)
18.
go back to reference Kousarrizi, M.R.N.; Seiti, F.; Teshnehlab, M.: An experimental comparative study on thyroid disease diagnosis based on feature subset selection and classification. Int. J. Electr. Comput. Sci. 12(1), 13–19 (2012) Kousarrizi, M.R.N.; Seiti, F.; Teshnehlab, M.: An experimental comparative study on thyroid disease diagnosis based on feature subset selection and classification. Int. J. Electr. Comput. Sci. 12(1), 13–19 (2012)
19.
go back to reference Chen, H.L.; Yang, B.; Wang, G.; Liu, J.: A three-stage expert system based on support vector machines for thyroid disease diagnosis. J. Med. Syst. 36(3), 1953–1963 (2012)CrossRef Chen, H.L.; Yang, B.; Wang, G.; Liu, J.: A three-stage expert system based on support vector machines for thyroid disease diagnosis. J. Med. Syst. 36(3), 1953–1963 (2012)CrossRef
20.
go back to reference Dogantekin, E.; Dogantekin, A.; Avci, D.: An expert system based on generalized discriminant analysis and wavelet support vector machine for diagnosis of thyroid diseases. Expert Syst. Appl. 38(1), 146–150 (2011)CrossRef Dogantekin, E.; Dogantekin, A.; Avci, D.: An expert system based on generalized discriminant analysis and wavelet support vector machine for diagnosis of thyroid diseases. Expert Syst. Appl. 38(1), 146–150 (2011)CrossRef
21.
go back to reference Keleş, A.; Keles, A.: ESTDD: expert system for thyroid diseases diagnosis. Expert Syst. Appl. 34(1), 242–246 (2008)CrossRef Keleş, A.; Keles, A.: ESTDD: expert system for thyroid diseases diagnosis. Expert Syst. Appl. 34(1), 242–246 (2008)CrossRef
22.
go back to reference Ozyilmaz, L.; Yildirim, T.: Diagnosis of thyroid disease using artificial neural network methods. In: 9th International Conference on Neural Information Processing, 2002 (ICONIP2002), pp. 2033–2036, IEEE (2002) Ozyilmaz, L.; Yildirim, T.: Diagnosis of thyroid disease using artificial neural network methods. In: 9th International Conference on Neural Information Processing, 2002 (ICONIP2002), pp. 2033–2036, IEEE (2002)
24.
go back to reference Alcalá-Fdez, J.; Sánchez, J.L.; Garc, S.; Jesus, M.J.D., et al.: KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Mult. Valued Log. Soft Comput. 17, 255–287 (2011) Alcalá-Fdez, J.; Sánchez, J.L.; Garc, S.; Jesus, M.J.D., et al.: KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Mult. Valued Log. Soft Comput. 17, 255–287 (2011)
25.
go back to reference Pedregosa, F.; Weiss, R.; Brucher, M.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12(2011), 2825–2830 (2011)MathSciNetMATH Pedregosa, F.; Weiss, R.; Brucher, M.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12(2011), 2825–2830 (2011)MathSciNetMATH
26.
go back to reference Li, C.; Zhang, S.; Zhang, H.; Pang, L.; Lam, K.; Hui, C.; Zhang, S.: Using the K-nearest neighbor algorithm for the classification of lymph node metastasis in gastric cancer. Comput. Math. Methods Med. (2012) Li, C.; Zhang, S.; Zhang, H.; Pang, L.; Lam, K.; Hui, C.; Zhang, S.: Using the K-nearest neighbor algorithm for the classification of lymph node metastasis in gastric cancer. Comput. Math. Methods Med. (2012)
27.
go back to reference Chalekar, P.; Shroff, S.; Pise, S.; Panicker, S.S.: Use of K-nearest neighbor in thyroid disease classification. Int. J. Curr. Eng. Sci. Res. 1(2), 2394–2697 (2014) Chalekar, P.; Shroff, S.; Pise, S.; Panicker, S.S.: Use of K-nearest neighbor in thyroid disease classification. Int. J. Curr. Eng. Sci. Res. 1(2), 2394–2697 (2014)
28.
go back to reference Mushtaq, Z.; Yaqub, A.; Hassan, A.; Su, S.F.: Performance analysis of supervised classifiers using PCA based techniques on breast cancer. In: International Conference on Engineering and Emerging Technologies, 2019 (ICEET2019), pp. 1–6, IEEE (2019) Mushtaq, Z.; Yaqub, A.; Hassan, A.; Su, S.F.: Performance analysis of supervised classifiers using PCA based techniques on breast cancer. In: International Conference on Engineering and Emerging Technologies, 2019 (ICEET2019), pp. 1–6, IEEE (2019)
29.
go back to reference Aboudi, N.; Guetari, R.; Khlifa, N.: Multi-objectives optimisation of features selection for the classification of thyroid nodules in ultrasound images. IET Image Process. 14(9), 1901–1908 (2020)CrossRef Aboudi, N.; Guetari, R.; Khlifa, N.: Multi-objectives optimisation of features selection for the classification of thyroid nodules in ultrasound images. IET Image Process. 14(9), 1901–1908 (2020)CrossRef
30.
go back to reference Deepika, M.; Kalaiselvi, K.: A empirical study on disease diagnosis using data mining techniques. In: International Conference on Inventive Communication and Computational Technologies, 2018 (ICICCT2018), pp. 615–620, IEEE (2019) Deepika, M.; Kalaiselvi, K.: A empirical study on disease diagnosis using data mining techniques. In: International Conference on Inventive Communication and Computational Technologies, 2018 (ICICCT2018), pp. 615–620, IEEE (2019)
31.
go back to reference Zhou, Z.-H.: Ensemble Methods: Foundations and Algorithms—Zhi-Hua Zhou—Google Books. CRC Press, Boca Raton (2012)CrossRef Zhou, Z.-H.: Ensemble Methods: Foundations and Algorithms—Zhi-Hua Zhou—Google Books. CRC Press, Boca Raton (2012)CrossRef
32.
go back to reference Lavanya, D.; Rani, K.U.: Performance evaluation of decision tree classifiers on medical datasets. Int. J. Comput. Appl. 26(4), 1–4 (2011) Lavanya, D.; Rani, K.U.: Performance evaluation of decision tree classifiers on medical datasets. Int. J. Comput. Appl. 26(4), 1–4 (2011)
33.
go back to reference Yang, Y.; Chen, G.; Reniers, G.: Vulnerability assessment of atmospheric storage tanks to floods based on logistic regression. Reliab. Eng. Syst. Saf. 196, 106721 (2019)CrossRef Yang, Y.; Chen, G.; Reniers, G.: Vulnerability assessment of atmospheric storage tanks to floods based on logistic regression. Reliab. Eng. Syst. Saf. 196, 106721 (2019)CrossRef
34.
go back to reference Sahu, B.; Mohanty, S.; Rout, S.: A hybrid approach for breast cancer classification and diagnosis. ICST Trans. Scalable Inf. Syst. 6(20), 2–8 (2019) Sahu, B.; Mohanty, S.; Rout, S.: A hybrid approach for breast cancer classification and diagnosis. ICST Trans. Scalable Inf. Syst. 6(20), 2–8 (2019)
35.
go back to reference Islam, M.M.; Iqbal, H.; Haque, M.R.; Hasan, M.K.: Prediction of breast cancer using support vector machine and K-Nearest neighbors. In: 5th IEEE Region 10 Humanitarian Technology Conference. 2017, pp. 226–229, IEEE (2017) Islam, M.M.; Iqbal, H.; Haque, M.R.; Hasan, M.K.: Prediction of breast cancer using support vector machine and K-Nearest neighbors. In: 5th IEEE Region 10 Humanitarian Technology Conference. 2017, pp. 226–229, IEEE (2017)
Metadata
Title
Performance Analysis of Machine Learning Algorithms for Thyroid Disease
Authors
Hafiz Abbad Ur Rehman
Chyi-Yeu Lin
Zohaib Mushtaq
Shun-Feng Su
Publication date
23-01-2021
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 10/2021
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-020-05206-x

Other articles of this Issue 10/2021

Arabian Journal for Science and Engineering 10/2021 Go to the issue

Research Article-Electrical Engineering

Personalized Advanced Time Blood Glucose Level Prediction

Premium Partners