Skip to main content

2019 | OriginalPaper | Buchkapitel

Majority Voting Algorithm for Diagnosing of Imbalanced Malaria Disease

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Vector borne diseases like malaria fever is one of the most elevating issues in medical domain. Accurate identification of a patient from the given set of samples and classification becomes one of the challenging task when dealing with imbalanced datasets. Many conventional machine learning and data mining algorithms are shows poor performance to classify skewed distributed data because they are trained very well with the majority class samples only. Proposing an ensemble method called majority voting defined with a set of machine learning algorithms namely decision tree—C4.5, Naive Bayesian and K-Nearest Neighbor (KNN) classifiers. Classification of samples can be done based on the majority voting of classifiers. Experiment results stating that voting ensemble method shows classification accuracy of 95.2% on imbalanced malaria disease data whereas dealing with balanced malaria disease data voting ensembler shows 92.1% of accuracy. Consequently voting shows 100% classification report on precision, Recall and F1-Score on imbalanced malaria disease data sets whereas on balanced malaria disease data voting shows 96% of Precision, Recall and F1-Score metrics.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bui TQ, Pham HM (2016) Web based GIS for spatial pattern detection: application to malaria incidence in Vietnam. Bui Pham Springer Plus 5(1014):1–14 Bui TQ, Pham HM (2016) Web based GIS for spatial pattern detection: application to malaria incidence in Vietnam. Bui Pham Springer Plus 5(1014):1–14
2.
Zurück zum Zitat MacLeod DA, Jones A, Di Giuseppe F, Caminade C, Morse AP (2015) Demonstration of successful malaria forecasts for Botswana using an operational seasonal climate model. Environ Res Lett 10:044005, 1–11 (IOP Publishing) MacLeod DA, Jones A, Di Giuseppe F, Caminade C, Morse AP (2015) Demonstration of successful malaria forecasts for Botswana using an operational seasonal climate model. Environ Res Lett 10:044005, 1–11 (IOP Publishing)
3.
Zurück zum Zitat Rahman MZ, Roytman L, Kadik A, Rosy DA (2015) Environmental data analysis and remote sensing for early detection of dengue and malaria. In: Proceedings of SPIE, vol 9112, pp 1–9 Rahman MZ, Roytman L, Kadik A, Rosy DA (2015) Environmental data analysis and remote sensing for early detection of dengue and malaria. In: Proceedings of SPIE, vol 9112, pp 1–9
5.
Zurück zum Zitat Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE—BigComp, pp 217–222 Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE—BigComp, pp 217–222
6.
Zurück zum Zitat Ditzler G, Polikar R (2012) Incremental learning of concept drift from streaming imbalanced data. IEEE Trans Knowl Data Eng, pp 1–30 Ditzler G, Polikar R (2012) Incremental learning of concept drift from streaming imbalanced data. IEEE Trans Knowl Data Eng, pp 1–30
7.
Zurück zum Zitat Nugroho HA, Akbar SA, Murhandarwati EEH (2015) Feature extraction and classification for detection malaria parasites in thin blood smear. In: IEEE 2nd international conference on information technology, computer and electrical engineering (ICITACEE), pp 197–201 Nugroho HA, Akbar SA, Murhandarwati EEH (2015) Feature extraction and classification for detection malaria parasites in thin blood smear. In: IEEE 2nd international conference on information technology, computer and electrical engineering (ICITACEE), pp 197–201
8.
Zurück zum Zitat Das DK, Maiti AK, Chakraborty C (2015) Automated system for characterization and classification of malaria-infected stages using light microscopic images of thin blood smears. J Microsc 257(3):238–252CrossRef Das DK, Maiti AK, Chakraborty C (2015) Automated system for characterization and classification of malaria-infected stages using light microscopic images of thin blood smears. J Microsc 257(3):238–252CrossRef
9.
Zurück zum Zitat Ruiz D, Brun C, Connor SJ, Omumbo JA, Lyon B, Thomson MC (2014) Testing a multi-malaria-model ensemble against 30 years of data in the Kenyan highlands. Malaria J 13:206, 1–14 Ruiz D, Brun C, Connor SJ, Omumbo JA, Lyon B, Thomson MC (2014) Testing a multi-malaria-model ensemble against 30 years of data in the Kenyan highlands. Malaria J 13:206, 1–14
10.
Zurück zum Zitat Smith T, Ross A, Maire N, Chitnis N, Studer A, Hardy D, Brooks A, Penny M, Tanner M (2012) Ensemble modeling of the likely public health impact of pre-erythrocytic malaria vaccine. PLOS Med 9(1):1–20CrossRef Smith T, Ross A, Maire N, Chitnis N, Studer A, Hardy D, Brooks A, Penny M, Tanner M (2012) Ensemble modeling of the likely public health impact of pre-erythrocytic malaria vaccine. PLOS Med 9(1):1–20CrossRef
11.
Zurück zum Zitat Pandit P, Anand A (2016, August) Artificial neural networks for detection of malaria in RBCs. ArXiv: 1608.06627) Pandit P, Anand A (2016, August) Artificial neural networks for detection of malaria in RBCs. ArXiv: 1608.06627)
12.
Zurück zum Zitat Bbosa F, Wesonga R, Jehopio P (2016) Clinical malaria diagnosis: rule based Classification statistical prototype. Springer Plus 5:939CrossRef Bbosa F, Wesonga R, Jehopio P (2016) Clinical malaria diagnosis: rule based Classification statistical prototype. Springer Plus 5:939CrossRef
13.
Zurück zum Zitat Wu C, Wong PJY (2016) Multi-dimensional discrete Halanay inequalities and the global stability of the disease free equilibrium of a discrete delayed malaria model. Adv Differ Equ 2016:113MathSciNetCrossRefMATH Wu C, Wong PJY (2016) Multi-dimensional discrete Halanay inequalities and the global stability of the disease free equilibrium of a discrete delayed malaria model. Adv Differ Equ 2016:113MathSciNetCrossRefMATH
15.
Zurück zum Zitat Rahmanti FZ, Ningrum NK, Imania NK, Purnomo MH (2015, November) Plasmodium vivax classification from digitalization microscopic thick blood film using combination of second order statistical feature extraction and K-Nearest Neighbour (K-NN) classifier method. In: IEEE 4th international conference on instrumentation, communications, information technology, and biomedical engineering (ICICI-BME), Bandung, pp 2–3 Rahmanti FZ, Ningrum NK, Imania NK, Purnomo MH (2015, November) Plasmodium vivax classification from digitalization microscopic thick blood film using combination of second order statistical feature extraction and K-Nearest Neighbour (K-NN) classifier method. In: IEEE 4th international conference on instrumentation, communications, information technology, and biomedical engineering (ICICI-BME), Bandung, pp 2–3
16.
Zurück zum Zitat Charpe KC, Bairagi V (2015) Automated malaria parasite and there stage detection in microscopic blood images. In: IEEE sponsored 9th international conference on intelligent systems and control (ISCO) Charpe KC, Bairagi V (2015) Automated malaria parasite and there stage detection in microscopic blood images. In: IEEE sponsored 9th international conference on intelligent systems and control (ISCO)
17.
Zurück zum Zitat Somasekar J, Reddy BE (2015) Segmentation of erythrocytes infected with malaria parasites for the diagnosis using microscopy imaging. Comput Electr Eng, pp 336–351 (Elsevier) Somasekar J, Reddy BE (2015) Segmentation of erythrocytes infected with malaria parasites for the diagnosis using microscopy imaging. Comput Electr Eng, pp 336–351 (Elsevier)
18.
Zurück zum Zitat Cameron E, Battle KE, Bhatt S, Weiss DJ, Bisanzio D, Mappin B, Dalrymple U, Hay SI, Smith DL, Griffin JT, Wenger EA, Eckhoff PA, Smith TA, Penny MA, Gething PW (2015) Defining the relationship between infection prevalence and clinical incidence of Plasmodium falciparum malaria. Nat Commun 6:8170, 1–10 Cameron E, Battle KE, Bhatt S, Weiss DJ, Bisanzio D, Mappin B, Dalrymple U, Hay SI, Smith DL, Griffin JT, Wenger EA, Eckhoff PA, Smith TA, Penny MA, Gething PW (2015) Defining the relationship between infection prevalence and clinical incidence of Plasmodium falciparum malaria. Nat Commun 6:8170, 1–10
19.
Zurück zum Zitat Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Prog Artif Intell, pp 1–12 Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Prog Artif Intell, pp 1–12
20.
Zurück zum Zitat Deng X, Zhong W, Ren J, Zeng D, Zhang H (2016) An imbalanced data classification method based on automatic clustering under-sampling. IEEE Trans, pp 1–8 Deng X, Zhong W, Ren J, Zeng D, Zhang H (2016) An imbalanced data classification method based on automatic clustering under-sampling. IEEE Trans, pp 1–8
21.
Zurück zum Zitat Ali A, Shamsuddin SM, Ralescu AL (2013) Classification with class imbalance problem: a review. Int J Adv Soft Comput Appl 5(3):1–30 Ali A, Shamsuddin SM, Ralescu AL (2013) Classification with class imbalance problem: a review. Int J Adv Soft Comput Appl 5(3):1–30
22.
Zurück zum Zitat Poolsawad N, Kambhampati C, Cleland JGF (2014) Balancing class for performance of classification with a clinical dataset. In: Proceedings of the World Congress on engineering, vol 1, pp 1–6 Poolsawad N, Kambhampati C, Cleland JGF (2014) Balancing class for performance of classification with a clinical dataset. In: Proceedings of the World Congress on engineering, vol 1, pp 1–6
23.
Zurück zum Zitat Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224–228CrossRef Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224–228CrossRef
24.
Zurück zum Zitat Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H, Bing G (2016) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl, pp 1–49 Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H, Bing G (2016) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl, pp 1–49
25.
Zurück zum Zitat Jamal S, Periwal V, Scaria V (2013) Predictive modeling of anti-malarial molecules inhibiting apicoplast formation. BMC Bioinform 14:55, 1–8 Jamal S, Periwal V, Scaria V (2013) Predictive modeling of anti-malarial molecules inhibiting apicoplast formation. BMC Bioinform 14:55, 1–8
26.
Zurück zum Zitat Andrade BB, Reis-Filho A, Souza-Neto SM, Clarencio J, Carmargo LMA, Barral A, Barral-Netto M (2010) Severe Plasmodium vivax malaria exhibits marked inflammatory imbalance. Malaria J 9:13, 1–8 Andrade BB, Reis-Filho A, Souza-Neto SM, Clarencio J, Carmargo LMA, Barral A, Barral-Netto M (2010) Severe Plasmodium vivax malaria exhibits marked inflammatory imbalance. Malaria J 9:13, 1–8
27.
Zurück zum Zitat Dubey R, Zhou J, Wanga Y, Thompson PM, Ye J (2014) Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study. Elsevier Neuro Image 87:220–241 Dubey R, Zhou J, Wanga Y, Thompson PM, Ye J (2014) Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study. Elsevier Neuro Image 87:220–241
28.
Zurück zum Zitat Ng WWY, Hu J, Yeung DS, Yin S, Roli F (2015) Diversified sensitivity-based under sampling for imbalance classification problems. IEEE Trans Cybern, pp 1–11 Ng WWY, Hu J, Yeung DS, Yin S, Roli F (2015) Diversified sensitivity-based under sampling for imbalance classification problems. IEEE Trans Cybern, pp 1–11
29.
Zurück zum Zitat Roumani YF, May JH, Strum DP, Vargas LG (2013) Classifying highly imbalanced ICU data. Health care Manag Sci 16:119–128CrossRef Roumani YF, May JH, Strum DP, Vargas LG (2013) Classifying highly imbalanced ICU data. Health care Manag Sci 16:119–128CrossRef
30.
Zurück zum Zitat Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE transaction, pp 217–222 Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE transaction, pp 217–222
31.
Zurück zum Zitat Garcia V, Sanchez JS, Mollineda RA (2012) On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowl Based Syst 25:13–21 (Elsevier)CrossRef Garcia V, Sanchez JS, Mollineda RA (2012) On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowl Based Syst 25:13–21 (Elsevier)CrossRef
32.
Zurück zum Zitat Thongkam J, Xu G, Zhang Y, Huang F (2009) Toward breast cancer survivability prediction model through improving training space. Expert Syst Appl 36:12200–12209 (Elsevier)CrossRef Thongkam J, Xu G, Zhang Y, Huang F (2009) Toward breast cancer survivability prediction model through improving training space. Expert Syst Appl 36:12200–12209 (Elsevier)CrossRef
33.
Zurück zum Zitat Zhao X-M, Li X, Chen L, Aihara K (2007) Protein classification with imbalanced data. Wiley InterSci 70:125–1132 Zhao X-M, Li X, Chen L, Aihara K (2007) Protein classification with imbalanced data. Wiley InterSci 70:125–1132
34.
Zurück zum Zitat López V, Fernandez A, Garcia S, Palade V, Herrera F (2013) An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics. Inf Sci 250:113–141 (Elsevier)CrossRef López V, Fernandez A, Garcia S, Palade V, Herrera F (2013) An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics. Inf Sci 250:113–141 (Elsevier)CrossRef
35.
Zurück zum Zitat Ma L, Fan S (2017) CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests. BMC Bioinform 18:169CrossRef Ma L, Fan S (2017) CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests. BMC Bioinform 18:169CrossRef
Metadaten
Titel
Majority Voting Algorithm for Diagnosing of Imbalanced Malaria Disease
verfasst von
T. Sajana
M. R. Narasingarao
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-00665-5_4

Neuer Inhalt