Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 4/2021

28-01-2021 | Research Article-Computer Engineering and Computer Science

An Improved Hybrid Approach for Handling Class Imbalance Problem

Authors: Abeer S. Desuky, Sadiq Hussain

Published in: Arabian Journal for Science and Engineering | Issue 4/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Class imbalance issue that presents in many real-world datasets exhibit favouritism toward the majority class and showcases poor performance for the minority class. Such misclassifications may incur dubious outcome in case of disease diagnosis and other critical applications. Hence, it is a hot topic for the researchers to tackle the class imbalance issue. We present a novel hybrid approach for handling such datasets. We utilize simulated annealing algorithm for undersampling and apply support vector machine, decision tree, k-nearest neighbor and discriminant analysis for the classification task. We validate our technique in 51 real-world datasets and compare it with other recent works. Our technique yields better efficacy than the existing techniques and hence it can be applied in imbalance datasets to mitigate the misclassification.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Abdar, M.; Acharya, U.R.; Sarrafzadegan, N.; Makarenkov, V.: NE-nu-SVC: a new nested ensemble clinical decision support system for effective diagnosis of coronary artery disease. IEEE Access 7, 167605–167620 (2019)CrossRef Abdar, M.; Acharya, U.R.; Sarrafzadegan, N.; Makarenkov, V.: NE-nu-SVC: a new nested ensemble clinical decision support system for effective diagnosis of coronary artery disease. IEEE Access 7, 167605–167620 (2019)CrossRef
2.
go back to reference Abdar, M.; Nasarian, E.; Zhou, X.; Bargshady, G.; Wijayaningrum, V.N.; Hussain, S.: Performance improvement of decision trees for diagnosis of coronary artery disease using multi filtering approach. In: 2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS) (pp. 26–30). IEEE (2019) Abdar, M.; Nasarian, E.; Zhou, X.; Bargshady, G.; Wijayaningrum, V.N.; Hussain, S.: Performance improvement of decision trees for diagnosis of coronary artery disease using multi filtering approach. In: 2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS) (pp. 26–30). IEEE (2019)
3.
go back to reference Zomorodi-moghadam, M.; Abdar, M.; Davarzani, Z.; Zhou, X.; Pławiak, P.; Acharya, U.R.: Hybrid particle swarm optimization for rule discovery in the diagnosis of coronary artery disease. Expert Syst. 38, e12485 (2019) Zomorodi-moghadam, M.; Abdar, M.; Davarzani, Z.; Zhou, X.; Pławiak, P.; Acharya, U.R.: Hybrid particle swarm optimization for rule discovery in the diagnosis of coronary artery disease. Expert Syst. 38, e12485 (2019)
4.
go back to reference Nasarian, E.; Abdar, M.; Fahami, M.A.; Alizadehsani, R.; Hussain, S.; Basiri, M.E.; Zomorodi-Moghadam, M.; Zhou, X.; Pławiak, P.; Acharya, U.R.; Tan, R.S.: Association between work-related features and coronary artery disease: a heterogeneous hybrid feature selection integrated with balancing approach. Pattern Recogn. Lett. 133, 33–40 (2020)CrossRef Nasarian, E.; Abdar, M.; Fahami, M.A.; Alizadehsani, R.; Hussain, S.; Basiri, M.E.; Zomorodi-Moghadam, M.; Zhou, X.; Pławiak, P.; Acharya, U.R.; Tan, R.S.: Association between work-related features and coronary artery disease: a heterogeneous hybrid feature selection integrated with balancing approach. Pattern Recogn. Lett. 133, 33–40 (2020)CrossRef
5.
go back to reference Samami, M.; Akbari, E.; Abdar, M.; Plawiak, P.; Nematzadeh, H.; Basiri, M.E.; Makarenkov, V.: A mixed solution-based high agreement filtering method for class noise detection in binary classification. Phys. A Stat. Mech. Appl. 553, 124219 (2020)CrossRef Samami, M.; Akbari, E.; Abdar, M.; Plawiak, P.; Nematzadeh, H.; Basiri, M.E.; Makarenkov, V.: A mixed solution-based high agreement filtering method for class noise detection in binary classification. Phys. A Stat. Mech. Appl. 553, 124219 (2020)CrossRef
6.
go back to reference Tuncer, T.; Dogan, S.; Abdar, M.; Ehsan Basiri, M.; Pławiak, P.: Face recognition with triangular fuzzy set-based local cross patterns in wavelet domain. Symmetry 11(6), 787 (2019)CrossRef Tuncer, T.; Dogan, S.; Abdar, M.; Ehsan Basiri, M.; Pławiak, P.: Face recognition with triangular fuzzy set-based local cross patterns in wavelet domain. Symmetry 11(6), 787 (2019)CrossRef
7.
go back to reference Abdar, M.; Basiri, M.E.; Yin, J.; Habibnezhad, M.; Chi, G.; Nemati, S.; Asadi, S.: Energy choices in Alaska: mining people’s perception and attitudes from geotagged tweets. Renew. Sustain. Energy Rev. 124, 109781 (2020)CrossRef Abdar, M.; Basiri, M.E.; Yin, J.; Habibnezhad, M.; Chi, G.; Nemati, S.; Asadi, S.: Energy choices in Alaska: mining people’s perception and attitudes from geotagged tweets. Renew. Sustain. Energy Rev. 124, 109781 (2020)CrossRef
8.
go back to reference Basiri, M.E.; Abdar, M.; Cifci, M.A.; Nemati, S.; Acharya, U.R.: A novel method for sentiment classification of drug reviews using fusion of deep and machine learning techniques. Knowl.-Based Syst. 198, 105949 (2020)CrossRef Basiri, M.E.; Abdar, M.; Cifci, M.A.; Nemati, S.; Acharya, U.R.: A novel method for sentiment classification of drug reviews using fusion of deep and machine learning techniques. Knowl.-Based Syst. 198, 105949 (2020)CrossRef
9.
go back to reference Pławiak, P.; Abdar, M.: Novel methodology for cardiac arrhythmias classification based on long-duration ECG signal fragments analysis. In: Biomedical Signal Processing (pp. 225–272). Springer, Singapore (2020) Pławiak, P.; Abdar, M.: Novel methodology for cardiac arrhythmias classification based on long-duration ECG signal fragments analysis. In: Biomedical Signal Processing (pp. 225–272). Springer, Singapore (2020)
10.
go back to reference Stoean, C.; Stoean, R.; Atencia, M.; Abdar, M.; Velázquez-Pérez, L.; Khosravi, A.; Nahavandi, S.; Acharya, U.R.; Joya, G.: Automated detection of presymptomatic conditions in Spinocerebellar Ataxia type 2 using Monte Carlo dropout and deep neural network techniques with electrooculogram signals. Sensors 20(11), 3032 (2020)CrossRef Stoean, C.; Stoean, R.; Atencia, M.; Abdar, M.; Velázquez-Pérez, L.; Khosravi, A.; Nahavandi, S.; Acharya, U.R.; Joya, G.: Automated detection of presymptomatic conditions in Spinocerebellar Ataxia type 2 using Monte Carlo dropout and deep neural network techniques with electrooculogram signals. Sensors 20(11), 3032 (2020)CrossRef
11.
go back to reference Hussain, S.; Muhammad, L.J.; Ishaq, F.S.; Yakubu, A.; Mohammed, I.A.: Performance evaluation of various data mining algorithms on road traffic accident dataset. In: Information and Communication Technology for Intelligent Systems (pp. 67–78). Springer, Singapore (2019) Hussain, S.; Muhammad, L.J.; Ishaq, F.S.; Yakubu, A.; Mohammed, I.A.: Performance evaluation of various data mining algorithms on road traffic accident dataset. In: Information and Communication Technology for Intelligent Systems (pp. 67–78). Springer, Singapore (2019)
12.
go back to reference Hussain, S.; Muhammad, L.J.; Yakubu, A.: Mining social media and DBpedia data using gephi and R. J. Appl. Comput. Sci. Math. 12(1), 14–20 (2018)CrossRef Hussain, S.; Muhammad, L.J.; Yakubu, A.: Mining social media and DBpedia data using gephi and R. J. Appl. Comput. Sci. Math. 12(1), 14–20 (2018)CrossRef
13.
go back to reference Devi, D.; Namasudra, S.; Kadry, S.: A boosting-aided adaptive cluster-based undersampling approach for treatment of class imbalance problem. Int. J. Data Warehousing Min. (IJDWM) 16(3), 60–86 (2020)CrossRef Devi, D.; Namasudra, S.; Kadry, S.: A boosting-aided adaptive cluster-based undersampling approach for treatment of class imbalance problem. Int. J. Data Warehousing Min. (IJDWM) 16(3), 60–86 (2020)CrossRef
14.
go back to reference Mohammed, R.; Rawashdeh, J.; Abdullah, M.: Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: 2020 11th International Conference on Information and Communication Systems (ICICS) (pp. 243–248). IEEE (2020) Mohammed, R.; Rawashdeh, J.; Abdullah, M.: Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: 2020 11th International Conference on Information and Communication Systems (ICICS) (pp. 243–248). IEEE (2020)
15.
go back to reference Liu, B.; Tsoumakas, G.: Dealing with class imbalance in classifier chains via random undersampling. Knowl.-Based Syst. 192, 105292 (2020)CrossRef Liu, B.; Tsoumakas, G.: Dealing with class imbalance in classifier chains via random undersampling. Knowl.-Based Syst. 192, 105292 (2020)CrossRef
16.
go back to reference Vuttipittayamongkol, P.; Elyan, E.: Neighbourhood-based undersampling approach for handling imbalanced and overlapped data. Inf. Sci. 509, 47–70 (2020)CrossRef Vuttipittayamongkol, P.; Elyan, E.: Neighbourhood-based undersampling approach for handling imbalanced and overlapped data. Inf. Sci. 509, 47–70 (2020)CrossRef
17.
go back to reference Vuttipittayamongkol, P.; Elyan, E.: Improved overlap-based undersampling for imbalanced dataset classification with application to Epilepsy and Parkinson’s disease. Int. J. Neural Syst. 30, 2050043 (2020)CrossRef Vuttipittayamongkol, P.; Elyan, E.: Improved overlap-based undersampling for imbalanced dataset classification with application to Epilepsy and Parkinson’s disease. Int. J. Neural Syst. 30, 2050043 (2020)CrossRef
18.
go back to reference Sarkar, S.; Khatedi, N.; Pramanik, A.; Maiti, J.: An ensemble learning-based undersampling technique for handling class-imbalance problem. In: Proceedings of ICETIT 2019 (pp. 586–595). Springer, Cham (2020) Sarkar, S.; Khatedi, N.; Pramanik, A.; Maiti, J.: An ensemble learning-based undersampling technique for handling class-imbalance problem. In: Proceedings of ICETIT 2019 (pp. 586–595). Springer, Cham (2020)
19.
go back to reference Bejaoui, A.; Elkhalil, K.; Kammoun, A.; Alouni, M.S.; Alnaffouri, T.: Improved design of quadratic discriminant analysis classifier in unbalanced settings. arXiv preprint arXiv:2006.06355 (2020) Bejaoui, A.; Elkhalil, K.; Kammoun, A.; Alouni, M.S.; Alnaffouri, T.: Improved design of quadratic discriminant analysis classifier in unbalanced settings. arXiv preprint arXiv:2006.06355 (2020)
20.
go back to reference Jian, C.; Gao, J.; Ao, Y.: A new sampling method for classifying imbalanced data based on support vector machine ensemble. Neurocomputing 193, 115–122 (2016)CrossRef Jian, C.; Gao, J.; Ao, Y.: A new sampling method for classifying imbalanced data based on support vector machine ensemble. Neurocomputing 193, 115–122 (2016)CrossRef
21.
go back to reference Dubey, H.; Pudi, V.: Class based weighted k-nearest neighbor over imbalance dataset. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining (pp. 305–316). Springer, Berlin, Heidelberg (2013) Dubey, H.; Pudi, V.: Class based weighted k-nearest neighbor over imbalance dataset. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining (pp. 305–316). Springer, Berlin, Heidelberg (2013)
22.
go back to reference Liu, W.; Chawla, S.; Cieslak, D.A.; Chawla, N.V.: A robust decision tree algorithm for imbalanced data sets. In Proceedings of the 2010 SIAM International Conference on Data Mining (pp. 766–777). Society for Industrial and Applied Mathematics (2010) Liu, W.; Chawla, S.; Cieslak, D.A.; Chawla, N.V.: A robust decision tree algorithm for imbalanced data sets. In Proceedings of the 2010 SIAM International Conference on Data Mining (pp. 766–777). Society for Industrial and Applied Mathematics (2010)
23.
go back to reference Tóth, J.; Tomán, H.; Hajdu, A.: Efficient sampling-based energy function evaluation for ensemble optimization using simulated annealing. Pattern Recognit. 107, 107510 (2020)CrossRef Tóth, J.; Tomán, H.; Hajdu, A.: Efficient sampling-based energy function evaluation for ensemble optimization using simulated annealing. Pattern Recognit. 107, 107510 (2020)CrossRef
24.
go back to reference Yang, D.; Wang, X.; Tian, X.; Zhang, Y.: Improving monarch butterfly optimization through simulated annealing strategy. J. Ambient Intell. Hum. Comput., 1–1, 2020 Yang, D.; Wang, X.; Tian, X.; Zhang, Y.: Improving monarch butterfly optimization through simulated annealing strategy. J. Ambient Intell. Hum. Comput., 1–1, 2020
25.
go back to reference Camelo, P.H.C.; de Carvalho, R.L.: Multilayer perceptron optimization through simulated annealing and fast simulated annealing. Acad. J. Comput., Eng. Appl. Math. 1(2), 28–31 (2020)CrossRef Camelo, P.H.C.; de Carvalho, R.L.: Multilayer perceptron optimization through simulated annealing and fast simulated annealing. Acad. J. Comput., Eng. Appl. Math. 1(2), 28–31 (2020)CrossRef
26.
go back to reference Tsai, C.F.; Lin, W.C.; Hu, Y.H.; Yao, G.T.: Under-sampling class imbalanced datasets by combining clustering analysis and instance selection. Inf. Sci. 477, 47–54 (2019)CrossRef Tsai, C.F.; Lin, W.C.; Hu, Y.H.; Yao, G.T.: Under-sampling class imbalanced datasets by combining clustering analysis and instance selection. Inf. Sci. 477, 47–54 (2019)CrossRef
27.
go back to reference Zhu, M.; Xia, J.; Jin, X.; Yan, M.; Cai, G.; Yan, J.; Ning, G.: Class weights random forest algorithm for processing class imbalanced medical data. IEEE Access 6, 4641–4652 (2018)CrossRef Zhu, M.; Xia, J.; Jin, X.; Yan, M.; Cai, G.; Yan, J.; Ning, G.: Class weights random forest algorithm for processing class imbalanced medical data. IEEE Access 6, 4641–4652 (2018)CrossRef
28.
go back to reference Li, J.; Fong, S.; Yuan, M.; Wong, R.K.: Adaptive multi-objective swarm crossover optimization for imbalanced data classification. In: International Conference on Advanced Data Mining and Applications (pp. 374–390). Springer, Cham (2016) Li, J.; Fong, S.; Yuan, M.; Wong, R.K.: Adaptive multi-objective swarm crossover optimization for imbalanced data classification. In: International Conference on Advanced Data Mining and Applications (pp. 374–390). Springer, Cham (2016)
29.
go back to reference Li, M.; Xiong, A.; Wang, L.; Deng, S.; Ye, J.: Aco resampling: enhancing the performance of oversampling methods for class imbalance classification. Knowl.-Based Syst. 196, 105818 (2020)CrossRef Li, M.; Xiong, A.; Wang, L.; Deng, S.; Ye, J.: Aco resampling: enhancing the performance of oversampling methods for class imbalance classification. Knowl.-Based Syst. 196, 105818 (2020)CrossRef
30.
go back to reference Huda, S.; Yearwood, J.; Jelinek, H.F.; Hassan, M.M.; Fortino, G.; Buckland, M.: A hybrid feature selection with ensemble classification for imbalanced healthcare data: a case study for brain tumor diagnosis. IEEE Access 4, 9145–9154 (2016)CrossRef Huda, S.; Yearwood, J.; Jelinek, H.F.; Hassan, M.M.; Fortino, G.; Buckland, M.: A hybrid feature selection with ensemble classification for imbalanced healthcare data: a case study for brain tumor diagnosis. IEEE Access 4, 9145–9154 (2016)CrossRef
31.
go back to reference Febriantono, M.A.; Pramono, S.H.; Rahmadwati, R.; Naghdy, G.: Classification of multiclass imbalanced data using cost-sensitive decision tree C50. IAES Int. J. Artif. Intell. 9(1), 65 (2020) Febriantono, M.A.; Pramono, S.H.; Rahmadwati, R.; Naghdy, G.: Classification of multiclass imbalanced data using cost-sensitive decision tree C50. IAES Int. J. Artif. Intell. 9(1), 65 (2020)
32.
go back to reference Babu, M.C.; Pushpa, S.: Genetic algorithm-based PCA classification for imbalanced dataset. In: Intelligent Computing in Engineering (pp. 541–552). Springer, Singapore (2020) Babu, M.C.; Pushpa, S.: Genetic algorithm-based PCA classification for imbalanced dataset. In: Intelligent Computing in Engineering (pp. 541–552). Springer, Singapore (2020)
33.
go back to reference Ri, J.; Kim, H.: G-mean based extreme learning machine for imbalance learning. Dig. Signal Process. 98, 102637 (2020)CrossRef Ri, J.; Kim, H.: G-mean based extreme learning machine for imbalance learning. Dig. Signal Process. 98, 102637 (2020)CrossRef
34.
go back to reference Susan, S.; Kumar, A.: Hybrid of intelligent minority oversampling and PSO-based intelligent majority undersampling for learning from imbalanced datasets. In: International Conference on Intelligent Systems Design and Applications (pp. 760–769). Springer, Cham (2018) Susan, S.; Kumar, A.: Hybrid of intelligent minority oversampling and PSO-based intelligent majority undersampling for learning from imbalanced datasets. In: International Conference on Intelligent Systems Design and Applications (pp. 760–769). Springer, Cham (2018)
35.
go back to reference El-Shafeiy, E.; Abohany, A.: Medical imbalanced data classification based on random forests. In: Joint European-US Workshop on Applications of Invariance in Computer Vision (pp. 81–91). Springer, Cham (2020) El-Shafeiy, E.; Abohany, A.: Medical imbalanced data classification based on random forests. In: Joint European-US Workshop on Applications of Invariance in Computer Vision (pp. 81–91). Springer, Cham (2020)
36.
go back to reference Yang, Y.; Huang, S.; Huang, W.; Chang, X.: Privacy-preserving cost-sensitive learning. In: IEEE Transactions on Neural Networks and Learning Systems (2020) Yang, Y.; Huang, S.; Huang, W.; Chang, X.: Privacy-preserving cost-sensitive learning. In: IEEE Transactions on Neural Networks and Learning Systems (2020)
37.
go back to reference Wang, D.; Zhang, X.; Chen, H.; Zhou, Y.: A sintering state recognition framework to integrate prior knowledge and hidden information considering class imbalance. In: IEEE Transactions on Industrial Electronics (2020) Wang, D.; Zhang, X.; Chen, H.; Zhou, Y.: A sintering state recognition framework to integrate prior knowledge and hidden information considering class imbalance. In: IEEE Transactions on Industrial Electronics (2020)
38.
go back to reference Delahaye, D.; Chaimatanan, S.; Mongeau, M.: Simulated annealing: from basics to applications. In Handbook of Metaheuristics (pp. 1–35). Springer, Cham (2019) Delahaye, D.; Chaimatanan, S.; Mongeau, M.: Simulated annealing: from basics to applications. In Handbook of Metaheuristics (pp. 1–35). Springer, Cham (2019)
39.
go back to reference Kirkpatrick, S.; Gelatt, C.D.; Vecchi, M.P.: Optimization by simulated annealing. Science 220(4598), 671–680 (1983)MathSciNetCrossRef Kirkpatrick, S.; Gelatt, C.D.; Vecchi, M.P.: Optimization by simulated annealing. Science 220(4598), 671–680 (1983)MathSciNetCrossRef
40.
go back to reference Jeong, I.S.; Kim, H.K.; Kim, T.H.; Lee, D.H.; Kim, K.J.; Kang, S.H.: A feature selection approach based on simulated annealing for detecting various denial of service attacks. Softw. Netw. 2018(1), 173–190 (2018) Jeong, I.S.; Kim, H.K.; Kim, T.H.; Lee, D.H.; Kim, K.J.; Kang, S.H.: A feature selection approach based on simulated annealing for detecting various denial of service attacks. Softw. Netw. 2018(1), 173–190 (2018)
41.
go back to reference Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)CrossRef Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)CrossRef
42.
go back to reference Lin, W.C.; Tsai, C.F.; Hu, Y.H.; Jhang, J.S.: Clustering-based undersampling in class-imbalanced data. Inf. Sci. 409, 17–26 (2017)CrossRef Lin, W.C.; Tsai, C.F.; Hu, Y.H.; Jhang, J.S.: Clustering-based undersampling in class-imbalanced data. Inf. Sci. 409, 17–26 (2017)CrossRef
43.
go back to reference Vuttipittayamongkol, P.; Elyan, E.; Petrovski, A.; Jayne, C.: Overlap-based undersampling for improving imbalanced data classification. In: International Conference on Intelligent Data Engineering and Automated Learning (pp. 689–697). Springer, Cham (2018) Vuttipittayamongkol, P.; Elyan, E.; Petrovski, A.; Jayne, C.: Overlap-based undersampling for improving imbalanced data classification. In: International Conference on Intelligent Data Engineering and Automated Learning (pp. 689–697). Springer, Cham (2018)
44.
go back to reference Czarnowski, I.; Jędrzejowicz, P.: An approach to imbalanced data classification based on instance selection and over-sampling. In: International Conference on Computational Collective Intelligence (pp. 601–610). Springer, Cham (2019) Czarnowski, I.; Jędrzejowicz, P.: An approach to imbalanced data classification based on instance selection and over-sampling. In: International Conference on Computational Collective Intelligence (pp. 601–610). Springer, Cham (2019)
45.
go back to reference Wilson, D.L.: Asymptotic properties of nearest neighbor rules using edited data. IEEE Trans. Syst., Man, Cybern. 3, 408–421 (1972)MathSciNetCrossRef Wilson, D.L.: Asymptotic properties of nearest neighbor rules using edited data. IEEE Trans. Syst., Man, Cybern. 3, 408–421 (1972)MathSciNetCrossRef
46.
go back to reference Czarnowski, I.; Jędrzejowicz, P.: Cluster-based instance selection for the imbalanced data classification. In: International Conference on Computational Collective Intelligence (pp. 191–200). Springer, Cham (2018) Czarnowski, I.; Jędrzejowicz, P.: Cluster-based instance selection for the imbalanced data classification. In: International Conference on Computational Collective Intelligence (pp. 191–200). Springer, Cham (2018)
47.
go back to reference Quinlan, J.: C4. 5: Programs for Machine Learning. Elsevier (2014) Quinlan, J.: C4. 5: Programs for Machine Learning. Elsevier (2014)
48.
go back to reference Alcalá-Fdez, J.; Fernández, A.; Luengo, J.; Derrac, J.; García, S.; Sánchez, L.; Herrera, F.: Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Multiple-Valued Log. Soft Comput. 17, 255–287 (2011) Alcalá-Fdez, J.; Fernández, A.; Luengo, J.; Derrac, J.; García, S.; Sánchez, L.; Herrera, F.: Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Multiple-Valued Log. Soft Comput. 17, 255–287 (2011)
49.
go back to reference Soltanzadeh, P.; Hashemzadeh, M.: RCSMOTE: range-controlled synthetic minority over-sampling technique for handling the class imbalance problem. Inf. Sci. 542(2021), 92–111 (2021)MathSciNetCrossRef Soltanzadeh, P.; Hashemzadeh, M.: RCSMOTE: range-controlled synthetic minority over-sampling technique for handling the class imbalance problem. Inf. Sci. 542(2021), 92–111 (2021)MathSciNetCrossRef
50.
go back to reference Han, H.; Wang, W.Y.; Mao, B.H.: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: International Conference on Intelligent Computing (pp. 878–887). Springer, Berlin, Heidelberg (2005) Han, H.; Wang, W.Y.; Mao, B.H.: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: International Conference on Intelligent Computing (pp. 878–887). Springer, Berlin, Heidelberg (2005)
51.
go back to reference Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH
Metadata
Title
An Improved Hybrid Approach for Handling Class Imbalance Problem
Authors
Abeer S. Desuky
Sadiq Hussain
Publication date
28-01-2021
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 4/2021
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-021-05347-7

Other articles of this Issue 4/2021

Arabian Journal for Science and Engineering 4/2021 Go to the issue

Research Article-Computer Engineering and Computer Science

Low-Rank Approximation of Circulant Matrix to a Noisy Matrix

Research Article-Computer Engineering and Computer Science

Machine Learning Application for Predicting Autistic Traits in Toddlers

Research Article-Computer Engineering and Computer Science

Performance Evaluation of Lightweight Encryption Algorithms for IoT-Based Applications

Premium Partners