Skip to main content
Top
Published in: Cluster Computing 3/2019

25-09-2017

Intelligent churn prediction for telecom using GP-AdaBoost learning and PSO undersampling

Authors: Adnan Idris, Aksam Iftikhar, Zia ur Rehman

Published in: Cluster Computing | Special Issue 3/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays, telecom industry faces fierce competition in satisfying its customers. This competition thus requires an efficient churn prediction system to identify customers who are ready to quit. Such churn customers are then retained through addressing relevant reasons identified by the churn prediction system. Therefore, now the role of churn prediction system is not only restricted to accurately predict churners but also to interpret customer churn behavior. In this paper, searching capabilities of genetic programming (GP) and classification capabilities of AdaBoost are integrated in order to evolve a high-performance churn prediction system having better churn identification abilities. For this, frequently selected features in various GP expressions evaluated through AdaBoost based learning, are marked and analyzed. Moreover, the issue of imbalance present in telecom datasets is also addressed through particle swarm optimization (PSO) based undersampling method, which provides unbiased distribution of training set to GP-AdaBoost based prediction system. Particle swarm optimization based undersampling method in combination with GP-AdaBoost results a churn prediction system (ChP-GPAB), which offers better learning of churners and also identifies underlying factors responsible for churn behavior of customers. Two standard telecom data sets are used for evaluation and comparison of the proposed ChP-GPAB system. The results show that the proposed ChP-GPAB system yields 0.91 AUC and 0.86 AUC on Cell2Cell and Orange datasets, in addition to identifying the reasons of churning.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference ICT: The World in: ICT Facts and Figures, Geneva (2014) ICT: The World in: ICT Facts and Figures, Geneva (2014)
2.
go back to reference Shin, Y.H., David, C.Y., Hsiu, Y.W.: Applying data mining to telecom churn management. Expert Syst. Appl. 37, 3665–3675 (2006) Shin, Y.H., David, C.Y., Hsiu, Y.W.: Applying data mining to telecom churn management. Expert Syst. Appl. 37, 3665–3675 (2006)
3.
go back to reference Bock, K.W.D., Van den Poel, D.: An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction. Expert Syst. Appl. 38, 12293–12301 (2011)CrossRef Bock, K.W.D., Van den Poel, D.: An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction. Expert Syst. Appl. 38, 12293–12301 (2011)CrossRef
4.
go back to reference Huang, Y., Kechadi, T.: An effective hybrid learning system for telecommunication churn prediction. Expert Syst. Appl. 40, 5635–5647 (2013)CrossRef Huang, Y., Kechadi, T.: An effective hybrid learning system for telecommunication churn prediction. Expert Syst. Appl. 40, 5635–5647 (2013)CrossRef
5.
go back to reference Pendharkar, P.C.: Genetic algorithm based neural network approaches for predicting churn in cellular wireless network services. Expert Syst. Appl. 36, 6714–6720 (2009)CrossRef Pendharkar, P.C.: Genetic algorithm based neural network approaches for predicting churn in cellular wireless network services. Expert Syst. Appl. 36, 6714–6720 (2009)CrossRef
6.
go back to reference Burez, J., Van den Poel, D.: Handling class imbalance in customer churn prediction. Expert Syst. Appl. 36, 4626–4636 (2009)CrossRef Burez, J., Van den Poel, D.: Handling class imbalance in customer churn prediction. Expert Syst. Appl. 36, 4626–4636 (2009)CrossRef
7.
go back to reference Mikel, G., Alberto, F., Edurne, B., Humberto, B., Francisco, H.: A review on ensembles for the class imbalance problem: bagging-boosting- and hybrid-based approaches. IEEE Trans. Syst. Man Cybern. C 42, 463–484 (2012)CrossRef Mikel, G., Alberto, F., Edurne, B., Humberto, B., Francisco, H.: A review on ensembles for the class imbalance problem: bagging-boosting- and hybrid-based approaches. IEEE Trans. Syst. Man Cybern. C 42, 463–484 (2012)CrossRef
8.
go back to reference Verbeke, W., Dejaeger, K., Martens, D., Hur, J., Baesens, B.: New insights into churn prediction in the telecommunication sector: a profit driven data mining approach. Eur. J. Oper. Res. 218, 211–229 (2012)CrossRef Verbeke, W., Dejaeger, K., Martens, D., Hur, J., Baesens, B.: New insights into churn prediction in the telecommunication sector: a profit driven data mining approach. Eur. J. Oper. Res. 218, 211–229 (2012)CrossRef
9.
go back to reference Adnan, I., Muhammad, R., Asifullah, K.: Churn prediction in telecom using random forest and PSO based data balancing in combination with various feature selection strategies. Comput. Electr. Eng. 38, 1808–1819 (2012)CrossRef Adnan, I., Muhammad, R., Asifullah, K.: Churn prediction in telecom using random forest and PSO based data balancing in combination with various feature selection strategies. Comput. Electr. Eng. 38, 1808–1819 (2012)CrossRef
10.
go back to reference Huang, B.Q., Kechadi, T.M., Buckley, B., Kiernan, G., Keogh, E., Rashid, T.: A new feature set with new window techniques for customer churn prediction in land-line telecommunications. Expert Syst. Appl. 37, 3657–3665 (2010)CrossRef Huang, B.Q., Kechadi, T.M., Buckley, B., Kiernan, G., Keogh, E., Rashid, T.: A new feature set with new window techniques for customer churn prediction in land-line telecommunications. Expert Syst. Appl. 37, 3657–3665 (2010)CrossRef
11.
go back to reference Huang, B., Buckley, B., Kechadi, T.M.: Multi-objective feature selection by using NSGA-II for customer churn prediction in telecommunications. Expert Syst. Appl. 37, 3638–3646 (2010)CrossRef Huang, B., Buckley, B., Kechadi, T.M.: Multi-objective feature selection by using NSGA-II for customer churn prediction in telecommunications. Expert Syst. Appl. 37, 3638–3646 (2010)CrossRef
12.
go back to reference Kisioglu, P., Topcu, Y.I.: Applying Bayesian belief network approach to customer churn analysis: a case study on the telecom industry of Turkey. Expert Syst. Appl. 38, 7151–7157 (2011)CrossRef Kisioglu, P., Topcu, Y.I.: Applying Bayesian belief network approach to customer churn analysis: a case study on the telecom industry of Turkey. Expert Syst. Appl. 38, 7151–7157 (2011)CrossRef
13.
go back to reference Xu, H., Zhang, Z., Zhang, Y.: Churn prediction in telecom using a hybrid two-phase feature selection method. In: Third International Symposium on Intelligent Information Technology Application, 2009. IITA 2009, pp. 576–579 (2009) Xu, H., Zhang, Z., Zhang, Y.: Churn prediction in telecom using a hybrid two-phase feature selection method. In: Third International Symposium on Intelligent Information Technology Application, 2009. IITA 2009, pp. 576–579 (2009)
14.
go back to reference Owczarczuk, M.: Churn models for prepaid customers in the cellular telecommunication industry using large data marts. Expert Syst. Appl. 37, 4710–4712 (2010)CrossRef Owczarczuk, M.: Churn models for prepaid customers in the cellular telecommunication industry using large data marts. Expert Syst. Appl. 37, 4710–4712 (2010)CrossRef
15.
go back to reference Verbeke, W., Martens, D., Mues, C., Baesens, B.: Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert Syst. Appl. 38, 2354–2364 (2011)CrossRef Verbeke, W., Martens, D., Mues, C., Baesens, B.: Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert Syst. Appl. 38, 2354–2364 (2011)CrossRef
16.
go back to reference De Bock, K.W., Van den Poel, D.: Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models. Expert Syst. Appl. 39, 6816–6826 (2012)CrossRef De Bock, K.W., Van den Poel, D.: Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models. Expert Syst. Appl. 39, 6816–6826 (2012)CrossRef
17.
go back to reference Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. C 40, 121–144 (2010)CrossRef Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. C 40, 121–144 (2010)CrossRef
18.
go back to reference Khan, G.M., Arshad, R., Mahmud, S.A., Ullah, F.: Intelligent bandwidth estimation for variable bit rate traffic. IEEE Trans. Evol. Comput. 19, 151–155 (2015)CrossRef Khan, G.M., Arshad, R., Mahmud, S.A., Ullah, F.: Intelligent bandwidth estimation for variable bit rate traffic. IEEE Trans. Evol. Comput. 19, 151–155 (2015)CrossRef
19.
go back to reference Bhowan, U., Johnston, M., Mengjie, Z., Xin, Y.: Reusing genetic programming for ensemble selection in classification of unbalanced data. IEEE Trans. Evol. Comput. 18, 893–908 (2014)CrossRef Bhowan, U., Johnston, M., Mengjie, Z., Xin, Y.: Reusing genetic programming for ensemble selection in classification of unbalanced data. IEEE Trans. Evol. Comput. 18, 893–908 (2014)CrossRef
20.
go back to reference Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33, 1–39 (2009)CrossRef Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33, 1–39 (2009)CrossRef
21.
go back to reference Sorokina, D.: Application of additive groves ensemble with multiple counts feature evaluation to KDD Cup ’09 small data set. In: Presented at the JMLR Workshop and Conference Proceedings, Paris (2009) Sorokina, D.: Application of additive groves ensemble with multiple counts feature evaluation to KDD Cup ’09 small data set. In: Presented at the JMLR Workshop and Conference Proceedings, Paris (2009)
22.
go back to reference Adnan, I., Asifullah, K., Lee, Y.S.: Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification. Appl. Intell. 39, 659–672 (2013)CrossRef Adnan, I., Asifullah, K., Lee, Y.S.: Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification. Appl. Intell. 39, 659–672 (2013)CrossRef
23.
go back to reference Amin, A., Anwar, S., Adnan, A., Nawaz, M., Howard, N., Qadir, J., et al.: Comparing oversampling techniques to handle the class imbalance problem: a customer churn prediction case study. IEEE Access 4, 7940–7957 (2016)CrossRef Amin, A., Anwar, S., Adnan, A., Nawaz, M., Howard, N., Qadir, J., et al.: Comparing oversampling techniques to handle the class imbalance problem: a customer churn prediction case study. IEEE Access 4, 7940–7957 (2016)CrossRef
24.
go back to reference Yang, P., Xu, L., Zhou, B.B., Zhang, Z., Zomaya, A.Y.: A particle swarm based hybrid system for imbalanced medical data sampling. BMC Genom. 10, S34–S34 (2009)CrossRef Yang, P., Xu, L., Zhou, B.B., Zhang, Z., Zomaya, A.Y.: A particle swarm based hybrid system for imbalanced medical data sampling. BMC Genom. 10, S34–S34 (2009)CrossRef
25.
go back to reference Au, W.H., Chan, K.C.C., Xin, Y.: A novel evolutionary data mining algorithm with applications to churn prediction. IEEE Trans. Evol. Comput. 7, 532–545 (2003)CrossRef Au, W.H., Chan, K.C.C., Xin, Y.: A novel evolutionary data mining algorithm with applications to churn prediction. IEEE Trans. Evol. Comput. 7, 532–545 (2003)CrossRef
26.
go back to reference Hadden, J., Tiwari, A., Roy, R., Ruta, D.: Computer assisted customer churn management: state-of-the-art and future trends. Comput. Oper. Res. 34, 2902–2917 (2007)CrossRef Hadden, J., Tiwari, A., Roy, R., Ruta, D.: Computer assisted customer churn management: state-of-the-art and future trends. Comput. Oper. Res. 34, 2902–2917 (2007)CrossRef
27.
go back to reference Wang, P., Emmerich, M., Li, R., Tang, K., Back, T., Yao, X.: Convex Hull-based multiobjective genetic programming for maximizing receiver operating characteristic performance. IEEE Trans. Evol. Comput. 19, 12 (2015)CrossRef Wang, P., Emmerich, M., Li, R., Tang, K., Back, T., Yao, X.: Convex Hull-based multiobjective genetic programming for maximizing receiver operating characteristic performance. IEEE Trans. Evol. Comput. 19, 12 (2015)CrossRef
28.
go back to reference Langdon, W.B., Harman, M.: Optimizing existing software with genetic programming. IEEE Trans. Evol. Comput. 19, 118–135 (2014)CrossRef Langdon, W.B., Harman, M.: Optimizing existing software with genetic programming. IEEE Trans. Evol. Comput. 19, 118–135 (2014)CrossRef
29.
go back to reference Lemmens, A., Croux, C.: Bagging and boosting classification trees to predict churn. J. Mark. Res. 43, 276–286 (2006)CrossRef Lemmens, A., Croux, C.: Bagging and boosting classification trees to predict churn. J. Mark. Res. 43, 276–286 (2006)CrossRef
30.
go back to reference Bose, I., Chen, X.: Hybrid models using unsupervised clustering for prediction of customer churn. J. Org. Comput. Electron. Commer. 19, 133–151 (2009)CrossRef Bose, I., Chen, X.: Hybrid models using unsupervised clustering for prediction of customer churn. J. Org. Comput. Electron. Commer. 19, 133–151 (2009)CrossRef
31.
go back to reference Lima, E., Mues, C., Baesens, B.: Monitoring and backtesting churn models. Expert Syst. Appl. 38, 975–982 (2011)CrossRef Lima, E., Mues, C., Baesens, B.: Monitoring and backtesting churn models. Expert Syst. Appl. 38, 975–982 (2011)CrossRef
32.
go back to reference Chen, Z.-Y., Fan, Z.-P., Sun, M.: A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data. Eur. J. Oper. Res. 223, 461–472 (2012)MathSciNetCrossRef Chen, Z.-Y., Fan, Z.-P., Sun, M.: A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data. Eur. J. Oper. Res. 223, 461–472 (2012)MathSciNetCrossRef
33.
go back to reference Keramati, A., Jafari-Marandi, R., Aliannejadi, M., Ahmadian, I., Mozaffari, M., Abbasi, U.: Improved churn prediction in telecommunication industry using data mining techniques. Appl. Softw. Comput. 24, 994–1012 (2014)CrossRef Keramati, A., Jafari-Marandi, R., Aliannejadi, M., Ahmadian, I., Mozaffari, M., Abbasi, U.: Improved churn prediction in telecommunication industry using data mining techniques. Appl. Softw. Comput. 24, 994–1012 (2014)CrossRef
34.
go back to reference Kyoungok, K., Chi-Hyuk, J., Jaewook, L.: Improved churn prediction in telecommunication industry by analyzing a large network. Expert Syst. Appl. 41, 6575–6584 (2014)CrossRef Kyoungok, K., Chi-Hyuk, J., Jaewook, L.: Improved churn prediction in telecommunication industry by analyzing a large network. Expert Syst. Appl. 41, 6575–6584 (2014)CrossRef
35.
go back to reference Ning, L., Hua, L., Jie, L., Guangquan, Z.: A customer churn prediction model in telecom industry using boosting. IEEE Trans. Industr. Inform. 10, 1659–1665 (2012) Ning, L., Hua, L., Jie, L., Guangquan, Z.: A customer churn prediction model in telecom industry using boosting. IEEE Trans. Industr. Inform. 10, 1659–1665 (2012)
36.
go back to reference Amin, A., Anwar, S., Adnan, A., Nawaz, M., Alawfi, K., Hussain, A., et al.: Customer churn prediction in the telecommunication sector using a rough set approach. Neurocomputing 237, 242–254 (2017)CrossRef Amin, A., Anwar, S., Adnan, A., Nawaz, M., Alawfi, K., Hussain, A., et al.: Customer churn prediction in the telecommunication sector using a rough set approach. Neurocomputing 237, 242–254 (2017)CrossRef
37.
go back to reference Amin, A., Khan, C., Ali, I., Anwar, S.: Customer churn prediction in telecommunication industry: with and without counter-example. In: Gelbukh, A., Espinoza, F.C., Galicia-Haro, S.N. (eds.) Proceedings on Nature-Inspired Computation and Machine Learning: 13th Mexican International Conference on Artificial Intelligence, MICAI 2014, Tuxtla Gutiérrez, Mexico, November 16–22, 2014, Part II, pp. 206–218. Springer International Publishing, Cham (2014)CrossRef Amin, A., Khan, C., Ali, I., Anwar, S.: Customer churn prediction in telecommunication industry: with and without counter-example. In: Gelbukh, A., Espinoza, F.C., Galicia-Haro, S.N. (eds.) Proceedings on Nature-Inspired Computation and Machine Learning: 13th Mexican International Conference on Artificial Intelligence, MICAI 2014, Tuxtla Gutiérrez, Mexico, November 16–22, 2014, Part II, pp. 206–218. Springer International Publishing, Cham (2014)CrossRef
38.
go back to reference Amin, A., Rahim, F., Ramzan, M., Anwar, S.: A prudent based approach for customer churn prediction. In: International Conference: Beyond Databases, Architectures and Structures, pp. 320–332 (2015)CrossRef Amin, A., Rahim, F., Ramzan, M., Anwar, S.: A prudent based approach for customer churn prediction. In: International Conference: Beyond Databases, Architectures and Structures, pp. 320–332 (2015)CrossRef
39.
go back to reference Miller, H., Clarke, S., Lane, S., Lonie, A., Lazaridiz, D., Petrovski, S., et al.: Predicting customer behaviour: the University of Melbourne’s KDD Cup report. In: Presented at the JMLR Workshop and Conference Proceedings, Paris (2009) Miller, H., Clarke, S., Lane, S., Lonie, A., Lazaridiz, D., Petrovski, S., et al.: Predicting customer behaviour: the University of Melbourne’s KDD Cup report. In: Presented at the JMLR Workshop and Conference Proceedings, Paris (2009)
40.
go back to reference Huang, B., Kechadi, M.T., Buckley, B.: Customer churn prediction in telecommunications. Expert Syst. Appl. 39, 1414–1425 (2012)CrossRef Huang, B., Kechadi, M.T., Buckley, B.: Customer churn prediction in telecommunications. Expert Syst. Appl. 39, 1414–1425 (2012)CrossRef
41.
go back to reference Idris, A., Khan, A., Lee, Y.S.: Genetic programming and adaboosting based churn prediction for telecom. In: 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 1328–1332 (2012) Idris, A., Khan, A., Lee, Y.S.: Genetic programming and adaboosting based churn prediction for telecom. In: 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 1328–1332 (2012)
42.
go back to reference Busa-Fekete, R., Kegl, B.: Accelerating AdaBoost using UCB. In: Presented at the JMLR Workshop and Conference Proceedings, Paris (2009) Busa-Fekete, R., Kegl, B.: Accelerating AdaBoost using UCB. In: Presented at the JMLR Workshop and Conference Proceedings, Paris (2009)
Metadata
Title
Intelligent churn prediction for telecom using GP-AdaBoost learning and PSO undersampling
Authors
Adnan Idris
Aksam Iftikhar
Zia ur Rehman
Publication date
25-09-2017
Publisher
Springer US
Published in
Cluster Computing / Issue Special Issue 3/2019
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-017-1154-3

Other articles of this Special Issue 3/2019

Cluster Computing 3/2019 Go to the issue

Premium Partner