×

A survey of evolution in predictive models and impacting factors in customer churn. (English) Zbl 1373.62030

Summary: The information-based prediction models using machine learning techniques have gained massive popularity during the last few decades. Such models have been applied in a number of domains such as medical diagnosis, crime prediction, movies rating, etc. Similar is the trend in telecom industry where prediction models have been applied to predict the dissatisfied customers who are likely to change the service provider. Due to immense financial cost of customer churn in telecom, the companies from all over the world have analyzed various factors (such as call cost, call quality, customer service response time, etc.) using several learners such as decision trees, support vector machines, neural networks, probabilistic models such as Bayes, etc. This paper presents a detailed survey of models from 2000 to 2015 describing the datasets used in churn prediction, impacting features in those datasets and classifiers that are used to implement prediction model. A total of 48 studies related to churn prediction in telecom industry are discussed using 23 datasets (3 public and 20 private). Our survey aims to highlight the evolution of techniques from simple features/learners to more complex learners and feature engineering or sampling techniques. We also give an overview of the current challenges in churn prediction and suggest solutions to resolve them. This paper will allow researchers such as data analysts in general and telecom operators in particular to choose best suited techniques and features to prepare their churn prediction models.

MSC:

62B10 Statistical aspects of information-theoretic topics
68T05 Learning and adaptive systems in artificial intelligence
62P30 Applications of statistics in engineering and industry; control charts

Software:

NSGA-II; SMOTE
Full Text: DOI

References:

[1] Amin, A., Shehzad, S., Khan, C., Ali, I. and Anwar, S. (2015). Churn prediction in telecommunication industry using rough set approach. New Trends in Computational Collective Intelligence, Springer, London, pp. 83-95.
[2] Backiel, A., Verbinnen, Y., Baesens, B. and Claeskens, G. (2015). Combining local and social network classifiers to improve churn prediction. Proc. 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, ACM, pp. 651-658.
[3] Basiri, J., Taghiyareh, F. and Moshiri, B. (2010). A hybrid approach to predict churn. Services Computing Conference (APSCC), 2010 IEEE Asia-Pacific, IEEE, pp. 485-491.
[4] Baumann, A., Lessmann, S., Coussement, K. and De Bock, K. W. (2015). Maximize what matters: Predicting customer churn with decision-centric ensemble selection. In: Proceedings of the 23rd European Conference on Information Systems (ECIS), Münster, Germany, : AIS, Paper 15.
[5] Binti Oseman, K., Haris, N. A. and Bin Abu Bakar, F. (2010). Data mining in churn analysis model for telecommunication industry. Journal of Statistical Modeling and Analytics, 1: 19-27.
[6] Branduşoiu, I. and Toderean, G. (2013). Churn prediction in the telecommunications sector using support vector machines. Ann. Oradea Univ. Fascicle Manag. Technol. Eng., 1: 19-22.
[7] Brândusoiu, I. B. and Toderean, G. (2013). A neural networks approach for churn prediction modeling in mobile telecommunications industry. University of Pitesti Scientific Bulletin Series: Electronics and Computers Science, 13: 2013.
[8] Breiman, L. (1996). Bagging predictors. Mach. Learn., 24: 123-140. · Zbl 0858.68080
[9] Broomhead, D. S. and Lowe, D. (1988). Radial basis functions, multi-variable functional interpolation and adaptive networks. Technical Report, DTIC Document. · Zbl 0657.68085
[10] Burez, J. and Van den Poel, D. (2009). Handling class imbalance in customer churn prediction. Expert Syst. Appl., 36: 4626-4636.
[11] Chawla, N. V., Bowyer, K. W., Hall, L. O. and Kegelmeyer, W. P. (2002). Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res., 16: 321-357. · Zbl 0994.68128
[12] Cohen, W. W. (1995). Fast effective rule induction. Proc. Twelfth Int. Conf. Machine Learning, pp. 115-123.
[13] Colgate, M., Stewart, K. and Kinsella, R. (1996). Customer defection: A study of the student market in ireland. Int. J. Bank Mark., 14: 23-29.
[14] Colorni, A., Dorigo, M., Maniezzo, V.et al.. (1991). Distributed optimization by ant colonies. Proc. First European Conference on Artificial Life, Paris, France, , pp. 134-142.
[15] Cortes, C. and Vapnik, V. (1995). Support-vector networks. Mach. Learn., 20: 273-297. · Zbl 0831.68098
[16] Dahiya, K. and Bhatia, S. (2015). Customer churn analysis in telecom industry. 4th Int. Conf. Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), 2015, IEEE, pp. 1-6.
[17] Deb, K., Pratap, A., Agarwal, S. and Meyarivan, T. (2002). A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE Trans. Evol. Comput., 6: 182-197.
[18] Dietterich, T. G. (2000). Ensemble methods in machine learning. International Workshop on Multiple Classifier Systems, Springer, pp. 1-15.
[19] Droftina, U., Kosir, A. and Stular, M. (2011). Subscriber churn analysis with selected mobile provider. Conference Paper September 2011. DOI: 10.13140/2.1.2942.5924 Conference: 20th International Electrotechnical and Computer Science Conference ERK 2011, At Portoroz, Slovenia. · Zbl 1414.91304
[20] Drummond, C. and Holte, R. C. (2005). Severe class imbalance: Why better algorithms arent the answer. European Conference on Machine Learning, Springer, pp. 539-546.
[21] Ferreira, J., Vellasco, M. B., Pacheco, M. A. C., Carlos, R. and Barbosa, H. (2004). Data mining techniques on the evaluation of wireless churn. ESANN, pp. 483-488.
[22] Freund, Y. and Schapire, R. E. (1995). A desicion-theoretic generalization of on-line learning and an application to boosting. European Conference on Computational Learning Theory, Springer, pp. 23-37.
[23] Gopal, R. K. and Meher, S. K. (2008). Customer churn time prediction in mobile telecommunication industry using ordinal regression. Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer, pp. 884-889.
[24] Hadden, J., Tiwari, A., Roy, R. and Ruta, D. (2006). Churn prediction: Does technology matter. Int. J. Intell. Technol. 1: 104-110.
[25] Ho, T. K., Hull, J. J. and Srihari, S. N. (1994). Decision combination in multiple classifier systems. IEEE Trans. Pattern Anal. Mach. Intell., 16: 66-75.
[26] Huang, B., Buckley, B. and Kechadi, T.-M. (2010a). Multi-objective feature selection by using nsga-ii for customer churn prediction in telecommunications. Expert Syst. Appl., 37: 3638-3646.
[27] Huang, Y., Huang, B. and Kechadi, M. (2010b). A new filter feature selection approach for customer churn prediction in telecommunications, IEEE Int. Conf. Industrial Engineering and Engineering Management (IEEM), 2010, IEEE, pp. 338-342.
[28] Hung, S.-Y., Yen, D. C. and Wang, H.-Y. (2006). Applying data mining to telecom churn management. Expert Syst. Appl., 31: 515-524.
[29] Ismail, M. R., Awang, M. K., Rahman, M. N. A. and Makhtar, M. (2015). A multi-layer perceptron approach for customer churn prediction. Int. J. Multimed. Ubiquit. Eng., 10: 213-222.
[30] Kirui, C., Hong, L., Cheruiyot, W. and Kirui, H. (2013). Predicting customer churn in mobile telephony industry using probabilistic classifiers in data mining. IJCSI Int. J. Comput. Sci. Issues, 10: 694-784.
[31] Kisioglu, P. and Topcu, Y. I. (2011). Applying bayesian belief network approach to customer churn analysis: A case study on the telecom industry of turkey. Expert Syst. Appl., 38: 7151-7157.
[32] Kraljević, G. and Gotovac, S. (2010). Modeling data mining applications for prediction of prepaid churn in telecommunication services. AUTOMATIKA: časopis za automatiku, mjerenje, elektroniku, računarstvo i komunikacije, 51: 275-283.
[33] Lemmens, A. and Croux, C. (2006). Bagging and boosting classification trees to predict churn. J. Mark. Res., 43: 276-286.
[34] Li, P., Li, S., Bi, T. and Liu, Y. (2014). Telecom customer churn prediction method based on cluster stratified sampling logistic regression. Int. Conf. Software Intelligence Technologies and Applications & Int. Conf. Frontiers of Internet of Things 2014, IET, pp. 282-287.
[35] Liu, X.-Y., Wu, J. and Zhou, Z.-H. (2009). Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man, Cybern. B (Cybernetics), 39: 539-550.
[36] Liu, Y. and Yao, X. (1999). Ensemble learning via negative correlation. Neural Netw., 12: 1399-1404.
[37] Liu, Y. and Zhuang, Y. (2015). Research model of churn prediction based on customer segmentation and misclassification cost in the context of big data. J. Comput. Commun., 3: 87.
[38] Maldonado, S., Flores, Á., Verbraken, T., Baesens, B. and Weber, R. (2015). Profit-based feature selection using support vector machines-general framework and an application for customer retention. Appl. Soft Comput., 35: 740-748.
[39] Martens, D., Baesens, B. and Van Gestel, T. (2009). Decompositional rule extraction from support vector machines by active learning. IEEE Trans. Knowl. Data Eng., 21: 178-191.
[40] Mohammadi, G., Tavakkoli-Moghaddam, R. and Mohammadi, M. (2013). Hierarchical neural regression models for customer churn prediction. J. Eng., 2013: 9, Article ID 543940.
[41] Mollineda, R., Alejo, R. and Sotoca, J. (2007). The class imbalance problem in pattern classification and learning. II Congreso Español de Informática (CEDI 2007). ISBN, pp. 978-84.
[42] Mozer, M. C., Wolniewicz, R., Grimes, D. B., Johnson, E. and Kaushansky, H. (2000). Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry. IEEE Trans. Neural Netw., 11: 690-696.
[43] Nath, S. V. and Behara, R. S. (2003). Customer churn analysis in the wireless industry: A data mining approach. Proc. Annual Meeting of the Decision Sciences Institute, pp. 505-510.
[44] Olle, G. D. O. and Cai, S. (2014). A hybrid churn prediction model in mobile telecommunication industry. Int. J. e-Educ. e-Bus. e-Manag. e-Learn., 4: 55.
[45] Owczarczuk, M. (2010). Churn models for prepaid customers in the cellular telecommunication industry using large data marts. Expert Syst. Appl., 37: 4710-4712.
[46] Pawlak, Z. (1982). Rough sets. Int. J. Comput. Inf. Sci., 11: 341-356. · Zbl 0501.68053
[47] Peng, L., Xiaoyang, Y., Boyu, S. and Jiuling, H. (2013). Telecom customer churn prediction based on imbalanced data re-sampling method. Int. Conf. Measurement, Information and Control (ICMIC), 2013, IEEE, pp. 229-233.
[48] Phua, C., Cao, H., Gomes, J. B. and Nguyen, M. N. (2012). Predicting near-future churners and win-backs in the telecommunications industry. arXiv:1210.6891.
[49] Quinlan, J. R. (2014). C4. 5: Programs for Machine Learning, Elsevier, 2014.
[50] Qureshi, S. A., Rehman, A. S., Qamar, A. M., Kamal, A. and Rehman, A. (2013). Telecommunication subscribers’ churn prediction model using machine learning. Eighth Int. Conf. Digital Information Management (ICDIM), 2013, IEEE, pp. 131-136.
[51] Richter, Y., Yom-Tov, E. and Slonim, N. (2010). Predicting customer churn in mobile networks through analysis of social groups. SDM, SIAM, pp. 732-741.
[52] Rodan, A., Fayyoumi, A., Faris, H., Alsakran, J. and AIK-adi, O. (2015). Negative correlation learning for customer churn prediction: A comparison study. Sci. World J., 2015: 1-7.
[53] Rosenblatt, F. (1961). Principles of neurodynamics. Perceptrons and the theory of brain mechanisms. Report, DTIC Document. · Zbl 0143.43504
[54] Sato, T., Huang, B., Lefait, G., Kechadi, M. T. and Buckley, B. (2009). Kernel-based principal components analysis on large telecommunication data. Proc. Eighth Australasian Data Mining Conference-Volume 101, Australian Computer Society, Inc., pp. 109-115.
[55] Schölkopf, B., Platt, J. C., Shawe-Taylor, J., Smola, A. J. and Williamson, R. C. (2001). Estimating the support of a high-dimensional distribution. Neural Comput., 13: 1443-1471. · Zbl 1009.62029
[56] Shaaban, E., Helmy, Y., Khedr, A. and Nasr, M. (2012). A proposed churn prediction model. IJERA, 2: 693-697.
[57] Sharma, A. and Kumar Panigrahi, P. (2011). A neural network based approach for predicting customer churn in cellular network services. Int. J. Comput. Appl., 27: 26-31.
[58] Tsai, C.-F. and Chen, M.-Y. (2010). Variable selection by association rules for customer churn prediction of multimedia on demand. Expert Syst. Appl., 37: 2006-2015.
[59] Tsai, C.-F. and Lu, Y.-H. (2009). Customer churn prediction by hybrid neural networks. Expert Syst. Appl., 36: 12547-12553.
[60] Ultsch, A. (1999). Data mining and knowledge discovery with emergent self-organizing feature maps for multivariate time series. Kohonen Maps, 46: 33-46.
[61] Umayaparvathi, V. and Iyakutti, K. (2012). Applications of data mining techniques in telecom churn prediction. Int. J. Comput. Appl., 42: 5-9.
[62] Vafeiadis, T., Diamantaras, K. I., Sarigiannidis, G. and Chatzisavvas, K. C. (2015). A comparison of machine learning techniques for customer churn prediction. Simul. Model. Pract. Theory, 55: 1-9.
[63] Verbeke, W., Dejaeger, K., Martens, D., Hur, J. and Baesens, B. (2012). New insights into churn prediction in the telecommunication sector: A profit driven data mining approach. Euro. J. Oper. Res., 218: 211-229.
[64] Verbeke, W., Martens, D., Mues, C. and Baesens, B. (2011). Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert Syst. Appl., 38: 2354-2364.
[65] Woods, K., Bowyer, K. and Kegelmeyer, W. P. Jr (1996). Combination of multiple classifiers using local accuracy estimates. Proceedings CVPR’96, 1996 IEEE Computer Society Conf. Computer Vision and Pattern Recognition, 1996, IEEE, pp. 391-396.
[66] Xia, G.-E. and Jin, W.-D. (2008). Model of customer churn prediction on support vector machine. Syst. Eng. Theory Pract., 28: 71-77.
[67] Xiao, J., Xiao, Y., Huang, A., Liu, D. and Wang, S. (2015). Feature-selection-based dynamic transfer ensemble model for customer churn prediction. Knowl. Inf. Syst., 43: 29-51.
[68] Zhang, Y., Liang, R., Li, Y., Zheng, Y. and Berry, M. (2011). Behavior-based telecommunication churn prediction with neural network approach. Int. Symp. Computer Science and Society (ISCCS), 2011, IEEE, pp. 307-310.
[69] Zhao, Y., Li, B., Li, X., Liu, W. and Ren, S. (2005). Customer churn prediction using improved one-class support vector machine. Int. Conf. Advanced Data Mining and Applications, Springer, pp. 300-306.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.