Skip to main content

Towards a Large Scale Practical Churn Model for Prepaid Mobile Markets

  • Conference paper
  • First Online:
Advances in Data Mining. Applications and Theoretical Aspects (ICDM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10357))

Included in the following conference series:

Abstract

Communication Service Providers (CSPs) are increasingly focusing on effective churn management strategies due to fierce competition, price wars and high subscriber acquisition cost. Though there are many existing studies utilizing data mining techniques for building churn prediction models, addressing aspects such as large data volumes and high dimensionality of subscriber data is challenging in practice. In this paper, we present a large scale churn management system, utilizing dimensionality reduction and a decision-tree ensemble learning. We consider subscriber behavior trends as well as aggregated key performance indicators (KPIs) as features and use a combination of feature selection strategies for dimensionality reduction. We report favorable results from evaluating our model on a real-world dataset consisting of approximately 5 million active subscribers from a popular Asian CSP. The model output is presented as an actionable lift chart which will help marketers in deciding the target subscriber base for retention campaigns. We also present the practical aspects involved in productionizing our model using the Apache Oozie workflow engine.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kang, C., Pei-ji, S.: Customer churn prediction based on svm-rfe. In: International Seminar on Business and Information Management, ISBIM 2008, vol. 1, pp.306–309. IEEE (2008)

    Google Scholar 

  2. Hadden, J., Tiwari, A., Roy, R., Ruta, D.: Computer assisted customer churn management: State-of-the-art and future trends. Computers & Operations Research 34, 2902–2917 (2007)

    Article  MATH  Google Scholar 

  3. Wei, C.P., Chiu, I.T.: Turning telecommunications call details to churn prediction: a data mining approach. Expert Systems with Applications 23, 103–112 (2002)

    Article  Google Scholar 

  4. Kim, H.S., Yoon, C.H.: Determinants of subscriber churn and customer loyalty in the korean mobile telephony market. Telecommunications Policy 28, 751–765 (2004)

    Article  Google Scholar 

  5. Commission, F.C., et al.: Annual report and analysis of competitive market conditions with respect to mobile wireless, including commercial mobile services, Washington, DC (2010)

    Google Scholar 

  6. Hung, S.Y., Yen, D.C., Wang, H.Y.: Applying data mining to telecom churn management. Expert Systems with Applications 31, 515–524 (2006)

    Article  Google Scholar 

  7. Huang, B., Kechadi, M.T., Buckley, B.: Customer churn prediction in telecommunications. Expert Systems with Applications 39, 1414–1425 (2012)

    Article  Google Scholar 

  8. Han, Q., Ferreira, P.: Determinants of subscriber churn in wireless networks: Role of peer influence (2015)

    Google Scholar 

  9. Kraljević, G., Gotovac, S.: Modeling data mining applications for prediction of prepaid churn in telecommunication services. AUTOMATIKA: časopis za automatiku, mjerenje, elektroniku, računarstvo i komunikacije 51 275–283(2010)

    Google Scholar 

  10. Radosavljevik, D., Van Der Putten, P., Larsen, K.K.: The impact of experimental setup on prepaid churn modeling: data, population and outcome definition. In: Industrial Conference on Data Mining-Workshops, pp. 14–27. Citeseer(2010)

    Google Scholar 

  11. Kusuma, P.D., Radosavljevik, D., Takes, F.W., van der Putten, P.: Combining customer attribute and social network mining for prepaid mobile churn prediction. In: Proceedings of the 23rd Annual Belgian Dutch Conference on Machine Learning (Benelearn), pp. 50–58(2013)

    Google Scholar 

  12. Owczarczuk, M.: Churn models for prepaid customers in the cellular telecommunication industry using large data marts. Expert Systems with Applications 37, 4710–4712 (2010)

    Article  Google Scholar 

  13. Quinlan, J.R.: Simplifying decision trees. International Journal of Man-Machine Studies 27, 221–234 (1987)

    Article  Google Scholar 

  14. Bishop, C.M., et al.: Neural networks for pattern recognition (1995)

    Google Scholar 

  15. Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20, 273–297 (1995)

    MATH  Google Scholar 

  16. Powers, D.M.: Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation (2011)

    Google Scholar 

  17. Umayaparvathi, V., Iyakutti, K.: Applications of data mining techniques in telecom churn prediction. International Journal of Computer Applications 42, 5–9 (2012)

    Article  Google Scholar 

  18. Au, W.H., Chan, K.C., Yao, X.: A novel evolutionary data mining algorithm with applications to churn prediction. IEEE Transactions on Evolutionary Computation 7, 532–545 (2003)

    Article  Google Scholar 

  19. Idris, A., Khan, A., Lee, Y.S.: Genetic programming and adaboosting based churn prediction for telecom. In: 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 1328–1332. IEEE (2012)

    Google Scholar 

  20. Foundation, T.A.S.: Apache pig: (2015). https://pig.apache.org/

  21. Kent, J.T.: Information gain and a general measure of correlation. Biometrika 70, 163–173 (1983)

    Article  MathSciNet  MATH  Google Scholar 

  22. Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97, 273–324 (1997)

    Article  MATH  Google Scholar 

  23. McCullagh, P., Nelder, J.A.: Generalized linear models, vol. 37. CRC Press (1989)

    Google Scholar 

  24. Breiman, L.: Random forests. Machine Learning 45, 5–32 (2001)

    Article  MATH  Google Scholar 

  25. Skurichina, M., Duin, R.P.: Bagging, boosting and the random subspace method for linear classifiers. Pattern Analysis & Applications 5, 121–135 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  26. Wikipedia: Hyperparameter optimization – wikipedia, the free encyclopedia (2017) (online accessed, April 7, 2017)

    Google Scholar 

  27. Wikipedia: Mobile telephone numbering in india – wikipedia, the free encyclopedia (2017) (online accessed, April 7, 2017)

    Google Scholar 

  28. Uregina: Cumulative gains and lift charts (2015) (online accessed, April 7, 2017)

    Google Scholar 

  29. Chu, C., Kim, S.K., Lin, Y.A., Yu, Y., Bradski, G., Ng, A.Y., Olukotun, K.: Map-reduce for machine learning on multicore. Advances in Neural Information Processing Systems 19, 281 (2007)

    Google Scholar 

  30. Spark, A.: Apache spark-lightning-fast cluster computing (2014)

    Google Scholar 

  31. Islam, M., Huang, A.K., Battisha, M., Chiang, M., Srinivasan, S., Peters, C., Neumann, A., Abdelnur, A.: Oozie: towards a scalable workflow management system for hadoop. In: Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, 4. ACM (2012)

    Google Scholar 

  32. Meng, X., Bradley, J., Yuvaz, B., Sparks, E., Venkataraman, S., Liu, D., Freeman, J., Tsai, D., Amde, M., Owen, S., et al.: Mllib: Machine learning in apache spark. JMLR 17, 1–7 (2016)

    MathSciNet  MATH  Google Scholar 

  33. Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisciplinary Reviews: Computational Statistics 2, 433–459 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amit Kumar Meher .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Meher, A.K., Wilson, J., Prashanth, R. (2017). Towards a Large Scale Practical Churn Model for Prepaid Mobile Markets. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2017. Lecture Notes in Computer Science(), vol 10357. Springer, Cham. https://doi.org/10.1007/978-3-319-62701-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62701-4_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62700-7

  • Online ISBN: 978-3-319-62701-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics