Skip to main content

Customer Value Prediction in Direct Marketing Using Hybrid Support Vector Machine Rule Extraction Method

  • Conference paper
  • First Online:
New Trends in Databases and Information Systems (ADBIS 2019)

Abstract

Data mining techniques can aid companies in evaluation of customers that generate highest amount of revenue in a direct marketing campaign. Most commonly, customer value is evaluated by a uniform segmentation of customers (20% for each segment) based on buying behavior using recency, frequency and monetary (RFM) attributes, whereby for direct campaigns the segments with the highest score of these attributes are subjectively selected. In this paper, the method of k-means clustering, according to RFM attributes is proposed, based on which the customer value can be more objectively determined. The most valuable customers, as a rule, are the smallest group compared to other clusters, so the problem of class imbalance occurs. In order to overcome this problem, a hybrid Support Vector Machine Rule Extraction (SVM-RE) method is proposed for predicting which customer belongs to a cluster, based on data on consumer characteristics and offered products. The SVM classifier is known as a good predictor in case of class imbalance, but does not generate an interpretable model. Therefore, the Decision Tree (DT) method generates rules, based on the prediction result of the SVM classifier. The results of the empirical case study showed, that using this hybrid method with good classification performance, customer value level can be predicted, i.e. targeting existing and new buyers for direct marketing campaigns can be efficiently done, regardless of the class imbalance problem. It’s also shown that using the hybrid SVM-RE method, it is possible to obtain significantly better prediction accuracy than using the DT method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Jonker, J., Piersma, N., Van den Poel, D.: Joint optimization of customer segmentation and marketing policy to maximize long-term profitability. Expert Syst. Appl. 27, 159–168 (2004)

    Article  Google Scholar 

  2. Kaymak, U.: Fuzzy target selection using RFM variables. In: Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569)

    Google Scholar 

  3. Hughes, A.: Strategic Database Marketing. McGraw-Hill, New York (2005)

    Google Scholar 

  4. McCarty, J., Hastak, M.: Segmentation approaches in data-mining: a comparison of RFM, CHAID, and logistic regression. J. Bus. Res. 60, 656–662 (2007)

    Article  Google Scholar 

  5. Olson, D., Cao, Q., Gu, C., Lee, D.: Comparison of customer response models. Serv. Bus. 3, 117–130 (2009)

    Article  Google Scholar 

  6. Olson, D., Chae, B.: Direct marketing decision support through predictive customer response modeling. Decis. Support Syst. 54, 443–451 (2012)

    Article  Google Scholar 

  7. Cui, G., Wong, M., Wan, X.: Targeting high value customers while under resource constraint: partial order constrained optimization with genetic algorithm. J. Interact. Market. 29, 27–37 (2015)

    Article  Google Scholar 

  8. Kim, D., Lee, H., Cho, S.: Response modeling with support vector regression. Expert Syst. Appl. 34, 1102–1108 (2008)

    Article  Google Scholar 

  9. Otter, P.W., Scheer, H.V.D., Wansbeek, T.: Optimal selection of households for direct marketing by joint modeling of the probability and quantity of response. s.n. University of Groningen, CCSO Centre for Economic Research, Working Papers (2006)

    Google Scholar 

  10. Malthouse, E.: Ridge regression and direct marketing scoring models. J. Interact. Market. 13, 10–23 (1999)

    Article  Google Scholar 

  11. Wu, J., Lin, Z.: Research on customer segmentation model by clustering. In: Proceedings of the 7th International Conference on Electronic Commerce - ICEC 2005 (2005)

    Google Scholar 

  12. Drozdenki, R., Drake, P.: Optimal Database Marketing. Sage Publications, Thousand Oaks (2002)

    Google Scholar 

  13. Hosseini, S., Maleki, A., Gholamian, M.: Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty. Expert Syst. Appl. 37, 5259–5264 (2010)

    Article  Google Scholar 

  14. Sarvari, P., Ustundag, A., Takci, H.: Performance evaluation of different customer segmentation approaches based on RFM and demographics analysis. Kybernetes 45, 1129–1157 (2016)

    Article  Google Scholar 

  15. Khalili-Damghani, K., Abdi, F., Abolmakarem, S.: Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: real case of customer-centric industries. Appl. Soft Comput. 73, 816–828 (2018)

    Article  Google Scholar 

  16. Kim, G., Chae, B., Olson, D.: A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models. Serv. Bus. 7, 167–182 (2012)

    Article  Google Scholar 

  17. Miguéis, V.L., Camanho, A.S., Borges, J.: Predicting direct marketing response in banking: comparison of class imbalance methods. Serv. Bus. 11, 831–849 (2017)

    Article  Google Scholar 

  18. Farquad, M., Bose, I.: Preprocessing unbalanced data using support vector machine. Decis. Support Syst. 53, 226–233 (2012)

    Article  Google Scholar 

  19. Barakat, N., Bradley, A.P.: Rule extraction from support vector machines: a review. Neurocomputing. 74, 178–190 (2010)

    Article  Google Scholar 

  20. Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (2010)

    MATH  Google Scholar 

  21. Sanderson, M., Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008). Nat. Lang. Eng. 16(1), 100–103 (2010)

    Google Scholar 

  22. Martens, D., Huysmans, J., Setiono, R., Vanthienen, J., Baesens, B.: Rule extraction from support vector machines: an overview of issues and application in credit scoring. In: Diederich, J. (ed.) Rule Extraction from Support Vector Machines. Studies in Computational Intelligence, vol. 80, pp. 33–63. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-75390-2_2

    Chapter  MATH  Google Scholar 

  23. Diederich, J.: Rule extraction from support vector machines: an introduction. In: Diederich, J. (ed.) Rule Extraction from Support Vector Machines. Studies in Computational Intelligence, vol. 80, pp. 3–31. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-75390-2_1

    Chapter  MATH  Google Scholar 

  24. Martens, D., Baesens, B., Gestel, T.V., Vanthienen, J.: Comprehensible credit scoring models using rule extraction from support vector machines. Eur. J. Oper. Res. 183, 1466–1476 (2007)

    Google Scholar 

  25. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1, 81–106 (1986)

    Google Scholar 

  26. Quinlan, J.R.: C4.5 - Programs for Machine Learning. Kaufmann, San Mateo (1992)

    Google Scholar 

  27. Breiman, L.: Classification and Regression Trees. Wadsworth International Group, Belmont (1984)

    MATH  Google Scholar 

  28. Kašćelan, L., Kašćelan, V., Jovanović, M.: Hybrid support vector machine rule extraction method for discovering the preferences of stock market investors: evidence from Montenegro. Intell. Autom. Soft Comput. 21, 503–522 (2014)

    Article  Google Scholar 

  29. Hughes, A.M.: Strategic Database Marketing: The Masterplan for Starting and Managing a Profitable, Customer-Based Marketing Program. Irwin, Chicago (1994)

    Google Scholar 

  30. Wang, C.-H.: Apply robust segmentation to the service industry using kernel induced fuzzy clustering techniques. Expert Syst. Appl. 37, 8395–8400 (2010)

    Article  Google Scholar 

  31. Marshall, P.: The 80/20 Rule of Sales: How to Find Your Best Customers. https://www.entrepreneur.com/article/229294

  32. Hsieh, N.-C.: An integrated data mining and behavioral scoring model for analyzing bank customers. Expert Syst. Appl. 27, 623–633 (2004)

    Article  Google Scholar 

  33. Tsai, C.-Y., Chiu, C.-C.: A purchase-based market segmentation methodology. Expert Syst. Appl. 27, 265–276 (2004)

    Article  Google Scholar 

  34. Cheng, C.-H., Chen, Y.-S.: Classifying the segmentation of customer value via RFM model and RS theory. Expert Syst. Appl. 36, 4176–4184 (2009)

    Article  Google Scholar 

  35. MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Conference on Mathematical Statistics and Probability, vol. 1, pp. 281–297 (1967)

    Google Scholar 

  36. Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 1, 224–227 (1979)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Suncica Rogic .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rogic, S., Kascelan, L. (2019). Customer Value Prediction in Direct Marketing Using Hybrid Support Vector Machine Rule Extraction Method. In: Welzer, T., et al. New Trends in Databases and Information Systems. ADBIS 2019. Communications in Computer and Information Science, vol 1064. Springer, Cham. https://doi.org/10.1007/978-3-030-30278-8_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-30278-8_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-30277-1

  • Online ISBN: 978-3-030-30278-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics