Abstract
Data mining techniques can aid companies in evaluation of customers that generate highest amount of revenue in a direct marketing campaign. Most commonly, customer value is evaluated by a uniform segmentation of customers (20% for each segment) based on buying behavior using recency, frequency and monetary (RFM) attributes, whereby for direct campaigns the segments with the highest score of these attributes are subjectively selected. In this paper, the method of k-means clustering, according to RFM attributes is proposed, based on which the customer value can be more objectively determined. The most valuable customers, as a rule, are the smallest group compared to other clusters, so the problem of class imbalance occurs. In order to overcome this problem, a hybrid Support Vector Machine Rule Extraction (SVM-RE) method is proposed for predicting which customer belongs to a cluster, based on data on consumer characteristics and offered products. The SVM classifier is known as a good predictor in case of class imbalance, but does not generate an interpretable model. Therefore, the Decision Tree (DT) method generates rules, based on the prediction result of the SVM classifier. The results of the empirical case study showed, that using this hybrid method with good classification performance, customer value level can be predicted, i.e. targeting existing and new buyers for direct marketing campaigns can be efficiently done, regardless of the class imbalance problem. It’s also shown that using the hybrid SVM-RE method, it is possible to obtain significantly better prediction accuracy than using the DT method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Jonker, J., Piersma, N., Van den Poel, D.: Joint optimization of customer segmentation and marketing policy to maximize long-term profitability. Expert Syst. Appl. 27, 159–168 (2004)
Kaymak, U.: Fuzzy target selection using RFM variables. In: Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569)
Hughes, A.: Strategic Database Marketing. McGraw-Hill, New York (2005)
McCarty, J., Hastak, M.: Segmentation approaches in data-mining: a comparison of RFM, CHAID, and logistic regression. J. Bus. Res. 60, 656–662 (2007)
Olson, D., Cao, Q., Gu, C., Lee, D.: Comparison of customer response models. Serv. Bus. 3, 117–130 (2009)
Olson, D., Chae, B.: Direct marketing decision support through predictive customer response modeling. Decis. Support Syst. 54, 443–451 (2012)
Cui, G., Wong, M., Wan, X.: Targeting high value customers while under resource constraint: partial order constrained optimization with genetic algorithm. J. Interact. Market. 29, 27–37 (2015)
Kim, D., Lee, H., Cho, S.: Response modeling with support vector regression. Expert Syst. Appl. 34, 1102–1108 (2008)
Otter, P.W., Scheer, H.V.D., Wansbeek, T.: Optimal selection of households for direct marketing by joint modeling of the probability and quantity of response. s.n. University of Groningen, CCSO Centre for Economic Research, Working Papers (2006)
Malthouse, E.: Ridge regression and direct marketing scoring models. J. Interact. Market. 13, 10–23 (1999)
Wu, J., Lin, Z.: Research on customer segmentation model by clustering. In: Proceedings of the 7th International Conference on Electronic Commerce - ICEC 2005 (2005)
Drozdenki, R., Drake, P.: Optimal Database Marketing. Sage Publications, Thousand Oaks (2002)
Hosseini, S., Maleki, A., Gholamian, M.: Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty. Expert Syst. Appl. 37, 5259–5264 (2010)
Sarvari, P., Ustundag, A., Takci, H.: Performance evaluation of different customer segmentation approaches based on RFM and demographics analysis. Kybernetes 45, 1129–1157 (2016)
Khalili-Damghani, K., Abdi, F., Abolmakarem, S.: Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: real case of customer-centric industries. Appl. Soft Comput. 73, 816–828 (2018)
Kim, G., Chae, B., Olson, D.: A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models. Serv. Bus. 7, 167–182 (2012)
Miguéis, V.L., Camanho, A.S., Borges, J.: Predicting direct marketing response in banking: comparison of class imbalance methods. Serv. Bus. 11, 831–849 (2017)
Farquad, M., Bose, I.: Preprocessing unbalanced data using support vector machine. Decis. Support Syst. 53, 226–233 (2012)
Barakat, N., Bradley, A.P.: Rule extraction from support vector machines: a review. Neurocomputing. 74, 178–190 (2010)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (2010)
Sanderson, M., Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008). Nat. Lang. Eng. 16(1), 100–103 (2010)
Martens, D., Huysmans, J., Setiono, R., Vanthienen, J., Baesens, B.: Rule extraction from support vector machines: an overview of issues and application in credit scoring. In: Diederich, J. (ed.) Rule Extraction from Support Vector Machines. Studies in Computational Intelligence, vol. 80, pp. 33–63. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-75390-2_2
Diederich, J.: Rule extraction from support vector machines: an introduction. In: Diederich, J. (ed.) Rule Extraction from Support Vector Machines. Studies in Computational Intelligence, vol. 80, pp. 3–31. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-75390-2_1
Martens, D., Baesens, B., Gestel, T.V., Vanthienen, J.: Comprehensible credit scoring models using rule extraction from support vector machines. Eur. J. Oper. Res. 183, 1466–1476 (2007)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1, 81–106 (1986)
Quinlan, J.R.: C4.5 - Programs for Machine Learning. Kaufmann, San Mateo (1992)
Breiman, L.: Classification and Regression Trees. Wadsworth International Group, Belmont (1984)
Kašćelan, L., Kašćelan, V., Jovanović, M.: Hybrid support vector machine rule extraction method for discovering the preferences of stock market investors: evidence from Montenegro. Intell. Autom. Soft Comput. 21, 503–522 (2014)
Hughes, A.M.: Strategic Database Marketing: The Masterplan for Starting and Managing a Profitable, Customer-Based Marketing Program. Irwin, Chicago (1994)
Wang, C.-H.: Apply robust segmentation to the service industry using kernel induced fuzzy clustering techniques. Expert Syst. Appl. 37, 8395–8400 (2010)
Marshall, P.: The 80/20 Rule of Sales: How to Find Your Best Customers. https://www.entrepreneur.com/article/229294
Hsieh, N.-C.: An integrated data mining and behavioral scoring model for analyzing bank customers. Expert Syst. Appl. 27, 623–633 (2004)
Tsai, C.-Y., Chiu, C.-C.: A purchase-based market segmentation methodology. Expert Syst. Appl. 27, 265–276 (2004)
Cheng, C.-H., Chen, Y.-S.: Classifying the segmentation of customer value via RFM model and RS theory. Expert Syst. Appl. 36, 4176–4184 (2009)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Conference on Mathematical Statistics and Probability, vol. 1, pp. 281–297 (1967)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 1, 224–227 (1979)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Rogic, S., Kascelan, L. (2019). Customer Value Prediction in Direct Marketing Using Hybrid Support Vector Machine Rule Extraction Method. In: Welzer, T., et al. New Trends in Databases and Information Systems. ADBIS 2019. Communications in Computer and Information Science, vol 1064. Springer, Cham. https://doi.org/10.1007/978-3-030-30278-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-30278-8_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30277-1
Online ISBN: 978-3-030-30278-8
eBook Packages: Computer ScienceComputer Science (R0)