Abstract
Measuring indirect importance of various attributes is a very common task in marketing analysis for which researchers use correlation and regression techniques. We have listed and illustrated some common problems with widely used latent importance measures. A more theoretically sound approach – the Shapley Value decomposition – was applied to a rich data set of US internet stores. The use of store-level data instead of respondent-level data allowed us to reveal the factors, which are powerful in explaining, why some stores have higher rates of willingness to make repeat purchases than the others. By confronting the indirect importance and performance measures for three different internet stores, we have revealed strengths, weaknesses, attributes that the company should bring customers' attention to and attributes that do not require immediate improvement.
Similar content being viewed by others
INTRODUCTION
Nowadays, internet forums and special web services give customers an opportunity to express their opinion about the quality of goods and services sold online. The data containing information on satisfaction with various attributes of internet stores are very valuable for researchers and marketers.1, 2 E-tailers gradually realize that they can reduce costs by increasing loyalty, because it is cheaper to retain existing customers than to attract new ones. Therefore, revealing which attributes have a stronger influence on loyalty and satisfaction of internet shoppers can help e-tailers to become more competitive by concentrating on the most important areas.
Researchers studied the determinants of customer loyalty and satisfaction in various markets.3, 4, 5 Unlike traditional retailing, e-commerce only starts to attract the attention of researchers.6, 7, 8, 9, 10 Marketing scholars try to determine the characteristics that influence customer satisfaction and loyalty,11, 12 as well as characteristics, influencing the effectiveness of marketing activity and internet stores’ success.13, 14 Compared with offline stores, internet stores offer more opportunities for interactive and personalized marketing.15 It is easier for online buyers to compare alternatives, especially in the case of functional products and services. The fact that operations are available at the click of the mouse raises concerns about customer loyalty: if customers can easily compare their purchasing opportunities, they may be satisfied with the service, but may easily switch between stores in search of a cheaper store.
Satisfaction and loyalty measures are not perfect substitutes:16, 17 a consumer can be loyal without being fully satisfied (for example, in the case of few alternative stores) or she can be satisfied, but not loyal (for example, in the case of a large number of alternatives). In our study, we adhere to the definition of loyalty given by Srinivasan et al,18 according to which customer loyalty of a particular online store is expressed in their willingness to make repeat purchases.
Most customer satisfaction and loyalty studies analyze a single company, using proprietary survey data. Instead, we gather publicly available data on a sample of internet stores, which give us an opportunity to answer the following questions:
-
1
Why some internet stores are more successful than the others in terms of repeat purchase intentions, that is, which attributes have the highest indirect importance for online shoppers?
Fontenot et al19 argue that the declared importance (stated by respondents directly) gives a weaker differentiation of attribute importance than indirect importance, obtained by quantifying the influence of attribute ratings on some overall satisfaction or loyalty measure. If a researcher comes to the conclusion that attributes have almost indistinguishable importance for respondents, such a result has little practical value and is probably caused by the deficiencies of the approach to measuring importance. In this article, we give details of the Shapley Value approach to measuring indirect attribute importance. This method is said to be used by some market research companies and in the field of labor economics, but has not been widely used in academic marketing papers.
-
1
How can an internet store combine data on its performance for every attribute and the importance of an attribute to conduct ‘performance-importance’ strategic quadrant-analysis of its customers’ satisfaction?
DATA DESCRIPTION
The data were collected in September 2010 from www.Bizrate.com. The data set consists of average grades given by customers to 804 non-food internet stores. All grades are on a 10-point scale with 10 being excellent. We use a relatively strict requirement for the sample size, based on which the average grade is calculated – no less than 250 respondents during the last 3 months. This ensures that the average grade reflects the opinion of a reasonable number of people, not just a handful of customers.
A descriptive analysis of the sample is presented in Table 1. The coefficient of variation (the ratio of the standard deviation to the mean, multiplied by 100) indicates that grades for all attributes are rather homogeneous. Shipping charges ratings are the least homogeneous (the coefficient of variation is 11.4 per cent). It is worth mentioning that ‘Overall rating’ and ‘Would shop here again’ have very similar distribution and they are strongly correlated (Pearson correlation coefficient=0.937). That is why further we use only ‘Would shop here again’ as the dependent variable.
Before conducting the multivariate analysis, we did z-standardization of all attribute ratings (as a result, all the variables have mean=0 and standard deviation=1): Z ij =((R ij −R¯ j )/s j ), where R ij – store i rating on attribute j, R̄ j – average rating of all stores on attribute j, s j – standard deviation of attribute j rating.
MEASURING INDIRECT IMPORTANCE
It is useful to measure the importance of every attribute, that is, the strength of its influence on some overall measure of store performance, such as the overall satisfaction or the loyalty. There are a number of widely used methods, such as correlation and regression analysis, which provide researchers with some coefficients.
Some of the problems with correlation and regression coefficients are:
-
There are no serious theoretical grounds to consider them to be the measures of importance.
-
They do not add up to 100 per cent, that is, you cannot decompose the overall satisfaction into the effects of every attribute.
-
Very often these coefficients are very similar for several attributes, which leads to low diagnostic power of these measures of indirect importance.
Instead of these widespread approaches, we use the Shapley Value approach to the decomposition of R2 (the coefficient of determination) in a regression of repeat purchase intention on attribute ratings. In cooperative game theory, the Shapley Value allows dividing the payoff among players so that the Nash equilibrium is achieved.20 In a regression, explanatory variables correspond to players and the coefficient of determination – to the total payoff of the coalition consisting of all the players.21 The Shapley Value for a particular attribute equals to the average gain in R2, which is observed when this variable is included in the model, across all possible sequences of independent variables inclusion into the model. Therefore the Shapley Value approach is theoretically sound. In addition, the sum of Shapley Values for all attributes equals R2. That is why this approach is appropriate for the decomposition of R2 into components.
We start with ordinary least squares (OLS) regression with robust standard errors to obtain standardized regression coefficients (Table 2).
The model explains almost 79 per cent of the dependent variable's variance. Most standardized regression coefficients are close to each other and insignificantly different from zero, which makes it difficult to evaluate the contribution of each attribute to customer loyalty. One more problem is that coefficients at 4 of 13 variables are negative (however, they are insignificantly different from zero). This is probably caused by multicollinearity problem, not by the fact that these factors really have negative or zero influence on customer loyalty. Researchers often have to recode such negative coefficients to zero if they want to estimate the percentage contribution of each attribute to R2, which is a questionable approach. That is why we calculate the percentage contributions of each attribute to R2 using the Shapley Value decomposition,22 comparing it with a more common measure – Pearson correlation (Table 3).
The coefficient of variation based on all percentage contributions and the coefficient of variation based on percentage contribution of the five most important attributes clearly indicate that the Shapley Value approach has a higher diagnostic power, that is, it distinguishes better between important and unimportant attributes.
It should be noted that the availability of the product you wanted contributes 11.5 per cent to R2, whereas according to the ordinary regression it was one of the least significant factors. This difference supports the idea that care should be taken when interpreting standardized regression coefficients as importance measures.
STORES SEGMENTATION
Before applying cluster analysis, we conduct principal components analysis with varimax rotation so as to be able to segment stores based on several meaningful aspects of internet shopping. Only standardized input variables (that is, all variables except ‘Overall rating’ and ‘Would shop here again’) were included in the analysis.
According to the rule of thumb (include factors for which eigenvalues exceed 1), three factors were extracted. The first three factors explain over 77 per cent of the total variance (29.9 per cent, 25.0 per cent and 22.4 per cent, correspondingly). The correlations between the initial variables and the factor scores, as well as the interpretation of the principal components, are presented in Table 4.
The generated factor scores can be used in regression analysis to evaluate the influence of each factor on the overall satisfaction and the repeat purchase intention, but we will use them in cluster analysis to identify groups of stores with similar strengths and weaknesses.
To reveal several groups of stores based on their strengths and weaknesses, we conducted hierarchical cluster analysis based on the factors that were previously extracted using principal component analysis. As cluster analysis is an exploratory technique, we compared several methods and decided that the best solution is the three-cluster solution obtained using the centroid method, which gave segments that differ significantly with respect to mean factor scores (Table 5).
-
Cluster 1 consists of internet stores with the highest ratings for ‘Economic considerations’, relatively high ratings for ‘Internet store usability and design’ and low scores for ‘Post-order service and satisfaction with claim’.
-
Cluster 2 consists of internet stores with the highest ratings for ‘Internet store usability and design’, average ratings for ‘Post-order service and satisfaction with claims’ and the lowest ratings for ‘Economic considerations’.
-
Cluster 3 consists of internet stores with the highest ratings for ‘Post-order service and satisfaction with claims’, average ratings for ‘Economic considerations’ and the lowest ratings for ‘Internet store usability and design’.
Cluster analysis has shown that the stores’ strengths and weaknesses differ. Some stores have high website usability and offer good prices and other charges, but relatively low post-order service, despite the fact that post-order services contributes the most to customer loyalty.
STRATEGIC QUADRANT-ANALYSIS
In order to reveal strengths and weaknesses of a store and to give recommendations on high-priority improvements, we drew a chart, where one axis represents attribute importance and the other – attribute performance. The chart is divided into four quadrants with two lines corresponding to the mean values of importance and performance (Figure 1).
In our study, attribute performance is measured as the mean ‘Would shop here again’ rating and importance – as the percentage contribution to R2 is based on the Shapley Value approach.
The interpretation of each quadrant is as follows:
-
Important attributes (‘high importance – high performance’) are attributes, which are a company's assets, which should be maintained at a high level.
-
Desired attributes (‘high importance – low performance’) are attributes that should be improved first.
-
Expected attributes (‘low importance – high performance’) are attributes, which are a company's assets, but are perceived as those which go without saying.
-
Unimportant attributes (‘low importance – low performance’) are attributes, the improvement of which is not of high priority, despite relatively low marks.
We will illustrate the use of strategic quadrant analysis for the assessment of several stores, representing the three segments, previously extracted using cluster analysis. The mean factor scores of each cluster (that is, cluster centers) were given in Table 5. In order to find stores typical of each cluster, we calculated the Euclidian distances from each store i to the cluster centers using the following formulas:
where disti1 – the distance from the observation i to the center of cluster 1; disti2 – the distance from the observation i to the center of cluster 2; disti3 – the distance from the observation i to the center of cluster 3; faci1 – factor 1 (‘Post-order service and satisfaction with claims’) score for the observation i; faci2 – factor 2 (‘Internet store usability and design’) score for the observation i; and faci3 – factor 3 (‘Economic considerations’) score for the observation i.
Then we have chosen three stores, which are closest to each cluster center, so that each store represents one cluster, and drew strategic quadrant analysis charts for each of these internet stores. The strategic quadrant analysis maps for each store are presented in Figures 2, 3 and 4, respectively.
The summary of quadrant analysis is presented in Table 6. Such attributes as ‘On-time delivery’, ‘Availability of product you wanted’ are important, whereas such attributes as ‘Variety of shipping options’ and ‘Shipping charges’ are unimportant for all stores. ‘Charges stated clearly before order submission’ always belongs to the group of expected attributes.
CONCLUSION
We have listed and illustrated some common problems with correlation and regression coefficients, which are widely used as indirect importance measures in marketing analysis. Unlike them, the Shapley Value approach, which came to econometrics from game theory, is a theoretically sound method for calculating the individual contribution of service attributes to customer loyalty. This method was applied to a large sample of US internet stores. The use of store-level data instead of respondent-level data allowed us to explain why some stores have higher rates of willingness to make repeat purchases than the others.
According to Shapley Value coefficients of importance, the most important attributes (with percentage contribution exceeding 10 per cent) are clearly ‘Customer support’, ‘Product met expectations’, ‘On-time delivery’, ‘Order tracking’ and ‘Availability of product you wanted’.
It is important to highlight how a firm can use the information about the latent importance of various attributes. By confronting it with performance measures (such as loyalty or satisfaction), companies can reveal their strengths (high importance – high performance), weaknesses (high importance – low performance), attributes that the company should bring customers’ attention to (low importance – high performance) and attributes, which do not require immediate improvement (low importance – low performance).
We have conducted factor analysis on store attributes ratings followed by cluster analysis. As a result, three segments were formed, which have different strong points: segment 1 has good Post-order service and satisfaction with claims, segment 2 has the best internet store usability and design, and segment 3 consists of stores that are good from an economic point of view. Then, we illustrated the usefulness of strategic quadrant analysis by conducting it for three companies that represent three different segments of stores.
It is worth mentioning that our study is limited to modeling repeat purchase intention. Perhaps that is the reason why economic considerations appeared to be not very important: customers evaluate the prices, shipping charges and the variety of shipping options beforehand and rarely regret buying at some particular store. Therefore, in order to retain the existing customers, internet retailers do not have to worry a lot about prices and charges, but the ‘economic considerations’ factor can be important for attracting new customers. Learning the ways to increase the probability of initial purchase is one of the possible issues for further research.
References
Dholakia, R.R. and Zhao, M. (2010) Effects of online store attributes on customer satisfaction and repurchase intentions. International Journal of Retail & Distribution Management 38 (7): 482–496.
Gustafsson, A. and Johnson, M.D. (2004) Determining attribute importance in a service satisfaction model. Journal of Service Research 7 (2): 124–141.
Anderson, E. and Fornell, C. (2000) Foundations of the American customer satisfaction index. Total Quality Management & Business Excellence 11 (7): 869–882.
Sharma, S., Niedrich, R.W. and Dobbins, G. (1999) A framework for monitoring customer satisfaction: An empirical illustration. Industrial Marketing Management 28 (3): 231–243.
Posselt, T. and Gerstner, E. (2005) Pre-sale vs. Post-sale e-satisfaction: Impact on repurchase intention and overall satisfaction. Journal of Interactive Marketing 19 (4): 35–47.
Burke, R.R. (2002) Technology and the customer interface: What consumers want in the physical and virtual store. Journal of the Academy of Marketing Science 30 (4): 411–432.
Eroglu, S.A., Machleit, K.A. and Davis, L.M. (2003) Empirical testing of a model of online store atmospherics and shopper responses. Psychology & Marketing 20 (2): 139–150.
Jin, B. and Park, J.Y. (2005) The moderating effect of online purchase experience on the evaluation of online store attributes and the subsequent impact on market response outcomes. Advances in Consumer Research 33: 203–211.
Kim, K. and Kim, E.B. (2006) Suggestions to enhance the cyber store customer satisfaction. Journal of American Academy of Business 9 (1): 233–240.
Park, C.-H. and Kim, Y.-G. (2003) Identifying key factors affecting consumer purchase behavior in an online shopping context. International Journal of Retail & Distribution Management 31 (1): 16–29.
Zeithaml, V.A., Parasuraman, A. and Malhotra, A. (2002) Service quality delivery through websites: A critical review of extant knowledge. Journal of Academy of Marketing Science 30 (4): 362–375.
Szymanski, D.M. and Hise, R.T. (2000) E-satisfaction: An initial examination. Journal of Retailing 76 (3): 309–322.
Lii, Y.-S., Lim, H.J. and Tseng, L.P. (2004) The effects of web operational factors on marketing performance. Journal of American Academy of Business 5 (1): 486–494.
Weathers, D. and Makienko, I. (2006) Assessing the relationships between e-tail success and product and website factors. Journal of Interactive Marketing 20 (2): 41–53.
Wind, J. and Rangaswamy, A. (2001) Customerization: The next revolution in mass customization. Journal of Interactive Marketing 15 (1): 13–32.
Bloemer, J.M.M. and Kasper, H.D.P. (1995) The complex relationship between consumer satisfaction and brand loyalty. Journal of Economic Psychology 16 (2): 311–329.
Oliver, R.L. (1999) Whence consumer loyalty? Journal of Marketing 63 (Special Issue): 33–44.
Srinivasan, S.S., Anderson, R. and Ponnavolu, K. (2002) Customer loyalty in e-commence: An exploration of its antecedents and consequences. Journal of Retailing 78 (1): 41–50.
Fontenot, G., Henke, L., Carson, K. and Carson, P.P. (2007) Techniques for determining importance: Balancing scientific method and subjectivity. Journal of Targeting, Measurement and Analysis for Marketing 15 (3): 170–180.
Shapley, L.S. (1953) A value for n-person games. In: H.W. Kuhn and A.W. Tucker (eds.) Contributions to the Theory of Games, Vol. II (Annals of Mathematical Studies 28) Princeton, NJ: Princeton University Press, pp. 307–317.
Lipovetsky, S. and Conklin, M. (2001) Analysis of regression in game theory approach. Applied Stochastic Models in Business and Industry 17 (4): 319–330.
The Shapley Value has not become a standard procedure in most statistical packages yet. We use Stata module shapley developed by Stanislav Kolenikov. This program performs exact additive decomposition of a sample statistic (in our case – R2) by effects specified in the factor list. To perform the Shapley decomposition, the effects are eliminated one by one, and marginal effects from each exclusion are weighted in such a way that all exclusion trajectories have equal weights.
Author information
Authors and Affiliations
Corresponding author
Additional information
This study was carried out within ‘The National Research University Higher School of Economics’ Academic Fund Program in 2012–2013, research grant No. 11-01-0193’.
Rights and permissions
About this article
Cite this article
Pokryshevskaya, E., Antipov, E. The strategic analysis of online customers’ repeat purchase intentions. J Target Meas Anal Mark 20, 203–211 (2012). https://doi.org/10.1057/jt.2012.16
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1057/jt.2012.16