The strategic analysis of online customers’ repeat purchase intentions

Pokryshevskaya, Elena B; Antipov, Evgeny A

doi:10.1057/jt.2012.16

The strategic analysis of online customers’ repeat purchase intentions

Original Article
Published: 22 October 2012

Volume 20, pages 203–211, (2012)
Cite this article

Download PDF

Journal of Targeting, Measurement and Analysis for Marketing

The strategic analysis of online customers’ repeat purchase intentions

Download PDF

Elena B Pokryshevskaya &
Evgeny A Antipov¹

4818 Accesses
7 Citations
Explore all metrics

Abstract

Measuring indirect importance of various attributes is a very common task in marketing analysis for which researchers use correlation and regression techniques. We have listed and illustrated some common problems with widely used latent importance measures. A more theoretically sound approach – the Shapley Value decomposition – was applied to a rich data set of US internet stores. The use of store-level data instead of respondent-level data allowed us to reveal the factors, which are powerful in explaining, why some stores have higher rates of willingness to make repeat purchases than the others. By confronting the indirect importance and performance measures for three different internet stores, we have revealed strengths, weaknesses, attributes that the company should bring customers' attention to and attributes that do not require immediate improvement.

Online influencer marketing

Article 08 January 2022

Fine F. Leung, Flora F. Gu & Robert W. Palmatier

Customer experience: fundamental premises and implications for research

Article Open access 13 January 2020

Larissa Becker & Elina Jaakkola

Impulse buying: a meta-analytic review

Article Open access 09 July 2019

Gopalkrishnan R. Iyer, Markus Blut, … Dhruv Grewal

INTRODUCTION

Nowadays, internet forums and special web services give customers an opportunity to express their opinion about the quality of goods and services sold online. The data containing information on satisfaction with various attributes of internet stores are very valuable for researchers and marketers.^{1, 2} E-tailers gradually realize that they can reduce costs by increasing loyalty, because it is cheaper to retain existing customers than to attract new ones. Therefore, revealing which attributes have a stronger influence on loyalty and satisfaction of internet shoppers can help e-tailers to become more competitive by concentrating on the most important areas.

Researchers studied the determinants of customer loyalty and satisfaction in various markets.^{3, 4, 5} Unlike traditional retailing, e-commerce only starts to attract the attention of researchers.^{6, 7, 8, 9, 10} Marketing scholars try to determine the characteristics that influence customer satisfaction and loyalty,^{11, 12} as well as characteristics, influencing the effectiveness of marketing activity and internet stores’ success.^{13, 14} Compared with offline stores, internet stores offer more opportunities for interactive and personalized marketing.¹⁵ It is easier for online buyers to compare alternatives, especially in the case of functional products and services. The fact that operations are available at the click of the mouse raises concerns about customer loyalty: if customers can easily compare their purchasing opportunities, they may be satisfied with the service, but may easily switch between stores in search of a cheaper store.

Satisfaction and loyalty measures are not perfect substitutes:^{16, 17} a consumer can be loyal without being fully satisfied (for example, in the case of few alternative stores) or she can be satisfied, but not loyal (for example, in the case of a large number of alternatives). In our study, we adhere to the definition of loyalty given by Srinivasan et al,¹⁸ according to which customer loyalty of a particular online store is expressed in their willingness to make repeat purchases.

Most customer satisfaction and loyalty studies analyze a single company, using proprietary survey data. Instead, we gather publicly available data on a sample of internet stores, which give us an opportunity to answer the following questions:

1
Why some internet stores are more successful than the others in terms of repeat purchase intentions, that is, which attributes have the highest indirect importance for online shoppers?

Fontenot et al¹⁹ argue that the declared importance (stated by respondents directly) gives a weaker differentiation of attribute importance than indirect importance, obtained by quantifying the influence of attribute ratings on some overall satisfaction or loyalty measure. If a researcher comes to the conclusion that attributes have almost indistinguishable importance for respondents, such a result has little practical value and is probably caused by the deficiencies of the approach to measuring importance. In this article, we give details of the Shapley Value approach to measuring indirect attribute importance. This method is said to be used by some market research companies and in the field of labor economics, but has not been widely used in academic marketing papers.

1
How can an internet store combine data on its performance for every attribute and the importance of an attribute to conduct ‘performance-importance’ strategic quadrant-analysis of its customers’ satisfaction?

DATA DESCRIPTION

The data were collected in September 2010 from www.Bizrate.com. The data set consists of average grades given by customers to 804 non-food internet stores. All grades are on a 10-point scale with 10 being excellent. We use a relatively strict requirement for the sample size, based on which the average grade is calculated – no less than 250 respondents during the last 3 months. This ensures that the average grade reflects the opinion of a reasonable number of people, not just a handful of customers.

A descriptive analysis of the sample is presented in Table 1. The coefficient of variation (the ratio of the standard deviation to the mean, multiplied by 100) indicates that grades for all attributes are rather homogeneous. Shipping charges ratings are the least homogeneous (the coefficient of variation is 11.4 per cent). It is worth mentioning that ‘Overall rating’ and ‘Would shop here again’ have very similar distribution and they are strongly correlated (Pearson correlation coefficient=0.937). That is why further we use only ‘Would shop here again’ as the dependent variable.

Table 1 Descriptive statistics

Full size table

Before conducting the multivariate analysis, we did z-standardization of all attribute ratings (as a result, all the variables have mean=0 and standard deviation=1): Z_ij=((R_ij−R¯_j)/s_j), where R_ij – store i rating on attribute j, R̄_j – average rating of all stores on attribute j, s_j – standard deviation of attribute j rating.

MEASURING INDIRECT IMPORTANCE

It is useful to measure the importance of every attribute, that is, the strength of its influence on some overall measure of store performance, such as the overall satisfaction or the loyalty. There are a number of widely used methods, such as correlation and regression analysis, which provide researchers with some coefficients.

Some of the problems with correlation and regression coefficients are:

There are no serious theoretical grounds to consider them to be the measures of importance.
They do not add up to 100 per cent, that is, you cannot decompose the overall satisfaction into the effects of every attribute.
Very often these coefficients are very similar for several attributes, which leads to low diagnostic power of these measures of indirect importance.

Instead of these widespread approaches, we use the Shapley Value approach to the decomposition of R² (the coefficient of determination) in a regression of repeat purchase intention on attribute ratings. In cooperative game theory, the Shapley Value allows dividing the payoff among players so that the Nash equilibrium is achieved.²⁰ In a regression, explanatory variables correspond to players and the coefficient of determination – to the total payoff of the coalition consisting of all the players.²¹ The Shapley Value for a particular attribute equals to the average gain in R², which is observed when this variable is included in the model, across all possible sequences of independent variables inclusion into the model. Therefore the Shapley Value approach is theoretically sound. In addition, the sum of Shapley Values for all attributes equals R². That is why this approach is appropriate for the decomposition of R² into components.

We start with ordinary least squares (OLS) regression with robust standard errors to obtain standardized regression coefficients (Table 2).

Table 2 Standardized regression coefficients

Full size table

The model explains almost 79 per cent of the dependent variable's variance. Most standardized regression coefficients are close to each other and insignificantly different from zero, which makes it difficult to evaluate the contribution of each attribute to customer loyalty. One more problem is that coefficients at 4 of 13 variables are negative (however, they are insignificantly different from zero). This is probably caused by multicollinearity problem, not by the fact that these factors really have negative or zero influence on customer loyalty. Researchers often have to recode such negative coefficients to zero if they want to estimate the percentage contribution of each attribute to R², which is a questionable approach. That is why we calculate the percentage contributions of each attribute to R² using the Shapley Value decomposition,²² comparing it with a more common measure – Pearson correlation (Table 3).

Table 3 Attribute importance based on Pearson correlations and Shapley Values

Full size table

The coefficient of variation based on all percentage contributions and the coefficient of variation based on percentage contribution of the five most important attributes clearly indicate that the Shapley Value approach has a higher diagnostic power, that is, it distinguishes better between important and unimportant attributes.

It should be noted that the availability of the product you wanted contributes 11.5 per cent to R², whereas according to the ordinary regression it was one of the least significant factors. This difference supports the idea that care should be taken when interpreting standardized regression coefficients as importance measures.

STORES SEGMENTATION

Before applying cluster analysis, we conduct principal components analysis with varimax rotation so as to be able to segment stores based on several meaningful aspects of internet shopping. Only standardized input variables (that is, all variables except ‘Overall rating’ and ‘Would shop here again’) were included in the analysis.

According to the rule of thumb (include factors for which eigenvalues exceed 1), three factors were extracted. The first three factors explain over 77 per cent of the total variance (29.9 per cent, 25.0 per cent and 22.4 per cent, correspondingly). The correlations between the initial variables and the factor scores, as well as the interpretation of the principal components, are presented in Table 4.

Table 4 Rotated component matrix

Full size table

The generated factor scores can be used in regression analysis to evaluate the influence of each factor on the overall satisfaction and the repeat purchase intention, but we will use them in cluster analysis to identify groups of stores with similar strengths and weaknesses.

To reveal several groups of stores based on their strengths and weaknesses, we conducted hierarchical cluster analysis based on the factors that were previously extracted using principal component analysis. As cluster analysis is an exploratory technique, we compared several methods and decided that the best solution is the three-cluster solution obtained using the centroid method, which gave segments that differ significantly with respect to mean factor scores (Table 5).

Cluster 1 consists of internet stores with the highest ratings for ‘Economic considerations’, relatively high ratings for ‘Internet store usability and design’ and low scores for ‘Post-order service and satisfaction with claim’.
Cluster 2 consists of internet stores with the highest ratings for ‘Internet store usability and design’, average ratings for ‘Post-order service and satisfaction with claims’ and the lowest ratings for ‘Economic considerations’.
Cluster 3 consists of internet stores with the highest ratings for ‘Post-order service and satisfaction with claims’, average ratings for ‘Economic considerations’ and the lowest ratings for ‘Internet store usability and design’.

Table 5 Mean factor scores within clusters

Full size table

Cluster analysis has shown that the stores’ strengths and weaknesses differ. Some stores have high website usability and offer good prices and other charges, but relatively low post-order service, despite the fact that post-order services contributes the most to customer loyalty.

STRATEGIC QUADRANT-ANALYSIS

In order to reveal strengths and weaknesses of a store and to give recommendations on high-priority improvements, we drew a chart, where one axis represents attribute importance and the other – attribute performance. The chart is divided into four quadrants with two lines corresponding to the mean values of importance and performance (Figure 1).

In our study, attribute performance is measured as the mean ‘Would shop here again’ rating and importance – as the percentage contribution to R² is based on the Shapley Value approach.

The interpretation of each quadrant is as follows:

Important attributes (‘high importance – high performance’) are attributes, which are a company's assets, which should be maintained at a high level.
Desired attributes (‘high importance – low performance’) are attributes that should be improved first.
Expected attributes (‘low importance – high performance’) are attributes, which are a company's assets, but are perceived as those which go without saying.
Unimportant attributes (‘low importance – low performance’) are attributes, the improvement of which is not of high priority, despite relatively low marks.

We will illustrate the use of strategic quadrant analysis for the assessment of several stores, representing the three segments, previously extracted using cluster analysis. The mean factor scores of each cluster (that is, cluster centers) were given in Table 5. In order to find stores typical of each cluster, we calculated the Euclidian distances from each store i to the cluster centers using the following formulas:

where dist_i1 – the distance from the observation i to the center of cluster 1; dist_i2 – the distance from the observation i to the center of cluster 2; dist_i3 – the distance from the observation i to the center of cluster 3; fac_i1 – factor 1 (‘Post-order service and satisfaction with claims’) score for the observation i; fac_i2 – factor 2 (‘Internet store usability and design’) score for the observation i; and fac_i3 – factor 3 (‘Economic considerations’) score for the observation i.

Then we have chosen three stores, which are closest to each cluster center, so that each store represents one cluster, and drew strategic quadrant analysis charts for each of these internet stores. The strategic quadrant analysis maps for each store are presented in Figures 2, 3 and 4, respectively.

The summary of quadrant analysis is presented in Table 6. Such attributes as ‘On-time delivery’, ‘Availability of product you wanted’ are important, whereas such attributes as ‘Variety of shipping options’ and ‘Shipping charges’ are unimportant for all stores. ‘Charges stated clearly before order submission’ always belongs to the group of expected attributes.

Table 6 Quadrant-analysis summary

Full size table

CONCLUSION

We have listed and illustrated some common problems with correlation and regression coefficients, which are widely used as indirect importance measures in marketing analysis. Unlike them, the Shapley Value approach, which came to econometrics from game theory, is a theoretically sound method for calculating the individual contribution of service attributes to customer loyalty. This method was applied to a large sample of US internet stores. The use of store-level data instead of respondent-level data allowed us to explain why some stores have higher rates of willingness to make repeat purchases than the others.

According to Shapley Value coefficients of importance, the most important attributes (with percentage contribution exceeding 10 per cent) are clearly ‘Customer support’, ‘Product met expectations’, ‘On-time delivery’, ‘Order tracking’ and ‘Availability of product you wanted’.

It is important to highlight how a firm can use the information about the latent importance of various attributes. By confronting it with performance measures (such as loyalty or satisfaction), companies can reveal their strengths (high importance – high performance), weaknesses (high importance – low performance), attributes that the company should bring customers’ attention to (low importance – high performance) and attributes, which do not require immediate improvement (low importance – low performance).

We have conducted factor analysis on store attributes ratings followed by cluster analysis. As a result, three segments were formed, which have different strong points: segment 1 has good Post-order service and satisfaction with claims, segment 2 has the best internet store usability and design, and segment 3 consists of stores that are good from an economic point of view. Then, we illustrated the usefulness of strategic quadrant analysis by conducting it for three companies that represent three different segments of stores.

It is worth mentioning that our study is limited to modeling repeat purchase intention. Perhaps that is the reason why economic considerations appeared to be not very important: customers evaluate the prices, shipping charges and the variety of shipping options beforehand and rarely regret buying at some particular store. Therefore, in order to retain the existing customers, internet retailers do not have to worry a lot about prices and charges, but the ‘economic considerations’ factor can be important for attracting new customers. Learning the ways to increase the probability of initial purchase is one of the possible issues for further research.

References

Dholakia, R.R. and Zhao, M. (2010) Effects of online store attributes on customer satisfaction and repurchase intentions. International Journal of Retail & Distribution Management 38 (7): 482–496.
Article Google Scholar
Gustafsson, A. and Johnson, M.D. (2004) Determining attribute importance in a service satisfaction model. Journal of Service Research 7 (2): 124–141.
Article Google Scholar
Anderson, E. and Fornell, C. (2000) Foundations of the American customer satisfaction index. Total Quality Management & Business Excellence 11 (7): 869–882.
Article Google Scholar
Sharma, S., Niedrich, R.W. and Dobbins, G. (1999) A framework for monitoring customer satisfaction: An empirical illustration. Industrial Marketing Management 28 (3): 231–243.
Article Google Scholar
Posselt, T. and Gerstner, E. (2005) Pre-sale vs. Post-sale e-satisfaction: Impact on repurchase intention and overall satisfaction. Journal of Interactive Marketing 19 (4): 35–47.
Article Google Scholar
Burke, R.R. (2002) Technology and the customer interface: What consumers want in the physical and virtual store. Journal of the Academy of Marketing Science 30 (4): 411–432.
Article Google Scholar
Eroglu, S.A., Machleit, K.A. and Davis, L.M. (2003) Empirical testing of a model of online store atmospherics and shopper responses. Psychology & Marketing 20 (2): 139–150.
Article Google Scholar
Jin, B. and Park, J.Y. (2005) The moderating effect of online purchase experience on the evaluation of online store attributes and the subsequent impact on market response outcomes. Advances in Consumer Research 33: 203–211.
Google Scholar
Kim, K. and Kim, E.B. (2006) Suggestions to enhance the cyber store customer satisfaction. Journal of American Academy of Business 9 (1): 233–240.
Google Scholar
Park, C.-H. and Kim, Y.-G. (2003) Identifying key factors affecting consumer purchase behavior in an online shopping context. International Journal of Retail & Distribution Management 31 (1): 16–29.
Article Google Scholar
Zeithaml, V.A., Parasuraman, A. and Malhotra, A. (2002) Service quality delivery through websites: A critical review of extant knowledge. Journal of Academy of Marketing Science 30 (4): 362–375.
Article Google Scholar
Szymanski, D.M. and Hise, R.T. (2000) E-satisfaction: An initial examination. Journal of Retailing 76 (3): 309–322.
Article Google Scholar
Lii, Y.-S., Lim, H.J. and Tseng, L.P. (2004) The effects of web operational factors on marketing performance. Journal of American Academy of Business 5 (1): 486–494.
Article Google Scholar
Weathers, D. and Makienko, I. (2006) Assessing the relationships between e-tail success and product and website factors. Journal of Interactive Marketing 20 (2): 41–53.
Article Google Scholar
Wind, J. and Rangaswamy, A. (2001) Customerization: The next revolution in mass customization. Journal of Interactive Marketing 15 (1): 13–32.
Article Google Scholar
Bloemer, J.M.M. and Kasper, H.D.P. (1995) The complex relationship between consumer satisfaction and brand loyalty. Journal of Economic Psychology 16 (2): 311–329.
Article Google Scholar
Oliver, R.L. (1999) Whence consumer loyalty? Journal of Marketing 63 (Special Issue): 33–44.
Article Google Scholar
Srinivasan, S.S., Anderson, R. and Ponnavolu, K. (2002) Customer loyalty in e-commence: An exploration of its antecedents and consequences. Journal of Retailing 78 (1): 41–50.
Article Google Scholar
Fontenot, G., Henke, L., Carson, K. and Carson, P.P. (2007) Techniques for determining importance: Balancing scientific method and subjectivity. Journal of Targeting, Measurement and Analysis for Marketing 15 (3): 170–180.
Article Google Scholar
Shapley, L.S. (1953) A value for n-person games. In: H.W. Kuhn and A.W. Tucker (eds.) Contributions to the Theory of Games, Vol. II (Annals of Mathematical Studies 28) Princeton, NJ: Princeton University Press, pp. 307–317.
Google Scholar
Lipovetsky, S. and Conklin, M. (2001) Analysis of regression in game theory approach. Applied Stochastic Models in Business and Industry 17 (4): 319–330.
Article Google Scholar
The Shapley Value has not become a standard procedure in most statistical packages yet. We use Stata module shapley developed by Stanislav Kolenikov. This program performs exact additive decomposition of a sample statistic (in our case – R²) by effects specified in the factor list. To perform the Shapley decomposition, the effects are eliminated one by one, and marginal effects from each exclusion are weighted in such a way that all exclusion trajectories have equal weights.

Download references

Author information

Authors and Affiliations

Prospekt Aviakonstruktorov, d. 20, kv. 90, Saint-Petersburg, 197373, Russia
Evgeny A Antipov

Authors

Elena B Pokryshevskaya
View author publications
You can also search for this author in PubMed Google Scholar
Evgeny A Antipov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Evgeny A Antipov.

Additional information

This study was carried out within ‘The National Research University Higher School of Economics’ Academic Fund Program in 2012–2013, research grant No. 11-01-0193’.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pokryshevskaya, E., Antipov, E. The strategic analysis of online customers’ repeat purchase intentions. J Target Meas Anal Mark 20, 203–211 (2012). https://doi.org/10.1057/jt.2012.16

Download citation

Received: 14 September 2012
Revised: 14 September 2012
Published: 22 October 2012
Issue Date: 01 September 2012
DOI: https://doi.org/10.1057/jt.2012.16

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The strategic analysis of online customers’ repeat purchase intentions

Abstract