Abstract
We extend the usual specification of the multivariate probit model frequently used to analyze multi-category purchase incidence data by introducing interaction effects between marketing variables. Models are estimated by a Markov Chain Monte Carlo simulation method using 24,047 shopping visits made by a random sample of 1500 households in one specific grocery store over a one year period. Our data refer to a total of 25 food and non-food product categories and include socio-demographic household attributes in addition to purchases and marketing variables. Information criteria agree on the superiority of the extended specification. Estimation results demonstrate that many interaction effects are erroneously attributed to the main effects of marketing variables if one applies the usual specification instead. We derive managerial implications with respect to sales revenue by stochastic simulation. If managers base decisions on the usual specification in spite of its worse statistical performance, they run the risk to overestimate sales revenue increases due to sales promotion activities.
Similar content being viewed by others
Notes
We thank one reviewer for drawing our attention to this paper.
We thank one reviewer for commenting on this point.
Duvvuri et al. (2007) do not give any information on how model performance changes due to error correlations.
We thank one reviewer who asked us to elaborate on these as a rule asymmetric effects and on the interpretation of error correlations.
We thank a reviewer for commenting on this point.
References
Ainslie A, Rossi PE (1998) Similarities in choice behavior across product categories. Market Sci 17:91–106
Albert J, Chib S (1993) Bayesian analysis of binary and polychotomous response data. J Am Stat Assoc 88:669–679
Allenby GM, Ginter JL (1995) The effects of in-store displays and feature advertising on consideration sets. Int J Res Market 12:67–80
Allenby GM, Lenk PJ (1995) Reassessing brand loyalty, price sensitivity, and merchandising effects on consumer brand choice. J Bus and Ec Stat 13:281–290
Andrews RL, Currim IS (2002) Identifying segments with identical choice behaviors across product categories: an intercategory logit mixture model. Int J Res Market 19:65–79
Banerjee S, Yandell BS, Yi NJ (2008) Bayesian quantitative trait loci mapping for multiple traits. Gen 4:2275–2289
Barnard J, McCulloch R, Meng XL (2000) Modeling covariance matrices in terms of standard deviations and correlations with application to shrinkage. Stat Sin 10:1281–1311
Bell DR, Chiang J, Padmanabhan V (1999) The decomposition of promotional response: An empirical generalization. Market Sci 18:504–526
Bell DR, Corsten D, Knox G (2011) From point of purchase to path to purchase: how preshopping factors drive unplanned buying. J Market 75:3145
Boztug Y, Hildebrandt L (2008) Modeling joint purchases with a multivariate MNL approach. Schmalenbach Bus Rev 60:400–422
Boztug Y, Reutterer T (2008) A combined approach for segment-specific market basket analysis. Eur J Oper Res 187:294–312
Boztug Y, Silberhorn N (2006) Modellierungsansätze in der Warenkorbanalyse im Überblick. J Betr Wirtsch 56:105–128
Bronnenberg BJ, Kruger MW, Mela CF (2008) The IRI marketing data set. Market Sci 27:745–748
Cameron AC, Trivedi PK (2005) Microeconometrics. Cambridge University Press, New York
Chib S, Greenberg E (1998) Bayesian analysis of multivariate probit models. Biom 85:347–361
Chib S, Seetharaman PB, Strijnev A (2002) Analysis of multi-category purchase incidence decisions using IRI market basket data. In: Franses PH, Montgomery AL (eds) Econometric models in marketing. JAI, Amsterdam, pp 57–92
DeNavas-Walt C, Cleveland RW (2002) Money Income in the United States: 2001. Government Printing Office, Washington, DC
Dippold K, Hruschka H (2013) A model of heterogeneous multicategory choice for market basket analysis. Rev Market Sci 11:1–31
Duvvuri SD, Ansari V, Gupta S (2007) Consumers’ price sensitivities across complementary categories. Manag Sci 53:1933–1945
Greene (2003) Econometric analysis, 5th edn. Prentice Hall, N.J
Gupta S (1988) Impact of sales promotions on when, what, and how much to buy. J Market Res 25:342–355
Kruger MW, Pagni D (2008) IRI academic data set description. Information Resources Incorporated, Chicago
Liu X, Daniels MJ (2006) A new algorithm for simulating a correlation matrix based on parameter expansion and reparameterization. J Comp Graph Stat 15:897–914
Manchanda P, Ansari A, Gupta S (1999) The “shopping basket” : a model for multi-category purchase incidence decisions. Market Sci 18:95–114
Nielsen (2010) Six trade promotion tips. Why less can be more. http://www.nielsen.com/us/en/newswire/2010/six-trade-promotion-tips-why-less-can-be-more. Accessed 19 March 2014
PE (1998) Similarities in choice behavior across product categories. Market Sci 17:91–106
Papatla P (1995) A dynamic model of the advertising-price sensitivity relationship for heterogeneous consumers. Journal Bus Res 33:261–271
Russell GJ, Petersen A (2000) Analysis of cross category dependence in market basket selection. J Retail 76:369–392
Seetharaman PB, Siddhartha C, Ainslie A, Boatwright P, Chan T, Gupta S, Mehta N, Rao V, Strijnev A (2005) Models of multi-category choice behavior. Market Lett 16:239–254
Song I, Chintagunta PK (2007) A discrete-continuous model for multi-category purchase behavior of households. J Market Res 44:595–612
SymphonyIRI Group (2013) Merchandising trends: supporting the value proposition. Chicago, IL
Zellner A (1971) An introduction to Bayesian inference in econometrics. Wiley, New York
Acknowledgments
We thank two reviewers and Thomas Reutterer for comments on an earlier version which besides being insightful really helped to improve the article.
Author information
Authors and Affiliations
Corresponding author
Additional information
An erratum to this article is available at http://dx.doi.org/10.1007/s11573-016-0841-5.
Appendix: Overview of MCMC simulation
Appendix: Overview of MCMC simulation
Estimation of the MVP model is based on the Bayesian analysis of Zellner (1971) for the seemingly unrelated regression (SUR) model and the sampling of latent variables due to Albert and Chib (1993) as well as Chib and Greenberg (1998). Sampling of the correlation matrix of latent residuals in steps 3–5 draws upon Liu and Daniels (2006). Sampling of individual coefficients works similar to the approach developed by Banerjee et al. (2008) for the SUR model.
Stochastic utilities are multivariate normally distributed with mean vector \(V_{it}\) containing deterministic utilities \(V_{1it}, \ldots V_{Jit},\) and covariance matrix \(\Sigma\). Therefore the conditional distribution of the stochastic utility of category j given the remaining \(J-1\) categories is univariate normal with conditional mean \(V_{jit|-j}\) and conditional variance \(\sigma _{-j}\) as parameters:
\(V_{-kit}\) corresponds to vector \(V_{it}\) without element k. \(\sigma _{-j}\) denotes the vector containing the residual covariances of category j with the other \(J-1\) categories. \(\Sigma _{-j}\) corresponds to \(\Sigma\) without row and column j.
\(\mathcal{N}(\mu , \sigma )\) denotes the univariate normal distribution with expectation \(\mu\) and variance \(\sigma\). \(\mathcal{TN}^{+}(\mu, \sigma )\) and \(\mathcal{TN}^{-}(\mu, \sigma )\) denote univariate normal distributions truncated to positive and negative values, respectively. \(\mathcal{IW} (\nu,A)\) denotes the inverse Wishart distribution with degrees of freedom \(\nu\) and symmetric, nonsingular matrix A.
Coefficients including category constants are collected in a (P, J) matrix \(\beta\) with \(P=1+S+M\) and \(P=1+S+M+0.5 \, M \, (M-1)\) for the specifications main and inter, respectively. S and M denote the number of socio-demographic and marketing variables. T symbolizes the total number of baskets across all households. (T, P) matrix X holds the P predictors, (T, J) matrix V the stochastic utilities of all baskets and categories.
Latent residuals are defined as differences between stochastic and deterministic utilities \(e_{jit} = U_{jit} - V_{jit}\). The residual vector of household i for basket t is \(e_{it} = (e_{1it}, e_{2i1t},\ldots, e_{Jit})^{'}\).
Initially all coefficients are set to zero and \(\Sigma\) to a diagonal correlation matrix. Each coefficient has the prior \(\mathcal N(0,\underline{v})\) with \(\underline{v}= 4\). Estimation steps of each iteration are as follows:
-
1.
For all categories \(j=1,\ldots,J\), households \(i=1,\ldots,I\) and their baskets \(t=1,\ldots,T_i\)
$$U_{jit} \sim \left\{ \begin{array}{ll} {\mathcal{TN}}^{+}(V_{jit|-j}, \sigma _{-k}) & \text{ if } \quad y_{jit}=1 \\ {\mathcal{TN}}^{-}(V_{jit|-j}, \sigma _{-k}) & \text{ else } \end{array} \right.$$ -
2.
Compute matrix of cross products of latent residuals across households \(A= \sum _{i=1}^{I} \sum _{t=1}^{T_i} e_{it} e_{it}^{'}\).
-
3.
Sample covariance matrix \(\Sigma\) from \(\mathcal IW (T-J-1,A)\).
-
4.
Compute proposal correlation matrix \(\tilde{R} = D \Sigma D\) where D is a (J, J) diagonal matrix with elements \(d_{j,j} = \sigma _{k,k}^{-1/2}\) (\(\sigma _{k,k}\) denotes the element of \(\Sigma\) in row and column k).
-
5.
Accept \(\tilde{R}\) with probability \(min(1, \exp {(0.5 (J+1) (\log {|\tilde{R}|}-\log {|\Sigma _{0}|})}))\) with \(\Sigma _{0}\) as correlation matrix of the previous iteration.
-
6.
Set \(\Sigma = R\) and compute its inverse \(H=\Sigma ^{-1}\)
-
7.
Sample coefficients \(\beta _{pj}\) for predictors \(p = 1,\dots,P\) and categories \(j =1, \dots,J\):
$$\begin{aligned} \beta _{pj} \sim \mathcal N (m,v) \quad \text{ with } \quad v = 1/(1/\underline{v} + h_{jj} x_p' x_p) \quad \text{ and } \quad m = h_j' (V - X \beta _0)' \, x_p \, v \\ \end{aligned}$$\(h_{jj}\) is the element in row j and column j of H. \(h_j\) and \(x_p\) denote columns j and p of H and X, respectively. \(\beta _0\) equals \(\beta\) except for a zero value in row p and column j.
Rights and permissions
About this article
Cite this article
Hruschka, H. Analyzing the dependences of multi-category purchases on interactions of marketing variables. J Bus Econ 87, 295–313 (2017). https://doi.org/10.1007/s11573-016-0820-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11573-016-0820-x