Skip to main content
Log in

Analyzing the dependences of multi-category purchases on interactions of marketing variables

  • Original Paper
  • Published:
Journal of Business Economics Aims and scope Submit manuscript

An Erratum to this article was published on 25 November 2016

Abstract

We extend the usual specification of the multivariate probit model frequently used to analyze multi-category purchase incidence data by introducing interaction effects between marketing variables. Models are estimated by a Markov Chain Monte Carlo simulation method using 24,047 shopping visits made by a random sample of 1500 households in one specific grocery store over a one year period. Our data refer to a total of 25 food and non-food product categories and include socio-demographic household attributes in addition to purchases and marketing variables. Information criteria agree on the superiority of the extended specification. Estimation results demonstrate that many interaction effects are erroneously attributed to the main effects of marketing variables if one applies the usual specification instead. We derive managerial implications with respect to sales revenue by stochastic simulation. If managers base decisions on the usual specification in spite of its worse statistical performance, they run the risk to overestimate sales revenue increases due to sales promotion activities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. We thank one reviewer for drawing our attention to this paper.

  2. We thank one reviewer for commenting on this point.

  3. Duvvuri et al. (2007) do not give any information on how model performance changes due to error correlations.

  4. We thank one reviewer who asked us to elaborate on these as a rule asymmetric effects and on the interpretation of error correlations.

  5. We thank a reviewer for commenting on this point.

References

  • Ainslie A, Rossi PE (1998) Similarities in choice behavior across product categories. Market Sci 17:91–106

  • Albert J, Chib S (1993) Bayesian analysis of binary and polychotomous response data. J Am Stat Assoc 88:669–679

  • Allenby GM, Ginter JL (1995) The effects of in-store displays and feature advertising on consideration sets. Int J Res Market 12:67–80

    Article  Google Scholar 

  • Allenby GM, Lenk PJ (1995) Reassessing brand loyalty, price sensitivity, and merchandising effects on consumer brand choice. J Bus and Ec Stat 13:281–290

    Google Scholar 

  • Andrews RL, Currim IS (2002) Identifying segments with identical choice behaviors across product categories: an intercategory logit mixture model. Int J Res Market 19:65–79

    Article  Google Scholar 

  • Banerjee S, Yandell BS, Yi NJ (2008) Bayesian quantitative trait loci mapping for multiple traits. Gen 4:2275–2289

    Google Scholar 

  • Barnard J, McCulloch R, Meng XL (2000) Modeling covariance matrices in terms of standard deviations and correlations with application to shrinkage. Stat Sin 10:1281–1311

    Google Scholar 

  • Bell DR, Chiang J, Padmanabhan V (1999) The decomposition of promotional response: An empirical generalization. Market Sci 18:504–526

    Article  Google Scholar 

  • Bell DR, Corsten D, Knox G (2011) From point of purchase to path to purchase: how preshopping factors drive unplanned buying. J Market 75:3145

    Article  Google Scholar 

  • Boztug Y, Hildebrandt L (2008) Modeling joint purchases with a multivariate MNL approach. Schmalenbach Bus Rev 60:400–422

  • Boztug Y, Reutterer T (2008) A combined approach for segment-specific market basket analysis. Eur J Oper Res 187:294–312

  • Boztug Y, Silberhorn N (2006) Modellierungsansätze in der Warenkorbanalyse im Überblick. J Betr Wirtsch 56:105–128

  • Bronnenberg BJ, Kruger MW, Mela CF (2008) The IRI marketing data set. Market Sci 27:745–748

    Article  Google Scholar 

  • Cameron AC, Trivedi PK (2005) Microeconometrics. Cambridge University Press, New York

    Book  Google Scholar 

  • Chib S, Greenberg E (1998) Bayesian analysis of multivariate probit models. Biom 85:347–361

    Google Scholar 

  • Chib S, Seetharaman PB, Strijnev A (2002) Analysis of multi-category purchase incidence decisions using IRI market basket data. In: Franses PH, Montgomery AL (eds) Econometric models in marketing. JAI, Amsterdam, pp 57–92

    Google Scholar 

  • DeNavas-Walt C, Cleveland RW (2002) Money Income in the United States: 2001. Government Printing Office, Washington, DC

    Google Scholar 

  • Dippold K, Hruschka H (2013) A model of heterogeneous multicategory choice for market basket analysis. Rev Market Sci 11:1–31

    Article  Google Scholar 

  • Duvvuri SD, Ansari V, Gupta S (2007) Consumers’ price sensitivities across complementary categories. Manag Sci 53:1933–1945

    Article  Google Scholar 

  • Greene (2003) Econometric analysis, 5th edn. Prentice Hall, N.J

  • Gupta S (1988) Impact of sales promotions on when, what, and how much to buy. J Market Res 25:342–355

    Article  Google Scholar 

  • Kruger MW, Pagni D (2008) IRI academic data set description. Information Resources Incorporated, Chicago

    Google Scholar 

  • Liu X, Daniels MJ (2006) A new algorithm for simulating a correlation matrix based on parameter expansion and reparameterization. J Comp Graph Stat 15:897–914

    Article  Google Scholar 

  • Manchanda P, Ansari A, Gupta S (1999) The “shopping basket” : a model for multi-category purchase incidence decisions. Market Sci 18:95–114

  • Nielsen (2010) Six trade promotion tips. Why less can be more. http://www.nielsen.com/us/en/newswire/2010/six-trade-promotion-tips-why-less-can-be-more. Accessed 19 March 2014

  • PE (1998) Similarities in choice behavior across product categories. Market Sci 17:91–106

  • Papatla P (1995) A dynamic model of the advertising-price sensitivity relationship for heterogeneous consumers. Journal Bus Res 33:261–271

    Article  Google Scholar 

  • Russell GJ, Petersen A (2000) Analysis of cross category dependence in market basket selection. J Retail 76:369–392

    Article  Google Scholar 

  • Seetharaman PB, Siddhartha C, Ainslie A, Boatwright P, Chan T, Gupta S, Mehta N, Rao V, Strijnev A (2005) Models of multi-category choice behavior. Market Lett 16:239–254

    Article  Google Scholar 

  • Song I, Chintagunta PK (2007) A discrete-continuous model for multi-category purchase behavior of households. J Market Res 44:595–612

    Article  Google Scholar 

  • SymphonyIRI Group (2013) Merchandising trends: supporting the value proposition. Chicago, IL

  • Zellner A (1971) An introduction to Bayesian inference in econometrics. Wiley, New York

    Google Scholar 

Download references

Acknowledgments

We thank two reviewers and Thomas Reutterer for comments on an earlier version which besides being insightful really helped to improve the article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Harald Hruschka.

Additional information

An erratum to this article is available at http://dx.doi.org/10.1007/s11573-016-0841-5.

Appendix: Overview of MCMC simulation

Appendix: Overview of MCMC simulation

Estimation of the MVP model is based on the Bayesian analysis of Zellner (1971) for the seemingly unrelated regression (SUR) model and the sampling of latent variables due to Albert and Chib (1993) as well as Chib and Greenberg (1998). Sampling of the correlation matrix of latent residuals in steps 3–5 draws upon Liu and Daniels (2006). Sampling of individual coefficients works similar to the approach developed by Banerjee et al. (2008) for the SUR model.

Stochastic utilities are multivariate normally distributed with mean vector \(V_{it}\) containing deterministic utilities \(V_{1it}, \ldots V_{Jit},\) and covariance matrix \(\Sigma\). Therefore the conditional distribution of the stochastic utility of category j given the remaining \(J-1\) categories is univariate normal with conditional mean \(V_{jit|-j}\) and conditional variance \(\sigma _{-j}\) as parameters:

$$\begin{aligned} V_{jit|-j}= & V_{jit} + \sigma_{s-k} {\mathop \Sigma \limits_{s-k}^{-1}} (V_{-kit} - \mu _{-jit}),\nonumber \\ \sigma _{-j}= & {} \sigma _{j,j} - \sigma _{-j} {\mathop \Sigma \limits _{-j}^{-1}} \sigma _{-j}^{'} \end{aligned}$$
(7)

\(V_{-kit}\) corresponds to vector \(V_{it}\) without element k. \(\sigma _{-j}\) denotes the vector containing the residual covariances of category j with the other \(J-1\) categories. \(\Sigma _{-j}\) corresponds to \(\Sigma\) without row and column j.

\(\mathcal{N}(\mu , \sigma )\) denotes the univariate normal distribution with expectation \(\mu\) and variance \(\sigma\). \(\mathcal{TN}^{+}(\mu, \sigma )\) and \(\mathcal{TN}^{-}(\mu, \sigma )\) denote univariate normal distributions truncated to positive and negative values, respectively. \(\mathcal{IW} (\nu,A)\) denotes the inverse Wishart distribution with degrees of freedom \(\nu\) and symmetric, nonsingular matrix A.

Coefficients including category constants are collected in a (PJ) matrix \(\beta\) with \(P=1+S+M\) and \(P=1+S+M+0.5 \, M \, (M-1)\) for the specifications main and inter, respectively. S and M denote the number of socio-demographic and marketing variables. T symbolizes the total number of baskets across all households. (TP) matrix X holds the P predictors, (TJ) matrix V the stochastic utilities of all baskets and categories.

Latent residuals are defined as differences between stochastic and deterministic utilities \(e_{jit} = U_{jit} - V_{jit}\). The residual vector of household i for basket t is \(e_{it} = (e_{1it}, e_{2i1t},\ldots, e_{Jit})^{'}\).

Initially all coefficients are set to zero and \(\Sigma\) to a diagonal correlation matrix. Each coefficient has the prior \(\mathcal N(0,\underline{v})\) with \(\underline{v}= 4\). Estimation steps of each iteration are as follows:

  1. 1.

    For all categories \(j=1,\ldots,J\), households \(i=1,\ldots,I\) and their baskets \(t=1,\ldots,T_i\)

    $$U_{jit} \sim \left\{ \begin{array}{ll} {\mathcal{TN}}^{+}(V_{jit|-j}, \sigma _{-k}) & \text{ if } \quad y_{jit}=1 \\ {\mathcal{TN}}^{-}(V_{jit|-j}, \sigma _{-k}) & \text{ else } \end{array} \right.$$
  2. 2.

    Compute matrix of cross products of latent residuals across households \(A= \sum _{i=1}^{I} \sum _{t=1}^{T_i} e_{it} e_{it}^{'}\).

  3. 3.

    Sample covariance matrix \(\Sigma\) from \(\mathcal IW (T-J-1,A)\).

  4. 4.

    Compute proposal correlation matrix \(\tilde{R} = D \Sigma D\) where D is a (JJ) diagonal matrix with elements \(d_{j,j} = \sigma _{k,k}^{-1/2}\) (\(\sigma _{k,k}\) denotes the element of \(\Sigma\) in row and column k).

  5. 5.

    Accept \(\tilde{R}\) with probability \(min(1, \exp {(0.5 (J+1) (\log {|\tilde{R}|}-\log {|\Sigma _{0}|})}))\) with \(\Sigma _{0}\) as correlation matrix of the previous iteration.

  6. 6.

    Set \(\Sigma = R\) and compute its inverse \(H=\Sigma ^{-1}\)

  7. 7.

    Sample coefficients \(\beta _{pj}\) for predictors \(p = 1,\dots,P\) and categories \(j =1, \dots,J\):

    $$\begin{aligned} \beta _{pj} \sim \mathcal N (m,v) \quad \text{ with } \quad v = 1/(1/\underline{v} + h_{jj} x_p' x_p) \quad \text{ and } \quad m = h_j' (V - X \beta _0)' \, x_p \, v \\ \end{aligned}$$

    \(h_{jj}\) is the element in row j and column j of H. \(h_j\) and \(x_p\) denote columns j and p of H and X, respectively. \(\beta _0\) equals \(\beta\) except for a zero value in row p and column j.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hruschka, H. Analyzing the dependences of multi-category purchases on interactions of marketing variables. J Bus Econ 87, 295–313 (2017). https://doi.org/10.1007/s11573-016-0820-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11573-016-0820-x

Keywords

JEL Classification

Navigation