Skip to main content
Erschienen in: Review of Managerial Science 4/2013

01.10.2013 | Original Paper

A parsimonious multivariate poisson model for market basket analysis

verfasst von: Katrin Dippold, Harald Hruschka

Erschienen in: Review of Managerial Science | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Intensified research on multivariate Poisson models offers new opportunities for the analysis of purchase quantities in market basket data. The investigation of positive or negative correlations in quantity decisions among product categories facilitates a deeper understanding of consumer purchase behavior. The applied multivariate log-normal Poisson model introduces interdependencies between categories with multivariate normal-distributed latent effects by means of a covariance matrix. As the size of this covariance matrix depends on the number of categories in the model, its ation may become tedious. Furthermore, we assume that quantity decisions do not interact for all pairs of categories. That is why we propose to use covariance selection to derive a parsimonious representation of the correlation structure. For two market basket data sets, we show that the vast majority of off-diagonal elements in the covariance matrix are irrelevant. For a data set with product categories, the model with a partly restricted covariance matrix achieves a better fit to the holdout data than the model with full covariance matrix. For a data set with subcategories of the broader category beverage, the proposed model with restricted covariance outperforms the model with full covariance matrix even on the calibration data. We conclude that interactions of quantity decisions are overall the exception, even for complements-in-use.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Note that the answer to how many items within a product category are bought implies the purchase incidence question.
 
2
Winkelmann (2008) shows that excess zeros can be implemented in the model with any multiplicative heterogeneity.
 
3
We thank one anonymous reviewer for suggesting to discuss this model.
 
4
These market baskets are not used for model calibration but retained to cross-validate the estimates.
 
5
All estimates or analyses in this paper based on SymphonyIRI Group, Inc. data are by aaaaaa and bbbbbbb and not SymphonyIRI Group, Inc.
 
6
Razors, photo supplies, and sugar substitutes are omitted as they are bought in <1 % of baskets. Cigarettes are dropped because of lacking marketing information.
 
7
As mentioned before, the explanation for positive interactions may be complementarity in usage or purchase. A negative effect can be explained by a budget effect (Niraj et al. 2008).
 
8
Note that Niraj et al. (2008) do not account for family size at all, whereas Nikoloulopoulos and Karlis (2010) estimate coefficients for the number of household members in four age cohorts.
 
Literatur
Zurück zum Zitat Aitchison J, Ho CH (1989) The multivariate poisson-log normal distribution. Biometrika 76(4):643–653CrossRef Aitchison J, Ho CH (1989) The multivariate poisson-log normal distribution. Biometrika 76(4):643–653CrossRef
Zurück zum Zitat Alfò M, Trovato G (2004) Semiparametric mixture models for multivariate count data with application. Econom J 7(2):426–454CrossRef Alfò M, Trovato G (2004) Semiparametric mixture models for multivariate count data with application. Econom J 7(2):426–454CrossRef
Zurück zum Zitat Armstrong H, Carter CK, Wong KFK, Kohn R (2009) Bayesian covariance matrix estimation using a mixture of decomposable graphical models. Stat Comput 19(3):303–316CrossRef Armstrong H, Carter CK, Wong KFK, Kohn R (2009) Bayesian covariance matrix estimation using a mixture of decomposable graphical models. Stat Comput 19(3):303–316CrossRef
Zurück zum Zitat Bishop YM, Fienberg SE, Holl PW (1975) Discrete multivariate analysis: theory and practice. MIT Press, Cambridge Bishop YM, Fienberg SE, Holl PW (1975) Discrete multivariate analysis: theory and practice. MIT Press, Cambridge
Zurück zum Zitat Boztuğ Y, Reutterer T (2008) A combined approach for segment-specific market basket analysis. Eur J Oper Res 187(1):294–312CrossRef Boztuğ Y, Reutterer T (2008) A combined approach for segment-specific market basket analysis. Eur J Oper Res 187(1):294–312CrossRef
Zurück zum Zitat Boztuğ Y, Silberhorn N (2006) Modellierungsansätze in der Warenkorbanalyse im Überblick. J Betriebswirtschaft 56(2):105–128CrossRef Boztuğ Y, Silberhorn N (2006) Modellierungsansätze in der Warenkorbanalyse im Überblick. J Betriebswirtschaft 56(2):105–128CrossRef
Zurück zum Zitat Brijs T, Karlis D, Swinnen G, Vanhoof K, Wets G, Manchanda P (2004) A multivariate poisson mixture model for marketing applications. Stat Neerl 58(3):322–348CrossRef Brijs T, Karlis D, Swinnen G, Vanhoof K, Wets G, Manchanda P (2004) A multivariate poisson mixture model for marketing applications. Stat Neerl 58(3):322–348CrossRef
Zurück zum Zitat Bronnenberg BJ, Kruger MW, Mela CF (2008) The IRI marketing data set. Mark Sci 27(4):745–748CrossRef Bronnenberg BJ, Kruger MW, Mela CF (2008) The IRI marketing data set. Mark Sci 27(4):745–748CrossRef
Zurück zum Zitat Cameron AC, Trivedi PK (2005) Microeconometrics: methods and applications. Cambridge University Press, New York CrossRef Cameron AC, Trivedi PK (2005) Microeconometrics: methods and applications. Cambridge University Press, New York CrossRef
Zurück zum Zitat Chen Z, Dunson DB (2003) Random effects selection in linear mixed models. Biometrics 59:762–769CrossRef Chen Z, Dunson DB (2003) Random effects selection in linear mixed models. Biometrics 59:762–769CrossRef
Zurück zum Zitat Chib S, Greenberg E, Winkelmann R (1998) Posterior simulation and Bayes factors in panel count data models. J Econom 86(1):33–54CrossRef Chib S, Greenberg E, Winkelmann R (1998) Posterior simulation and Bayes factors in panel count data models. J Econom 86(1):33–54CrossRef
Zurück zum Zitat Chib S, Seetharaman PB, Strijnev A (2002) Analysis of multi-category purchase incidence using IRI market basket data. In: Franses PH, Montgomery AL (eds) Econometric models in marketing 16. JAI, Amsterdam, pp 57–92 Chib S, Seetharaman PB, Strijnev A (2002) Analysis of multi-category purchase incidence using IRI market basket data. In: Franses PH, Montgomery AL (eds) Econometric models in marketing 16. JAI, Amsterdam, pp 57–92
Zurück zum Zitat Chib S, Winkelmann R (2001) Markov chain Monte Carlo analysis of correlated count data. J Bus Econ Stat 19(4):428–435CrossRef Chib S, Winkelmann R (2001) Markov chain Monte Carlo analysis of correlated count data. J Bus Econ Stat 19(4):428–435CrossRef
Zurück zum Zitat Czado C, Gneiting T, Held L (2009) Predictive model assessment for count data. Biometrics 65(4):1254–1261CrossRef Czado C, Gneiting T, Held L (2009) Predictive model assessment for count data. Biometrics 65(4):1254–1261CrossRef
Zurück zum Zitat Dippold K, Hruschka H (2012) Variable selection for market basket analysis. CompStat (forthcoming) Dippold K, Hruschka H (2012) Variable selection for market basket analysis. CompStat (forthcoming)
Zurück zum Zitat Doyle P, Saunders J (1990) Multiproduct advertising budgeting. Mark Sci 9(2):97–113CrossRef Doyle P, Saunders J (1990) Multiproduct advertising budgeting. Mark Sci 9(2):97–113CrossRef
Zurück zum Zitat Duvvuri SD, Ansari A, Gupta S (2007) Consumers’ price sensitivities across complementary categories. Manag Sci 53(12):1933–1945CrossRef Duvvuri SD, Ansari A, Gupta S (2007) Consumers’ price sensitivities across complementary categories. Manag Sci 53(12):1933–1945CrossRef
Zurück zum Zitat Erdem T (1998) An empirical analysis of umbrella branding. J Mark Res 35(3):339–351CrossRef Erdem T (1998) An empirical analysis of umbrella branding. J Mark Res 35(3):339–351CrossRef
Zurück zum Zitat Frühwirth–Schnatter S (2006) Finite mixture Markov switching models. Springer, New York Frühwirth–Schnatter S (2006) Finite mixture Markov switching models. Springer, New York
Zurück zum Zitat Frühwirth-Schnatter S, Tüchler R (2008) Bayesian parsimonious covariance estimation for hierarchical linear mixed models. Stat Comput 18(1):1–13CrossRef Frühwirth-Schnatter S, Tüchler R (2008) Bayesian parsimonious covariance estimation for hierarchical linear mixed models. Stat Comput 18(1):1–13CrossRef
Zurück zum Zitat Griffiths RC, Milne RK (1987) A class of infinitely divisible multivariate negative binomial distributions. J Multivariate Anal 22(1):13–23CrossRef Griffiths RC, Milne RK (1987) A class of infinitely divisible multivariate negative binomial distributions. J Multivariate Anal 22(1):13–23CrossRef
Zurück zum Zitat Hansen KT, Singh VP, Chintagunta P (2006) Understanding store-brand purchase behavior across categories. Mark Sci 25(1):75–90CrossRef Hansen KT, Singh VP, Chintagunta P (2006) Understanding store-brand purchase behavior across categories. Mark Sci 25(1):75–90CrossRef
Zurück zum Zitat Hruschka H, Lukanowicz M, Buchta C (1999) Cross-category sales promotion effects. J Retail Consum Serv 6(2):99–105CrossRef Hruschka H, Lukanowicz M, Buchta C (1999) Cross-category sales promotion effects. J Retail Consum Serv 6(2):99–105CrossRef
Zurück zum Zitat Karlis D (2003) An EM algorithm for multivariate poisson distribution and related models. J Appl Stat 30(1):63–77CrossRef Karlis D (2003) An EM algorithm for multivariate poisson distribution and related models. J Appl Stat 30(1):63–77CrossRef
Zurück zum Zitat Karlis D, Meligkotsidou L (2005) Multivariate poisson regression with covariance structure. Stat Comput 15(4):255–265CrossRef Karlis D, Meligkotsidou L (2005) Multivariate poisson regression with covariance structure. Stat Comput 15(4):255–265CrossRef
Zurück zum Zitat Kök AG, Fisher M, Vaidyanathan R (2008) Assortment planning: review of literature and industry practice. In: Agrawal N, Smith SA (eds) Retail supply chain management, Springer, Berlin, pp 99–154CrossRef Kök AG, Fisher M, Vaidyanathan R (2008) Assortment planning: review of literature and industry practice. In: Agrawal N, Smith SA (eds) Retail supply chain management, Springer, Berlin, pp 99–154CrossRef
Zurück zum Zitat Kohn R, Smith M, Chan D (2001) Nonparametric regression using linear combinations of basis functions. Stat Comput 11(4):313–322CrossRef Kohn R, Smith M, Chan D (2001) Nonparametric regression using linear combinations of basis functions. Stat Comput 11(4):313–322CrossRef
Zurück zum Zitat Kruger MW, Pagni D (2008) IRI academic data set description version 131. Information Resources Incorporated, Chicago Kruger MW, Pagni D (2008) IRI academic data set description version 131. Information Resources Incorporated, Chicago
Zurück zum Zitat Ma J, Kockelman KM (2006) Bayesian multivariate poisson regression for models of injury count by severity. Transp Res Record 1950(4):24–34CrossRef Ma J, Kockelman KM (2006) Bayesian multivariate poisson regression for models of injury count by severity. Transp Res Record 1950(4):24–34CrossRef
Zurück zum Zitat Manchanda P, Ansari A, Gupta S (1999) The “shopping basket”: a model for multicategory purchase incidence decisions. Mark Sci 18(2):95–114CrossRef Manchanda P, Ansari A, Gupta S (1999) The “shopping basket”: a model for multicategory purchase incidence decisions. Mark Sci 18(2):95–114CrossRef
Zurück zum Zitat McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman & Hall, London McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman & Hall, London
Zurück zum Zitat Müller-Hagedorn L, Natter M (2011) Handels-marketing. 5 Auflage, Kohlhammer, Stuttgart Müller-Hagedorn L, Natter M (2011) Handels-marketing. 5 Auflage, Kohlhammer, Stuttgart
Zurück zum Zitat Munkin MK, Trivedi PK (1999) Simulated maximum likelihood estimation of multivariate mixed-poisson regression models with application. Econom J 2:29–48CrossRef Munkin MK, Trivedi PK (1999) Simulated maximum likelihood estimation of multivariate mixed-poisson regression models with application. Econom J 2:29–48CrossRef
Zurück zum Zitat Nikoloulopoulos AK, Karlis D (2009) Finite normal mixture copulas for multivariate discrete data modeling. J Stat Plan Inference 139(11):3878–3890CrossRef Nikoloulopoulos AK, Karlis D (2009) Finite normal mixture copulas for multivariate discrete data modeling. J Stat Plan Inference 139(11):3878–3890CrossRef
Zurück zum Zitat Nikoloulopoulos AK, Karlis D (2010) Modeling multivariate count data using copulas. Commun Stat Simul Comput 39(1):172–187CrossRef Nikoloulopoulos AK, Karlis D (2010) Modeling multivariate count data using copulas. Commun Stat Simul Comput 39(1):172–187CrossRef
Zurück zum Zitat Niraj R, Padmanabhan V, Seetharaman PB (2008) Research note: a cross-category model of households’ incidence quantity decisions. Mark Sci 27(2):225–235CrossRef Niraj R, Padmanabhan V, Seetharaman PB (2008) Research note: a cross-category model of households’ incidence quantity decisions. Mark Sci 27(2):225–235CrossRef
Zurück zum Zitat Papastefanou G (2001) The ZUMA data file version of the GfK consumerScan household panel. In: Papastefanou G, Schmidt P, Börsch-Supan A, Lüdkte H, Oltersdorf U (eds) SSocial economic analyses of consumer panel data. Zentrum für Umfragen Meinungen und Analysen (ZUMA), Mannheim, pp 206–212 Papastefanou G (2001) The ZUMA data file version of the GfK consumerScan household panel. In: Papastefanou G, Schmidt P, Börsch-Supan A, Lüdkte H, Oltersdorf U (eds) SSocial economic analyses of consumer panel data. Zentrum für Umfragen Meinungen und Analysen (ZUMA), Mannheim, pp 206–212
Zurück zum Zitat Park ES, Lord D (2007) Multivariate poisson-lognormal models for jointly modeling crash frequency by severity. Transp Res Rec J Transp Res Board 2019:1–6CrossRef Park ES, Lord D (2007) Multivariate poisson-lognormal models for jointly modeling crash frequency by severity. Transp Res Rec J Transp Res Board 2019:1–6CrossRef
Zurück zum Zitat Russell GJ, Petersen A (2000) Analysis of cross category dependence in market basket selection. J Retail 76(3):319–332CrossRef Russell GJ, Petersen A (2000) Analysis of cross category dependence in market basket selection. J Retail 76(3):319–332CrossRef
Zurück zum Zitat Russell GJ, Ratneshwar S, Shocker AD, Bell D, Bodapati A, Degeratu A, Hildebrandt L, Kim N, Ramaswami S, Shankar VH (1999) Multiple-category decision-making: review and synthesis. Mark Lett 10(3):319–332CrossRef Russell GJ, Ratneshwar S, Shocker AD, Bell D, Bodapati A, Degeratu A, Hildebrandt L, Kim N, Ramaswami S, Shankar VH (1999) Multiple-category decision-making: review and synthesis. Mark Lett 10(3):319–332CrossRef
Zurück zum Zitat Seetharaman PB, Chib S, Ainslie A, Boatwright P, Chan T, Gupta S, Mehta N, Rao V, Strijnev A (2005) Models of multicategory choice behavior. Mark Lett 16(3-4):239–254CrossRef Seetharaman PB, Chib S, Ainslie A, Boatwright P, Chan T, Gupta S, Mehta N, Rao V, Strijnev A (2005) Models of multicategory choice behavior. Mark Lett 16(3-4):239–254CrossRef
Zurück zum Zitat Singh VP, Hansen KT, Gupta S (2005) Modeling preferences for common attributes in multi-category brand choice. J Mark Res 42(2):195–209CrossRef Singh VP, Hansen KT, Gupta S (2005) Modeling preferences for common attributes in multi-category brand choice. J Mark Res 42(2):195–209CrossRef
Zurück zum Zitat Smith M, Kohn R (2002) Parsimonious covariance matrix estimation for longitudinal data. J Am Stat Assoc 97(460):1141–1153CrossRef Smith M, Kohn R (2002) Parsimonious covariance matrix estimation for longitudinal data. J Am Stat Assoc 97(460):1141–1153CrossRef
Zurück zum Zitat Smith SA, Agrawal N (2000) Management of multi-item retail inventory systems with demand substitution. Oper Res 48:50–64CrossRef Smith SA, Agrawal N (2000) Management of multi-item retail inventory systems with demand substitution. Oper Res 48:50–64CrossRef
Zurück zum Zitat Smyth P (2000) Model selection for probabilistic clustering using cross-validated likelihood. Stat Comp 10(1):63–72CrossRef Smyth P (2000) Model selection for probabilistic clustering using cross-validated likelihood. Stat Comp 10(1):63–72CrossRef
Zurück zum Zitat Song I, Chintagunta PK (2007) A discrete-continuous model for multicategory purchase behavior of households. J Mark Res 44(4):595–612CrossRef Song I, Chintagunta PK (2007) A discrete-continuous model for multicategory purchase behavior of households. J Mark Res 44(4):595–612CrossRef
Zurück zum Zitat Tanner MA, Wong WH (1987) The calculation of posterior distributions by data augmentation. J Am Stat Assoc 82(398):528–540CrossRef Tanner MA, Wong WH (1987) The calculation of posterior distributions by data augmentation. J Am Stat Assoc 82(398):528–540CrossRef
Zurück zum Zitat Trivedi PK, Zimmer DM (2005) Copula modeling: an introduction for practitioners. Found Trends Econom 1(1):1–111CrossRef Trivedi PK, Zimmer DM (2005) Copula modeling: an introduction for practitioners. Found Trends Econom 1(1):1–111CrossRef
Zurück zum Zitat Wang H(D), Kalwani MU, Akçura T (2007) A bayesian multivariate poisson regression model of cross-category store brand purchasing behavior. J Retail Consum Serv 14(6):369–382CrossRef Wang H(D), Kalwani MU, Akçura T (2007) A bayesian multivariate poisson regression model of cross-category store brand purchasing behavior. J Retail Consum Serv 14(6):369–382CrossRef
Zurück zum Zitat Wedel M, Böckenholt U, Kamakura WA (2003) Factor models for multivariate count data. J Multivar Anal 87(2):356–369CrossRef Wedel M, Böckenholt U, Kamakura WA (2003) Factor models for multivariate count data. J Multivar Anal 87(2):356–369CrossRef
Zurück zum Zitat Winkelmann R (2008) Econometric analysis of count data, 5th edn. Springer, Heidelberg Winkelmann R (2008) Econometric analysis of count data, 5th edn. Springer, Heidelberg
Zurück zum Zitat Wong F, Carter CK, Kohn R (2003) Efficient estimation of covariance selection models. Biometrika 90(4):809–830CrossRef Wong F, Carter CK, Kohn R (2003) Efficient estimation of covariance selection models. Biometrika 90(4):809–830CrossRef
Metadaten
Titel
A parsimonious multivariate poisson model for market basket analysis
verfasst von
Katrin Dippold
Harald Hruschka
Publikationsdatum
01.10.2013
Verlag
Springer Berlin Heidelberg
Erschienen in
Review of Managerial Science / Ausgabe 4/2013
Print ISSN: 1863-6683
Elektronische ISSN: 1863-6691
DOI
https://doi.org/10.1007/s11846-012-0088-7

Weitere Artikel der Ausgabe 4/2013

Review of Managerial Science 4/2013 Zur Ausgabe

Premium Partner