Skip to main content

2018 | OriginalPaper | Buchkapitel

9. Cluster Analysis

verfasst von : Erik Mooi, Marko Sarstedt, Irma Mooi-Reci

Erschienen in: Market Research

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We provide comprehensive and advanced knowledge of cluster analysis knowledge. We first introduce the principles of cluster analysis and outline the steps and decisions involved. We discuss how to select appropriate clustering variables and subsequently introduce modern hierarchical and partitioning methods for cluster analysis, using simple examples to illustrate how they work. We also discuss the key measures of similarity and dissimilarity, and offer guidance on how to decide the number of clusters to extract from the data. Each step in a cluster analysis is subsequently linked to its execution in Stata (using menus and code), thus enabling readers to analyze, chart, and validate the results. Interpretation of Stata output can be difficult, but we make this easier by means of an annotated case study. We conclude with suggestions for further readings on the use, application, and interpretation of cluster analysis.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Tonks (2009) provides a discussion of segment design and the choice of clustering variables in consumer markets.
 
2
See Arabie and Hubert (1994), Sheppard (1996), and Dolnicar and Grün (2009).
 
3
Whereas agglomerative methods have the large task of checking N·(N−1)/2 possible first combinations of observations (note that N represents the number of observations in the dataset), divisive methods have the almost impossible task of checking 2(N−1)−1 combinations.
 
4
There are many other matching coefficients, such as Yule’s Q, Kulczynski, or Ochiai, which are also menu-accessible in Stata. However, since most applications of cluster analysis rely on metric or ordinal data, we will not discuss these. See Wedel and Kamakura (2000) for more information on alternative matching coefficients.
 
5
For details on the implementation of these stopping rules in Stata, see Halpin (2016).
 
6
In the https://static-content.springer.com/image/chp%3A10.1007%2F978-981-10-5218-7_9/MediaObjects/395806_1_En_9_Figb_HTML.gif Web Appendix (→Downloads), we offer a Stata.ado file to calculate the ω k called chomega.ado. We also offer an Excel sheet (VRC.xlsx) to calculate the ω k manually.
 
7
See Punj and Stewart (1983) for additional information on this sequential approach.
 
Literatur
Zurück zum Zitat Anderberg, M. R. (1973). Cluster analysis for applications. New York: Academic. Anderberg, M. R. (1973). Cluster analysis for applications. New York: Academic.
Zurück zum Zitat Arabie, P., & Hubert, L. (1994). Cluster analysis in marketing research. In R. P. Bagozzi (Ed.), Advanced methods in marketing research (pp. 160–189). Cambridge: Basil Blackwell & Mott, Ltd.. Arabie, P., & Hubert, L. (1994). Cluster analysis in marketing research. In R. P. Bagozzi (Ed.), Advanced methods in marketing research (pp. 160–189). Cambridge: Basil Blackwell & Mott, Ltd..
Zurück zum Zitat Arthur, D., & Vassilvitskii, S. (2007). k-means++: The advantages of careful seeding. In Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms (pp. 1027–1035). Philadelphia: Society for Industrial and Applied Mathematics. Arthur, D., & Vassilvitskii, S. (2007). k-means++: The advantages of careful seeding. In Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms (pp. 1027–1035). Philadelphia: Society for Industrial and Applied Mathematics.
Zurück zum Zitat Becker, J.-M., Ringle, C. M., Sarstedt, M., & Völckner, F. (2015). How collinearity affects mixture regression results. Marketing Letters, 26(4), 643–659.CrossRef Becker, J.-M., Ringle, C. M., Sarstedt, M., & Völckner, F. (2015). How collinearity affects mixture regression results. Marketing Letters, 26(4), 643–659.CrossRef
Zurück zum Zitat Caliński, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics—Theory and Methods, 3(1), 1–27.CrossRef Caliński, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics—Theory and Methods, 3(1), 1–27.CrossRef
Zurück zum Zitat Dolnicar, S. (2003). Using cluster analysis for market segmentation—typical misconceptions, established methodological weaknesses and some recommendations for improvement. Australasian Journal of Market Research, 11(2), 5–12.CrossRef Dolnicar, S. (2003). Using cluster analysis for market segmentation—typical misconceptions, established methodological weaknesses and some recommendations for improvement. Australasian Journal of Market Research, 11(2), 5–12.CrossRef
Zurück zum Zitat Dolnicar, S., & Grün, B. (2009). Challenging “factor-cluster segmentation”. Journal of Travel Research, 47(1), 63–71.CrossRef Dolnicar, S., & Grün, B. (2009). Challenging “factor-cluster segmentation”. Journal of Travel Research, 47(1), 63–71.CrossRef
Zurück zum Zitat Dolnicar, S., & Lazarevski, K. (2009). Methodological reasons for the theory/practice divide in market segmentation. Journal of Marketing Management, 25(3–4), 357–373.CrossRef Dolnicar, S., & Lazarevski, K. (2009). Methodological reasons for the theory/practice divide in market segmentation. Journal of Marketing Management, 25(3–4), 357–373.CrossRef
Zurück zum Zitat Dolnicar, S., Grün, B., Leisch, F., & Schmidt, F. (2014). Required sample sizes for data-driven market segmentation analyses in tourism. Journal of Travel Research, 53(3), 296–306.CrossRef Dolnicar, S., Grün, B., Leisch, F., & Schmidt, F. (2014). Required sample sizes for data-driven market segmentation analyses in tourism. Journal of Travel Research, 53(3), 296–306.CrossRef
Zurück zum Zitat Dolnicar, S., Grün, B., & Leisch, F. (2016). Increasing sample size compensates for data problems in segmentation studies. Journal of Business Research, 69(2), 992–999.CrossRef Dolnicar, S., Grün, B., & Leisch, F. (2016). Increasing sample size compensates for data problems in segmentation studies. Journal of Business Research, 69(2), 992–999.CrossRef
Zurück zum Zitat Duda, R. O., & Hart, P. E. (1973). Pattern classification. Hoboken: Wiley. Duda, R. O., & Hart, P. E. (1973). Pattern classification. Hoboken: Wiley.
Zurück zum Zitat Duda, R. O., Hart, P. E., & Stork, D. G. (2001). Pattern classification (2nd ed.). Hoboken: Wiley. Duda, R. O., Hart, P. E., & Stork, D. G. (2001). Pattern classification (2nd ed.). Hoboken: Wiley.
Zurück zum Zitat Everitt, B. S., & Rabe-Hesketh, S. (2006). Handbook of statistical analyses using Stata (4th ed.). Boca Raton: Chapman & Hall/CRC. Everitt, B. S., & Rabe-Hesketh, S. (2006). Handbook of statistical analyses using Stata (4th ed.). Boca Raton: Chapman & Hall/CRC.
Zurück zum Zitat Formann, A. K. (1984). Die Latent-Class-Analyse: Einführung in die Theorie und Anwendung. Beltz: Weinheim. Formann, A. K. (1984). Die Latent-Class-Analyse: Einführung in die Theorie und Anwendung. Beltz: Weinheim.
Zurück zum Zitat Gower, J. C. (1971). A general coefficient of similarity and some of its properties. Biometrics, 27(4), 857–871.CrossRef Gower, J. C. (1971). A general coefficient of similarity and some of its properties. Biometrics, 27(4), 857–871.CrossRef
Zurück zum Zitat Kaufman, L., & Rousseeuw, P. J. (2005). Finding groups in data. An introduction to cluster analysis. Hoboken: Wiley. Kaufman, L., & Rousseeuw, P. J. (2005). Finding groups in data. An introduction to cluster analysis. Hoboken: Wiley.
Zurück zum Zitat Kotler, P., & Keller, K. L. (2015). Marketing management (15th ed.). Upper Saddle River: Prentice Hall. Kotler, P., & Keller, K. L. (2015). Marketing management (15th ed.). Upper Saddle River: Prentice Hall.
Zurück zum Zitat Milligan, G. W., & Cooper, M. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50(2), 159–179.CrossRef Milligan, G. W., & Cooper, M. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50(2), 159–179.CrossRef
Zurück zum Zitat Milligan, G. W., & Cooper, M. (1988). A study of variable standardization. Journal of Classification, 5(2), 181–204.CrossRef Milligan, G. W., & Cooper, M. (1988). A study of variable standardization. Journal of Classification, 5(2), 181–204.CrossRef
Zurück zum Zitat Park, H.-S., & Jun, C.-H. (2009). A simple and fast algorithm for K-medoids clustering. Expert Systems with Applications, 36(2), 3336–3341.CrossRef Park, H.-S., & Jun, C.-H. (2009). A simple and fast algorithm for K-medoids clustering. Expert Systems with Applications, 36(2), 3336–3341.CrossRef
Zurück zum Zitat Punj, G., & Stewart, D. W. (1983). Cluster analysis in marketing research: Review and suggestions for application. Journal of Marketing Research, 20(2), 134–148. Punj, G., & Stewart, D. W. (1983). Cluster analysis in marketing research: Review and suggestions for application. Journal of Marketing Research, 20(2), 134–148.
Zurück zum Zitat Qiu, W., & Joe, H. (2009). Cluster generation: Random cluster generation (with specified degree of separation). R package version 1.2.7. Qiu, W., & Joe, H. (2009). Cluster generation: Random cluster generation (with specified degree of separation). R package version 1.2.7.
Zurück zum Zitat Sheppard, A. (1996). The sequence of factor analysis and cluster analysis: Differences in segmentation and dimensionality through the use of raw and factor scores. Tourism Analysis, 1(1), 49–57. Sheppard, A. (1996). The sequence of factor analysis and cluster analysis: Differences in segmentation and dimensionality through the use of raw and factor scores. Tourism Analysis, 1(1), 49–57.
Zurück zum Zitat Tonks, D. G. (2009). Validity and the design of market segments. Journal of Marketing Management, 25(3/4), 341–356.CrossRef Tonks, D. G. (2009). Validity and the design of market segments. Journal of Marketing Management, 25(3/4), 341–356.CrossRef
Zurück zum Zitat Wedel, M., & Kamakura, W. A. (2000). Market segmentation: Conceptual and methodological foundations (2nd ed.). Boston: Kluwer Academic. Wedel, M., & Kamakura, W. A. (2000). Market segmentation: Conceptual and methodological foundations (2nd ed.). Boston: Kluwer Academic.
Zurück zum Zitat van der Kloot, W. A., Spaans, A. M. J., & Heinser, W. J. (2005). Instability of hierarchical cluster analysis due to input order of the data: The PermuCLUSTER solution. Psychological Methods, 10(4), 468–476.CrossRef van der Kloot, W. A., Spaans, A. M. J., & Heinser, W. J. (2005). Instability of hierarchical cluster analysis due to input order of the data: The PermuCLUSTER solution. Psychological Methods, 10(4), 468–476.CrossRef
Zurück zum Zitat Lilien, G. L., & Rangaswamy, A. (2004). Marketing engineering. Computer-assisted marketing analysis and planning (2nd ed.). Bloomington: Trafford Publishing. Lilien, G. L., & Rangaswamy, A. (2004). Marketing engineering. Computer-assisted marketing analysis and planning (2nd ed.). Bloomington: Trafford Publishing.
Zurück zum Zitat John H. R., Kayande, U., & Stremersch, S. (2014). From academic research to marketing practice: Exploring the marketing science value chain. International Journal of Research in Marketing, 31(2), 127–140 John H. R., Kayande, U., & Stremersch, S. (2014). From academic research to marketing practice: Exploring the marketing science value chain. International Journal of Research in Marketing, 31(2), 127–140
Metadaten
Titel
Cluster Analysis
verfasst von
Erik Mooi
Marko Sarstedt
Irma Mooi-Reci
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-5218-7_9