Skip to main content

2018 | OriginalPaper | Buchkapitel

6. Principal Component Analysis and Extensions

verfasst von : Patrick Mair

Erschienen in: Modern Psychometrics with R

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This chapter introduces principal component analysis (PCA), a technique for dimension reduction in multivariate datasets. At its core there is a matrix decomposition technique called singular value decomposition, which is introduced at the beginning of this chapter. This is followed by PCA model formulation, computation, and an application. Relationships with exploratory factor analysis are discussed as well. Subsequently, some PCA variants such as robust and sparse PCA are briefly discussed. The final two sections introduce two extensions of PCA. The first one is three-way PCA for three-way input data structures. The second one is called independent component analysis and illustrated using electroencephalography (EEG) data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
For readers who are more attracted to a sonic introduction to SVD rather than a formal one, Michael Greenacre composed an SVD song (“It had to be U”), available on YouTube.
 
2
In this section we follow a notation for indices of the data and components as well as the number of them which is different from that used before but which is standard in the literature of three-way analysis (see Kiers, 2000).
 
3
We omit the matrix formulation for all three-way PCA models since it requires some 3D matrix operations that are beyond scope of this book (see, e.g., Kroonenberg, 2008, for corresponding expressions).
 
4
Biplots are introduced in more detail in Chap. 10.
 
5
Thanks for Hrag Pailian for sharing this dataset.
 
Literatur
Zurück zum Zitat Arend, A. M., & Zimmer, H. D. (2011). What does ipsilateral delay activity reflect? Inferences from slow potentials in a lateralized visual working memory task. Journal of Cognitive Neuroscience, 23, 4048–4056.CrossRef Arend, A. M., & Zimmer, H. D. (2011). What does ipsilateral delay activity reflect? Inferences from slow potentials in a lateralized visual working memory task. Journal of Cognitive Neuroscience, 23, 4048–4056.CrossRef
Zurück zum Zitat Carroll, J. D., & Arabie, P. (1980). Multidimensional scaling. Annual Review of Psychology, 31, 607–649.CrossRef Carroll, J. D., & Arabie, P. (1980). Multidimensional scaling. Annual Review of Psychology, 31, 607–649.CrossRef
Zurück zum Zitat Carroll, J. D., & Chang, J. J. (1970). Analysis of individual differences in multidimensional scaling via an N-way generalization of Eckart-Young decomposition. Psychometrika, 35, 283–319.CrossRef Carroll, J. D., & Chang, J. J. (1970). Analysis of individual differences in multidimensional scaling via an N-way generalization of Eckart-Young decomposition. Psychometrika, 35, 283–319.CrossRef
Zurück zum Zitat Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1, 211–218.CrossRef Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1, 211–218.CrossRef
Zurück zum Zitat Fox, J., & Weisberg, S. (2011). An R companion to applied regression (2nd ed.). Thousand Oaks: Sage. Fox, J., & Weisberg, S. (2011). An R companion to applied regression (2nd ed.). Thousand Oaks: Sage.
Zurück zum Zitat Harshman, R. A. (1970). Foundations of the PARAFAC procedure: Models and conditions for an “explanatory” multi-modal factor analysis (Technical report 16, UCLA Working Papers in Phonetics). Harshman, R. A. (1970). Foundations of the PARAFAC procedure: Models and conditions for an “explanatory” multi-modal factor analysis (Technical report 16, UCLA Working Papers in Phonetics).
Zurück zum Zitat Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441.CrossRef Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441.CrossRef
Zurück zum Zitat Hyvärinen, A., Karhunen, J., & Oja, E. (2001). Independent component analysis. New York: Wiley.CrossRef Hyvärinen, A., Karhunen, J., & Oja, E. (2001). Independent component analysis. New York: Wiley.CrossRef
Zurück zum Zitat James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning with applications in R. New York: Springer.CrossRef James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning with applications in R. New York: Springer.CrossRef
Zurück zum Zitat Joliffe, I. T. (2002). Principal component analysis (2nd ed.). New York: Springer. Joliffe, I. T. (2002). Principal component analysis (2nd ed.). New York: Springer.
Zurück zum Zitat Kiers, H. A. L. (2000). Towards a standardized notation and terminology in multiway analysis. Journal of Chemometrics, 14, 105–122.MathSciNetCrossRef Kiers, H. A. L. (2000). Towards a standardized notation and terminology in multiway analysis. Journal of Chemometrics, 14, 105–122.MathSciNetCrossRef
Zurück zum Zitat Kroonenberg, P. M. (2008). Applied multiway data analysis. Hoboken: Wiley.CrossRef Kroonenberg, P. M. (2008). Applied multiway data analysis. Hoboken: Wiley.CrossRef
Zurück zum Zitat Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine, 2, 559–572.MATH Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine, 2, 559–572.MATH
Zurück zum Zitat Sharma, S. (1996). Applied multivariate techniques. New York: Wiley. Sharma, S. (1996). Applied multivariate techniques. New York: Wiley.
Zurück zum Zitat Sidanius, J., Levin, S., van Laar, C., & Sears, D. O. (2010). The diversity challenge: Social identity and intergroup relations on the college campus. New York: The Russell Sage Foundation. Sidanius, J., Levin, S., van Laar, C., & Sears, D. O. (2010). The diversity challenge: Social identity and intergroup relations on the college campus. New York: The Russell Sage Foundation.
Zurück zum Zitat Sigg, C. D., & Buhmann, J. M. (2008). Expectation-maximization for sparse and non-negative PCA. In Proceedings of the 25th International Conference on Machine Learning. Sigg, C. D., & Buhmann, J. M. (2008). Expectation-maximization for sparse and non-negative PCA. In Proceedings of the 25th International Conference on Machine Learning.
Zurück zum Zitat Stacklies, W., Redestig, H., Scholz, M., Walther, D., & Selbig, J. (2007). pcaMethods: A bioconductor package providing PCA methods for incomplete data. Bioinformatics, 23, 1164–1167.CrossRef Stacklies, W., Redestig, H., Scholz, M., Walther, D., & Selbig, J. (2007). pcaMethods: A bioconductor package providing PCA methods for incomplete data. Bioinformatics, 23, 1164–1167.CrossRef
Zurück zum Zitat Stone, J. V. (2004). Independent component analysis: A tutorial introduction. Cambridge: The MIT Press. Stone, J. V. (2004). Independent component analysis: A tutorial introduction. Cambridge: The MIT Press.
Zurück zum Zitat Timmerman, M. E. (2001). Component analysis of multisubject multivariate longitudinal data. PhD thesis, University of Groningen, Groningen. Timmerman, M. E. (2001). Component analysis of multisubject multivariate longitudinal data. PhD thesis, University of Groningen, Groningen.
Zurück zum Zitat Treiblmaier, H. (2006). Datenqualität und individualisierte Kommunikation [Data Quality and Individualized Communication]. Wiesbaden: DUV Gabler Edition Wissenschaft. Treiblmaier, H. (2006). Datenqualität und individualisierte Kommunikation [Data Quality and Individualized Communication]. Wiesbaden: DUV Gabler Edition Wissenschaft.
Zurück zum Zitat Treiblmaier, H., Bentler, P. M., & Mair, P. (2011). Formative constructs implemented via common factors. Structural Equation Modeling: A Multidisciplinary Journal, 18, 1–17.MathSciNetCrossRef Treiblmaier, H., Bentler, P. M., & Mair, P. (2011). Formative constructs implemented via common factors. Structural Equation Modeling: A Multidisciplinary Journal, 18, 1–17.MathSciNetCrossRef
Zurück zum Zitat Venables, W. N., & Ripley, B. D. (2002). Modern applied statistics with S (4th ed.). New York: Springer.CrossRef Venables, W. N., & Ripley, B. D. (2002). Modern applied statistics with S (4th ed.). New York: Springer.CrossRef
Zurück zum Zitat Vogel, E. K., & Machizawa, M. G. (2004). Neural activity predicts individual differences in visual working memory capacity. Nature, 428, 748–751.CrossRef Vogel, E. K., & Machizawa, M. G. (2004). Neural activity predicts individual differences in visual working memory capacity. Nature, 428, 748–751.CrossRef
Zurück zum Zitat Willerman, L., Schultz, R., Rutledge, J. N., & Bigler, E. (1991). In vivo brain size and intelligence. Intelligence, 15, 223–228.CrossRef Willerman, L., Schultz, R., Rutledge, J. N., & Bigler, E. (1991). In vivo brain size and intelligence. Intelligence, 15, 223–228.CrossRef
Zurück zum Zitat Zou, H., Hastie, T., & Tibshirani, R. (2006). Sparse principal component analysis. Journal of Computational and Graphical Statistics, 15, 262–286.MathSciNetCrossRef Zou, H., Hastie, T., & Tibshirani, R. (2006). Sparse principal component analysis. Journal of Computational and Graphical Statistics, 15, 262–286.MathSciNetCrossRef
Metadaten
Titel
Principal Component Analysis and Extensions
verfasst von
Patrick Mair
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-93177-7_6

Premium Partner