Skip to main content
Top

2018 | OriginalPaper | Chapter

2. Factor Analysis

Author : Patrick Mair

Published in: Modern Psychometrics with R

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter introduces exploratory and confirmatory factor analysis. It starts with a section on correlation coefficients since factor analytic techniques are based on covariance/correlation matrices. Special emphasis is on tetrachoric/polychoric correlations for ordinal input data. This is followed by elaborations on exploratory factor analysis including practical aspects such as determining the number of factors and rotation techniques to facilitate factor interpretation. A recent development is Bayesian exploratory factor analysis which, in addition to the loadings, also estimates the number of factors and allows them to be correlated. This approach is explored in a separate section. The second part of this chapter consists of a detailed treatment of confirmatory factor analysis which lays the groundwork for structural equation models presented in the next chapter. In confirmatory factor analysis, the number of factors and the assignment of indicators to factors are determined by substantive considerations. Several extensions in terms of multigroup, longitudinal, and multilevel settings are presented. The chapter concludes with a Bayesian approach to confirmatory factor analysis.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Eigenvalues will be introduced in Sect. 6.​1.​1.
 
2
This call gives a warning that the matrix is not positive definite.
 
3
In EFA, residuals are defined by \((\mathbf R - \hat {\mathbf P})\), where R is the sample correlation matrix and \(\hat {\mathbf P}\) the estimated model correlation matrix.
 
4
An overview of rotation techniques and corresponding comparisons can be found in Browne (2001).
 
5
Thanks to Rémi Piatek and Sylvia Frühwirth-Schnatter for their support with this application.
 
6
In their original paper, Conti et al. (2014) use the more restrictive assumption of at least three manifest variables per active factor, to rule out potential identification problems due to extreme cases with zero correlation between some factors. With correlated factors, however, the weaker assumption of two manifest variables per factor is sufficient for identification.
 
7
We switch the notation for the input data (Y instead of X) in order to be consistent with the standard SEM model formulation presented in the next chapter.
 
8
Note that compared to Eq. (2.4) we slightly change the notation (i.e., Ψ instead of Φ, and Θ instead of Ψ) in order to be consistent with the names of the output objects in the lavaan package (Rosseel, 2012), which is used throughout this chapter.
 
9
At the time this book was written, lavaan allows for two-level structures only. Also, thanks to Yves Rosseel for sharing the code.
 
Literature
go back to reference Bartholomew, D. J., & Knott, M. (1999). Latent variable models and factor analysis (2nd ed.). London: Hodder Arnold.MATH Bartholomew, D. J., & Knott, M. (1999). Latent variable models and factor analysis (2nd ed.). London: Hodder Arnold.MATH
go back to reference Bartholomew, D. J., Steele, F., Moustaki, I., & Galbraith, J. I. (2008). Analysis of multivariate social science data (2nd ed.). Boca Raton: CRC Press.MATH Bartholomew, D. J., Steele, F., Moustaki, I., & Galbraith, J. I. (2008). Analysis of multivariate social science data (2nd ed.). Boca Raton: CRC Press.MATH
go back to reference Bergh, R., Akrami, N., Sidanius, J., & Sibley, C. (2016). Is group membership necessary for understanding prejudice? A re-evaluation of generalized prejudice and its personality correlates. Journal of Personality and Social Psychology, 111, 367–395.CrossRef Bergh, R., Akrami, N., Sidanius, J., & Sibley, C. (2016). Is group membership necessary for understanding prejudice? A re-evaluation of generalized prejudice and its personality correlates. Journal of Personality and Social Psychology, 111, 367–395.CrossRef
go back to reference Browne, M. W. (2001). An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research, 36, 111–150.CrossRef Browne, M. W. (2001). An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research, 36, 111–150.CrossRef
go back to reference Browne, M. W., & Cudeck, R. (1993). Alternative ways of assessing model fit. In: K. A. Bollen & J. S. Long (Eds.), Testing structural equation models (pp. 136–162). Beverly Hills: Sage. Browne, M. W., & Cudeck, R. (1993). Alternative ways of assessing model fit. In: K. A. Bollen & J. S. Long (Eds.), Testing structural equation models (pp. 136–162). Beverly Hills: Sage.
go back to reference Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research, 1, 245–276.CrossRef Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research, 1, 245–276.CrossRef
go back to reference Conti, G., Frühwirth-Schnatter, S., Heckman, J. J., & Piatek, R. (2014). Bayesian exploratory factor analysis. Journal of Econometrics, 183, 31–57.MathSciNetCrossRef Conti, G., Frühwirth-Schnatter, S., Heckman, J. J., & Piatek, R. (2014). Bayesian exploratory factor analysis. Journal of Econometrics, 183, 31–57.MathSciNetCrossRef
go back to reference Drasgow, F. (1986). Polychoric and polyserial correlations. In: S. Kotz & N. L. Johnson (Eds.), Encyclopedia of statistical sciences (Vol. 7, pp. 68–74). New York: Wiley. Drasgow, F. (1986). Polychoric and polyserial correlations. In: S. Kotz & N. L. Johnson (Eds.), Encyclopedia of statistical sciences (Vol. 7, pp. 68–74). New York: Wiley.
go back to reference Epskamp, S. (2015). semPlot: Unified visualizations of structural equation models. Structural Equation Modeling: A Multidisciplinary Journal, 22, 474–483.MathSciNetCrossRef Epskamp, S. (2015). semPlot: Unified visualizations of structural equation models. Structural Equation Modeling: A Multidisciplinary Journal, 22, 474–483.MathSciNetCrossRef
go back to reference Finney, S. J., & DiStefano, C. (2013). Nonnormal and categorical data in structural equation modeling. In: G. R. Hancock & R. O. Mueller (Eds.), Structural equation modeling: A second course (2nd ed., pp. 439–492). Charlotte: Information Age Publishing. Finney, S. J., & DiStefano, C. (2013). Nonnormal and categorical data in structural equation modeling. In: G. R. Hancock & R. O. Mueller (Eds.), Structural equation modeling: A second course (2nd ed., pp. 439–492). Charlotte: Information Age Publishing.
go back to reference Hendrickson, A. E., & White, P. O. (1964). PROMAX: A quick method for rotation to oblique simple structure. British Journal of Mathematical and Statistical Psychology, 17, 65–70.CrossRef Hendrickson, A. E., & White, P. O. (1964). PROMAX: A quick method for rotation to oblique simple structure. British Journal of Mathematical and Statistical Psychology, 17, 65–70.CrossRef
go back to reference Ho, A. K., Sidanius, J., Kteily, N., Sheehy-Skeffington, J., Pratto, F., Henkel, K. E., Foels, R., & Stewart, A. L. (2015). The nature of social dominance orientation: Theorizing and measuring preferences for intergroup inequality using the new SDO7 scale. Journal of Personality and Social Psychology, 109, 1003–1028.CrossRef Ho, A. K., Sidanius, J., Kteily, N., Sheehy-Skeffington, J., Pratto, F., Henkel, K. E., Foels, R., & Stewart, A. L. (2015). The nature of social dominance orientation: Theorizing and measuring preferences for intergroup inequality using the new SDO7 scale. Journal of Personality and Social Psychology, 109, 1003–1028.CrossRef
go back to reference Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30, 179–185.CrossRef Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30, 179–185.CrossRef
go back to reference Hox, J. J. (2010). Multilevel analysis: Techniques and applications (2nd ed.). New York: Routledge.CrossRef Hox, J. J. (2010). Multilevel analysis: Techniques and applications (2nd ed.). New York: Routledge.CrossRef
go back to reference Hsu, H. Y., Kwok, O. M., Lin, J. H., & Acosta, S. (2015). Detecting misspecified multilevel structural equation models with common fit indices: A Monte Carlo study. Multivariate Behavioral Research, 50, 197–215.CrossRef Hsu, H. Y., Kwok, O. M., Lin, J. H., & Acosta, S. (2015). Detecting misspecified multilevel structural equation models with common fit indices: A Monte Carlo study. Multivariate Behavioral Research, 50, 197–215.CrossRef
go back to reference Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6, 1–55.CrossRef Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6, 1–55.CrossRef
go back to reference Jöreskog, K. G., & Goldberger, A. S. (1975). Estimation of a model with multiple indicators and multiple causes of a single latent variable. Journal of the American Statistical Association, 70, 631–639.MathSciNetMATH Jöreskog, K. G., & Goldberger, A. S. (1975). Estimation of a model with multiple indicators and multiple causes of a single latent variable. Journal of the American Statistical Association, 70, 631–639.MathSciNetMATH
go back to reference Kaiser, H. F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika, 23, 187–200.CrossRef Kaiser, H. F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika, 23, 187–200.CrossRef
go back to reference Kaiser, H. F. (1960). The application of electronic computers to factor analysis. Educational and Psychological Measurement, 20, 141–151.CrossRef Kaiser, H. F. (1960). The application of electronic computers to factor analysis. Educational and Psychological Measurement, 20, 141–151.CrossRef
go back to reference Kaplan, D. (2014). Bayesian statistics for the social sciences. New York: Guilford. Kaplan, D. (2014). Bayesian statistics for the social sciences. New York: Guilford.
go back to reference Kenny, D. A. (1979). Correlation and causality. New York: Wiley.MATH Kenny, D. A. (1979). Correlation and causality. New York: Wiley.MATH
go back to reference Kirk, D. B. (1973). On the numerical approximation of the bivariate normal (tetrachoric) correlation coefficient. Psychometrika, 38, 259–268.CrossRef Kirk, D. B. (1973). On the numerical approximation of the bivariate normal (tetrachoric) correlation coefficient. Psychometrika, 38, 259–268.CrossRef
go back to reference Kline, R. B. (2016). Principles and practice of structural equation modeling (4th ed.). New York: Guilford Press.MATH Kline, R. B. (2016). Principles and practice of structural equation modeling (4th ed.). New York: Guilford Press.MATH
go back to reference Kruschke, J. (2014). Doing Bayesian data analysis: A tutorial with R , JAGS , and Stan (2nd ed.). Cambridge: Academic.MATH Kruschke, J. (2014). Doing Bayesian data analysis: A tutorial with R , JAGS , and Stan (2nd ed.). Cambridge: Academic.MATH
go back to reference Little, T. D. (2013). Longitudinal structural equation modeling. New York: Guilford. Little, T. D. (2013). Longitudinal structural equation modeling. New York: Guilford.
go back to reference MacCallum, R. C. (2009). Factor analysis. In: R. E. Millsap, & A. Maydeu-Olivares (Eds.), The Sage handbook of quantitative methods in psychology (pp. 123–177) London: Sage.CrossRef MacCallum, R. C. (2009). Factor analysis. In: R. E. Millsap, & A. Maydeu-Olivares (Eds.), The Sage handbook of quantitative methods in psychology (pp. 123–177) London: Sage.CrossRef
go back to reference Mair, P., Hofmann, E., Gruber, K., Zeileis, A., & Hornik, K. (2015). Motivation, values, and work design as drivers of participation in the R open source project for statistical computing. Proceedings of the National Academy of Sciences of the United States of America 112, 14788–14792.CrossRef Mair, P., Hofmann, E., Gruber, K., Zeileis, A., & Hornik, K. (2015). Motivation, values, and work design as drivers of participation in the R open source project for statistical computing. Proceedings of the National Academy of Sciences of the United States of America 112, 14788–14792.CrossRef
go back to reference McDonald, R., & Mulaik, S. A. (1979). Determinacy of common factors: A nontechnical review. Psychological Bulletin, 86, 430–445.CrossRef McDonald, R., & Mulaik, S. A. (1979). Determinacy of common factors: A nontechnical review. Psychological Bulletin, 86, 430–445.CrossRef
go back to reference Merkle, E. C., & Rosseel, Y. (2018). blavaan: Bayesian structural equation models via parameter expansion. Journal of Statistical Software, 85(4), 1–30.CrossRef Merkle, E. C., & Rosseel, Y. (2018). blavaan: Bayesian structural equation models via parameter expansion. Journal of Statistical Software, 85(4), 1–30.CrossRef
go back to reference Muthén, B. O., & Hofacker, C. (1988). Testing the assumptions underlying tetrachoric correlations. Psychometrika, 83, 563–578.CrossRef Muthén, B. O., & Hofacker, C. (1988). Testing the assumptions underlying tetrachoric correlations. Psychometrika, 83, 563–578.CrossRef
go back to reference Revelle, W., & Rocklin, T. (1979). Very simple structure: An alternative procedure for estimating the optimal number of interpretable factors. Multivariate Behavioral Research, 14, 403–414.CrossRef Revelle, W., & Rocklin, T. (1979). Very simple structure: An alternative procedure for estimating the optimal number of interpretable factors. Multivariate Behavioral Research, 14, 403–414.CrossRef
go back to reference Rhemtulla, M., Brosseau-Liard, P. E., & Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17, 354–373.CrossRef Rhemtulla, M., Brosseau-Liard, P. E., & Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17, 354–373.CrossRef
go back to reference Savalei, V. (2011). What to do about zero frequency cells when estimating polychoric correlations. Structural Equation Modeling: A Multidisciplinary Journal, 18, 253–273.MathSciNetCrossRef Savalei, V. (2011). What to do about zero frequency cells when estimating polychoric correlations. Structural Equation Modeling: A Multidisciplinary Journal, 18, 253–273.MathSciNetCrossRef
go back to reference Sidanius, J., & Pratto, F. (2001). Social dominance: An intergroup theory of social hierarchy and oppression. Cambridge: Cambridge University Press. Sidanius, J., & Pratto, F. (2001). Social dominance: An intergroup theory of social hierarchy and oppression. Cambridge: Cambridge University Press.
go back to reference Sidanius, J., Levin, S., van Laar, C., & Sears, D. O. (2010). The diversity challenge: Social identity and intergroup relations on the college campus. New York: The Russell Sage Foundation. Sidanius, J., Levin, S., van Laar, C., & Sears, D. O. (2010). The diversity challenge: Social identity and intergroup relations on the college campus. New York: The Russell Sage Foundation.
go back to reference Tisak, J., & Meredith, W. (1990). Longitudinal factor analysis. In: A. von Eye (Ed.), Statistical methods in longitudinal research (Vol. 1, pp. 125–149). San Diego: Academic.CrossRef Tisak, J., & Meredith, W. (1990). Longitudinal factor analysis. In: A. von Eye (Ed.), Statistical methods in longitudinal research (Vol. 1, pp. 125–149). San Diego: Academic.CrossRef
go back to reference Treiblmaier, H. (2006). Datenqualität und individualisierte Kommunikation [Data Quality and Individualized Communication]. Wiesbaden: DUV Gabler Edition Wissenschaft. Treiblmaier, H. (2006). Datenqualität und individualisierte Kommunikation [Data Quality and Individualized Communication]. Wiesbaden: DUV Gabler Edition Wissenschaft.
go back to reference Treiblmaier, H., Bentler, P. M., & Mair, P. (2011). Formative constructs implemented via common factors. Structural Equation Modeling: A Multidisciplinary Journal 18, 1–17.MathSciNetCrossRef Treiblmaier, H., Bentler, P. M., & Mair, P. (2011). Formative constructs implemented via common factors. Structural Equation Modeling: A Multidisciplinary Journal 18, 1–17.MathSciNetCrossRef
go back to reference Tucker, L. R., & Lewis, C. (1973). A reliability coefficient for maximum likelihood factor analysis. Psychometrika, 38, 1–10.CrossRef Tucker, L. R., & Lewis, C. (1973). A reliability coefficient for maximum likelihood factor analysis. Psychometrika, 38, 1–10.CrossRef
go back to reference Vaughn-Coaxum, R., Mair, P., & Weisz, J. R. (2016) Racial/ethnic differences in youth depression indicators: An item response theory analysis of symptoms reported by White, Black, Asian, and Latino youths. Clinical Psychological Science 4, 239–253.CrossRef Vaughn-Coaxum, R., Mair, P., & Weisz, J. R. (2016) Racial/ethnic differences in youth depression indicators: An item response theory analysis of symptoms reported by White, Black, Asian, and Latino youths. Clinical Psychological Science 4, 239–253.CrossRef
go back to reference Velicer, W. F. (1976). Determining the number of components from the matrix of partial correlations. Psychometrika, 41, 321–327.CrossRef Velicer, W. F. (1976). Determining the number of components from the matrix of partial correlations. Psychometrika, 41, 321–327.CrossRef
Metadata
Title
Factor Analysis
Author
Patrick Mair
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-93177-7_2

Premium Partner