Skip to main content
Top
Published in: Advances in Data Analysis and Classification 3/2013

01-09-2013 | Regular Article

A Monte Carlo evaluation of three methods to detect local dependence in binary data latent class models

Authors: Daniel L. Oberski, Geert H. van Kollenburg, Jeroen K. Vermunt

Published in: Advances in Data Analysis and Classification | Issue 3/2013

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Binary data latent class analysis is a form of model-based clustering applied in a wide range of fields. A central assumption of this model is that of conditional independence of responses given latent class membership, often referred to as the “local independence” assumption. The results of latent class analysis may be severely biased when this crucial assumption is violated; investigating the degree to which bivariate relationships between observed variables fit this hypothesis therefore provides vital information. This article evaluates three methods of doing so. The first is the commonly applied method of referring the so-called “bivariate residuals” to a Chi-square distribution. We also introduce two alternative methods that are novel to the investigation of local dependence in latent class analysis: bootstrapping the bivariate residuals, and the asymptotic score test or “modification index”. Our Monte Carlo simulation indicates that the latter two methods perform adequately, while the first method does not perform as intended.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Ahlquist JS, Breunig C (2012) Model-based clustering and typologies in the social sciences. Polit Anal 20(1):92–112CrossRef Ahlquist JS, Breunig C (2012) Model-based clustering and typologies in the social sciences. Polit Anal 20(1):92–112CrossRef
go back to reference Albert P, Dodd L (2004) A cautionary note on the robustness of latent class models for estimating diagnostic error without a gold standard. Biometrics 60(2):427–435MathSciNetMATHCrossRef Albert P, Dodd L (2004) A cautionary note on the robustness of latent class models for estimating diagnostic error without a gold standard. Biometrics 60(2):427–435MathSciNetMATHCrossRef
go back to reference Baughman A, Bisgard K, Cortese M, Thompson W, Sanden G, Strebel P (2008) Utility of composite reference standards and latent class analysis in evaluating the clinical accuracy of diagnostic tests for pertussis. Clin Vaccine Immunol 15(1):106–114CrossRef Baughman A, Bisgard K, Cortese M, Thompson W, Sanden G, Strebel P (2008) Utility of composite reference standards and latent class analysis in evaluating the clinical accuracy of diagnostic tests for pertussis. Clin Vaccine Immunol 15(1):106–114CrossRef
go back to reference Chen F, Mackey A, Vermunt J, Roos D (2007) Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS One 2(4):e383CrossRef Chen F, Mackey A, Vermunt J, Roos D (2007) Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS One 2(4):e383CrossRef
go back to reference Collins LM, Lanza ST (2010) Latent class and latent transition analysis: with applications in the social, behavioral, and health sciences, vol 718. Wiley, New York Collins LM, Lanza ST (2010) Latent class and latent transition analysis: with applications in the social, behavioral, and health sciences, vol 718. Wiley, New York
go back to reference Efron B (1982) The Jackknife, the bootstrap, and other resampling plans. In: Proceedings of the CBMS-NSF Regional Conference Series in Applied Mathematics, Society for Industrial and Applied Mathematics (SIAM), Philadelphia Efron B (1982) The Jackknife, the bootstrap, and other resampling plans. In: Proceedings of the CBMS-NSF Regional Conference Series in Applied Mathematics, Society for Industrial and Applied Mathematics (SIAM), Philadelphia
go back to reference Evers M, Namboodiri N (1979) On the design matrix strategy in the analysis of categorical data. Sociol Methodol 10:86–111CrossRef Evers M, Namboodiri N (1979) On the design matrix strategy in the analysis of categorical data. Sociol Methodol 10:86–111CrossRef
go back to reference Faraone S, Tsuang M (1994) Measuring diagnostic accuracy in. Am J Psychiatry 1(51):651 Faraone S, Tsuang M (1994) Measuring diagnostic accuracy in. Am J Psychiatry 1(51):651
go back to reference Forcina A (2008) Identifiability of extended latent class models with individual covariates. Comput Stat Data Anal 52(12):5263–5268MathSciNetMATHCrossRef Forcina A (2008) Identifiability of extended latent class models with individual covariates. Comput Stat Data Anal 52(12):5263–5268MathSciNetMATHCrossRef
go back to reference Formann A (1992) Linear logistic latent class analysis for polytomous data. J Am Stat Assoc 87(418): 476–486 Formann A (1992) Linear logistic latent class analysis for polytomous data. J Am Stat Assoc 87(418): 476–486
go back to reference Gaffikin L, McGrath J, Arbyn M, Blumenthal P (2007) Visual inspection with acetic acid as a cervical cancer test: accuracy validated using latent class analysis. BMC Med Res Methodol 7(1):36CrossRef Gaffikin L, McGrath J, Arbyn M, Blumenthal P (2007) Visual inspection with acetic acid as a cervical cancer test: accuracy validated using latent class analysis. BMC Med Res Methodol 7(1):36CrossRef
go back to reference Gallego A, Oberski D (2012) Personality and political participation: the mediation hypothesis. Polit Behav 34:424–451CrossRef Gallego A, Oberski D (2012) Personality and political participation: the mediation hypothesis. Polit Behav 34:424–451CrossRef
go back to reference Glas C (1998) Detection of differential item functioning using Lagrange multiplier tests. Stat Sinica 8: 647–668 Glas C (1998) Detection of differential item functioning using Lagrange multiplier tests. Stat Sinica 8: 647–668
go back to reference Glas C (1999) Modification indices for the 2-PL and the nominal response model. Psychometrika 64(3): 273–294 Glas C (1999) Modification indices for the 2-PL and the nominal response model. Psychometrika 64(3): 273–294
go back to reference Hadgu A, Dendukuri N, Hilden J (2005) Evaluation of nucleic acid amplification tests in the absence of a perfect gold-standard test: a review of the statistical and epidemiologic issues. Epidemiology 16(5): 604–612 Hadgu A, Dendukuri N, Hilden J (2005) Evaluation of nucleic acid amplification tests in the absence of a perfect gold-standard test: a review of the statistical and epidemiologic issues. Epidemiology 16(5): 604–612
go back to reference Hagenaars JAP (1988) Latent structure models with direct effects between indicators local dependence models. Sociol Methods Res 16(3):379–405CrossRef Hagenaars JAP (1988) Latent structure models with direct effects between indicators local dependence models. Sociol Methods Res 16(3):379–405CrossRef
go back to reference Hagenaars JAP, McCutcheon AL (2002) Applied latent class analysis. Cambridge University Press, Cambridge Hagenaars JAP, McCutcheon AL (2002) Applied latent class analysis. Cambridge University Press, Cambridge
go back to reference Heinen T (1996) Latent class and discrete latent trait models: similarities and differences. Sage, Thousand Oaks Heinen T (1996) Latent class and discrete latent trait models: similarities and differences. Sage, Thousand Oaks
go back to reference Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1): 177–196 Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1): 177–196
go back to reference Huang G, Bandeen-Roche K (2004) Building an identifiable latent class model with covariate effects on underlying and measured variables. Psychometrika 69(1):5–32MathSciNetCrossRef Huang G, Bandeen-Roche K (2004) Building an identifiable latent class model with covariate effects on underlying and measured variables. Psychometrika 69(1):5–32MathSciNetCrossRef
go back to reference Hybels C, Blazer D, Pieper C, Landerman L, Steffens D (2009) Profiles of depressive symptoms in older adults diagnosed with major depression: a latent cluster analysis. Am J Geriatr Psychiatry 17(5):387CrossRef Hybels C, Blazer D, Pieper C, Landerman L, Steffens D (2009) Profiles of depressive symptoms in older adults diagnosed with major depression: a latent cluster analysis. Am J Geriatr Psychiatry 17(5):387CrossRef
go back to reference Langeheine R, Pannekoek J, Van de Pol F (1996) Bootstrapping goodness-of-fit measures in categorical data analysis. Sociol Methods Res 24(4):492–516CrossRef Langeheine R, Pannekoek J, Van de Pol F (1996) Bootstrapping goodness-of-fit measures in categorical data analysis. Sociol Methods Res 24(4):492–516CrossRef
go back to reference Laumann EO, Paik A, Rosen RC (1999) Sexual dysfunction in the United States. JAMA 281(6):537–544CrossRef Laumann EO, Paik A, Rosen RC (1999) Sexual dysfunction in the United States. JAMA 281(6):537–544CrossRef
go back to reference Maydeu-Olivares A, Joe H (2005) Limited-and full-information estimation and goodness-of-fit testing in \(2^n\) contingency tables. J Am Stat Assoc 100(471):1009–1020MathSciNetMATHCrossRef Maydeu-Olivares A, Joe H (2005) Limited-and full-information estimation and goodness-of-fit testing in \(2^n\) contingency tables. J Am Stat Assoc 100(471):1009–1020MathSciNetMATHCrossRef
go back to reference McLachlan G, Peel D (2000) Finite mixture models volume 299. Wiley-Interscience, New YorkCrossRef McLachlan G, Peel D (2000) Finite mixture models volume 299. Wiley-Interscience, New YorkCrossRef
go back to reference Nyholt D, Gillespie N, Heath A, Merikangas K, Duffy D, Martin N (2004) Latent class and genetic analysis does not support migraine with aura and migraine without aura as separate entities. Genet Epidemiol 26(3):231–244CrossRef Nyholt D, Gillespie N, Heath A, Merikangas K, Duffy D, Martin N (2004) Latent class and genetic analysis does not support migraine with aura and migraine without aura as separate entities. Genet Epidemiol 26(3):231–244CrossRef
go back to reference R Core Team (2012) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0 R Core Team (2012) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0
go back to reference Rao CR (1948) Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. In: Proceedings of the Cambridge philosophical society, vol 44, pp 50–57. Cambridge University Press, Cambridge Rao CR (1948) Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. In: Proceedings of the Cambridge philosophical society, vol 44, pp 50–57. Cambridge University Press, Cambridge
go back to reference Saris W, Satorra A, Sörbom D (1987) The detection and correction of specification errors in structural equation models. Sociol Methodol 17:105–129CrossRef Saris W, Satorra A, Sörbom D (1987) The detection and correction of specification errors in structural equation models. Sociol Methodol 17:105–129CrossRef
go back to reference Satorra A (1989) Alternative test criteria in covariance structure analysis: a unified approach. Psychometrika 54(1):131–151MathSciNetCrossRef Satorra A (1989) Alternative test criteria in covariance structure analysis: a unified approach. Psychometrika 54(1):131–151MathSciNetCrossRef
go back to reference Savage M, Devine F, Cunningham N, Taylor M, Li Y, Hjellbrekke J, Le Roux B, Friedman S, Miles A (2013) A new model of social class? Findings from the BBC’s Great British Class Survey Experiment. Sociology 47(2):219–250CrossRef Savage M, Devine F, Cunningham N, Taylor M, Li Y, Hjellbrekke J, Le Roux B, Friedman S, Miles A (2013) A new model of social class? Findings from the BBC’s Great British Class Survey Experiment. Sociology 47(2):219–250CrossRef
go back to reference Tay L, Newman D, Vermunt J (2011) Using mixed-measurement item response theory with covariates (MM-IRT-C) to ascertain observed and unobserved measurement equivalence. Organ Res Methods 14(1):147–176CrossRef Tay L, Newman D, Vermunt J (2011) Using mixed-measurement item response theory with covariates (MM-IRT-C) to ascertain observed and unobserved measurement equivalence. Organ Res Methods 14(1):147–176CrossRef
go back to reference Torrance-Rynard V, Walter S (1998) Effects of dependent errors in the assessment of diagnostic test performance. Stat Med 16(19):2157–2175CrossRef Torrance-Rynard V, Walter S (1998) Effects of dependent errors in the assessment of diagnostic test performance. Stat Med 16(19):2157–2175CrossRef
go back to reference Vacek P (1985) The effect of conditional dependence on the evaluation of diagnostic tests. Biometrics 41(4):959–968 Vacek P (1985) The effect of conditional dependence on the evaluation of diagnostic tests. Biometrics 41(4):959–968
go back to reference van der Linden W, Glas C (2010) Statistical tests of conditional independence between responses and/or response times on test items. Psychometrika 75(1):120–139 van der Linden W, Glas C (2010) Statistical tests of conditional independence between responses and/or response times on test items. Psychometrika 75(1):120–139
go back to reference Vermunt JK, Magidson J (2005) Technical guide for latent GOLD 4.0: Basic and advanced. Statistical Innovations Inc, Belmont Vermunt JK, Magidson J (2005) Technical guide for latent GOLD 4.0: Basic and advanced. Statistical Innovations Inc, Belmont
go back to reference Walter S, Irwig L (1988) Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review. J Clin Epidemiol 41(9):923–937CrossRef Walter S, Irwig L (1988) Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review. J Clin Epidemiol 41(9):923–937CrossRef
go back to reference Walter SD, Riddell CA, Rabachini T, Villa LL, Franco EL (2013) Accuracy of p53 codon 72 polymorphism status determined by multiple laboratory methods: a latent class model analysis. PloS one 8(2):e56430CrossRef Walter SD, Riddell CA, Rabachini T, Villa LL, Franco EL (2013) Accuracy of p53 codon 72 polymorphism status determined by multiple laboratory methods: a latent class model analysis. PloS one 8(2):e56430CrossRef
go back to reference White N, Johnson H, Silburn P, Mellick G, Dissanayaka N, Mengersen K (2012) Probabilistic subgroup identification using bayesian finite mixture modelling: a case study in Parkinson’s disease phenotype identification. Stat Methods Med Res 21(6):563–583CrossRef White N, Johnson H, Silburn P, Mellick G, Dissanayaka N, Mengersen K (2012) Probabilistic subgroup identification using bayesian finite mixture modelling: a case study in Parkinson’s disease phenotype identification. Stat Methods Med Res 21(6):563–583CrossRef
Metadata
Title
A Monte Carlo evaluation of three methods to detect local dependence in binary data latent class models
Authors
Daniel L. Oberski
Geert H. van Kollenburg
Jeroen K. Vermunt
Publication date
01-09-2013
Publisher
Springer Berlin Heidelberg
Published in
Advances in Data Analysis and Classification / Issue 3/2013
Print ISSN: 1862-5347
Electronic ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-013-0146-2

Other articles of this Issue 3/2013

Advances in Data Analysis and Classification 3/2013 Go to the issue

Premium Partner