Skip to main content
Log in

Randomization-based inference about latent variables from complex samples

  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

Standard procedures for drawing inferences from complex samples do not apply when the variable of interestϑ cannot be observed directly, but must be inferred from the values of secondary random variables that depend onϑ stochastically. Examples are proficiency variables in item response models and class memberships in latent class models. Rubin's “multiple imputation” techniques yield approximations of sample statistics that would have been obtained, hadϑ been observable, and associated variance estimates that account for uncertainty due to both the sampling of respondents and the latent nature ofϑ. The approach is illustrated with data from the National Assessment for Educational Progress.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Andersen, E. B., & Madsen, M. (1977). Estimating the parameters of a latent population distribution.Psychometrika, 42, 357–374.

    Google Scholar 

  • Beaton, A. E. (1987).The NAEP 1983/84 technical report (NAEP Report 15-TR-20). Princeton: Educational Testing Service.

    Google Scholar 

  • Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: An application of an EM algorithm.Psychometrika, 46, 443–459.

    Google Scholar 

  • Bock, R. D., Mislevy, R. J., & Woodson, C. E. M. (1982). The next stage in educational assessment.Educational Researcher, 11, 4–11, 16.

    Google Scholar 

  • Cassel, C. M., Särndal, C. E., & Wretman, J. H. (1977).Foundations of inference in survey sampling. New York: Wiley.

    Google Scholar 

  • Cochran, W. G. (1977).Sampling techniques. New York: Wiley.

    Google Scholar 

  • Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion).Journal of the Royal Statistical Society, Series B, 39, 1–38.

    Google Scholar 

  • Ford, B. L. (1983). An overview of hot-deck procedures. In W. G. Madow, I. Olkin, & D. B. Rubin (Eds.),Incomplete data in sample surveys, Volume 2, Theory and bibliographies (pp. 185–207). New York: Academic Press.

    Google Scholar 

  • Goldstein, H., & McDonald, R. P. (1988). A general model for the analysis of multilevel data.Psychometrika, 53, 455–467.

    Google Scholar 

  • Johnson, E. G. (1987). Estimation of uncertainty due to sampling variation. In A. E. Beaton,The NAEP 1983/84 technical report (NAEP Report 15-TR-20, pp. 505–512). Princeton: Educational Testing Service.

    Google Scholar 

  • Kelley, T. L. (1923).Statistical method. New York: Macmillan.

    Google Scholar 

  • Kirsch, I. S., & Jungeblut, A. (1986).Literacy: Profile of America's young adults (Final Report, No. 16-PL-01). Princeton, NJ: National Assessment of Education Progress.

    Google Scholar 

  • Laird, N. M. (1978). Nonparametric maximum likelihood estimation of a mixing distribution.Journal of the American Statistical Association, 73, 805–811.

    Google Scholar 

  • Little, R. J. A., & Rubin, D. B. (1987).Statistical analysis with missing data. New York: Wiley.

    Google Scholar 

  • Lord, F. M. (1965). A strong true-score theory, with applications.Psychometrika, 30, 239–270.

    Google Scholar 

  • Lord, F. M. (1969). Estimating true score distributions in psychological testing (An empirical Bayes problem).Psychometrika, 34, 259–299.

    Google Scholar 

  • Mislevy, R. J. (1984). Estimating latent distributions.Psychometrika, 49, 359–381.

    Google Scholar 

  • Mislevy, R. J. (1985). Estimation of latent group effects.Journal of the American Statistical Association, 80, 993–997.

    Google Scholar 

  • Mislevy, R. J., & Bock, R. D. (1983).BILOG: Item analysis and test scoring with binary logistic models [computer program]. Mooresville, IN: Scientific Software.

    Google Scholar 

  • Mislevy, R. J., & Sheehan, K. M. (1987). Marginal estimation procedures. In A. E. Beaton,The NAEP 1983/84 technical report (NAEP Report 15-TR-20, pp. 293–360). Princeton: Educational Testing Service.

    Google Scholar 

  • Muthén, B., & Satorra, A. (1989). Multilevel aspects of varying parameters in structural models. In R. D. Bock (Ed.),Multilevel analysis of educational data (pp. 87–99). San Diego: Academic Press.

    Google Scholar 

  • National Assessment of Educational Progress (1985).The reading report card: Progress toward excellence in our schools. Princeton: Educational Testing Service.

    Google Scholar 

  • Rigdon, S. E., & Tsutakawa, R. K. (1983). Parameter estimation in latent trait models.Psychometrika, 48, 567–574.

    Google Scholar 

  • Rubin, D. B. (1977). Formalizing subjective notions about the effect of nonrespondents in sample surveys.Journal of the American Statistical Association, 72, 538–543.

    Google Scholar 

  • Rubin, D. B. (1987).Multiple imputation for nonresponse in surveys. New York: Wiley.

    Google Scholar 

  • Sanathanan, L., & Blumenthal, N. (1978). The logistic model and estimation of latent structure.Journal of the American Statistical Association, 73, 794–798.

    Google Scholar 

  • Sheehan, K. M. (1985).M-GROUP: Estimation of group effects in multivariate models [computer program]. Princeton: Educational Testing Service.

    Google Scholar 

  • Tanner, M., & Wong, W. (1987). The calculation of posterior distributions by data augmentation (with discussion).Journal of the American Statistical Association, 82, 528–550.

    Google Scholar 

  • Zwick, R. (1987). Assessing the dimensionality of NAEP reading data.Journal of Educational Measurement, 24, 293–308.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

This research was supported by Grant No. NIE-G-83-0011 of the Office for Educational Research and Improvement, Center for Education Statistics, and Contract No. N00014-88-K-0304, R&T 4421552 from the Cognitive Sciences Program, Cognitive and Neural Sciences Division, Office of Naval Research. It does not necessarily reflect the views of either agency. I am grateful to R. Darrell Bock for calling my attention to the applicability of multiple imputation to the assessment setting; to Albert Beaton and Eugene Johnson for enlightening discussions on the topic; and to Henry Braun, Ben King, Debra Kline, Gary Phillips, Paul Rosenbaum, Don Rubin, John Tukey, Ming-Mei Wang, Kentaro Yamamoto, Rebecca Zwick, and two anonymous reviewers for comments on earlier drafts. Example 4 is based on the analysis of the 1984 National Assessment for Educational Progress reading survey, carried out at Educational Testing Service through the tireless efforts of too many people to mention by name, under the direction of Albert Beaton, Director of NAEP Data Analyses. David Freund, Bruce Kaplan, and Jennifer Nelson conducted additional analyses of the 1984 and 1988 data for the example.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mislevy, R.J. Randomization-based inference about latent variables from complex samples. Psychometrika 56, 177–196 (1991). https://doi.org/10.1007/BF02294457

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02294457

Key words

Navigation