Psychometric Engineering as Art: Variations on a Theme

Thissen, David

doi:10.1007/978-4-431-66996-8_1

David Thissen¹

637 Accesses

Summary

The Psychometric Society is “devoted to the development of Psychology as a quantitative rational science.” Engineering is often set in contradistinction with science; art is sometimes considered different from science. Why, then, juxtapose the words in the title: psychometric, engineering,and art? Because an important aspect of quantitative psychology is problem-solving, and engineering solves problems. And an essential aspect of a good solution is beauty—hence, art. In overview and with examples, this presentation describes activities that are quantitative psychology as engineering and art—that is, as design. Extended illustrations involve systems for scoring tests in realistic contexts. Allusions are made to other examples that extend the conception of quantitative psychology as engineering and art across a wider range of psychometric activities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barr AH (1946) Picasso: Fifty years of his art. The Museum of Modem Art, New York
Google Scholar
Berkson J (1944) Application of the logistic function to bio-assay. Journal of the American Statistical Association 39: 357–375.
MathSciNet Google Scholar
Berkson J (1953) A statistically precise and relatively simple method of estimating the bioassay with quantal response, based on the logistic function. Journal of the American Statistical Association 48: 565–599.
MATH Google Scholar
Bimbaum A (1968) Some latent trait models and their use in inferring an examinee’s ability. In: Lord FM, Novick MR, Statistical theories of mental test scores. Addison-Wesley, Reading, MA, pp 395–479
Google Scholar
Bock RD, Aitkin M (1981) Marginal maximum likelihood estimation of item parameters: an application of the EM algorithm. Psychometrika 46: 443–459
Article MathSciNet Google Scholar
Bock RD, Lieberman M (1970) Fitting a response model for n dichotomously scored items. Psychometrika 35: 179–197
Article Google Scholar
Bock RD, Mislevy RJ (1982) Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement 6: 431–444
Article Google Scholar
Box GEP (1979) Some problems of statistics and everyday life. Journal of the American Statistical Association 74: 1–4
Article Google Scholar
Brooks FP (1996) The computer scientist as toolsmith II. Communications of the ACM 39: 61–68
Article Google Scholar
Brooks FP (in press) The design of design. Communications of the ACM
Google Scholar
Chen WH (1995) Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores. Unpublished doctoral dissertation, The University of North Carolina at Chapel Hill
Google Scholar
Chen WH, Thissen D (1999) Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores. British Journal of Mathematical and Statistical Psychology 52: 19–37
Article Google Scholar
Cronbach LJ, Gleser GC, Nanda H, Rajaratnam N (1972) The dependability of behavioral measurements: Theory of generalizability for scores and profiles. John Wiley & Sons, New York
Google Scholar
Finney DJ (1952) Probit analysis: A statistical treatment of the sigmoid response curve. Cambridge University Press, London
MATH Google Scholar
Goldstein A (2001, March 12) Making another big score. Time 157: 66–67
Google Scholar
Henriques DB, Steinberg J (2001, May 20) Errors plague testing industry. The New York Times, pp Al, A22–23
Google Scholar
Jones LV (1998) LL Thurstone’s vision of psychology as a quantitative rational science. In: Kimble GA, Wertheimer M (eds) Portraits of pioneers in psychology, vol III. Lawrence Erlbaum Associates, Mahwah, NJ, pp 84–102
Google Scholar
Kelley TL (1927) The interpretation of educational measurements. World Book, New York
Google Scholar
Kelley TL (1947) Fundamentals of statistics. Harvard University Press, Cambridge, MA
Google Scholar
Lazarsfeld PF (1950) The logical and mathematical foundation of latent structure analysis. In: Stouffer SA, Guttman L, Suchman EA, Lazarsfeld PF, Star, SA, Clausen JA, Measurement and Prediction. Wiley, New York, pp 362–412
Google Scholar
Laidlaw DH, Fleischer KW, Barr AH (1995, September) Bayesian Mixture Classification of MRI Data for Geometric Modeling and Visualization. Poster presented at the First International Workshop on Statistical Mixture Modeling, Aussois, France. (Retrieved from the Worldwide Web: http://www.gg.caltech.edu/–dhl/aussois/paper.html)
Google Scholar
Lewis B (1996, March 15) IS survival guide. Infoworld 21: 96
Google Scholar
Lord FM (1953) The relation of test score to the trait underlying the test. Educational and Psychological Measurement 13: 517–548
Article Google Scholar
Lord FM, Novick M (1968) Statistical theories of mental test scores. Addison Wesley, Reading, MA
MATH Google Scholar
Lord FM, Wingersky MS (1984) Comparison of IRT true-score and equipercentile observed-score “equatings.” Applied Psychological Measurement 8: 453–461
Article Google Scholar
Mislevy RM, Johnson EG, Muraki E (1992) Scaling procedures in NAEP. Journal of Educational Statistics 17: 131–154
Article Google Scholar
Muraki E (1992) A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement 16: 159–176
Article Google Scholar
Muraki E (1997) A generalized partial credit model. In: van der Linden W, Hambleton RK (eds) Handbook of modern item response theory. Springer, New York, pp 153–164
Chapter Google Scholar
Orlando M (1997) Item fit in the context of item response theory. Unpublished doctoral dissertation, The University of North Carolina at Chapel Hill
Google Scholar
Orlando M, Thissen D (2000) New item fit indices for dichotomous item response theory models. Applied Psychological Measurement 24: 50–64
Article Google Scholar
Picasso P (1923) Picasso speaks—A statement by the artist. The Arts 3: 315–326
Google Scholar
Raz J, Turetsky BI, Dickerson LW (2001) Inference for a random wavelet packet model of single-channel event-related potentials. Journal of the American Statistical Association 96: 409–420
Article MathSciNet MATH Google Scholar
Robbins H (1952) Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society 58: 527–535
Article MathSciNet MATH Google Scholar
Rosa K, Swygert K, Nelson L, Thissen D (2001) Item response theory applied to combinations of multiple-choice and constructed-response items—scale scores for patterns of summed scores. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 253–292
Google Scholar
Samejima F (1969) Estimation of latent ability using a response pattern of graded scores. Psychometric Monograph No 17
Google Scholar
Samejima F (1997) Graded response model. In: van der Linden W, Hambleton RK (eds) Handbook of modern item response theory. Springer, New York, pp 85–100
Chapter Google Scholar
Thissen D, Flora D, Reeve B, & Vevea J L (2000, July) Linear Approximations for Item Response Theory Response Pattern Scores. Paper presented at the annual meeting of the Psychometric Society, Vancouver, BC, Canada
Google Scholar
Thissen D, Nelson L, Rosa K, McLeod LD (2001) Item response theory for items scored in more than two categories. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 141–186
Google Scholar
Thissen D, Nelson L, Swygert K (2001) Item response theory applied to combinations of multiple-choice and constructed-response items—approximation methods for scale scores. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 293–341
Google Scholar
Thissen D, Orlando M (2001) Item response theory for items scored in two categories. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 73–140
Google Scholar
Thissen D, Pommerich M, Billeaud K, Williams VSL (1995) Item response theory for scores on tests including polytomous items with ordered responses. Applied Psychological Measurement 19: 39–49
Article Google Scholar
Thissen D & Wainer H (eds) (2001) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ
Google Scholar
Thurstone LL (1925) A method of scaling psychological and educational tests. Journal of Educational Psychology 16: 433–449
Article Google Scholar
Thurstone LL (1927) The law of comparative judgment. Psychological Review 34: 278–286
Google Scholar
Thurstone LL (1937) Psychology as a quantitative rational science. Science 85: 227–232
Article Google Scholar
Thurstone LL (1938) Primary mental abilities. University of Chicago Press, Chicago, IL
Google Scholar
Tukey JW (1961) Data analysis and behavioral science or learning to bear the quantitative man’s burden by shunning badmandments. Unpublished ms reprinted in LV Jones (ed) (1986), The collected works of John W Tukey, Vol III, Philosophy and principles of data analysis: 1949–1964. Wadsworth & Brooks-Cole, Monterey, CA, pp 187–389
Google Scholar
Tukey JW (1962) The future of data analysis. Annals of Mathematical Statistics 33:1–67. (Reprinted in LV Jones (ed) (1986), The collected works of John W Tukey, Vol III, Philosophy and principles of data analysis: 1949–1964. Wadsworth & Brooks-Cole, Monterey, CA, pp 391–484 )
Google Scholar
Wainer H, Vevea JL, Camacho F, Reeve B, Rosa K, Nelson L, Swygert K, Thissen D (2001) Augmented scores—“borrowing strength” to compute scores based on small numbers of items. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 343–387
Google Scholar
Williams VSL, Pommerich M, Thissen D (1998) A comparison of developmental scales based on Thurstone methods and item response theory. Journal of Educational Measurement 35: 93–107
Article Google Scholar
Yen WM (1984) Obtaining maximum likelihood trait estimates from number-correct scores for the three-parameter logistic model. Journal of Educational Measurement 21: 93–111
Article Google Scholar

Download references

Author information

Authors and Affiliations

L.L. Thurstone Psychometric Laboratory, University of North Carolina at Chapel Hill, Davie Hall, CB#3270, Chapel Hill, NC, 27599, USA
David Thissen

Authors

David Thissen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

H. Yanai A. Okada K. Shigemasu Y. Kano J. J. Meulman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thissen, D. (2003). Psychometric Engineering as Art: Variations on a Theme. In: Yanai, H., Okada, A., Shigemasu, K., Kano, Y., Meulman, J.J. (eds) New Developments in Psychometrics. Springer, Tokyo. https://doi.org/10.1007/978-4-431-66996-8_1

Download citation

DOI: https://doi.org/10.1007/978-4-431-66996-8_1
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-66998-2
Online ISBN: 978-4-431-66996-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics