Skip to main content

Psychometric Engineering as Art: Variations on a Theme

  • Conference paper
New Developments in Psychometrics
  • 637 Accesses

Summary

The Psychometric Society is “devoted to the development of Psychology as a quantitative rational science.” Engineering is often set in contradistinction with science; art is sometimes considered different from science. Why, then, juxtapose the words in the title: psychometric, engineering,and art? Because an important aspect of quantitative psychology is problem-solving, and engineering solves problems. And an essential aspect of a good solution is beauty—hence, art. In overview and with examples, this presentation describes activities that are quantitative psychology as engineering and art—that is, as design. Extended illustrations involve systems for scoring tests in realistic contexts. Allusions are made to other examples that extend the conception of quantitative psychology as engineering and art across a wider range of psychometric activities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Barr AH (1946) Picasso: Fifty years of his art. The Museum of Modem Art, New York

    Google Scholar 

  • Berkson J (1944) Application of the logistic function to bio-assay. Journal of the American Statistical Association 39: 357–375.

    MathSciNet  Google Scholar 

  • Berkson J (1953) A statistically precise and relatively simple method of estimating the bioassay with quantal response, based on the logistic function. Journal of the American Statistical Association 48: 565–599.

    MATH  Google Scholar 

  • Bimbaum A (1968) Some latent trait models and their use in inferring an examinee’s ability. In: Lord FM, Novick MR, Statistical theories of mental test scores. Addison-Wesley, Reading, MA, pp 395–479

    Google Scholar 

  • Bock RD, Aitkin M (1981) Marginal maximum likelihood estimation of item parameters: an application of the EM algorithm. Psychometrika 46: 443–459

    Article  MathSciNet  Google Scholar 

  • Bock RD, Lieberman M (1970) Fitting a response model for n dichotomously scored items. Psychometrika 35: 179–197

    Article  Google Scholar 

  • Bock RD, Mislevy RJ (1982) Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement 6: 431–444

    Article  Google Scholar 

  • Box GEP (1979) Some problems of statistics and everyday life. Journal of the American Statistical Association 74: 1–4

    Article  Google Scholar 

  • Brooks FP (1996) The computer scientist as toolsmith II. Communications of the ACM 39: 61–68

    Article  Google Scholar 

  • Brooks FP (in press) The design of design. Communications of the ACM

    Google Scholar 

  • Chen WH (1995) Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores. Unpublished doctoral dissertation, The University of North Carolina at Chapel Hill

    Google Scholar 

  • Chen WH, Thissen D (1999) Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores. British Journal of Mathematical and Statistical Psychology 52: 19–37

    Article  Google Scholar 

  • Cronbach LJ, Gleser GC, Nanda H, Rajaratnam N (1972) The dependability of behavioral measurements: Theory of generalizability for scores and profiles. John Wiley & Sons, New York

    Google Scholar 

  • Finney DJ (1952) Probit analysis: A statistical treatment of the sigmoid response curve. Cambridge University Press, London

    MATH  Google Scholar 

  • Goldstein A (2001, March 12) Making another big score. Time 157: 66–67

    Google Scholar 

  • Henriques DB, Steinberg J (2001, May 20) Errors plague testing industry. The New York Times, pp Al, A22–23

    Google Scholar 

  • Jones LV (1998) LL Thurstone’s vision of psychology as a quantitative rational science. In: Kimble GA, Wertheimer M (eds) Portraits of pioneers in psychology, vol III. Lawrence Erlbaum Associates, Mahwah, NJ, pp 84–102

    Google Scholar 

  • Kelley TL (1927) The interpretation of educational measurements. World Book, New York

    Google Scholar 

  • Kelley TL (1947) Fundamentals of statistics. Harvard University Press, Cambridge, MA

    Google Scholar 

  • Lazarsfeld PF (1950) The logical and mathematical foundation of latent structure analysis. In: Stouffer SA, Guttman L, Suchman EA, Lazarsfeld PF, Star, SA, Clausen JA, Measurement and Prediction. Wiley, New York, pp 362–412

    Google Scholar 

  • Laidlaw DH, Fleischer KW, Barr AH (1995, September) Bayesian Mixture Classification of MRI Data for Geometric Modeling and Visualization. Poster presented at the First International Workshop on Statistical Mixture Modeling, Aussois, France. (Retrieved from the Worldwide Web: http://www.gg.caltech.edu/–dhl/aussois/paper.html)

    Google Scholar 

  • Lewis B (1996, March 15) IS survival guide. Infoworld 21: 96

    Google Scholar 

  • Lord FM (1953) The relation of test score to the trait underlying the test. Educational and Psychological Measurement 13: 517–548

    Article  Google Scholar 

  • Lord FM, Novick M (1968) Statistical theories of mental test scores. Addison Wesley, Reading, MA

    MATH  Google Scholar 

  • Lord FM, Wingersky MS (1984) Comparison of IRT true-score and equipercentile observed-score “equatings.” Applied Psychological Measurement 8: 453–461

    Article  Google Scholar 

  • Mislevy RM, Johnson EG, Muraki E (1992) Scaling procedures in NAEP. Journal of Educational Statistics 17: 131–154

    Article  Google Scholar 

  • Muraki E (1992) A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement 16: 159–176

    Article  Google Scholar 

  • Muraki E (1997) A generalized partial credit model. In: van der Linden W, Hambleton RK (eds) Handbook of modern item response theory. Springer, New York, pp 153–164

    Chapter  Google Scholar 

  • Orlando M (1997) Item fit in the context of item response theory. Unpublished doctoral dissertation, The University of North Carolina at Chapel Hill

    Google Scholar 

  • Orlando M, Thissen D (2000) New item fit indices for dichotomous item response theory models. Applied Psychological Measurement 24: 50–64

    Article  Google Scholar 

  • Picasso P (1923) Picasso speaks—A statement by the artist. The Arts 3: 315–326

    Google Scholar 

  • Raz J, Turetsky BI, Dickerson LW (2001) Inference for a random wavelet packet model of single-channel event-related potentials. Journal of the American Statistical Association 96: 409–420

    Article  MathSciNet  MATH  Google Scholar 

  • Robbins H (1952) Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society 58: 527–535

    Article  MathSciNet  MATH  Google Scholar 

  • Rosa K, Swygert K, Nelson L, Thissen D (2001) Item response theory applied to combinations of multiple-choice and constructed-response items—scale scores for patterns of summed scores. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 253–292

    Google Scholar 

  • Samejima F (1969) Estimation of latent ability using a response pattern of graded scores. Psychometric Monograph No 17

    Google Scholar 

  • Samejima F (1997) Graded response model. In: van der Linden W, Hambleton RK (eds) Handbook of modern item response theory. Springer, New York, pp 85–100

    Chapter  Google Scholar 

  • Thissen D, Flora D, Reeve B, & Vevea J L (2000, July) Linear Approximations for Item Response Theory Response Pattern Scores. Paper presented at the annual meeting of the Psychometric Society, Vancouver, BC, Canada

    Google Scholar 

  • Thissen D, Nelson L, Rosa K, McLeod LD (2001) Item response theory for items scored in more than two categories. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 141–186

    Google Scholar 

  • Thissen D, Nelson L, Swygert K (2001) Item response theory applied to combinations of multiple-choice and constructed-response items—approximation methods for scale scores. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 293–341

    Google Scholar 

  • Thissen D, Orlando M (2001) Item response theory for items scored in two categories. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 73–140

    Google Scholar 

  • Thissen D, Pommerich M, Billeaud K, Williams VSL (1995) Item response theory for scores on tests including polytomous items with ordered responses. Applied Psychological Measurement 19: 39–49

    Article  Google Scholar 

  • Thissen D & Wainer H (eds) (2001) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ

    Google Scholar 

  • Thurstone LL (1925) A method of scaling psychological and educational tests. Journal of Educational Psychology 16: 433–449

    Article  Google Scholar 

  • Thurstone LL (1927) The law of comparative judgment. Psychological Review 34: 278–286

    Google Scholar 

  • Thurstone LL (1937) Psychology as a quantitative rational science. Science 85: 227–232

    Article  Google Scholar 

  • Thurstone LL (1938) Primary mental abilities. University of Chicago Press, Chicago, IL

    Google Scholar 

  • Tukey JW (1961) Data analysis and behavioral science or learning to bear the quantitative man’s burden by shunning badmandments. Unpublished ms reprinted in LV Jones (ed) (1986), The collected works of John W Tukey, Vol III, Philosophy and principles of data analysis: 1949–1964. Wadsworth & Brooks-Cole, Monterey, CA, pp 187–389

    Google Scholar 

  • Tukey JW (1962) The future of data analysis. Annals of Mathematical Statistics 33:1–67. (Reprinted in LV Jones (ed) (1986), The collected works of John W Tukey, Vol III, Philosophy and principles of data analysis: 1949–1964. Wadsworth & Brooks-Cole, Monterey, CA, pp 391–484 )

    Google Scholar 

  • Wainer H, Vevea JL, Camacho F, Reeve B, Rosa K, Nelson L, Swygert K, Thissen D (2001) Augmented scores—“borrowing strength” to compute scores based on small numbers of items. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 343–387

    Google Scholar 

  • Williams VSL, Pommerich M, Thissen D (1998) A comparison of developmental scales based on Thurstone methods and item response theory. Journal of Educational Measurement 35: 93–107

    Article  Google Scholar 

  • Yen WM (1984) Obtaining maximum likelihood trait estimates from number-correct scores for the three-parameter logistic model. Journal of Educational Measurement 21: 93–111

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

H. Yanai A. Okada K. Shigemasu Y. Kano J. J. Meulman

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Japan

About this paper

Cite this paper

Thissen, D. (2003). Psychometric Engineering as Art: Variations on a Theme. In: Yanai, H., Okada, A., Shigemasu, K., Kano, Y., Meulman, J.J. (eds) New Developments in Psychometrics. Springer, Tokyo. https://doi.org/10.1007/978-4-431-66996-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-4-431-66996-8_1

  • Publisher Name: Springer, Tokyo

  • Print ISBN: 978-4-431-66998-2

  • Online ISBN: 978-4-431-66996-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics