Summary
The Psychometric Society is “devoted to the development of Psychology as a quantitative rational science.” Engineering is often set in contradistinction with science; art is sometimes considered different from science. Why, then, juxtapose the words in the title: psychometric, engineering,and art? Because an important aspect of quantitative psychology is problem-solving, and engineering solves problems. And an essential aspect of a good solution is beauty—hence, art. In overview and with examples, this presentation describes activities that are quantitative psychology as engineering and art—that is, as design. Extended illustrations involve systems for scoring tests in realistic contexts. Allusions are made to other examples that extend the conception of quantitative psychology as engineering and art across a wider range of psychometric activities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barr AH (1946) Picasso: Fifty years of his art. The Museum of Modem Art, New York
Berkson J (1944) Application of the logistic function to bio-assay. Journal of the American Statistical Association 39: 357–375.
Berkson J (1953) A statistically precise and relatively simple method of estimating the bioassay with quantal response, based on the logistic function. Journal of the American Statistical Association 48: 565–599.
Bimbaum A (1968) Some latent trait models and their use in inferring an examinee’s ability. In: Lord FM, Novick MR, Statistical theories of mental test scores. Addison-Wesley, Reading, MA, pp 395–479
Bock RD, Aitkin M (1981) Marginal maximum likelihood estimation of item parameters: an application of the EM algorithm. Psychometrika 46: 443–459
Bock RD, Lieberman M (1970) Fitting a response model for n dichotomously scored items. Psychometrika 35: 179–197
Bock RD, Mislevy RJ (1982) Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement 6: 431–444
Box GEP (1979) Some problems of statistics and everyday life. Journal of the American Statistical Association 74: 1–4
Brooks FP (1996) The computer scientist as toolsmith II. Communications of the ACM 39: 61–68
Brooks FP (in press) The design of design. Communications of the ACM
Chen WH (1995) Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores. Unpublished doctoral dissertation, The University of North Carolina at Chapel Hill
Chen WH, Thissen D (1999) Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores. British Journal of Mathematical and Statistical Psychology 52: 19–37
Cronbach LJ, Gleser GC, Nanda H, Rajaratnam N (1972) The dependability of behavioral measurements: Theory of generalizability for scores and profiles. John Wiley & Sons, New York
Finney DJ (1952) Probit analysis: A statistical treatment of the sigmoid response curve. Cambridge University Press, London
Goldstein A (2001, March 12) Making another big score. Time 157: 66–67
Henriques DB, Steinberg J (2001, May 20) Errors plague testing industry. The New York Times, pp Al, A22–23
Jones LV (1998) LL Thurstone’s vision of psychology as a quantitative rational science. In: Kimble GA, Wertheimer M (eds) Portraits of pioneers in psychology, vol III. Lawrence Erlbaum Associates, Mahwah, NJ, pp 84–102
Kelley TL (1927) The interpretation of educational measurements. World Book, New York
Kelley TL (1947) Fundamentals of statistics. Harvard University Press, Cambridge, MA
Lazarsfeld PF (1950) The logical and mathematical foundation of latent structure analysis. In: Stouffer SA, Guttman L, Suchman EA, Lazarsfeld PF, Star, SA, Clausen JA, Measurement and Prediction. Wiley, New York, pp 362–412
Laidlaw DH, Fleischer KW, Barr AH (1995, September) Bayesian Mixture Classification of MRI Data for Geometric Modeling and Visualization. Poster presented at the First International Workshop on Statistical Mixture Modeling, Aussois, France. (Retrieved from the Worldwide Web: http://www.gg.caltech.edu/–dhl/aussois/paper.html)
Lewis B (1996, March 15) IS survival guide. Infoworld 21: 96
Lord FM (1953) The relation of test score to the trait underlying the test. Educational and Psychological Measurement 13: 517–548
Lord FM, Novick M (1968) Statistical theories of mental test scores. Addison Wesley, Reading, MA
Lord FM, Wingersky MS (1984) Comparison of IRT true-score and equipercentile observed-score “equatings.” Applied Psychological Measurement 8: 453–461
Mislevy RM, Johnson EG, Muraki E (1992) Scaling procedures in NAEP. Journal of Educational Statistics 17: 131–154
Muraki E (1992) A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement 16: 159–176
Muraki E (1997) A generalized partial credit model. In: van der Linden W, Hambleton RK (eds) Handbook of modern item response theory. Springer, New York, pp 153–164
Orlando M (1997) Item fit in the context of item response theory. Unpublished doctoral dissertation, The University of North Carolina at Chapel Hill
Orlando M, Thissen D (2000) New item fit indices for dichotomous item response theory models. Applied Psychological Measurement 24: 50–64
Picasso P (1923) Picasso speaks—A statement by the artist. The Arts 3: 315–326
Raz J, Turetsky BI, Dickerson LW (2001) Inference for a random wavelet packet model of single-channel event-related potentials. Journal of the American Statistical Association 96: 409–420
Robbins H (1952) Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society 58: 527–535
Rosa K, Swygert K, Nelson L, Thissen D (2001) Item response theory applied to combinations of multiple-choice and constructed-response items—scale scores for patterns of summed scores. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 253–292
Samejima F (1969) Estimation of latent ability using a response pattern of graded scores. Psychometric Monograph No 17
Samejima F (1997) Graded response model. In: van der Linden W, Hambleton RK (eds) Handbook of modern item response theory. Springer, New York, pp 85–100
Thissen D, Flora D, Reeve B, & Vevea J L (2000, July) Linear Approximations for Item Response Theory Response Pattern Scores. Paper presented at the annual meeting of the Psychometric Society, Vancouver, BC, Canada
Thissen D, Nelson L, Rosa K, McLeod LD (2001) Item response theory for items scored in more than two categories. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 141–186
Thissen D, Nelson L, Swygert K (2001) Item response theory applied to combinations of multiple-choice and constructed-response items—approximation methods for scale scores. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 293–341
Thissen D, Orlando M (2001) Item response theory for items scored in two categories. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 73–140
Thissen D, Pommerich M, Billeaud K, Williams VSL (1995) Item response theory for scores on tests including polytomous items with ordered responses. Applied Psychological Measurement 19: 39–49
Thissen D & Wainer H (eds) (2001) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ
Thurstone LL (1925) A method of scaling psychological and educational tests. Journal of Educational Psychology 16: 433–449
Thurstone LL (1927) The law of comparative judgment. Psychological Review 34: 278–286
Thurstone LL (1937) Psychology as a quantitative rational science. Science 85: 227–232
Thurstone LL (1938) Primary mental abilities. University of Chicago Press, Chicago, IL
Tukey JW (1961) Data analysis and behavioral science or learning to bear the quantitative man’s burden by shunning badmandments. Unpublished ms reprinted in LV Jones (ed) (1986), The collected works of John W Tukey, Vol III, Philosophy and principles of data analysis: 1949–1964. Wadsworth & Brooks-Cole, Monterey, CA, pp 187–389
Tukey JW (1962) The future of data analysis. Annals of Mathematical Statistics 33:1–67. (Reprinted in LV Jones (ed) (1986), The collected works of John W Tukey, Vol III, Philosophy and principles of data analysis: 1949–1964. Wadsworth & Brooks-Cole, Monterey, CA, pp 391–484 )
Wainer H, Vevea JL, Camacho F, Reeve B, Rosa K, Nelson L, Swygert K, Thissen D (2001) Augmented scores—“borrowing strength” to compute scores based on small numbers of items. In: Thissen D, Wainer H (eds) Test Scoring. Lawrence Erlbaum Associates, Mahwah, NJ, pp 343–387
Williams VSL, Pommerich M, Thissen D (1998) A comparison of developmental scales based on Thurstone methods and item response theory. Journal of Educational Measurement 35: 93–107
Yen WM (1984) Obtaining maximum likelihood trait estimates from number-correct scores for the three-parameter logistic model. Journal of Educational Measurement 21: 93–111
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2003 Springer Japan
About this paper
Cite this paper
Thissen, D. (2003). Psychometric Engineering as Art: Variations on a Theme. In: Yanai, H., Okada, A., Shigemasu, K., Kano, Y., Meulman, J.J. (eds) New Developments in Psychometrics. Springer, Tokyo. https://doi.org/10.1007/978-4-431-66996-8_1
Download citation
DOI: https://doi.org/10.1007/978-4-431-66996-8_1
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-66998-2
Online ISBN: 978-4-431-66996-8
eBook Packages: Springer Book Archive