The Kernel Method of Test Equating

Über dieses Buch

Kernel Equating (KE) is a powerful, modern and unified approach to test equating. It is based on a flexible family of equipercentile-like equating functions and contains the linear equating function as a special case. Any equipercentile equating method has five steps or parts. They are: 1) pre-smoothing; 2) estimation of the score-probabilities on the target population; 3) continuization; 4) computing and diagnosing the equating function; 5) computing the standard error of equating and related accuracy measures. KE brings these steps together in an organized whole rather than treating them as disparate problems.

KE exploits pre-smoothing by fitting log-linear models to score data, and incorporates it into step 5) above. KE provides new tools for diagnosing a given equating function, and for comparing two or more equating functions in order to choose between them. In this book, KE is applied to the four major equating designs and to both Chain Equating and Post-Stratification Equating for the Non-Equivalent groups with Anchor Test Design.

This book will be an important reference for several groups: (a) Statisticians and others interested in the theory behind equating methods and the use of model-based statistical methods for data smoothing in applied work; (b) Practitioners who need to equate tests—including those with these responsibilities in testing companies, state testing agencies and school districts; and (c) Instructors in psychometric and measurement programs. The authors assume some familiarity with linear and equipercentile test equating, and with matrix algebra.

Alina von Davier is an Associate Research Scientist in the Center for Statistical Theory and Practice, at Educational Testing Service. She has been a research collaborator at the Universities of Trier, Magdeburg, and Kiel, an assistant professor at the Politechnical University of Bucharest and a research scientist at the Institute for Psychology in Bucharest.

Paul Holland holds the Frederic M. Lord Chair in Measurement and Statistics at Educational Testing Service. He held faculty positions in the Graduate School of Education, University of California, Berkeley and the Harvard Department of Statistics. He is a Fellow of the American Statistical Association, the Institute of Mathematical Statistics, and the American Association for the Advancement of Science. He is an elected Member of the International Statistical Institute and a past president of the Psychometric society. He was awarded the (AERA/ACT) E. F. Lindquist Award, in 2000, and was designated a National Associate of the National Academies of Science in 2002.

Dorothy Thayer currently is a consultant in the Center of Statistical Theory and Practice, at Educational Testing Service. Her research interests include computational and statistical methodology, empirical Bayes techniques, missing data procedures and exploratory data analysis techniques.

From the reviews:

"The book is nicely laid out, is extremely well written, and is an excellent text for a semester course or a short course…The book is highly recommended." Short Book Reviews of the International Statistical Institute, December 2004

"This book is well-written and the presentation is clear, rigorous, and concise...A rich set of applications is used to illustrate the methods...This book is a gem! I highly recommend it to any statistician or psychometrician who has even a passing interest in test equating." Pscyhometrika, March 2006

"This is a great book, and it is the first to focus on the kernel method of test equating." Applied Psychological Measurement, September 2005

Inhaltsverzeichnis

Frontmatter

Introduction and Notation

1. Introduction and Notation

The Kernel Method of Test Equating: Theory

Frontmatter

2. Data Collection Designs

Summary

This chapter classifies the Equating Designs in order to show the similarities and differences between them. For example, the NEAT Design can be viewed as containing the EG Design as a special case when P=Q and A has only a single score value. Similarly the CB Design contains both EG and SG Designs in it.

Another approach to classifying the designs is based on the estimation of the parameters in the pre-smoothing step, i.e., the numbers and type of the distributions to be estimated—univariate (EG) and bivariate (SG, CB, and NEAT).

We can also classify the designs by the number of populations and samples that are involved. Table 2.5 identifies some of the various ways Equating Designs may be classified.

Some designs are simple, i.e., involve a single population of examinees, have less assumptions, but need either larger sample sizes (EG), or require the same people to take two test forms at the same time (SG and CB). Other designs are more complicated, i.e., involve two populations of test takers, make use of an anchor test, and require that additional assumptions be fulfilled (i.e., NEAT). These complexities are often compensated by their increased versatility.

Springer Professional

The Kernel Method of Test Equating

Über dieses Buch

Inhaltsverzeichnis

Frontmatter

Introduction and Notation

1. Introduction and Notation

The Kernel Method of Test Equating: Theory

Frontmatter

2. Data Collection Designs

3. Kernel Equating: Overview, Pre-smoothing, and Estimation of r and s

4. Kernel Equating: Continuization and Equating

5. Kernel Equating: The SEE and the SEED

6. Kernel Equating versus Other Equating Methods

The Kernel Method of Test Equating: Applications

Frontmatter

7. The Equivalent-Groups Design

8. The Single-Group Design

9. The Counterbalanced Design

10. The NEAT Design: Chain Equating

11. The NEAT Design: Post-Stratification Equating

Backmatter