Skip to main content

1995 | Buch

Test Equating

Methods and Practices

verfasst von: Michael J. Kolen, Robert L. Brennan

Verlag: Springer New York

Buchreihe : Springer Series in Statistics

insite
SUCHEN

Über dieses Buch

In recent years, many researchers in the psychology and statistical communities have paid increasing attention to test equating as issues of using multiple test forms have arisen and in response to criticisms of traditional testing techniques. This book provides a practically oriented introduction to test equating which both discusses the most frequently used equating methodologies and covers many of the practical issues involved. The main themes are: - the purpose of equating - distinguishing between equating and related methodologies - the importance of test equating to test development and quality control - the differences between equating properties, equating designs, and equating methods - equating error, and the underlying statistical assumptions for equating. The authors are acknowledged experts in the field, and the book is based on numerous courses and seminars they have presented. As a result, educators, psychometricians, professionals in measurement, statisticians, and students coming to the subject for the first time as part of their graduate study will find this an invaluable text and reference.

Inhaltsverzeichnis

Frontmatter
Chapter 1. Introduction and Concepts
Abstract
This chapter provides a general overview of equating and briefly considers important concepts. The concept of equating is described, as is why it is needed, and how to distinguish it from other related processes. Equating properties and designs are considered in detail, because these concepts provide the organizing themes for addressing the statistical methods treated in subsequent chapters. Some issues in evaluating equating are also considered. The chapter concludes with a preview of subsequent chapters.
Michael J. Kolen, Robert L. Brennan
Chapter 2. Observed Score Equating Using the Random Groups Design
Abstract
As was stressed in Chapter 1, the same specifications property is an essential property of equating, which means that the forms to be equated must be built to the same content and statistical specifications. We also stressed that the symmetry property is essential for any equating relationship. The focus of the present chapter is on methods that are designed to achieve the observed score equating property, along with the same specifications and symmetry properties. As was described in Chapter 1, these observed score equating methods are developed with the goal that, after equating, converted scores on two forms have at least some of the same score distribution characteristics in a population of examinees.
Michael J. Kolen, Robert L. Brennan
Chapter 3. Random Groups—Smoothing in Equipercentile Equating
Abstract
As described in Chapter 2, sample statistics are used to estimate equating relationships. For mean and linear equating, the use of sample means and standard deviations in place of the parameters typically leads to adequate equating precision, even when the sample size is fairly small. However, when sample percentiles and percentile ranks are used to estimate equi-percentile relationships, equating often is not sufficiently precise for practical purposes because of sampling error.
Michael J. Kolen, Robert L. Brennan
Chapter 4. Nonequivalent Groups—Linear Methods
Abstract
Chapter 1 introduced the common-item nonequivalent groups design. For this design, two groups of examinees from different populations are each administered different test forms that have a set of items in common This design often is used when only one form of a test can be administered on a given test date. As discussed in Chapter 1, the set of common items should be as similar as possible to the full-length forms in both content and statistical characteristics.
Michael J. Kolen, Robert L. Brennan
Chapter 5. Nonequivalent Groups—Equipercentile Methods
Abstract
Equipercentile equating methods have been developed for the common-item nonequivalent groups design. These methods are similar to the equipercentile methods for random groups described in Chapter 2. Equipercentile methods with nonequivalent groups consider the distributions of total score and scores on the common items, rather than only the means, standard deviations, and covariances that were considered in Chapter 4. As has been indicated previously, equipercentile equating is an observed score equating procedure that is developed from the perspective of the observed score equating property described in Chapter 1. Thus, equipercentile equating with the common-item nonequivalent groups design requires that a synthetic population, as defined in Chapter 4, be considered. In this chapter, we present an equipercentile method that we show to be closely allied to the Tucker linear method of Chapter 4. We also describe how smoothing methods, such as those described in Chapter 3, can be used when conducting equipercentile equating with nonequivalent groups. The methods described in this chapter are illustrated using the same data that were used in Chapter 4, and the results are compared to the linear results from Chapter 4.
Michael J. Kolen, Robert L. Brennan
Chapter 6. Item Response Theory Methods
Abstract
The use of item response theory (IRT) in testing applications has grown considerably over the past 15 years. This growth has been reinforced by the many publications in the area (e.g., Baker, 1992a; Hambleton and Swaminathan, 1985; Hambleton et al., 1991; Lord, 1980; Wright and Stone, 1979). Applications of IRT include test development, item banking, differential item functioning, adaptive testing, test equating, and test scaling. A major appeal of IRT is that it provides an integrated psychometric framework for developing and scoring tests. Much of the power of IRT results from it explicitly modeling examinee responses at the item level, whereas, for example, the focus of classical test models and strong true score models is on responses at the level of test scores.
Michael J. Kolen, Robert L. Brennan
Chapter 7. Standard Errors of Equating
Abstract
Two general sources of error in estimating equating relationships are present whenever equating is conducted using data from an equating study: random error and systematic error. Random equating error is present when the scores of examinees who are considered to be samples from a population or populations of examinees are used to estimate equating relationships. When only random equating error is involved in estimating equating relationships, the estimated equating relationship differs from the equating relationship in the population because data were collected from a sample, rather than from the whole population. If the whole population were available, then no random equating error would be present. Thus, the amount of random error in estimating equating relationships becomes negligible as the sample size increases.
Michael J. Kolen, Robert L. Brennan
Chapter 8. Practical Issues in Equating and Scaling to Achieve Comparability
Abstract
Many of the practical issues that are involved in conducting equating are described in this chapter. We describe major issues and provide references that consider these issues in more depth. The early portions of this chapter focus on equating dichotomously scored paper and pencil tests. In later portions, the focus broadens to include practical issues in scaling to achieve comparability, including computerized testing and performance and other types of alternative assessments.
Michael J. Kolen, Robert L. Brennan
Backmatter
Metadaten
Titel
Test Equating
verfasst von
Michael J. Kolen
Robert L. Brennan
Copyright-Jahr
1995
Verlag
Springer New York
Electronic ISBN
978-1-4757-2412-7
Print ISBN
978-1-4757-2414-1
DOI
https://doi.org/10.1007/978-1-4757-2412-7