“Waiting for Life to Arrive”: A history of the regression-discontinuity design in Psychology, Statistics and Economics

doi:10.1016/j.jeconom.2007.05.002

Journal of Econometrics

Volume 142, Issue 2, February 2008, Pages 636-654

https://doi.org/10.1016/j.jeconom.2007.05.002 Get rights and content

Abstract

This paper reviews the history of the regression discontinuity design in three academic disciplines. It describes the design's birth and subsequent demise in Psychology even though most problems with it had been solved there. It further describes the scant interest shown in the design by scholars formally trained in Statistics, and the design's poor reception in Economics from 1972 until about 1995, when its profile and acceptance changed. Reasons are given for this checkered history that is characterized as waiting for life to arrive.

Introduction

The regression discontinuity design (RDD) occurs when assignment to treatment depends deterministically on a quantified score on some continuous assignment variable. This score is then used as a covariate in a regression of outcome. When RDD is perfectly implemented, the selection process is fully observed and so can be modeled to produce an unbiased causal inference.

This paper is about the history of RDD. Although I am not a trained historian, I know enough to respect the primacy historians place on documenting events and trends. I also know that interpreting these events and trends has to be conditioned by independent knowledge of temporal sequence, by archived specifics that are relevant to the explanations offered, and by recourse to substantiated theories of individual and institutional behavior. Fortunately, most of this paper is about such events, trends and interpretations. But a few parts are not, and historians may well become nervous when I try to interpret events that might have happened but did not. Although some historians are developing a taste for counterfactual or virtual history (Ferguson, 1997), it is deservedly a minority taste. As an amateur historian, I will almost certainly fall into other traps professionals learn to avoid. Of those I can recognize, one is the teleological trap of inferring inevitable-seeming links between past events when, with more secure footing in the original time and place, these events might seem more contingent and many other futures possible. Another problem is that I am not an independent commentator on RDD. I have been peripherally involved in its history, albeit as a disseminator and not a theorist or practitioner. I was also marginally involved in the Northwestern University theory group that developed the design in the early 1970s after its discovery earlier (Thistlethwaite and Campbell, 1960). Doubtless I know more about RDD's history in that context than about other attempts to develop, disseminate or evaluate the design. While I have read many of the original sources reported on here, I have probably relied on secondary sources more than a real historian would. The relatively recent history of RDD has helped me, though, since many of the method's pioneers are still alive and have offered commentary on earlier drafts of this paper as it touched on their work. I have tried to incorporate their memories and sensitivities into this final version, sometimes even citing their notes to me about their work. Nonetheless, the following account is mine and not theirs; and while I respect the simplest norms of writing history, I cannot hope to dip deeply into the historian's bag of analytic tools. So, caveat lector.

This is not the first historical account of RDD. Donald Campbell, the design's originator, wrote his own version of the design's early history (Campbell, 1984), and various scholars have given snapshots of its history since then (Trochim, 1984, Trochim, 2001). However, the present account is more current, detailed and interdisciplinary than its predecessors. Indeed, it is organized around academic disciplines, tracing the history of the design in Psychology and Education, then in Statistics and Biostatistics, and then in Economics. The account speaks to many themes, including the repeated re-invention of the design across these disciplines. This was often done invoking different names for the design, the upshot being that RDD has not attained consistent “brand” status across the various behavioral, social and health sciences. Another theme speaks to the design's differential waxing and waning by discipline, trying to describe and explain what happened. RDD was invented and initially developed in Psychology and Education, but interest in it waned there after about 1990. It has never had much visible growth in Statistics, though its was acknowledged there. And in Economics RDD had a serendipitous birth, a long period of neglect, and then a renaissance after about 1995. This special journal number is part of that revival. Since its invention in 1960, RDD has been, in Samuel Beckett's words: “waiting for life to happen”. Will this revival breathe life into the design in Economics and, who knows, even beyond?

Section snippets

Psychology and Education

No doubt exists that the first publication on RDD was an application in education by two psychologists, Thistlethwaite and Campbell (1960). No doubt also exists that Campbell was the initiator and that he continued to work on the topic while Thistlethwaite did not. What is less clear is the intuition that led Campbell to develop the design. To probe this we go to Campbell and Stanley (1963) since it provides more conceptual clarification than the earlier paper. This clarification did not

Statistics

For the purposes of this paper I understand Statistics in terms of scholars trained in that field, whether in a Department of Statistics, Mathematics or Biostatistics. Also included are those scholars trained in other fields who later came to hold their major academic appointment in a Statistics Department.

Rubin (1977) is the first published article I could find in Statistics that mentions RDD. This paper has been portrayed as another independent invention of the design together with a formal

Economics

The earliest papers on RDD in Economics were by Goldberger, 1972a, Goldberger, 1972b. These unpublished papers represent two main accomplishments for RDD theory, though they were only incidental to Goldberger's main purpose. The first accomplishment was a proof of the basic design, showing formally what Campbell had only intuited. The Goldberger's papers were based on the distinction between non-equivalent groups whose difference depends on true ability in one case, and on measured ability in

Conclusions

Several themes stand out in the half century of RDD's history. One is its repeated independent discovery. While this augurs well for the design's validity and relevance across fields, one circumstance of the reinventions has been strange. Campbell first named the design regression-discontinuity; Goldberger referred to it as deterministic selection on the covariate; Sacks and Spiegelman studiously avoided naming it; Rubin first wrote about it as part of a larger discussion of treatment

Acknowledgments

Thanks are due to Richard Berk, Glen Cain, Arthur Goldberger, Guido Imbens, George Knafl, Thomas Lemieux, William Shadish, Clifford Spiegelman, William Trochim and Vivian Wong for feedback on prior drafts. They are not responsible for any errors of fact or taste.

References (83)

G.C. Cain
Regression and selection models to improve nonexperimental comparisons
J.C. Cappelleri et al.
An illustrative statistical analysis of cut-off based randomized clinical trials
Journal of Clinical Epidemiology
(1994)
G. Knafl et al.
Model robust confidence intervals
Journal for Statistical Planning and Inference
(1982)
W.M.K. Trochim
Regression discontinuity design
W.M.K. Trochim et al.
Cutoff assignment strategies for enhancing randomized clinical trials
Controlled Clinical Trials
(1992)
L.S. Aiken et al.
Comparison of a randomized and two quasi-experimental designs in a single outcome evaluation: efficacy of a university-level remedial writing program
Evaluation Review
(1998)
J.D. Angrist et al.
Identification of causal effects using instrumental variables
Journal of the American Statistical Association
(1996)
J.D. Angrist et al.
Does compensatory school attendance affect schooling and earnings
Quarterly Journal of Economics
(1991)
J.D. Angrist et al.
Using Maimonides’ rule to estimate the effect of class size on scholastic achievement
Quarterly Journal of Economics
(1999)
B.S. Barnow et al.
Issues in the analysis of selectivity bias

Battistin, E., Rettore, E., 2005. Ineligibles and eligible non-participants as a double comparison group in...

R.A. Berk et al.

An evaluation of California's inmate classification system using a regression discontinuity design

Journal of the American Statistical Association

(1999)

C.A. Bennet et al.

Evaluation and Experiment

(1975)

R.A. Berk et al.

Capitalizing on nonrandom assignment to treatments: a regression-discontinuity evaluation of a crime-control program

Journal of the American Statistical Association

(1983)

Black, D., Galdo, J., Smith, J.C., 2005. Evaluating the regression discontinuity design using experimental data....

H. Bloom et al.

Memo on the Evaluation Design of the Reading First National Impact Study

(2005)

Boruch, R., 1973. Regression-discontinuity designs revisited. Northwestern University, Evanston, IL, unpublished...

R. Boruch

Coupling randomized experiments and approximations to experiments in social program evaluation

Sociological Methods and Research

(1975)

H. Buddelmeyer et al.

An Evaluation of the Performance of Regression Discontinuity Design on PROGRESA

(2003)

D.T. Campbell

Reforms as experiments

American Psychologist

(1969)

D.T. Campbell

Forward

D.T. Campbell et al.

Making the case for randomized assignment to treatments by considering the alternatives: six ways in which quasi-experimental evaluations in compensatory education tend to underestimate effects

D.T. Campbell et al.

How regression artifacts in quasi-experiments can mistakenly make compensatory education look harmful

D.T. Campbell et al.

Experimental and quasi-experimental designs for research on teaching

Cappelleri, J.C., 1991. Cutoff-based designs in comparison and combination with randomized clinical trials. Ph.D....

J.C. Cappelleri et al.

Power analysis of cutoff-based randomized clinical trials

Evaluation Review

(1994)

J.C. Cappelleri et al.

Ethical and scientific features of cutoff-based designs

Medical Decision Making

(1995)

T.D. Cook et al.

Quasi-Experimentation: Design and Analysis for Field Settings

(1979)

Cook, T.D., Shadish, W.R., Wong, V.C., 2005. Within-study comparisons of experiments and non-experiments: can they help...

Cook, T.D., Wong, V.C., in press. Empirical tests of the validation of the regression discontinuity design. Annales...

N. Ferguson

Virtual History: Alternatives and Counterfactuals

(1997)

M. Finkelstein et al.

Clinical and prophylactic trials with assured new treatment for those at greater risk: I. A design proposal

Journal of Public Health

(1996)

M. Finkelstein et al.

Clinical and prophylactic trials with assured new treatment for those at greater risk: II. Examples

Journal of Public Health

(1996)

S. Glazerman et al.

Nonexperimental vs. experimental estimates of earnings impacts

The Annals of the American Academy

(2003)

Goldberger, A.S., 1972a. Selection bias in evaluating treatment effects: some formal illustrations. Madison, WI,...

Goldberger, A.S., 1972b. Selection bias in evaluating treatment effects: the case of interaction. Madison, WI,...

W.T. Gormley et al.

The effects of universal pre-k in Oklahoma: research highlights and policy implications

The Policy Studies Journal

(2005)

J. Hahn et al.

Identification and estimation of treatment effects with a regression-discontinuity design

Econometrica

(2001)

G.W. Imbens et al.

Identification and estimation of local average treatment effects

Econometrica

(1994)

G.W. Imbens et al.

Evaluating the cost of conscription in the Netherlands

Journal of Business and Economic Statistics

(1995)

B.A. Jacob et al.

The impact of teacher training on student achievement: quasi-experimental evidence from school reform efforts in Chicago

Journal of Human Resources

(2004)

Cited by (216)

The impact of retirement on body mass index in China: An empirical study based on regression discontinuity design
2023, SSM - Population Health
An aging population is an important trend of social development, and it will be China's basic national condition for a long time. However, the pressure on domestic pension payments and economic operations will increase daily. The delayed retirement policy is gradually implemented as a critical initiative to improve capital and labor force allocation. The impact of retirement on residents' Body Mass Index (BMI) and weight has become a focus issue. This paper investigates the mechanism of the impact of retirement on residents' BMI using microdata from the China Family Panel Studies (CFPS) 2018, combined with a fuzzy regression discontinuity design to measure the potential health impact of China's current retirement policy on residents. The study finds that: (1) Retirement has a significant negative effect on BMI for women, with retirement leading to a significantly increased risk of deviation from normal BMI levels and significantly increasing the weight of retired women. However, retirement does not have a significant effect on men. (2) Retirement policies affect residents' BMI to different degrees depending on their family size, with the negative effect on women being more pronounced in smaller family sizes. (3) Female residents who retire to help their children with intergenerational care are more likely to maintain normal BMI levels, significantly positively affecting their potential health. (4) Retirement negatively affects BMI through channels such as significantly reducing exercise frequency among female residents. The study demonstrates that retirement policy impacts the BMI and weight of female residents, so the formulation and implementation of delayed retirement policy should be flexible, and family factors such as family sizes and intergenerational care should be considered appropriately.
On the scientific study of small samples: Challenges confronting quantitative and qualitative methodologies
2023, Leadership Quarterly
Often phenomena that are important to understand and predict are very rare. Rare events can prove difficult to analyze systematically because they do not generate many sampling observations. In this article I examine how small sample sizes can be studied scientifically. The article begins with an explanation of the distinction between research and science. I then bring to the fore the importance of counterfactual comparisons and outline the nature of the methodological problems posed by the study of small samples. These problems include challenges related to using a single case, small sample sizes, selecting on the dependent variable, regression toward the mean, explaining a variable with a constant, and using the same data to both generate and test hypotheses. I provide potential resolutions to these problems which are: (a) employing matched controls; (b) shifting or widen the category of inquiry; (c) selecting variables based on variance in the independent variable; (d) including counterfactuals; (e) ensuring that both independent and dependent variables demonstrate variation; and (f) testing potential hypotheses against data sets that are fully independent of those used to generate the hypotheses. I conclude with a discussion of future directions for undertaking a more scientific approach to using small samples.
Causal analysis of central bank holdings of corporate bonds under interference
2022, Economic Modelling
We investigate whether the transfer of corporate bonds from the private sector to the balance sheet of the central bank permanently alters their relative prices. Answering this question complements the literature on central bank asset purchase programs, documenting significant relative price changes over shorter horizons. We use data on bonds issued over the duration of the European Central Bank's corporate bond purchase program and a novel regression discontinuity design to quantify the causal effect of interest. The estimates indicate that the program did not, on average, permanently alter the yield spreads of eligible bonds relative to those of similar noneligible bonds. This finding suggests that central bank holdings of even relatively illiquid private sector securities can have no distortionary effects on the relative prices of such assets.
The regression discontinuity design: Methods and implementation with a worked example in health services research
2022, Zeitschrift fur Evidenz, Fortbildung und Qualitat im Gesundheitswesen
The randomized controlled trial (RCT) is the gold standard in evidence-based medicine. However, this design may not be appropriate in every setting, so other methods or designs such as the regression discontinuity design (RDD) are required.
The aim of this article is to introduce the RDD, summarise methodology in the context of health services research and present a worked example using the statistic software SPSS (Examples for R and Stata in the Appendix A). The mathematical notations of sharp and fuzzy RDD as well as their distinction are presented. Furthermore, examples from the literature and recent studies are highlighted, and both advantages and disadvantages of the design are discussed.
The RDD consists of four essential steps: 1. Determine feasibility; 2. Note possible treatment manipulation, 3. Check for the treatment effect, and 4. Fit the regression models to measure the treatment effect.
The RDD comes as an alternative for studies in health service research where an RCT cannot be conducted, but a threshold-based comparison can be made.
Die randomisierte kontrollierte Studie (RCT) ist der Goldstandard in der evidenzbasierten Medizin. Dieses Design kann jedoch nicht in jedem Umfeld eingesetzt werden, sodass andere Methoden oder Designs wie das Regressions-Diskontinuitäts-Design (RDD) erforderlich sind.
Dieser Beitrag führt in das RD-Design ein, fasst die Methodik im Kontext der Versorgungsforschung zusammen und stellt ein praktisches Arbeitsbeispiel unter Verwendung der Statistiksoftware SPSS vor (Beispiele für R und Stata im Anhang). Vorgestellt werden die mathematischen Notationen der scharfen und unscharfen RDD sowie deren Unterscheidung. Darüber hinaus werden Beispiele aus der Literatur und aktuellen Studien hervorgehoben und die Vor- und Nachteile des Designs diskutiert.
Das RDD besteht aus vier wesentlichen Schritten für die Anwendung: 1. Die Durchführbarkeit bestimmen, 2. Mögliche Manipulation der Behandlung beachten, 3. Behandlungseffekt überprüfen und 4. Die Regressionsmodelle anpassen, um den Behandlungseffekt zu messen.
Das RDD stellt eine Alternative für Studien in der Versorgungsforschung dar, bei denen keine RCT durchgeführt werden kann, aber ein schwellenwertbasierter Vergleich möglich ist.
The impact of the send-down experience on the health of elderly Chinese women: Evidence from the China family panel studies
2022, International Review of Economics and Finance
From a life course perspective, a “major social change event” experienced by individuals in early life will affect their health in adulthood. Here we select a group of elderly women whose health is often disadvantaged and use data from the China Family Panel Studies as the sample. Our fuzzy regression discontinuity results show that experiencing the “send-down” experience increased the probability of elderly women suffering from chronic diseases by 6.9% and reduced their mental health by 1.6%. Additionally, the “send-down” experience affects women's health in old age by influencing their socioeconomic status and subjective well-being in later life. Our results provide a reference for the future optimization of the Chinese government and our society's health security system.
The Economic Impact of R&D Tax Incentives: Evidence Using Regression Discontinuity Design
2024, SSRN

View all citing articles on Scopus

View full text

“Waiting for Life to Arrive”: A history of the regression-discontinuity design in Psychology, Statistics and Economics

Abstract

Introduction

Section snippets

Psychology and Education

Statistics

Economics

Conclusions

Acknowledgments

Journal of Clinical Epidemiology

Journal for Statistical Planning and Inference

Controlled Clinical Trials

Comparison of a randomized and two quasi-experimental designs in a single outcome evaluation: efficacy of a university-level remedial writing program

Evaluation Review

Identification of causal effects using instrumental variables

Journal of the American Statistical Association

Does compensatory school attendance affect schooling and earnings

Quarterly Journal of Economics

Using Maimonides’ rule to estimate the effect of class size on scholastic achievement

Quarterly Journal of Economics

Issues in the analysis of selectivity bias

An evaluation of California's inmate classification system using a regression discontinuity design

Journal of the American Statistical Association

Evaluation and Experiment

Capitalizing on nonrandom assignment to treatments: a regression-discontinuity evaluation of a crime-control program

Journal of the American Statistical Association

Memo on the Evaluation Design of the Reading First National Impact Study

Coupling randomized experiments and approximations to experiments in social program evaluation

Sociological Methods and Research

An Evaluation of the Performance of Regression Discontinuity Design on PROGRESA

Reforms as experiments

American Psychologist

Forward

Making the case for randomized assignment to treatments by considering the alternatives: six ways in which quasi-experimental evaluations in compensatory education tend to underestimate effects

How regression artifacts in quasi-experiments can mistakenly make compensatory education look harmful

Experimental and quasi-experimental designs for research on teaching

Power analysis of cutoff-based randomized clinical trials

Evaluation Review

Ethical and scientific features of cutoff-based designs

Medical Decision Making

Quasi-Experimentation: Design and Analysis for Field Settings

Virtual History: Alternatives and Counterfactuals

Clinical and prophylactic trials with assured new treatment for those at greater risk: I. A design proposal

Journal of Public Health

Clinical and prophylactic trials with assured new treatment for those at greater risk: II. Examples

Journal of Public Health

Nonexperimental vs. experimental estimates of earnings impacts

The Annals of the American Academy

The effects of universal pre-k in Oklahoma: research highlights and policy implications

The Policy Studies Journal

Identification and estimation of treatment effects with a regression-discontinuity design

Econometrica

Identification and estimation of local average treatment effects

Econometrica

Evaluating the cost of conscription in the Netherlands

Journal of Business and Economic Statistics

The impact of teacher training on student achievement: quasi-experimental evidence from school reform efforts in Chicago

Journal of Human Resources