We can set about explaining fertility time-trends in two ways. The strongest form of explanation is a theory entailing a substantive model—that is, a model that represents as far as possible the real-world processes generating the fertility rates analyzed (Cox
1990; Freedman
1985; Hand
2004). An alternative form of explanation is via empirical models, a more common strategy in social science; these models aim to explain as much as possible of the variance in a dependent variable, but they are not designed to represent the
modus operandi of the underlying phenomena.
2 The arguments that follow apply broadly to both approaches, but some distinctions are drawn in discussion.
The question of whether timing effects are distorting when period fertility is the explanandum is addressed here in two stages, considering first the TFR per se as a measure, and then the arguments for and against adjusting the TFR.
Period Synthetic Indicators as Dependent Variables
Adjustment for bias/distortion is proposed primarily for the period TFR and other synthetic indicators. A first question, then, is whether indicators of this kind are suitable as dependent variables when explaining period trends.
3 I think not, for two reasons. First, the TFR is a single figure indicator, but period change is multidimensional rather than unidimensional and thus cannot be represented by a single figure. A summary indicator, however, may be adequate for analyses of the long run that aim to explain gross changes in level. Second, the TFR and analogous indicators are in synthetic cohort form, referring to the cumulative experience of a lifetime, and are therefore in an inappropriate metric to represent the phenomena of a single period (Ní Bhrolcháin
1992). As a result, they are unsuitable as dependent variables in any substantive model of the underlying process in its period aspect. As noted earlier, synthetic indicators can be treated as simple statistical summaries, and in that guise might be suitable as dependent variables for empirical as distinct from behavioral models. However, the problem remains that no single measure can accurately represent fertility in time series when trends differ between exposure categories.
Are Timing Effects a Source of Bias in Period Fertility as Dependent Variable?
The arguments in the preceding section apply to the TFR in all its forms: additive, multiplicative, adjusted, or unadjusted. Nevertheless, let us set aside objections to TFR-like indices and consider the issue of bias/distortion in the TFR when period fertility is the explanandum. Of the types of bias identified earlier, two are potentially relevant to trends in the period TFR as dependent variable: confounding (bias A) and definitional (bias B1). Bias of types B2 and B3 cannot be present because when we are trying to explain trends in period fertility, the dependent variable measures a period phenomenon and is not a cohort estimator.
Year-on-year movements in the TFR can be biased by compositional factors because of confounding between the exposure distribution and calendar time—a spurious timing effect (bias A). The solution here is not adjustment, which is designed to remove genuine timing effects, but to use indices either specific for parity and age, or for parity and duration for orders two and higher, or standardized for these factors. Such indicators, and measures summarizing them—period parity progression ratios, the period parity progression based TFR, and regression-standardized parity-specific rates (Feeney and Yu
1987; Hoem
1993; Ní Bhrolcháin
1987; Rallu and Toulemon
1994)—are free of confounding bias. But, occurrence-exposure rates of this type, and measures derived from them, are influenced by genuine tempo change and thus might be thought of as definitionally distorted (bias B1). That is a mistake. True tempo effects do not distort period indicators in the role of explanandum. Rather, genuine tempo effects are an integral component of the period fertility trends we have to explain, as are the changes in variance highlighted by Kohler and Philipov (
2001)
. Explanation means explaining the whole of the change in period fertility, not just part of it. To remove tempo effects from period fertility as explanandum would denude it of an intrinsic and often substantial component of change (Ní Bhrolcháin
1992,
1994). The greater the timing shift in a period, the worse the impact of adjustment because a larger part of what is happening in a period, from an explanatory angle, is removed. Wachter (
2005) sees tempo adjustment as a way of standardizing for tempo. I argue that a period fertility indicator that is standardized for tempo is misspecified, if the objective is to explain period trends. The issue is not at all comparable to age standardization in mortality analysis. In explaining variation in death rates, age structure effects are regarded as extraneous and not a target of explanation. In explaining change in period fertility, by contrast, identifying the causes of tempo change
is part of the objective.
An extreme example, the Year of the Fire Horse in Japan, illustrates the argument. Japan saw a dip of 27% in the TFR in 1966, due to avoidance of births in a year that, according to popular astrology, would be an inauspicious year of birth for a girl. To explain the phenomenon, it would be absurd to use as the dependent variable a time series of tempo-adjusted TFRs. Any timing element is integral to the impact of folk belief on that year’s fertility. (Of course, if estimating the effect of the folk belief on cohort fertility, cohort outcomes would be the dependent variable.) In the case of the speed premium in Sweden, tempo is a large part of the period effect, and its precise detail allows a strong case to be made for a causal influence of legislation relating to maternity-pay provision on fertility (Andersson
1999; Hoem
1990; Ní Bhrolcháin and Dyson
2007). The same applies to less-pronounced period fluctuations. For example, accelerated childbearing was a sizeable ingredient of the baby boom of the late 1950s and 1960s in the United States, the United Kingdom, and several other developed societies (Butz and Ward
1979; Ní Bhrolcháin
1987; Ryder
1980). If part of the explanation of the baby boom is that postwar prosperity, full employment, and high wages gave rise to accelerated marriage and childbearing, that faster pace of family formation must be represented on the left hand side of the equation. Similarly, later childbearing is a central feature of what is to be explained in the period trends of the last few decades in developed societies.
An analogy may be useful. Consider a car traveling for a fixed duration of time. Its speed varies during the journey: rounding a sharp bend or going uphill, it slows down, while on the straight or downhill, it travels faster. Speed may vary also depending on terrain, traffic, the driver’s inclinations, and so on. The task of explaining period fertility trends is analogous to accounting for the sequence of speeds at which the car travels. Saying that a well-standardized period fertility indicator is distorted is like saying that a measure of the car’s speed at an arbitrarily chosen point in the journey, or when the car is changing speed, is mistaken. It may well give a biased estimate of average speed over the journey as a whole, but it gives an accurate account of the car’s speed at the point at which this was measured. If we think in terms of “underlying” speed or average speed during a journey, and whether and how it can be inferred from speed along the route, we are measuring something other than speed at a particular time-point. In addition, we would have to either construct models and make assumptions for the purpose, or investigate the properties of a large number of such journeys, to generate an empirical basis for the estimate.
The analogy is not perfect, but it helps to spotlight some key points. To adjust the measure of speed during periods of acceleration and deceleration would clearly misrepresent the recorded time sequence of speeds during the journey. Quantifying and explaining that sequence—comparable to measuring and explaining period fertility trends—is a different problem from measuring and explaining average speed or distance traveled during the journey—a task analogous to estimating cohort or longer-run fertility levels. Schoen (
2004) has used the car analogy for a different purpose—to argue for the importance of cohort fertility—and assumes that the driver has an intended destination, although one that may alter during the journey. Here, the analogy is between the car’s trajectory and aggregate fertility movements, and no assumption is needed about intentions regarding destination, speed, or duration of the journey.
Period Quantum and Tempo
The argument thus far is that genuine timing effects are intrinsic to period fertility as dependent variable, but that does not rule out the disaggregation of period fertility into tempo and quantum components, each to be the focus of explanation. Separate estimates of period level and timing effects for explanatory purposes could be argued for if several conditions were to hold. These conditions are that in period mode: (1) the quantum of fertility and its timing are separable in a quantitative sense; (2) quantum and tempo measures reflect distinct aspects of the underlying behavioral process (when we are seeking a behavioral explanation rather than an empirical one); and (3) that quantum and tempo respond differently to change in social, economic, and other determining factors.
Are tempo and quantum separable in a quantitative sense? Indices can be specified that, on their face, represent the quantum and tempo aspects of period fertility (e.g., Bongaarts and Feeney
1998; Butz and Ward
1979; Foster
1990; Kohler and Ortega
2002a; Ryder
1980). However, although separable in theory, quantum and tempo tend to covary in practice. The most familiar evidence of this is the near-universal tendency of cohort fertility series to reflect the fluctuations in corresponding period series, but with a lesser amplitude.
4 Time series presented by Schoen (
2004) illustrate the point further. Schoen’s Table 1 gives a number of quantum indicators for the United States during 1917–2001: the conventional TFR, two versions of the Bongaarts and Feeney adjusted TFR, the Butz and Ward average completed fertility (ACF) measure,
5 together with the mean age at childbearing and corresponding (true) cohort total fertility for part of the series. From these, the period timing indices associated with each quantum measure have been obtained. Corresponding to the ACF, we have the timing measure TFR / ACF (Butz and Ward
1979; Schoen
2004). Two specifications of a Bongaarts and Feeney timing index are calculated for each of the two versions of the BF TFR: additive (BF timing = TFR – TFR*) and ratio versions (BF timing = TFR / TFR*).
In all cases, the timing and quantum indicators are positively associated. The correlation between ACF and its associated timing index is .83 (1917–1997); and for the four versions of the Bongaarts and Feeney timing and quantum indices, the correlations are between .32 and .45 (1918–1997). On the ACF evidence, periods of faster timing are also periods of higher levels of fertility. This is true also of the Bongaarts and Feeney indices, although to a lesser extent, possibly because the TFR
adj indicator can be erratic (Schoen
2004; Smallwood
2002b). Overall, these figures demonstrate that although separable to some degree, period quantum and tempo are not independent (see also van Imhoff and Keilman
2000).
Are period quantum and tempo behaviorally distinct? Estimates of tempo and quantum effects such as those of Butz and Ward (
1979), Ryder (
1980), Foster (
1990), and Bongaarts and Feeney (
1998) are interesting. But, thus far, there have been no studies that investigate whether the mathematical constructs in these analyses correspond to real-world processes. The question is relevant at both micro and macro levels. At the individual level, the question is whether decisions about tempo and quantum are taken separately or jointly. At the aggregate level, whether the tempo/quantum divide corresponds to underlying processes is harder to determine, but it can be studied indirectly by investigating both the independence of tempo and quantum, and how far they are jointly or separately influenced by causal factors.
Ryder’s perspective on the subject is instructive. In a paper that is best known for having estimated the quantum and tempo of cohort fertility and for having analyzed them into their components, Ryder (
1980) concluded that the two are not independent. In his view, quantum and tempo “are to some degree manifestations of the same underlying behavior,” and “we cannot, in principle, make a statistical separation of the tempo and quantum facets of fertility” (Ryder
1980:44–45). For Ryder—the pioneer in estimating tempo and quantum—the two were interlinked at both the individual and at the aggregate levels.
Are period quantum and tempo influenced either by different factors or differentially by the same factors? If yes, then they may reflect genuinely distinct processes. If not, they are a single, undifferentiated entity. While instances can be found of changes in timing in reaction to socioeconomic determinants—the Swedish speed premium effect being a very clear-cut case (Andersson
1999; Andersson et al.
2006)—it is not obvious that such instances are exclusively due to timing effects nor that currently available indices of timing would represent them accurately. Research is still needed to address the real-world justification for separating level and timing, at both individual and aggregate levels.
Validation
Ultimately, measures used for any purpose need formal validation. No independent criterion is available against which to validate period measures in explanatory studies, but we do have an indirect check on the performance of a period measure as dependent variable: namely, explanatory success. Indicators of period fertility as explanandum would be validated by an empirically successful explanation of period trends in which they were embedded—a form of construct validity. As Ryder has suggested, we will know we have the right measures when we have a good explanation of time trends. Thus far, we lack convincing, well-documented explanations for period trends that could help to adjudicate between different measurement approaches. Nevertheless, success in explanation, rather than a check against cohort values, is the appropriate criterion for evaluating indicators of period fertility as dependent variable.