Skip to main content
Top
Published in: Demography 1/2013

01-02-2013

Probabilistic Forecasting Using Stochastic Diffusion Models, With Applications to Cohort Processes of Marriage and Fertility

Authors: Mikko Myrskylä, Joshua R. Goldstein

Published in: Demography | Issue 1/2013

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this article, we show how stochastic diffusion models can be used to forecast demographic cohort processes using the Hernes, Gompertz, and logistic models. Such models have been used deterministically in the past, but both behavioral theory and forecast utility are improved by introducing randomness and uncertainty into the standard differential equations governing population processes. Our approach is to add time-series stochasticity to linearized versions of each process. We derive both Monte Carlo and analytic methods for estimating forecast uncertainty. We apply our methods to several examples of marriage and fertility, extending them to simultaneous forecasting of multiple cohorts and to processes restricted by factors such as declining fecundity.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
The conceptual similarity between stochastic differential equations (SDE) and our results is masked by the fact that we work in an exclusively discrete setup.
 
2
The tendency to treat complex processes as linear is occasionally criticized, as in the “general linear reality” paradigm in which the timing and order of events are irrelevant for the outcome and there is no feedback from the outcome to the effect (see Abbot 1988). The diffusion models we use take this critique into account by allowing for dynamic feedback over time.
 
3
Deterministic time trend with transient disturbances appears to be contrary to the idea of diffusion processes in which the past influences the future. The difference-stationary specification, which has “long memory” (Raffalovich 1994), seems to fit better for diffusion processes.
 
4
Deviations from linearity in g signal deviations from model assumptions. In principle, one can use standard methods to test the linearity (Harvery and Leybourne 2007; Hinich 1982). In demography, the number of observations is typically small, resulting in low test power. Therefore, visual inspection of the process may be preferred over formal testing, as in Wu (1990). These remarks apply also to the Gompertz and logistic models.
 
5
The prediction Eq. (6) is preferred because of its simplicity and linearity in exp(g t ). Alternatives include https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq35_HTML.gif , which follows directly from Eq. (2), and a prediction that is based on solving https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq36_HTML.gif from https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq37_HTML.gif . This quadratic equation arises from the approximation https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq38_HTML.gif . Empirically, these result in similar predictions. The predictions have a small bias resulting from a discrete growth factor https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq39_HTML.gif being applied to https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq40_HTML.gif , whereas optimally, one would apply a continuous growth factor from https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq41_HTML.gif to https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq42_HTML.gif . The bias could be reduced by a midpoint correction, analogous to the Euler method for solving differential equations numerically (see Griffiths and Smith 1991), in which the growth factor is averaged over two observations. The bias, however, is small if the step length is small, and it was negligible in simulations in which the step length corresponded to one year. These remarks apply also to the Gompertz and logistic models: a small bias resulting from the discretization is present and could be reduced using the midpoint correction, but in practice, such a correction is not needed with one-year age groups.
 
6
The prediction intervals are estimated in a standard way using https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq68_HTML.gif as the variance estimate for k-step-ahead prediction https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq69_HTML.gif . Because of the discretization (Eq. (4)), g 20 is not observed.
 
7
Billari and Toulemon (2006) used the Hernes model to forecast cohort childlessness and based the forecast uncertainty on uncertainty in the model parameters, in the same spirit as Goldstein and Kenney (2001).
 
8
In empirical applications, the starting age may influence the results. For example, if very young ages were included, the process of adopting the modeled behavior could in these young ages differ from that of the older ages, resulting in nonlinearities in the underlying process g. As a rule of thumb, a reasonable starting age is when g starts to behave linearly.
 
9
We first estimate the drift parameter for each cohort using data up to age 30. We denote this cohort-specific drift parameter by https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq73_HTML.gif . For each cohort, we construct predictions https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq74_HTML.gif for ages 31–45. We estimate the parameter IFC by minimizing https://static-content.springer.com/image/art%3A10.1007%2Fs13524-012-0154-4/MediaObjects/13524_2012_154_IEq75_HTML.gif , where the weights are w t  = (45 – t – 1) / (15 ⋅ 8). With this specification, the weights decline linearly so that w 31 = 1 / 8 and w 31 = 1 / (15 ⋅ 8). Such a weighting gives more weight to young ages, whose contribution to fertility matters more than that of older ages. We then find the best-fitting IFC using a grid search with IFC ranging from 0 to 1.500 with step length 0.001.
 
10
The prediction interval for 1955 cohort is 33 % wider than for cohort 1950 even though the predictions both start at age 30, and the ultimate completed fertility rates are similar. The reason for this is that the 1955 cohort had children later than the 1950 cohort, which means that from the model’s perspective, predictions for the 1955 cohort start earlier.
 
Literature
go back to reference Abbot, A. (1988). Transcending general linear reality. Sociological Theory, 6, 169–186.CrossRef Abbot, A. (1988). Transcending general linear reality. Sociological Theory, 6, 169–186.CrossRef
go back to reference Alho, J. M. (1990). Stochastic methods in population forecasting. International Journal of Forecasting, 6, 521–530.CrossRef Alho, J. M. (1990). Stochastic methods in population forecasting. International Journal of Forecasting, 6, 521–530.CrossRef
go back to reference Alho, J., Alders, M., Cruijsen, H., Keilman, N., Nikander, T., & Pham, D. Q. (2006). New forecast: Population decline postponed in Europe. Statistical Journal of the United Nations Economic Commission for Europe, 23, 1–10. Alho, J., Alders, M., Cruijsen, H., Keilman, N., Nikander, T., & Pham, D. Q. (2006). New forecast: Population decline postponed in Europe. Statistical Journal of the United Nations Economic Commission for Europe, 23, 1–10.
go back to reference Anselin, L. (1988). Spatial econometrics: Methods and models. Dordrecht, The Netherlands: Kluwer Academic Publishers. Anselin, L. (1988). Spatial econometrics: Methods and models. Dordrecht, The Netherlands: Kluwer Academic Publishers.
go back to reference Bernardi, L. (2003). Channels of social influence on reproduction. Population Research and Policy Review, 22, 527–555.CrossRef Bernardi, L. (2003). Channels of social influence on reproduction. Population Research and Policy Review, 22, 527–555.CrossRef
go back to reference Billari, F. C., & Prskawetz, A. (2003). Agent-based computational demography. Heidelberg, Germany: Physica Verlag.CrossRef Billari, F. C., & Prskawetz, A. (2003). Agent-based computational demography. Heidelberg, Germany: Physica Verlag.CrossRef
go back to reference Billari, F. C., & Toulemon, L. (2006). Cohort childlessness forecasts and analysis using the Hernes model. Liverpool, UK: Paper presented at the European Population Conference. Billari, F. C., & Toulemon, L. (2006). Cohort childlessness forecasts and analysis using the Hernes model. Liverpool, UK: Paper presented at the European Population Conference.
go back to reference Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81, 637–654.CrossRef Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81, 637–654.CrossRef
go back to reference Christakis, N. A., & Fowler, J. H. (2008). The collective dynamics of smoking in a large social network. The New England Journal of Medicine, 358, 2249–2258.CrossRef Christakis, N. A., & Fowler, J. H. (2008). The collective dynamics of smoking in a large social network. The New England Journal of Medicine, 358, 2249–2258.CrossRef
go back to reference Coale, A. J., & McNeil, D. R. (1972). The distribution by age of the frequency of first marriage in a female cohort. Journal of the American Statistical Association, 67, 743–749.CrossRef Coale, A. J., & McNeil, D. R. (1972). The distribution by age of the frequency of first marriage in a female cohort. Journal of the American Statistical Association, 67, 743–749.CrossRef
go back to reference Cowan, R., & Jonard, N. (2004). Network structure and the diffusion of knowledge. Journal of Economic Dynamics and Control, 28, 1557–1575.CrossRef Cowan, R., & Jonard, N. (2004). Network structure and the diffusion of knowledge. Journal of Economic Dynamics and Control, 28, 1557–1575.CrossRef
go back to reference Diekmann, A. (1989). Diffusion and survival models for the process of entry into marriage. Journal of Mathematical Sociology, 14, 31–44.CrossRef Diekmann, A. (1989). Diffusion and survival models for the process of entry into marriage. Journal of Mathematical Sociology, 14, 31–44.CrossRef
go back to reference Doreian, P. (1980). Linear models with spatially distributed data: Spatial disturbances or spatial effects? Sociological Methods & Research, 9, 29–60.CrossRef Doreian, P. (1980). Linear models with spatially distributed data: Spatial disturbances or spatial effects? Sociological Methods & Research, 9, 29–60.CrossRef
go back to reference Frances, P. H. (1994). A method to select between Gompertz and logistic trend curves. Technological Forecasting and Social Change, 46, 45–49.CrossRef Frances, P. H. (1994). A method to select between Gompertz and logistic trend curves. Technological Forecasting and Social Change, 46, 45–49.CrossRef
go back to reference Goldstein, J. R. (2010). A behavioral Gompertz model for cohort fertility schedules in low and moderate fertility populations (MPIDR Working Paper WP 2010–021). Rostock, Germany: Max Planck Institute for Demographic Research. Goldstein, J. R. (2010). A behavioral Gompertz model for cohort fertility schedules in low and moderate fertility populations (MPIDR Working Paper WP 2010–021). Rostock, Germany: Max Planck Institute for Demographic Research.
go back to reference Goldstein, J. R., & Kenney, C. T. (2001). Marriage delayed or marriage forgone? New cohort forecasts of first marriage for U.S. women. American Sociological Review, 66, 506–519.CrossRef Goldstein, J. R., & Kenney, C. T. (2001). Marriage delayed or marriage forgone? New cohort forecasts of first marriage for U.S. women. American Sociological Review, 66, 506–519.CrossRef
go back to reference Griffiths, D. V., & Smith, I. M. (1991). Numerical methods for engineers: A programming approach. Boca Raton, FL: CRC Press. Griffiths, D. V., & Smith, I. M. (1991). Numerical methods for engineers: A programming approach. Boca Raton, FL: CRC Press.
go back to reference Gruber, H., & Verboven, F. (2001). The diffusion of mobile telecommunications services in the European Union. European Economic Review, 45, 577–588.CrossRef Gruber, H., & Verboven, F. (2001). The diffusion of mobile telecommunications services in the European Union. European Economic Review, 45, 577–588.CrossRef
go back to reference Hammel, E. A., Mason, C., & Wachter, K. W. (1990). SOCSIM II, A sociodemographic microsimulation program, Rev. 1.0, operating manual (Graduate Group in Demography Working Paper No. 29). Berkeley: University of California, Institute of International Studies, Program in Population Research. Hammel, E. A., Mason, C., & Wachter, K. W. (1990). SOCSIM II, A sociodemographic microsimulation program, Rev. 1.0, operating manual (Graduate Group in Demography Working Paper No. 29). Berkeley: University of California, Institute of International Studies, Program in Population Research.
go back to reference Harvery, D. I., & Leybourne, S. J. (2007). Testing for time series linearity. The Econometrics Journal, 10, 149–165.CrossRef Harvery, D. I., & Leybourne, S. J. (2007). Testing for time series linearity. The Econometrics Journal, 10, 149–165.CrossRef
go back to reference Harvey, A. C. (1984). Time series forecasting based on the logistic curve. Journal of the Operational Research Society, 35, 641–646. Harvey, A. C. (1984). Time series forecasting based on the logistic curve. Journal of the Operational Research Society, 35, 641–646.
go back to reference Hernes, G. (1972). The process of entry into first marriage. American Sociological Review, 37, 173–182.CrossRef Hernes, G. (1972). The process of entry into first marriage. American Sociological Review, 37, 173–182.CrossRef
go back to reference Hinich, M. J. (1982). Testing for Gaussianity and linearity of a stationary time series. Journal of Time Series Analysis, 3, 169–176.CrossRef Hinich, M. J. (1982). Testing for Gaussianity and linearity of a stationary time series. Journal of Time Series Analysis, 3, 169–176.CrossRef
go back to reference Hoem, J. M., Madsen, D., Lovgreen Nielsen, J., Ohlsen, E.-M., Oluf Hansen, H., & Rennermalm, B. (1981). Experiments in modelling recent Danish fertility curves. Demography, 18, 231–244.CrossRef Hoem, J. M., Madsen, D., Lovgreen Nielsen, J., Ohlsen, E.-M., Oluf Hansen, H., & Rennermalm, B. (1981). Experiments in modelling recent Danish fertility curves. Demography, 18, 231–244.CrossRef
go back to reference Ike, S. (2002). A non-stationary stochastic process model of completed marital fertility in Japan. Journal of Mathematical Sociology, 26, 35–55.CrossRef Ike, S. (2002). A non-stationary stochastic process model of completed marital fertility in Japan. Journal of Mathematical Sociology, 26, 35–55.CrossRef
go back to reference Keilman, N., & Pham, D. Q. (2004). Time series based errors and empirical errors in fertility forecasts in the Nordic countries. International Statistical Review, 72, 5–18.CrossRef Keilman, N., & Pham, D. Q. (2004). Time series based errors and empirical errors in fertility forecasts in the Nordic countries. International Statistical Review, 72, 5–18.CrossRef
go back to reference Kohler, H.-P. (2001). Fertility and social interaction: An economic perspective. Oxford, UK: Oxford University Press.CrossRef Kohler, H.-P. (2001). Fertility and social interaction: An economic perspective. Oxford, UK: Oxford University Press.CrossRef
go back to reference Land, K. C., Deane, G., & Blau, J. R. (1991). Religious pluralism and church membership: A spatial diffusion model. American Sociological Review, 56, 237–249.CrossRef Land, K. C., Deane, G., & Blau, J. R. (1991). Religious pluralism and church membership: A spatial diffusion model. American Sociological Review, 56, 237–249.CrossRef
go back to reference Lee, R. D. (1993). Modeling and forecasting the time series of U.S. fertility: Age patterns, range, and ultimate level. International Journal of Forecasting, 9, 187–202.CrossRef Lee, R. D. (1993). Modeling and forecasting the time series of U.S. fertility: Age patterns, range, and ultimate level. International Journal of Forecasting, 9, 187–202.CrossRef
go back to reference Lee, R. D. (1998). Probabilistic approaches to population forecasting. Population and Development Review, 24, 156–190.CrossRef Lee, R. D. (1998). Probabilistic approaches to population forecasting. Population and Development Review, 24, 156–190.CrossRef
go back to reference Lee, R. D., & Tuljapurkar, S. (1994). Stochastic population forecasts for the United States: Beyond high, medium, and low. Journal of the American Statistical Association, 89, 1175–1189.CrossRef Lee, R. D., & Tuljapurkar, S. (1994). Stochastic population forecasts for the United States: Beyond high, medium, and low. Journal of the American Statistical Association, 89, 1175–1189.CrossRef
go back to reference Li, N., & Wu, Z. (2008, April). Modeling and forecasting first marriage: A latent function approach. Paper presented at the annual meeting of the Population Association of America, New Orleans, LA. Li, N., & Wu, Z. (2008, April). Modeling and forecasting first marriage: A latent function approach. Paper presented at the annual meeting of the Population Association of America, New Orleans, LA.
go back to reference Lutz, W., & Goldstein, J. R. (2004). Introduction: How to deal with uncertainty in population forecasting? International Statistical Review, 72, 1–4.CrossRef Lutz, W., & Goldstein, J. R. (2004). Introduction: How to deal with uncertainty in population forecasting? International Statistical Review, 72, 1–4.CrossRef
go back to reference Lutz, W., Sanderson, W., & Scherbov, S. (2001). The end of world population growth. Nature, 412, 543–545.CrossRef Lutz, W., Sanderson, W., & Scherbov, S. (2001). The end of world population growth. Nature, 412, 543–545.CrossRef
go back to reference Mansfield, E. (1961). Technical change and the rate of imitation. Econometrica, 29, 741–766.CrossRef Mansfield, E. (1961). Technical change and the rate of imitation. Econometrica, 29, 741–766.CrossRef
go back to reference Mar-Molinero, C. (1980). Tractors in Spain: A logistic analysis. Journal of the Operational Research Society, 31, 141–152. Mar-Molinero, C. (1980). Tractors in Spain: A logistic analysis. Journal of the Operational Research Society, 31, 141–152.
go back to reference Marsden, P. V., & Friedkin, N. E. (1993). Network studies of social influence. Sociological Methods & Research, 22, 127–151.CrossRef Marsden, P. V., & Friedkin, N. E. (1993). Network studies of social influence. Sociological Methods & Research, 22, 127–151.CrossRef
go back to reference Martin, S. P. (2004). Reassessing delayed and forgone marriage in the United States (Working paper). New York: Russell Sage. Martin, S. P. (2004). Reassessing delayed and forgone marriage in the United States (Working paper). New York: Russell Sage.
go back to reference Meade, N., & Islam, T. (2006). Modelling and forecasting the diffusion of innovation—A 25-year review. International Journal of Forecasting, 22, 519–545.CrossRef Meade, N., & Islam, T. (2006). Modelling and forecasting the diffusion of innovation—A 25-year review. International Journal of Forecasting, 22, 519–545.CrossRef
go back to reference Øksendael, B. (2003). Stochastic differential equations (6th ed.). Berlin, Germany: Springer.CrossRef Øksendael, B. (2003). Stochastic differential equations (6th ed.). Berlin, Germany: Springer.CrossRef
go back to reference Pearl, R., & Reed, L. J. (1920). On the rate of growth of the population of the United States and its mathematical representation. Proceedings of the National Academy of Sciences, 6, 275–288.CrossRef Pearl, R., & Reed, L. J. (1920). On the rate of growth of the population of the United States and its mathematical representation. Proceedings of the National Academy of Sciences, 6, 275–288.CrossRef
go back to reference Pollard, J., & Valkovics, E. (1992). The Gompertz distribution and its applications. Genus, 48(3–4), 15–29. Pollard, J., & Valkovics, E. (1992). The Gompertz distribution and its applications. Genus, 48(3–4), 15–29.
go back to reference Preston, S. H., Heuveline, P., & Guillot, M. (2001). Demography: Measuring and modeling population processes. Oxford, UK: Blackwell Publishers. Preston, S. H., Heuveline, P., & Guillot, M. (2001). Demography: Measuring and modeling population processes. Oxford, UK: Blackwell Publishers.
go back to reference Raffalovich, L. E. (1994). Detrending time series: A cautionary note. Sociological Methods and Research, 22, 492–519.CrossRef Raffalovich, L. E. (1994). Detrending time series: A cautionary note. Sociological Methods and Research, 22, 492–519.CrossRef
go back to reference Tolnay, S. E., Deane, G., & Beck, E. M. (1996). Vicarious violence: Spatial effects on southern lynchings, 1890–1919. The American Journal of Sociology, 102, 788–815.CrossRef Tolnay, S. E., Deane, G., & Beck, E. M. (1996). Vicarious violence: Spatial effects on southern lynchings, 1890–1919. The American Journal of Sociology, 102, 788–815.CrossRef
go back to reference Valente, T. W. (1995). Network models of the diffusion of innovations. Cresskill, NJ: Hampton Press. Valente, T. W. (1995). Network models of the diffusion of innovations. Cresskill, NJ: Hampton Press.
go back to reference Vaupel, J. W. (2010). Biodemography of human ageing. Nature, 464, 536–542.CrossRef Vaupel, J. W. (2010). Biodemography of human ageing. Nature, 464, 536–542.CrossRef
go back to reference Wachter, K. W. (1987). Microsimulation of household cycles. In J. Bongaarts, T. Burch, & K. W. Wachter (Eds.), Family demography: Methods and their application (pp. 215–227). Oxford, UK: Clarendon. Wachter, K. W. (1987). Microsimulation of household cycles. In J. Bongaarts, T. Burch, & K. W. Wachter (Eds.), Family demography: Methods and their application (pp. 215–227). Oxford, UK: Clarendon.
go back to reference Winsor, C. P. (1932). The Gompertz curve as a growth curve. Proceedings of the National Academy of Sciences of the United States of America, 18(1), 1–8.CrossRef Winsor, C. P. (1932). The Gompertz curve as a growth curve. Proceedings of the National Academy of Sciences of the United States of America, 18(1), 1–8.CrossRef
go back to reference Wu, L. L. (1990). Simple graphical goodness-of-fit tests for hazard rate models. Madison: University of Wisconsin Press. Wu, L. L. (1990). Simple graphical goodness-of-fit tests for hazard rate models. Madison: University of Wisconsin Press.
Metadata
Title
Probabilistic Forecasting Using Stochastic Diffusion Models, With Applications to Cohort Processes of Marriage and Fertility
Authors
Mikko Myrskylä
Joshua R. Goldstein
Publication date
01-02-2013
Publisher
Springer US
Published in
Demography / Issue 1/2013
Print ISSN: 0070-3370
Electronic ISSN: 1533-7790
DOI
https://doi.org/10.1007/s13524-012-0154-4

Other articles of this Issue 1/2013

Demography 1/2013 Go to the issue