Skip to main content
Erschienen in: Quality & Quantity 3/2022

14.10.2021

Multilevel and time-series missing value imputation for combined survey and longitudinal context data

verfasst von: David Wutchiett, Claire Durand

Erschienen in: Quality & Quantity | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Comparative research examining relationships between individual-level survey response data and time-varying country context variables for political or socioeconomic characteristics is often complicated by missing values. Surveys and longitudinal context measures may be produced during alternative years and at differing frequencies. Observations may be intermittent or may only cover few consecutive years across a full longitudinal sequence. Statistical evaluations that do not impute values with consideration to data’s missingness characteristics may produce biased estimates. Model-based approaches for missing value imputation such as multiple imputation and time series imputation offer means through which imputed values may be produced given complex hierarchical and longitudinal relations. Using incomplete survey data for institutional trust measures from 554,104 respondents from twenty-seven Eastern European and Central Asian countries between 1993 and 2016, and corresponding longitudinal context descriptors of demographic, socioeconomic and political conditions, multilevel multiple imputation and time-series imputation methods were compared and evaluated. Where missingness is intermittent across the breadth of longitudinal sequence, time series imputation may produce convincing estimates for national-level variables’ values while understating uncertainty associated with imputation. When missing values are numerous and span tail ends of a sequence, multivariate multilevel multiple imputation with time variable fixed effects may produce better estimates for country-variables through incorporation of information derived from additional covariates and other countries’ concurrent trajectories. Multilevel multiple imputation models with random slopes for time variables were found to have beneficial qualities in that countries’ unique longitudinal trends are emphasized and fit while that effects of pooled observations and additional covariates contribute to estimation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
For example, within the regions of Eastern Europe and West Asia, measurement and public release of several economic and demographic characteristics began in large part with the 1990s. Only since the 2000s has the report of context characteristic data largely increased in frequency and regularity within the countries of Central and Southwest Asia and Southeast Europe.
 
2
These countries are Albania, Armenia, Belarus, Bosnia-Herzegovina, Bulgaria, Croatia, Czech Republic, Estonia, Georgia, Greece, Hungary, Kazakhstan, Kyrgyzstan, Latvia, Lithuania, Macedonia, Moldova, Poland, Romania, Russia, Serbia, Slovakia, Slovenia, Tajikistan, Turkey, Ukraine and Uzbekistan.
 
3
The programs were the World Values Survey, the Caucasus Barometer, Consolidation of Democracy in Central and Eastern Europe, European Quality of Life, European Social Survey, European Values Study, Life in Transition, New Baltic Barometer, the New Europe Barometer, the New Russia Barometer, Values and Change in Post-communist Europe and the Asian Barometer.
 
4
Approaches’ prediction matrices used within the mice imputation function are further defined in Appendix B.
 
5
When conducting univariate time series imputation, Poland and Serbia’s longitudinal sequences for the poverty rate had zero observations. In such a case, the univariate time series method has no data to draw upon and no imputations could be performed. As such, yearly mean values of the complete imputed poverty rate variable across the other twenty-five countries were used to generate estimates for the two countries’ yearly poverty rate values.
 
6
Countries with no observations for a variable were not considered.
 
7
The missing data pattern for the combined two-level data is described in Appendix A.
 
8
A preliminary complete case analysis of associations between poverty rate and all other considered longitudinal context and survey respondent covariates through use of a multilevel model with random effects found all independent variables except respondent age to be significantly associated with a country’s yearly poverty rate.
 
Literatur
Zurück zum Zitat Bryk, A.S., Raudenbush, S.W.: Toward a more appropriate conceptualization of research on school effects: a three-level hierarchical linear model. Am. J. Educ. 97(1), 65–108 (1989)CrossRef Bryk, A.S., Raudenbush, S.W.: Toward a more appropriate conceptualization of research on school effects: a three-level hierarchical linear model. Am. J. Educ. 97(1), 65–108 (1989)CrossRef
Zurück zum Zitat Bryk, A.S., Raudenbush, S.W.: Hierarchical linear models: applications and data analysis methods. Sage Publications, Inc. (1992) Bryk, A.S., Raudenbush, S.W.: Hierarchical linear models: applications and data analysis methods. Sage Publications, Inc. (1992)
Zurück zum Zitat Cave, W., Giovannini, E.: The statistical measurement of services: recent achievements and remaining challenges. Metroeconomica 58(3), 479–501 (2007)CrossRef Cave, W., Giovannini, E.: The statistical measurement of services: recent achievements and remaining challenges. Metroeconomica 58(3), 479–501 (2007)CrossRef
Zurück zum Zitat Center for systemic peace. Polity IV database version. Center for systemic peace, Vienna, VA (2018). Center for systemic peace. Polity IV database version. Center for systemic peace, Vienna, VA (2018).
Zurück zum Zitat Easterly, W., Ritzen, J., Woolcock, M.: Social cohesion, institutions, and growth. Econ Polit 18(2), 103–120 (2006)CrossRef Easterly, W., Ritzen, J., Woolcock, M.: Social cohesion, institutions, and growth. Econ Polit 18(2), 103–120 (2006)CrossRef
Zurück zum Zitat Elahi, A. Challenges of data collection: with special regard to developing countries. In OECD: statistics, knowledge and policy 2007: Measuring and fostering the progress of societies. (2008). Elahi, A. Challenges of data collection: with special regard to developing countries. In OECD: statistics, knowledge and policy 2007: Measuring and fostering the progress of societies. (2008).
Zurück zum Zitat Enders, C.K., Mistler, S.A., Keller, B.T.: Multilevel multiple imputation: a review and evaluation of joint modeling and chained equations imputation. Psychol. Methods 21(2), 222 (2016)CrossRef Enders, C.K., Mistler, S.A., Keller, B.T.: Multilevel multiple imputation: a review and evaluation of joint modeling and chained equations imputation. Psychol. Methods 21(2), 222 (2016)CrossRef
Zurück zum Zitat Gelman, A., Hill, J. Data analysis using regression and hierarchical/multilevel models. Cambridge, New York (2007). Gelman, A., Hill, J. Data analysis using regression and hierarchical/multilevel models. Cambridge, New York (2007).
Zurück zum Zitat Gesthuizen, M., Van der Meer, T., Scheepers, P.: Ethnic diversity and social capital in Europe: tests of Putnam’s thesis in European countries. Scand. Polit. Stud. 32(2), 121–142 (2009)CrossRef Gesthuizen, M., Van der Meer, T., Scheepers, P.: Ethnic diversity and social capital in Europe: tests of Putnam’s thesis in European countries. Scand. Polit. Stud. 32(2), 121–142 (2009)CrossRef
Zurück zum Zitat Grund, S., Lüdtke, O., Robitzsch, A.: Multiple imputation of missing data at level 2: a comparison of fully conditional and joint modeling in multilevel designs. J. Educational Behavioral Statist. 43(3), 316–353 (2018a)CrossRef Grund, S., Lüdtke, O., Robitzsch, A.: Multiple imputation of missing data at level 2: a comparison of fully conditional and joint modeling in multilevel designs. J. Educational Behavioral Statist. 43(3), 316–353 (2018a)CrossRef
Zurück zum Zitat Grund, S., Lüdtke, O., Robitzsch, A.: Multiple imputation of missing data for multilevel models: simulations and recommendations. Organ. Res. Methods 21(1), 111–149 (2018b)CrossRef Grund, S., Lüdtke, O., Robitzsch, A.: Multiple imputation of missing data for multilevel models: simulations and recommendations. Organ. Res. Methods 21(1), 111–149 (2018b)CrossRef
Zurück zum Zitat Honaker, J., King, J.: What to do about missing values in time-series cross-section data. Am. J. Political Sci. 54(2), 561–581 (2010)CrossRef Honaker, J., King, J.: What to do about missing values in time-series cross-section data. Am. J. Political Sci. 54(2), 561–581 (2010)CrossRef
Zurück zum Zitat Hudson, J.: Institutional trust and subjective well-being across the EU. Kyklos 59(1), 43–62 (2006)CrossRef Hudson, J.: Institutional trust and subjective well-being across the EU. Kyklos 59(1), 43–62 (2006)CrossRef
Zurück zum Zitat Joye, D., Sapin, M., Wolf, C.: Weights in comparative surveys? Call for Open. Black Box Newslett. Harmon. Soc. Sci. 5(2), 2–16 (2019) Joye, D., Sapin, M., Wolf, C.: Weights in comparative surveys? Call for Open. Black Box Newslett. Harmon. Soc. Sci. 5(2), 2–16 (2019)
Zurück zum Zitat Kołczyńska, M.: Micro-and Macro-Level Determinants of Participation in Demonstrations: An Analysis of Cross-National Survey Data Harmonized Ex-Post, pp. 1–36. Methods, Data, Analyses pp (2019) Kołczyńska, M.: Micro-and Macro-Level Determinants of Participation in Demonstrations: An Analysis of Cross-National Survey Data Harmonized Ex-Post, pp. 1–36. Methods, Data, Analyses pp (2019)
Zurück zum Zitat Kornai, J.: The great transformation of central Eastern Europe: success and disappointment. Econ. Transit. 14(2), 207–244 (2006)CrossRef Kornai, J.: The great transformation of central Eastern Europe: success and disappointment. Econ. Transit. 14(2), 207–244 (2006)CrossRef
Zurück zum Zitat Lindberg, S.I., Coppedge, M., Gerring, J., Teorell, J.: V-Dem: a new way to measure democracy. J. Democr. 25(3), 159–169 (2014)CrossRef Lindberg, S.I., Coppedge, M., Gerring, J., Teorell, J.: V-Dem: a new way to measure democracy. J. Democr. 25(3), 159–169 (2014)CrossRef
Zurück zum Zitat Little, Roderick J.A., Rubin, D.B. Statistical Analysis with Missing Data. Wiley (2019). Little, Roderick J.A., Rubin, D.B. Statistical Analysis with Missing Data. Wiley (2019).
Zurück zum Zitat McDonald, D.G., Dimmick, J.: The Conceptualization and measurement of diversity. Commun. Res. 30(1), 60–79 (2003)CrossRef McDonald, D.G., Dimmick, J.: The Conceptualization and measurement of diversity. Commun. Res. 30(1), 60–79 (2003)CrossRef
Zurück zum Zitat Medve-Bálint, G., Boda, Z.: The poorer you are, the more you trust? The effect of inequality and income on institutional trust in East-Central Europe. Czech Sociol. Rev. 50(3), 419–454 (2014)CrossRef Medve-Bálint, G., Boda, Z.: The poorer you are, the more you trust? The effect of inequality and income on institutional trust in East-Central Europe. Czech Sociol. Rev. 50(3), 419–454 (2014)CrossRef
Zurück zum Zitat Mistler, S.A., Enders, C.K.: A comparison of joint model and fully conditional specification imputation for multilevel missing data. J. Educ. Behav. Stat. 42(4), 432–466 (2017)CrossRef Mistler, S.A., Enders, C.K.: A comparison of joint model and fully conditional specification imputation for multilevel missing data. J. Educ. Behav. Stat. 42(4), 432–466 (2017)CrossRef
Zurück zum Zitat Montalvo, D. Understanding trust in municipal governments. AmericasBarometer Insights 35 (2010) Montalvo, D. Understanding trust in municipal governments. AmericasBarometer Insights 35 (2010)
Zurück zum Zitat Moritz, S., Bartz-Beielstein, T.: ImputeTS: time series missing value imputation in R. R. J. 9(1), 207–218 (2017)CrossRef Moritz, S., Bartz-Beielstein, T.: ImputeTS: time series missing value imputation in R. R. J. 9(1), 207–218 (2017)CrossRef
Zurück zum Zitat Moritz, S., Sardá, A., Bartz-Beielstein, T., Zaefferer, M., Stork, J.: Comparison of different methods for univariate time series imputation in R. ArXiv Preprint (2015) Moritz, S., Sardá, A., Bartz-Beielstein, T., Zaefferer, M., Stork, J.: Comparison of different methods for univariate time series imputation in R. ArXiv Preprint (2015)
Zurück zum Zitat Nardulli, P.F., Wong, C.J., Singh, A., Peyton, B., Bajjaliegh, J.: The Composition of Religious and Ethnic Groups (CREG) Project. University of Illinois, Urbana-Champaign, Cline Center for Democracy (2012) Nardulli, P.F., Wong, C.J., Singh, A., Peyton, B., Bajjaliegh, J.: The Composition of Religious and Ethnic Groups (CREG) Project. University of Illinois, Urbana-Champaign, Cline Center for Democracy (2012)
Zurück zum Zitat Raghunathan, T.E., Lepkowski, J.M., Van Hoewyk, J., Solenberger, P.: A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv. methodol. 27(1), 85–96 (2001) Raghunathan, T.E., Lepkowski, J.M., Van Hoewyk, J., Solenberger, P.: A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv. methodol. 27(1), 85–96 (2001)
Zurück zum Zitat Robila, M. Families in Eastern Europe: Context, Trends and Variations. In: Families in Eastern Europe. pp 1–14 Emerald Group Publishing Limited (2004). Robila, M. Families in Eastern Europe: Context, Trends and Variations. In: Families in Eastern Europe. pp 1–14 Emerald Group Publishing Limited (2004).
Zurück zum Zitat Rontos, K., Roumeliotou, M.: Generalized social trust in Greece and its association with demographic and socio-economic predictors. Port. J. Soc. Sci. 12(1), 63–84 (2013) Rontos, K., Roumeliotou, M.: Generalized social trust in Greece and its association with demographic and socio-economic predictors. Port. J. Soc. Sci. 12(1), 63–84 (2013)
Zurück zum Zitat Rubin, D.B.: Multiple Imputation for Nonresponse in Surveys. John Wiley, New York (1987)CrossRef Rubin, D.B.: Multiple Imputation for Nonresponse in Surveys. John Wiley, New York (1987)CrossRef
Zurück zum Zitat Schafer, J.L., Yucel, R.M.: Computational strategies for multivariate linear mixed-effects models with missing values. J. Comput. Graph. Stat. 11(2), 437–457 (2002)CrossRef Schafer, J.L., Yucel, R.M.: Computational strategies for multivariate linear mixed-effects models with missing values. J. Comput. Graph. Stat. 11(2), 437–457 (2002)CrossRef
Zurück zum Zitat Seeta Prabhu, K.: Social statistics for human development reports and millennium development goal reports: challenges and constraints. J. Hum. Dev. 6(3), 375–397 (2005)CrossRef Seeta Prabhu, K.: Social statistics for human development reports and millennium development goal reports: challenges and constraints. J. Hum. Dev. 6(3), 375–397 (2005)CrossRef
Zurück zum Zitat Snijders, T. and Bosker, R. Multilevel Analysis. An introduction to Basic and Advanced Multilevel Modeling (Second edition) pp. 354 Sage Publications, London (2012). Snijders, T. and Bosker, R. Multilevel Analysis. An introduction to Basic and Advanced Multilevel Modeling (Second edition) pp. 354 Sage Publications, London (2012).
Zurück zum Zitat Taljaard, M., Donner, A., Klar, N.: Imputation strategies for missing continuous outcomes in cluster randomized trials. Biom. J. 50(3), 329–345 (2008)CrossRef Taljaard, M., Donner, A., Klar, N.: Imputation strategies for missing continuous outcomes in cluster randomized trials. Biom. J. 50(3), 329–345 (2008)CrossRef
Zurück zum Zitat Tang, L., Song, J., Belin, T.R., Unützer, J.: A comparison of imputation methods in a longitudinal randomized clinical trial. Stat. Med. 24(14), 2111–2128 (2005)CrossRef Tang, L., Song, J., Belin, T.R., Unützer, J.: A comparison of imputation methods in a longitudinal randomized clinical trial. Stat. Med. 24(14), 2111–2128 (2005)CrossRef
Zurück zum Zitat Teachman, J.D.: Analysis of population diversity: measures of qualitative variation. Sociol. Methods Res. 8(3), 341–362 (1980)CrossRef Teachman, J.D.: Analysis of population diversity: measures of qualitative variation. Sociol. Methods Res. 8(3), 341–362 (1980)CrossRef
Zurück zum Zitat van Buuren, S.: Multiple imputation of multilevel data. Handb. Adv. Multilevel Anal. 10, 173–196 (2011) van Buuren, S.: Multiple imputation of multilevel data. Handb. Adv. Multilevel Anal. 10, 173–196 (2011)
Zurück zum Zitat van Buuren, S. Flexible Imputation of Missing Data. Chapman and Hall/CRC (2018). van Buuren, S. Flexible Imputation of Missing Data. Chapman and Hall/CRC (2018).
Zurück zum Zitat van Buuren, S., Groothuis-Oudshoorn, K.: Mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–68 (2010) van Buuren, S., Groothuis-Oudshoorn, K.: Mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–68 (2010)
Zurück zum Zitat van Buuren, S., Brand, J.P., Groothuis-Oudshoorn, C.G., Rubin, D.B.: Fully conditional specification in multivariate imputation. J. Stat. Comput. Simul. 76(12), 1049–1064 (2006)CrossRef van Buuren, S., Brand, J.P., Groothuis-Oudshoorn, C.G., Rubin, D.B.: Fully conditional specification in multivariate imputation. J. Stat. Comput. Simul. 76(12), 1049–1064 (2006)CrossRef
Zurück zum Zitat Vodopivec, M., Wörgötter, A., Raju, D.: Unemployment benefit systems in central and Eastern Europe: a review of the 1990s. Comp. Econ. Stud. 47(4), 615–651 (2005)CrossRef Vodopivec, M., Wörgötter, A., Raju, D.: Unemployment benefit systems in central and Eastern Europe: a review of the 1990s. Comp. Econ. Stud. 47(4), 615–651 (2005)CrossRef
Zurück zum Zitat You, J.: Social trust: fairness matters more than homogeneity. Polit. Psychol. 33(5), 701–721 (2012)CrossRef You, J.: Social trust: fairness matters more than homogeneity. Polit. Psychol. 33(5), 701–721 (2012)CrossRef
Zurück zum Zitat Yucel, R.M.: Multiple imputation inference for multivariate multilevel continuous data with ignorable non-response. Philos. Trans. r. Soc. Math. Phys. Eng. Sci. 366(1874), 2389–2403 (2008) Yucel, R.M.: Multiple imputation inference for multivariate multilevel continuous data with ignorable non-response. Philos. Trans. r. Soc. Math. Phys. Eng. Sci. 366(1874), 2389–2403 (2008)
Zurück zum Zitat Zhao, J.H., Schafer, J.L. pan: Multiple Imputation for Multivariate Panel or Clustered Data. R Package Version 1.4 (2016). Zhao, J.H., Schafer, J.L. pan: Multiple Imputation for Multivariate Panel or Clustered Data. R Package Version 1.4 (2016).
Metadaten
Titel
Multilevel and time-series missing value imputation for combined survey and longitudinal context data
verfasst von
David Wutchiett
Claire Durand
Publikationsdatum
14.10.2021
Verlag
Springer Netherlands
Erschienen in
Quality & Quantity / Ausgabe 3/2022
Print ISSN: 0033-5177
Elektronische ISSN: 1573-7845
DOI
https://doi.org/10.1007/s11135-021-01186-8

Weitere Artikel der Ausgabe 3/2022

Quality & Quantity 3/2022 Zur Ausgabe

Premium Partner