Skip to main content
Top
Published in: Quality & Quantity 4/2016

20-06-2015

Entropy balancing: a maximum-entropy reweighting scheme to adjust for coverage error

Authors: Samantha K. Watson, Mark Elliot

Published in: Quality & Quantity | Issue 4/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a newly available technique to adjust for bias in non-probabilistically selected samples. To date, applications of this innovative technique—termed entropy balancing—have been restricted to evaluation settings, where the goal is to reduce model dependence prior to the estimation of treatment effects. In a novel application, we demonstrate the technique’s utility in cases where the goal is to correct for sample bias originating in coverage error. The appeal of entropy balancing in this latter setting lies in its capacity to optimise the twin goals of improved balance in covariate distribution and maximum retention of information. Entropy balancing combines the opportunity to incorporate a large set of moment conditions in the calculation of weights, with the ability to directly implement exact balance. The technique thus builds upon the theoretical appeal of the more widely known and applied propensity score adjustment method, while addressing that method’s practical limitations. We demonstrate the utility of the entropy balancing technique empirically, through an example using the Young Lives Project survey data for rural Andhra Pradesh, South India. We conclude by summarising the potential of this procedure to contribute to robust survey-based research more widely.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
See Hainmueller (2012) for a comprehensive presentation of the theoretical framework.
 
2
In cases where only marginal population probabilities are available (from summarised census data for example) the ebalance procedure allows for values to be manually specified to reweight the non-probability sample covariates in line with available known population targets.
 
3
All analysis is conducted in STATA 13 software; Hainmueller’s “ebalance” suite of commands to perform the entropy balance procedure can be imported to STATA in the usual manner, i.e. “ssc install ebalance, all replace”.
 
4
The survey was sponsored by the UK Department for International Development (DFID), and is led by the Oxford Department of International Development at the University of Oxford, in collaboration with academic institutions in each of the four project countries.
 
5
In the second round of data collection all individuals resident in a selected household were included in the survey.
 
6
Andersson (1996) discusses the general method of sentinel site sampling in some detail.
 
7
At the all India level a total of 124,680 households and 602,833 individuals took part in the survey for schedule 10 of the 61st round of the NSS.
 
8
Household class is calculated on the basis of household landholding and dominant labour relations.
 
9
The default iteration number is 20, the default tolerance level 0.015, and both can be increased if convergence fails.
 
Literature
go back to reference Abadie, A., Imbens, G.W.: Bias-corrected matching estimators for average treatment effects. J. Bus. Econ. Stat. 29(1), 1–11 (2011)CrossRef Abadie, A., Imbens, G.W.: Bias-corrected matching estimators for average treatment effects. J. Bus. Econ. Stat. 29(1), 1–11 (2011)CrossRef
go back to reference Andersson, N.: Evidence-based planning: the philosophy and methods of sentinel community surveillance. Economic Development Institute of the World Bank, Washington (1996) Andersson, N.: Evidence-based planning: the philosophy and methods of sentinel community surveillance. Economic Development Institute of the World Bank, Washington (1996)
go back to reference Duffy, B., Terhanian, G., Bremer, J., Smith, K.: Comparing data from online and face-to-face surveys. Int. J. Market Res. 47(6), 615–639 (2005) Duffy, B., Terhanian, G., Bremer, J., Smith, K.: Comparing data from online and face-to-face surveys. Int. J. Market Res. 47(6), 615–639 (2005)
go back to reference Frölich, M.: Propensity score matching without conditional independence assumption—with an application to the gender wage gap in the United Kingdom. Econ. J. 10(2), 359–407 (2007)CrossRef Frölich, M.: Propensity score matching without conditional independence assumption—with an application to the gender wage gap in the United Kingdom. Econ. J. 10(2), 359–407 (2007)CrossRef
go back to reference Hainmueller, J.: Entropy balancing for causal effects: a multivariate reweighting method to produce balanced samples in observational studies. Polit. Anal. 20(1), 25–46 (2012)CrossRef Hainmueller, J.: Entropy balancing for causal effects: a multivariate reweighting method to produce balanced samples in observational studies. Polit. Anal. 20(1), 25–46 (2012)CrossRef
go back to reference Heckman, J.J., Ichimura, H., Todd, P.: Matching as an econometric evaluation estimator. Rev. Econ. Stud. 65(2), 261–294 (1998)CrossRef Heckman, J.J., Ichimura, H., Todd, P.: Matching as an econometric evaluation estimator. Rev. Econ. Stud. 65(2), 261–294 (1998)CrossRef
go back to reference Ho, D.E., Imai, K., King, G., Stuart, E.A.: Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Polit. Anal. 15(3), 199–236 (2007)CrossRef Ho, D.E., Imai, K., King, G., Stuart, E.A.: Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Polit. Anal. 15(3), 199–236 (2007)CrossRef
go back to reference Isaksson, A., Forsman, G.: A comparison between using the web and using the telephone to survey political opinions. In: Proceedings of the section on survey research methods, American Statistical Association, Alexandria (2003) Isaksson, A., Forsman, G.: A comparison between using the web and using the telephone to survey political opinions. In: Proceedings of the section on survey research methods, American Statistical Association, Alexandria (2003)
go back to reference Kalton, G.: Models in the practice of survey sampling (revisited). J. Off. Stat. 18(2), 129–154 (2002) Kalton, G.: Models in the practice of survey sampling (revisited). J. Off. Stat. 18(2), 129–154 (2002)
go back to reference Lee, S.: Propensity score adjustment as a weighting scheme for volunteer panel web surveys. J. Off. Stat. 22(2), 329–349 (2006) Lee, S.: Propensity score adjustment as a weighting scheme for volunteer panel web surveys. J. Off. Stat. 22(2), 329–349 (2006)
go back to reference MSPI: Annexure II—population projection. National Sample Survey Organisation, New Delhi (2006) MSPI: Annexure II—population projection. National Sample Survey Organisation, New Delhi (2006)
go back to reference MSPI: How to use unit level data. Ministry of Statistics and Programme Implementation, New Delhi (2008) MSPI: How to use unit level data. Ministry of Statistics and Programme Implementation, New Delhi (2008)
go back to reference NSSO: Note on estimation procedure of NSS 61st round. Government of India, New Delhi (2004) NSSO: Note on estimation procedure of NSS 61st round. Government of India, New Delhi (2004)
go back to reference Rosenbaum, P.R., Rubin, D.B.: The central role of the propensity score in observational studies for causal effects. Biometrika 70(1), 41–55 (1983)CrossRef Rosenbaum, P.R., Rubin, D.B.: The central role of the propensity score in observational studies for causal effects. Biometrika 70(1), 41–55 (1983)CrossRef
go back to reference Schonlau, M., Van Soest, A., Kapteyn, A., Couper, M.: Selection bias in web surveys and the use of propensity scores. Sociol. Methods Res. 37(3), 291–318 (2009)CrossRef Schonlau, M., Van Soest, A., Kapteyn, A., Couper, M.: Selection bias in web surveys and the use of propensity scores. Sociol. Methods Res. 37(3), 291–318 (2009)CrossRef
go back to reference Sekhon, J.S.: Opiates for the matches: matching methods for causal inference. Annu. Rev. Polit. Sci. 12, 487–508 (2009)CrossRef Sekhon, J.S.: Opiates for the matches: matching methods for causal inference. Annu. Rev. Polit. Sci. 12, 487–508 (2009)CrossRef
go back to reference Steinmetz, S.M., Bianchi, A., Tijdens, K.G., Biffignandi, S.: Improving web survey quality—potentials and constraints of propensity score weighting. In: Callegaro, M., Baker, R., Bethlehem, J., Göritz, A., Krosnick, J., Lavrakas, P. (eds.) Online panel research: a data quality perspective. Wiley, New York (2014) Steinmetz, S.M., Bianchi, A., Tijdens, K.G., Biffignandi, S.: Improving web survey quality—potentials and constraints of propensity score weighting. In: Callegaro, M., Baker, R., Bethlehem, J., Göritz, A., Krosnick, J., Lavrakas, P. (eds.) Online panel research: a data quality perspective. Wiley, New York (2014)
go back to reference Stuart, E., Cole, S.R., Cole, Stephen R., Bradshaw, C.P., Leaf, P.J.: The use of propensity scores to assess the generalizability of results from randomized trials. J. R. Stat. Soc. 174(2), 369–386 (2011)CrossRef Stuart, E., Cole, S.R., Cole, Stephen R., Bradshaw, C.P., Leaf, P.J.: The use of propensity scores to assess the generalizability of results from randomized trials. J. R. Stat. Soc. 174(2), 369–386 (2011)CrossRef
go back to reference Vavreck, L., Rivers, D.: The 2006 cooperative congressional election study. J. Elect. 18(4), 355–366 (2008) Vavreck, L., Rivers, D.: The 2006 cooperative congressional election study. J. Elect. 18(4), 355–366 (2008)
go back to reference Wilson, I., Huttly, S.R., Fenn, B.: A case study of sample design for longitudinal research: Young Lives. Int. J. Soc. Res. Methodol. 9(5), 351–365 (2006)CrossRef Wilson, I., Huttly, S.R., Fenn, B.: A case study of sample design for longitudinal research: Young Lives. Int. J. Soc. Res. Methodol. 9(5), 351–365 (2006)CrossRef
go back to reference Zhao, Z.: Sensitivity of propensity score methods to the specifications. IZA forschungsinstitut zur Zukunft der Arbeit (Institute for the Study of Labour). Discussion Paper No. 1873 (2005) Bonn. Available at ftp.iza.org/dp1873.pdf Zhao, Z.: Sensitivity of propensity score methods to the specifications. IZA forschungsinstitut zur Zukunft der Arbeit (Institute for the Study of Labour). Discussion Paper No. 1873 (2005) Bonn. Available at ftp.​iza.​org/​dp1873.​pdf
Metadata
Title
Entropy balancing: a maximum-entropy reweighting scheme to adjust for coverage error
Authors
Samantha K. Watson
Mark Elliot
Publication date
20-06-2015
Publisher
Springer Netherlands
Published in
Quality & Quantity / Issue 4/2016
Print ISSN: 0033-5177
Electronic ISSN: 1573-7845
DOI
https://doi.org/10.1007/s11135-015-0235-8

Other articles of this Issue 4/2016

Quality & Quantity 4/2016 Go to the issue

Premium Partner