Top

Data Mining and Knowledge Discovery

Published in:

09-11-2022

Differentiated matching for individual and average treatment effect estimation

Authors: Zhao Ziyu, Kun Kuang, Bo Li, Peng Cui, Runze Wu, Jun Xiao, Fei Wu

Published in: Data Mining and Knowledge Discovery | Issue 1/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

One fundamental problem of causal inference is estimating treatment eect with observational data where variables are confounded. The traditional way of controlling the confounding bias is to match units with different treatments but similar variables. However, traditional matching methods fail on selection and differentiation among the pool of numerous potential confounders, leading to possible under-performance. In this paper, we give a theoretical analysis of confounder differentiation and propose a novel Differentiated Matching (DM) algorithm for both individual and average treatment effect estimation by learning confounder weights for variable differentiation and unit matching. To address the distribution shift in confounder weights learning, we further propose a Propensity Score based DM (PSDM) algorithm by weighted regression with the inverse of the propensity score. Extensive experiments on both synthetic and real-world datasets demonstrate that the proposed algorithms achieve better performance than other matching methods on treatment effect estimation.

previous article A methodology for refined evaluation of neural code completion approaches

next article Informative pseudo-labeling for graph neural networks with few labels

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

The linear assumption can be relaxed by adding high order terms in the regression process.

Higher dimension brings NULL matching in DAME and CEM, we omitted these methods in continuous settings.

Austin PC (2011) An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivar Behav Res 46(3):399–424CrossRef

Bottou L, Peters J, Quiñonero-Candela J, Charles DX, Chickering DM, Portugaly E, Ray D, Simard P, Snelson E (2013) Counterfactual reasoning and learning systems: the example of computational advertising. J Mach Learn Res 14(1):3207–3260MathSciNetMATH

Chan D, Ge R, Gershony O, Hesterberg T, Lambert D (2010) Evaluating online ad campaigns in a pipeline: causal models at scale. In: KDD, pp 7–16

Dehejia RH, Wahba S (1999) Causal effects in nonexperimental studies: reevaluating the evaluation of training programs. J Am Stat Assoc 94(448):1053–1062CrossRef

Diamond A, Sekhon JS (2013) Genetic matching for estimating causal effects: a general multivariate matching method for achieving balance in observational studies. Rev Econ Stat 95(3):932–945CrossRef

Hill JL (2011) Bayesian nonparametric modeling for causal inference. J Comput Graph Stat 20(1):217–240MathSciNetCrossRef

Holland PW (1986) Statistics and causal inference. J Am Stat Assoc 81(396):945–960MathSciNetCrossRefMATH

Iacus SM, King G, Porro G (2012) Causal inference without balance checking: coarsened exact matching. Polit Anal 20(1):1–24CrossRef

Imbens GW, Rubin DB (2015) Causal inference in statistics, social, and biomedical sciences. Cambridge University Press, CambridgeCrossRefMATH

Kallus N (2017) A framework for optimal matching for causal inference. In: Artificial Intelligence and Statistics, pp 372–381

Kallus N (2019) Generalized optimal matching methods for causal inference. J Mach Learn Res (forthcoming)

Kohavi R, Longbotham R (2011) Unexpected results in online controlled experiments. ACM SIGKDD Explor Newsl 12(2):31–35CrossRef

Kuang K, Cui P, Li B, Jiang M, Wang Y, Wu F, Yang S (2019) Treatment effect estimation via differentiated confounder balancing and regression. ACM Trans Knowledge Dis from Data (TKDD) 14(1):1–25

Kuang K, Li L, Geng Z, Xu L, Zhang K, Liao B, Huang H, Ding P, Miao W, Jiang Z (2020) Causal inference. Engineering 6(3):253–263CrossRef

LaLonde RJ (1986) Evaluating the econometric evaluations of training programs with experimental data. Am Econom Rev pp 604–620

Lewis RA, Reiley D (2008) Does retail advertising work? measuring the effects of advertising on sales via a controlled experiment on yahoo! Measuring the Effects of Advertising on Sales Via a Controlled Experiment on Yahoo

Li Y, Kuang K, Li B, Cui P, Tao J, Yang H, Wu F (2020) Continuous treatment effect estimation via generative adversarial de-confounding. In: Proceedings of the 2020 KDD Workshop on Causal Discovery, PMLR, pp 4–22

Liu Y, Dieng A, Roy S, Rudin C, Volfovsky A (2019) Interpretable almost matching exactly for causal inference. AISTATS

Omohundro SM (1989) Five balltree construction algorithms. Int Comput Sci Institute Berkeley

Rosenbaum PR (2017) Imposing minimax and quantile constraints on optimal matching in observational studies. J Comput Graph Stat 26(1):66–78MathSciNetCrossRef

Rosenbaum PR, Rubin DB (1983) The central role of the propensity score in observational studies for causal effects. Biometrika 70(1):41–55MathSciNetCrossRefMATH

Rosenbaum PR, Rubin DB (1985) Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. Am Stat 39(1):33–38

Shalit U, Johansson FD, Sontag D (2017) Estimating individual treatment effect: generalization bounds and algorithms. In: Int Conf Mach Learn, PMLR, pp 3076–3085

Wang T, Morucci M, Awan MU, Liu Y, Roy S, Rudin C, Volfovsky A (2021) Flame: A fast large-scale almost matching exactly approach to causal inference. J Mach Learn Res 22:1–41MathSciNetMATH

Zadrozny B (2004) Learning and evaluating classifiers under sample selection bias. In: Proceedings of the twenty-first international conference on Machine learning, p 114

Title: Differentiated matching for individual and average treatment effect estimation
Authors: Zhao Ziyu
Kun Kuang
Bo Li
Peng Cui
Runze Wu
Jun Xiao
Fei Wu
Publication date: 09-11-2022
Publisher: Springer US
Published in: Data Mining and Knowledge Discovery / Issue 1/2023
Print ISSN: 1384-5810
Electronic ISSN: 1573-756X
DOI: https://doi.org/10.1007/s10618-022-00886-5

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 1/2023

Informative pseudo-labeling for graph neural networks with few labels

Using differential evolution for an attribute-weighted inverted specific-class distance measure for nominal attributes

Practical joint human-machine exploration of industrial time series using the matrix profile

ContE: contextualized knowledge graph embedding for circular relations

Structural iterative lexicographic autoencoded node representation

Category tree distance: a taxonomy-based transaction distance for web user analysis

Premium Partner