Machlanski, Damian and Samothrakis, Spyridon and Clarke, Paul (2024) Undersmoothing Causal Estimators With Generative Trees. IEEE Access, 12. pp. 38562-38574. DOI https://doi.org/10.1109/access.2024.3376423
Machlanski, Damian and Samothrakis, Spyridon and Clarke, Paul (2024) Undersmoothing Causal Estimators With Generative Trees. IEEE Access, 12. pp. 38562-38574. DOI https://doi.org/10.1109/access.2024.3376423
Machlanski, Damian and Samothrakis, Spyridon and Clarke, Paul (2024) Undersmoothing Causal Estimators With Generative Trees. IEEE Access, 12. pp. 38562-38574. DOI https://doi.org/10.1109/access.2024.3376423
Abstract
Average causal effects are averages of (heterogeneous) individual treatment effects (ITEs) taken over the entire target population. The estimation of average causal effects has been studied in depth, but averages are insufficient for more individualised decision-making where ITEs are more appropriate. However, estimating ITEs for every population member is challenging, particularly when estimation must be based on observational data rather than data from randomised experiments. One potential problem with observational data arises when there are large differences between the sample distributions of the input features of the treated and control units. This problem is known as covariate shift. It can lead to model misspecification the harmful effects of which can be severe for ITE estimation because point estimation is highly sensitive to regions of the common support of the input space in which the number of treated or control units is very small. Moreover, common solutions are often based on reweighing schemes involving propensity scores which were originally designed for average effects and not ITEs. In this paper, we propose Debiasing Generative Trees, a novel data augmentation method based on generative trees that debiases and undersmooths causal estimators trained on augmented data. It encourages higher modelling complexity that reduces misspecification and improves estimation of ITEs. We show empirically that our proposed approach yields models of higher complexity and more accurate predictions of ITEs, and is competitive with traditional methods for estimating average treatment effects. Our results confirm that reweighing methods can struggle with ITE estimation and that the choice of model class can significantly impact prediction performance.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Conditional average treatment effect; observational data; covariate shift; model misspecification; data augmentation; generative trees |
Divisions: | Faculty of Science and Health Faculty of Social Sciences Faculty of Science and Health > Computer Science and Electronic Engineering, School of Faculty of Social Sciences > Institute for Social and Economic Research |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 08 Oct 2024 15:07 |
Last Modified: | 30 Oct 2024 21:16 |
URI: | http://repository.essex.ac.uk/id/eprint/38065 |
Available files
Filename: Undersmoothing_Causal_Estimators_With_Generative_Trees.pdf
Licence: Creative Commons: Attribution 4.0