Synthetic Design: An Optimization Approach to Experimental Design with Synthetic Controls

Abstract

We investigate the optimal design of experimental studies that have pre-treatment outcome data available. The average treatment effect is estimated as the difference between the weighted average outcomes of the treated and control units. A number of commonly used approaches fit this formulation, including the difference-in-means estimator and a variety of synthetic-control techniques. We propose several methods for choosing the set of treated units in conjunction with the weights. Observing the NP-hardness of the problem, we introduce a mixed-integer programming formulation which selects both the treatment and control sets and unit weightings. We prove that these proposed approaches lead to qualitatively different experimental units being selected for treatment. We use simulations based on publicly available data from the US Bureau of Labor Statistics that show improvements in terms of mean squared error and statistical power when compared to simple and commonly used alternatives such as randomized trials.

Cite

Text

Doudchenko et al. "Synthetic Design: An Optimization Approach to Experimental Design with Synthetic Controls." Neural Information Processing Systems, 2021.

Markdown

[Doudchenko et al. "Synthetic Design: An Optimization Approach to Experimental Design with Synthetic Controls." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/doudchenko2021neurips-synthetic/)

BibTeX

@inproceedings{doudchenko2021neurips-synthetic,
  title     = {{Synthetic Design: An Optimization Approach to Experimental Design with Synthetic Controls}},
  author    = {Doudchenko, Nick and Khosravi, Khashayar and Pouget-Abadie, Jean and Lahaie, Sébastien and Lubin, Miles and Mirrokni, Vahab and Spiess, Jann and Imbens, Guido},
  booktitle = {Neural Information Processing Systems},
  year      = {2021},
  url       = {https://mlanthology.org/neurips/2021/doudchenko2021neurips-synthetic/}
}