Data-Driven Design of Randomized Control Trials with Guaranteed Treatment Effects

Abstract

Randomized controlled trials (RCTs) generate guarantees for treatment effects. However, RCTs often spend unnecessary resources exploring sub-optimal treatments, which can reduce the power of treatment guarantees. To address this, we propose a two-stage RCT design. In the first stage, a data-driven screening procedure prunes low-impact treatments, while the second stage focuses on developing high-probability lower bounds for the best-performing treatment effect. Unlike existing adaptive RCT frameworks, our method is simple enough to be implemented in scenarios with limited adaptivity. We derive optimal designs for two-stage RCTs and demonstrate how such designs can be implemented through sample splitting. Empirically, we demonstrate that two-stage designs improve upon single-stage approaches, especially for scenarios where domain knowledge is available through a prior. Our work is thus, a simple yet effective design for RCTs, optimizing for the ability to certify with high probability the largest possible treatment effect for at least one of the arms studied.

Cite

Text

Cortes-Gomez et al. "Data-Driven Design of Randomized Control Trials with Guaranteed Treatment Effects." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Cortes-Gomez et al. "Data-Driven Design of Randomized Control Trials with Guaranteed Treatment Effects." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/cortesgomez2025icml-datadriven/)

BibTeX

@inproceedings{cortesgomez2025icml-datadriven,
  title     = {{Data-Driven Design of Randomized Control Trials with Guaranteed Treatment Effects}},
  author    = {Cortes-Gomez, Santiago and Raman, Naveen Janaki and Singh, Aarti and Wilder, Bryan},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {11313-11327},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/cortesgomez2025icml-datadriven/}
}