DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design

Abstract

The discovery of therapeutics to treat genetically-driven pathologies relies on identifying genes involved in the underlying disease mechanism. Existing approaches search over the billions of potential interventions to maximize the expected influence on the target phenotype. However, to reduce the risk of failure in future stages of trials, practical experiment design aims to find a set of interventions that maximally change a target phenotype via diverse mechanisms. We propose DiscoBAX - a sample-efficient method for maximizing the rate of significant discoveries per experiment while simultaneously probing for a wide range of diverse mechanisms during a genomic experiment campaign. We provide theoretical guarantees of optimality under standard assumptions, and conduct a comprehensive experimental evaluation covering both synthetic as well as real-world experimental design tasks. DiscoBAX outperforms existing state-of-the-art methods for experimental design, selecting effective and diverse perturbations in biological systems.

Cite

Text

Lyle et al. "DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design." International Conference on Machine Learning, 2023.

Markdown

[Lyle et al. "DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/lyle2023icml-discobax/)

BibTeX

@inproceedings{lyle2023icml-discobax,
  title     = {{DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design}},
  author    = {Lyle, Clare and Mehrjou, Arash and Notin, Pascal and Jesson, Andrew and Bauer, Stefan and Gal, Yarin and Schwab, Patrick},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {23170-23189},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/lyle2023icml-discobax/}
}