PropertyDAG: Multi-Objective Bayesian Optimization of Partially Ordered, Mixed-Variable Properties for Biological Sequence Design

Abstract

Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarchical dependency structure. We consider a common use case where some regions of the Pareto frontier are prioritized over others according to a specified $\textit{partial ordering}$ in the objectives. For instance, when designing antibodies, we maximize the binding affinity to a target antigen only if it can be expressed in live cell culture---modeling the experimental dependency in which affinity can only be measured for antibodies that can be expressed and thus produced in viable quantities. In general, we may want to confer a partial ordering to the properties such that each property is optimized conditioned on its parent properties satisfying some feasibility condition. To this end, we present PropertyDAG, a framework that operates on top of the traditional multi-objective BO to impose a desired partial ordering on the objectives, e.g. expression $\rightarrow$ affinity. We demonstrate its performance over multiple simulated active learning iterations on a penicillin production task, toy numerical problem, and a real-world antibody design task.

Cite

Text

Park et al. "PropertyDAG: Multi-Objective Bayesian Optimization of Partially Ordered, Mixed-Variable Properties for Biological Sequence Design." NeurIPS 2022 Workshops: AI4Science, 2022.

Markdown

[Park et al. "PropertyDAG: Multi-Objective Bayesian Optimization of Partially Ordered, Mixed-Variable Properties for Biological Sequence Design." NeurIPS 2022 Workshops: AI4Science, 2022.](https://mlanthology.org/neuripsw/2022/park2022neuripsw-propertydag/)

BibTeX

@inproceedings{park2022neuripsw-propertydag,
  title     = {{PropertyDAG: Multi-Objective Bayesian Optimization of Partially Ordered, Mixed-Variable Properties for Biological Sequence Design}},
  author    = {Park, Ji Won and Stanton, Samuel Don and Saremi, Saeed and Watkins, Andrew Martin and Dwyer, Henri and Gligorijevic, Vladimir and Bonneau, Richard and Ra, Stephen and Cho, Kyunghyun},
  booktitle = {NeurIPS 2022 Workshops: AI4Science},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/park2022neuripsw-propertydag/}
}