Low N, High N Protein Engineering
Abstract
Machine learning assisted directed evolution often involves experimentally collecting data from a relatively small number of variants to update a surrogate model, due to experimental limitations of characterisation and sequencing at high throughput. We propose an alternative approach, involving collecting high-throughput experimental data in a manner that results in a large number of characterised variants at the cost of reduced information: although the sequences and the measured fitness values are known, their correspondence is not. In particular we explore applying this method to the optimisation of a recently discovered phenomenon: magnetically sensitive fluorescent proteins.
Cite
Text
Abrahams et al. "Low N, High N Protein Engineering." ICLR 2024 Workshops: GEM, 2024.Markdown
[Abrahams et al. "Low N, High N Protein Engineering." ICLR 2024 Workshops: GEM, 2024.](https://mlanthology.org/iclrw/2024/abrahams2024iclrw-low/)BibTeX
@inproceedings{abrahams2024iclrw-low,
title = {{Low N, High N Protein Engineering}},
author = {Abrahams, G J and Outeiral, Carlos and Steel, Harrison and Deane, Charlotte},
booktitle = {ICLR 2024 Workshops: GEM},
year = {2024},
url = {https://mlanthology.org/iclrw/2024/abrahams2024iclrw-low/}
}