You Only Train Once: Differentiable Subset Selection for Omics Data
Abstract
Selecting compact and informative gene subsets from single-cell transcriptomic data is essential for biomarker discovery, improving interpretability, and cost-effective profiling. However, most existing feature selection approaches either operate as multi-stage pipelines or rely on post hoc feature attribution, making selection and prediction weakly coupled. However, most existing feature selection approaches either operate as multi-stage pipelines or rely on post hoc feature attribution, making selection and prediction weakly coupled. In this work, we present YOTO (you only train once), an end-to-end framework that jointly identifies discrete gene subsets and performs prediction within a single differentiable architecture. In our model, the prediction task directly guides which genes are selected, while the learned subsets, in turn, shape the predictive representation. This closed feedback loop enables the model to iteratively refine both what it selects and how it predicts during training. Unlike existing approaches, YOTO enforces sparsity so that only the selected genes contribute to infer- ence, eliminating the need to train additional downstream classifiers. Through a multi-task learning design, the model learns shared representations across related objectives, allowing different tasks to inform one another, and discovering gene subsets that generalize across tasks without additional training steps. We evaluate YOTO on two representative single-cell RNA-seq datasets, showing that it consistently outperforms state-of-the-art baselines. These results demonstrate that sparse, end-to-end, multi-task gene subset selection improves predictive performance and yields compact and meaningful gene subsets, advancing biomarker discovery and single-cell analysis.
Cite
Text
Chopard et al. "You Only Train Once: Differentiable Subset Selection for Omics Data." Transactions on Machine Learning Research, 2026.Markdown
[Chopard et al. "You Only Train Once: Differentiable Subset Selection for Omics Data." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/chopard2026tmlr-you/)BibTeX
@article{chopard2026tmlr-you,
title = {{You Only Train Once: Differentiable Subset Selection for Omics Data}},
author = {Chopard, Daphné and da Silva Gonçalves, Jorge and Cannistraci, Irene and Sutter, Thomas M. and Vogt, Julia E},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/chopard2026tmlr-you/}
}