Generating Informative Samples for Risk-Averse Fine-Tuning of Downstream Tasks
Abstract
Risk-averse modeling is critical in safety-sensitive and high-stakes applications. Conditional Value-at-Risk (CVaR) quantifies such risk by measuring the expected loss in the tail of the loss distribution, and minimizing it provides a principled framework for training robust models. However, direct CVaR minimization remains challenging due to the difficulty of accurately estimating rare, high-loss events—particularly at extreme quantiles. In this work, we propose a novel training framework that synthesizes informative samples for CVaR optimization using score-based generative models. Specifically, we guide a diffusion-based generative model to sample from a reweighted distribution that emphasizes inputs likely to incur high loss under a pretrained reference model. These samples are then incorporated via a loss-weighted importance sampling scheme to reduce noise in stochastic optimization. We establish convergence guarantees and show that the synthesized, high-loss-emphasized dataset substantially contributes to the noise reduction. Empirically, we validate the effectiveness of our approach across multiple settings, including a real-world wireless channel compression task, where our method achieves significant improvements over standard risk minimization strategies.
Cite
Text
Kim et al. "Generating Informative Samples for Risk-Averse Fine-Tuning of Downstream Tasks." Advances in Neural Information Processing Systems, 2025.Markdown
[Kim et al. "Generating Informative Samples for Risk-Averse Fine-Tuning of Downstream Tasks." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/kim2025neurips-generating/)BibTeX
@inproceedings{kim2025neurips-generating,
title = {{Generating Informative Samples for Risk-Averse Fine-Tuning of Downstream Tasks}},
author = {Kim, Heasung and Lee, Taekyun and Kim, Hyeji and De Veciana, Gustavo},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/kim2025neurips-generating/}
}