Zhuang, Vincent

7 publications

ICLR 2025 Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models Yinlam Chow, Guy Tennenholtz, Izzeddin Gur, Vincent Zhuang, Bo Dai, Aviral Kumar, Rishabh Agarwal, Sridhar Thiagarajan, Craig Boutilier, Aleksandra Faust
ICLR 2025 Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning Yunyue Wei, Shanning Zhuang, Vincent Zhuang, Yanan Sui
ICLR 2025 Training Language Models to Self-Correct via Reinforcement Learning Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust
NeurIPS 2024 Scalable Bayesian Optimization via Focalized Sparse Gaussian Processes Yunyue Wei, Vincent Zhuang, Saraswati Soedarmadji, Yanan Sui
AISTATS 2021 No-Regret Reinforcement Learning with Heavy-Tailed Rewards Vincent Zhuang, Yanan Sui
ICML 2018 Stagewise Safe Bayesian Optimization with Gaussian Processes Yanan Sui, Vincent Zhuang, Joel Burdick, Yisong Yue
UAI 2017 Multi-Dueling Bandits with Dependent Arms Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue