Hong, Ilgee

7 publications

NeurIPS 2025 Ask a Strong LLM Judge When Your Reward Model Is Uncertain Zhenghao Xu, Qin Lu, Qingru Zhang, Liang Qiu, Ilgee Hong, Changlong Yu, Wenlin Yao, Yao Liu, Haoming Jiang, Lihong Li, Hyokun Yun, Tuo Zhao
ICML 2025 Discriminative Finetuning of Generative Large Language Models Without Reward Models and Human Preference Data Siqi Guo, Ilgee Hong, Vicente Balmaseda, Changlong Yu, Liang Qiu, Xin Liu, Haoming Jiang, Tuo Zhao, Tianbao Yang
NeurIPS 2025 Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models Ilgee Hong, Changlong Yu, Liang Qiu, Weixiang Yan, Zhenghao Xu, Haoming Jiang, Qingru Zhang, Qin Lu, Xin Liu, Chao Zhang, Tuo Zhao
NeurIPS 2024 Adaptive Preference Scaling for Reinforcement Learning with Human Feedback Ilgee Hong, Zichong Li, Alexander Bukharin, Yixiao Li, Haoming Jiang, Tianbao Yang, Tuo Zhao
NeurIPS 2024 Robust Reinforcement Learning from Corrupted Human Feedback Alexander Bukharin, Ilgee Hong, Haoming Jiang, Zichong Li, Qingru Zhang, Zixuan Zhang, Tuo Zhao
ICML 2023 Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching Ilgee Hong, Sen Na, Michael W. Mahoney, Mladen Kolar
NeurIPSW 2022 Adaptive Inexact Sequential Quadratic Programming via Iterative Randomized Sketching Ilgee Hong, Sen Na, Mladen Kolar