Egert, Daniel

3 publications

ICLR 2026 RLBFF: Binary Flexible Feedback to Bridge Between Human Feedback & Verifiable Rewards Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Ellie Evans, Daniel Egert, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev
ICLR 2025 HelpSteer2-Preference: Complementing Ratings with Preferences Zhilin Wang, Alexander Bukharin, Olivier Delalleau, Daniel Egert, Gerald Shen, Jiaqi Zeng, Oleksii Kuchaiev, Yi Dong
NeurIPS 2024 HelpSteer 2: Open-Source Dataset for Training Top-Performing Reward Models Zhilin Wang, Yi Dong, Olivier Delalleau, Jiaqi Zeng, Gerald Shen, Daniel Egert, Jimmy J. Zhang, Makesh Narsimhan Sreedhar, Oleksii Kuchaiev