Evans, Ellie

3 publications

ICLR 2026 ProfBench: Multi-Domain Rubrics Requiring Professional Knowledge to Answer and Judge Zhilin Wang, Jaehun Jung, Ximing Lu, Shizhe Diao, Ellie Evans, Jiaqi Zeng, Pavlo Molchanov, Yejin Choi, Jan Kautz, Yi Dong
ICLR 2026 RLBFF: Binary Flexible Feedback to Bridge Between Human Feedback & Verifiable Rewards Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Ellie Evans, Daniel Egert, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev
NeurIPS 2025 HelpSteer3-Preference: Open Human-Annotated Preference Data Across Diverse Tasks and Languages Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Hoo-Chang Shin, Felipe Soares, Alexander Bukharin, Ellie Evans, Yi Dong, Oleksii Kuchaiev