Soares, Felipe

2 publications

ICLR 2026 RLBFF: Binary Flexible Feedback to Bridge Between Human Feedback & Verifiable Rewards Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Ellie Evans, Daniel Egert, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev
NeurIPS 2025 HelpSteer3-Preference: Open Human-Annotated Preference Data Across Diverse Tasks and Languages Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Hoo-Chang Shin, Felipe Soares, Alexander Bukharin, Ellie Evans, Yi Dong, Oleksii Kuchaiev