Oren, Yonatan

3 publications

ICLR 2024 Proving Test Set Contamination in Black-Box Language Models Yonatan Oren, Nicole Meister, Niladri S. Chatterji, Faisal Ladhak, Tatsunori Hashimoto
NeurIPS 2024 RedPajama: An Open Dataset for Training Large Language Models Maurice Weber, Daniel Y. Fu, Quentin Anthony, Yonatan Oren, Shane Adams, Anton Alexandrov, Xiaozhong Lyu, Huu Nguyen, Xiaozhe Yao, Virginia Adams, Ben Athiwaratkun, Rahul Chalamala, Kezhen Chen, Max Ryabinin, Tri Dao, Percy Liang, Christopher RĂ©, Irina Rish, Ce Zhang
NeurIPS 2018 A Retrieve-and-Edit Framework for Predicting Structured Outputs Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang