Piterbarg, Ulyana

6 publications

ICLR 2026 Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Romain Froger, Pierre Andrews, Matteo Bettini, Amar Budhiraja, Ricardo Silveira Cabral, Virginie Do, Emilien Garreau, Jean-Baptiste Gaya, Hugo Laurençon, Maxime Lecanu, Kunal Malkan, Dheeraj Mekala, Pierre Menard, Gerard Moreno-Torres Bertran, Ulyana Piterbarg, Mikhail Plekhanov, Mathieu Rita, Andrey Rusakov, Vladislav Vorotilov, Mengjue Wang, Ian Yu, Amine Benhalloum, Grégoire Mialon, Thomas Scialom

ICLR 2025 BALROG: Benchmarking Agentic LLM and VLM Reasoning on Games Davide Paglieri, Bartłomiej Cupiał, Samuel Coward, Ulyana Piterbarg, Maciej Wolczyk, Akbir Khan, Eduardo Pignatelli, Łukasz Kuciński, Lerrel Pinto, Rob Fergus, Jakob Nicolaus Foerster, Jack Parker-Holder, Tim Rocktäschel

ICLRW 2025 D3: A Large Dataset for Training Code Language Models to Act Diff-by-Diff Ulyana Piterbarg, Kanishk Gandhi, Lerrel Pinto, Noah Goodman, Rob Fergus

ICLR 2025 Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

ICML 2024 Diff History for Neural Language Agents Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

NeurIPS 2023 NetHack Is Hard to Hack Ulyana Piterbarg, Lerrel Pinto, Rob Fergus