Moghaddam, Roshanak Zilouchian

2 publications

ICLR 2025 RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code Dhruv Gautam, Spandan Garg, Jinu Jang, Neel Sundaresan, Roshanak Zilouchian Moghaddam
NeurIPSW 2024 RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code Dhruv Gautam, Spandan Garg, Jinu Jang, Neel Sundaresan, Roshanak Zilouchian Moghaddam