Gumaste, Rohan

2 publications

ICLR 2025 IterGen: Iterative Semantic-Aware Structured LLM Generation with Backtracking Shubham Ugare, Rohan Gumaste, Tarun Suresh, Gagandeep Singh, Sasa Misailovic
TMLR 2025 Two-Step Offline Preference-Based Reinforcement Learning on Explicitly Constrained Policies Yinglun Xu, Tarun Suresh, Rohan Gumaste, David Zhu, Ruirui Li, Zhengyang Wang, Haoming Jiang, Xianfeng Tang, Qingyu Yin, Monica Xiao Cheng, Qi Zeng, Chao Zhang, Gagandeep Singh