Gaurav, Sanket

1 publications

NeurIPS 2025 Prompted Policy Search: Reinforcement Learning Through Linguistic and Numerical Reasoning in LLMs Yifan Zhou, Sachin Grover, Mohamed El Mistiri, Kamalesh Kalirathinam, Pratyush Kerhalkar, Swaroop Mishra, Neelesh Kumar, Sanket Gaurav, Oya Aran, Heni Ben Amor