Misra, Dipendra
27 publications
NeurIPS
2025
Principled Fine-Tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward
ICLR
2024
The Truth Is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
NeurIPSW
2022
Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information