Krishnamurthy, Akshay
89 publications
ICLR
2025
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
ICML
2025
Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
ICLR
2024
Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression
COLT
2024
Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning
NeurIPS
2024
Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic Modularity
NeurIPS
2021
Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination