Bedi, Amrit Singh
20 publications
CVPR
2025
Immune: Improving Safety Against Jailbreaks in Multi-Modal LLMs via Inference-Time Alignment
NeurIPS
2025
On the Global Optimality of Policy Gradient Methods in General Utility Reinforcement Learning
JMLR
2024
On the Sample Complexity and Metastability of Heavy-Tailed Policy Search in Continuous Control
AAAI
2022
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach