Malik, Dhruv

9 publications

TMLR 2025 Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts Youngseog Chung, Dhruv Malik, Jeff Schneider, Yuanzhi Li, Aarti Singh
NeurIPS 2023 How Does Adaptive Optimization Impact Local Neural Network Geometry? Kaiqi Jiang, Dhruv Malik, Yuanzhi Li
ICML 2023 Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality Dhruv Malik, Conor Igoe, Yuanzhi Li, Aarti Singh
COLT 2022 Complete Policy Regret Bounds for Tallying Bandits Dhruv Malik, Yuanzhi Li, Aarti Singh
ICML 2021 Sample Efficient Reinforcement Learning in Continuous State Spaces: A Perspective Beyond Linearity Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li
NeurIPS 2021 When Is Generalizable Reinforcement Learning Tractable? Dhruv Malik, Yuanzhi Li, Pradeep K. Ravikumar
JMLR 2020 Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems Dhruv Malik, Ashwin Pananjady, Kush Bhatia, Koulik Khamaru, Peter L. Bartlett, Martin J. Wainwright
AISTATS 2019 Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems Dhruv Malik, Ashwin Pananjady, Kush Bhatia, Koulik Khamaru, Peter Bartlett, Martin Wainwright
ICML 2018 An Efficient, Generalized Bellman Update for Cooperative Inverse Reinforcement Learning Dhruv Malik, Malayandi Palaniappan, Jaime Fisac, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan