Kumar, Aviral
112 publications
NeurIPS
2025
Bigger, Regularized, Categorical: High-Capacity Value Functions Are Efficient Multi-Task Learners
ICLR
2025
Scaling LLM Test-Time Compute Optimally Can Be More Effective than Scaling Parameters for Reasoning
NeurIPS
2024
Designing Cell-Type-Specific Promoter Sequences Using Conservative Model-Based Optimization
NeurIPS
2024
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning
ICMLW
2024
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning
ICMLW
2024
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning
ICMLW
2024
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning
ICMLW
2024
Learning to Reason by Failing: Offline RL on Sub-Optimal Rollouts Scales Synthetic Data by 8x
NeurIPS
2024
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
ICLRW
2023
Latent Conservative Objective Models for Offline Data-Driven Crystal Structure Prediction
CoRL
2022
Don’t Start from Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
ICMLW
2022
Effective Offline RL Needs Going Beyond Pessimism: Representations and Distributional Shift
NeurIPSW
2022
Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning
NeurIPSW
2022
Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning