Koppel, Alec
37 publications
AISTATS
2025
Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis
ICMLW
2024
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
JMLR
2024
On the Sample Complexity and Metastability of Heavy-Tailed Policy Search in Continuous Control
ICLR
2024
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
NeurIPS
2023
Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities
AAAI
2022
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
NeurIPSW
2022
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning