Zhou, Ruida
24 publications
ICLR
2026
Beyond Binary Preferences: A Principled Framework for Reward Modeling with Ordinal Feedback
ICML
2025
On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures
NeurIPSW
2024
Correlational Lagrangian Schrodinger Bridge: Learning Dynamics with Population-Level Regularization
NeurIPSW
2024
From Function to Distribution Modeling: A PAC-Generative Approach to Offline Optimization
NeurIPS
2023
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games
NeurIPS
2022
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning