Song, Yuda

23 publications

ICML 2025 Accelerating Unbiased LLM Evaluation via Synthetic Feedback Zhaoyi Zhou, Yuda Song, Andrea Zanette
ICLRW 2025 Accelerating Unbiased LLM Evaluation via Synthetic Feedback Zhaoyi Zhou, Yuda Song, Andrea Zanette
ICLR 2025 Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models Yuda Song, Hanlin Zhang, Carson Eisenach, Sham M. Kakade, Dean Foster, Udaya Ghai
NeurIPS 2025 To Distill or Decide? Understanding the Algorithmic Trade-Off in Partially Observable RL Yuda Song, Dhruv Rohatgi, Aarti Singh, Drew Bagnell
ICML 2024 Hybrid Reinforcement Learning from Offline Observation Alone Yuda Song, Drew Bagnell, Aarti Singh
NeurIPSW 2024 Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models Yuda Song, Hanlin Zhang, Carson Eisenach, Sham M. Kakade, Dean Foster, Udaya Ghai
ICLR 2024 Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun
ICML 2024 Rich-Observation Reinforcement Learning with Continuous Latent Dynamics Yuda Song, Lili Wu, Dylan J Foster, Akshay Krishnamurthy
NeurIPS 2024 The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage Yuda Song, Gokul Swamy, Aarti Singh, J. Andrew Bagnell, Wen Sun
ICMLW 2024 The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage Yuda Song, Gokul Swamy, Aarti Singh, Drew Bagnell, Wen Sun
ICLR 2023 Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient Yuda Song, Yifei Zhou, Ayush Sekhari, Drew Bagnell, Akshay Krishnamurthy, Wen Sun
COLT 2023 Provable Benefits of Representational Transfer in Reinforcement Learning Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang
ICLR 2023 Representation Learning for Low-Rank General-Sum Markov Games Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Zihan Ding, Chi Jin, Mengdi Wang
ICML 2023 The Virtues of Laziness in Model-Based RL: A Unified Objective and Algorithms Anirudh Vemula, Yuda Song, Aarti Singh, Drew Bagnell, Sanjiban Choudhury
ICML 2022 Efficient Reinforcement Learning in Block MDPs: A Model-Free Representation Learning Approach Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun
NeurIPSW 2022 Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient Yuda Song, Yifei Zhou, Ayush Sekhari, Drew Bagnell, Akshay Krishnamurthy, Wen Sun
ECCV 2022 Multi-Curve Translator for High-Resolution Photorealistic Image Translation Yuda Song, Hui Qian, Xin Du
L4DC 2022 Online No-Regret Model-Based Meta RL for Personalized Navigation Yuda Song, Yuan Ye, Wen Sun, Kris Kitani
NeurIPSW 2022 Provable Benefits of Representational Transfer in Reinforcement Learning Alekh Agarwal, Yuda Song, Kaiwen Wang, Mengdi Wang, Wen Sun, Xuezhou Zhang
ICLR 2022 Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design Ye Yuan, Yuda Song, Zhengyi Luo, Wen Sun, Kris M. Kitani
ICML 2021 PC-MLP: Model-Based Reinforcement Learning with Policy Cover Guided Exploration Yuda Song, Wen Sun
ICCV 2021 StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement Yuda Song, Hui Qian, Xin Du
ICML 2020 Provably Efficient Model-Based Policy Adaptation Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao