ML Anthology
Authors
Search
About
Song, Yuda
23 publications
ICML
2025
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
Zhaoyi Zhou
,
Yuda Song
,
Andrea Zanette
ICLRW
2025
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
Zhaoyi Zhou
,
Yuda Song
,
Andrea Zanette
ICLR
2025
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
Yuda Song
,
Hanlin Zhang
,
Carson Eisenach
,
Sham M. Kakade
,
Dean Foster
,
Udaya Ghai
NeurIPS
2025
To Distill or Decide? Understanding the Algorithmic Trade-Off in Partially Observable RL
Yuda Song
,
Dhruv Rohatgi
,
Aarti Singh
,
Drew Bagnell
ICML
2024
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
,
Drew Bagnell
,
Aarti Singh
NeurIPSW
2024
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
Yuda Song
,
Hanlin Zhang
,
Carson Eisenach
,
Sham M. Kakade
,
Dean Foster
,
Udaya Ghai
ICLR
2024
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Yifei Zhou
,
Ayush Sekhari
,
Yuda Song
,
Wen Sun
ICML
2024
Rich-Observation Reinforcement Learning with Continuous Latent Dynamics
Yuda Song
,
Lili Wu
,
Dylan J Foster
,
Akshay Krishnamurthy
NeurIPS
2024
The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage
Yuda Song
,
Gokul Swamy
,
Aarti Singh
,
J. Andrew Bagnell
,
Wen Sun
ICMLW
2024
The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage
Yuda Song
,
Gokul Swamy
,
Aarti Singh
,
Drew Bagnell
,
Wen Sun
ICLR
2023
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Yuda Song
,
Yifei Zhou
,
Ayush Sekhari
,
Drew Bagnell
,
Akshay Krishnamurthy
,
Wen Sun
COLT
2023
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
,
Yuda Song
,
Wen Sun
,
Kaiwen Wang
,
Mengdi Wang
,
Xuezhou Zhang
ICLR
2023
Representation Learning for Low-Rank General-Sum Markov Games
Chengzhuo Ni
,
Yuda Song
,
Xuezhou Zhang
,
Zihan Ding
,
Chi Jin
,
Mengdi Wang
ICML
2023
The Virtues of Laziness in Model-Based RL: A Unified Objective and Algorithms
Anirudh Vemula
,
Yuda Song
,
Aarti Singh
,
Drew Bagnell
,
Sanjiban Choudhury
ICML
2022
Efficient Reinforcement Learning in Block MDPs: A Model-Free Representation Learning Approach
Xuezhou Zhang
,
Yuda Song
,
Masatoshi Uehara
,
Mengdi Wang
,
Alekh Agarwal
,
Wen Sun
NeurIPSW
2022
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Yuda Song
,
Yifei Zhou
,
Ayush Sekhari
,
Drew Bagnell
,
Akshay Krishnamurthy
,
Wen Sun
ECCV
2022
Multi-Curve Translator for High-Resolution Photorealistic Image Translation
Yuda Song
,
Hui Qian
,
Xin Du
L4DC
2022
Online No-Regret Model-Based Meta RL for Personalized Navigation
Yuda Song
,
Yuan Ye
,
Wen Sun
,
Kris Kitani
NeurIPSW
2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
,
Yuda Song
,
Kaiwen Wang
,
Mengdi Wang
,
Wen Sun
,
Xuezhou Zhang
ICLR
2022
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design
Ye Yuan
,
Yuda Song
,
Zhengyi Luo
,
Wen Sun
,
Kris M. Kitani
ICML
2021
PC-MLP: Model-Based Reinforcement Learning with Policy Cover Guided Exploration
Yuda Song
,
Wen Sun
ICCV
2021
StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement
Yuda Song
,
Hui Qian
,
Xin Du
ICML
2020
Provably Efficient Model-Based Policy Adaptation
Yuda Song
,
Aditi Mavalankar
,
Wen Sun
,
Sicun Gao