Xu, Tian

13 publications

ICML 2025 Improving Reward Model Generalization from Adversarial Process Enhanced Preferences Zhilong Zhang, Tian Xu, Xinghao Du, Xingchen Cao, Yihao Sun, Yang Yu
ICLR 2025 Preserving Diversity in Supervised Fine-Tuning of Large Language Models Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Zhi-Quan Luo, Ruoyu Sun
NeurIPSW 2024 Entropic Distribution Matching for Supervised Fine-Tuning of LLMs: Less Overfitting and Better Diversity Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Ruoyu Sun, Zhi-Quan Luo
ICML 2024 Limited Preference Aided Imitation Learning from Imperfect Demonstrations Xingchen Cao, Fan-Ming Luo, Junyin Ye, Tian Xu, Zhilong Zhang, Yang Yu
ICLR 2024 Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning Chengxing Jia, Chenxiao Gao, Hao Yin, Fuxiang Zhang, Xiong-Hui Chen, Tian Xu, Lei Yuan, Zongzhang Zhang, Zhi-Hua Zhou, Yang Yu
NeurIPS 2024 Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation Tian Xu, Zhilong Zhang, Ruishuo Chen, Yihao Sun, Yang Yu
ICML 2024 ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo
ICLR 2024 Reward-Consistent Dynamics Models Are Strongly Generalizable for Offline Reinforcement Learning Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu
NeurIPS 2023 Imitation Learning from Imperfection: Theoretical Justifications and Algorithms Ziniu Li, Tian Xu, Zeyu Qin, Yang Yu, Zhi-Quan Luo
UAI 2023 Provably Efficient Adversarial Imitation Learning with Unknown Transitions Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo
NeurIPS 2020 Error Bounds of Imitating Policies and Environments Tian Xu, Ziniu Li, Yang Yu
ECCVW 2020 Investigating Bias and Fairness in Facial Expression Recognition Tian Xu, Jennifer White, Sinan Kalkan, Hatice Gunes
CVPRW 2018 Using Psychophysical Methods to Understand Mechanisms of Face Identification in a Deep Neural Network Tian Xu, Oliver G. B. Garrod, H. Steven Scholte, Robin A. A. Ince, Philippe G. Schyns