Zhang, Yufeng

18 publications

AAAI 2025 Enhancing Chain of Thought Prompting in Large Language Models via Reasoning Patterns Yufeng Zhang, Xuepeng Wang, Lingxiang Wu, Jinqiao Wang
ICCV 2025 MCOP: Multi-UAV Collaborative Occupancy Prediction Zefu Lin, Wenbo Chen, Xiaojuan Jin, Yuran Yang, Lue Fan, Yixin Zhang, Yufeng Zhang, Zhaoxiang Zhang
ICML 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
ICLRW 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
AISTATS 2025 What and How Does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang
ECCV 2024 BaSIC: BayesNet Structure Learning for Computational Scalable Neural Image Compression Yufeng Zhang, Hang Yu, Shizhan Liu, Wenrui Dai, Weiyao Lin
ICLR 2024 Finite-State Autoregressive Entropy Coding for Efficient Learned Lossless Compression Yufeng Zhang, Hang Yu, Jianguo Li, Weiyao Lin
CVPRW 2024 Super-Resolution of Biomedical Volumes with 2D Supervision Cheng Jiang, Alexander Gedeon, Yiwei Lyu, Eric Landgraf, Yufeng Zhang, Xinhai Hou, Akhil Kondepudi, Asadur Chowdury, Honglak Lee, Todd C. Hollon
NeurIPS 2023 On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions Yufeng Zhang, Jialu Pan, Li Ken Li, Wanwei Liu, Zhenbang Chen, Xinwang Liu, J Wang
ICML 2022 Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation Zhihan Liu, Yufeng Zhang, Zuyue Fu, Zhuoran Yang, Zhaoran Wang
ICML 2022 Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes Hongyi Guo, Qi Cai, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang
AISTATS 2021 Provably Efficient Actor-Critic for Risk-Sensitive and Robust Adversarial RL: A Linear-Quadratic Case Yufeng Zhang, Zhuoran Yang, Zhaoran Wang
AAAI 2021 A Graph-Based Relevance Matching Model for Ad-Hoc Retrieval Yufeng Zhang, Jinghao Zhang, Zeyu Cui, Shu Wu, Liang Wang
ICML 2021 Infinite-Dimensional Optimization for Zero-Sum Games via Variational Transport Lewis Liu, Yufeng Zhang, Zhuoran Yang, Reza Babanezhad, Zhaoran Wang
NeurIPS 2021 Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration Runzhe Wu, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang
NeurIPS 2021 Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic Yufeng Zhang, Siyu Chen, Zhuoran Yang, Michael I. Jordan, Zhaoran Wang
NeurIPS 2020 Can Temporal-Difference and Q-Learning Learn Representation? a Mean-Field Theory Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang
ICML 2020 Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate Yufeng Zhang, Qi Cai, Zhuoran Yang, Zhaoran Wang