Li, Ziniu

16 publications

ICLR 2025 Adam-Mini: Use Fewer Learning Rates to Gain More Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Diederik P Kingma, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun
ICML 2025 Controlling Large Language Model with Latent Action Chengxing Jia, Ziniu Li, Pengyuan Wang, Yi-Chen Li, Zhenyu Hou, Yuxiao Dong, Yang Yu
ICLR 2025 Preserving Diversity in Supervised Fine-Tuning of Large Language Models Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Zhi-Quan Luo, Ruoyu Sun
NeurIPS 2025 Teaching Language Models to Reason with Tools Chengpeng Li, Zhengyang Tang, Ziniu Li, Mingfeng Xue, Keqin Bao, Tian Ding, Ruoyu Sun, Benyou Wang, Xiang Wang, Junyang Lin, Dayiheng Liu
ICLR 2025 Understanding and Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu
ICMLW 2024 Adam-Mini: Use Fewer Learning Rates to Gain More Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun
NeurIPSW 2024 Entropic Distribution Matching for Supervised Fine-Tuning of LLMs: Less Overfitting and Better Diversity Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Ruoyu Sun, Zhi-Quan Luo
NeurIPSW 2024 Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu
NeurIPSW 2024 Pruning for Robust Concept Erasing in Diffusion Models Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu
ICML 2024 ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo
NeurIPS 2024 Why Transformers Need Adam: A Hessian Perspective Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo
ICMLW 2024 Why Transformers Need Adam: A Hessian Perspective Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo
NeurIPS 2023 Imitation Learning from Imperfection: Theoretical Justifications and Algorithms Ziniu Li, Tian Xu, Zeyu Qin, Yang Yu, Zhi-Quan Luo
UAI 2023 Provably Efficient Adversarial Imitation Learning with Unknown Transitions Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo
ICLR 2022 HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning Ziniu Li, Yingru Li, Yushun Zhang, Tong Zhang, Zhi-Quan Luo
NeurIPS 2020 Error Bounds of Imitating Policies and Environments Tian Xu, Ziniu Li, Yang Yu