ML Anthology
Authors
Search
About
Li, Ziniu
16 publications
ICLR
2025
Adam-Mini: Use Fewer Learning Rates to Gain More
Yushun Zhang
,
Congliang Chen
,
Ziniu Li
,
Tian Ding
,
Chenwei Wu
,
Diederik P Kingma
,
Yinyu Ye
,
Zhi-Quan Luo
,
Ruoyu Sun
ICML
2025
Controlling Large Language Model with Latent Action
Chengxing Jia
,
Ziniu Li
,
Pengyuan Wang
,
Yi-Chen Li
,
Zhenyu Hou
,
Yuxiao Dong
,
Yang Yu
ICLR
2025
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
Ziniu Li
,
Congliang Chen
,
Tian Xu
,
Zeyu Qin
,
Jiancong Xiao
,
Zhi-Quan Luo
,
Ruoyu Sun
NeurIPS
2025
Teaching Language Models to Reason with Tools
Chengpeng Li
,
Zhengyang Tang
,
Ziniu Li
,
Mingfeng Xue
,
Keqin Bao
,
Tian Ding
,
Ruoyu Sun
,
Benyou Wang
,
Xiang Wang
,
Junyang Lin
,
Dayiheng Liu
ICLR
2025
Understanding and Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention
Tianyun Yang
,
Ziniu Li
,
Juan Cao
,
Chang Xu
ICMLW
2024
Adam-Mini: Use Fewer Learning Rates to Gain More
Yushun Zhang
,
Congliang Chen
,
Ziniu Li
,
Tian Ding
,
Chenwei Wu
,
Yinyu Ye
,
Zhi-Quan Luo
,
Ruoyu Sun
NeurIPSW
2024
Entropic Distribution Matching for Supervised Fine-Tuning of LLMs: Less Overfitting and Better Diversity
Ziniu Li
,
Congliang Chen
,
Tian Xu
,
Zeyu Qin
,
Jiancong Xiao
,
Ruoyu Sun
,
Zhi-Quan Luo
NeurIPSW
2024
Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention
Tianyun Yang
,
Ziniu Li
,
Juan Cao
,
Chang Xu
NeurIPSW
2024
Pruning for Robust Concept Erasing in Diffusion Models
Tianyun Yang
,
Ziniu Li
,
Juan Cao
,
Chang Xu
ICML
2024
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Ziniu Li
,
Tian Xu
,
Yushun Zhang
,
Zhihang Lin
,
Yang Yu
,
Ruoyu Sun
,
Zhi-Quan Luo
NeurIPS
2024
Why Transformers Need Adam: A Hessian Perspective
Yushun Zhang
,
Congliang Chen
,
Tian Ding
,
Ziniu Li
,
Ruoyu Sun
,
Zhi-Quan Luo
ICMLW
2024
Why Transformers Need Adam: A Hessian Perspective
Yushun Zhang
,
Congliang Chen
,
Tian Ding
,
Ziniu Li
,
Ruoyu Sun
,
Zhi-Quan Luo
NeurIPS
2023
Imitation Learning from Imperfection: Theoretical Justifications and Algorithms
Ziniu Li
,
Tian Xu
,
Zeyu Qin
,
Yang Yu
,
Zhi-Quan Luo
UAI
2023
Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Tian Xu
,
Ziniu Li
,
Yang Yu
,
Zhi-Quan Luo
ICLR
2022
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning
Ziniu Li
,
Yingru Li
,
Yushun Zhang
,
Tong Zhang
,
Zhi-Quan Luo
NeurIPS
2020
Error Bounds of Imitating Policies and Environments
Tian Xu
,
Ziniu Li
,
Yang Yu