Zhang, Qingru

13 publications

NeurIPS 2025 Ask a Strong LLM Judge When Your Reward Model Is Uncertain Zhenghao Xu, Qin Lu, Qingru Zhang, Liang Qiu, Ilgee Hong, Changlong Yu, Wenlin Yao, Yao Liu, Haoming Jiang, Lihong Li, Hyokun Yun, Tuo Zhao
NeurIPS 2025 Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models Ilgee Hong, Changlong Yu, Liang Qiu, Weixiang Yan, Zhenghao Xu, Haoming Jiang, Qingru Zhang, Qin Lu, Xin Liu, Chao Zhang, Tuo Zhao
NeurIPS 2025 Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs Hao Kang, Qingru Zhang, Han Cai, Weiyuan Xu, Tushar Krishna, Yilun Du, Tsachy Weissman
NeurIPS 2024 Robust Reinforcement Learning from Corrupted Human Feedback Alexander Bukharin, Ilgee Hong, Haoming Jiang, Zichong Li, Qingru Zhang, Zixuan Zhang, Tuo Zhao
ICLR 2024 Tell Your Model Where to Attend: Post-Hoc Attention Steering for LLMs Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
ICLR 2023 Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
ICML 2023 Less Is More: Task-Aware Layer-Wise Distillation for Language Model Compression Chen Liang, Simiao Zuo, Qingru Zhang, Pengcheng He, Weizhu Chen, Tuo Zhao
ICML 2023 LoSparse: Structured Compression of Large Language Models Based on Low-Rank and Sparse Approximation Yixiao Li, Yifan Yu, Qingru Zhang, Chen Liang, Pengcheng He, Weizhu Chen, Tuo Zhao
NeurIPS 2023 Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms Alexander Bukharin, Yan Li, Yue Yu, Qingru Zhang, Zhehui Chen, Simiao Zuo, Chao Zhang, Songan Zhang, Tuo Zhao
NeurIPSW 2023 Tell Your Model Where to Attend: Post-Hoc Attention Steering for LLMs Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
ICML 2022 PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance Qingru Zhang, Simiao Zuo, Chen Liang, Alexander Bukharin, Pengcheng He, Weizhu Chen, Tuo Zhao
NeurIPS 2021 A Biased Graph Neural Network Sampler with Near-Optimal Regret Qingru Zhang, David P. Wipf, Quan Gan, Le Song
ICLR 2019 AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods Zhiming Zhou, Qingru Zhang, Guansong Lu, Hongwei Wang, Weinan Zhang, Yong Yu