Pan, Rui

16 publications

NeurIPS 2025 ASGO: Adaptive Structured Gradient Optimization Kang An, Yuxing Liu, Rui Pan, Yi Ren, Shiqian Ma, Donald Goldfarb, Tong Zhang
ICLR 2025 AdaGrad Under Anisotropic Smoothness Yuxing Liu, Rui Pan, Tong Zhang
TMLR 2025 Entropy-Regularized Process Reward Model Hanning Zhang, Pengcheng Wang, Shizhe Diao, Yong Lin, Rui Pan, Hanze Dong, Dylan Zhang, Pavlo Molchanov, Tong Zhang
ICML 2025 MA-LoT: Model-Collaboration Lean-Based Long Chain-of-Thought Reasoning Enhances Formal Theorem Proving Ruida Wang, Rui Pan, Yuxin Li, Jipeng Zhang, Yizhen Jia, Shizhe Diao, Renjie Pi, Junjie Hu, Tong Zhang
NeurIPS 2025 NegoCollab: A Common Representation Negotiation Approach for Heterogeneous Collaborative Perception Shao Congzhang, Quan Yuan, Guiyang Luo, Yue Hu, Danni Wang, Liu Yilin, Rui Pan, Bo Chen, Jinglin Li
ICLR 2025 Personalized Visual Instruction Tuning Renjie Pi, Jianshu Zhang, Tianyang Han, Jipeng Zhang, Rui Pan, Tong Zhang
NeurIPS 2025 Safe RLHF-V: Safe Reinforcement Learning from Multi-Modal Human Feedback Jiaming Ji, Xinyu Chen, Rui Pan, Han Zhu, Jiahao Li, Donghai Hong, Boyuan Chen, Jiayi Zhou, Kaile Wang, Juntao Dai, Chi-Min Chan, Sirui Han, Yike Guo, Yaodong Yang
NeurIPS 2025 SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning Rui Pan, Yinwei Dai, Zhihao Zhang, Gabriele Oliaro, Zhihao Jia, Ravi Netravali
ICML 2025 Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods Yifan Hao, Xingyuan Pan, Hanning Zhang, Chenlu Ye, Rui Pan, Tong Zhang
ICLR 2024 Accelerated Convergence of Stochastic Heavy Ball Method Under Anisotropic Gradient Noise Rui Pan, Yuxing Liu, Xiaoyu Wang, Tong Zhang
NeurIPS 2024 Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions Renjie Pi, Jianshu Zhang, Jipeng Zhang, Rui Pan, Zhekai Chen, Tong Zhang
NeurIPS 2024 LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning Rui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang
ECCV 2024 Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
IJCAI 2023 GPLight: Grouped Multi-Agent Reinforcement Learning for Large-Scale Traffic Signal Control Yilin Liu, Guiyang Luo, Quan Yuan, Jinglin Li, Lei Jin, Bo Chen, Rui Pan
TMLR 2023 RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, KaShun Shum, Tong Zhang
ICLR 2022 Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums Rui Pan, Haishan Ye, Tong Zhang