Zhao, Jiawei

17 publications

NeurIPS 2025 Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts Haizhong Zheng, Yang Zhou, Brian R. Bartoldson, Bhavya Kailkhura, Fan Lai, Jiawei Zhao, Beidi Chen
ICML 2025 From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications Ajay Kumar Jaiswal, Yifan Wang, Lu Yin, Shiwei Liu, Runjin Chen, Jiawei Zhao, Ananth Grama, Yuandong Tian, Zhangyang Wang
NeurIPS 2025 ParetoQ: Improving Scaling Laws in Extremely Low-Bit LLM Quantization Zechun Liu, Changsheng Zhao, Hanxian Huang, Sijia Chen, Jing Zhang, Jiawei Zhao, Scott Roy, Lisa Jin, Yunyang Xiong, Yangyang Shi, Lin Xiao, Yuandong Tian, Bilge Soran, Raghuraman Krishnamoorthi, Tijmen Blankevoort, Vikas Chandra
CPAL 2025 Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Zhenyu Zhang, Ajay Kumar Jaiswal, Lu Yin, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang
ICML 2024 GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian
ICLRW 2024 GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian
TMLR 2024 Incremental Spatial and Spectral Learning of Neural Operators for Solving Large-Scale PDEs Robert Joseph George, Jiawei Zhao, Jean Kossaifi, Zongyi Li, Anima Anandkumar
ECCVW 2024 Loop Mining Large-Scale Unlabeled Data for Corner Case Detection in Autonomous Driving Jiawei Zhao, Yiting Duan, Jinming Su, Wangwang Yang, Tingyi Guo, Xingyue Chen, Junfeng Luo
ICMLW 2024 MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar
NeurIPS 2024 Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar
NeurIPS 2024 S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-Tuning by Structured Sparsity Xinyu Yang, Jixuan Leng, Geyang Guo, Jiawei Zhao, Ryumei Nakada, Linjun Zhang, Huaxiu Yao, Beidi Chen
NeurIPSW 2024 Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition Robert Joseph George, David Pitt, Jiawei Zhao, Jean Kossaifi, Cheng Luo, Yuandong Tian, Anima Anandkumar
ICMLW 2023 Incremental Low-Rank Learning Jiawei Zhao, Yifei Zhang, Beidi Chen, Florian Tobias Schaefer, Anima Anandkumar
TMLR 2022 ZerO Initialization: Initializing Neural Networks with Only Zeros and Ones Jiawei Zhao, Florian Tobias Schaefer, Anima Anandkumar
ICCV 2021 Transformer-Based Dual Relation Graph for Multi-Label Image Recognition Jiawei Zhao, Ke Yan, Yifan Zhao, Xiaowei Guo, Feiyue Huang, Jia Li
NeurIPS 2020 Learning Compositional Functions via Multiplicative Weight Updates Jeremy Bernstein, Jiawei Zhao, Markus Meister, Ming-Yu Liu, Anima Anandkumar, Yisong Yue
ICLR 2019 signSGD with Majority Vote Is Communication Efficient and Fault Tolerant Jeremy Bernstein, Jiawei Zhao, Kamyar Azizzadenesheli, Anima Anandkumar