Cai, Ruisi

15 publications

CVPR 2025 FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting Hengyu Liu, Yuehao Wang, Chenxin Li, Ruisi Cai, Kevin Wang, Wuyang Li, Pavlo Molchanov, Peihao Wang, Zhangyang Wang
ICLR 2025 LLaMaFlex: Many-in-One LLMs via Generalized Pruning and Weight Sharing Ruisi Cai, Saurav Muralidharan, Hongxu Yin, Zhangyang Wang, Jan Kautz, Pavlo Molchanov
ICML 2025 Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding Jiajun Zhu, Peihao Wang, Ruisi Cai, Jason D. Lee, Pan Li, Zhangyang Wang
CVPR 2025 Steepest Descent Density Control for Compact 3D Gaussian Splatting Peihao Wang, Yuehao Wang, Dilin Wang, Sreyas Mohan, Zhiwen Fan, Lemeng Wu, Ruisi Cai, Yu-Ying Yeh, Zhangyang Wang, Qiang Liu, Rakesh Ranjan
ICLR 2025 Understanding and Mitigating Bottlenecks of State Space Models Through the Lens of Recency and Over-Smoothing Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li
NeurIPS 2024 $\textit{Read-ME}$: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design Ruisi Cai, Yeonju Ro, Geon-Woo Kim, Peihao Wang, Babak Ehteshami Bejnordi, Aditya Akella, Zhangyang Wang
NeurIPS 2024 $\texttt{Model-GLUE}$: Democratized LLM Scaling for a Large Model Zoo in the Wild Xinyu Zhao, Guoheng Sun, Ruisi Cai, Yukun Zhou, Pingzhi Li, Peihao Wang, Bowen Tan, Yexiao He, Li Chen, Yi Liang, Beidi Chen, Binhang Yuan, Hongyi Wang, Ang Li, Zhangyang Wang, Tianlong Chen
ICML 2024 Flextron: Many-in-One Flexible Large Language Model Ruisi Cai, Saurav Muralidharan, Greg Heinrich, Hongxu Yin, Zhangyang Wang, Jan Kautz, Pavlo Molchanov
ICML 2024 LoCoCo: Dropping in Convolutions for Long Context Compression Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen
NeurIPS 2023 H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher RĂ©, Clark Barrett, Zhangyang "Atlas" Wang, Beidi Chen
ICMLW 2023 H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Re, Clark Barrett, Zhangyang Wang, Beidi Chen
CVPRW 2023 Many-Task Federated Learning: A New Problem Setting and a Simple Baseline Ruisi Cai, Xiaohan Chen, Shiwei Liu, Jayanth Srinivasa, Myungjin Lee, Ramana Kompella, Zhangyang Wang
ICCV 2023 Robust Mixture-of-Expert Training for Convolutional Neural Networks Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Huan Zhang, Pin-Yu Chen, Shiyu Chang, Zhangyang Wang, Sijia Liu
ICML 2023 Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights? Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
NeurIPS 2022 Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection Without Clean Datasets Ruisi Cai, Zhenyu Zhang, Tianlong Chen, Xiaohan Chen, Zhangyang Wang