Stoica, Ion

74 publications

ICLR 2025 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli, Siyuan Zhuang, Ion Stoica, Michael W. Mahoney
ICML 2025 Copilot Arena: A Platform for Code LLM Evaluation in the Wild Wayne Chi, Valerie Chen, Anastasios Nikolas Angelopoulos, Wei-Lin Chiang, Aditya Mittal, Naman Jain, Tianjun Zhang, Ion Stoica, Chris Donahue, Ameet Talwalkar
MLOSS 2025 Depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers Kaichao You, Runsheng Bai, Meng Cao, Jianmin Wang, Ion Stoica, Mingsheng Long
NeurIPS 2025 Efficiently Scaling LLM Reasoning Programs with Certaindex Yichao Fu, Junda Chen, Siqi Zhu, Zheyu Fu, Zhongdongming Dai, Yonghao Zhuang, Yian Ma, Aurick Qiao, Tajana Rosing, Ion Stoica, Hao Zhang
NeurIPS 2025 Establishing Best Practices in Building Rigorous Agentic Benchmarks Yuxuan Zhu, Tengjun Jin, Yada Pruksachatkun, Andy K Zhang, Shu Liu, Sasha Cui, Sayash Kapoor, Shayne Longpre, Kevin Meng, Rebecca Weiss, Fazl Barez, Rahul Gupta, Jwala Dhamala, Jacob Merizian, Mario Giulianelli, Harry Coppock, Cozmin Ududec, Antony Kellermann, Jasjeet S Sekhon, Jacob Steinhardt, Sarah Schwettmann, Arvind Narayanan, Matei Zaharia, Ion Stoica, Percy Liang, Daniel Kang
ICML 2025 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards Yangsibo Huang, Milad Nasr, Anastasios Nikolas Angelopoulos, Nicholas Carlini, Wei-Lin Chiang, Christopher A. Choquette-Choo, Daphne Ippolito, Matthew Jagielski, Katherine Lee, Ken Liu, Ion Stoica, Florian Tramèr, Chiyuan Zhang
ICML 2025 Fast Video Generation with Sliding Tile Attention Peiyuan Zhang, Yongqi Chen, Runlong Su, Hangliang Ding, Ion Stoica, Zhengzhong Liu, Hao Zhang
NeurIPS 2025 Faster Video Diffusion with Trainable Sparse Attention Peiyuan Zhang, Yongqi Chen, Haofeng Huang, Will Lin, Zhengzhong Liu, Ion Stoica, Eric P. Xing, Hao Zhang
ICML 2025 From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and Benchbuilder Pipeline Tianle Li, Wei-Lin Chiang, Evan Frick, Lisa Dunlap, Tianhao Wu, Banghua Zhu, Joseph E. Gonzalez, Ion Stoica
NeurIPS 2025 GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents Manish Shetty, Naman Jain, Jinjian Liu, Vijay Kethanaboyina, Koushik Sen, Ion Stoica
ICLR 2025 GameArena: Evaluating LLM Reasoning Through Live Computer Games Lanxiang Hu, Qiyu Li, Anze Xie, Nan Jiang, Ion Stoica, Haojian Jin, Hao Zhang
ICML 2025 HashAttention: Semantic Sparsity for Faster Inference Aditya Desai, Shuo Yang, Alejandro Cuadron, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica
ICLR 2025 How to Evaluate Reward Models for RLHF Evan Frick, Tianle Li, Connor Chen, Wei-Lin Chiang, Anastasios Nikolas Angelopoulos, Jiantao Jiao, Banghua Zhu, Joseph E. Gonzalez, Ion Stoica
ICLR 2025 JudgeBench: A Benchmark for Evaluating LLM-Based Judges Sijun Tan, Siyuan Zhuang, Kyle Montgomery, William Yuan Tang, Alejandro Cuadron, Chenguang Wang, Raluca Popa, Ion Stoica
ICLR 2025 LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code Naman Jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica
ICML 2025 OR-Bench: An Over-Refusal Benchmark for Large Language Models Justin Cui, Wei-Lin Chiang, Ion Stoica, Cho-Jui Hsieh
ICML 2025 Prompt-to-Leaderboard: Prompt-Adaptive LLM Evaluations Evan Frick, Connor Chen, Joseph Tennyson, Tianle Li, Wei-Lin Chiang, Anastasios Nikolas Angelopoulos, Ion Stoica
NeurIPS 2025 Radial Attention: $\mathcal{O}(n\log N)$ Sparse Attention with Energy Decay for Long Video Generation Xingyang Li, Muyang Li, Tianle Cai, Haocheng Xi, Shuo Yang, Yujun Lin, Lvmin Zhang, Songlin Yang, Jinbo Hu, Kelly Peng, Maneesh Agrawala, Ion Stoica, Kurt Keutzer, Song Han
ICLRW 2025 Reasoning Without Self-Doubt: More Efficient Chain-of-Thought Through Certainty Probing Yichao Fu, Junda Chen, Yonghao Zhuang, Zheyu Fu, Ion Stoica, Hao Zhang
CoRL 2025 RoboMonkey: Scaling Test-Time Sampling and Verification for Vision-Language-Action Models Jacky Kwok, Christopher Agia, Rohan Sinha, Matt Foutter, Shulu Li, Ion Stoica, Azalia Mirhoseini, Marco Pavone
ICLR 2025 RouteLLM: Learning to Route LLMs from Preference Data Isaac Ong, Amjad Almahairi, Vincent Wu, Wei-Lin Chiang, Tianhao Wu, Joseph E. Gonzalez, M Waleed Kadous, Ion Stoica
ICML 2025 Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity Haocheng Xi, Shuo Yang, Yilong Zhao, Chenfeng Xu, Muyang Li, Xiuyu Li, Yujun Lin, Han Cai, Jintao Zhang, Dacheng Li, Jianfei Chen, Ion Stoica, Kurt Keutzer, Song Han
NeurIPS 2025 Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation Shuo Yang, Haocheng Xi, Yilong Zhao, Muyang Li, Jintao Zhang, Han Cai, Yujun Lin, Xiuyu Li, Chenfeng Xu, Kelly Peng, Jianfei Chen, Song Han, Kurt Keutzer, Ion Stoica
ICML 2025 The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models Shishir G Patil, Huanzhi Mao, Fanjia Yan, Charlie Cheng-Jie Ji, Vishnu Suresh, Ion Stoica, Joseph E. Gonzalez
NeurIPS 2025 Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning Chaofan Lin, Jiaming Tang, Shuo Yang, Hanshuo Wang, Tian Tang, Boyu Tian, Ion Stoica, Song Han, Mingyu Gao
CVPR 2025 VisionArena: 230k Real World User-VLM Conversations with Preference Labels Christopher Chou, Lisa Dunlap, Koki Mashita, Krishna Mandal, Trevor Darrell, Ion Stoica, Joseph E. Gonzalez, Wei-Lin Chiang
NeurIPS 2025 Why Do Multi-Agent LLM Systems Fail? Mert Cemri, Melissa Z Pan, Shuyi Yang, Lakshya A Agrawal, Bhavya Chopra, Rishabh Tiwari, Kurt Keutzer, Aditya Parameswaran, Dan Klein, Kannan Ramchandran, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica
ICLRW 2025 Why Do Multiagent Systems Fail? Melissa Z Pan, Mert Cemri, Lakshya A Agrawal, Shuyi Yang, Bhavya Chopra, Rishabh Tiwari, Kurt Keutzer, Aditya Parameswaran, Kannan Ramchandran, Dan Klein, Joseph E. Gonzalez, Matei Zaharia, Ion Stoica
NeurIPS 2025 WorldModelBench: Judging Video Generation Models as World Models Dacheng Li, Yunhao Fang, Yukang Chen, Shuo Yang, Shiyi Cao, Justin Wong, Michael Luo, Xiaolong Wang, Hongxu Yin, Joseph E. Gonzalez, Ion Stoica, Song Han, Yao Lu
NeurIPS 2024 Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems Lingjiao Chen, Jared Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou
ICML 2024 Break the Sequential Dependency of LLM Inference Using Lookahead Decoding Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang
ICML 2024 Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios Nikolas Angelopoulos, Tianle Li, Dacheng Li, Banghua Zhu, Hao Zhang, Michael Jordan, Joseph E. Gonzalez, Ion Stoica
NeurIPS 2024 Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions Vinamra Benara, Chandan Singh, John X. Morris, Richard J. Antonello, Ion Stoica, Alexander G. Huth, Jianfeng Gao
NeurIPS 2024 Efficient LLM Scheduling by Learning to Rank Yichao Fu, Siqi Zhu, Runlong Su, Aurick Qiao, Ion Stoica, Hao Zhang
ICLR 2024 LLM-Assisted Code Cleaning for Training Accurate Code Generators Naman Jain, Tianjun Zhang, Wei-Lin Chiang, Joseph E. Gonzalez, Koushik Sen, Ion Stoica
ICLR 2024 LMSYS-Chat-1m: A Large-Scale Real-World LLM Conversation Dataset Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang
ICML 2024 MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving Jiangfei Duan, Runyu Lu, Haojie Duanmu, Xiuhong Li, Xingcheng Zhang, Dahua Lin, Ion Stoica, Hao Zhang
ICML 2024 Online Speculative Decoding Xiaoxuan Liu, Lanxiang Hu, Peter Bailis, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang
ICML 2024 R2E: Turning Any GitHub Repository into a Programming Agent Environment Naman Jain, Manish Shetty, Tianjun Zhang, King Han, Koushik Sen, Ion Stoica
ICLRW 2024 R2E: Turning Any GitHub Repository into a Programming Agent Environment Naman Jain, Manish Shetty, Tianjun Zhang, King Han, Koushik Sen, Ion Stoica
NeurIPS 2024 SGLang: Efficient Execution of Structured Language Model Programs Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng
NeurIPS 2024 Stylus: Automatic Adapter Selection for Diffusion Models Michael Luo, Justin Wong, Brandon Trabucco, Yanping Huang, Joseph E. Gonzalez, Zhifeng Chen, Ruslan Salakhutdinov, Ion Stoica
ICML 2024 Trustless Audits Without Revealing Data or Models Suppakit Waiwitlikhit, Ion Stoica, Yi Sun, Tatsunori Hashimoto, Daniel Kang
ICML 2023 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning Abdus Salam Azad, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Aleksandra Faust, Pieter Abbeel, Ion Stoica
ICMLW 2023 Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks Daniel Kang, Xuechen Li, Ion Stoica, Carlos Guestrin, Matei Zaharia, Tatsunori Hashimoto
ICML 2023 FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin, Beidi Chen, Percy Liang, Christopher Re, Ion Stoica, Ce Zhang
NeurIPSW 2023 How Long Can Context Length of Open-Source LLMs Truly Promise? Dacheng Li, Rulin Shao, Anze Xie, Ying Sheng, Lianmin Zheng, Joseph Gonzalez, Ion Stoica, Xuezhe Ma, Hao Zhang
NeurIPSW 2023 Improving Code Style for Accurate Code Generation Naman Jain, Tianjun Zhang, Wei-Lin Chiang, Joseph E. Gonzalez, Koushik Sen, Ion Stoica
NeurIPS 2023 Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric P. Xing, Hao Zhang, Joseph E Gonzalez, Ion Stoica
NeurIPSW 2023 LightSeq: : Sequence Level Parallelism for Distributed Training of Long Context Transformers Dacheng Li, Rulin Shao, Anze Xie, Eric P. Xing, Joseph E. Gonzalez, Ion Stoica, Xuezhe Ma, Hao Zhang
NeurIPSW 2023 Scaling up Trustless DNN Inference with Zero-Knowledge Proofs Daniel Kang, Tatsunori Hashimoto, Ion Stoica, Yi Sun
JMLR 2023 VCG Mechanism Design with Unknown Agent Values Under Stochastic Bandit Feedback Kirthevasan Kandasamy, Joseph E Gonzalez, Michael I Jordan, Ion Stoica
AISTATS 2022 Learning Competitive Equilibria in Exchange Economies with Bandit Feedback Wenshuo Guo, Kirthevasan Kandasamy, Joseph Gonzalez, Michael Jordan, Ion Stoica
NeurIPSW 2022 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica
NeurIPSW 2022 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica
ECCV 2022 Context-Aware Streaming Perception in Dynamic Environments Gur-Eyal Sela, Ionel Gog, Justin Wong, Kumar Krishna Agrawal, Xiangxi Mo, Sukrit Kalra, Peter Schafhalter, Eric Leong, Xin Wang, Bharathan Balaji, Joseph Gonzalez, Ion Stoica
ICML 2022 POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging Shishir G. Patil, Paras Jain, Prabal Dutta, Ion Stoica, Joseph Gonzalez
AAAI 2022 Programmatic Modeling and Generation of Real-Time Strategic Soccer Environments for Reinforcement Learning Abdus Salam Azad, Edward Kim, Qiancheng Wu, Kimin Lee, Ion Stoica, Pieter Abbeel, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia
NeurIPS 2021 Accelerating Quadratic Optimization with Reinforcement Learning Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato, Goran Banjac, Michael Luo, Francesco Borrelli, Joseph E Gonzalez, Ion Stoica, Ken Goldberg
ICML 2021 ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training Jianfei Chen, Lianmin Zheng, Zhewei Yao, Dequan Wang, Ion Stoica, Michael Mahoney, Joseph Gonzalez
NeurIPS 2021 RLlib Flow: Distributed Reinforcement Learning Is a Dataflow Problem Eric Liang, Zhanghao Wu, Michael Luo, Sven Mika, Joseph E Gonzalez, Ion Stoica
NeurIPS 2021 Representing Long-Range Context for Graph Neural Networks with Global Attention Zhanghao Wu, Paras Jain, Matthew Wright, Azalia Mirhoseini, Joseph E Gonzalez, Ion Stoica
ICML 2021 Resource Allocation in Multi-Armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism Brijen Thananjeyan, Kirthevasan Kandasamy, Ion Stoica, Michael Jordan, Ken Goldberg, Joseph Gonzalez
ICML 2021 TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models Zhuohan Li, Siyuan Zhuang, Shiyuan Guo, Danyang Zhuo, Hao Zhang, Dawn Song, Ion Stoica
ICML 2020 FetchSGD: Communication-Efficient Federated Learning with Sketching Daniel Rothchild, Ashwinee Panda, Enayat Ullah, Nikita Ivkin, Ion Stoica, Vladimir Braverman, Joseph Gonzalez, Raman Arora
ICLR 2020 IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks Michael Luo, Jiahao Yao, Richard Liaw, Eric Liang, Ion Stoica
ICML 2020 Variable Skipping for Autoregressive Range Density Estimation Eric Liang, Zongheng Yang, Ion Stoica, Pieter Abbeel, Yan Duan, Peter Chen
NeurIPS 2019 Communication-Efficient Distributed SGD with Sketching Nikita Ivkin, Daniel Rothchild, Enayat Ullah, Vladimir Braverman, Ion Stoica, Raman Arora
ICMLW 2019 Multi-Task Learning via Task Multi-Clustering Andy Yan, Xin Wang, Ion Stoica, Joseph Gonzalez, Roy Fox
ICML 2019 Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules Daniel Ho, Eric Liang, Xi Chen, Ion Stoica, Pieter Abbeel
ICLR 2018 Parametrized Hierarchical Procedures for Neural Programming Roy Fox, Richard Shin, Sanjay Krishnan, Ken Goldberg, Dawn Song, Ion Stoica
ICML 2018 RLlib: Abstractions for Distributed Reinforcement Learning Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, Ion Stoica
CoRL 2017 DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations Sanjay Krishnan, Roy Fox, Ion Stoica, Ken Goldberg
ICLR 2016 SparkNet: Training Deep Networks in Spark Philipp Moritz, Robert Nishihara, Ion Stoica, Michael I. Jordan