Wang, Sida

22 publications

ICLR 2025 LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code Naman Jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica
ICLR 2025 SWE-Bench Multimodal: Do AI Systems Generalize to Visual Software Domains? John Yang, Carlos E Jimenez, Alex L Zhang, Kilian Lieret, Joyce Yang, Xindi Wu, Ori Press, Niklas Muennighoff, Gabriel Synnaeve, Karthik R Narasimhan, Diyi Yang, Sida Wang, Ofir Press
NeurIPS 2025 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Yuxiang Wei, Olivier Duchenne, Jade Copet, Quentin Carbonneaux, Lingming Zhang, Daniel Fried, Gabriel Synnaeve, Rishabh Singh, Sida Wang
ICLR 2025 Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows Fangyu Lei, Jixuan Chen, Yuxiao Ye, Ruisheng Cao, Dongchan Shin, Hongjin Su, Zhaoqing Suo, Hongcheng Gao, Wenjing Hu, Pengcheng Yin, Victor Zhong, Caiming Xiong, Ruoxi Sun, Qian Liu, Sida Wang, Tao Yu
ICML 2024 CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution Alex Gu, Baptiste Roziere, Hugh James Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida Wang
ICLRW 2024 CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution Alex Gu, Baptiste Roziere, Hugh James Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida Wang
ICML 2024 Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks Linyuan Gong, Sida Wang, Mostafa Elhoushi, Alvin Cheung
NeurIPS 2024 Spider2-V: How Far Are Multimodal Agents from Automating Data Science and Engineering Workflows? Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Yuchen Mao, Wenjing Hu, Tianbao Xie, Hongshen Xu, Danyang Zhang, Sida Wang, Ruoxi Sun, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu, Victor Zhong, Lu Chen, Kai Yu, Tao Yu
NeurIPS 2023 Accessing Higher Dimensions for Unsupervised Word Translation Sida Wang
ICML 2023 Coder Reviewer Reranking for Code Generation Tianyi Zhang, Tao Yu, Tatsunori Hashimoto, Mike Lewis, Wen-Tau Yih, Daniel Fried, Sida Wang
ICML 2023 DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation Yuhang Lai, Chengxi Li, Yiming Wang, Tianyi Zhang, Ruiqi Zhong, Luke Zettlemoyer, Wen-Tau Yih, Daniel Fried, Sida Wang, Tao Yu
ICLR 2023 InCoder: A Generative Model for Code Infilling and Synthesis Daniel Fried, Armen Aghajanyan, Jessy Lin, Sida Wang, Eric Wallace, Freda Shi, Ruiqi Zhong, Scott Yih, Luke Zettlemoyer, Mike Lewis
ICML 2023 LEVER: Learning to Verify Language-to-Code Generation with Execution Ansong Ni, Srini Iyer, Dragomir Radev, Veselin Stoyanov, Wen-Tau Yih, Sida Wang, Xi Victoria Lin
AISTATS 2021 Towards Understanding the Behaviors of Optimal Deep Active Learning Algorithms Yilun Zhou, Adithya Renduchintala, Xian Li, Sida Wang, Yashar Mehdad, Asish Ghoshal
NeurIPS 2021 SILG: The Multi-Domain Symbolic Interactive Language Grounding Benchmark Victor Zhong, Austin W. Hanjie, Sida Wang, Karthik Narasimhan, Luke Zettlemoyer
NeurIPS 2020 Pre-Training via Paraphrasing Mike Lewis, Marjan Ghazvininejad, Gargi Ghosh, Armen Aghajanyan, Sida Wang, Luke Zettlemoyer
AAAI 2017 Phrase-Based Presentation Slides Generation for Academic Papers Sida Wang, Xiaojun Wan, Shikang Du
NeurIPS 2015 Estimating Mixture Models via Mixtures of Polynomials Sida Wang, Arun Tejasvi Chaganty, Percy Liang
NeurIPS 2014 Altitude Training: Strong Bounds for Single-Layer Dropout Stefan Wager, William Fithian, Sida Wang, Percy Liang
NeurIPS 2014 Simple MAP Inference via Low-Rank Relaxations Roy Frostig, Sida Wang, Percy Liang, Christopher D. Manning
NeurIPS 2013 Dropout Training as Adaptive Regularization Stefan Wager, Sida Wang, Percy Liang
ICML 2013 Fast Dropout Training Sida Wang, Christopher Manning