Zou, James

139 publications

NeurIPS 2025 4KAgent: Agentic Any Image to 4k Super-Resolution Yushen Zuo, Qi Zheng, Mingyang Wu, Xinrui Jiang, Renjie Li, Jian Wang, Yide Zhang, Gengchen Mai, Lihong Wang, James Zou, Xiaoyu Wang, Ming-Hsuan Yang, Zhengzhong Tu
NeurIPS 2025 A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning Yuzheng Hu, Fan Wu, Haotian Ye, David Forsyth, James Zou, Nan Jiang, Jiaqi W. Ma, Han Zhao
NeurIPS 2025 AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration Andy Zhou, Kevin Wu, Francesco Pinto, Zhaorun Chen, Yi Zeng, Yu Yang, Shuang Yang, Sanmi Koyejo, James Zou, Bo Li
NeurIPS 2025 CGBench: Benchmarking Language Model Scientific Reasoning for Clinical Genetics Research Owen Queen, Harrison G Zhang, James Zou
ICCV 2025 CHORDS: Diffusion Sampling Accelerator with Multi-Core Hierarchical ODE Solvers Jiaqi Han, Haotian Ye, Puheng Li, Minkai Xu, James Zou, Stefano Ermon
ICLR 2025 Capturing the Temporal Dependence of Training Data Influence Jiachen T. Wang, Dawn Song, James Zou, Prateek Mittal, Ruoxi Jia
ICML 2025 CollabLLM: From Passive Responders to Active Collaborators Shirley Wu, Michel Galley, Baolin Peng, Hao Cheng, Gavin Li, Yao Dou, Weixin Cai, James Zou, Jure Leskovec, Jianfeng Gao
ICML 2025 Cost-Efficient Collaboration Between On-Device and Cloud Language Models Avanika Narayan, Dan Biderman, Sabri Eyuboglu, Avner May, Scott Linderman, James Zou, Christopher Re
ICLRW 2025 Cost-Efficient Collaboration Between On-Device and Cloud Language Models Avanika Narayan, Sabri Eyuboglu, Dan Biderman, Avner May, Scott Linderman, James Zou, Christopher Re
DMLR 2025 Data Acquisition: A New Frontier in Data-Centric AI Lingjiao Chen, Bilge Acun, Newsha Ardalani, Yifan Sun, Feiyang Kang, Hanrui Lyu, Yongchan Kwon, Ruoxi Jia, Carole-Jean Wu, Matei Zaharia, James Zou
AISTATS 2025 Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models Haotian Ye, Himanshu Jain, Chong You, Ananda Theertha Suresh, Haowei Lin, James Zou, Felix Yu
NeurIPS 2025 EvoLM: In Search of Lost Training Dynamics for Language Model Reasoning Zhenting Qi, Fan Nie, Alexandre Alahi, James Zou, Himabindu Lakkaraju, Yilun Du, Eric P. Xing, Sham M. Kakade, Hanlin Zhang
NeurIPS 2025 ExGra-Med: Extended Context Graph Alignment for Medical Vision-Language Models Duy Minh Ho Nguyen, Nghiem Tuong Diep, Trung Quoc Nguyen, Hoang-Bao Le, Tai Nguyen, Anh-Tien Nguyen, TrungTin Nguyen, Nhat Ho, Pengtao Xie, Roger Wattenhofer, Daniel Sonntag, James Zou, Mathias Niepert
ICML 2025 FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees Fan Nie, Xiaotian Hou, Shuhang Lin, James Zou, Huaxiu Yao, Linjun Zhang
ICLR 2025 GMValuator: Similarity-Based Data Valuation for Generative Models Jiaxi Yang, Wenlong Deng, Benlin Liu, Yangsibo Huang, James Zou, Xiaoxiao Li
NeurIPS 2025 GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters Wanjia Zhao, Jiaqi Han, Siyi Gu, Mingjian Jiang, James Zou, Stefano Ermon
ICML 2025 Improving Model Alignment Through Collective Intelligence of Open-Source Models Junlin Wang, Roy Xie, Shang Zhu, Jue Wang, Ben Athiwaratkun, Bhuwan Dhingra, Shuaiwen Leon Song, Ce Zhang, James Zou
ICLR 2025 Locality Alignment Improves Vision-Language Models Ian Connick Covert, Tony Sun, James Zou, Tatsunori Hashimoto
ICLR 2025 MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Peng Xia, Kangyu Zhu, Haoran Li, Tianze Wang, Weijia Shi, Sheng Wang, Linjun Zhang, James Zou, Huaxiu Yao
ICLR 2025 MedTrinity-25m: A Large-Scale Multimodal Dataset with Multigranular Annotations for Medicine Yunfei Xie, Ce Zhou, Lang Gao, Juncheng Wu, Xianhang Li, Hong-Yu Zhou, Sheng Liu, Lei Xing, James Zou, Cihang Xie, Yuyin Zhou
ICLR 2025 Mixture-of-Agents Enhances Large Language Model Capabilities Junlin Wang, Jue Wang, Ben Athiwaratkun, Ce Zhang, James Zou
NeurIPS 2025 More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models Zhongxing Xu, Chengzhi Liu, Qingyue Wei, Juncheng Wu, James Zou, Xin Eric Wang, Yuyin Zhou, Sheng Liu
ICLRW 2025 OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Pan Lu, Bowen Chen, Sheng Liu, Rahul Thapa, Joseph Boen, James Zou
ICLRW 2025 Proper Dataset Valuation by Pointwise Mutual Information Shuran Zheng, Xuan Qi, Rui Ray Chen, Yongchan Kwon, James Zou
ICLR 2025 Reducing Hallucinations in Large Vision-Language Models via Latent Space Steering Sheng Liu, Haotian Ye, James Zou
TMLR 2025 Reliable and Responsible Foundation Models Xinyu Yang, Junlin Han, Rishi Bommasani, Jinqi Luo, Wenjie Qu, Wangchunshu Zhou, Adel Bibi, Xiyao Wang, Jaehong Yoon, Elias Stengel-Eskin, Shengbang Tong, Lingfeng Shen, Rafael Rafailov, Runjia Li, Zhaoyang Wang, Yiyang Zhou, Chenhang Cui, Yu Wang, Wenhao Zheng, Huichi Zhou, Jindong Gu, Zhaorun Chen, Peng Xia, Tony Lee, Thomas P Zollo, Vikash Sehwag, Jixuan Leng, Jiuhai Chen, Yuxin Wen, Huan Zhang, Zhun Deng, Linjun Zhang, Pavel Izmailov, Pang Wei Koh, Yulia Tsvetkov, Andrew Gordon Wilson, Jiaheng Zhang, James Zou, Cihang Xie, Hao Wang, Philip Torr, Julian McAuley, David Alvarez-Melis, Florian Tramèr, Kaidi Xu, Suman Jana, Chris Callison-Burch, Rene Vidal, Filippos Kokkinos, Mohit Bansal, Beidi Chen, Huaxiu Yao
NeurIPS 2025 SiriuS: Self-Improving Multi-Agent Systems via Bootstrapped Reasoning Wanjia Zhao, Mert Yuksekgonul, Shirley Wu, James Zou
ICLRW 2025 SiriuS: Self-Improving Multi-Agent Systems via Bootstrapped Reasoning Wanjia Zhao, Mert Yuksekgonul, Shirley Wu, James Zou
NeurIPS 2025 Solving Inequality Proofs with Large Language Models Pan Lu, Jiayi Sheng, Luna Lyu, Jikai Jin, Tony Xia, Alex Gu, James Zou
NeurIPS 2025 metaTextGrad: Automatically Optimizing Language Model Optimizers Guowei Xu, Mert Yuksekgonul, Carlos Guestrin, James Zou
NeurIPS 2024 Accelerating Transformers with Spectrum-Preserving Token Merging Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen, TrungTin Nguyen, Ngan Le, Pengtao Xie, Daniel Sonntag, James Zou, Binh T. Nguyen, Mathias Niepert
NeurIPS 2024 Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems Lingjiao Chen, Jared Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou
ICML 2024 ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations Kailas Vodrahalli, James Zou
NeurIPS 2024 AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning Shirley Wu, Shiyu Zhao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis N. Ioannidis, Karthik Subbian, Jure Leskovec, James Zou
NeurIPS 2024 CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models Peng Xia, Ze Chen, Juanxi Tian, Yangrui Gong, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao
ICMLW 2024 CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models Peng Xia, Ze Chen, Juanxi Tian, Gong Yangrui, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao
NeurIPS 2024 ClashEval: Quantifying the Tug-of-War Between an LLM’s Internal Prior and External Evidence Kevin Wu, Eric Wu, James Zou
DMLR 2024 DMLR: Data-Centric Machine Learning Research - Past, Present and Future Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš, Ahmed Alaa, Adji Bousso Dieng, Natasha Noy, Vijay Janapa Reddi, James Zou, Praveen Paritosh, Mihaela van der Schaar, Kurt Bollacker, Lora Aroyo, Ce Zhang, Joaquin Vanschoren, Isabelle Guyon, Peter Mattson
ICLR 2024 DataInf: Efficiently Estimating Data Influence in LoRA-Tuned LLMs and Diffusion Models Yongchan Kwon, Eric Wu, Kevin Wu, James Zou
NeurIPS 2024 Enhancing Large Vision Language Models with Self-Training on Image Comprehension Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, Quanquan Gu, James Zou, Kai-Wei Chang, Wei Wang
TMLR 2024 FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance Lingjiao Chen, Matei Zaharia, James Zou
NeurIPS 2024 GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec
ICML 2024 How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis Federico Bianchi, Patrick John Chia, Mert Yuksekgonul, Jacopo Tagliabue, Dan Jurafsky, James Zou
NeurIPSW 2024 Improving LLM Group Fairness on Tabular Data via In-Context Learning Valeriia Cherepanova, Chia-Jung Lee, Nil-Jana Akpinar, Riccardo Fogliato, Martin Andres Bertran, Michael Kearns, James Zou
NeurIPSW 2024 Improving LLM Group Fairness on Tabular Data via In-Context Learning Valeriia Cherepanova, Chia-Jung Lee, Nil-Jana Akpinar, Riccardo Fogliato, Martin Andres Bertran, Michael Kearns, James Zou
ICML 2024 Learning and Forgetting Unsafe Examples in Large Language Models Jiachen Zhao, Zhun Deng, David Madras, James Zou, Mengye Ren
NeurIPSW 2024 MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Peng Xia, Kangyu Zhu, Haoran Li, Tianze Wang, Weijia Shi, Linjun Zhang, James Zou, Huaxiu Yao
NeurIPSW 2024 MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Peng Xia, Kangyu Zhu, Haoran Li, Tianze Wang, Weijia Shi, Sheng Wang, Linjun Zhang, James Zou, Huaxiu Yao
ICLR 2024 Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace Xinyu Yang, Weixin Liang, James Zou
TMLR 2024 New Evaluation Metrics Capture Quality Degradation Due to LLM Watermarking Karanpartap Singh, James Zou
ICML 2024 Position: TrustLLM: Trustworthiness in Large Language Models Yue Huang, Lichao Sun, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Hanchi Sun, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric P. Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang, Huan Zhang, Huaxiu Yao, Manolis Kellis, Marinka Zitnik, Meng Jiang, Mohit Bansal, James Zou, Jian Pei, Jian Liu, Jianfeng Gao, Jiawei Han, Jieyu Zhao, Jiliang Tang, Jindong Wang, Joaquin Vanschoren, John Mitchell, Kai Shu, Kaidi Xu, Kai-Wei Chang, Lifang He, Lifu Huang, Michael Backes, Neil Zhenqiang Gong, Philip S. Yu, Pin-Yu Chen, Quanquan Gu, Ran Xu, Rex Ying, Shuiwang Ji, Suman Jana, Tianlong Chen, Tianming Liu, Tianyi Zhou, William Yang Wang, Xiang Li, Xiangliang Zhang, Xiao Wang, Xing Xie, Xun Chen, Xuyu Wang, Yan Liu, Yanfang Ye, Yinzhi Cao, Yong Chen, Yue Zhao
ICML 2024 Prospector Heads: Generalized Feature Attribution for Large Models & Data Gautam Machiraju, Alexander Derry, Arjun D Desai, Neel Guha, Amir-Hossein Karimi, James Zou, Russ B Altman, Christopher Re, Parag Mallick
TMLR 2024 Provable Membership Inference Privacy Zachary Izzo, Jinsung Yoon, Sercan O Arik, James Zou
CHIL 2024 Regulating AI Adaptation: An Analysis of AI Medical Device Updates Kevin Wu, Eric Wu, Kit Rodolfa, Daniel E Ho, James Zou
ICML 2024 Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits Jiachen T. Wang, Tianji Yang, James Zou, Yongchan Kwon, Ruoxi Jia
NeurIPS 2024 STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec
ICLR 2024 Safety-Tuned LLaMAs: Lessons from Improving the Safety of Large Language Models That Follow Instructions Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Rottger, Dan Jurafsky, Tatsunori Hashimoto, James Zou
ICML 2024 Scaling Laws for the Value of Individual Data Points in Machine Learning Ian Connick Covert, Wenlong Ji, Tatsunori Hashimoto, James Zou
ICML 2024 Selecting Large Language Model to Fine-Tune via Rectified Scaling Law Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, Zihao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang
ICLRW 2024 Selecting Large Language Model to Fine-Tune via Rectified Scaling Law Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, Zihao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang
ICML 2024 Simple Linear Attention Language Models Balance the Recall-Throughput Tradeoff Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, James Zou, Atri Rudra, Christopher Re
ICMLW 2024 Simple Linear Attention Language Models Balance the Recall-Throughput Tradeoff Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Re
ICML 2024 SleepFM: Multi-Modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals Rahul Thapa, Bryan He, Magnus Ruud Kjaer, Hyatt Moore Iv, Gauri Ganjoo, Emmanuel Mignot, James Zou
NeurIPS 2024 Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution Ian Covert, Chanwoo Kim, Su-In Lee, James Zou, Tatsunori Hashimoto
NeurIPS 2024 TFG: Unified Training-Free Guidance for Diffusion Models Haotian Ye, Haowei Lin, Jiaqi Han, Minkai Xu, Sheng Liu, Yitao Liang, Jianzhu Ma, James Zou, Stefano Ermon
ICMLW 2024 Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs Valeriia Cherepanova, James Zou
ICMLW 2024 Towards Smaller Language Models via Layer Looping Sabri Eyuboglu, Dylan Zinsley, Jon Saad-Falcon, Simran Arora, Atri Rudra, James Zou, Christopher Re
NeurIPS 2024 UniTox: Leveraging LLMs to Curate a Unified Dataset of Drug-Induced Toxicity from FDA Labels Jake Silberg, Kyle Swanson, Elana Simon, Angela Zhang, Zaniar Ghazizadeh, Scott Ogden, Hisham Hamadeh, James Zou
ICLR 2024 Zoology: Measuring and Improving Recall in Efficient Language Models Simran Arora, Sabri Eyuboglu, Aman Timalsina, Isys Johnson, Michael Poli, James Zou, Atri Rudra, Christopher Re
NeurIPSW 2024 metaTextGrad: Learning to Learn with Language Models as Optimizers Guowei Xu, Mert Yuksekgonul, Carlos Guestrin, James Zou
NeurIPSW 2023 A Theoretical Study of Dataset Distillation Zachary Izzo, James Zou
ICML 2023 Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations Weixin Liang, Yining Mao, Yongchan Kwon, Xinyu Yang, James Zou
NeurIPSW 2023 Analyzing ChatGPT’s Behavior Shifts over Time Lingjiao Chen, Matei Zaharia, James Zou
ICMLW 2023 Beyond Confidence: Reliable Models Should Also Consider Atypicality Mert Yuksekgonul, Linjun Zhang, James Zou, Carlos Guestrin
ICLRW 2023 Beyond Confidence: Reliable Models Should Also Quantify Atypicality Mert Yuksekgonul, Linjun Zhang, James Zou, Carlos Guestrin
TMLR 2023 Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Johan Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew Kyle Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartłomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cesar Ferri, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Christopher Waites, Christian Voigt, Christopher D Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, C. Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A Chi, Ethan Dyer, Ethan Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, Francois Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germàn Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Xinyue Wang, Gonzalo Jaimovitch-Lopez, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Francis Anthony Shevlin, Hinrich Schuetze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, Jose Hernandez-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh Dhole, Kevin Gimpel, Kevin Omondi, Kory Wallace Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras-Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros-Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje Ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, Maria Jose Ramirez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L Leavitt, Matthias Hagen, Mátyás Schubert, Medina Orduna Baitemirova, Melody Arnaud, Melvin McElrath, Michael Andrew Yee, Michael Cohen, Michael Gu, Michael Ivanitskiy, Michael Starritt, Michael Strube, Michał Swędrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T, Nanyun Peng, Nathan Andrew Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha S. Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter W Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Miłkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan Le Bras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Russ Salakhutdinov, Ryan Andrew Chi, Seungjae Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel Stern Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima Shammie Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven Piantadosi, Stuart Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsunori Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay Venkatesh Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Sophie Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu
CHIL 2023 Collecting Data When Missingness Is Unknown: A Method for Improving Model Performance Given Under-Reporting in Patient Populations Kevin Wu, Dominik Dahlem, Christopher Hane, Eran Halperin, James Zou
ICML 2023 Data-Driven Subgroup Identification for Linear Regression Zachary Izzo, Ruishan Liu, James Zou
ICML 2023 Data-OOB: Out-of-Bag Estimate as a Simple and Efficient Data Value Yongchan Kwon, James Zou
ICLR 2023 Diagnosing and Rectifying Vision Models Using Language Yuhui Zhang, Jeff Z. HaoChen, Shih-Cheng Huang, Kuan-Chieh Wang, James Zou, Serena Yeung
ICML 2023 Discover and Cure: Concept-Aware Mitigation of Spurious Correlation Shirley Wu, Mert Yuksekgonul, Linjun Zhang, James Zou
ICLR 2023 FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data Zhun Deng, Jiayao Zhang, Linjun Zhang, Ting Ye, Yates Coley, Weijie J Su, James Zou
ICLR 2023 FaiREE: Fair Classification with Finite-Sample and Distribution-Free Guarantee Puheng Li, James Zou, Linjun Zhang
AISTATS 2023 Freeze Then Train: Towards Provable Representation Learning Under Spurious Correlations and Feature Noise Haotian Ye, James Zou, Linjun Zhang
ICLRW 2023 GPT Detectors Are Biased Against Non-Native English Writers Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu, James Zou
NeurIPSW 2023 Generative AI for Designing and Validating Easily Synthesizable and Structurally Novel Antibiotics Kyle Swanson, Gary Liu, Denise Catacutan, James Zou, Jonathan Stokes
AAAI 2023 HAPI Explorer: Comprehension, Discovery, and Explanation on History of ML APIs Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Huamin Qu, Christopher Ré, Matei Zaharia, James Zou
ICMLW 2023 Less Is More: Using Multiple LLMs for Applications with Lower Costs Lingjiao Chen, Matei Zaharia, James Zou
NeurIPSW 2023 Navigating Dataset Documentation in ML: A Large-Scale Analysis of Dataset Cards on Hugging Face Xinyu Yang, Weixin Liang, James Zou
ICLR 2023 Post-Hoc Concept Bottleneck Models Mert Yuksekgonul, Maggie Wang, James Zou
ICMLW 2023 Prospectors: Leveraging Short Contexts to Mine Salient Objects in High-Dimensional Imagery Gautam Machiraju, Arjun D Desai, James Zou, Christopher Re, Parag Mallick
JMLR 2023 The Power of Contrast for Feature Learning: A Theoretical Analysis Wenlong Ji, Zhun Deng, Ryumei Nakada, James Zou, Linjun Zhang
AISTATS 2023 Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data Ryumei Nakada, Halil Ibrahim Gulluk, Zhun Deng, Wenlong Ji, James Zou, Linjun Zhang
CHIL 2023 Understanding and Predicting the Effect of Environmental Factors on People with Type 2 Diabetes Kailas Vodrahalli, Gregory D Lyng, Brian L Hill, Kimmo Karkkainen, Jeffrey Hertzberg, James Zou, Eran Halperin
ICLR 2023 When and Why Vision-Language Models Behave like Bags-of-Words, and What to Do About It? Mert Yuksekgonul, Federico Bianchi, Pratyusha Kalluri, Dan Jurafsky, James Zou
AISTATS 2022 Beta Shapley: A Unified and Noise-Reduced Data Valuation Framework for Machine Learning Yongchan Kwon, James Zou
AISTATS 2022 How to Learn When Data Gradually Reacts to Your Model Zachary Izzo, James Zou, Lexing Ying
AISTATS 2022 MLDemon:Deployment Monitoring for Machine Learning Systems Tony Ginart, Martin Jinye Zhang, James Zou
CVPR 2022 Clustering Plotted Data by Image Segmentation Tarek Naous, Srinjay Sarkar, Abubakar Abid, James Zou
TMLR 2022 Competition over Data: How Does Data Purchase Affect Users? Yongchan Kwon, Tony A Ginart, James Zou
NeurIPSW 2022 Data-Driven Subgroup Identification for Linear Regression Zachary Izzo, Ruishan Liu, James Zou
ICLR 2022 Domino: Discovering Systematic Errors with Cross-Modal Embeddings Sabri Eyuboglu, Maya Varma, Khaled Kamal Saab, Jean-Benoit Delbrouck, Christopher Lee-Messer, Jared Dunnmon, James Zou, Christopher Re
NeurIPSW 2022 DrML: Diagnosing and Rectifying Vision Models Using Language Yuhui Zhang, Jeff Z. HaoChen, Shih-Cheng Huang, Kuan-Chieh Wang, James Zou, Serena Yeung
NeurIPSW 2022 DrML: Diagnosing and Rectifying Vision Models Using Language Yuhui Zhang, Jeff Z. HaoChen, Shih-Cheng Huang, Kuan-Chieh Wang, James Zou, Serena Yeung
ICML 2022 Efficient Online ML API Selection for Multi-Label Classification Tasks Lingjiao Chen, Matei Zaharia, James Zou
ICLR 2022 How Did the Model Change? Efficiently Assessing Machine Learning API Shifts Lingjiao Chen, Matei Zaharia, James Zou
ICML 2022 Improving Out-of-Distribution Robustness via Selective Augmentation Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou, Chelsea Finn
ICML 2022 Meaningfully Debugging Model Mistakes Using Conceptual Counterfactual Explanations Abubakar Abid, Mert Yuksekgonul, James Zou
ICLR 2022 MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts Weixin Liang, James Zou
ICMLW 2022 Mind the Gap: Understanding the Modality Gap in Multi-Modal Contrastive Representation Learning Weixin Liang, Yuhui Zhang, Yongchan Kwon, Serena Yeung, James Zou
ICMLW 2022 On the Nonlinear Correlation of ML Performance Across Data Subpopulations Weixin Liang, Yining Mao, Yongchan Kwon, Xinyu Yang, James Zou
ICLRW 2022 Post-Hoc Concept Bottleneck Models Mert Yuksekgonul, Maggie Wang, James Zou
NeurIPSW 2022 Predicting Immune Escape with Pretrained Protein Language Model Embeddings Kyle Swanson, Howard Chang, James Zou
NeurIPSW 2022 Provable Membership Inference Privacy Zachary Izzo, Jinsung Yoon, Sercan O Arik, James Zou
NeurIPSW 2022 Recommendation for New Drugs with Limited Prescription Data Zhenbang Wu, Huaxiu Yao, Zhe Su, David Liebovitz, Lucas M Glass, James Zou, Chelsea Finn, Jimeng Sun
ICML 2022 When and How Mixup Improves Calibration Linjun Zhang, Zhun Deng, Kenji Kawaguchi, James Zou
AISTATS 2021 Approximate Data Deletion from Machine Learning Models Zachary Izzo, Mary Anne Smart, Kamalika Chaudhuri, James Zou
AISTATS 2021 Competing AI: How Does Competition Feedback Affect Machine Learning? Tony Ginart, Eva Zhang, Yongchan Kwon, James Zou
AISTATS 2021 Efficient Computation and Analysis of Distributional Shapley Values Yongchan Kwon, Manuel A. Rivas, James Zou
AISTATS 2021 Improving Adversarial Robustness via Unlabeled Out-of-Domain Data Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou
ICLR 2021 How Does Mixup Help with Robustness and Generalization? Linjun Zhang, Zhun Deng, Kenji Kawaguchi, Amirata Ghorbani, James Zou
ICML 2021 How to Learn When Data Reacts to Your Model: Performative Gradient Descent Zachary Izzo, Lexing Ying, James Zou
ICML 2021 Improving Generalization in Meta-Learning via Task Augmentation Huaxiu Yao, Long-Kai Huang, Linjun Zhang, Ying Wei, Li Tian, James Zou, Junzhou Huang, Zhenhui () Li
NeurIPSW 2021 Language Models as Recommender Systems: Evaluations and Limitations Yuhui Zhang, Hao Ding, Zeren Shui, Yifei Ma, James Zou, Anoop Deoras, Hao Wang
ICML 2020 A Distributional Framework for Data Valuation Amirata Ghorbani, Michael Kim, James Zou
ICLR 2020 Learning Transport Cost from Subset Correspondence Ruishan Liu, Akshay Balsubramani, James Zou
ICML 2019 Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits Martin Zhang, James Zou, David Tse
ICML 2019 Data Shapley: Equitable Valuation of Data for Machine Learning Amirata Ghorbani, James Zou
ICML 2019 Discovering Conditionally Salient Features with Statistical Guarantees Jaime Roquero Gimenez, James Zou
AISTATS 2019 Improving the Stability of the Knockoff Procedure: Multiple Simultaneous Knockoffs and Entropy Maximization Jaime Roquero Gimenez, James Zou
AISTATS 2019 Knockoffs for the Mass: New Feature Importance Statistics with False Discovery Guarantees Jaime Roquero Gimenez, Amirata Ghorbani, James Zou
ICML 2018 CoVeR: Learning Covariate-Specific Vector Representations with Tensor Decompositions Kevin Tian, Teng Zhang, James Zou
AISTATS 2018 Why Adaptively Collected Data Have Negative Bias and How to Correct for It Xinkun Nie, Xiaoying Tian, Jonathan Taylor, James Zou
ICML 2017 Estimating the Unseen from Multiple Populations Aditi Raghunathan, Gregory Valiant, James Zou
ICML 2017 Learning Latent Space Models with Angular Constraints Pengtao Xie, Yuntian Deng, Yi Zhou, Abhimanu Kumar, Yaoliang Yu, James Zou, Eric P. Xing
AISTATS 2017 Quantifying the Accuracy of Approximate Diffusions and Markov Chains Jonathan Huggins, James Zou
AISTATS 2016 Controlling Bias in Adaptive Data Analysis Using Information Theory Daniel Russo, James Zou
ICML 2016 Rich Component Analysis Rong Ge, James Zou
ICML 2015 Intersecting Faces: Non-Negative Matrix Factorization with New Guarantees Rong Ge, James Zou