Jurafsky, Dan

23 publications

ICML 2025 AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders Zhengxuan Wu, Aryaman Arora, Atticus Geiger, Zheng Wang, Jing Huang, Dan Jurafsky, Christopher D Manning, Christopher Potts

ICLR 2025 H4rm3l: A Language for Composable Jailbreak Attack Synthesis Moussa Koulako Bala Doumbouya, Ananjan Nandi, Gabriel Poesia, Davide Ghilardi, Anna Goldie, Federico Bianchi, Dan Jurafsky, Christopher D Manning

ICML 2025 What Can Large Language Models Do for Sustainable Food? Anna Thomas, Adam Yee, Andrew Mayne, Maya B. Mathur, Dan Jurafsky, Kristina Gligorić

ICLR 2024 A Benchmark for Learning to Translate a New Language from One Grammar Book Garrett Tanzer, Mirac Suzgun, Eline Visser, Dan Jurafsky, Luke Melas-Kyriazi

NeurIPSW 2024 AI-Generated Content and Public Persuasion: The Limited Effect of AI Authorship Labels Isabel O. Gallegos, Chen Shani, Weiyan Shi, Federico Bianchi, Robb Willer, Dan Jurafsky

ICML 2024 How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis Federico Bianchi, Patrick John Chia, Mert Yuksekgonul, Jacopo Tagliabue, Dan Jurafsky, James Zou

ICML 2024 Model Alignment as Prospect Theoretic Optimization Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky, Douwe Kiela

NeurIPS 2024 ReFT: Representation Finetuning for Language Models Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts

ICLR 2024 Safety-Tuned LLaMAs: Lessons from Improving the Safety of Large Language Models That Follow Instructions Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Rottger, Dan Jurafsky, Tatsunori Hashimoto, James Zou

NeurIPS 2023 Ecosystem-Level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes Connor Toups, Rishi Bommasani, Kathleen Creel, Sarah Bana, Dan Jurafsky, Percy Liang

JMLR 2023 Foundation Models and Fair Use Peter Henderson, Xuechen Li, Dan Jurafsky, Tatsunori Hashimoto, Mark A. Lemley, Percy Liang

ICLR 2023 When and Why Vision-Language Models Behave like Bags-of-Words, and What to Do About It? Mert Yuksekgonul, Federico Bianchi, Pratyusha Kalluri, Dan Jurafsky, James Zou

NeurIPS 2022 Picking on the Same Person: Does Algorithmic Monoculture Lead to Outcome Homogenization? Rishi Bommasani, Kathleen A. Creel, Ananya Kumar, Dan Jurafsky, Percy Liang

NeurIPS 2022 Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset Peter Henderson, Mark Krass, Lucia Zheng, Neel Guha, Christopher D Manning, Dan Jurafsky, Daniel Ho

ICMLW 2022 Self-Destructing Models: Increasing the Costs of Harmful Dual Uses in Foundation Models Eric Mitchell, Peter Henderson, Christopher D Manning, Dan Jurafsky, Chelsea Finn

ICLR 2021 Nearest Neighbor Machine Translation Urvashi Khandelwal, Angela Fan, Dan Jurafsky, Luke Zettlemoyer, Mike Lewis

AAAI 2020 Automatically Neutralizing Subjective Bias in Text Reid Pryzant, Richard Diehl Martinez, Nathan Dass, Sadao Kurohashi, Dan Jurafsky, Diyi Yang

ICLR 2020 Generalization Through Memorization: Nearest Neighbor Language Models Urvashi Khandelwal, Omer Levy, Dan Jurafsky, Luke Zettlemoyer, Mike Lewis

NeurIPS 2020 Language Through a Prism: A Spectral Approach for Multiscale Language Representations Alex Tamkin, Dan Jurafsky, Noah Goodman

JMLR 2020 Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning Peter Henderson, Jieru Hu, Joshua Romoff, Emma Brunskill, Dan Jurafsky, Joelle Pineau

NeurIPS 2018 Embedding Logical Queries on Knowledge Graphs Will Hamilton, Payal Bajaj, Marinka Zitnik, Dan Jurafsky, Jure Leskovec

ICLR 2017 Data Noising as Smoothing in Neural Network Language Models Ziang Xie, Sida I. Wang, Jiwei Li, Daniel Lévy, Aiming Nie, Dan Jurafsky, Andrew Y. Ng

ICML 2012 Learning the Central Events and Participants in Unlabeled Text Nathanael Chambers, Dan Jurafsky