Goel, Shashwat

9 publications

ICLR 2026 Pitfalls in Evaluating Language Model Forecasters Daniel Paleka, Shashwat Goel, Jonas Geiping, Florian Tramèr
ICLR 2026 The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs Akshit Sinha, Arvindh Arun, Shashwat Goel, Steffen Staab, Jonas Geiping
ICML 2025 A Cognac Shot to Forget Bad Memories: Corrective Unlearning for Graph Neural Networks Varshita Kolipaka, Akshit Sinha, Debangan Mishra, Sumit Kumar, Arvindh Arun, Shashwat Goel, Ponnurangam Kumaraguru
ICLRW 2025 Can Language Models Falsify? the Need for Inverse Benchmarking Shiven Sinha, Shashwat Goel, Ponnurangam Kumaraguru, Jonas Geiping, Matthias Bethge, Ameya Prabhu
ICML 2025 Great Models Think Alike and This Undermines AI Oversight Shashwat Goel, Joschka Strüber, Ilze Amanda Auzina, Karuna K Chandra, Ponnurangam Kumaraguru, Douwe Kiela, Ameya Prabhu, Matthias Bethge, Jonas Geiping
ICLRW 2025 Great Models Think Alike and This Undermines AI Oversight Shashwat Goel, Joschka Strüber, Ilze Amanda Auzina, Karuna K Chandra, Ponnurangam Kumaraguru, Douwe Kiela, Ameya Prabhu, Matthias Bethge, Jonas Geiping
TMLR 2024 Corrective Machine Unlearning Shashwat Goel, Ameya Prabhu, Philip Torr, Ponnurangam Kumaraguru, Amartya Sanyal
AAAI 2024 Proportional Aggregation of Preferences for Sequential Decision Making Nikhil Chandak, Shashwat Goel, Dominik Peters
ICML 2024 The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew Bo Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Ariel Herbert-Voss, Cort B Breuer, Andy Zou, Mantas Mazeika, Zifan Wang, Palash Oswal, Weiran Lin, Adam Alfred Hunt, Justin Tienken-Harder, Kevin Y. Shih, Kemper Talley, John Guan, Ian Steneker, David Campbell, Brad Jokubaitis, Steven Basart, Stephen Fitz, Ponnurangam Kumaraguru, Kallol Krishna Karmakar, Uday Tupakula, Vijay Varadharajan, Yan Shoshitaishvili, Jimmy Ba, Kevin M. Esvelt, Alexandr Wang, Dan Hendrycks