Kotha, Suhas

6 publications

ICLR 2025 Repetition Improves Language Model Embeddings Jacob Mitchell Springer, Suhas Kotha, Daniel Fried, Graham Neubig, Aditi Raghunathan

ICML 2024 Position: A Safe Harbor for AI Evaluation and Red Teaming Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Alex Pentland, Arvind Narayanan, Percy Liang, Peter Henderson

NeurIPSW 2024 Testing the Limits of Jailbreaking Defenses with the Purple Problem Taeyoun Kim, Suhas Kotha, Aditi Raghunathan

ICLR 2024 Understanding Catastrophic Forgetting in Language Models via Implicit Inference Suhas Kotha, Jacob Mitchell Springer, Aditi Raghunathan

NeurIPS 2023 Provably Bounding Neural Network Preimages Suhas Kotha, Christopher Brix, J. Zico Kolter, Krishnamurthy Dvijotham, Huan Zhang

NeurIPSW 2023 Understanding Catastrophic Forgetting in Language Models via Implicit Inference Suhas Kotha, Jacob Springer, Aditi Raghunathan