Kim, Taeyoun

5 publications

NeurIPS 2025 Reasoning as an Adaptive Defense for Safety Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan, Aviral Kumar
NeurIPS 2024 Predicting the Performance of Foundation Models via Agreement-on-the-Line Rahul Saxena, Taeyoun Kim, Aman Mehra, Christina Baek, Zico Kolter, Aditi Raghunathan
NeurIPSW 2024 Testing the Limits of Jailbreaking Defenses with the Purple Problem Taeyoun Kim, Suhas Kotha, Aditi Raghunathan
NeurIPSW 2023 Predicting the Performance of Foundation Models via Agreement-on-the-Line Aman Mehra, Rahul Saxena, Taeyoun Kim, Christina Baek, J Zico Kolter, Aditi Raghunathan
NeurIPSW 2023 Predicting the Performance of Foundation Models via Agreement-on-the-Line Rahul Saxena, Aman Mehra, Taeyoun Kim, Christina Baek, J Zico Kolter, Aditi Raghunathan