ML Anthology
Authors
Search
About
Kim, Taeyoun
5 publications
NeurIPS
2025
Reasoning as an Adaptive Defense for Safety
Taeyoun Kim
,
Fahim Tajwar
,
Aditi Raghunathan
,
Aviral Kumar
NeurIPS
2024
Predicting the Performance of Foundation Models via Agreement-on-the-Line
Rahul Saxena
,
Taeyoun Kim
,
Aman Mehra
,
Christina Baek
,
Zico Kolter
,
Aditi Raghunathan
NeurIPSW
2024
Testing the Limits of Jailbreaking Defenses with the Purple Problem
Taeyoun Kim
,
Suhas Kotha
,
Aditi Raghunathan
NeurIPSW
2023
Predicting the Performance of Foundation Models via Agreement-on-the-Line
Aman Mehra
,
Rahul Saxena
,
Taeyoun Kim
,
Christina Baek
,
J Zico Kolter
,
Aditi Raghunathan
NeurIPSW
2023
Predicting the Performance of Foundation Models via Agreement-on-the-Line
Rahul Saxena
,
Aman Mehra
,
Taeyoun Kim
,
Christina Baek
,
J Zico Kolter
,
Aditi Raghunathan