ML Anthology
Authors
Search
About
Huebner, Curtis
1 publications
NeurIPSW
2023
Eliciting Language Model Behaviors Using Reverse Language Models
Jacob Pfau
,
Alex Infanger
,
Abhay Sheshadri
,
Ayush Panda
,
Julian Michael
,
Curtis Huebner