Huebner, Curtis

1 publications

NeurIPSW 2023 Eliciting Language Model Behaviors Using Reverse Language Models Jacob Pfau, Alex Infanger, Abhay Sheshadri, Ayush Panda, Julian Michael, Curtis Huebner