Kaufmann, Maximilian

3 publications

NeurIPSW 2024 RenderAttack: Hundreds of Adversarial Attacks Through Differentiable Texture Generation Dron Hazra, Alex Bie, Mantas Mazeika, Xuwang Yin, Andy Zou, Dan Hendrycks, Maximilian Kaufmann
ICLR 2024 The Reversal Curse: LLMs Trained on “a Is B” Fail to Learn “b Is A” Lukas Berglund, Meg Tong, Maximilian Kaufmann, Mikita Balesni, Asa Cooper Stickland, Tomasz Korbak, Owain Evans
NeurIPSW 2023 The Reversal Curse: LLMs Trained on "a Is B" Fail to Learn "b Is A" Lukas Berglund, Meg Tong, Maximilian Kaufmann, Mikita Balesni, Asa Stickland, Tomasz Korbak, Owain Evans