Attanasio, Giuseppe

1 publications

ICLR 2024 Safety-Tuned LLaMAs: Lessons from Improving the Safety of Large Language Models That Follow Instructions Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Rottger, Dan Jurafsky, Tatsunori Hashimoto, James Zou