ML Anthology
Authors
Search
About
Smith, Eric Michael
4 publications
ICLR
2026
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
Jingyu Zhang
,
Haozhu Wang
,
Eric Michael Smith
,
Sid Wang
,
Amr Sharaf
,
Mahesh Pasupuleti
,
Benjamin Van Durme
,
Daniel Khashabi
,
Jason E Weston
,
Hongyuan Zhan
ICLR
2025
Backtracking Improves Generation Safety
Yiming Zhang
,
Jianfeng Chi
,
Hailey Nguyen
,
Kartikeya Upasani
,
Daniel M. Bikel
,
Jason E Weston
,
Eric Michael Smith
ICLR
2025
Persistent Pre-Training Poisoning of LLMs
Yiming Zhang
,
Javier Rando
,
Ivan Evtimov
,
Jianfeng Chi
,
Eric Michael Smith
,
Nicholas Carlini
,
Florian Tramèr
,
Daphne Ippolito
NeurIPSW
2022
Perturbation Augmentation for Fairer NLP
Rebecca Qian
,
Candace Ross
,
Jude Fernandes
,
Eric Michael Smith
,
Douwe Kiela
,
Adina Williams