ML Anthology
Authors
Search
About
Cai, Will
2 publications
ICML
2025
Improving LLM Safety Alignment with Dual-Objective Optimization
Xuandong Zhao
,
Will Cai
,
Tianneng Shi
,
David Huang
,
Licong Lin
,
Song Mei
,
Dawn Song
AAAI
2025
Scaling Trends for Data Poisoning in LLMs
Dillon Bowen
,
Brendan Murphy
,
Will Cai
,
David Khachaturov
,
Adam Gleave
,
Kellin Pelrine