ML Anthology
Authors
Search
About
Griffin, Avery
1 publications
ICLR
2026
Eliciting Harmful Capabilities by Fine-Tuning on Safeguarded Outputs
Jackson Kaunismaa
,
John Hughes
,
Christina Q Knight
,
Avery Griffin
,
Mrinank Sharma
,
Erik Jones