Griffin, Avery

1 publications

ICLR 2026 Eliciting Harmful Capabilities by Fine-Tuning on Safeguarded Outputs Jackson Kaunismaa, John Hughes, Christina Q Knight, Avery Griffin, Mrinank Sharma, Erik Jones