ML Anthology
Authors
Search
About
Elson, David
1 publications
ICML
2025
MONA: Myopic Optimization with Non-Myopic Approval Can Mitigate Multi-Step Reward Hacking
Sebastian Farquhar
,
Vikrant Varma
,
David Lindner
,
David Elson
,
Caleb Biddulph
,
Ian Goodfellow
,
Rohin Shah