ML Anthology
Authors
Search
About
Brown, Samuel F.
4 publications
ICLR
2025
AI Sandbagging: Language Models Can Strategically Underperform on Evaluations
Teun van der Weij
,
Felix Hofstätter
,
Oliver Jaffe
,
Samuel F. Brown
,
Francis Rhys Ward
NeurIPSW
2024
AI Sandbagging: Language Models Can Selectively Underperform on Evaluations
Teun van der Weij
,
Felix Hofstätter
,
Oliver Jaffe
,
Samuel F. Brown
,
Francis Rhys Ward
NeurIPSW
2024
Auto-Enhance: Towards a Meta-Benchmark to Evaluate AI Agents' Ability to Improve Other Agents
Samuel F. Brown
,
Basil Labib
,
Codruta Lugoj
,
Sai Sasank Y
NeurIPSW
2024
Auto-Enhance: Towards a Meta-Benchmark to Evaluate AI Agents' Ability to Improve Other Agents
Samuel F. Brown
,
Basil Labib
,
Codruta Lugoj
,
Sai Sasank Y