ML Anthology
Authors
Search
About
Jose, Arun
2 publications
NeurIPS
2025
Reasoning Models Sometimes Output Illegible Chains of Thought
Arun Jose
NeurIPS
2025
Why Do Some Language Models Fake Alignment While Others Don't?
Abhay Sheshadri
,
John Hughes
,
Julian Michael
,
Alex Troy Mallen
,
Arun Jose
,
Fabien Roger