Jose, Arun

2 publications

NeurIPS 2025 Reasoning Models Sometimes Output Illegible Chains of Thought Arun Jose
NeurIPS 2025 Why Do Some Language Models Fake Alignment While Others Don't? Abhay Sheshadri, John Hughes, Julian Michael, Alex Troy Mallen, Arun Jose, Fabien Roger