Whittingham, Hannes

2 publications

NeurIPS 2025 CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring Benjamin Arnav, Pablo Bernabeu-Perez, Nathan Helm-Burger, Timothy Kostolansky, Hannes Whittingham, Mary Phuong
NeurIPS 2025 Large Language Models Can Learn and Generalize Steganographic Chain-of-Thought Under Process Supervision Robert McCarthy, Joey Skaf, Luis Ibanez-Lissen, Vasil Georgiev, Connor Watts, Hannes Whittingham, Lorena Gonzalez-Manzano, Cameron Tice, Edward James Young, Puria Radmard, David Lindner