Tan, Daniel Chee Hian

5 publications

ICML 2025 Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs Jan Betley, Daniel Chee Hian Tan, Niels Warncke, Anna Sztyber-Betley, Xuchan Bao, Martı́n Soto, Nathan Labenz, Owain Evans
ICLRW 2025 Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs Jan Betley, Daniel Chee Hian Tan, Niels Warncke, Anna Sztyber-Betley, Xuchan Bao, Martín Soto, Nathan Labenz, Owain Evans
JAIR 2025 Towards Generalist Robot Learning from Internet Video: A Survey Robert McCarthy, Daniel Chee Hian Tan, Dominik Schmidt, Fernando Acero, Nathan Herr, Yilun Du, Thomas George Thuruthel, Zhibin Li
ICMLW 2024 Analyzing the Generalization and Reliability of Steering Vectors Daniel Chee Hian Tan, David Chanin, Aengus Lynch, Adrià Garriga-Alonso, Dimitrios Kanoulas, Brooks Paige, Robert Kirk
NeurIPSW 2024 H-Space Sparse Autoencoders Ayodeji Ijishakin, Ming Liang Ang, Levente Baljer, Daniel Chee Hian Tan, Hugo Laurence Fry, Ahmed Abdulaal, Aengus Lynch, James H. Cole