Bao, Xuchan

11 publications

ICML 2025 Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs Jan Betley, Daniel Chee Hian Tan, Niels Warncke, Anna Sztyber-Betley, Xuchan Bao, Martı́n Soto, Nathan Labenz, Owain Evans
ICLRW 2025 Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs Jan Betley, Daniel Chee Hian Tan, Niels Warncke, Anna Sztyber-Betley, Xuchan Bao, Martín Soto, Nathan Labenz, Owain Evans
ICLR 2025 Tell Me About Yourself: LLMs Are Aware of Their Learned Behaviors Jan Betley, Xuchan Bao, Martín Soto, Anna Sztyber-Betley, James Chua, Owain Evans
NeurIPSW 2024 Language Models Can Articulate Their Implicit Goals Jan Betley, Xuchan Bao, Martín Soto, Anna Sztyber-Betley, James Chua, Owain Evans
TMLR 2023 Finding and Only Finding Differential Nash Equilibria by Both Pretending to Be a Follower Xuchan Bao, Guodong Zhang
ICMLW 2023 Statistics Estimation in Neural Network Training: A Recursive Identification Approach Ruth Crasto, Xuchan Bao, Roger Baker Grosse
ICLRW 2022 Finding and Only Finding Local Nash Equilibria by Both Pretending to Be a Follower Xuchan Bao, Guodong Zhang
JMLR 2021 A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints Guodong Zhang, Xuchan Bao, Laurent Lessard, Roger Grosse
NeurIPS 2021 Learning to Elect Cem Anil, Xuchan Bao
NeurIPS 2020 Regularized Linear Autoencoders Recover the Principal Components, Eventually Xuchan Bao, James Lucas, Sushant Sachdeva, Roger B Grosse
ICLR 2019 TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer Sicong Huang, Qiyang Li, Cem Anil, Xuchan Bao, Sageev Oore, Roger B. Grosse