ML Anthology
Authors
Search
About
Bao, Xuchan
11 publications
ICML
2025
Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs
Jan Betley
,
Daniel Chee Hian Tan
,
Niels Warncke
,
Anna Sztyber-Betley
,
Xuchan Bao
,
Martı́n Soto
,
Nathan Labenz
,
Owain Evans
ICLRW
2025
Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs
Jan Betley
,
Daniel Chee Hian Tan
,
Niels Warncke
,
Anna Sztyber-Betley
,
Xuchan Bao
,
Martín Soto
,
Nathan Labenz
,
Owain Evans
ICLR
2025
Tell Me About Yourself: LLMs Are Aware of Their Learned Behaviors
Jan Betley
,
Xuchan Bao
,
Martín Soto
,
Anna Sztyber-Betley
,
James Chua
,
Owain Evans
NeurIPSW
2024
Language Models Can Articulate Their Implicit Goals
Jan Betley
,
Xuchan Bao
,
Martín Soto
,
Anna Sztyber-Betley
,
James Chua
,
Owain Evans
TMLR
2023
Finding and Only Finding Differential Nash Equilibria by Both Pretending to Be a Follower
Xuchan Bao
,
Guodong Zhang
ICMLW
2023
Statistics Estimation in Neural Network Training: A Recursive Identification Approach
Ruth Crasto
,
Xuchan Bao
,
Roger Baker Grosse
ICLRW
2022
Finding and Only Finding Local Nash Equilibria by Both Pretending to Be a Follower
Xuchan Bao
,
Guodong Zhang
JMLR
2021
A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints
Guodong Zhang
,
Xuchan Bao
,
Laurent Lessard
,
Roger Grosse
NeurIPS
2021
Learning to Elect
Cem Anil
,
Xuchan Bao
NeurIPS
2020
Regularized Linear Autoencoders Recover the Principal Components, Eventually
Xuchan Bao
,
James Lucas
,
Sushant Sachdeva
,
Roger B Grosse
ICLR
2019
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang
,
Qiyang Li
,
Cem Anil
,
Xuchan Bao
,
Sageev Oore
,
Roger B. Grosse