Yao, Wei
15 publications
NeurIPS
2025
Bilevel Optimization for Adversarial Learning Problems: Sharpness, Generation, and Beyond
ICLR
2025
Super(ficial)-Alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
15 publications