Tao, Junyi

1 publications

ICML 2025 Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors Jing Huang, Junyi Tao, Thomas Icard, Diyi Yang, Christopher Potts