Bai, Yu
58 publications
NeurIPS
2025
Accelerated Vertical Federated Adversarial Learning Through Decoupling Layer-Wise Dependencies
NeurIPSW
2024
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
NeurIPS
2023
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations
NeurIPSW
2023
How Do Transformers Learn In-Context Beyond Simple Functions? a Case Study on Learning with Representations
NeurIPSW
2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
NeurIPSW
2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
NeurIPS
2023
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection
ICMLW
2023
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection
NeurIPS
2022
Identifying Good Directions to Escape the NTK Regime and Efficiently Learn Low-Degree Plus Sparse Polynomials
IJCAI
2022
Stage-Wise Stylistic Headline Generation: Style Generation and Summarized Content Insertion
ICLR
2022
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?