Luo, Tao
16 publications
NeurIPS
2025
From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics
ICLR
2023
MA-BERT: Towards Matrix Arithmetic-Only BERT Inference by Eliminating Complex Non-Linear Functions
16 publications