Bhojanapalli, Srinadh
34 publications
NeurIPS
2024
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
ICMLW
2024
Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers
NeurIPS
2020
O(n) Connections Are Expressive Enough: Universal Approximability of Sparse Transformers