Awasthi, Pranjal
64 publications
NeurIPSW
2024
Majority Kernels: An Approach to Leverage Big Model Dynamics for Efficient Small Model Training
NeurIPS
2024
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
ICMLW
2024
Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers