Chen, Brian K

2 publications

ICML 2024 Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers Brian K Chen, Tianyang Hu, Hui Jin, Hwee Kuan Lee, Kenji Kawaguchi
NeurIPSW 2024 In-Context Learning Behaves as a Greedy Layer-Wise Gradient Descent Algorithm Brian K Chen, Tianyang Hu, Hui Jin, Hwee Kuan Lee, Kenji Kawaguchi