Kajitsuka, Tokio

2 publications

ICLR 2025 On the Optimal Memorization Capacity of Transformers Tokio Kajitsuka, Issei Sato
ICLR 2024 Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators? Tokio Kajitsuka, Issei Sato