Khalitov, Ruslan

2 publications

ICLR 2023 ChordMixer: A Scalable Neural Attention Model for Sequences with Different Length Ruslan Khalitov, Tong Yu, Lei Cheng, Zhirong Yang
CVPR 2022 Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention Tong Yu, Ruslan Khalitov, Lei Cheng, Zhirong Yang