Su, Zhan

7 publications

ICLRW 2025 Exploring Sparse Adapters for Scalable Merging of Parameter Efficient Experts Samin Yeasar Arnob, Zhan Su, Minseon Kim, Oleksiy Ostapenko, Doina Precup, Lucas Caccia, Alessandro Sordoni
TMLR 2024 Mixture of Latent Experts Using Tensor Products Zhan Su, Fengran Mo, Prayag Tiwari, Benyou Wang, Qiuchi Li, Jian-Yun Nie, Jakob Grue Simonsen
ICML 2024 Towards Modular LLMs by Building and Reusing a Library of LoRAs Oleksiy Ostapenko, Zhan Su, Edoardo Ponti, Laurent Charlin, Nicolas Le Roux, Lucas Caccia, Alessandro Sordoni
NeurIPSW 2023 A Case Study of Instruction Tuning with Mixture of Parameter-Efficient Experts Oleksiy Ostapenko, Lucas Caccia, Zhan Su, Nicolas Le Roux, Laurent Charlin, Alessandro Sordoni
NeurIPS 2023 Multi-Head Adapter Routing for Cross-Task Generalization Lucas Page-Caccia, Edoardo Maria Ponti, Zhan Su, Matheus Pereira, Nicolas Le Roux, Alessandro Sordoni
AAAI 2019 A Generalized Language Model in Tensor Space Lipeng Zhang, Peng Zhang, Xindian Ma, Shuqin Gu, Zhan Su, Dawei Song
AAAI 2018 End-to-End Quantum-like Language Models with Application to Question Answering Peng Zhang, Jiabin Niu, Zhan Su, Benyou Wang, Liqun Ma, Dawei Song