Liang, Weixin
21 publications
TMLR
2025
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
ICLRW
2025
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
ICLRW
2025
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
NeurIPS
2025
ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code
ICLR
2024
Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace
ICML
2023
Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations
NeurIPSW
2023
Navigating Dataset Documentation in ML: A Large-Scale Analysis of Dataset Cards on Hugging Face