Gao, Chaochen

4 publications

NeurIPS 2025 LongMagpie: A Self-Synthesis Method for Generating Large-Scale Long-Context Instructions Chaochen Gao, Xing W, Zijia Lin, Debing Zhang, Songlin Hu
ICML 2025 NExtLong: Toward Effective Long-Context Training Without Long Documents Chaochen Gao, Xing W, Zijia Lin, Debing Zhang, Songlin Hu
ICLR 2025 Quest: Query-Centric Data Synthesis Approach for Long-Context Scaling of Large Language Model Chaochen Gao, Xing Wu, Qi Fu, Songlin Hu
ICMLW 2022 Boosting Monolingual Sentence Representation with Large-Scale Parallel Translation Datasets Jue Wang, Haofan Wang, Xing Wu, Chaochen Gao, Debing Zhang