Yu, Zichun

3 publications

NeurIPS 2025 Group-Level Data Selection for Efficient Pretraining Zichun Yu, Fei Peng, Jie Lei, Arnold Overwijk, Wen-tau Yih, Chenyan Xiong
ICLR 2025 Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning Xiaochuan Li, Zichun Yu, Chenyan Xiong
NeurIPS 2024 MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models Zichun Yu, Spandan Das, Chenyan Xiong