Shen, Yingli

1 publications

NeurIPS 2025 DCAD-2000: A Multilingual Dataset Across 2000+ Languages with Data Cleaning as Anomaly Detection Wen Lai, Yingli Shen, Shuo Wang, Xueren Zhang, Kangyang Luo, Alexander Fraser, Maosong Sun