Sabolčec, Vinko

3 publications

NeurIPS 2025 Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Bettina Messmer, Vinko Sabolčec, Martin Jaggi
ICLRW 2025 Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Bettina Messmer, Vinko Sabolčec, Martin Jaggi
NeurIPS 2025 URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training Dongyang Fan, Vinko Sabolčec, Martin Jaggi