Sun, Zetian
4 publications
ICLR
2026
Is On-Policy Data Always the Best Choice for Direct Preference Optimization-Based LM Alignment?
ICLR
2026
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire a Versatile Embedding Model
4 publications