Li, Dongfang
9 publications
ICLR
2026
Is On-Policy Data Always the Best Choice for Direct Preference Optimization-Based LM Alignment?
ICLR
2026
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire a Versatile Embedding Model
9 publications