Hu, Baotian
15 publications
ICLR
2026
Is On-Policy Data Always the Best Choice for Direct Preference Optimization-Based LM Alignment?
ICLR
2026
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire a Versatile Embedding Model
AAAI
2025
CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models
ICML
2024
VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context