Xiao, Jiancong
12 publications
ICLR
2025
Magnetic Preference Optimization: Achieving Last-Iterate Convergence for Language Model Alignment
ICML
2025
Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach
NeurIPSW
2024
Entropic Distribution Matching for Supervised Fine-Tuning of LLMs: Less Overfitting and Better Diversity