Xu, Tian
13 publications
NeurIPSW
2024
Entropic Distribution Matching for Supervised Fine-Tuning of LLMs: Less Overfitting and Better Diversity
NeurIPS
2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation