Forward KL Regularized Preference Optimization for Aligning Diffusion Policies

Cite

Text

Shan et al. "Forward KL Regularized Preference Optimization for Aligning Diffusion Policies." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I13.33576

Markdown

[Shan et al. "Forward KL Regularized Preference Optimization for Aligning Diffusion Policies." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/shan2025aaai-forward/) doi:10.1609/AAAI.V39I13.33576

BibTeX

@inproceedings{shan2025aaai-forward,
  title     = {{Forward KL Regularized Preference Optimization for Aligning Diffusion Policies}},
  author    = {Shan, Zhao and Fan, Chenyou and Qiu, Shuang and Shi, Jiyuan and Bai, Chenjia},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {14386-14395},
  doi       = {10.1609/AAAI.V39I13.33576},
  url       = {https://mlanthology.org/aaai/2025/shan2025aaai-forward/}
}