Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment

Cite

Text

Cai et al. "Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I22.34519

Markdown

[Cai et al. "Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/cai2025aaai-approximated/) doi:10.1609/AAAI.V39I22.34519

BibTeX

@inproceedings{cai2025aaai-approximated,
  title     = {{Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment}},
  author    = {Cai, Yuang and Yuan, Yuyu and Shi, Jinsheng and Lin, Qinhong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {23505-23513},
  doi       = {10.1609/AAAI.V39I22.34519},
  url       = {https://mlanthology.org/aaai/2025/cai2025aaai-approximated/}
}