PPO-CLIP Attains Global Optimality: Towards Deeper Understandings of Clipping

Cite

Text

Huang et al. "PPO-CLIP Attains Global Optimality: Towards Deeper Understandings of Clipping." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I11.29154

Markdown

[Huang et al. "PPO-CLIP Attains Global Optimality: Towards Deeper Understandings of Clipping." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/huang2024aaai-ppo/) doi:10.1609/AAAI.V38I11.29154

BibTeX

@inproceedings{huang2024aaai-ppo,
  title     = {{PPO-CLIP Attains Global Optimality: Towards Deeper Understandings of Clipping}},
  author    = {Huang, Nai-Chieh and Hsieh, Ping-Chun and Ho, Kuo-Hao and Wu, I-Chen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {12600-12607},
  doi       = {10.1609/AAAI.V38I11.29154},
  url       = {https://mlanthology.org/aaai/2024/huang2024aaai-ppo/}
}