Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles
Abstract
In this study, we delve into an emerging optimization challenge involving a black-box objective function that can only be gauged via a ranking oracle—a situation frequently encountered in real-world scenarios, especially when the function is evaluated by human judges. A prominent instance of such a situation is Reinforcement Learning with Human Feedback (RLHF), an approach recently employed to enhance the performance of Large Language Models (LLMs) using human guidance \citep{ouyang2022training,liu2023languages,chatgpt,bai2022training}. We introduce ZO-RankSGD, an innovative zeroth-order optimization algorithm designed to tackle this optimization problem, accompanied by theoretical assurances. Our algorithm utilizes a novel rank-based random estimator to determine the descent direction and guarantees convergence to a stationary point. We demonstrate the effectiveness of ZO-RankSGD in a novel application: improving the quality of images generated by a diffusion generative model with human ranking feedback. Throughout experiments, we found that ZO-RankSGD can significantly enhance the detail of generated images with only a few rounds of human feedback. Overall, our work advances the field of zeroth-order optimization by addressing the problem of optimizing functions with only ranking feedback, and offers a new and effective approach for aligning Artificial Intelligence (AI) with human intentions.
Cite
Text
Tang et al. "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles." ICML 2023 Workshops: MFPL, 2023.Markdown
[Tang et al. "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles." ICML 2023 Workshops: MFPL, 2023.](https://mlanthology.org/icmlw/2023/tang2023icmlw-zerothorder/)BibTeX
@inproceedings{tang2023icmlw-zerothorder,
title = {{Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles}},
author = {Tang, Zhiwei and Rybin, Dmitry and Chang, Tsung-Hui},
booktitle = {ICML 2023 Workshops: MFPL},
year = {2023},
url = {https://mlanthology.org/icmlw/2023/tang2023icmlw-zerothorder/}
}