Modeling the Plurality of Human Preferences via Ideal Points
Abstract
Large foundation models require extensive \textit{alignment} to human preferences before deployment. Existing methods utilize the Bradley-Terry-Luce (BTL) model \cite{bradley1952rank} and often assume a universal preference, neglecting the diversity of individual opinions. We introduce \PAL, a framework that models the plurality of human preferences using the ideal point model and mixture modeling. \PAL captures the plurality while learning a common preference latent space, enabling few-shot generalization to new users. With simple multi-layer perceptron, \PAL achieves competitive reward model accuracy on Summary \cite{stiennon2020learning} (language), Pick-a-Pic \cite{kirstain2024pick} (image generation), and Persona \cite{perez2022discovering} (semisynthetic) induced heterogeneous datasets, matching state-of-the-art performance with greater efficiency. Lastly, our findings highlight the need for more nuanced data collection to capture the heterogeneity of human preferences.
Cite
Text
Chen et al. "Modeling the Plurality of Human Preferences via Ideal Points." ICML 2024 Workshops: MFHAIA, 2024.Markdown
[Chen et al. "Modeling the Plurality of Human Preferences via Ideal Points." ICML 2024 Workshops: MFHAIA, 2024.](https://mlanthology.org/icmlw/2024/chen2024icmlw-modeling/)BibTeX
@inproceedings{chen2024icmlw-modeling,
title = {{Modeling the Plurality of Human Preferences via Ideal Points}},
author = {Chen, Daiwei and Chen, Yi and Rege, Aniket and Vinayak, Ramya Korlakai},
booktitle = {ICML 2024 Workshops: MFHAIA},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/chen2024icmlw-modeling/}
}