Plurality of Value Pluralism and AI Value Alignment

Abstract

AI value alignment efforts increasingly emphasize value pluralism, but implementing value pluralism itself involves contested choices. This paper introduces a two-level framework distinguishing between first-order value choices (implementing specific accounts of values) and second-order value choices (determining the legitimacy of these first-order value selections and implementations). I argue that genuine pluralistic value alignment requires explicit engagement with both levels. While first-order choices involve defining and measuring values, second-order choices address who has legitimate authority to make first-order value decisions and through what processes. The framework yields two critical insights by decomposing value pluralism into distinct components. First, it helps prevent "pluralistic value-washing'' where superficial appeals to insignificant pluralism could mask fundamentally monistic alignment approaches. Second, it reveals that there is no single "correct" implementation of value pluralism --- attempts to converge on "pluralism'' as a universal good approach fundamentally contradict pluralistic principles themselves. To enable more meaningful tracking of pluralistic value alignment in both single and multi-agent AI systems, I propose developing "value cards" based on the components of this normative framework.

Cite

Text

Kasirzadeh. "Plurality of Value Pluralism and AI Value Alignment." NeurIPS 2024 Workshops: Pluralistic-Alignment, 2024.

Markdown

[Kasirzadeh. "Plurality of Value Pluralism and AI Value Alignment." NeurIPS 2024 Workshops: Pluralistic-Alignment, 2024.](https://mlanthology.org/neuripsw/2024/kasirzadeh2024neuripsw-plurality/)

BibTeX

@inproceedings{kasirzadeh2024neuripsw-plurality,
  title     = {{Plurality of Value Pluralism and AI Value Alignment}},
  author    = {Kasirzadeh, Atoosa},
  booktitle = {NeurIPS 2024 Workshops: Pluralistic-Alignment},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/kasirzadeh2024neuripsw-plurality/}
}