Pareto Multi-Objective Alignment for Language Models

He, Qiang; Maghsudi, Setareh

doi:10.1007/978-3-032-06078-5_15

Pareto Multi-Objective Alignment for Language Models

Qiang He, Setareh Maghsudi

ECML-PKDD 2025 pp. 257-272

doi:10.1007/978-3-032-06078-5_15 /ecmlpkdd/2025/he2025ecmlpkdd-pareto/

Abstract

Large language models (LLMs) are increasingly deployed in real-world applications that require careful balancing of multiple, often conflicting, objectives, such as informativeness versus conciseness, or helpfulness versus creativity. However, current alignment methods, primarily based on reinforcement learning from human feedback (RLHF), optimize LLMs toward a single reward function, resulting in rigid behavior that fails to capture the complexity and diversity of human preferences. This limitation hinders the adaptability of LLMs to practical scenarios, making multi-objective alignment (MOA) a critical yet underexplored area. To bridge this gap, we propose PA reto M ulti-Objective A lignment (PAMA), a principled and computationally efficient algorithm designed explicitly for MOA in LLMs. In contrast to computationally prohibitive gradient-based multi-objective optimization (MOO) methods, PAMA transforms multi-objective RLHF into a convex optimization problem with a closed-form solution, significantly enhancing scalability. Traditional gradient-based MOO approaches suffer from prohibitive $\mathcal {O}(n^2d)$ O ( n 2 d ) complexity, where d represents the number of model parameters, typically in the billions for LLMs, rendering direct optimization infeasible. PAMA reduces this complexity to $\mathcal {O}(n)$ O ( n ) where n is the number of objectives, enabling optimization to be completed within milliseconds. We provide theoretical guarantees that PAMA converges to a Pareto stationary point, where no objective can be improved without degrading at least one other. Extensive experiments across language models ranging from 125M to 7B parameters demonstrate PAMA’s robust and effective multi-objective alignment capabilities, consistently outperforming baseline methods, aligning with its theoretical advantages. PAMA provides a highly efficient solution to the MOA problem that was previously considered intractable, offering a practical and theoretically grounded approach to aligning LLMs with diverse human values, paving the way for versatile and adaptable real-world AI deployments.

PDF ECML-PKDD Semantic Scholar

Cite

Text

He and Maghsudi. "Pareto Multi-Objective Alignment for Language Models." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06078-5_15

Markdown

[He and Maghsudi. "Pareto Multi-Objective Alignment for Language Models." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/he2025ecmlpkdd-pareto/) doi:10.1007/978-3-032-06078-5_15

BibTeX

@inproceedings{he2025ecmlpkdd-pareto,
  title     = {{Pareto Multi-Objective Alignment for Language Models}},
  author    = {He, Qiang and Maghsudi, Setareh},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2025},
  pages     = {257-272},
  doi       = {10.1007/978-3-032-06078-5_15},
  url       = {https://mlanthology.org/ecmlpkdd/2025/he2025ecmlpkdd-pareto/}
}