ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models Using Pareto High-Quality Data

Haoran Gu; Handing Wang; Yi Mei; Mengjie Zhang; Yaochu Jin

doi:10.1609/aaai.v40i21.38799

ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models Using Pareto High-Quality Data

Authors

Haoran Gu Xidian University
Handing Wang Xidian University
Yi Mei Victoria University of Wellington
Mengjie Zhang Victoria University of Wellington
Yaochu Jin Westlake University

DOI:

https://doi.org/10.1609/aaai.v40i21.38799

Abstract

Aligning large language models with multiple human expectations and values is crucial for ensuring that they adequately serve a variety of user needs. To this end, offline multiobjective alignment algorithms such as the Rewards-in-Context algorithm have shown strong performance and efficiency. However, inappropriate preference representations and training with imbalanced reward scores limit the performance of such algorithms. In this work, we introduce ParetoHqD that addresses the above issues by representing human preferences as preference directions in the objective space and regarding data near the Pareto front as ''high-quality'' data. For each preference, ParetoHqD follows a two-stage supervised fine-tuning process, where each stage uses an individual Pareto high-quality training set that best matches its preference direction. The experimental results have demonstrated the superiority of ParetoHqD over five baselines on two multiobjective alignment tasks.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

PDF
Poster

Published

2026-03-14

How to Cite

Gu, H., Wang, H., Mei, Y., Zhang, M., & Jin, Y. (2026). ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models Using Pareto High-Quality Data. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17454–17462. https://doi.org/10.1609/aaai.v40i21.38799

Download Citation

Issue

Vol. 40 No. 21: AAAI-26 Technical Tracks 21

Section

AAAI Technical Track on Humans and AI

ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models Using Pareto High-Quality Data

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information