ALPHA: Action-Based Learning for Pluralistic Human Alignment in Large Language Models

Authors

  • Aanisha Bhattacharyya Adobe Media and Data Science Research (MDSR)
  • Susmit Agrawal Adobe Media and Data Science Research (MDSR)
  • Yaman Kumar Singla Adobe Media and Data Science Research (MDSR)
  • Tarun Ram Menta Adobe Media and Data Science Research (MDSR)
  • Nikitha Sr Adobe Media and Data Science Research (MDSR)
  • Rajiv Ratn Shah Indraprastha Institute of Information Technology Delhi (IIIT Delhi)
  • Changyou Chen State University of New York at Buffalo
  • Balaji Krishnamurthy Adobe Media and Data Science Research (MDSR)

DOI:

https://doi.org/10.1609/aaai.v40i44.41056

Abstract

Large language models are widely used, but aligning them with societal values remains challenging. Current approaches often rely on human annotations, which are hard to scale, or synthetic data produced by models that may themselves be misaligned, making it difficult to capture genuine public opinion. This limits scalability and introduces demographic biases that reduce the representativeness and fairness of model behavior. We introduce a novel approach to pluralistic alignment through behavioral learning, grounded in the psychological principle that actions (behavior) have strong consistency with opinions. Specifically, we present ALPHA50M, a dataset of over 50 million samples derived from 1.5 million real-world advertisements, incorporating rich behavioral signals inferred from demographic engagement patterns. Models trained on this data achieve state-of-the-art zero-shot performance on diverse alignment benchmarks spanning cultural reasoning, political views, and social values. We also propose two new benchmarks: OpinionQA-XL, which covers surveys across 100+ societal topics, and GSS, which evaluates temporal opinion shift modeling over decades. Our results demonstrate that learning from behavioral signals, derived from observed human actions, enables models to align with diverse demographic opinions, capture underlying social and cultural norms, and generalize to new topics and surveys beyond training data. This behavioral learning paradigm offers a scalable and demographically broad alternative to existing alignment techniques.

Downloads

Published

2026-03-14

How to Cite

Bhattacharyya, A., Agrawal, S., Singla, Y. K., Menta, T. R., Sr, N., Shah, R. R., … Krishnamurthy, B. (2026). ALPHA: Action-Based Learning for Pluralistic Human Alignment in Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 37249–37258. https://doi.org/10.1609/aaai.v40i44.41056

Issue

Section

AAAI Special Track on AI Alignment