Learning Label Distribution with Dirichlet Process Mixture Model

Authors

  • Minglong Wang Nanjing University of Aeronautics and Astronautics
  • Weiwei Li Nanjing University of Aeronautics and Astronautics
  • Yunan Lu The Hong Kong Polytechnic University Nanjing University of Science and Technology
  • Xiuyi Jia Nanjing University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v40i31.39846

Abstract

Label Distribution Learning (LDL) is an effective machine learning paradigm for addressing label ambiguity, where each sample is annotated with a distribution that conveys rich semantic information. However, during the actual annotation process of label distributions, annotators often exhibit divergent labeling preferences for the same sample. Most existing LDL methods overlook this heterogeneity, assuming that the observed label distribution originates from a single labeling pattern. Such an assumption limits their capacity to manage inter-annotator disagreement and constrains the generalization of the resulting models. To address this issue, we propose, for the first time, a Dirichlet process mixture model (DPMM)-based framework for LDL. This framework leverages nonparametric Bayesian methods to adaptively uncover diverse latent labeling patterns from the data and to accurately model annotator heterogeneity. Specifically, the ground-truth label distribution of each sample is modeled as a weighted mixture of multiple latent components, where a feature-conditioned gating mechanism adaptively controls the contribution of each component. Experimental results demonstrate that the proposed model consistently achieves competitive performance on several widely-used benchmark datasets.

Downloads

Published

2026-03-14

How to Cite

Wang, M., Li, W., Lu, Y., & Jia, X. (2026). Learning Label Distribution with Dirichlet Process Mixture Model. Proceedings of the AAAI Conference on Artificial Intelligence, 40(31), 26398–26406. https://doi.org/10.1609/aaai.v40i31.39846

Issue

Section

AAAI Technical Track on Machine Learning VIII