Towards Better IncomLDL: We Are Unaware of Hidden Labels in Advance

Authors

  • Jiecheng Jiang School of Computer Science and Engineering, Southeast University, Nanjing 210096, China School of Computing, National University of Singapore, Singapore
  • Jiawei Tang School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
  • Jiahao Jiang School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
  • Hui Liu School of Computing Information Sciences, Saint Francis University, Hong Kong, China
  • Junhui Hou Department of Computer Science, City University of Hong Kong, Hong Kong, China
  • Yuheng Jia School of Computer Science and Engineering, Southeast University, Nanjing 210096, China School of Computing Information Sciences, Saint Francis University, Hong Kong, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China

DOI:

https://doi.org/10.1609/aaai.v40i27.39389

Abstract

Label distribution learning (LDL) is a novel paradigm that describe the samples by label distribution of a sample. However, acquiring LDL dataset is costly and time-consuming, which leads to the birth of incomplete label distribution learning (IncomLDL). All the previous IncomLDL methods set the description degrees of "missing" labels in an instance to 0, but remains those of other labels unchanged. This setting is unrealistic because when certain labels are missing, the degrees of the remaining labels will increase accordingly. We fix this unrealistic setting in IncomLDL and raise a new problem: LDL with hidden labels (HidLDL), which aims to recover a complete label distribution from a real-world incomplete label distribution where certain labels in an instance are omitted during annotation. To solve this challenging problem, we discover the significance of proportional information of the observed labels and capture it by an innovative constraint to utilize it during the optimization process. We simultaneously use local feature similarity and the global low-rank structure to reveal the mysterious veil of hidden labels. Moreover, we **theoretically** give the recovery bound of our method, proving the feasibility of our method in learning from hidden labels. Extensive recovery and predictive experiments on various datasets prove the superiority of our method to state-of-the-art LDL and IncomLDL methods.

Downloads

Published

2026-03-14

How to Cite

Jiang, J., Tang, J., Jiang, J., Liu, H., Hou, J., & Jia, Y. (2026). Towards Better IncomLDL: We Are Unaware of Hidden Labels in Advance. Proceedings of the AAAI Conference on Artificial Intelligence, 40(27), 22318-22326. https://doi.org/10.1609/aaai.v40i27.39389

Issue

Section

AAAI Technical Track on Machine Learning IV