Generative Label Enhancement with Gaussian Mixture and Partial Ranking
DOI:
https://doi.org/10.1609/aaai.v37i7.26078Keywords:
ML: Multi-Class/Multi-Label Learning & Extreme Classification, ML: Unsupervised & Self-Supervised LearningAbstract
Label distribution learning (LDL) is an effective learning paradigm for dealing with label ambiguity. When applying LDL, the datasets annotated with label distributions (i.e., the real-valued vectors like the probability distribution) are typically required. Unfortunately, most existing datasets only contain the logical labels, and manual annotating with label distributions is costly. To address this problem, we treat the label distribution as a latent vector and infer its posterior by variational Bayes. Specifically, we propose a generative label enhancement model to encode the process of generating feature vectors and logical label vectors from label distributions in a principled way. In terms of features, we assume that the feature vector is generated by a Gaussian mixture dominated by the label distribution, which captures the one-to-many relationship from the label distribution to the feature vector and thus reduces the feature generation error. In terms of logical labels, we design a probability distribution to generate the logical label vector from a label distribution, which captures partial label ranking in the logical label vector and thus provides a more accurate guidance for inferring the label distribution. Besides, to approximate the posterior of the label distribution, we design a inference model, and derive the variational learning objective. Finally, extensive experiments on real-world datasets validate our proposal.Downloads
Published
2023-06-26
How to Cite
Lu, Y., He, L., Min, F., Li, W., & Jia, X. (2023). Generative Label Enhancement with Gaussian Mixture and Partial Ranking. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 8975-8983. https://doi.org/10.1609/aaai.v37i7.26078
Issue
Section
AAAI Technical Track on Machine Learning II