Probability-Density-aware Semi-supervised Learning
DOI:
https://doi.org/10.1609/aaai.v39i18.34085Abstract
In Semi-supervised learning(SSL), we always accept cluster assumption, assuming features in different high-density regions belong to other categories. However, it is always ignored by existing algorithms and needs mathematical explanations. This paper first proposes a theorem to statistically explain cluster assumption and prove that the probability density can significantly help to use the prior fully. A Probability-Density-Aware Measure(PM) is proposed based on the theorem to discern the similarity between neighbor points. The PM is deployed to improve Label Propagation and a new pseudo-labeling algorithm, the Probability-Density-Aware Label Propagation(PMLP), is proposed. We also prove that traditional first-order similarity pseudo-labeling could be viewed as a particular case of PMLP, which provides a comprehensive theoretical understanding of PMLP's superior performance. Extensive experiments demonstrate that PMLP achieves outstanding performance compared with other recent methods.Downloads
Published
2025-04-11
How to Cite
Liu, S., Zheng, R., Shen, Y., Yu, Z., Li, K., Sun, X., & Lin, S. (2025). Probability-Density-aware Semi-supervised Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(18), 18943–18950. https://doi.org/10.1609/aaai.v39i18.34085
Issue
Section
AAAI Technical Track on Machine Learning IV