Sample Weighted Incomplete Multimodal Clustering Based on Graph Coarsening Label Extraction

Authors

  • Zhenjiao Liu Inspur Cloud Information Technology Co., Ltd.
  • Xue Xiao Inspur Cloud Information Technology Co., Ltd.
  • Yao Chen Inspur Cloud Information Technology Co., Ltd.
  • Jiao Xue Inspur Cloud Information Technology Co., Ltd.
  • Shubin Ma Dalian University of Technology
  • Liang Zhao Dalian University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i28.39575

Abstract

Multimodal data is typically collected through heterogeneous sensors and processing pipelines. However, due to variations in acquisition environments, device capabilities, and feature extraction methods, such data often suffers from incompleteness and inconsistent quality across modalities. To address these challenges, prior studies have explored modality selection and data completion strategies to improve information fusion. Nevertheless, these approaches face two main limitations: (1) they struggle to simultaneously ensure computational efficiency for large-scale graph data and maintain structural and semantic consistency across heterogeneous modality graphs; and (2) most of them operate at the modality level and fail to capture fine-grained, sample-specific quality variations. To overcome these issues, we propose a novel clustering framework, Sample Weighted Incomplete Multimodal Clustering Based on Graph Coarsening Label Extraction (IMC-GCSW). The proposed method introduces a graph coarsening-based label extraction strategy. It significantly reduces the computational cost of multimodal graph processing, while preserving key node information and local topological structures. Furthermore, a quality-aware sample weighting strategy is designed to enable fine-grained modeling of modality-specific data quality, allowing the model to dynamically suppress the influence of low-quality modalities on individual samples. Experiments on both general-purpose datasets and the Fructus Aurantii Disease and Pest Datasets demonstrate that the proposed method exhibits superior performance and strong adaptability in handling multimodal data with incompleteness and quality inconsistency.

Downloads

Published

2026-03-14

How to Cite

Liu, Z., Xiao, X., Chen, Y., Xue, J., Ma, S., & Zhao, L. (2026). Sample Weighted Incomplete Multimodal Clustering Based on Graph Coarsening Label Extraction. Proceedings of the AAAI Conference on Artificial Intelligence, 40(28), 23981–23989. https://doi.org/10.1609/aaai.v40i28.39575

Issue

Section

AAAI Technical Track on Machine Learning V