Rethinking Multi-Instance Learning Through Graph-Driven Fusion: A Dual-Path Approach to Adaptive Representation

Authors

  • Yu-Xuan Zhang School of Information Science and Technology, Southwest Jiaotong University
  • Zhengchun Zhou School of Information Science and Technology, Southwest Jiaotong University
  • Weisha Liu SWJTU-Leeds Joint School, Southwest Jiaotong University
  • Mingxing Zhang School of Mathematics, Southwest Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v40i34.40081

Abstract

Multi-instance learning (MIL) has become a powerful paradigm for weakly supervised learning tasks, where each sample is a bag of unlabeled instances with only the bag-level label. While graph-based MIL methods enhance bag topological structure modeling, they often suffer from high computation costs and limited representation due to rigid graph construction and insufficient integration of intra-bag semantics. To address these challenges, we propose GDF-MIL, a novel graph-driven MIL framework, which introduces a dual-path feature fusion mechanism to adaptively balance topological structure modeling and semantic feature preservation. First, the adaptive bag mapping module (ABMM) performs soft clustering to extract compact and informative representations. Subsequently, a dynamic graph structure learning (DGSL) component efficiently learns sparse topological structures via weighted connectivity, aggregating them into a comprehensive graph-level representation. Finally, to balance fast graph construction and bag-level knowledge, dual-path feature fusion (DPFF) employs a dual-path gating mechanism to integrate both types of features, which are then passed to the classifier for bag label prediction. Extensive experiments on twenty-four datasets across four domains show that GDF-MIL significantly outperforms eighteen state-of-the-art methods on the majority of datasets.

Downloads

Published

2026-03-14

How to Cite

Zhang, Y.-X., Zhou, Z., Liu, W., & Zhang, M. (2026). Rethinking Multi-Instance Learning Through Graph-Driven Fusion: A Dual-Path Approach to Adaptive Representation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 28510–28518. https://doi.org/10.1609/aaai.v40i34.40081

Issue

Section

AAAI Technical Track on Machine Learning XI