Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation

Authors

  • Jiarui Yang Shanghai Key Lab of Intell. Info. Processing, School of Computer Science, Fudan University Shanghai Collaborative Innovation Center on Intelligent Visual Computing Institute of Information Engineering, CAS
  • Chuan Wang School of Computer Science and Technology, Beijing JiaoTong University Institute of Information Engineering, CAS Guangdong Provincial Key Lab of Intell. Info. Processing & Shenzhen Key Lab of Media Security, Shenzhen University
  • Jun Zhang Institute of Information Engineering, CAS
  • Shuyi Wu Information Research Center of Military Science, PLA Academy of Military Science
  • Jinjing Zhao National Key Laboratory of Science and Technology on Information System Security
  • Zeming Liu School of Computer Science and Engineering, Beihang University
  • Liang Yang School of Artificial Intelligence, Hebei University of Technology

DOI:

https://doi.org/10.1609/aaai.v39i9.32998

Abstract

Scene Graph Generation (SGG) aims to detect all objects and identify their pairwise relationships existing in the scene. Considering the substantial human labor costs, existing scene graph annotations are often sparse and biased, which result in confusion training with low-frequency predicates. In this work, we design a Semi-Supervised Clustering framework for Scene Graph Generation (SSC-SGG) that uses the sparse labeled data to guide the generation of effective pseudo-labels from unlabeled object pairs, thus enriching the labeled sample space, especially for low-frequency interaction samples. We approach from the perspective of clustering, reducing the problem of confirmation bias in a self-training manner. Specifically, we first enhance the model's robustness to feature extraction via prototype-based clustering, aggregating different relationship augmented features onto the same prototype. Secondly, we design a dynamic pseudo-label assignment algorithm based on a mini-batch, which adjusts the detection sensitivity to different frequency samples from the historical assignment. Finally, we conduct joint training on the pseudo-labels and the labeled data. We conduct experiments on various SGG models and achieve substantial overall performance improvements, demonstrating the effectiveness of SSC-SGG.

Downloads

Published

2025-04-11

How to Cite

Yang, J., Wang, C., Zhang, J., Wu, S., Zhao, J., Liu, Z., & Yang, L. (2025). Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(9), 9220–9228. https://doi.org/10.1609/aaai.v39i9.32998

Issue

Section

AAAI Technical Track on Computer Vision VIII