FOCUS: Towards Universal Foreground Segmentation

Authors

  • Zuyao You Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University Shanghai Collaborative Innovation Center of Intelligent Visual Computing
  • Lingyu Kong Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University
  • Lingchen Meng Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University Shanghai Collaborative Innovation Center of Intelligent Visual Computing
  • Zuxuan Wu Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University Shanghai Collaborative Innovation Center of Intelligent Visual Computing

DOI:

https://doi.org/10.1609/aaai.v39i9.33038

Abstract

Foreground segmentation is a fundamental task in computer vision, encompassing various subdivision tasks. Previous research has typically designed task-specific architectures for each task, leading to a lack of unification. Moreover, they primarily focus on recognizing foreground objects without effectively distinguishing them from the background. In this paper, we emphasize the importance of the background and its relationship with the foreground. We introduce FOCUS, the Foreground ObjeCts Universal Segmentation framework that can handle multiple foreground tasks. We develop a multi-scale semantic network using the edge information of objects to enhance image features. To achieve boundary-aware segmentation, we propose a novel distillation method, integrating the contrastive learning strategy to refine the prediction mask in multi-modal feature space. We conduct extensive experiments on a total of 13 datasets across 5 tasks, and the results demonstrate that FOCUS consistently outperforms the state-of-the-art task-specific models on most metrics.

Downloads

Published

2025-04-11

How to Cite

You, Z., Kong, L., Meng, L., & Wu, Z. (2025). FOCUS: Towards Universal Foreground Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(9), 9580–9588. https://doi.org/10.1609/aaai.v39i9.33038

Issue

Section

AAAI Technical Track on Computer Vision VIII