Forget What Has Seen: Selective Concept Unlearning in Segmentation Foundation Models

Miaozeng Du; Jiaqi Li; Sirui Pan; Yi Zhan; Guilin Qi; Yuxin Zhang; Rihui Jin; Yinjia Shu; Qianshan Wei

doi:10.1609/aaai.v40i25.39233

Authors

Miaozeng Du College of Software Engineering, Southeast University, Nanjing, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Jiaqi Li School of Computer Science and Engineering, Southeast University, Nanjing, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Sirui Pan School of Computer Science and Engineering, Southeast University, Nanjing, China
Yi Zhan School of Computer Science, Peking University, Beijing, China
Guilin Qi School of Computer Science and Engineering, Southeast University, Nanjing, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Yuxin Zhang School of Computer Science and Engineering, Southeast University, Nanjing, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Rihui Jin School of Computer Science and Engineering, Southeast University, Nanjing, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Yinjia Shu School of Computer Science and Engineering, Southeast University, Nanjing, China
Qianshan Wei School of Computer Science and Engineering, Southeast University, Nanjing, China

DOI:

https://doi.org/10.1609/aaai.v40i25.39233

Abstract

Machine unlearning (MU) has emerged as a critical tool for removing sensitive or personal information from machine learning models, empowering individuals with the right to be forgotten. While MU has achieved success in classification and generative tasks, whether this technique can be effectively applied to segmentation foundation models remains uncertain. To address this issue, we propose an efficient method, Selective Concept Unlearning (SCU), to unlearn the segmentation capability of target concepts. SCU consists of several key aspects: (1) The Multi-level Forgetting Module, designed with a hierarchical three-level suppression strategy, including (i) distillation-level: Negative distillation steers model’s output distribution away from teacher’s correct outputs, erasing its learned concept recognition. (ii) attention-level: Attention suppression minimizes model’s attention to target regions. (iii) output-level: Directly erases predictions for the target by relabeling as background. (2) The Preservation Module ensures maintaining segmentation quality for non-target concepts. Additionally, we introduce a set of metrics to evaluate segmentation unlearning methods. Experiments demonstrate that SCU consistently outperforms existing baselines.

Forget What Has Seen: Selective Concept Unlearning in Segmentation Foundation Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information