DIET: Machine Unlearning on a Data-Diet
DOI:
https://doi.org/10.1609/aaai.v40i27.39431Abstract
Machine Unlearning (MU) aims to remove the influence of specific knowledge from a pretrained model. Existing methods often rely on retained training data to preserve utility; such dependence is impractical due to privacy and scalability constraints. A further complication arises when unlearning is applied to vision-language models (VLMs), where entangled multimodal representations make targeted forgetting especially challenging. We propose DIET, a principled retain-data-free unlearning method for VLMs that addresses these challenges by leveraging the geometry of hyperbolic space. The core idea is to push forget embeddings toward class-mismatched prototypes located at the boundary of the hyperbolic space. In hyperbolic geometry, points near the boundary become infinitely distant from interior points. As a result, moving forget embeddings to the boundary makes their influence on the model asymptotically negligible. To formalize this, we guide the forgetting process using the Busemann function, which quantifies directional distance to the boundary. We further develop an adaptive scheme based on optimal transport that selects mismatched prototypes for each forget embedding, enabling flexible unlearning dynamics. Extensive experiments on fine-grained datasets such as Flowers102, OxfordPets, and StanfordCars show that DIET achieves an average forget accuracy of 8.06%, while preserving 69.04% utility using only 16 samples per concept, significantly outperforming the best retain-free baselines with a 117.5% improvement in model utility, and showing competitive performance to retain-data baselines with only a 3.79% dropDownloads
Published
2026-03-14
How to Cite
Kunananthaseelan, N., Wu, J., Le, T., Haffari, G., & Harandi, M. (2026). DIET: Machine Unlearning on a Data-Diet. Proceedings of the AAAI Conference on Artificial Intelligence, 40(27), 22698–22706. https://doi.org/10.1609/aaai.v40i27.39431
Issue
Section
AAAI Technical Track on Machine Learning IV