On the Misalignment Between Data Learnability and Forgettability in Machine Unlearning

Authors

  • Zijie Pan City University of Macau
  • Zuobin Ying City University of Macau
  • Yajie Wang Beijing Institute of Technology
  • Wanlei Zhou City University of Macau

DOI:

https://doi.org/10.1609/aaai.v40i29.39657

Abstract

We report a structural mismatch between a data point’s {learnability}—how quickly it improves the loss—and its {forgettability}—how much it anchors the final parameters—an aspect ignored by prior machine unlearning frameworks such as SISA, Fisher-Forget, and influence-based fine-tuning. To make this gap measurable we introduce Unlearning Gradient Sensitivity (UGS), an influence score computable with a single Hutch++ sketch, and derive the Learnability–Forgettability Divergence (LFD), the Jensen–Shannon distance between the model’s learning and forgetting distributions. We prove that UGS dispersion decays exponentially only under explicit regularisation and that LFD converges to zero when its weight grows sub-linearly relative to the UGS term. Building on these findings, we introduce Dual-Aware Training (DAT)—a lightweight regularization method that reduces variability in how easily data points can be forgotten and aligns learning and forgetting behaviors during training. On CIFAR-10, MNIST, and IMDB, DAT maintains the original model accuracy while cutting forgettability divergence in half and significantly lowering the cost of certified unlearning, showing that it’s effective to make models forgettable from the start.

Downloads

Published

2026-03-14

How to Cite

Pan, Z., Ying, Z., Wang, Y., & Zhou, W. (2026). On the Misalignment Between Data Learnability and Forgettability in Machine Unlearning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(29), 24718–24726. https://doi.org/10.1609/aaai.v40i29.39657

Issue

Section

AAAI Technical Track on Machine Learning VI