Input Margins Can Predict Generalization Too

Authors

  • Coenraad Mouton Faculty of Engineering, North-West University, South Africa Centre for Artificial Intelligence Research, South Africa South African National Space Agency
  • Marthinus Wilhelmus Theunissen Faculty of Engineering, North-West University, South Africa Centre for Artificial Intelligence Research, South Africa
  • Marelie H Davel Faculty of Engineering, North-West University, South Africa Centre for Artificial Intelligence Research, South Africa National Institute for Theoretical and Computational Sciences, South Africa

DOI:

https://doi.org/10.1609/aaai.v38i13.29351

Keywords:

ML: Representation Learning, ML: Adversarial Learning & Robustness, ML: Deep Learning Theory

Abstract

Understanding generalization in deep neural networks is an active area of research. A promising avenue of exploration has been that of margin measurements: the shortest distance to the decision boundary for a given sample or its representation internal to the network. While margins have been shown to be correlated with the generalization ability of a model when measured at its hidden representations (hidden margins), no such link between large margins and generalization has been established for input margins. We show that while input margins are not generally predictive of generalization, they can be if the search space is appropriately constrained. We develop such a measure based on input margins, which we refer to as 'constrained margins'. The predictive power of this new measure is demonstrated on the 'Predicting Generalization in Deep Learning' (PGDL) dataset and contrasted with hidden representation margins. We find that constrained margins achieve highly competitive scores and outperform other margin measurements in general. This provides a novel insight on the relationship between generalization and classification margins, and highlights the importance of considering the data manifold for investigations of generalization in DNNs.

Published

2024-03-24

How to Cite

Mouton, C., Theunissen, M. W., & Davel, M. H. (2024). Input Margins Can Predict Generalization Too. Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), 14379-14387. https://doi.org/10.1609/aaai.v38i13.29351

Issue

Section

AAAI Technical Track on Machine Learning IV