Approximating Metric Magnitude of Point Sets

Rayna Andreeva; James Ward; Primoz Skraba; Jie Gao; Rik Sarkar

doi:10.1609/aaai.v39i15.33687

Authors

Rayna Andreeva University of Edinburgh
James Ward University of Edinburgh
Primoz Skraba Queen Mary, University of London
Jie Gao Rutgers University
Rik Sarkar University of Edinburgh

DOI:

https://doi.org/10.1609/aaai.v39i15.33687

Abstract

Metric magnitude of a point cloud is a measure of its ``size." It has been adapted to various mathematical contexts and recent work suggests that it can enhance machine learning and optimization algorithms. But its usability is limited due to the computational cost when the dataset is large or when the computation must be carried out repeatedly (e.g. in model training). In this paper, we study the magnitude computation problem, and show efficient ways of approximating it. We show that it can be cast as a convex optimization problem, but not as a submodular optimization. The paper describes two new algorithms -- an iterative approximation algorithm that converges fast and is accurate in practice, and a subset selection method that makes the computation even faster. It has previously been proposed that the magnitude of model sequences generated during stochastic gradient descent is correlated to the generalization gap. Extension of this result using our more scalable algorithms shows that longer sequences bear higher correlations. We also describe new applications of magnitude in machine learning -- as an effective regularizer for neural network training, and as a novel clustering criterion.

Approximating Metric Magnitude of Point Sets

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information