Flattening the Density Gradient for Eliminating Spatial Centrality to Reduce Hubness

Kazuo Hara; Ikumi Suzuki; Kei Kobayashi; Kenji Fukumizu; Milos Radovanovic

doi:10.1609/aaai.v30i1.10240

Authors

Kazuo Hara National Institute of Genetics
Ikumi Suzuki Yamagata University
Kei Kobayashi The Institute of Statistical Mathematics
Kenji Fukumizu The Institute of Statistical Mathematics
Milos Radovanovic University of Novi Sad

DOI:

https://doi.org/10.1609/aaai.v30i1.10240

Keywords:

Hubness, Density gradient, Spatial centrality, k nearest neighbor method

Abstract

Spatial centrality, whereby samples closer to the center of a dataset tend to be closer to all other samples, is regarded as one source of hubness. Hubness is well known to degrade k-nearest-neighbor (k-NN) classification. Spatial centrality can be removed by centering, i.e., shifting the origin to the global center of the dataset, in cases where inner product similarity is used. However, when Euclidean distance is used, centering has no effect on spatial centrality because the distance between the samples is the same before and after centering. As described in this paper, we propose a solution for the hubness problem when Euclidean distance is considered. We provide a theoretical explanation to demonstrate how the solution eliminates spatial centrality and reduces hubness. We then present some discussion of the reason the proposed solution works, from a viewpoint of density gradient, which is regarded as the origin of spatial centrality and hubness. We demonstrate that the solution corresponds to flattening the density gradient. Using real-world datasets, we demonstrate that the proposed method improves k-NN classification performance and outperforms an existing hub-reduction method.

Flattening the Density Gradient for Eliminating Spatial Centrality to Reduce Hubness

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information