Investigating the Effectiveness of Laplacian-Based Kernels in Hub Reduction

Authors

  • Ikumi Suzuki Nara Institute of Science and Technology
  • Kazuo Hara National Institute of Genetics
  • Masashi Shimbo Nara Institute of Science and Technology
  • Yuji Matsumoto Nara Institute of Science and Technology
  • Marco Saerens Universite Catholique de Louvain

DOI:

https://doi.org/10.1609/aaai.v26i1.8295

Keywords:

High-dimensional data, Hub, Data centroid, Laplacian-based kernels

Abstract

A “hub” is an object closely surrounded by, or very similar to, many other objects in the dataset. Recent studies by Radovanovi´c et al. indicate that in high dimensional spaces, hubs almost always emerge, and objects close to the data centroid tend to become hubs. In this paper, we show that the family of kernels based on the graph Laplacian makes all objects in the dataset equally similar to the centroid, and thus they are expected to make less hubs when used as a similarity measure. We investigate this hypothesis using both synthetic and real-world data. It turns out that these kernels suppress hubs in some cases but not always, and the results seem to be affected by the size of the data—a factor not discussed previously. However, for the datasets in which hubs are indeed reduced by the Laplacian-based kernels, these kernels work well in ranking and classification tasks. This result suggests that the amount of hubs, which can be readily computed in an unsupervised fashion, can be a yardstick of whether Laplacian-based kernels work effectively for a given data.

Downloads

Published

2021-09-20

How to Cite

Suzuki, I., Hara, K., Shimbo, M., Matsumoto, Y., & Saerens, M. (2021). Investigating the Effectiveness of Laplacian-Based Kernels in Hub Reduction. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 1112-1118. https://doi.org/10.1609/aaai.v26i1.8295

Issue

Section

AAAI Technical Track: Machine Learning