Deep Embedding for Determining the Number of Clusters

Authors

  • Yiqi Wang National University of Defense Technology
  • Zhan Shi University of Texas at Austin
  • Xifeng Guo National University of Defense Technology
  • Xinwang Liu National University of Defense Technology
  • En Zhu National University of Defense Technology
  • Jianping Yin Dongguan University of Technology

DOI:

https://doi.org/10.1609/aaai.v32i1.12150

Keywords:

clustering, deep learning, dimension reduction

Abstract

Determining the number of clusters is important but challenging, especially for data of high dimension. In this paper, we propose Deep Embedding Determination (DED), a method that can solve jointly for the unknown number of clusters and feature extraction. DED first combines the virtues of the convolutional autoencoder and the t-SNE technique to extract low dimensional embedded features. Then it determines the number of clusters using an improved density-based clustering algorithm. Our experimental evaluation on image datasets shows significant improvement over state-of-the-art methods and robustness with respect to hyperparameter settings.

Downloads

Published

2018-04-29

How to Cite

Wang, Y., Shi, Z., Guo, X., Liu, X., Zhu, E., & Yin, J. (2018). Deep Embedding for Determining the Number of Clusters. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12150