Group-Pair Convolutional Neural Networks for Multi-View Based 3D Object Retrieval

Authors

  • Zan Gao Tianjin University of Technology
  • Deyu Wang Tianjin University of Technology
  • Xiangnan He National University of Singapore
  • Hua Zhang Tianjin University of Technology

DOI:

https://doi.org/10.1609/aaai.v32i1.11899

Keywords:

3D object retrieval, Group-pair CNN

Abstract

In recent years, research interest in object retrieval has shifted from 2D towards 3D data. Despite many well-designed approaches, we point out that limitations still exist and there is tremendous room for improvement, including the heavy reliance on hand-crafted features, the separated optimization of feature extraction and object retrieval, and the lack of sufficient training samples. In this work, we address the above limitations for 3D object retrieval by developing a novel end-to-end solution named Group Pair Convolutional Neural Network (GPCNN). It can jointly learn the visual features from multiple views of a 3D model and optimize towards the object retrieval task. To tackle the insufficient training data issue, we innovatively employ a pair-wise learning scheme, which learns model parameters from the similarity of each sample pair, rather than the traditional way of learning from sparse label–sample matching. Extensive experiments on three public benchmarks show that our GPCNN solution significantly outperforms the state-of-the-art methods with 3% to 42% improvement in retrieval accuracy.

Downloads

Published

2018-04-26

How to Cite

Gao, Z., Wang, D., He, X., & Zhang, H. (2018). Group-Pair Convolutional Neural Networks for Multi-View Based 3D Object Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.11899

Issue

Section

Main Track: Machine Learning Applications