Efficient Virtual View Selection for 3D Hand Pose Estimation

Authors

  • Jian Cheng Institute of Software Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Yanguang Wan Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Dexin Zuo Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Cuixia Ma Institute of Software Chinese Academy of Sciences
  • Jian Gu Alibaba
  • Ping Tan Simon Fraser University Alibaba
  • Hongan Wang Institute of Software, Chinese Academy of Sciences
  • Xiaoming Deng Institute of Software, Chinese Academy of Sciences
  • Yinda Zhang Google

DOI:

https://doi.org/10.1609/aaai.v36i1.19919

Keywords:

Computer Vision (CV)

Abstract

3D hand pose estimation from single depth is a fundamental problem in computer vision, and has wide applications. However, the existing methods still can not achieve satisfactory hand pose estimation results due to view variation and occlusion of human hand. In this paper, we propose a new virtual view selection and fusion module for 3D hand pose estimation from single depth. We propose to automatically select multiple virtual viewpoints for pose estimation and fuse the results of all and find this empirically delivers accurate and robust pose estimation. In order to select most effective virtual views for pose fusion, we evaluate the virtual views based on the confidence of virtual views using a light-weight network via network distillation. Experiments on three main benchmark datasets including NYU, ICVL and Hands2019 demonstrate that our method outperforms the state-of-the-arts on NYU and ICVL, and achieves very competitive performance on Hands2019-Task1, and our proposed virtual view selection and fusion module is both effective for 3D hand pose estimation.

Downloads

Published

2022-06-28

How to Cite

Cheng, J., Wan, Y., Zuo, D., Ma, C., Gu, J., Tan, P., Wang, H., Deng, X., & Zhang, Y. (2022). Efficient Virtual View Selection for 3D Hand Pose Estimation. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 419-426. https://doi.org/10.1609/aaai.v36i1.19919

Issue

Section

AAAI Technical Track on Computer Vision I