Cross-Constrained Progressive Inference for 3D Hand Pose Estimation with Dynamic Observer-Decision-Adjuster Networks

Authors

  • Zhehan Kan Southern University of Science and Technology
  • Xueting Hu Southern University of Science and Technology
  • Zihan Liao Southern University of Science and Technology
  • Ke Yu Southern University of Science and Technology
  • Zhihai He Southern University of Science and Technology Pengcheng Laboratory, Shenzhen, China

DOI:

https://doi.org/10.1609/aaai.v38i3.28048

Keywords:

CV: 3D Computer Vision, CV: Biometrics, Face, Gesture & Pose

Abstract

Generalization is very important for pose estimation, especially for 3D pose estimation where small changes in the 2D images could trigger structural changes in the 3D space. To achieve generalization, the system needs to have the capability of detecting estimation errors by double-checking the projection coherence between the 3D and 2D spaces and adapting its network inference process based on this feedback. Current pose estimation is one-time feed-forward and lacks the capability to gather feedback and adapt the inference outcome. To address this problem, we propose to explore the concept of progressive inference where the network learns an observer to continuously detect the prediction error based on constraints matching, as well as an adjuster to refine its inference outcome based on these constraints errors. Within the context of 3D hand pose estimation, we find that this observer-adjuster design is relatively unstable since the observer is operating in the 2D image domain while the adjuster is operating in the 3D domain. To address this issue, we propose to construct two sets of observers-adjusters with complementary constraints from different perspectives. They operate in a dynamic sequential manner controlled by a decision network to progressively improve the 3D pose estimation. We refer to this method as Cross-Constrained Progressive Inference (CCPI). Our extensive experimental results on FreiHAND and HO-3D benchmark datasets demonstrate that the proposed CCPI method is able to significantly improve the generalization capability and performance of 3D hand pose estimation.

Published

2024-03-24

How to Cite

Kan, Z., Hu, X., Liao, Z., Yu, K., & He, Z. (2024). Cross-Constrained Progressive Inference for 3D Hand Pose Estimation with Dynamic Observer-Decision-Adjuster Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2697-2704. https://doi.org/10.1609/aaai.v38i3.28048

Issue

Section

AAAI Technical Track on Computer Vision II