Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis

Authors

  • Haopeng Zhang Beihang University
  • Tarek El-Gaaly Rutgers University
  • Ahmed Elgammal Rutgers University
  • Zhiguo Jiang Beihang University

DOI:

https://doi.org/10.1609/aaai.v27i1.8634

Keywords:

vision, kinect, object recognition, machine learning, multi-modal visual learning

Abstract

Object recognition is a key precursory challenge in the fields of object manipulation and robotic/AI visual reasoning in general. Recognizing object categories, particular instances of objects and viewpoints/poses of objects are three critical subproblems robots must solve in order to accurately grasp/manipulate objects and reason about their environ- ments. Multi-view images of the same object lie on intrinsic low-dimensional manifolds in descriptor spaces (e.g. visual/depth descriptor spaces). These object manifolds share the same topology despite being geometrically different. Each object manifold can be represented as a deformed version of a unified manifold. The object manifolds can thus be parametrized by its homeomorphic mapping/reconstruction from the unified manifold. In this work, we construct a manifold descriptor from this mapping between homeomorphic manifolds and use it to jointly solve the three challenging recognition sub-problems. We extensively experiment on a challenging multi-modal (i.e. RGBD) dataset and other object pose datasets and achieve state-of-the-art results.

Downloads

Published

2013-06-30

How to Cite

Zhang, H., El-Gaaly, T., Elgammal, A., & Jiang, Z. (2013). Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 27(1), 1012-1019. https://doi.org/10.1609/aaai.v27i1.8634