Precision-Enhanced Human-Object Contact Detection via Depth-Aware Perspective Interaction and Object Texture Restoration

Authors

  • Yuxiao Wang South China University of Technology
  • Wenpeng Neng Ningbo Konfoong Bioinformation Tech Co., Ltd.
  • Zhenao Wei South China University of Technology
  • Yu Lei Southwest Jiaotong University
  • Weiying Xue South China University of Technology
  • Nan Zhuang Zhejiang University
  • Yanwu Xu South China University of Technology
  • Xinyu Jiang South China University of Technology
  • Qi Liu South China University of Technology

DOI:

https://doi.org/10.1609/aaai.v39i8.32883

Abstract

Human-object contact (HOT) is designed to accurately identify the areas where humans and objects come into contact. Current methods frequently fail to account for scenarios where objects are frequently blocking the view, resulting in inaccurate identification of contact areas. To tackle this problem, we suggest using a perspective interaction HOT detector called PIHOT, which utilizes a depth map generation model to offer depth information of humans and objects related to the camera, thereby preventing false interaction detection. Furthermore, we use mask dilatation and object restoration techniques to restore the texture details in covered areas, improve the boundaries between objects, and enhance the perception of humans interacting with objects. Moreover, a spatial awareness perception is intended to concentrate on the characteristic features close to the points of contact. The experimental results show that the PIHOT algorithm achieves state-of-the-art performance on three benchmark datasets for HOT detection tasks. Compared to the most recent DHOT, our method enjoys an average improvement of 13%, 27.5%, 16%, and 18.5% on SC-Acc., C-Acc., mIoU, and wIoU metrics, respectively.

Published

2025-04-11

How to Cite

Wang, Y., Neng, W., Wei, Z., Lei, Y., Xue, W., Zhuang, N., … Liu, Q. (2025). Precision-Enhanced Human-Object Contact Detection via Depth-Aware Perspective Interaction and Object Texture Restoration. Proceedings of the AAAI Conference on Artificial Intelligence, 39(8), 8187–8195. https://doi.org/10.1609/aaai.v39i8.32883

Issue

Section

AAAI Technical Track on Computer Vision VII