Gaze Target Detection by Merging Human Attention and Activity Cues

Yaokun Yang; Yihan Yin; Feng Lu

doi:10.1609/aaai.v38i7.28480

Authors

Yaokun Yang Beihang University
Yihan Yin Beihang University
Feng Lu Beihang University

DOI:

https://doi.org/10.1609/aaai.v38i7.28480

Keywords:

CV: Biometrics, Face, Gesture & Pose, CV: Scene Analysis & Understanding

Abstract

Despite achieving impressive performance, current methods for detecting gaze targets, which depend on visual saliency and spatial scene geometry, continue to face challenges when it comes to detecting gaze targets within intricate image backgrounds. One of the primary reasons for this lies in the oversight of the intricate connection between human attention and activity cues. In this study, we introduce an innovative approach that amalgamates the visual saliency detection with the body-part & object interaction both guided by the soft gaze attention. This fusion enables precise and dependable detection of gaze targets amidst intricate image backgrounds. Our approach attains state-of-the-art performance on both the Gazefollow benchmark and the GazeVideoAttn benchmark. In comparison to recent methods that rely on intricate 3D reconstruction of a single input image, our approach, which solely leverages 2D image information, still exhibits a substantial lead across all evaluation metrics, positioning it closer to human-level performance. These outcomes underscore the potent effectiveness of our proposed method in the gaze target detection task.

Gaze Target Detection by Merging Human Attention and Activity Cues

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription