Temporal Action Proposal Generation with Background Constraint

Authors

  • Haosen Yang Harbin Institute of Technology Department of Computer Vision Technology (VIS), Baidu Inc
  • Wenhao Wu Department of Computer Vision Technology (VIS), Baidu Inc
  • Lining Wang Harbin Institute of Technology
  • Sheng Jin Harbin Institute of Technology
  • Boyang Xia Institute of Computing Technology, Chinese Academy of Science Department of Computer Vision Technology (VIS), Baidu Inc
  • Hongxun Yao Harbin Institute of Technology
  • Hujie Huang Harbin Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v36i3.20212

Keywords:

Computer Vision (CV)

Abstract

Temporal action proposal generation (TAPG) is a challenging task that aims to locate action instances in untrimmed videos with temporal boundaries. To evaluate the confidence of proposals, the existing works typically predict action score of proposals that are supervised by the temporal Intersection-over-Union (tIoU) between proposal and the ground-truth. In this paper, we innovatively propose a general auxiliary Background Constraint idea to further suppress low-quality proposals, by utilizing the background prediction score to restrict the confidence of proposals. In this way, the Background Constraint concept can be easily plug-and-played into existing TAPG methods (BMN, GTAD). From this perspective, we propose the Background Constraint Network (BCNet) to further take advantage of the rich information of action and background. Specifically, we introduce an Action-Background Interaction module for reliable confidence evaluation, which models the inconsistency between action and background by attention mechanisms at the frame and clip levels. Extensive experiments are conducted on two popular benchmarks, ActivityNet-1.3 and THUMOS14. The results demonstrate that our method outperforms state-of-the-art methods. Equipped with the existing action classifier, our method also achieves remarkable performance on the temporal action localization task.

Downloads

Published

2022-06-28

How to Cite

Yang, H., Wu, W., Wang, L., Jin, S., Xia, B., Yao, H., & Huang, H. (2022). Temporal Action Proposal Generation with Background Constraint. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3054-3062. https://doi.org/10.1609/aaai.v36i3.20212

Issue

Section

AAAI Technical Track on Computer Vision III