Intentional Evolutionary Learning for Untrimmed Videos with Long Tail Distribution

Authors

  • Yuxi Zhou Tianjin University of Technology Tsinghua University
  • Xiujie Wang Tianjin University of Technology
  • Jianhua Zhang Tianjin University of Technology
  • Jiajia Wang Tianjin University of Technology
  • Jie Yu Tianjin University of Technology
  • Hao Zhou Tianjin University of Technology
  • Yi Gao Tianjin University of Technology
  • Shengyong Chen Tianjin University of Technology

DOI:

https://doi.org/10.1609/aaai.v38i7.28605

Keywords:

CV: Video Understanding & Activity Analysis, APP: Other Applications, CV: 3D Computer Vision, CV: Applications, DMKM: Mining of Visual, Multimedia & Multimodal Data, HAI: Applications, HAI: Human-Aware Planning and Behavior Prediction, HAI: Human-Computer Interaction, ML: Applications, ML: Deep Neural Architectures and Foundation Models

Abstract

Human intention understanding in untrimmed videos aims to watch a natural video and predict what the person’s intention is. Currently, exploration of predicting human intentions in untrimmed videos is far from enough. On the one hand, untrimmed videos with mixed actions and backgrounds have a significant long-tail distribution with concept drift characteristics. On the other hand, most methods can only perceive instantaneous intentions, but cannot determine the evolution of intentions. To solve the above challenges, we propose a loss based on Instance Confidence and Class Accuracy (ICCA), which aims to alleviate the prediction bias caused by the long-tail distribution with concept drift characteristics in video streams. In addition, we propose an intention-oriented evolutionary learning method to determine the intention evolution pattern (from what action to what action) and the time of evolution (when the action evolves). We conducted extensive experiments on two untrimmed video datasets (THUMOS14 and ActivityNET v1.3), and our method has achieved excellent results compared to SOTA methods. The code and supplementary materials are available at https://github.com/Jennifer123www/UntrimmedVideo.

Downloads

Published

2024-03-24

How to Cite

Zhou, Y., Wang, X., Zhang, J., Wang, J., Yu, J., Zhou, H., Gao, Y., & Chen, S. (2024). Intentional Evolutionary Learning for Untrimmed Videos with Long Tail Distribution. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7713-7721. https://doi.org/10.1609/aaai.v38i7.28605

Issue

Section

AAAI Technical Track on Computer Vision VI