HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video Tracking

Authors

  • Jiahao Wang Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Fang Liu Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Licheng Jiao Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Hao Wang Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Shuo Li Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Xinyi Wang Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Lingling Li Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Puhua Chen Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China
  • Xu Liu Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education Joint International Research Laboratory of Intelligent Perception and Computation School of Artificial Intelligent, Xidian University, Xi’an,710071, P.R. China

DOI:

https://doi.org/10.1609/aaai.v40i12.37950

Abstract

In recent years, the rapid progress of deep learning has driven notable advancements in satellite video tracking, a critical task for applications such as environmental monitoring, disaster management, and defense. Despite these strides, existing approaches remain constrained by their inability to handle dynamic challenges, such as target appearance variations, complex motion patterns, and occlusions. Traditional methods often suffer from static template matching or overly complex update mechanisms, compromising their robustness and practicality in real-world scenarios. To address these limitations, we propose a paradigm shift in satellite video tracking by integrating historical trajectory knowledge with visual features. This fusion enhances the tracker's perceptual understanding of targets over time, enabling more adaptive and resilient tracking. By aligning spatial, temporal, and cross-modal information, our approach effectively bridges the gap between fragmented observations and coherent tracking performance, even under challenging conditions like small target detection and cluttered backgrounds. Extensive experiments conducted on multiple satellite video tracking benchmarks demonstrate the superiority of our method, with HTTrack achieving success rates of 51.5% on SV248S, 52.9% on SatSOT, and 32.6% on VISO, significantly outperforming state-of-the-art trackers and marking a step forward in achieving robust, accurate, and scalable satellite video tracking.

Published

2026-03-14

How to Cite

Wang, J., Liu, F., Jiao, L., Wang, H., Li, S., Wang, X., Li, L., Chen, P., & Liu, X. (2026). HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video Tracking. Proceedings of the AAAI Conference on Artificial Intelligence, 40(12), 9856-9866. https://doi.org/10.1609/aaai.v40i12.37950

Issue

Section

AAAI Technical Track on Computer Vision IX