Motion-Aware Object Tracking via Motion and Geometry-Aware Cues

Authors

  • Hongtao Yang Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin 541004, China University Engineering Research Center of Educational Intelligent Technology, Guangxi Normal University, Guilin 541004, China
  • Bineng Zhong Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin 541004, China University Engineering Research Center of Educational Intelligent Technology, Guangxi Normal University, Guilin 541004, China
  • Qihua Liang Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin 541004, China University Engineering Research Center of Educational Intelligent Technology, Guangxi Normal University, Guilin 541004, China
  • Xiantao Hu Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin 541004, China Nanjing University of Science and Technology
  • Yufei Tan Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin 541004, China University Engineering Research Center of Educational Intelligent Technology, Guangxi Normal University, Guilin 541004, China Guangxi Key Laboratory of Brain-inspired Computing and Intelligent Chips, School of Electronic and Information Engineering, Guangxi Normal University, Guilin 541004, China
  • Haiying Xia Guangxi Key Laboratory of Brain-inspired Computing and Intelligent Chips, School of Electronic and Information Engineering, Guangxi Normal University, Guilin 541004, China
  • Shuxiang Song Guangxi Key Laboratory of Brain-inspired Computing and Intelligent Chips, School of Electronic and Information Engineering, Guangxi Normal University, Guilin 541004, China

DOI:

https://doi.org/10.1609/aaai.v40i14.38144

Abstract

Understanding motion is essential for visual object tracking, especially in complex and dynamic scenarios. Yet, many existing methods rely on simplistic strategies such as template updates or temporal feature propagation, often overlooking the deeper modeling of motion information. To mitigate this limitation, we introduce a motion-aware spatio-temporal framework that enhances motion perception by explicitly matching motion patterns and modeling inter-frame motion relationships. Central to our design is a motion pattern dictionary, which encodes a diverse set of representative motion cues as learnable features. During tracking, features from the search region interact with the dictionary to retrieve the most relevant motion patterns, allowing the model to adapt to the current motion state. A dedicated decoder further incorporates temporal correlations to refine motion awareness. To complement motion modeling, we embed geometric cues into the search region features, which strengthens spatial perception, reduces ambiguity under occlusion, and improves foreground-background separation. Extensive evaluations on seven challenging benchmarks demonstrate the effectiveness of our design. In particular, MoDTrack_384 surpasses recent SOTA trackers on LaSOT by 1.2% in AUC, highlighting the benefits of motion pattern modeling and geometry-guided enhancement in mitigating tracking drift.

Published

2026-03-14

How to Cite

Yang, H., Zhong, B., Liang, Q., Hu, X., Tan, Y., Xia, H., & Song, S. (2026). Motion-Aware Object Tracking via Motion and Geometry-Aware Cues. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 11604-11612. https://doi.org/10.1609/aaai.v40i14.38144

Issue

Section

AAAI Technical Track on Computer Vision XI