LVPTrack: High Performance Domain Adaptive UAV Tracking with Label Aligned Visual Prompt Tuning

Authors

  • Hongjing Wu Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
  • Siyuan Yao Beijing University of Posts and Telecommunications, Beijing, China
  • Feng Huang School of Mechanical Engineering and Automation, Fuzhou University, Fuzhou, China
  • Shu Wang School of Mechanical Engineering and Automation, Fuzhou University, Fuzhou, China
  • Linchao Zhang Artificial Intelligence Institute of China Electronics Technology Group Corporation, Beijing, China
  • Zhuoran Zheng Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
  • Wenqi Ren Shenzhen Campus of Sun Yat-sen University, Shenzhen, China

DOI:

https://doi.org/10.1609/aaai.v39i8.32906

Abstract

Visual object tracking is essentially crucial for unmanned aerial vehicles (UAVs). Despite the substantial progress, most of the existing UAV trackers are designed for well-conditioned daytime data, while for the scenarios in challenging weather condition, e.g. foggy or nighttime environment, the tremendous domain gap leads to significant performance degradation. To address this issue, in this paper, we propose a novel robust UAV tracker termed LVPTrack, which conducts high quality label-aligned visual prompt tuning to adapt to various challenging weather conditions. Specifically, we first synthesize the sequential foggy and nighttime video frames to assist the model training. A domain adaptive teacher-student network is utilized to distill the hierarchical visual semantic of the target objects in cross-domain scenarios. Then we propose a target-aware pseudo-label voting (PLV) strategy to alleviate the target-level misalignment in the dual domains. Furthermore, we propose a dynamic aggregated prompt (DAP) module to facilitate the appearance variation adaptation of the target object in challenging scenarios. Extensive experiments demonstrate that our tracker achieves superior performance over existing state-of-the-art UAV trackers.

Downloads

Published

2025-04-11

How to Cite

Wu, H., Yao, S., Huang, F., Wang, S., Zhang, L., Zheng, Z., & Ren, W. (2025). LVPTrack: High Performance Domain Adaptive UAV Tracking with Label Aligned Visual Prompt Tuning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(8), 8395–8403. https://doi.org/10.1609/aaai.v39i8.32906

Issue

Section

AAAI Technical Track on Computer Vision VII