Beyond the Horizon: Decoupling Multi-View UAV Action Recognition via Partial Order Transfer

Authors

  • Wenxuan Liu State Key Laboratory for Multimedia Information Processing, Peking University Hubei Key Laboratory of Transportation Internet of Things, Wuhan University of Technology
  • Zhuo Zhou National Engineering Research Center for Multimedia Software, Wuhan University
  • Xuemei Jia National Engineering Research Center for Multimedia Software, Wuhan University
  • Siyuan Yang College of Computing and Data Science, Nanyang Technological University
  • Wenxin Huang Hubei Key Laboratory of Big Data Intelligent Analysis and Application, Hubei University
  • Xian Zhong Hubei Key Laboratory of Transportation Internet of Things, Wuhan University of Technology
  • Chia-Wen Lin Department of Electrical Engineering, National Tsing Hua University

DOI:

https://doi.org/10.1609/aaai.v40i9.37668

Abstract

Action recognition using uncrewed aerial vehicles (UAVs) faces unique challenges due to substantial view variations along the vertical spatial axis. Unlike ground-based scenarios, UAVs capture actions from diverse altitudes, resulting in pronounced appearance discrepancies and reduced recognition robustness. To address this, we introduce a multi-view formulation tailored for UAV altitudes and empirically uncover a distinctive partial order among views, where recognition accuracy consistently declines as altitude increases. This key observation motivates the proposed Aero Partial Order Guided Network (Aerorder), which explicitly models and exploits the hierarchical structure of UAV views to enhance cross-altitude action recognition. Aerorder comprises three main components: (1) a View Partition (VP) module that groups views by altitude using the head-to-body ratio; (2) an Order-aware Feature Decoupling (OFD) module that disentangles action-relevant and view-specific representations under partial order guidance; and (3) an Action Partial Order Guide (APOG) that progressively transfers knowledge from easier (low-altitude) to harder (high-altitude) views. Extensive experiments on Drone-Action, MOD20, and UAV validate the superiority of Aerorder, achieving consistent improvements over state-of-the-art methods, up to 4.7% and 1.3% gains on Drone-Action and MOD20, respectively.

Downloads

Published

2026-03-14

How to Cite

Liu, W., Zhou, Z., Jia, X., Yang, S., Huang, W., Zhong, X., & Lin, C.-W. (2026). Beyond the Horizon: Decoupling Multi-View UAV Action Recognition via Partial Order Transfer. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 7305-7313. https://doi.org/10.1609/aaai.v40i9.37668

Issue

Section

AAAI Technical Track on Computer Vision VI