Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery

Authors

  • Jialu Zhang The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China Cixi Institute of Biomedical Engineering, Ningbo Institute of Industrial Technology, Chinese Academy of Sciences
  • Xiaoying Yang The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China
  • Wentao He The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China
  • Jianfeng Ren The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China Nottingham Ningbo China Beacons of Excellence Research and Innovation Institute, University of Nottingham Ningbo China
  • Qian Zhang The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China Nottingham Ningbo China Beacons of Excellence Research and Innovation Institute, University of Nottingham Ningbo China
  • Yitian Zhao Cixi Institute of Biomedical Engineering, Ningbo Institute of Industrial Technology, Chinese Academy of Sciences
  • Ruibin Bai The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China Nottingham Ningbo China Beacons of Excellence Research and Innovation Institute, University of Nottingham Ningbo China
  • Xiangjian He The Digital Port Technologies Lab, School of Computer Science, University of Nottingham Ningbo China Nottingham Ningbo China Beacons of Excellence Research and Innovation Institute, University of Nottingham Ningbo China
  • Jiang Liu Cixi Institute of Biomedical Engineering, Ningbo Institute of Industrial Technology, Chinese Academy of Sciences Department of Computer Science and Engineering, Southern University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v38i1.27795

Keywords:

APP: Mobility, Driving & Flight, CV: Visual Reasoning & Symbolic Representations

Abstract

Object detection in aerial imagery presents a significant challenge due to large scale variations among objects. This paper proposes an evolutionary reinforcement learning agent, integrated within a coarse-to-fine object detection framework, to optimize the scale for more effective detection of objects in such images. Specifically, a set of patches potentially containing objects are first generated. A set of rewards measuring the localization accuracy, the accuracy of predicted labels, and the scale consistency among nearby patches are designed in the agent to guide the scale optimization. The proposed scale-consistency reward ensures similar scales for neighboring objects of the same category. Furthermore, a spatial-semantic attention mechanism is designed to exploit the spatial semantic relations between patches. The agent employs the proximal policy optimization strategy in conjunction with the evolutionary strategy, effectively utilizing both the current patch status and historical experience embedded in the agent. The proposed model is compared with state-of-the-art methods on two benchmark datasets for object detection on drone imagery. It significantly outperforms all the compared methods. Code is available at https://github.com/UNNC-CV/EvOD/.

Published

2024-03-25

How to Cite

Zhang, J., Yang, X., He, W., Ren, J., Zhang, Q., Zhao, Y., Bai, R., He, X., & Liu, J. (2024). Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery. Proceedings of the AAAI Conference on Artificial Intelligence, 38(1), 410-418. https://doi.org/10.1609/aaai.v38i1.27795

Issue

Section

AAAI Technical Track on Application Domains