Adaptive Interception in Dynamic Domains: Exploration of Hybrid Reinforcement Learning in Pursuit-Evasion Tasks

Authors

  • Matthew Akinmolayan Bowie State University
  • Darsana Josyula Bowie State University
  • David Casbeer Air Force Research Laboratory

DOI:

https://doi.org/10.1609/aaaiss.v8i1.42510

Abstract

This paper investigates a hybrid approach that integrates clas sical interception heuristics with reinforcement learning (RL) for solving real-time pursuit-evasion problems in both single agent and multi-agent scenarios. Drawing inspiration from Weintraub et al.’s range-limited pursuit-evasion formulation, we developed a simulation and learning architecture com bining Proximal Policy Optimization (PPO) with classical prediction-based interception. Through adaptive curriculum learning and an enhanced reward function, the hybrid agent learns to dynamically balance between geometric reason ing and learned behaviors. We first validate our approach in single-agent scenarios, then extend to a multi-agent set ting featuring coordinated pursuer teams defending stations against RL-trained evader flocks using adversarial self-play. Our evaluations reveal that the hybrid method significantly outperforms classical approaches in both settings, achiev ing a 94% capture rate versus 24% in single-agent scenarios and demonstrating robust coordination in multi-agent defense tasks against adaptive adversaries.

Downloads

Published

2026-05-18

How to Cite

Akinmolayan, M., Josyula, D., & Casbeer, D. (2026). Adaptive Interception in Dynamic Domains: Exploration of Hybrid Reinforcement Learning in Pursuit-Evasion Tasks. Proceedings of the AAAI Symposium Series, 8(1), 2–10. https://doi.org/10.1609/aaaiss.v8i1.42510

Issue

Section

Advances in AI-Enabled Tactical Autonomy