AoI-MDP: An AoI Optimized Markov Decision Process Dedicated in the Underwater Task (Student Abstract)

Authors

  • Yimian Ding Massachusetts Institute of Technology
  • Jingzehua Xu Massachusetts Institute of Technology
  • Yiyuan Yang University of Oxford
  • Guanwen Xie Massachusetts Institute of Technology
  • Xinqi Wang Massachusetts Institute of Technology
  • Shuai Zhang New Jersey Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v39i28.35247

Abstract

Ocean exploration places high demands on autonomous underwater vehicles, especially when there's observation delay. We propose age of information optimized Markov decision process (AoI-MDP) to enhance underwater tasks by modeling observation delay as signal delay and including it in the state space. AoI-MDP also introduces wait time in the action space and integrates AoI with reward functions, optimizing information freshness and decision-making using reinforcement learning. Simulations show AoI-MDP outperforms the standard MDP, demonstrating superior performance, feasibility, and generalization in underwater tasks. To accelerate relevant research, we have made the codes available as open-source at https://github.com/Xiboxtg/AoI-MDP.

Downloads

Published

2025-04-11

How to Cite

Ding, Y., Xu, J., Yang, Y., Xie, G., Wang, X., & Zhang, S. (2025). AoI-MDP: An AoI Optimized Markov Decision Process Dedicated in the Underwater Task (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 29348–29350. https://doi.org/10.1609/aaai.v39i28.35247