Interpretable Adversarial Reinforcement Learning

Oliver Chang

doi:10.1609/aaai.v40i48.42145

Interpretable Adversarial Reinforcement Learning

Authors

Oliver Chang University of California, Santa Cruz

DOI:

https://doi.org/10.1609/aaai.v40i48.42145

Abstract

Autonomous driving has shown significant progress in recent years. The combination of advanced sensors, ample data, and machine learning algorithms has led to the deployment of autonomous vehicles (AVs) in cities like Los Angeles, San Francisco, and Phoenix. While not all humans can drive perfectly, AVs should be able to plan, adapt, and react to environmental disturbances, including irrational human drivers. My research focuses on applying reinforcement learning (RL) techniques to validate AV-related cyber-physical systems (CPS) in realistic environments. I develop a custom RL environment that simulates highway driving scenarios with multiple vehicles. This environment includes a CPS model of adaptive cruise control (ACC), a lane-changing model (MOBIL), and an adversarial agent that learns to drive irrationally. My work extends interpretable RL techniques to continuous control tasks like autonomous driving.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

Published

2026-03-14

How to Cite

Chang, O. (2026). Interpretable Adversarial Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41038–41039. https://doi.org/10.1609/aaai.v40i48.42145

Download Citation

Issue

Vol. 40 No. 48: EAAI-26 AI for Education, Model AI Assignments, AAAI-26 Emerging Trends, Doctoral Consortium, Student Abstracts, Undergraduate Consortium and Demonstrations

Section

AAAI Doctoral Consortium Track

Interpretable Adversarial Reinforcement Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information