Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness

Ezgi Korkmaz

doi:10.1609/aaai.v37i7.26009

Authors

Ezgi Korkmaz ---

DOI:

https://doi.org/10.1609/aaai.v37i7.26009

Keywords:

ML: Reinforcement Learning Algorithms, ML: Adversarial Learning & Robustness, ML: Deep Neural Network Algorithms, PEAI: Safety, Robustness & Trustworthiness

Abstract

Learning from raw high dimensional data via interaction with a given environment has been effectively achieved through the utilization of deep neural networks. Yet the observed degradation in policy performance caused by imperceptible worst-case policy dependent translations along high sensitivity directions (i.e. adversarial perturbations) raises concerns on the robustness of deep reinforcement learning policies. In our paper, we show that these high sensitivity directions do not lie only along particular worst-case directions, but rather are more abundant in the deep neural policy landscape and can be found via more natural means in a black-box setting. Furthermore, we show that vanilla training techniques intriguingly result in learning more robust policies compared to the policies learnt via the state-of-the-art adversarial training techniques. We believe our work lays out intriguing properties of the deep reinforcement learning policy manifold and our results can help to build robust and generalizable deep reinforcement learning policies.

Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription