Expert Data Augmentation in Imitation Learning (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v37i13.26970Keywords:
Imitation Learning, Data Augmentation, Behavioral Cloning, Random Network DistillationAbstract
Behavioral Cloning (BC) is a simple and effective imitation learning algorithm, which suffers from compounding error due to covariate shift. One solution is to use enough data for training. However, the amount of expert demonstrations available is usually limited. So we propose an effective method to augment expert demonstrations to alleviate the problem of compounding error in BC. It operates by estimating the similarity of states and filtering out transitions that can go back to the states similar to ones in expert demonstrations during the process of sampling. The data filtered out along with original expert demonstrations are used for training. We evaluate the performance of our method on several Atari tasks and continuous MuJoCo control tasks. Empirically, BC trained with the augmented data significantly outperform BC trained with the original expert demonstrations.Downloads
Published
2023-09-06
How to Cite
Han, F., & Zhang, Z. (2023). Expert Data Augmentation in Imitation Learning (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16220-16221. https://doi.org/10.1609/aaai.v37i13.26970
Issue
Section
AAAI Student Abstract and Poster Program