Third-Person Imitation Learning via Image Difference and Variational Discriminator Bottleneck (Student Abstract)

Chong Jiang; Zongzhang Zhang; Zixuan Chen; Jiacheng Zhu; Junpeng Jiang

doi:10.1609/aaai.v34i10.7181

Third-Person Imitation Learning via Image Difference and Variational Discriminator Bottleneck (Student Abstract)

Authors

Chong Jiang Soochow University
Zongzhang Zhang Nanjing University
Zixuan Chen Soochow University
Jiacheng Zhu Soochow University
Junpeng Jiang Soochow University

DOI:

https://doi.org/10.1609/aaai.v34i10.7181

Abstract

Third-person imitation learning (TPIL) is a variant of generative adversarial imitation learning and can learn an expert-like policy from third-person expert demonstrations. Third-person expert demonstrations usually exist in the form of videos recorded in a third-person perspective, and there is a lack of direct correspondence with samples generated by agent. To alleviate this problem, we improve TPIL by applying image difference and variational discriminator bottleneck. Empirically, our new method has better performance than TPIL on two MuJoCo tasks, Reacher and Inverted Pendulum.

Downloads

Published

2020-04-03

How to Cite

Jiang, C., Zhang, Z., Chen, Z., Zhu, J., & Jiang, J. (2020). Third-Person Imitation Learning via Image Difference and Variational Discriminator Bottleneck (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 34(10), 13819-13820. https://doi.org/10.1609/aaai.v34i10.7181

Download Citation

Issue

Vol. 34 No. 10: Issue 10: AAAI-20 Student Tracks

Section

Student Abstract Track

Third-Person Imitation Learning via Image Difference and Variational Discriminator Bottleneck (Student Abstract)

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription