Object-Oriented Dynamics Learning through Multi-Level Abstraction

Guangxiang Zhu; Jianhao Wang; Zhizhou Ren; Zichuan Lin; Chongjie Zhang

doi:10.1609/aaai.v34i04.6183

Authors

Guangxiang Zhu Tsinghua university
Jianhao Wang Tsinghua University
Zhizhou Ren Tsinghua University
Zichuan Lin Tsinghua University
Chongjie Zhang Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v34i04.6183

Abstract

Object-based approaches for learning action-conditioned dynamics has demonstrated promise for generalization and interpretability. However, existing approaches suffer from structural limitations and optimization difficulties for common environments with multiple dynamic objects. In this paper, we present a novel self-supervised learning framework, called Multi-level Abstraction Object-oriented Predictor (MAOP), which employs a three-level learning architecture that enables efficient object-based dynamics learning from raw visual observations. We also design a spatial-temporal relational reasoning mechanism for MAOP to support instance-level dynamics learning and handle partial observability. Our results show that MAOP significantly outperforms previous methods in terms of sample efficiency and generalization over novel environments for learning environment models. We also demonstrate that learned dynamics models enable efficient planning in unseen environments, comparable to true environment models. In addition, MAOP learns semantically and visually interpretable disentangled representations.

Object-Oriented Dynamics Learning through Multi-Level Abstraction

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription