ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

Authors

  • Kaijun Wang Zhejiang University
  • Liqin Lu Zhejiang University of Technology
  • Mingyu Liu Zhejiang University
  • Jianuo Jiang The Chinese University of Hong Kong, Shenzhen
  • Zeju Li Zhejiang University
  • Bolin Zhang Zhejiang University
  • Wancai Zheng Zhejiang University of Technology
  • Xinyi Yu Zhejiang University of Technology
  • Hao Chen Zhejiang University
  • Chunhua Shen Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v40i22.38927

Abstract

Language-guided long-horizon mobile manipulation has long been a grand challenge in embodied semantic reasoning, generalizable manipulation, and adaptive locomotion. Three fundamental limitations hinder progress: First, although large language models have shown promise in enhancing spatial reasoning and task planning through learned semantic priors, existing implementations remain confined to tabletop scenarios, failing to address the constrained perception and limited actuation ranges characteristic of mobile platforms. Second, current manipulation strategies exhibit insufficient generalization when confronted with the diverse object configurations encountered in open-world environments. Third, while crucial for practical deployment, the dual requirement of maintaining high platform maneuverability alongside precise end-effector control in unstructured settings remains understudied in the literature. In this work, we present ODYSSEY, a unified mobile manipulation framework for agile quadruped robots equipped with manipulators, which seamlessly integrates high-level task planning with low-level whole-body control. To address the challenge of egocentric perception in language-conditioned tasks, we introduce a hierarchical planner powered by a vision-language model, enabling long-horizon instruction decomposition and precise action execution. At the control level, our novel whole-body policy achieves robust coordination of locomotion and manipulation across challenging terrains. We further present the first comprehensive benchmark for long-horizon mobile manipulation, evaluating diverse indoor and outdoor scenarios. Through successful sim-to-real transfer, we demonstrate the system’s generalization and robustness in real-world deployments, underscoring the practicality of legged manipulators in unstructured environments. Our work advances the feasibility of generalized robotic assistants capable of complex, dynamic tasks.

Published

2026-03-14

How to Cite

Wang, K., Lu, L., Liu, M., Jiang, J., Li, Z., Zhang, B., Zheng, W., Yu, X., Chen, H., & Shen, C. (2026). ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18602-18610. https://doi.org/10.1609/aaai.v40i22.38927

Issue

Section

AAAI Technical Track on Intelligent Robotics