Keep On Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training

Authors

  • Yang Zhang MoE Key Lab of Artificial Intelligence and AI Institute, Shanghai Jiao Tong University
  • Zhanxiang Cao MoE Key Lab of Artificial Intelligence and AI Institute, Shanghai Jiao Tong University Shanghai Innovation Institute
  • Buqing Nie MoE Key Lab of Artificial Intelligence and AI Institute, Shanghai Jiao Tong University
  • Haoyang Li MoE Key Lab of Artificial Intelligence and AI Institute, Shanghai Jiao Tong University Shanghai Innovation Institute
  • Zhong Jiangwei Lenovo Research
  • Qiao Sun Lenovo Research
  • Xiaoyi Hu Lenovo Research
  • Xiaokang Yang MoE Key Lab of Artificial Intelligence and AI Institute, Shanghai Jiao Tong University
  • Yue Gao MoE Key Lab of Artificial Intelligence and AI Institute, Shanghai Jiao Tong University Shanghai Innovation Institute

DOI:

https://doi.org/10.1609/aaai.v40i22.38949

Abstract

Humanoid robots are expected to operate reliably over long horizons while executing versatile whole-body skills. Yet Reinforcement Learning (RL) motion policies typically lose stability under prolonged operation, sensor/actuator noise, and real world disturbances. In this work, we propose a Selective Adversarial Attack for Robust Training (SA2RT) to enhance the robustness of motion skills. The adversary is learned to identify and sparsely perturb the most vulnerable states and actions under an attack-budget constraint, thereby exposing true weakness without inducing conservative overfitting. The resulting non-zero sum, alternating optimization continually strengthens the motion policy against the strongest discovered attacks. We validate our approach on the Unitree G1 humanoid robot across perceptive locomotion and whole-body control tasks. Experimental results show that adversarially trained policies improve the terrain traversal success rate by 40%, reduce the trajectory tracking error by 32%, and maintain long horizon mobility and tracking performance. Together, these results demonstrate that selective adversarial attacks are an effective driver for learning robust, long horizon humanoid motion skills.

Published

2026-03-14

How to Cite

Zhang, Y., Cao, Z., Nie, B., Li, H., Jiangwei, Z., Sun, Q., … Gao, Y. (2026). Keep On Going: Learning Robust Humanoid Motion Skills via Selective Adversarial Training. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18800–18808. https://doi.org/10.1609/aaai.v40i22.38949

Issue

Section

AAAI Technical Track on Intelligent Robotics