Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy

Bram Grooten; Patrick MacAlpine; Kaushik Subramanian; Peter Stone; Peter R. Wurman

doi:10.1609/aaai.v40i26.39281

Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy

Authors

Bram Grooten Sony AI TU Eindhoven
Patrick MacAlpine Sony AI
Kaushik Subramanian Sony AI
Peter Stone Sony AI The University of Texas at Austin
Peter R. Wurman Sony AI

DOI:

https://doi.org/10.1609/aaai.v40i26.39281

Abstract

Generalization to unseen environments is a significant challenge in the field of robotics and control. In this work, we focus on contextual reinforcement learning, where agents act within environments with varying contexts, such as self-driving cars or quadrupedal robots that need to operate in different terrains or weather conditions than they were trained for. We tackle the critical task of generalizing to out-of-distribution (OOD) settings, without access to explicit context information at test time. Recent work has addressed this problem by training a context encoder and a history adaptation module in separate stages. While promising, this two-phase approach is cumbersome to implement and train. We simplify the methodology and introduce SPARC: single-phase adaptation for robust control. We test SPARC on varying contexts within the high-fidelity racing simulator Gran Turismo 7 and wind-perturbed MuJoCo environments, and find that it achieves reliable and robust OOD generalization.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

Published

2026-03-14

How to Cite

Grooten, B., MacAlpine, P., Subramanian, K., Stone, P., & Wurman, P. R. (2026). Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy. Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), 21352–21360. https://doi.org/10.1609/aaai.v40i26.39281

Download Citation

Issue

Vol. 40 No. 26: AAAI-26 Technical Tracks 26

Section

AAAI Technical Track on Machine Learning III

Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information