Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
DOI:
https://doi.org/10.1609/aaai.v40i26.39281Abstract
Generalization to unseen environments is a significant challenge in the field of robotics and control. In this work, we focus on contextual reinforcement learning, where agents act within environments with varying contexts, such as self-driving cars or quadrupedal robots that need to operate in different terrains or weather conditions than they were trained for. We tackle the critical task of generalizing to out-of-distribution (OOD) settings, without access to explicit context information at test time. Recent work has addressed this problem by training a context encoder and a history adaptation module in separate stages. While promising, this two-phase approach is cumbersome to implement and train. We simplify the methodology and introduce SPARC: single-phase adaptation for robust control. We test SPARC on varying contexts within the high-fidelity racing simulator Gran Turismo 7 and wind-perturbed MuJoCo environments, and find that it achieves reliable and robust OOD generalization.Downloads
Published
2026-03-14
How to Cite
Grooten, B., MacAlpine, P., Subramanian, K., Stone, P., & Wurman, P. R. (2026). Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy. Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), 21352–21360. https://doi.org/10.1609/aaai.v40i26.39281
Issue
Section
AAAI Technical Track on Machine Learning III