Xiong, G., & Tambe, M. (2026). VORTEX: Aligning Task Utility and Human Preferences Through LLM-Guided Reward Shaping. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27162–27170. https://doi.org/10.1609/aaai.v40i32.39931