Hou, G., Fu, Y., Wu, C., Huang, X., Zheng, Z., Zhang, W., Shen, Y., & Lu, W. (2026). Reality vs Counterfactual: Multi-World Contrastive Reinforcement Learning for Enhancing MLLM’s Theory of Mind in Egocentric Videos. Proceedings of the AAAI Conference on Artificial Intelligence, 40(3), 1828-1836. https://doi.org/10.1609/aaai.v40i3.37162