Tian, B., Si, Y., Wang, J., Li, L., Bao, Z., Zhou, Z., … Qiu, M. (2026). CrossCheck-Bench: Diagnosing Compositional Failures in Multimodal Conflict Resolution. Proceedings of the AAAI Conference on Artificial Intelligence, 40(31), 25887–25895. https://doi.org/10.1609/aaai.v40i31.39788