An, Seungjun, Seonghoon Park, Gyeongnyeon Kim, Jeongyeol Baek, Byeongwon Lee, and Seungryong Kim. “Context Enhanced Transformer for Single Image Object Detection in Video Data”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 2 (March 24, 2024): 682–690. Accessed May 18, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/27825.