Fu, T., Zhao, M., Niu, K., Peng, K., & Li, B. (2026). OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(5), 4031–4039. https://doi.org/10.1609/aaai.v40i5.37406