Fu, Teng, Mengyang Zhao, Ke Niu, Kaixin Peng, and Bin Li. 2026. “OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (5):4031-39. https://doi.org/10.1609/aaai.v40i5.37406.