Fu, Teng, et al. “OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 5, Mar. 2026, pp. 4031-9, doi:10.1609/aaai.v40i5.37406.