Yin, P., Zeng, G., Wang, J., & Xie, D. (2024). CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 6729-6737. https://doi.org/10.1609/aaai.v38i7.28496