[1]
A. Sinha, D. Reilly, F. Bremond, P. Wang, and S. Das, “SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living”, AAAI, vol. 39, no. 7, pp. 6931–6939, Apr. 2025.