A Knowledge Graph Framework for Interpretable Video-Based Activity Recognition
DOI:
https://doi.org/10.1609/aaaiss.v6i1.36041Abstract
We propose two approaches for human activity recognition in videos that leverage knowledge graph representations. The first method constructs a Positional Encoding Knowledge Graph (PE-KG) by extracting objects and their spatial relationships from video keyframes, which are then analyzed using association rule mining. The second approach, termed Video KG, augments this representation by incorporating semantic cues from image captioning and affective insights from emotion detection with demographic analysis. The approach employs knowledge graph embeddings to capture spatiotemporal and contextual dependencies, leading to improved classification accuracy and enhanced interpretability on benchmarks such as the Kinetics dataset.Downloads
Published
2025-08-01
How to Cite
Mihailescu, R.-C. (2025). A Knowledge Graph Framework for Interpretable Video-Based Activity Recognition. Proceedings of the AAAI Symposium Series, 6(1), 111–118. https://doi.org/10.1609/aaaiss.v6i1.36041
Issue
Section
Context-Awareness in Cyber-Physical Systems