A Knowledge Graph Framework for Interpretable Video-Based Activity Recognition

Authors

  • Radu-Casian Mihailescu Heriot-Watt University

DOI:

https://doi.org/10.1609/aaaiss.v6i1.36041

Abstract

We propose two approaches for human activity recognition in videos that leverage knowledge graph representations. The first method constructs a Positional Encoding Knowledge Graph (PE-KG) by extracting objects and their spatial relationships from video keyframes, which are then analyzed using association rule mining. The second approach, termed Video KG, augments this representation by incorporating semantic cues from image captioning and affective insights from emotion detection with demographic analysis. The approach employs knowledge graph embeddings to capture spatiotemporal and contextual dependencies, leading to improved classification accuracy and enhanced interpretability on benchmarks such as the Kinetics dataset.

Downloads

Published

2025-08-01

How to Cite

Mihailescu, R.-C. (2025). A Knowledge Graph Framework for Interpretable Video-Based Activity Recognition. Proceedings of the AAAI Symposium Series, 6(1), 111–118. https://doi.org/10.1609/aaaiss.v6i1.36041

Issue

Section

Context-Awareness in Cyber-Physical Systems