Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological Measurement

Authors

  • Jae-Ho Choi Stanford University
  • Ki-Bong Kang Samsung Electronics
  • Kyung-Tae Kim Pohang University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v38i2.27898

Keywords:

CV: Multi-modal Vision, APP: Internet of Things, Sensor Networks & Smart Cities, CV: Biometrics, Face, Gesture & Pose, CV: Medical and Biological Imaging, HAI: Applications, HAI: Human-Computer Interaction, ROB: Multimodal Perception & Sensor Fusion

Abstract

Remote physiology, which involves monitoring vital signs without the need for physical contact, has great potential for various applications. Current remote physiology methods rely only on a single camera or radio frequency (RF) sensor to capture the microscopic signatures from vital movements. However, our study shows that fusing deep RGB and RF features from both sensor streams can further improve performance. Because these multimodal features are defined in distinct dimensions and have varying contextual importance, the main challenge in the fusion process lies in the effective alignment of them and adaptive integration of features under dynamic scenarios. To address this challenge, we propose a novel vital sensing model, named Fusion-Vital, that combines the RGB and RF modalities through the new introduction of pairwise input formats and transformer-based fusion strategies. We also perform comprehensive experiments based on a newly collected and released remote vital dataset comprising synchronized video-RF sensors, showing the superiority of the fusion approach over the previous single-sensor baselines in various aspects.

Published

2024-03-24

How to Cite

Choi, J.-H., Kang, K.-B., & Kim, K.-T. (2024). Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological Measurement. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 1344-1352. https://doi.org/10.1609/aaai.v38i2.27898

Issue

Section

AAAI Technical Track on Computer Vision I