MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection

Authors

  • Chuyi Zhong Academy for Engineering and Technology, Fudan University Cognition and Intelligent Technology Laboratory (CIT Lab) Institute of Metaverse & Intelligent Medicine, Fudan University
  • Dingkang Yang Academy for Engineering and Technology, Fudan University Cognition and Intelligent Technology Laboratory (CIT Lab) Institute of Metaverse & Intelligent Medicine, Fudan University
  • Peng Zhai Academy for Engineering and Technology, Fudan University Cognition and Intelligent Technology Laboratory (CIT Lab) Institute of Metaverse & Intelligent Medicine, Fudan University
  • Lihua Zhang Academy for Engineering and Technology, Fudan University Cognition and Intelligent Technology Laboratory (CIT Lab) Institute of Metaverse & Intelligent Medicine, Fudan University Jilin Provincial Key Laboratory of Intelligence Science and Engineering, Changchun, China Engineering Research Center of AI and Robotics, Ministry of Education, Shanghai, China

DOI:

https://doi.org/10.1609/aaai.v39i10.33157

Abstract

As the global population ages and the incidence of chronic diseases increases, the demand for early detection of abnormal medical conditions is increasing. Traditional health monitoring methods often require significant resources and specialized personnel, limiting their widespread use. Leveraging advancements in AI technologies, this study proposes a non-invasive method for detecting abnormal medical conditions from image data. A multimodal perception framework is introduced, integrating features from various modalities, including facial expressions and body postures, to enhance detection accuracy. The framework employs a Cascaded Squeeze-Excitation (CSE) module, consisting of Adaptive and Multi-modal Squeeze-Excitation components, to capture complex feature dependencies and improve cross-modal performance. Extensive experiments demonstrate the effectiveness of this approach, showing improved performance over existing methods. In addition, a new dataset that encompasses a wide range of medical conditions has been released, providing a valuable resource for future research in this domain.

Downloads

Published

2025-04-11

How to Cite

Zhong, C., Yang, D., Zhai, P., & Zhang, L. (2025). MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 39(10), 10653-10661. https://doi.org/10.1609/aaai.v39i10.33157

Issue

Section

AAAI Technical Track on Computer Vision IX