MVCINN: Multi-View Diabetic Retinopathy Detection Using a Deep Cross-Interaction Neural Network

Authors

  • Xiaoling Luo Harbin Institute of Technology, Shenzhen, China
  • Chengliang Liu Harbin Institute of Technology, Shenzhen, China
  • Waikeung Wong The Hong Kong Polytechnic University, Kowloon, Hong Kong Laboratory for Artificial Intelligence in Design, Hong Kong
  • Jie Wen Harbin Institute of Technology, Shenzhen, China
  • Xiaopeng Jin Shenzhen Technology University, Shenzhen, China
  • Yong Xu Harbin Institute of Technology, Shenzhen, China

DOI:

https://doi.org/10.1609/aaai.v37i7.26080

Keywords:

ML: Multi-Instance/Multi-View Learning, ML: Classification and Regression

Abstract

Diabetic retinopathy (DR) is the main cause of irreversible blindness for working-age adults. The previous models for DR detection have difficulties in clinical application. The main reason is that most of the previous methods only use single-view data, and the single field of view (FOV) only accounts for about 13% of the FOV of the retina, resulting in the loss of most lesion features. To alleviate this problem, we propose a multi-view model for DR detection, which takes full advantage of multi-view images covering almost all of the retinal field. To be specific, we design a Cross-Interaction Self-Attention based Module (CISAM) that interfuses local features extracted from convolutional blocks with long-range global features learned from transformer blocks. Furthermore, considering the pathological association in different views, we use the feature jigsaw to assemble and learn the features of multiple views. Extensive experiments on the latest public multi-view MFIDDR dataset with 34,452 images demonstrate the superiority of our method, which performs favorably against state-of-the-art models. To the best of our knowledge, this work is the first study on the public large-scale multi-view fundus images dataset for DR detection.

Downloads

Published

2023-06-26

How to Cite

Luo, X., Liu, C., Wong, W., Wen, J., Jin, X., & Xu, Y. (2023). MVCINN: Multi-View Diabetic Retinopathy Detection Using a Deep Cross-Interaction Neural Network. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 8993-9001. https://doi.org/10.1609/aaai.v37i7.26080

Issue

Section

AAAI Technical Track on Machine Learning II