MVCINN: Multi-View Diabetic Retinopathy Detection Using a Deep Cross-Interaction Neural Network
DOI:
https://doi.org/10.1609/aaai.v37i7.26080Keywords:
ML: Multi-Instance/Multi-View Learning, ML: Classification and RegressionAbstract
Diabetic retinopathy (DR) is the main cause of irreversible blindness for working-age adults. The previous models for DR detection have difficulties in clinical application. The main reason is that most of the previous methods only use single-view data, and the single field of view (FOV) only accounts for about 13% of the FOV of the retina, resulting in the loss of most lesion features. To alleviate this problem, we propose a multi-view model for DR detection, which takes full advantage of multi-view images covering almost all of the retinal field. To be specific, we design a Cross-Interaction Self-Attention based Module (CISAM) that interfuses local features extracted from convolutional blocks with long-range global features learned from transformer blocks. Furthermore, considering the pathological association in different views, we use the feature jigsaw to assemble and learn the features of multiple views. Extensive experiments on the latest public multi-view MFIDDR dataset with 34,452 images demonstrate the superiority of our method, which performs favorably against state-of-the-art models. To the best of our knowledge, this work is the first study on the public large-scale multi-view fundus images dataset for DR detection.Downloads
Published
2023-06-26
How to Cite
Luo, X., Liu, C., Wong, W., Wen, J., Jin, X., & Xu, Y. (2023). MVCINN: Multi-View Diabetic Retinopathy Detection Using a Deep Cross-Interaction Neural Network. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 8993-9001. https://doi.org/10.1609/aaai.v37i7.26080
Issue
Section
AAAI Technical Track on Machine Learning II