Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification

Created by
  • Haebom

Author

Guohao Huo, Zibo Lin, Zitong Wang, Ruiting Dai, Hao Tang

Outline

This paper proposes DMS-Net, a novel deep learning model based on binocular fundus images for retinal disease diagnosis. DMS-Net is based on the Siamese ResNet-152 architecture, which simultaneously processes fundus images from both eyes and considers pathological correlations. The model introduces the OmniPool Spatial Integrator Module (OSIM), which utilizes multi-scale adaptive pooling and spatial attention mechanisms to address unclear lesion boundaries and diffuse pathology distribution. Furthermore, the Calibrated Analogous Semantic Fusion Module (CASFM) is used to enhance the interaction between binocular images and aggregate modality-independent representations. Furthermore, the Cross-Modal Contrastive Alignment Module (CCAM) and the Cross-Modal Integrative Alignment Module (CIAM) enhance the aggregation of discriminatory and lesion-correlated semantic information between the left and right fundus images. When evaluated on the ODIR-5K dataset, DMS-Net achieved state-of-the-art performance with 82.9% accuracy, 84.5% recall, and 83.2% Cohen's kappa coefficient.

Takeaways, Limitations

Takeaways:
We demonstrate that binocular fundus imaging can improve the accuracy of diagnosing retinal diseases.
The proposed DMS-Net effectively addresses the problems of unclear lesions and diffuse pathology through various modules.
It can contribute to clinical decision support by achieving cutting-edge performance.
Code and preprocessed datasets will be released in the future.
Limitations:
Only performance evaluations on the ODIR-5K dataset are presented, so generalization performance on other datasets is uncertain.
A more detailed comparative analysis with other deep learning models is needed.
Further research is needed to determine its applicability and utility in real-world clinical settings.
👍