Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

A Systematic Study of Deep Learning Models and xAI Methods for Region-of-Interest Detection in MRI Scans

Created by
  • Haebom

Author

Justin Yiu, Kushank Arora, Daniel Steinberg, Rohit Ghiya

Outline

This study presents an automatic region-of-interest (ROI) detection system utilizing deep learning and explainable AI (xAI) techniques to improve the efficiency and objectivity of MRI interpretation, which is essential for evaluating knee injuries. Various deep learning architectures, including ResNet50, InceptionV3, Vision Transformers (ViT), and various U-Net variants, were evaluated using supervised and self-supervised learning methods. xAI techniques, such as Grad-CAM and Saliency Maps, were integrated to enhance interpretability. Performance was evaluated using area under the curve (AUC) (classification), PSNR/SSIM (reconstruction quality), and qualitative ROI visualization. ResNet50 demonstrated superior classification and ROI identification performance over Transformer-based models on the MRNet dataset. A combined U-Net + MLP model showed potential for improved reconstruction and interpretability but lower classification performance, while Grad-CAM provided the most clinically meaningful explanations among all architectures. In conclusion, CNN-based transfer learning was most effective on this dataset, and future performance enhancements to Transformer models through large-scale pretraining are expected.

Takeaways, Limitations

Takeaways:
We demonstrate that ResNet50-based CNN transfer learning is effective for knee MRI ROI detection.
Visualize the model's decision-making process and enhance clinical interpretability by utilizing xAI techniques (Grad-CAM).
Presenting the future development potential of the U-Net + MLP structure (improving spatial feature utilization and interpretability).
Limitations:
The relative small size of the MRNet dataset limits the evaluation of the potential of Transformer models.
Further research using large-scale pre-training datasets is needed.
Need to improve classification performance of U-Net + MLP structure.
👍