[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

DualSwinUnet++: An Enhanced Swin-Unet Architecture With Dual Decoders For PTMC Segmentation

Created by
  • Haebom

Author

Maryam Dialameh, Hossein Rajabzadeh, Moslem Sadeghi-Goughari, Jung Suk Sim, Hyock Ju Kwon

Outline

Accurate segmentation of papillary thyroid microcarcinomas (PTMCs) during ultrasound-guided radiofrequency ablation (RFA) is critical for effective treatment, but is challenging due to acoustic artifacts, small lesion size, and anatomical variations. In this study, we propose DualSwinUnet++, a dual-decoder transformer-based architecture designed to improve PTMC segmentation by incorporating thyroid gland context. DualSwinUnet++ uses independent linear projection heads for each decoder and a residual information flow mechanism that transfers intermediate features from the first (thyroid) decoder to the second (PTMC) decoder via concatenation and transformation. These design choices allow the model to explicitly condition tumor prediction on thyroid morphology without shared gradient interference. Trained on a clinical ultrasound dataset with 691 annotated RFA images and evaluated against state-of-the-art models, DualSwinUnet++ achieves excellent Dice and Jaccard scores while maintaining an inference latency of less than 200 ms. The results demonstrate that the model is suitable for near-real-time surgical assistance and is effective in improving segmentation accuracy in challenging PTMC cases.

Takeaways, Limitations

Takeaways:
DualSwinUnet++ presents an effective method to improve accurate segmentation during ultrasound-guided radiofrequency ablation (RFA) of papillary thyroid microcarcinoma (PTMC).
It demonstrates applicability to real-time surgical assistance with a fast inference speed of less than 200 ms.
Achieving high accuracy even in difficult PTMC cases by leveraging thyroid gland context.
It has excellent Dice and Jaccard scores compared to state-of-the-art models.
Limitations:
There is a lack of explicit mention of the size and diversity of the dataset used in the study. Generalization performance validation on larger and more diverse datasets is needed.
Additional validation for clinical application and performance evaluation in real surgical settings are needed.
Further research is needed to determine the interpretability and reliability of the model.
👍