Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

DINOv3 with Test-Time Training for Medical Image Registration

Created by
  • Haebom

Author

Shansong Wang, Mojtaba Safari, Mingzhe Hu, Qiang Li, Chih-Wei Chang, Richard LJ Qiu, Xiaofeng Yang

Outline

This paper proposes a novel training-free pipeline to address the need for large training data, a key issue with existing learning-based methods for medical image registration. It is based on a fixed DINOv3 encoder and optimization of deformation field test time in feature space. It generates accurate and regular deformations on two representative benchmarks (Abdomen MR-CT and ACDC cardiac MRI), outperforming existing methods. In particular, on Abdomen MR-CT, our pipeline achieved the best performance, achieving an average Dice similarity (DSC) of 0.790, a 95th percentile Hausdorff distance (HD95) of 4.9±5.0, and a log-Jacobian standard deviation (SDLogJ) of 0.08±0.02. On ACDC cardiac MRI, we also improved the average DSC to 0.769 and reduced SDLogJ and HD95 to 0.11 and 4.8, respectively. This suggests that operating in a compact underlying feature space at test time could be a practical and general solution for clinical registration without additional training.

Takeaways, Limitations

Takeaways:
We demonstrate that accurate medical image registration is possible without large amounts of training data.
We present the possibility of building an efficient registration pipeline by optimizing test time in a compact feature space.
Presenting a method that requires no training and is advantageous for clinical application.
Demonstrates superior performance compared to existing methods in two benchmarks.
Limitations:
Further research is needed on the generalization performance of the proposed method.
Applicability verification is needed for various medical imaging modalities and clinical environments.
Limitations exist due to dependency on DINOv3 encoder.
The results are limited to a specific benchmark and require evaluation on a wider range of datasets.
👍