Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance

Created by
  • Haebom

Author

Xuesong Li, Dianye Huang, Yameng Zhang, Nassir Navab, Zhongliang Jiang

Outline

This paper proposes a method to generate scene graphs (SGs) of ultrasound images using a transformer-based one-step method to solve the difficulty of interpreting medical ultrasound images. The SGs are generated to describe the content of ultrasound images and guide ultrasound scans without explicit object detection, and a large-scale language model (LLM) is used to refine abstract SG expressions according to user queries to generate explanations that even general users can understand. In addition, the predicted SGs are utilized to find missing anatomical structures in the current image and guide scans, thereby supporting general users to perform more standardized and complete anatomical exploration. The effectiveness of the proposed method was verified through five volunteers targeting images of the left and right neck regions including the carotid artery and thyroid gland, and it shows the possibility of contributing to the popularization of ultrasound by improving the interpretation and usability of ultrasound for general users.

Takeaways, Limitations

Takeaways:
Suggesting the possibility of popularizing ultrasound technology by improving the interpretation of ultrasound images and increasing the convenience of use
Efficient scene graph generation and object detection process omitted through a transformer-based one-step method
Generate user-friendly video descriptions using LLM
SG-based scan guidance allows for more standardized and complete anatomical exploration
Expanding the potential of ultrasound in point-of-care settings by making it easier for non-expert users to use it
Limitations:
Further research on generalizability is needed with validation using a limited dataset (5 volunteers).
Applicability verification for various ultrasound devices and imaging types is required.
Considering the dependency and error potential of LLM performance
Further studies are needed to determine its efficacy and safety in real-world clinical settings.
👍