This paper proposes a method to generate scene graphs (SGs) of ultrasound images using a transformer-based one-step method to solve the difficulty of interpreting medical ultrasound images. The SGs are generated to describe the content of ultrasound images and guide ultrasound scans without explicit object detection, and a large-scale language model (LLM) is used to refine abstract SG expressions according to user queries to generate explanations that even general users can understand. In addition, the predicted SGs are utilized to find missing anatomical structures in the current image and guide scans, thereby supporting general users to perform more standardized and complete anatomical exploration. The effectiveness of the proposed method was verified through five volunteers targeting images of the left and right neck regions including the carotid artery and thyroid gland, and it shows the possibility of contributing to the popularization of ultrasound by improving the interpretation and usability of ultrasound for general users.