Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using GPT-4o: Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework

Created by
  • Haebom

Author

Nanaka Hosokawa, Ryo Takahashi, Tomoya Kitano, Yukihiro Iida, Chisako Muramatsu, Tatsuro Hayashi, Yuta Seino, Xiangrong Zhou, Takeshi Hara, Akitoshi Katsumata, Hiroshi Fujita

Outline

We conducted a study to automatically generate jaw cyst findings from dental panoramic radiographs, leveraging the multimodal capabilities of OpenAI GPT-4o. To improve accuracy, we developed and validated the Self-Correction Loop with Structured Output (SLSO) framework. The SLSO framework, a 10-step process encompassing image input and analysis, structured data generation, tooth number extraction and consistency checking, iterative regeneration in case of discrepancies, finding generation, and subsequent structuring and consistency verification, was applied to 22 jaw cyst cases. We compared the results with the existing Chain-of-Thought (CoT) method on seven evaluation criteria: transparency, internal structure, boundaries, root resorption, tooth movement, relationships to other structures, and tooth number. The SLSO framework improved the output accuracy across multiple criteria, particularly in tooth number (66.9%), tooth movement (33.3%), and root resorption (28.6%). In successful cases, consistent structured output was achieved after up to five regenerations.

Takeaways, Limitations

The SLSO framework contributed to enforcing negative opinion descriptions, suppressing hallucinations, and improving the accuracy of tooth number identification.
The SLSO framework improves accuracy compared to the existing CoT method.
The dataset size was too small to achieve statistical significance.
Accurate identification of extensive lesions spanning multiple teeth is limited.
Further improvements are needed to improve overall performance and develop a practical opinion generation system.
👍