Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

PadChest-GR: A Bilingual Chest X-ray Dataset for Grounded Radiology Report Generation

Created by
  • Haebom

Author

Daniel C. Castro, Aurelia Bustos, Shruthi Bannur, Stephanie L. Hyland, Kenza Bouzid, Maria Teodora Wetscherek, Maria Dolores S anchez-Valverde, Lara Jaques-P erez, Lourdes P erez-Rodr iguez, Kenji Takeda, Jos e Mar ia Salinas, Javier Alvarez-Valle, Joaqu in Galant Herrero, Antonio Pertusa

Outline

PadChest-GR is the first manually annotated dataset designed to train evidence-based radiology report generation (GRRG) models for thoracic X line images. It contains 4,555 thoracic X line studies (3,099 abnormal and 1,456 normal) and provides lists of sentences describing individual positive and negative findings in both English and Spanish. A total of 7,037 positive finding sentences and 3,422 negative finding sentences are included, and each positive finding sentence is associated with up to two independent sets of bounding boxes labeled by different readers, along with categorical labels for finding type, location, and progression. This dataset provides a valuable resource for developing and evaluating GRRG models that understand and interpret radiology images and generated text.

Takeaways, Limitations

Takeaways:
Providing the first manually annotated dataset for training evidence-based radiology report generation (GRRG) models for thoracic X-ray images.
Bilingual support in English and Spanish.
Provides comprehensive annotation and precise location information for positive and negative findings.
Provides useful resources for developing and evaluating GRRG models.
Limitations:
The dataset may be relatively small (4,555 studies).
The dataset access is request-based ( https://bimcv.cipf.es/bimcv-projects/padchest-gr/ ).
Generalizability to other medical imaging modalities may be limited.
👍