PadChest-GR dataset
A dataset called PadChest-GR (Grounded-Reporting), derived from PadChest, which can be used to train radiological reporting models for chest x-rays. We curated a public bilingual dataset of 4,555 CXR studies with evidence-based reports (3,099 abnormal and 1,456 normal), each containing complete lists of sentences describing individual positive and negative findings in English and Spanish. In total, PadChest-GR contains 7,037 sentences of positive findings and 3,422 sentences of negative findings. Each positive finding sentence is associated with up to two independent sets of bounding boxes and has categorical labels for finding type, location, and progression.