[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

GeMix: Conditional GAN-Based Mixup for Improved Medical Image Augmentation

Created by
  • Haebom

Author

Hugo Carlesso, Maria Eliza Patulea, Moncef Garouani, Radu Tudor Ionescu, Josiane Mothe

Outline

GeMix is a new data augmentation technique proposed to overcome the limitations of existing Mixups in fields where realistic image generation is important, such as medical image classification. Instead of the simple pixel-by-pixel interpolation of existing Mixups, it performs learning-based interpolation that considers label information using a class-conditional GAN (StyleGAN2-ADA). Two label vectors are sampled from the Dirichlet distribution, mixed using the Beta distribution coefficient, and then given as a condition to StyleGAN2-ADA to generate visually consistent images. Experimental results using the COVIDx-CT-3 dataset show that it achieves a higher macro-F1 score than existing Mixups, and in particular, it reduces the false negative rate in COVID-19 detection. It can be easily applied to existing learning pipelines, and the code is also open to the public to increase reproducibility.

Takeaways, Limitations

Takeaways:
Solving the unrealistic image generation problem of existing Mixups
More meaningful data augmentation possible using label information
Potential for improved performance in high-risk areas such as medical image classification (reduced false negative rates in COVID-19 detection)
Excellent compatibility with existing learning pipelines
Ease of reproducibility and further research through code disclosure
Limitations:
Because StyleGAN2-ADA is used, significant computing resources may be required for training.
The performance of GeMix may depend on the performance of StyleGAN2-ADA.
Only experimental results for a specific dataset (COVIDx-CT-3) are presented, so generalization performance to other datasets requires further study.
👍