Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

Created by
  • Haebom

Author

Yijun Liang, Shweta Bhardwaj, Tianyi Zhou

Outline

This paper proposes "Diffusion Curriculum (DisCL)," a novel data augmentation method utilizing a text-image-guided diffusion model to address the challenges of training deep neural networks due to low-quality or insufficient data. To overcome the limitation of controlling the proximity between synthetic and original images using text-based induction alone, DisCL generates a variety of intermediate images between synthetic and real images through image induction. DisCL adjusts the level of image induction at each training step to focus on challenging examples and evaluates the effective level of synthetic image induction to improve learning from challenging data. This achieves improved performance on long-tail classification and low-quality data learning tasks. On the iWildCam dataset, it improves OOD and ID macro accuracy by 2.7% and 2.1%, respectively. On the ImageNet-LT dataset, it improves tail class accuracy from 4.4% to 23.64% and overall class accuracy by 4.02%.

Takeaways, Limitations

Takeaways:
Presenting effective data augmentation techniques to address low-quality or insufficient data issues.
Control the quality and diversity of synthetic data by combining text and image derivation.
Demonstrating the effectiveness of the Diffusion Curriculum (DisCL), which adjusts the level of imagery induction at each training stage.
Excellent performance improvement in long-term distribution classification and learning from low-quality data.
Limitations:
Lack of analysis of the computational cost and complexity of the proposed method.
The need to evaluate generalization performance across diverse datasets and tasks.
Further research is needed to optimize strategies for image-guided level control.
👍