Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

EarthSynth: Generating Informative Earth Observation with Diffusion Models

Created by
  • Haebom

Author

Jiancheng Pan, Shiye Lei, Yuqian Fu, Jiahao Li, Yanxing Liu, Yuze Sun, Xiao He, Long Peng, Xiaomeng Huang, Bo Zhao

Outline

EarthSynth is a diffusion-based generative foundational model proposed to address the lack of labeled data, a challenge in remote sensing image interpretation. It synthesizes diverse satellite data to generate labeled Earth observation data for downstream remote sensing image interpretation tasks. Specifically, it is the first to attempt multi-task generation in the remote sensing field, overcoming the generalization limitations of task-oriented synthesis. Trained on the EarthSynth-180K dataset, EarthSynth uses a counterfactual compositional training strategy and a 3D batch sample selection mechanism to enhance training data diversity and strengthen categorical control. Furthermore, it proposes a rule-based method called R-Filter to filter informative synthetic data. We evaluate EarthSynth on scene classification, object detection, and semantic segmentation tasks in open-world scenarios, demonstrating significant performance gains on open vocabulary understanding tasks, providing a practical solution for advancing remote sensing image interpretation.

Takeaways, Limitations

Takeaways:
Contributing to solving the problem of lack of labeling data for remote sensing image interpretation.
Improving remote sensing image interpretation performance through multi-task creation.
Improve your ability to understand open vocabulary in open world scenarios.
Applicability to various remote sensing tasks (scene classification, object detection, semantic segmentation) is presented.
Limitations:
Dependency on the EarthSynth-180K dataset. The quality and size of the dataset can impact performance.
R-Filter's rule-based approach has limitations in generalizing. Further verification is needed to validate its applicability to a wider range of situations.
The results presented may be limited to a specific dataset. Additional experiments on a variety of datasets are needed.
Potential performance degradation due to differences from real-world data.
👍