[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection

Created by
  • Haebom

Author

Hongyang Zhao, Tianyu Liang, Sina Davari, Daeho Kim

Outline

This paper presents a novel image synthesis methodology for construction site worker detection. Using a generative AI platform called Midjourney, we generate 12,000 synthetic images with 3,000 different prompts, which are then manually labeled and used as a DNN training dataset. When evaluated on a real construction image dataset, we achieve an average precision (AP) of 0.937 at an IoU threshold of 0.5 and an AP of 0.642 between 0.5 and 0.95. On the synthetic dataset, we achieve high performance with APs of 0.994 and 0.919, respectively. This demonstrates both the potential and limitations of generative AI to address the lack of DNN training data.

Takeaways, Limitations

Takeaways:
We demonstrate that generative AI (Midjourney) can be used to effectively generate datasets required for detecting construction site workers.
Suggesting that high accuracy can be achieved even on real datasets by training DNNs using synthetic data.
We suggest that generative AI-based data augmentation can contribute to improving DNN performance.
Limitations:
Due to the domain gap between synthetic data and real data, performance on real datasets is relatively lower than on synthetic datasets.
Building a dataset can take significant effort and time, as it requires a manual labeling process.
Due to limitations in generative AI such as Midjourney, there may be limitations in the quality and variety of images generated.
👍