Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Diffusion Models Through a Global Lens: Are They Culturally Inclusive?

Created by
  • Haebom

Author

Zahra Bayramli, Ayhan Suleymanzade, Na Min An, Huzama Ahmad, Eunsu Kim, Junyeong Park, James Thorne, Alice Oh

Outline

This paper questions the ability of state-of-the-art text-to-image diffusion models to accurately represent diverse cultural nuances, and introduces the CultDiff benchmark to evaluate the ability of image generation to include cultural features from 10 countries. We show that the generation of cultural elements such as architecture, clothing, and food, especially from marginalized regions, exhibits deficiencies compared to real images. Through a detailed analysis of various similarity aspects, we find significant differences in cultural relevance, descriptive fidelity, and realism, and develop CultDiff-S, a neural network-based image-to-image similarity metric that predicts human judgments for real and generated images with cultural features based on collected human ratings. In conclusion, we emphasize the need for comprehensive generative AI systems across a wide range of cultures and fair dataset representation.

Takeaways, Limitations

Takeaways:
Presentation of a benchmark (CultDiff) to objectively evaluate the cultural bias of existing text-image diffusion models
The model clearly presents the Limitations through quantitative and qualitative analysis of the generative capacity of cultural elements.
Development of a new image similarity index (CultDiff-S) based on human evaluation
Emphasize the importance of inclusiveness and fair dataset representation in generative AI
Limitations:
The number of countries included in the benchmark (10 countries) may be limited.
Further research is needed on the generalization performance of CultDiff-S
Lack of specific solutions on how to completely eliminate bias against specific cultures
👍