Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Personalized Safety Alignment for Text-to-Image Diffusion Models

Created by
  • Haebom

Author

Yu Lei, Jinbin Bai, Qingyu Shi, Aosong Feng, Kaidong Yu

Outline

This paper points out the limitations of the safety mechanism of text-image diffusion models, which fail to account for individual user preferences, and proposes a Personalized Safety Alignment (PSA) framework. PSA integrates user profiles into the diffusion process to adapt the model's behavior to individual safety criteria while maintaining image quality. It incorporates user-specific safety preferences using a novel dataset, Sage, and integrates the profiles through a cross-attention mechanism. Experimental results demonstrate that PSA outperforms existing methods in suppressing harmful content, generates content that better aligns with user constraints, and achieves higher Win Rate and Pass Rate scores. The code, data, and models are publicly available.

Takeaways, Limitations

Takeaways:
We present a novel framework to improve the safety of text-to-image generation models by applying user-customizable safety criteria.
Offers the possibility of customized safety settings that take into account various factors such as an individual's age, mental health, and personal beliefs.
Results show better performance in suppressing harmful content and higher satisfaction of user constraints than existing methods.
Ensuring reproducibility and scalability of research through open code, data, and models.
Limitations:
Further validation of the size and diversity of the Sage dataset is needed.
There is a need to verify the generalization performance of the PSA framework and its applicability to various models.
Consideration should be given to the accuracy and reliability of user profiles.
Security concerns need to be considered regarding the possibility of profile manipulation by malicious users.
👍