Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

CopyrightShield: Enhancing Diffusion Model Security against Copyright Infringement Attacks

Created by
  • Haebom

Author

Zhixiang Guo, Siyuan Liang, Aishan Liu, Dacheng Tao

Outline

This paper proposes CopyrightShield, a defense framework against copyright infringement attacks on diffusion models. It focuses on attacks where attackers intentionally inject non-copyrighted images into training data, thereby inducing the generation of copyright-infringing content for specific prompts. CopyrightShield analyzes the memory mechanism of the diffusion model to reveal that the attack exploits overfitting to specific spatial locations and prompts. It then proposes a method for detecting toxic samples using spatial masking and data imputation. Furthermore, it reduces the dependence on copyright infringement features and maintains generation performance through an adaptive optimization strategy that incorporates dynamic penalty terms into the training loss. Experimental results show that CopyrightShield significantly improves toxic sample detection performance under two attack scenarios, achieving an average F1 score of 0.665, a First Attack Era (FAE) delay of 115.2%, and a 56.7% reduction in the Copyright Infringement Rate (CIR). This represents a 25% improvement over the best-performing existing defense.

Takeaways, Limitations

Takeaways:
We present CopyrightShield, an effective defense framework against copyright infringement attacks using diffusion models.
Demonstration of the effectiveness of a method for detecting poisoned samples using spatial masking and data attribution.
We present the possibility of reducing copyright infringement dependency and maintaining generation performance through adaptive optimization strategies.
Confirmed 25% improved defense effect compared to existing best performance defense.
Limitations:
Further research is needed on the generalization performance of the proposed defense.
Robustness assessment against various types of copyright infringement attacks is needed.
It is necessary to explore potential problems and solutions that may arise when applied in real environments.
👍