[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Rethinking Data Protection in the (Generative) Artificial Intelligence Era

Created by
  • Haebom

Author

Yiming Li, Shuo Shao, Yu He, Junfeng Guo, Tianwei Zhang, Zhan Qin, Pin-Yu Chen, Michael Backes, Philip Torr, Dacheng Tao, Kui Ren

Outline

This paper highlights the significant changes in the meaning and value of data in the era of generative artificial intelligence (AI), and points out the inadequacy of existing data protection concepts. As data plays a critical role throughout the AI lifecycle, data protection is required at various stages, including training data, prompts, and outputs. Accordingly, we present a four-level taxonomy of non-usability, privacy, traceability, and erasability to capture the diverse protection requirements that arise in modern generative AI models and systems. This framework provides a structural understanding of the tradeoffs between data usability and control across the entire AI pipeline, including training datasets, model weights, system prompts, and AI-generated content, and analyzes representative technical approaches at each level and identifies regulatory blind spots. Ultimately, it provides a structural framework for aligning future AI technologies and governance with trustworthy data practices, providing timely guidance to developers, researchers, and regulators.

Takeaways, Limitations

Takeaways:
Presenting a new perspective and systematic approach to data protection in the era of generative AI
Provides a structural understanding of the trade-off between data usability and control.
Provides practical guidance on establishing a data protection strategy across the AI pipeline
Identify regulatory blind spots and suggest ways to improve them
Providing useful information to developers, researchers, and regulators alike
Limitations:
The four-level taxonomy presented may not fully cover all types of AI systems and data protection issues.
There is a need for a more in-depth analysis of the technical approach.
Further research is needed on the practical application and effectiveness of the proposed framework.
There may be a lack of consideration for specific regulatory environments.
👍