Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

MEraser: An Effective Fingerprint Erasure Approach for Large Language Models

Created by
  • Haebom

Author

Jingxuan Zhang, Zhenhua Xu, Rui Hu, Wenpeng Xing, Xuhong Zhang, Meng Han

Outline

This paper presents Mismatched Eraser (MEraser), a backdoor-based fingerprinting technique to address concerns about ownership and intellectual property protection in large-scale language models (LLMs). MEraser effectively removes backdoor-based fingerprints while maintaining model performance through a two-stage fine-tuning strategy utilizing mismatched and normal datasets. Through extensive evaluations on various LLM architectures and fingerprinting methods, we demonstrate that MEraser achieves complete fingerprinting while maintaining model performance even with a small training data set of less than 1,000 samples. Furthermore, we introduce a transferable eraser mechanism that enables effective fingerprinting without repetitive training across models. In conclusion, this paper provides a practical solution for fingerprinting in LLMs, exposes vulnerabilities in current fingerprinting techniques, and presents comprehensive evaluation criteria for the development of more robust model protection methods.

Takeaways, Limitations

Takeaways:
Presenting MEraser, an effective method for removing backdoor-based fingerprints.
Achieving high-performance fingerprint removal even with small amounts of data
Development of a fingerprint removal mechanism that can be transferred between models.
Revealing vulnerabilities in existing fingerprint technology and suggesting future research directions.
LLM offers a new approach to intellectual property protection
Limitations:
Further research is needed to determine whether MEraser's effectiveness can be applied equally to all types of backdoor-based fingerprinting.
The effectiveness of MEraser needs to be verified for more sophisticated and powerful fingerprint technologies.
Further research is needed to determine the applicability and stability of MEraser in real-world LLM deployment environments.
👍