Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

PULSE: Practical Evaluation Scenarios for Large Multimodal Model Unlearning

Created by
  • Haebom

Author

Tatsuki Kawakami, Kazuki Egashira, Atsuyuki Miyai, Go Irie, Kiyoharu Aizawa

Outline

This study investigates unlearning, a "forgetting" technique, to address privacy and copyright issues in large-scale language models (LLMs) and large-scale multimodal models (LMMs). Specifically, addressing the lack of a practical evaluation framework for LMM unlearning, we propose the PULSE protocol. This protocol evaluates LMM unlearning in realistic scenarios by introducing two perspectives: (i) unlearning pre-trained knowledge and (ii) assessing long-term sustainability. Our results demonstrate that while existing unlearning techniques can remove knowledge acquired through fine-tuning, they struggle to remove information acquired during pre-training. Furthermore, techniques that successfully unlearn on a single task suffer from performance degradation when sequentially unlearning the same data.

Takeaways, Limitations

Takeaways:
We propose PULSE, a new evaluation protocol for LMM unlearning, and present a realistic unlearning scenario.
The limitations of unlearning techniques are specifically analyzed through pre-learning knowledge unlearning and long-term sustainability assessment.
It reveals that existing unlearning techniques have difficulty removing pre-learned information.
We show that the success of a single unlearning task is not maintained in a sequential unlearning task.
Limitations:
It does not include research on ways to improve the specific unlearning technique itself.
Specific performance analysis results of specific unlearning techniques are not specified in the paper.
Further research is needed to determine the generalizability of the proposed PULSE protocol.
👍