[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Semantic Structure-Aware Generative Attacks for Enhanced Adversarial Transferability

Created by
  • Haebom

Author

Jongoh Jeong, Hunmin Yang, Jaeseok Jeong, Kuk-Jin Yoon

Outline

This paper deals with generative adversarial attacks that train a perturbation generator on a white-box surrogate model for application to a back-box Big Team model. Unlike conventional iterative attacks, generative adversarial attacks have excellent inference time efficiency, scalability, and transferability, but previous studies have failed to fully utilize the expression capabilities of generative models to preserve and utilize semantic information. In this paper, we point out that although the intermediate activations of the generator contain rich semantic features such as object boundaries and rough shapes, they are not fully utilized, which limits the alignment of perturbations with object-related regions. To address this issue, this paper proposes a semantic structure-aware attack framework based on Mean Teacher. The Mean Teacher acts as a temporally smoothed feature reference, which enhances the semantic consistency between the early layer activations of the student model and the semantically rich activations of the Teacher model through feature distillation. Based on experimental results, we anchor the perturbation generation to the semantically important early intermediate blocks in the generator, thereby inducing progressive adversarial perturbations in regions that significantly improve adversarial transferability. Through extensive experiments on various models, domains, and tasks, we demonstrate consistent performance improvements over existing state-of-the-art generative attacks, and comprehensively evaluate them using existing metrics and the newly proposed Accidental Correction Rate (ACR).

Takeaways, Limitations

Takeaways:
A novel framework to improve the transferability of generative adversarial attacks
Improved alignment of perturbation with object-related regions by leveraging semantic information
Verification of the effectiveness of a semantic structure-aware attack framework based on Mean Teacher
Introducing a new evaluation metric, ACR
Presenting extensive experimental results for a variety of models, domains, and tasks
Limitations:
Potential increase in computational cost due to using Mean Teacher
Further research is needed on the general applicability of the ACR index
Possible dependency on a specific production model
Further verification of generalizability is needed due to limitations in the experimental environment.
👍