Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

AnomalyControl: Learning Cross-modal Semantic Features for Controllable Anomaly Synthesis

Created by
  • Haebom

Author

Shidan He, Lei Liu, Xiujun Shu, Bo Wang, Yuanhao Feng, Shen Zhao

Outline

This paper proposes AnomalyControl, a novel anomaly synthesis framework, to overcome the shortcomings of existing text-to-image anomaly synthesis methods. Existing methods rely solely on textual information or coarsely aligned visual features, failing to adequately capture the complex characteristics of anomalies. AnomalyControl utilizes cross-modal semantic features as guidance signals, encoding generalized anomalies from text-to-image reference prompts. Specifically, it utilizes mismatched prompt pairs (text-to-image reference prompts and target text prompts) and leverages the cross-modal semantic modeling (CSM) module and the anomaly-to-semantic-enhanced attention (ASEA) mechanism to focus on subtle visual patterns of anomalies, enhancing the realism and contextual relevance of the generated anomaly features. Finally, the semantic map adapter (SGA) utilizes cross-modal semantic features as prior information to encode effective guidance signals for an appropriate and controllable synthesis process. Experimental results demonstrate that AnomalyControl outperforms existing methods and achieves state-of-the-art results in anomaly synthesis and subsequent tasks.

Takeaways, Limitations

Takeaways:
Effectively combine text and image information to synthesize more realistic and generalized anomalies.
Synthesis is possible by taking into account even the subtle visual patterns of anomalies by utilizing cross-modal semantic features.
Ability to generate high-quality anomaly data that contributes to improved performance of subsequent tasks.
The AnomalyControl framework has high applicability in various anomaly detection fields.
Limitations:
The proposed method may be computationally expensive.
There is a possibility of performance degradation for certain types of anomalies.
Further research is needed on generalization performance on diverse datasets.
Analysis of the relative importance of each module (CSM, ASEA, SGA) that contributes to improving the performance of AnomalyControl is needed.
👍