Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

TMUAD: Enhancing Logical Capabilities in Unified Anomaly Detection Models with a Text Memory Bank

Created by
  • Haebom

Author

Jiawei Liu, Jiahe Hou, Wei Wang, Jinsong Du, Yang Cong, Huijie Fan

Outline

This paper proposes a three-memory framework for unified structural and logical anomaly detection (TMUAD), which improves logical anomaly detection by introducing a text memory bank, unlike existing integrated approaches, to address the challenge of outlier detection due to the lack of normal data. TMUAD builds a class-level text memory bank using a proposed logic-aware text extractor to capture rich logical descriptions of objects in images. It then extracts features from segmented objects to build an object-level image memory bank, and extracts patch-level image features to build a patch-level memory bank. Using these three complementary memory banks, it retrieves and compares the most similar normal images to the query image, computes multi-level outlier scores, and fuses them into a final outlier score. This integration of structural and logical anomaly detection achieves state-of-the-art performance on seven public datasets from industrial and medical fields. The model and code are available at https://github.com/SIA-IDE/TMUAD .

Takeaways, Limitations

Takeaways:
Improving logical outlier detection performance by leveraging text memory banks.
Integrating structural and logical outlier detection to improve comprehensive outlier detection performance.
Achieving state-of-the-art performance on diverse industrial and healthcare datasets.
Open source release for improved accessibility and usability.
Limitations:
Lack of detailed analysis of the performance of the proposed logic-aware text extractor.
Need to evaluate generalization performance for various types of outliers.
Lack of efficiency analysis of the size and management of memory banks.
Potential overfitting to a specific dataset.
👍