This is a page that curates AI-related papers published worldwide. All content here is summarized using Google Gemini and operated on a non-profit basis. Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.
TMUAD: Enhancing Logical Capabilities in Unified Anomaly Detection Models with a Text Memory Bank
Created by
Haebom
Author
Jiawei Liu, Jiahe Hou, Wei Wang, Jinsong Du, Yang Cong, Huijie Fan
Outline
This paper proposes a three-memory framework for unified structural and logical anomaly detection (TMUAD), which improves logical anomaly detection by introducing a text memory bank, unlike existing integrated approaches, to address the challenge of outlier detection due to the lack of normal data. TMUAD builds a class-level text memory bank using a proposed logic-aware text extractor to capture rich logical descriptions of objects in images. It then extracts features from segmented objects to build an object-level image memory bank, and extracts patch-level image features to build a patch-level memory bank. Using these three complementary memory banks, it retrieves and compares the most similar normal images to the query image, computes multi-level outlier scores, and fuses them into a final outlier score. This integration of structural and logical anomaly detection achieves state-of-the-art performance on seven public datasets from industrial and medical fields. The model and code are available at https://github.com/SIA-IDE/TMUAD .