[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback

Created by
  • Haebom

Author

Yiyuan Yang, Zichuan Liu, Lei Song, Kai Ying, Zhiguang Wang, Tom Bamford, Svitlana Vyetrenko, Jiang Bian, Qingsong Wen

Outline

In this paper, we propose a new generative and inference-oriented task, Time-RA (Time-series Reasoning for Anomalies), using large-scale language models (LLMs) to overcome the limitations of existing binary classification-based time-series anomaly detection. We introduce RATs40K, a multi-modal benchmark dataset containing approximately 40,000 samples from various real-world domains (10), where each sample is annotated with numerical time-series data, contextual text, visual representations, and detailed anomaly types (14 univariate and 6 multivariate) and structured explanatory reasoning. We develop a sophisticated annotation framework utilizing ensemble-generated labels refined through GPT-4-based feedback to ensure accuracy and interpretability. Through extensive benchmarking on LLM and multi-modal LLMs, we demonstrate the performance and limitations of current models and emphasize the importance of fine-tuning based on supervised learning. The dataset and task presented in this study will contribute to the advancement of interpretable time-series anomaly detection and reasoning.

Takeaways, Limitations

Takeaways:
A new generative and inference-oriented approach (Time-RA) is presented, moving away from the traditional binary classification anomaly detection.
RATs40K, a multi-modal benchmark dataset based on real-world data, released
Improved interpretability by providing detailed anomaly type classification and structured explanatory reasoning
Performance evaluation of LLM and multimodal LLM and validation of the importance of supervised learning-based fine-tuning
Contributing to advances in interpretable time series anomaly detection and inference
Limitations:
Further research is needed on the versatility and generalizability of the RATs40K dataset.
High dependence on GPT-4, which may lead to difficulties in reproducibility if GPT-4 accessibility is limited
It is necessary to clearly state the performance limitations of the current model and to provide more specific suggestions for future improvement.
Verification of generality for various time series data features and domains is required.
👍