Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

ChronoGraph: A Real-World Graph-Based Multivariate Time Series Dataset

Created by
  • Haebom

Author

Adrian Catalin Lutu, Ioana Pintilie, Elena Burceanu, Andrei Manolache

Outline

ChronoGraph is a graph-structured, multivariate time series forecasting dataset built from real-world microservices. Each node is a service emitting a multivariate stream of system-level performance metrics that capture CPU, memory, and network usage patterns, while directed edges represent inter-service dependencies. The primary challenge is predicting future values of these signals at the service level. Furthermore, ChronoGraph provides expert-annotated event windows with anomaly detection labels, enabling the evaluation of anomaly detection methods and the robustness of predictions during outages. Compared to existing benchmarks in the industrial control system or transportation and air quality domains, ChronoGraph uniquely combines (i) multivariate time series, (ii) explicit, machine-readable dependency graphs, and (iii) anomaly labels that align with real-world events. It reports baseline results that include prediction models, pretrained time series-based models, and standard anomaly detectors. ChronoGraph provides a realistic benchmark for studying structure-aware prediction and event-aware evaluation in microservice systems.

Takeaways, Limitations

Takeaways:
It enables the development and evaluation of realistic predictive models by including multivariate time series data collected in a real microservice environment and inter-service dependency information.
By providing anomaly detection labels based on real-world events, you can evaluate the robustness of your prediction models and use them to research anomaly detection methods.
We provide a new benchmark for prediction and anomaly detection research that takes into account the structural characteristics of microservice systems.
Limitations:
Because the dataset was collected in a specific microservice environment, its generalizability needs to be examined.
Additional analysis may be required to determine the accuracy and reliability of the provided ideal labels.
Further expansion may be required to increase the size and diversity of the dataset.
👍