Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Common Data Format (CDF): A Standardized Format for Match-Data in Football (Soccer)

Created by
  • Haebom

Author

Gabriel Anzer, Kilian Arnsmeyer, Pascal Bauer, Joris Bekkers, Ulf Brefeld, Jesse Davis, Nicolas Evans, Matthias Kempe, Samuel J Robertson, Joshua Wyatt Smith, Jan Van Haaren

Outline

This paper proposes the Common Data Format (CDF) v1.0.0, a standardized format for soccer match data. Soccer match data (match records, videos, events, tracking data, metadata, etc.) collected by various organizations have varying formats, specifications, and representations, making analysis difficult. To address these issues, CDF defines a minimal schema that ensures data clarity, contextual information (e.g., source), and completeness. This paper details the technical specifications of CDF, the choice of representation for data clarity, and the data delivery method.

Takeaways, Limitations

Takeaways:
Increase the efficiency of soccer match data analysis: Reduce data integration and analysis time and cost through standardized formats.
Promoting data sharing and collaboration: Enhancing interoperability to facilitate data sharing and collaboration across diverse organizations.
Improve data quality: Increase the confidence in your analysis results by providing clear and complete data.
Limitations:
As of version v1.0.0, additional data types and functional improvements may be required in the future.
Adoption by all football data providers is required, and additional effort and persuasion are needed to achieve this.
Further research is needed to verify the practical application and validity of the proposed schema.
👍