Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization

Created by
  • Haebom

Author

Yassine Taoudi-Benchekroun, Klim Troyan, Pascal Sager, Stefan Gerber, Lukas Tuggener, Benjamin Grewe

Outline

COGITAO is a modular and extensible data generation framework and benchmark designed to systematically study compositionality and generalization in the vision domain. Inspired by the ARC-AGI problem setting, it constructs rule-based tasks that apply a set of transformations to objects in a grid environment. It supports configurations with adjustable depth for 28 interoperable transformations and offers extensive control over grid parameters and object properties. This flexibility allows the generation of millions of unique task rules (many times more than existing datasets) with varying difficulty and virtually unlimited sample generation per rule. Baseline experiments with state-of-the-art vision models demonstrate that despite excellent domain-specific performance, COGITAO consistently fails to generalize to novel combinations of familiar elements. COGITAO is fully open-source, including all code and datasets, to support ongoing research in this area.

Takeaways, Limitations

Takeaways:
Providing a new benchmark and data generation framework for studying constructivism and generalizability.
Create rule-based tasks that are much larger and more diverse than traditional datasets.
Clearly presenting the limitations of the constructability and generalization ability of state-of-the-art vision models.
Provided as open source, contributing to the activation of related research.
Limitations:
Research is currently limited to the visual domain.
Due to its high reliance on rule-based operations, it may have limitations in solving complex problems in the real world.
It is difficult to fully evaluate all the possibilities of COGITAO based on the presented baseline experimental results alone.
👍