Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Intuition emerges in Maximum Caliber models at criticality

Created by
  • Haebom

Author

Llu is Arola-Fern andez

Outline

We begin by addressing the lack of a physical explanation for whether large predictive models simply mimic training data or generate genuine insights. This study reports a primitive form of intuition, which emerges as a metastable phase of learning that critically balances next-token prediction and future path entropy. The intuition mechanism is discovered through mind-tuning, a minimal principle that imposes maximum caliber on predictive models using a control temperature-like parameter $\lambda$. Random walk learning in deterministic mazes exhibits a rich phase diagram that includes imitation (low $\lambda$), rule-breaking hallucinations (high $\lambda$), and a vulnerable intermediate window where the model spontaneously discovers new goal-oriented strategies, exhibiting strong protocol dependence (hysteresis) and multistability. These findings are captured in an effective low-dimensional theory, framing intuition as a property emerging from a critical balance between present memory and future curiosity.

Takeaways, Limitations

Takeaways:
It suggests that intuition can be explained as a metastable stage that appears in the learning process.
We demonstrate that the learning characteristics of a model can be adjusted using mind-tuning techniques.
We discovered a phenomenon in which models spontaneously discover new strategies and analyzed this using a phase diagram.
This suggests that a balance between “present memory” and “curiosity about the future” is important in forming intuition.
Limitations:
The applicability to real-world problems was not specifically presented.
Further research is needed to determine the generalizability of the proposed mechanism.
It is important to verify that the experimental results obtained in a simple maze environment can be applied equally to complex problems.
There is a lack of explanation of the specific content and limitations of low-dimensional theory.
👍