Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models

Created by
  • Haebom

Author

Keyon Vafa, Peter G. Chang, Ashesh Rambachan, Sendhil Mullainathan

Outline

This paper presents a novel technique to validate the premise that a foundation model can reveal deep domain understanding through sequential predictions—similar to how Kepler's predictions of planetary motion led to the discovery of Newtonian mechanics. This technique analyzes how the foundation model adapts to synthetic datasets generated from a hypothesized world model, measuring how well the model's inductive biases match the world model. Experiments across a variety of domains reveal that while the foundation model excels on its training task, it fails to adequately develop the inductive biases underlying the underlying world model when adapting to new tasks. Specifically, we find that foundation models trained on orbital trajectories tend to fail to apply Newtonian mechanics to new physics tasks, due to their reliance on task-specific heuristics that fail to generalize.

Takeaways, Limitations

Takeaways: We present a new criterion (inductive bias probe) for evaluating the performance of baseline models and provide a method for assessing whether baseline models understand a deep world model, rather than simply learning superficial patterns. By revealing the limitations of baseline models' generalization ability, we suggest directions for future research.
Limitations: The proposed evaluation technique depends on the assumed world model and may not be a universal evaluation criterion applicable to all domains. Furthermore, the results may be affected by the design and generation of the synthetic dataset used in the analysis. Further research using a wider variety of world models and datasets is needed.
👍