Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Understanding World or Predicting Future? A Comprehensive Survey of World Models

Created by
  • Haebom

Author

Jingtao Ding, Yunke Zhang, Yu Shang, Yuheng Zhang, Zefang Zong, Jie Feng, Yuan Yuan, Hongyuan Su, Nian Li, Nicholas Sukiennik, Fengli Xu, Yong Li

Outline

This paper provides a comprehensive review of the World Model, which has been gaining attention due to the advancement of multimodal large-scale language models such as GPT-4 and Sora. The World Model is generally considered as a tool for understanding the current state of the world or predicting future dynamics. In this paper, we systematically categorize the World Model into two major functions: (1) constructing internal representations to understand the mechanisms of the world, and (2) predicting future states to simulate and guide decision-making, and examine recent research trends in each category. We examine applications of the World Model in major fields such as autonomous driving, robotics, and social simulacra, and analyze how the two functions of the World Model are utilized in each field. Finally, we provide insights into major challenges and future research directions, and summarize related papers and code repositories.

Takeaways, Limitations

Takeaways:
We clearly distinguish between the two main functions of world models (constructing internal representations and predicting future states) and emphasize the importance of each function, providing a systematic understanding of world model research.
We demonstrate practical applicability by analyzing application cases of world models in various fields such as autonomous driving, robotics, and social simulacra.
It can contribute to the development of future world model research by suggesting future research directions.
Increase research accessibility by providing repositories of relevant papers and code.
Limitations:
The type and scope of world models covered in a paper may be limited.
Analysis for certain areas may be more detailed than for others.
There may be a lack of discussion of the ethical and social implications of the world model.
👍