Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Gemini 2.5 Pro Capable of Winning Gold at IMO 2025

Created by
  • Haebom

Author

Yichen Huang, Lin F. Yang

Outline

This paper presents the results of solving the 2025 International Mathematics Olympiad (IMO) problem using Google Gemini 2.5 Pro, a large-scale language model (LLM). The IMO problem is a unique problem that requires deep insight, creativity, and formal reasoning, and traditional LLMs are known to struggle with it. To avoid data contamination, we use a new IMO problem, and through pipeline design and prompt engineering, we achieve the correct answer for five out of six problems (although one problem requires discussion). This suggests that finding the optimal way to use a powerful model is important.

Takeaways, Limitations

Takeaways:
A strong LLM demonstrates that appropriate prompt engineering and pipeline design can successfully solve challenging mathematical problems.
It emphasizes the importance of studying optimal use of LLM to improve mathematical problem-solving skills.
The development of the LLM suggests that it may offer new possibilities for mathematics teaching and research.
Limitations:
One of the six questions requires further discussion on deriving the correct answer.
Further research is needed to determine the generalizability of the LLM and prompt engineering strategies used.
Further experiments are needed to evaluate the performance of LLM on different types of mathematical problems.
👍