Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Gemini 2.5 Pro Capable of Winning Gold at IMO 2025

Created by
  • Haebom

Author

Yichen Huang, Lin F. Yang

Outline

This paper presents the results of solving the 2025 International Mathematics Olympiad (IMO) problem using Google’s Gemini 2.5 Pro. Considering that existing large-scale language models (LLMs) perform well on mathematical benchmarks but struggle with IMO-level problems, we achieve the correct answer for 5 out of 6 problems through careful prompt design and a self-verification pipeline while avoiding data contamination. This highlights the importance of developing optimal strategies to fully leverage the potential of powerful LLMs for complex inference tasks.

Takeaways, Limitations

Takeaways: Demonstrates that strong LLM can significantly improve the performance of solving difficult mathematical problems with appropriate strategies and prompt engineering. Suggests the need for research on improving the reasoning ability of LLM and how to best utilize it.
Limitations: 1 out of 6 problems were not solved. Further research is needed on optimal prompt design and self-verification pipeline for solving IMO problems. Further verification is needed on whether it can be generalized to all types of IMO problems.
👍