Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Aryabhata: An exam-focused language model for JEE Math

Created by
  • Haebom

Author

Ritvik Rastogi, Sachin Dharashivkar, Sandeep Varma

Outline

Aryabhata 1.0 is a 7 billion-parameter small mathematical inference model optimized for the Indian entrance exam, JEE. While existing large-scale language models (LLMs) are often inadequate for training, Aryabhata 1.0 combines powerful open-weighted inference models and was developed through supervised learning fine-tuning (SFT) and curriculum learning using proven course of thought (CoT) tracking. It further improves performance by applying novel exploration strategies, such as Reinforcement Learning with Verifiable Rewards (RLVR) using the A2C objective and group relative advantage estimation, as well as adaptive group sizing and temperature control. It outperforms existing models in accuracy and efficiency on in-distribution benchmarks such as JEE Main 2025 and out-of-distribution benchmarks such as MATH and GSM8K, and provides educationally useful step-by-step inference. Aryabhata 1.0 is released as a foundational model for developing open-source, test-focused small language models.

Takeaways, Limitations

Takeaways:
Demonstrates the potential of a small LLM suitable for an exam-oriented learning environment.
Demonstrates educational utility through providing powerful reasoning capabilities and step-by-step reasoning.
It is released as open source, ensuring the possibility of community contribution and development.
Performance enhancement through novel exploration strategies such as A2C target and group relative advantage estimation, adaptive group sizing, and temperature control.
Limitations:
It is specialized for the JEE exam and may have limited generalizability to other types of math problems or subjects.
Because the model is small, its ability to solve complex problems may be limited.
Currently, it is specific to the JEE exam in India and its applicability to other countries or education systems requires further study.
👍