Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Created by
  • Haebom

Author

Khalil Hennara, Muhammad Hreden, Mohamed Motaism Hamed, Zeina Aldallal, Sara Chrouf, Safwan AlModhayan

Outline

Mutarjim is a compact yet powerful language model for bidirectional Arabic-English translation. Based on Kuwain-1.5B, it is significantly smaller than larger language models, yet outperforms larger models on multiple benchmarks thanks to an optimized two-stage learning approach and a carefully selected, high-quality training dataset. Furthermore, to overcome the limitations of existing Arabic-English benchmark datasets (narrow domain, short sentence length, and English source bias), we present a new benchmark, Tarjama-25, consisting of 5,000 expert-reviewed sentence pairs. Mutarjim achieves state-of-the-art performance on the Tarjama-25 English-Arabic translation task, outperforming large proprietary models such as GPT-4o mini. The Tarjama-25 dataset is publicly available.

Takeaways, Limitations

Takeaways:
We demonstrate that small-scale language models can achieve competitive translation performance compared to large-scale models.
Significantly reduces computational costs and learning requirements.
We present a new benchmark, Tarjama-25, that overcomes the limitations of existing Arabic-English translation evaluation datasets.
Contributing to the advancement of Arabic-English translation research through the release of the Tarjama-25 dataset.
Limitations:
Lack of specific Limitations or performance degradation cases for the Mutarjim model.
The Tarjama-25 dataset may be relatively small compared to other large-scale benchmarks (5,000 sentence pairs may be a relatively small amount of data).
Lack of detailed description of the Kuwain-1.5B model.
👍