Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

ALLoyM: A large language model for alloy phase diagram prediction

Created by
  • Haebom

Author

Yuna Oikawa, Guillaume Deffrennes, Taichi Abe, Ryo Tamura, Koji Tsuda

Outline

This paper introduces aLLoyM, a large-scale language model (LLM) applicable to materials science. aLLoyM is an LLM specifically tuned for alloy composition, temperature, and corresponding phase information. It was developed by curating question-and-answer (Q&A) pairs for binary and ternary phase diagrams based on the open-source Computational Phase Diagram Database (CPDDB) and the Calculation of PHAse Diagrams (CALPHAD). We fine-tuned Mistral, an open-source pre-trained LLM, in two Q&A formats: multiple-choice and short-answer. Benchmark evaluation results demonstrate that fine-tuning significantly improves performance on multiple-choice phase diagram questions. Furthermore, aLLoyM's short-answer model demonstrates its ability to generate novel phase diagrams based solely on constituent elements, highlighting its potential to accelerate the discovery of previously unexplored material systems. To encourage further research and adoption, we have released the short-answer fine-tuned version of aLLoyM and the complete benchmarking Q&A dataset on Hugging Face.

Takeaways, Limitations

Takeaways:
We developed a model that can be effectively applied to solving problems in materials science by utilizing open-source LLM.
ALLoyM performs well on both multiple-choice and short-answer questions, and the short-answer model in particular demonstrates its ability to generate novel phase diagrams.
We encourage further research and use by making the developed models and datasets public.
It offers the potential to accelerate the discovery of new material systems.
Limitations:
Currently, it is trained only using data for binary and ternary phase diagrams, so its applicability to more complex systems may be limited.
Further evaluation and validation of the model's accuracy and generalization performance are needed.
There is a dependency on the accuracy and completeness of data reliant on CPDDB and CALPHAD.
👍