Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Chemist-aligned retrosynthesis by ensembling diverse inductive bias models

Created by
  • Haebom

Author

Krzysztof Maziarz, Guoqing Liu, Hubert Misztela, Austin Tripp, Junren Li, Aleksei Kornev, Piotr Gainski , Holger Hoefling, Mike Fortunato, Rishi Gupta, Marwin Segler

Outline

This paper proposes a novel retrosynthetic model, RetroChimera, to overcome the limitations of AI-based synthetic planning models. RetroChimera is based on two newly developed components with complementary inductive biases, combined using a novel framework that integrates predictions from multiple sources via a learning-based ensemble strategy. Experimental results demonstrate that RetroChimera significantly outperforms existing leading models, exhibits robust performance outside the training data, and demonstrates, for the first time, the ability to learn even with a very small number of examples for each reaction class. Furthermore, industrial organic chemists prefer RetroChimera's predictions to reactions from the training data, demonstrating a high degree of consistency. Finally, zero-shot transfer on an internal dataset from a major pharmaceutical company demonstrates robust generalization under distributional shift.

Takeaways, Limitations

Takeaways:
Effectively solves the rare response and incorrect prediction problems of existing AI-based synthetic planning models.
Learning is possible even with small amounts of data, increasing data efficiency.
Validation of the model's reliability and practicality through the preferences of industry experts.
Improved generalization ability and broad applicability through zero-shot transfer.
Robust performance across a variety of data sizes and partitioning strategies.
Presenting the possibility of accelerating the development of more accurate models through an ensemble framework.
Limitations:
The performance of the RetroChimera model presented in this paper may be limited to specific datasets and conditions.
Further research is needed to assess the interpretability and reliability of the model's prediction results.
Further validation of applicability and scalability in real industrial environments is required.
👍