Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Chemical knowledge-informed framework for privacy-aware retrosynthesis learning

Created by
  • Haebom

Author

Guikun Chen, Xu Zhang, Xiaolin Hu, Yong Liu, Yi Yang, Wenguan Wang

Outline

In this paper, we present a privacy-preserving approach for machine learning-based retrosynthetic model training, the Chemical Knowledge-Based Framework (CKIF). Traditional retrosynthetic model training involves aggregating reaction data from multiple sources into a single point, which increases the risk of corporate confidentiality leaks. CKIF enables distributed training through iterative chemical knowledge-based aggregation of model parameters without disclosing proprietary reaction data from individual companies. The chemical properties of predicted reactants are used to quantitatively evaluate the observable behavior of individual models, which is then used to determine the weights used for model aggregation. CKIF significantly outperforms several strong baseline models on a variety of reaction datasets.

Takeaways, Limitations

Takeaways:
We present a novel retrosynthetic model learning framework that enables distributed learning while protecting confidential corporate response data.
Leverage chemical knowledge to improve the efficiency and accuracy of model parameter aggregation.
It demonstrates superior performance over existing methods on a variety of datasets.
Limitations:
It is possible that the performance gains from CKIF are limited to specific datasets. Additional experiments are needed on different types and scales of chemical reactions.
Further research is needed to optimize how model parameters are aggregated and how chemical knowledge is utilized.
Validation of applicability and scalability in real industrial environments is required.
👍