Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Potential Score Matching: Debiasing Molecular Structure Sampling with Potential Energy Guidance

Created by
  • Haebom

Author

Liya Guo, Zun Wang, Chang Liu, Junzhe Li, Pipi Hu, Yi Zhu

Outline

This paper addresses the fact that the ensemble average of a molecule's physical properties is closely related to its molecular structure distribution, and sampling this distribution is a fundamental challenge in physics and chemistry. Conventional methods, such as molecular dynamics (MD) simulations and Markov Chain Monte Carlo (MCMC) sampling, can be time-consuming and expensive. To overcome the limitations of diffusion models, which have emerged as efficient alternatives by learning the distribution of training data, we propose a potential score matching (PSM) method that utilizes potential energy gradients to guide generative models. PSM does not require an exact energy function and can eliminate bias in sample distributions even when trained with limited and biased data. We demonstrate that PSM outperforms existing state-of-the-art (SOTA) models on the commonly used toy model, the Lennard-Jones (LJ) potential, and on the high-dimensional MD17 and MD22 datasets. We also demonstrate that the molecular distribution generated by PSM closely approximates the Boltzmann distribution compared to conventional diffusion models.

Takeaways, Limitations

Takeaways:
We demonstrate that effective molecular structure distribution sampling is possible even with limited and biased data.
A new method (PSM) is presented to overcome the limitations of existing diffusion models.
It also shows excellent performance in high-dimensional problems.
It suggests the possibility of reducing computational costs by eliminating the need for an accurate energy function.
Limitations:
Only evaluation results for the Lennard-Jones potential and the MD17 and MD22 datasets are presented, so further research is needed on generalizability.
Additional applications and performance evaluations for real complex molecular systems are needed.
Lack of analysis of the computational complexity and scalability of PSM.
👍