Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models

Created by
  • Haebom

Author

Runqian Wang, Yilun Du

Outline

Equilibrium Matching (EqM) is a generative modeling framework built from the perspective of equilibrium dynamics. It abandons the non-equilibrium, time-conditional dynamics of conventional diffusion- and flow-based generative models and instead learns the equilibrium gradient of an implicit energy landscape. This approach allows for an optimization-based sampling process at inference time, using variable step sizes, an adaptive optimizer, and adaptive operations to obtain samples via gradient descent over the learned landscape. EqM achieves a FID of 1.90 on ImageNet 256×256, outperforming the generative performance of diffusion/flow models. Furthermore, EqM is theoretically justified by learning and sampling from a data manifold. Beyond generative models, EqM is a flexible framework that naturally addresses tasks such as partially noisy image denoising, OOD detection, and image synthesis. By replacing time-conditional velocities with an integrated equilibrium landscape, EqM provides a closer connection between flow and energy-based models, offering a straightforward path to optimization-based inference.

Takeaways, Limitations

Takeaways:
Achieves superior generative performance over diffusion/flow models (FID 1.90 on ImageNet 256x256).
A theoretical basis for learning and sampling from data manifolds is presented.
Capable of handling various tasks such as partially noisy image denoising, OOD detection, and image synthesis.
Strengthening the connection between flow models and energy-based models.
A simple method for optimization-based inference is presented.
Limitations:
There is no Limitations specified in the paper.
👍