Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Improved DDIM Sampling with Moment Matching Gaussian Mixtures

Created by
  • Haebom

Author

Prasad Gabbur

Outline

This paper proposes using a Gaussian Mixture Model (GMM) as a back-transition operator (kernel) within the widely used Denoising Diffusion Implicit Models (DDIM) framework for accelerated sampling in Denoising Diffusion Probabilistic Models (DDPM). Specifically, we constrain the parameters of the GMM to match the first and second central moments of the DDPM forward marginal probabilities. We demonstrate that moment matching is sufficient to obtain samples of equivalent or better quality compared to conventional DDIMs using a Gaussian kernel. We present experimental results for text-to-image generation using Stable Diffusion v2.1 on the COYO700M dataset, with unconditional models trained on CelebAHQ and FFHQ, conditional models trained on ImageNet, and the experimental results suggest that using the GMM kernel with a small number of sampling steps significantly improves the quality of generated samples, as measured by the FID and IS metrics. For example, using 10 sampling steps on ImageNet 256x256, the GMM kernel achieves an FID of 6.94 and an IS of 207.85, while the Gaussian kernel achieves an FID of 10.15 and an IS of 196.73. Furthermore, we derive a novel SDE sampler for the commutative flow matching model and conduct experiments using the proposed approach. We observe improvements in both the one-commutative flow and two-commutative flow models.

Takeaways, Limitations

Takeaways:
We demonstrate that using the GMM kernel in DDIM allows for high-quality sample generation even with fewer sampling steps.
Using the GMM kernel, we achieve performance improvements over the conventional Gaussian kernel-based DDIM in FID and IS metrics.
We also confirmed the effectiveness of using the GMM kernel in the stationary flow matching model.
Limitations:
This paper presents only experimental results for a specific dataset and model; therefore, further research is needed to determine generalizability to other datasets and models.
There is a lack of detailed discussion on parameter setting and optimization of GMM.
Research may be needed on other methods of constraining GMM parameters besides moment matching.
👍