[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Improved DDIM Sampling with Moment Matching Gaussian Mixtures

Created by
  • Haebom

Author

Prasad Gabbur

Outline

In this paper, we propose to use Gaussian Mixture Model (GMM) as a back-transition operator (kernel) within the framework of Denoising Diffusion Implicit Model (DDIM), which is one of the widely used methods for accelerated sampling from pre-trained models in Denoising Diffusion Probabilistic Model (DDPM). Specifically, we constrain the parameters of GMM to match the first and second central moments of the DDPM forward marginal probabilities. Experimental results show that moment matching is sufficient to obtain samples with equivalent or better quality than the conventional DDIM using Gaussian kernel. Experimental results are presented using unconditional models trained on CelebAHQ and FFHQ and conditional models trained on ImageNet dataset. The results suggest that using GMM kernel with a small number of sampling steps significantly improves the quality of generated samples as measured by FID and IS metrics. For example, when using 10 sampling steps on ImageNet 256x256, we achieved FID 6.94 and IS 207.85 using the GMM kernel, while 10.15 and 196.73, respectively, when using the Gaussian kernel.

Takeaways, Limitations

Takeaways:
We demonstrate that using GMM as a back-transition operator in DDIM enables high-quality sample generation even with fewer sampling steps.
By effectively connecting the forward marginal probability of DDPM and GMM through the moment matching technique, we achieved improved performance.
We validate the superiority of the proposed method through experiments on large-scale datasets such as ImageNet.
Limitations:
Constraining the parameters of GMM may only be applicable to certain problems.
Further studies are needed to investigate the generalization performance of the proposed method.
Additional experiments on different datasets and models are needed.
👍