[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Sampling Decisions

Created by
  • Haebom

Author

Michael Chertkov, Sungsoo Ahn, Hamidreza Behjoo

Outline

In this paper, we present a novel Decision Flow (DF) framework that integrates additional guidance from the original sampler when sampling from the target distribution. DF can be viewed as an AI-based algorithmic reincarnation of the Markov Decision Process (MDP) approach in probabilistic optimal control. It extends the continuous-space, continuous-time path integral diffusion sampling technique of [Behjoo, Chertkov 2025] to discrete time and space, while generalizing the generative flow network (GFN) framework of [Bengio, et al 2021]. In its most basic form, DF exploits the linear solvability of the underlying MDP [Todorov, 2007] to adjust the transition probabilities of the original sampler, using an explicit formulation that does not require a neural network (NN). The resulting Markov process is expressed as a convolution of the inverse-time Green’s function of the original sampler and the target distribution. We demonstrate the DF framework with an example of sampling in an easing model, compare DF with Metropolis-Hastings to quantify its efficiency, discuss potential NN-based extensions, and provide an overview of how DF can improve guided sampling in various application areas.

Takeaways, Limitations

Takeaways: Present a new DF framework that improves existing sampling techniques, a generalized approach that integrates MDP and GFN frameworks, suggests the possibility of efficient sampling without using neural networks, verifies efficiency through easing model experiments, and suggests the possibility of expanding to various application areas.
Limitations: Currently, only limited experiments on the easing model are presented, specifics on NN-based extensions are lacking, and further research is needed on practical performance and generalization performance in various application areas.
👍