Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Adaptive Inference-Time Scaling via Cyclic Diffusion Search

Created by
  • Haebom

Author

Gyubin Lee, Truong Nhat Nguyen Bao, Jaesik Yoon, Dongwoo Lee, Minsu Kim, Yoshua Bengio, Sungjin Ahn

Outline

In this paper, we propose an adaptive inference time scaling method that dynamically adjusts the computational load during the inference process to solve the inference time scaling problem of diffusion models. Unlike the existing methods that rely on a fixed noise removal schedule, we propose a new framework called Adaptive Bi-directional Cyclic Diffusion (ABCD), which improves the output through bidirectional diffusion cycles and adaptively controls the search depth and termination time. ABCD consists of three components: cyclic diffusion search, automatic exploration-exploitation balance, and adaptive thinking time. We experimentally demonstrate that it improves performance while maintaining computational efficiency on a variety of tasks.

Takeaways, Limitations

Takeaways:
We present a novel method to improve the computational efficiency of diffusion models through adaptive inference time scaling.
The amount of calculation can be dynamically adjusted depending on the difficulty of the task or specific requirements.
Ensured improved performance and maintained computational efficiency across a variety of tasks.
Bidirectional diffusion cycles and adaptive control provide the potential for more sophisticated output generation.
Limitations:
Difficulty in implementation and application due to the complexity of the ABCD framework.
Further research is needed on generalization performance across different tasks.
Further research is needed to determine how to set optimal parameters for specific tasks.
Further verification of the practical applicability and scalability of the proposed method is needed.
👍