Sign In

DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Sankarshana Venugopal (Seoul National University), Mohammad Mostafavi (Seoul National University), Jonghyun Choi (Seoul National University)

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ์ด๋ฏธ์ง€-๋Œ€-์ด๋ฏธ์ง€ ๋ณ€ํ™˜(I2I)์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋А๋ฆฐ ์ƒ˜ํ”Œ๋ง ์†๋„ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ํ™•์‚ฐ ๋ชจ๋ธ์˜ ๊ทผ๋ณธ์ ์ธ ํ™•๋ฅ  ๋ฏธ๋ถ„ ๋ฐฉ์ •์‹(SDE) ๋ฐ ์ƒ๋ฏธ๋ถ„ ๋ฐฉ์ •์‹(ODE)์˜ ์ค€์„ ํ˜• ๊ตฌ์กฐ๋ฅผ ํ™œ์šฉํ•˜๋Š” ํ•™์Šต ์—†๋Š” ์ƒ˜ํ”Œ๋Ÿฌ์ธ DBMSolver๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. DBMSolver๋Š” ์ง€์ˆ˜ ์ ๋ถ„๊ธฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ 1์ฐจ ๋ฐ 2์ฐจ ์†”๋ฃจ์…˜์„ ํšจ์œจ์ ์œผ๋กœ ๋„์ถœํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ํ•จ์ˆ˜ ํ‰๊ฐ€ ํšŸ์ˆ˜(NFE)๋ฅผ ์ตœ๋Œ€ 5๋ฐฐ๊นŒ์ง€ ์ค„์ด๋ฉด์„œ๋„ ์ด๋ฏธ์ง€ ํ’ˆ์งˆ์„ ํ–ฅ์ƒ์‹œํ‚ต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜ I2I ๋ณ€ํ™˜์—์„œ ์ƒ˜ํ”Œ๋ง ํšจ์œจ์„ฑ์„ ํš๊ธฐ์ ์œผ๋กœ ๊ฐœ์„ ํ•˜๊ณ  ๊ณ ํ’ˆ์งˆ ๊ฒฐ๊ณผ ์ƒ์„ฑ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ•™์Šต ๊ณผ์ • ์—†์ด ๊ธฐ์กด DBM ๋ชจ๋ธ์— ์ ์šฉ ๊ฐ€๋Šฅํ•˜์—ฌ ์‹ค์šฉ์„ฑ์„ ๋†’์ด๋ฉฐ, DIODE ๋ฐ์ดํ„ฐ์…‹์—์„œ 20 NFE๋กœ ๊ธฐ์กด 2์ฐจ ๊ธฐ์ค€์„  ๋Œ€๋น„ FID ์ ์ˆ˜๋ฅผ 53% ๋‚ฎ์ถ”๋Š” ๋“ฑ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์‹œ๋œ ํšจ์œจ์ ์ธ ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฒ•์€ ํ–ฅํ›„ ๊ณ ํ•ด์ƒ๋„ ๋ฐ ๋ณต์žกํ•œ I2I ๋ณ€ํ™˜ ์ž‘์—…์— ๋Œ€ํ•œ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํŠน์ • DBM์˜ SDE/ODE ๊ตฌ์กฐ์— ๋Œ€ํ•œ ์˜์กด์„ฑ์ด ์กด์žฌํ•˜๋ฉฐ, ๋” ๋ณต์žกํ•˜๊ฑฐ๋‚˜ ๋น„์„ ํ˜•์ ์ธ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ง„ DBM์— ๋Œ€ํ•œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ ๋ฐ ์„ฑ๋Šฅ ํ–ฅ์ƒ ์—ฌ๋ถ€๋Š” ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘