Sign In

Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Yihong Dong, Zhaoyu Ma, Xue Jiang, Zhiyuan Fan, Jiaru Qian, Yongmin Li, Jianha Xiao, Zhi Jin, Rongyu Cao, Binhua Li, Fei Huang, Yongbin Li, Ge Li

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์ฝ”๋“œ ์ƒ์„ฑ๊ณผ ๊ฐ™์ด ๊ตฌ์กฐ์  ์ œ์•ฝ์ด ๊ฐ•ํ•œ ์ž‘์—…์—์„œ ํ™•์‚ฐ ์–ธ์–ด ๋ชจ๋ธ(DLM)์˜ ์ถ”๋ก  ์†๋„์™€ ์ถœ๋ ฅ ํ’ˆ์งˆ ๊ฐ„์˜ ์„ฑ๋Šฅ ์ €ํ•˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด Saber๋ผ๋Š” ์ƒˆ๋กœ์šด ํ›ˆ๋ จ ์—†๋Š” ์ƒ˜ํ”Œ๋ง ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. Saber๋Š” ์ฝ”๋“œ ๋ฌธ๋งฅ์ด ํ™•๋ฆฝ๋จ์— ๋”ฐ๋ผ ์ ์‘์ ์œผ๋กœ ๊ฐ€์†ํ™”ํ•˜๊ณ , ์ƒ์„ฑ๋œ ํ† ํฐ์„ ๋˜๋Œ๋ฆฌ๊ธฐ ์œ„ํ•œ ๋ฐฑํŠธ๋ž˜ํ‚น ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋„์ž…ํ•˜์—ฌ DLM์˜ ์ƒ์„ฑ ๊ณผ์ •์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋‘ ๊ฐ€์ง€ ํ•ต์‹ฌ ํ†ต์ฐฐ์„ ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด Saber๋Š” ๊ธฐ์กด DLM ์ƒ˜ํ”Œ๋ง ๋ฐฉ๋ฒ• ๋Œ€๋น„ ์ฝ”๋“œ ์ƒ์„ฑ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋ฉด์„œ ์ถ”๋ก  ์†๋„๋ฅผ ํฌ๊ฒŒ ๊ฐœ์„ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
Saber๋Š” ์ฝ”๋“œ ์ƒ์„ฑ๊ณผ ๊ฐ™์€ ์ œ์•ฝ ์กฐ๊ฑด์ด ๊ฐ•ํ•œ DLM ์ž‘์—…์—์„œ ์ถ”๋ก  ์†๋„์™€ ์„ฑ๋Šฅ ์‚ฌ์ด์˜ ๊ท ํ˜•์„ ํšจ๊ณผ์ ์œผ๋กœ ๋งž์ถฅ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ์ ์‘์  ๊ฐ€์†ํ™” ๋ฐ ๋ฐฑํŠธ๋ž˜ํ‚น ๊ธฐ๋ฒ•์€ DLM์˜ ๋ณธ์งˆ์ ์ธ ์žฅ์ ์„ ํ™œ์šฉํ•˜์—ฌ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
Saber๋Š” DLM๊ณผ ์ž๊ธฐํšŒ๊ท€ ๋ชจ๋ธ ๊ฐ„์˜ ์ฝ”๋“œ ์ƒ์„ฑ ์„ฑ๋Šฅ ๊ฒฉ์ฐจ๋ฅผ ์ขํžˆ๋Š” ๋ฐ ๊ธฐ์—ฌํ•˜์ง€๋งŒ, ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ ๋ฐ ๋‹ค๋ฅธ ๋ณต์žกํ•œ ๋‹ค์šด์ŠคํŠธ๋ฆผ ์ž‘์—…์—์„œ์˜ ์„ฑ๋Šฅ ๊ฒ€์ฆ์€ ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘