Sign In

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

Created by
  • Haebom
Category
Empty

์ €์ž

Joshua Ong Jun Leang, Yu Zhao, Mihaela C\u{a}t\u{a}lina Stoian, Wenda Li, Shay B. Cohen, Eleonora Giunchiglia

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ Masked Diffusion Models (MDMs)์˜ ๊ณ„ํš ๋ฐ ์ฑ„์šฐ๊ธฐ(plan-and-infill) ๋””์ฝ”๋”ฉ์—์„œ ์Šฌ๋กฏ ์ฑ„์šฐ๊ธฐ ์ˆœ์„œ์˜ ๋ฏผ๊ฐ๋„ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์Šฌ๋กฏ ์„ ํƒ์„ ์˜์‚ฌ ๊ฒฐ์ • ๋ฌธ์ œ๋กœ ๊ฐ„์ฃผํ•˜๊ณ , Monte Carlo Tree Search (MCTS)๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ฑ„์šฐ๊ธฐ ์ˆœ์„œ๋ฅผ ์ตœ์ ํ™”ํ•˜๋Š” McDiffuSE ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•์€ MBPP์™€ MATH500 ๋ฐ์ดํ„ฐ์…‹์—์„œ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก  ๋Œ€๋น„ ์ƒ๋‹นํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
MDMs์˜ ์Šฌ๋กฏ ์ฑ„์šฐ๊ธฐ ์ˆœ์„œ ์ตœ์ ํ™”๋ฅผ ์œ„ํ•ด MCTS ๊ธฐ๋ฐ˜ ๊ณ„ํš ๋ฐฉ๋ฒ•์ด ํšจ๊ณผ์ ์ž„์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ˆœ์ฐจ์ ์ธ ์ฑ„์šฐ๊ธฐ ์ˆœ์„œ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ๋น„์ˆœ์ฐจ์ ์ธ ์ƒ์„ฑ ์ „๋žต์˜ ์ค‘์š”์„ฑ์„ ํ™•์ธํ•˜์˜€์œผ๋ฉฐ, ๋ชจ๋ธ์˜ ํ™•์‹  ํŽธํ–ฅ์„ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํšŸ์ˆ˜๋ณด๋‹ค ํƒ์ƒ‰ ์ƒ์ˆ˜๋ฅผ ๋†’์ด๋Š” ๊ฒƒ์ด ๋” ์ค‘์š”ํ•จ์„ ๋ฐํ˜”์Šต๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ๋Š” MCTS ๊ธฐ๋ฐ˜ ๊ณ„ํš์ด MDMs์˜ ์ƒ์„ฑ ํ’ˆ์งˆ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์œ ๋งํ•œ ์ ‘๊ทผ ๋ฐฉ์‹์ž„์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ–ฅํ›„ ์—ฐ๊ตฌ์—์„œ๋Š” ๋” ๋ณต์žกํ•œ ์ถ”๋ก  ํƒœ์Šคํฌ์— ๋Œ€ํ•œ McDiffuSE์˜ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ๊ณผ ๋‹ค์–‘ํ•œ MCTS ๋ณ€ํ˜• ๊ธฐ๋ฒ•์˜ ํšจ๊ณผ๋ฅผ ํƒ๊ตฌํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘