Sign In

Emergent Slow Thinking in LLMs as Inverse Tree Freezing

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Sihan Hu, Xiansheng Cai, Yuan Huang, Zhiyuan Yao, Linfeng Zhang, Pan Zhang, Youjin Deng, Kun Chen

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๊ฐ•ํ™”ํ•™์Šต ๊ธฐ๋ฐ˜์˜ ๊ฒ€์ฆ ๊ฐ€๋Šฅํ•œ ๋ณด์ƒ(RLVR)์ด ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ๋‹ค๋‹จ๊ณ„ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ์–ด๋–ป๊ฒŒ ์ด๋Œ์–ด๋‚ด๋Š”์ง€ ํ†ต๊ณ„๋ฌผ๋ฆฌํ•™์  ๊ด€์ ์—์„œ ์„ค๋ช…ํ•ฉ๋‹ˆ๋‹ค. LLM์˜ ์ œํ•œ๋œ ์šฉ๋Ÿ‰์ด ๋ณต์žกํ•œ ์ ‘๋‘์‚ฌ ๊ณต๊ฐ„์„ ์˜ˆ์ธก ์ƒํƒœ์˜ ๋งˆ๋ฅด์ฝ”ํ”„ ๋„คํŠธ์›Œํฌ๋กœ ์••์ถ•ํ•˜๊ณ , RLVR์€ ์ด ๋„คํŠธ์›Œํฌ์—์„œ ๊ฒฝ๋กœ ๋ณ‘ํ•ฉ๊ณผ ๊ฒฝ์Ÿ์„ ํ†ตํ•ด '๊ฐœ๋… ๋„คํŠธ์›Œํฌ(CoNet)'๋ฅผ ํ˜•์„ฑํ•˜๋ฉฐ ์ ์ง„์ ์œผ๋กœ ์ถ”๋ก ์„ ๋ฐœ์ „์‹œํ‚ต๋‹ˆ๋‹ค. ์ตœ์ข…์ ์œผ๋กœ ์ด๋Š” ๋‹ค์ค‘ ์ž…๋ ฅ, ๋‹จ์ผ ์ถœ๋ ฅ์˜ ์—ญ๋ฐฉํ–ฅ ํŠธ๋ฆฌ(inverse tree) ๊ตฌ์กฐ๋กœ ๊ณ ์ •๋˜๋ฉฐ, ๋ณธ ์—ฐ๊ตฌ๋Š” ์ด๋Ÿฌํ•œ ํ•™์Šต ๋™์—ญํ•™์„ ์žฌํ˜„ํ•˜๊ณ  ์ƒˆ๋กœ์šด ํ›ˆ๋ จ ๋ฐฉ๋ฒ•๋ก ์ธ Annealed-RLVR์„ ์ œ์•ˆํ•˜์—ฌ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
RLVR์ด LLM์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ '์—ญ๋ฐฉํ–ฅ ํŠธ๋ฆฌ ๊ณ ์ •(inverse tree freezing)'์ด๋ผ๋Š” ํ†ต๊ณ„๋ฌผ๋ฆฌํ•™์  ๊ณผ์ •์œผ๋กœ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ด๋Š” ์ถ”๋ก  ๊ณผ์ •์˜ ๊ธฐํ•˜ํ•™์  ๊ตฌ์กฐ์™€ ๊ด€๋ จ์ด ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
ํ›ˆ๋ จ ์‹œ์ ์˜ '์ขŒ์ ˆ(frustration)'์„ ์ด์šฉํ•œ ์งง์€ SFT ๊ฐœ์ž…(Annealed-RLVR)์ด ํ‘œ์ค€ RLVR๋ณด๋‹ค ์„ฑ๋Šฅ์ด ์šฐ์ˆ˜ํ•˜๋ฉฐ, ํŠนํžˆ ๊ณ ๋น„์šฉ ์ƒ˜ํ”Œ๋ง ํ™˜๊ฒฝ์—์„œ LLM์˜ ๋ถ•๊ดด๋ฅผ ๋ฐฉ์ง€ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
SFT์˜ ์ ์šฉ ์‹œ์ ์ด ์ถ”๋ก  ๋Šฅ๋ ฅ ์œ ์ง€์— ๊ฒฐ์ •์ ์ธ ์—ญํ• ์„ ํ•˜๋ฉฐ, ํŠธ๋ฆฌ ๊ณ ์ • ์ดํ›„์˜ SFT๋Š” ์˜คํžˆ๋ ค 'ํŒŒ๊ตญ์  ๋ง๊ฐ'์„ ์œ ๋ฐœํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ˜„์žฌ ๋ชจ๋ธ์€ LLM์˜ ์—ญ๋ฐฉํ–ฅ ํŠธ๋ฆฌ ๊ณ ์ • ๊ณผ์ •์„ ๋ช…ํ™•ํ•˜๊ฒŒ ๋ณด์—ฌ์ฃผ์ง€๋งŒ, ๋‹ค์–‘ํ•œ LLM ์•„ํ‚คํ…์ฒ˜ ๋ฐ ๋ณต์žกํ•œ ์ถ”๋ก  ์ž‘์—…์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ๊ฒ€์ฆ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘