Sign In

LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Songtao Wei, Yi Li, Zhikai Li, Xu Hu, Yuede Ji, Guanpeng Li, Feng Chen, Carl Yang, Zhichun Guo, Bingzhe Li

๐Ÿ’ก ๊ฐœ์š”

๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์€ ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ํ–ฅ์ƒ๋ ์ˆ˜๋ก ๋ถˆํ•„์š”ํ•˜๊ฒŒ ๊ธด ์ถ”๋ก  ๊ณผ์ •์„ ์ƒ์„ฑํ•˜์—ฌ ์ปดํ“จํŒ… ์ž์› ๋‚ญ๋น„๋ฅผ ์ดˆ๋ž˜ํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ๊ธธ์ด ๊ธฐ๋ฐ˜ ํšจ์œจ์„ฑ ๋ณด์ƒ์€ ํ•™์Šต ๊ณผ์ • ์ค‘ ์ตœ์ ์˜ ์ •ํ™•๋„-ํšจ์œจ์„ฑ ๊ท ํ˜•์ด ๋ณ€๋™ํ•˜๊ณ  ๋ฌธ์ œ๋ณ„ ์ถ”๋ก  ์˜ˆ์‚ฐ์ด ๋‹ฌ๋ผ์ง€๋Š” ๊ทผ๋ณธ์ ์ธ ๋ฌธ์ œ์— ์ง๋ฉดํ•ด ์žˆ์Šต๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์ด๋Ÿฌํ•œ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ์˜จ๋ผ์ธ, ์ž๊ฐ€ ์ ์‘ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋„์ž…ํ•˜๋Š” LEAD(Length-Efficient Adaptive and Dynamic reasoning)๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. LEAD๋Š” ์ž ์žฌ๋ ฅ ์Šค์ผ€์ผ๋ง ๋ถˆ์•ˆ์ •์„ฑ์„ ์ด์šฉํ•˜์—ฌ ๊ฐ ๋‹จ๊ณ„๋ณ„ ์ •ํ™•๋„-ํšจ์œจ์„ฑ ์ ˆ์ถฉ์ ์„ ๋™์ ์œผ๋กœ ์กฐ์ •ํ•˜๊ณ , ๋ชจ๋ธ ์ž์ฒด์˜ ์˜ฌ๋ฐ”๋ฅธ ์ถ”๋ก  ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋ฌธ์ œ๋ณ„ ๋ชฉํ‘œ ๊ธธ์ด๋ฅผ ์˜จ๋ผ์ธ์œผ๋กœ ์ถ”์ •ํ•˜์—ฌ ๊ณผ๋„ํ•œ ์ƒ๊ฐ๊ณผ ๊ณผ๋„ํ•œ ์••์ถ•์„ ๋ชจ๋‘ ํŽ˜๋„ํ‹ฐํ•˜๋Š” ๋Œ€์นญ์  ํšจ์œจ์„ฑ ๋ณด์ƒ์„ ์ ์šฉํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋™์ ์ด๊ณ  ์ ์‘์ ์ธ ํšจ์œจ์„ฑ ์ œ์–ด: ํ•™์Šต ๊ณผ์ • ์ค‘ ๋ณ€ํ™”ํ•˜๋Š” ์ •ํ™•๋„-ํšจ์œจ์„ฑ ๊ท ํ˜•๊ณผ ๋ฌธ์ œ๋ณ„ ์ถ”๋ก  ์˜ˆ์‚ฐ์„ ์‹ค์‹œ๊ฐ„์œผ๋กœ ํŒŒ์•…ํ•˜์—ฌ ์ตœ์ ์˜ ์ ˆ์ถฉ์ ์„ ๋™์ ์œผ๋กœ ์ฐพ์•„๋ƒ…๋‹ˆ๋‹ค.
โ€ข
์ •ํ™•๋„ ๋ฐ ํšจ์œจ์„ฑ ๋™์‹œ ๋‹ฌ์„ฑ: ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•๋ก ์€ ๊ธฐ์กด์˜ ํšจ์œจ์ ์ธ ์ถ”๋ก  ๊ธฐ๋ฒ•์— ๋น„ํ•ด ๋†’์€ ์ •ํ™•๋„์™€ ํšจ์œจ์„ฑ ์ ์ˆ˜๋ฅผ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ์›๋ณธ ๋ชจ๋ธ ๋Œ€๋น„ ํ˜„์ €ํžˆ ์งง์€ ๊ธธ์ด์˜ ์ถ”๋ก  ๊ฒฐ๊ณผ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ตœ์  ๋ชฉํ‘œ ๊ธธ์ด ์ถ”์ •์˜ ์–ด๋ ค์›€: ๋ชจ๋ธ์ด ์ž์ฒด์ ์œผ๋กœ ์ถ”๋ก  ๊ฒฐ๊ณผ๋ฅผ ์ƒ์„ฑํ•˜๋ฉด์„œ๋„ ์ •ํ™•๋„๋ฅผ ์œ ์ง€ํ•˜๊ธฐ ์œ„ํ•œ ์ตœ์ ์˜ ๋ชฉํ‘œ ๊ธธ์ด๋ฅผ ์˜จ๋ผ์ธ์œผ๋กœ ์ถ”์ •ํ•˜๋Š” ๊ฒƒ์€ ์—ฌ์ „ํžˆ ๋ณต์žกํ•œ ๋ฌธ์ œ์ž…๋‹ˆ๋‹ค.
๐Ÿ‘