Sign In

Self-Consistency Is Losing Its Edge: Diminishing Returns and Rising Costs in Modern LLMs

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Chiyan Loo

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ์ž๊ธฐ์ผ๊ด€์„ฑ(self-consistency) ๊ธฐ๋ฒ•์ด ์ตœ์‹  ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ๊ธฐ์—ฌํ•˜๋Š” ์ •๋„๊ฐ€ ๊ฐ์†Œํ•˜๊ณ  ์˜คํžˆ๋ ค ๋น„์šฉ๋งŒ ์ฆ๊ฐ€์‹œํ‚ค๋Š” ํ˜„์ƒ์„ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค. Gemini 2.5 ๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ์‹คํ—˜ ๊ฒฐ๊ณผ, ์ƒ˜ํ”Œ๋ง ํšŸ์ˆ˜๋ฅผ ๋Š˜๋ ค๋„ ์ •ํ™•๋„ ํ–ฅ์ƒ์€ ๋ฏธ๋ฏธํ•œ ๋ฐ˜๋ฉด, ํ† ํฐ ๋น„์šฉ์€ ๊ฑฐ์˜ ์„ ํ˜•์ ์œผ๋กœ ์ฆ๊ฐ€ํ•˜๋ฉฐ, ๊ฒฝ์šฐ์— ๋”ฐ๋ผ์„œ๋Š” ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๋Š” ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ์Šต๋‹ˆ๋‹ค. ์ด๋Š” ์ตœ์‹  LLM์ด ์ด๋ฏธ ๋ฌธ์ œ๋ฅผ ์•ˆ์ •์ ์œผ๋กœ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ถ”๊ฐ€์ ์ธ ์ถ”๋ก  ๊ฒฝ๋กœ๊ฐ€ ๋…ธ์ด์ฆˆ๋ฅผ ๋ฐœ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์ž๊ธฐ์ผ๊ด€์„ฑ์€ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ๋‚ฎ์„ ๋•Œ ํšจ๊ณผ์ ์ด์—ˆ์œผ๋‚˜, ์ตœ์‹  LLM์—๋Š” ๋น„ํšจ์œจ์ ์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋†’์€ ์ƒ˜ํ”Œ๋ง ํšŸ์ˆ˜๋Š” ์˜คํžˆ๋ ค ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ์ €ํ•˜์‹œํ‚ค๊ณ  ๋ถˆํ•„์š”ํ•œ ๋น„์šฉ์„ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋ชจ๋ธ์˜ ๋‹จ์ผ ํ†ต๊ณผ(single-pass) ์‹ ๋ขฐ๋„๊ฐ€ ์ž…์ฆ๋œ ๋ฌธ์ œ์— ๋Œ€ํ•ด์„œ๋Š” ๋‹ค์ค‘ ๊ฒฝ๋กœ ์ƒ˜ํ”Œ๋ง์„ ์‹ ์ค‘ํ•˜๊ฒŒ ์ ์šฉํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ์˜ ๊ฒฐ๊ณผ๋Š” ํŠน์ • ๋ชจ๋ธ(Gemini 2.5) ๋ฐ ๋ฐ์ดํ„ฐ์…‹(HotpotQA, MATH-500)์— ๊ตญํ•œ๋  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋‹ค๋ฅธ ๋ชจ๋ธ์ด๋‚˜ ๋ฌธ์ œ ์œ ํ˜•์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๊ฒ€์ฆ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘