Sign In

Large Language Model Reasoning Failures

Created by
  • Haebom
Category
Empty

์ €์ž

Peiyang Song, Pengrui Han, Noah Goodman

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ํ˜„์ €ํ•œ ๋ฐœ์ „์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ์ง€์†๋˜๋Š” ๋‹ค์–‘ํ•œ ์ถ”๋ก  ์‹คํŒจ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ์ดํ•ดํ•˜๊ณ  ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ์ฒซ ๋ฒˆ์งธ ํฌ๊ด„์ ์ธ ์กฐ์‚ฌ ๊ฒฐ๊ณผ๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค. ์ €์ž๋“ค์€ ์ถ”๋ก ์„ ๋‚ด์žฌ์ (embodied) ๋ฐ ๋น„๋‚ด์žฌ์ (non-embodied)์œผ๋กœ ๊ตฌ๋ถ„ํ•˜๊ณ , ๋น„๋‚ด์žฌ์  ์ถ”๋ก ์€ ๋‹ค์‹œ ๋น„๊ณต์‹์ (์ง๊ด€์ ) ๋ฐ ๊ณต์‹์ (๋…ผ๋ฆฌ์ )์œผ๋กœ ์„ธ๋ถ„ํ™”ํ•˜๋Š” ์ƒˆ๋กœ์šด ๋ถ„๋ฅ˜ ์ฒด๊ณ„๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ LLM ์•„ํ‚คํ…์ฒ˜ ์ž์ฒด์˜ ๊ทผ๋ณธ์ ์ธ ์‹คํŒจ, ํŠน์ • ์‘์šฉ ๋ถ„์•ผ์—์„œ์˜ ์ œํ•œ์ , ๋ฏธ์„ธํ•œ ๋ณ€ํ™”์—๋„ ๋ถˆ์•ˆ์ •ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” ๊ฐ•๊ฑด์„ฑ ๋ฌธ์ œ ๋“ฑ ์„ธ ๊ฐ€์ง€ ์œ ํ˜•์œผ๋กœ ์ถ”๋ก  ์‹คํŒจ๋ฅผ ๋ถ„๋ฅ˜ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM์˜ ์ถ”๋ก  ์‹คํŒจ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„๋ฅ˜ํ•˜๊ณ  ๋ถ„์„ํ•จ์œผ๋กœ์จ, ๋‹ค์–‘ํ•œ ์—ฐ๊ตฌ ๋…ธ๋ ฅ์„ ํ†ตํ•ฉํ•˜๊ณ  LLM์˜ ๊ตฌ์กฐ์  ์•ฝ์ ์— ๋Œ€ํ•œ ๋ช…ํ™•ํ•œ ๊ด€์ ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ถ”๋ก  ์‹คํŒจ์˜ ๊ทผ๋ณธ ์›์ธ ๋ถ„์„๊ณผ ์™„ํ™” ์ „๋žต ์ œ์‹œ๋ฅผ ํ†ตํ•ด, ๋” ๊ฐ•๋ ฅํ•˜๊ณ  ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ๊ฐ•๊ฑดํ•œ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ๊ฐ–์ถ˜ LLM ๊ฐœ๋ฐœ์„ ์œ„ํ•œ ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ˜„์žฌ ์—ฐ๊ตฌ๋“ค์€ LLM์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋ ค๋Š” ๋…ธ๋ ฅ์„ ๊ธฐ์šธ์ด๊ณ  ์žˆ์œผ๋‚˜, ํŠนํžˆ ๋ณต์žกํ•˜๊ฑฐ๋‚˜ ์ƒˆ๋กœ์šด ์œ ํ˜•์˜ ๋…ผ๋ฆฌ์  ์ถ”๋ก  ์‹คํŒจ์— ๋Œ€ํ•œ ๊ทผ๋ณธ์ ์ธ ํ•ด๊ฒฐ์ฑ…์€ ์•„์ง ๋ถ€์กฑํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์•ˆ๋œ ๋ถ„๋ฅ˜ ์ฒด๊ณ„๊ฐ€ LLM ์ถ”๋ก  ์‹คํŒจ๋ฅผ ์ดํ•ดํ•˜๋Š” ๋ฐ ์œ ์šฉํ•˜์ง€๋งŒ, ์‹ค์ œ ๋‹ค์–‘ํ•œ LLM ๋ชจ๋ธ๊ณผ ํƒœ์Šคํฌ์— ์ ์šฉํ•˜๊ณ  ๊ฒ€์ฆํ•˜๋Š” ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘