Sign In

To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

Created by
  • Haebom
Category
Empty

์ €์ž

Nanxu Gong, Haotian Li, Sixun Dong, Jianxun Lian, Yanjie Fu, Xing Xie

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ๋Œ€๊ทœ๋ชจ ์ถ”๋ก  ๋ชจ๋ธ(LRM)์ด ์ˆ˜ํ•™ ๋ฐ ์ฝ”๋”ฉ๊ณผ ๊ฐ™์€ ๋ถ„์•ผ์—์„œ ๋ณด์—ฌ์ค€ ๋‹จ๊ณ„๋ณ„ ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ์‚ฌํšŒ์ธ์ง€ ๋Šฅ๋ ฅ, ํŠนํžˆ ๋งˆ์Œ ์ด๋ก (ToM) ์ž‘์—…์œผ๋กœ ํ™•์žฅ๋  ์ˆ˜ ์žˆ๋Š”์ง€ ํƒ๊ตฌํ•ฉ๋‹ˆ๋‹ค. ์•„ํ™‰ ๊ฐ€์ง€ ์ตœ์‹  LLM์„ ๋Œ€์ƒ์œผ๋กœ ์ถ”๋ก  ๋ชจ๋ธ๊ณผ ๋น„์ถ”๋ก  ๋ชจ๋ธ์„ ๋น„๊ตํ•œ ๊ฒฐ๊ณผ, ์ถ”๋ก  ๋ชจ๋ธ์ด ํ•ญ์ƒ ๋” ๋‚˜์€ ์„ฑ๋Šฅ์„ ๋ณด์ด๊ฑฐ๋‚˜ ์˜คํžˆ๋ ค ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๋Š” ๊ฒฝ์šฐ๋„ ์žˆ์Œ์„ ๋ฐœ๊ฒฌํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Š” LRM์˜ ์ผ๋ฐ˜์ ์ธ ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ToM๊ณผ ๊ฐ™์€ ์‚ฌํšŒ์  ์ถ”๋ก  ์ž‘์—…์œผ๋กœ ์ง์ ‘์ ์œผ๋กœ ์ด์ „๋˜์ง€ ์•Š์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LRM์˜ ๋‹จ๊ณ„๋ณ„ ์ถ”๋ก  ๋Šฅ๋ ฅ์€ ToM ์ž‘์—…์—์„œ ์ผ๊ด€๋˜๊ฒŒ ์ด์ ์„ ์ œ๊ณตํ•˜์ง€ ๋ชปํ•˜๋ฉฐ, ์ถ”๋ก  ๊ธธ์ด ์ฆ๊ฐ€ ์‹œ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๋Š” '๋А๋ฆฐ ์‚ฌ๊ณ  ๋ถ•๊ดด(slow thinking collapse)' ํ˜„์ƒ์ด ๋‚˜ํƒ€๋‚ฉ๋‹ˆ๋‹ค.
โ€ข
์ถ”๋ก  ๊ธธ์ด๋ฅผ ์ œํ•œํ•˜๊ฑฐ๋‚˜ ๋™์ ์œผ๋กœ ์ ์‘์‹œํ‚ค๋Š” ๊ฒƒ์ด ToM ์ž‘์—…์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ๋„์›€์ด ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
ํ˜„์žฌ LRM์€ ์ง„์ •ํ•œ ์—ฐ์—ญ์  ์ถ”๋ก ๋ณด๋‹ค๋Š” ๊ฐ๊ด€์‹ ์„ ํƒ์ง€๋ฅผ ๋งค์นญํ•˜๋Š” '์˜ต์…˜ ๋งค์นญ ๋ฐ”๋กœ๊ฐ€๊ธฐ(option matching shortcut)'์— ์˜์กดํ•˜๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
LRM์˜ ํ˜•์‹์  ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ์‚ฌํšŒ์  ์ถ”๋ก  ๋Šฅ๋ ฅ์œผ๋กœ ์™„์ „ํžˆ ์ด์ „๋˜์ง€ ์•Š์œผ๋ฏ€๋กœ, ๊ฒฌ๊ณ ํ•œ ToM ๋Šฅ๋ ฅ์„ ์œ„ํ•ด์„œ๋Š” ๊ธฐ์กด ์ถ”๋ก  ๋ฐฉ์‹์„ ๋„˜์–ด์„œ๋Š” ๊ณ ์œ ํ•œ ์—ญ๋Ÿ‰ ๊ฐœ๋ฐœ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘