Sign In

MetaMind: General and Cognitive World Models in Multi-Agent Systems by Meta-Theory of Mind

Created by
  • Haebom
Category
Empty

์ €์ž

Lingyi Wang, Rashed Shelim, Walid Saad, Naren Ramakrishna

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์ค‘์•™ ์ง‘์ค‘์‹ ๊ฐ๋…์ด๋‚˜ ๋ช…์‹œ์ ์ธ ํ†ต์‹  ์—†์ด๋„ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์—์„œ ์ƒํ˜ธ ์˜์กด์ ์ธ ์—์ด์ „ํŠธ์˜ ์—ญํ•™ ๊ด€๊ณ„๋ฅผ ์ดํ•ดํ•˜๊ณ , ์žฅ๊ธฐ์ ์ธ ์ง‘๋‹จ์  ์ธ์‹์„ ๋ฐ”ํƒ•์œผ๋กœ ๊ณ„ํšํ•˜๋Š” ๊ณผ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด MetaMind๋ผ๋Š” ์ผ๋ฐ˜์ ์ด๊ณ  ์ธ์ง€์ ์ธ ์„ธ๊ณ„ ๋ชจ๋ธ์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. MetaMind๋Š” ์ƒˆ๋กœ์šด ๋ฉ”ํƒ€-์ด๋ก ์  ์‚ฌ๊ณ (Meta-ToM) ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๊ฐ ์—์ด์ „ํŠธ๊ฐ€ ์ž์‹ ์˜ ์‹ ๋…์— ๋Œ€ํ•œ ์˜ˆ์ธก ๋ฐ ๊ณ„ํš๋ฟ๋งŒ ์•„๋‹ˆ๋ผ, ์ž์‹ ์˜ ํ–‰๋™ ๊ถค์ ์œผ๋กœ๋ถ€ํ„ฐ ๋ชฉํ‘œ์™€ ์‹ ๋…์„ ์—ญ์œผ๋กœ ์ถ”๋ก ํ•˜๋„๋ก ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค. ์ด ์ž๊ธฐ ์„ฑ์ฐฐ์ ์ด๊ณ  ์–‘๋ฐฉํ–ฅ์ ์ธ ์ถ”๋ก  ๋ฃจํ”„๋ฅผ ํ†ตํ•ด ์—์ด์ „ํŠธ๋Š” ์ž๊ธฐ ์ง€๋„ ๋ฐฉ์‹์œผ๋กœ ๋ฉ”ํƒ€์ธ์ง€ ๋Šฅ๋ ฅ์„ ํ•™์Šตํ•˜๋ฉฐ, ์ด๋ฅผ ๋‹ค์‹œ ์•„๋‚ ๋กœ์ง€ ์ถ”๋ก ์„ ํ†ตํ•ด 1์ธ์นญ์—์„œ 3์ธ์นญ์œผ๋กœ ์ผ๋ฐ˜ํ™”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋ช…์‹œ์ ์ธ ํ†ต์‹  ์—†์ด ์ œํ•œ์ ์ธ ๊ด€์ฐฐ๋งŒ์œผ๋กœ๋„ ๋‹ค๋ฅธ ์—์ด์ „ํŠธ์˜ ๋ชฉํ‘œ์™€ ์‹ ๋…์„ ์ œ๋กœ์ƒท(zero-shot)์œผ๋กœ ์ถ”๋ก ํ•  ์ˆ˜ ์žˆ๋Š” ๋Šฅ๋ ฅ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ž๊ธฐ ์ง€๋„ ํ•™์Šต์„ ํ†ตํ•ด ๋ฉ”ํƒ€์ธ์ง€ ๋Šฅ๋ ฅ์„ ์Šต๋“ํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ๋ณต์žกํ•œ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ™˜๊ฒฝ์—์„œ ๋‚˜ํƒ€๋‚˜๋Š” ์ง‘๋‹จ์  ์˜๋„์— ์ ์‘ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
์ ์€ ์–‘์˜ ๋ฐ์ดํ„ฐ๋กœ๋„ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚˜, ์ ์€ ์ƒ˜ํ”Œ ํ•™์Šต(few-shot learning) ์‹œ๋‚˜๋ฆฌ์˜ค์— ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ˜„์žฌ ์—ฐ๊ตฌ๋Š” ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ฒฐ๊ณผ์— ๊ธฐ๋ฐ˜ํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ์‹ค์ œ ๋ณต์žกํ•˜๊ณ  ๋™์ ์ธ ํ™˜๊ฒฝ์—์„œ์˜ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ๊ณผ ํšจ์œจ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๊ฒ€์ฆ์ด ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘