Sign In

BEAGLE: Behavior-Enforced Agent for Grounded Learner Emulation

Created by
  • Haebom
Category
Empty

์ €์ž

Hanchen David Wang, Clayton Cohn, Zifan Xu, Siyuan Guo, Gautam Biswas, Meiyi Ma

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๊ฐœ๋ฐฉํ˜• ๋ฌธ์ œ ํ•ด๊ฒฐ ํ™˜๊ฒฝ์—์„œ ํ•™์Šต์ž์˜ ํ–‰๋™์„ ๋ชจ๋ฐฉํ•˜๋Š” ๋ฐ ๋ฐœ์ƒํ•˜๋Š” LLM์˜ ์—ญ๋Ÿ‰ ํŽธํ–ฅ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด BEAGLE์ด๋ผ๋Š” ์‹ ๊ฒฝ-์ƒ์ง•์  ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. BEAGLE์€ ์ž๊ธฐ ์กฐ์ ˆ ํ•™์Šต(SRL) ์ด๋ก ์„ ํ†ตํ•ฉํ•˜์—ฌ ์ธ์ง€ ํ–‰๋™๊ณผ ๋ฉ”ํƒ€์ธ์ง€ ํ–‰๋™์˜ ์‹œ์  ๋ฐ ์ „ํ™˜์„ ์ œ์–ดํ•˜๋Š” ๋ฐ˜๋งˆ๋ฅด์ฝ”ํ”„ ๋ชจ๋ธ, ํ˜„์‹ค์ ์ธ ์ง€์‹ ๊ฒฉ์ฐจ์™€ '์•Œ๋ ค์ง€์ง€ ์•Š์€ ๋ฏธ์ง€'๋ฅผ ๊ฐ•์ œํ•˜๋Š” ๊ฒฐํ•จ ์ฃผ์ž…์„ ํฌํ•จํ•œ ๋ฒ ์ด์ง€์•ˆ ์ง€์‹ ์ถ”์ , ๊ทธ๋ฆฌ๊ณ  ๊ณ ์˜์ ์ธ ์˜ค๋ฅ˜๋ฅผ ์ž์ฒด ์ˆ˜์ •ํ•˜๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•œ ๋ถ„๋ฆฌ๋œ ์—์ด์ „ํŠธ ์„ค๊ณ„๋ฅผ ํŠน์ง•์œผ๋กœ ํ•ฉ๋‹ˆ๋‹ค. BEAGLE์€ ์‹ค์ œ ํ•™์Šต์ž ๋ฐ์ดํ„ฐ๋ฅผ ์„ฑ๊ณต์ ์œผ๋กœ ์žฌํ˜„ํ•˜๋ฉฐ, ์ธ๊ฐ„ ํŠœ๋ง ํ…Œ์ŠคํŠธ์—์„œ ์‹ค์ œ ๋ฐ์ดํ„ฐ์™€ ๊ตฌ๋ณ„๋˜์ง€ ์•Š๋Š” ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM์˜ ์—ญ๋Ÿ‰ ํŽธํ–ฅ์„ ๊ทน๋ณตํ•˜๊ณ  ํ•™์Šต์ž์˜ ์‹ค์ œ ํ•™์Šต ๊ณผ์ •์„ ๋ณด๋‹ค ์‚ฌ์‹ค์ ์œผ๋กœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹ ์ œ์‹œ.
โ€ข
๊ต์œก ์—ฐ๊ตฌ, ์ ์‘ํ˜• ํŠœํ„ฐ๋ง ์‹œ์Šคํ…œ ํ›ˆ๋ จ, ๊ต์œก์  ๊ฐœ์ž… ํ…Œ์ŠคํŠธ ๋“ฑ ๋‹ค์–‘ํ•œ ๊ต์œก ๋ถ„์•ผ์— ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์ด ๋†’์Œ.
โ€ข
ํ˜„์žฌ๋Š” Python ํ”„๋กœ๊ทธ๋ž˜๋ฐ ์ž‘์—…์— ๋Œ€ํ•ด์„œ๋งŒ ํ‰๊ฐ€๋˜์—ˆ์œผ๋ฏ€๋กœ, ๋‹ค๋ฅธ ๋„๋ฉ”์ธ์ด๋‚˜ ๋” ๋ณต์žกํ•œ ๋ฌธ์ œ์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ ๊ฒ€์ฆ ํ•„์š”.
๐Ÿ‘