Sign In

Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation

Created by
  • Haebom
Category
Empty

์ €์ž

Kai Gobel, Pierrick Lorang, Patrik Zips, Tobias Gluck

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ์ž์œจ ๋กœ๋ด‡ ์‹œ์Šคํ…œ์˜ ํ•ต์‹ฌ ์—ญ๋Ÿ‰์ธ ์ž‘์—… ๊ณ„ํš(task planning) ๋ถ„์•ผ์—์„œ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ๊ฐ€๋Šฅ์„ฑ์„ ํƒ๊ตฌํ•ฉ๋‹ˆ๋‹ค. LLM์ด PDDL(Planning Domain Definition Language) ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์—”์ง„๊ณผ ์ƒํ˜ธ์ž‘์šฉํ•˜์—ฌ ๋‹จ๊ณ„๋ณ„๋กœ ํ–‰๋™์„ ์„ ํƒํ•˜๊ณ  ์ƒํƒœ๋ฅผ ๊ด€์ฐฐํ•˜๋Š” '์—์ด์ „ํŠธ์  LLM ๊ณ„ํš(agentic LLM planning)' ๋ฐฉ์‹์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์‹คํ—˜ ๊ฒฐ๊ณผ, ์—์ด์ „ํŠธ์  LLM ๊ณ„ํš์€ ์ง์ ‘์ ์ธ LLM ๊ณ„ํš๋ณด๋‹ค ์•ฝ๊ฐ„ ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋ณด์˜€์œผ๋ฉฐ, ํŠนํžˆ ๋‚œ์ด๋„ ์žˆ๋Š” ๋ฌธ์ œ์—์„œ ๋” ์งง์€ ๊ณ„ํš์„ ์ƒ์„ฑํ•˜๋Š” ๊ฒฝํ–ฅ์„ ํ™•์ธํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM์ด PDDL ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์—”์ง„๊ณผ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ธฐ์กด ๊ธฐํ˜ธ ๊ณ„ํš ๋ฐฉ์‹๊ณผ ๊ฒฝ์Ÿํ•  ์ˆ˜ ์žˆ๋Š” ์ž ์žฌ๋ ฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
์—์ด์ „ํŠธ์  ์ ‘๊ทผ ๋ฐฉ์‹์€ ๋‹จ๊ณ„๋ณ„ ํ™˜๊ฒฝ ํ”ผ๋“œ๋ฐฑ์„ ํ†ตํ•ด LLM์˜ ๊ณ„ํš ๋Šฅ๋ ฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
PDDL๊ณผ ๊ฐ™์ด ์ž์ฒด ํ‰๊ฐ€์ ์ธ ํ”ผ๋“œ๋ฐฑ๋งŒ์œผ๋กœ๋Š” ์—์ด์ „ํŠธ์  ํ•™์Šต์˜ ์‹ค์งˆ์ ์ธ ์ด์ ์„ ์ œํ•œํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์ด๋Š” ์™ธ๋ถ€์ ์œผ๋กœ ๊ฒ€์ฆ ๊ฐ€๋Šฅํ•œ ์‹ ํ˜ธ์˜ ์ค‘์š”์„ฑ์„ ๊ฐ•์กฐํ•ฉ๋‹ˆ๋‹ค.
โ€ข
LLM ๊ธฐ๋ฐ˜ ๊ณ„ํš์€ ํ˜„์žฌ๊นŒ์ง€๋Š” ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์˜ ๊ธฐ์–ต์— ์˜์กดํ•˜๋Š” ๊ฒฝํ–ฅ์ด ๊ฐ•ํ•˜๋ฉฐ, ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅํ•œ ๊ณ„ํš ๋Šฅ๋ ฅ ํ™•๋ณด๊ฐ€ ํ–ฅํ›„ ๊ณผ์ œ์ž…๋‹ˆ๋‹ค.
โ€ข
์—์ด์ „ํŠธ์  LLM ๊ณ„ํš์€ ํ† ํฐ ๋น„์šฉ์ด ๋” ๋†’๋‹ค๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘