Sign In

CoEvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification

Created by
  • Haebom
Category
Empty

์ €์ž

Hanrong Zhang, Shicheng Fan, Henry Peng Zou, Yankai Chen, Zhenting Wang, Jiayu Zhou, Chengze Li, Wei-Chieh Huang, Yifei Yao, Kening Zheng, Xue Liu, Xiaoxiao Li, Philip S. Yu

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๊ธฐ์กด LLM ์—์ด์ „ํŠธ์˜ ๋‹จ์ˆœํ•œ ๋„๊ตฌ ํ˜ธ์ถœ ๋ฐฉ์‹์œผ๋กœ๋Š” ํ•ด๊ฒฐํ•˜๊ธฐ ์–ด๋ ค์šด ๋ณต์žกํ•œ ์ „๋ฌธ ์—…๋ฌด๋ฅผ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด '์Šคํ‚ฌ'์ด๋ผ๋Š” ๊ฐœ๋…์„ ์ œ์•ˆํ•œ๋‹ค. ํ˜„์žฌ ์Šคํ‚ฌ ์ƒ์„ฑ์€ ์ˆ˜๋™ ์ž‘์„ฑ์œผ๋กœ ์ธํ•œ ๋†’์€ ๋ ˆ์ด๋ธ” ๋น„์šฉ๊ณผ ์ธ๊ฐ„-๊ธฐ๊ณ„ ์ธ์ง€ ๋ถˆ์ผ์น˜๋กœ ์ธํ•œ ์„ฑ๋Šฅ ์ €ํ•˜ ๋ฌธ์ œ๋ฅผ ๊ฒช๊ณ  ์žˆ๋‹ค. ์ด์— ๋ณธ ๋…ผ๋ฌธ์€ CoEvoSkills๋ผ๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•˜์—ฌ ์—์ด์ „ํŠธ๊ฐ€ ๋ณต์žกํ•œ ๋‹ค์ค‘ ํŒŒ์ผ ์Šคํ‚ฌ ํŒจํ‚ค์ง€๋ฅผ ์ž์œจ์ ์œผ๋กœ ์ƒ์„ฑํ•˜๋„๋ก ํ•œ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
CoEvoSkills๋Š” ์ž์œจ์ ์ธ ์Šคํ‚ฌ ์ƒ์„ฑ์„ ํ†ตํ•ด LLM ์—์ด์ „ํŠธ์˜ ๋ณต์žกํ•œ ์ž‘์—… ์ˆ˜ํ–‰ ๋Šฅ๋ ฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•๋ก ์€ ๋ณ„๋„์˜ ์ •๋‹ต ๋ฐ์ดํ„ฐ ์—†์ด๋„ ์œ ์ตํ•˜๊ณ  ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ํ”ผ๋“œ๋ฐฑ์„ ์ œ๊ณตํ•˜๋Š” ๋™๋ฐ˜ ๊ฒ€์ฆ๊ธฐ(Surrogate Verifier)๋ฅผ ํ†ตํ•ด ํšจ์œจ์„ฑ์„ ๋†’์ธ๋‹ค.
โ€ข
CoEvoSkills๋Š” ๋‹ค์–‘ํ•œ LLM์— ๋Œ€ํ•œ ๊ฐ•๋ ฅํ•œ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋ฉฐ, SkillsBench ๋ฒค์น˜๋งˆํฌ์—์„œ ๊ธฐ์กด ๊ธฐ๋ฒ• ๋Œ€๋น„ ์ตœ๊ณ ์˜ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค.
โ€ข
ํ–ฅํ›„ ์—ฐ๊ตฌ์—์„œ๋Š” ์Šคํ‚ฌ ์ƒ์„ฑ ๋ฐ ๊ฒ€์ฆ ๊ณผ์ •์—์„œ์˜ ํŽธํ–ฅ์„ฑ ์™„ํ™” ๋ฐ ๋” ๋„“์€ ๋ฒ”์œ„์˜ ์ž‘์—…์— ๋Œ€ํ•œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ ํƒ์ƒ‰์ด ํ•„์š”ํ•˜๋‹ค.
๐Ÿ‘