Sign In

AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents

Created by
  • Haebom
Category
Empty

์ €์ž

Haotian Chen, Xin Cong, Shengda Fan, Yuyang Fu, Ziqin Gong, Yaxi Lu, Yishan Li, Boye Niu, Chengjun Pan, Zijun Song, Huadong Wang, Yesai Wu, Yueying Wu, Zihao Xie, Yukun Yan, Zhong Zhang, Yankai Lin, Zhiyuan Liu, Maosong Sun

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ 40์–ต ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ๊ทœ๋ชจ์˜ ์—ฃ์ง€ ๋””๋ฐ”์ด์Šค์—์„œ๋„ ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ ์†Œํ˜• LLM ์—์ด์ „ํŠธ๋ฅผ ํ›ˆ๋ จํ•˜์—ฌ ์žฅ๊ธฐ์ ์ธ ํƒ์ƒ‰ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” 'AgentCPM-Explore'๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์—ฃ์ง€ ์Šค์ผ€์ผ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ์ €ํ•ดํ•˜๋Š” ์„ธ ๊ฐ€์ง€ ์ฃผ์š” ๋ณ‘๋ชฉ ํ˜„์ƒ(์น˜๋ช…์  ๋ง๊ฐ, ๋ณด์ƒ ์‹ ํ˜ธ ๋…ธ์ด์ฆˆ ๋ฏผ๊ฐ์„ฑ, ์žฅ๊ธฐ ์ปจํ…์ŠคํŠธ์˜ ์ค‘๋ณต ์ •๋ณด๋กœ ์ธํ•œ ์ถ”๋ก  ์ €ํ•˜)์„ ํŒŒ๋ผ๋ฏธํ„ฐ ๊ณต๊ฐ„ ์œตํ•ฉ, ๋ณด์ƒ ์‹ ํ˜ธ ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ, ์ปจํ…์ŠคํŠธ ์ •๋ณด ์ •์ œ๋ผ๋Š” ํ›ˆ๋ จ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ํ•ด๊ฒฐํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ, 40์–ต ํŒŒ๋ผ๋ฏธํ„ฐ ํด๋ž˜์Šค์—์„œ ์ตœ๊ณ  ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ–ˆ์œผ๋ฉฐ, 80์–ต ํŒŒ๋ผ๋ฏธํ„ฐ ๋ฐ ๋” ํฐ ๋ชจ๋ธ๋“ค๊ณผ๋„ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์—ฃ์ง€ ๋””๋ฐ”์ด์Šค์™€ ๊ฐ™์ด ์ œํ•œ๋œ ์ปดํ“จํŒ… ์ž์›์—์„œ๋„ LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ๊ฐ€ ๋ณต์žกํ•œ ์žฅ๊ธฐ ๊ณผ์ œ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.
โ€ข
์†Œํ˜• ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ ํ•œ๊ณ„๋Š” ๋ชจ๋ธ ํฌ๊ธฐ ์ž์ฒด๋ณด๋‹ค๋Š” ํ›ˆ๋ จ ๋ฐฉ์‹๊ณผ ์•ˆ์ •์„ฑ ๋ฌธ์ œ์— ์žˆ์Œ์„ ์‹œ์‚ฌํ•˜๋ฉฐ, ์ƒˆ๋กœ์šด ํ›ˆ๋ จ ๋ฐฉ๋ฒ•๋ก ์˜ ์ค‘์š”์„ฑ์„ ๊ฐ•์กฐํ•ฉ๋‹ˆ๋‹ค.
โ€ข
AgentCPM-Explore๋Š” ํŠน์ • ๋ฒค์น˜๋งˆํฌ์—์„œ ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ์„ ๋Šฅ๊ฐ€ํ•˜๋Š” ์„ฑ๊ณผ๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋‚˜, ์‹ค์ œ ๋‹ค์–‘ํ•œ ์—ฃ์ง€ ํ™˜๊ฒฝ์—์„œ์˜ ์‹ค์‹œ๊ฐ„ ์„ฑ๋Šฅ ๋ฐ ํšจ์œจ์„ฑ ๊ฒ€์ฆ์€ ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘