Sign In

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Created by
  • Haebom
Category
Empty

์ €์ž

Kaijie Zhu, Yuzhou Nie, Yijiang Li, Yiming Huang, Jialian Wu, Jiang Liu, Ximeng Sun, Zhenfei Yin, Lun Wang, Zicheng Liu, Emad Barsoum, William Yang Wang, Wenbo Guo

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ LLM์ด ๋ณต์žกํ•œ ํ„ฐ๋ฏธ๋„ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋Š” ๋ฐ ์žˆ์–ด ๊ฒช๋Š” ๋‘ ๊ฐ€์ง€ ์ฃผ์š” ๋ฌธ์ œ, ์ฆ‰ ๊ณ ํ’ˆ์งˆ์˜ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ํ›ˆ๋ จ ํ™˜๊ฒฝ ๋ถ€์กฑ๊ณผ ์ธ๊ฐ„ ์ „๋ฌธ๊ฐ€์˜ ์‹ค์ˆ˜๋ฅผ ๋ชจ๋ฐฉํ•œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ ๋ถ€์กฑ์„ ํ•ด๊ฒฐํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด TermiGen์ด๋ผ๋Š” ์ƒˆ๋กœ์šด ํŒŒ์ดํ”„๋ผ์ธ์„ ์ œ์•ˆํ•˜๋ฉฐ, ์ด ํŒŒ์ดํ”„๋ผ์ธ์€ ๊ฒ€์ฆ ๊ฐ€๋Šฅํ•œ ํ™˜๊ฒฝ๊ณผ ์˜ค๋ฅ˜ ์ˆ˜์ • ์‚ฌ์ดํด์ด ํ’๋ถ€ํ•œ ์ „๋ฌธ๊ฐ€ ๊ถค์ ์„ ํ•ฉ์„ฑํ•ฉ๋‹ˆ๋‹ค. TermiGen์œผ๋กœ ํ›ˆ๋ จ๋œ ๋ชจ๋ธ์€ ๊ธฐ์กด ๋ชจ๋ธ๋“ค์„ ๋Šฅ๊ฐ€ํ•˜๋ฉฐ ํŠนํžˆ TerminalBench ๋ฒค์น˜๋งˆํฌ์—์„œ 31.3%์˜ ๋†’์€ ์„ฑ๊ณต๋ฅ ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๊ณ ํ’ˆ์งˆ ํ•™์Šต ๋ฐ์ดํ„ฐ ์ƒ์„ฑ: TermiGen์€ ๋‹ค์–‘ํ•œ ์‹ค์ œ ํ™˜๊ฒฝ๊ณผ ์˜ค๋ฅ˜๋ฅผ ํฌํ•จํ•˜๋Š” ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ƒ์„ฑํ•˜์—ฌ LLM์˜ ํ„ฐ๋ฏธ๋„ ์ž‘์—… ์ˆ˜ํ–‰ ๋Šฅ๋ ฅ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
์˜คํ”ˆ ๊ฐ€์ค‘์น˜ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ: ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•๋ก ์€ ์˜คํ”ˆ ๊ฐ€์ค‘์น˜ ๋ชจ๋ธ์ด ๋…์  ๋ชจ๋ธ์— ํ•„์ ํ•˜๊ฑฐ๋‚˜ ๋Šฅ๊ฐ€ํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ•˜์—ฌ LLM ์—ฐ๊ตฌ ๋ฐ ๊ฐœ๋ฐœ์˜ ๋ฏผ์ฃผํ™”์— ๊ธฐ์—ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ™•์žฅ์„ฑ ๋ฐ ์ผ๋ฐ˜ํ™”: TermiGen์ด ์ƒ์„ฑํ•œ ๋ฐ์ดํ„ฐ์˜ ๋‹ค์–‘์„ฑ๊ณผ ์‹ค์ œ ์˜ค๋ฅ˜๋ฅผ ํฌํ•จํ•˜๋Š” ํŠน์„ฑ์ด ๋ชจ๋ธ์˜ ๊ฐ•๊ฑด์„ฑ๊ณผ ๋‹ค์–‘ํ•œ ์‹œ๋‚˜๋ฆฌ์˜ค์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ์–ด๋А ์ •๋„ ๋ณด์žฅํ•˜์ง€๋งŒ, ์‹ค์ œ ๋ณต์žกํ•˜๊ณ  ์˜ˆ์ƒ์น˜ ๋ชปํ•œ ์˜ค๋ฅ˜ ์ƒํ™ฉ์— ๋Œ€ํ•œ ์™„๋ฒฝํ•œ ํฌ๊ด„์„ฑ์€ ์—ฌ์ „ํžˆ ๋„์ „ ๊ณผ์ œ๋กœ ๋‚จ์•„์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘