Daily Arxiv

์ „ ์„ธ๊ณ„์—์„œ ๋ฐœ๊ฐ„๋˜๋Š” ์ธ๊ณต์ง€๋Šฅ ๊ด€๋ จ ๋…ผ๋ฌธ์„ ์ •๋ฆฌํ•˜๋Š” ํŽ˜์ด์ง€ ์ž…๋‹ˆ๋‹ค.
๋ณธ ํŽ˜์ด์ง€๋Š” Google Gemini๋ฅผ ํ™œ์šฉํ•ด ์š”์•ฝ ์ •๋ฆฌํ•˜๋ฉฐ, ๋น„์˜๋ฆฌ๋กœ ์šด์˜ ๋ฉ๋‹ˆ๋‹ค.
๋…ผ๋ฌธ์— ๋Œ€ํ•œ ์ €์ž‘๊ถŒ์€ ์ €์ž ๋ฐ ํ•ด๋‹น ๊ธฐ๊ด€์— ์žˆ์œผ๋ฉฐ, ๊ณต์œ  ์‹œ ์ถœ์ฒ˜๋งŒ ๋ช…๊ธฐํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค.

Training-Time Action Conditioning for Efficient Real-Time Chunking

Created by
  • Haebom
Category
Empty

์ €์ž

Kevin Black, Allen Z. Ren, Michael Equi, Sergey Levine

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์‹ค์‹œ๊ฐ„ ์ฒญํ‚น(RTC) ๊ธฐ๋ฒ•์„ ์‚ฌ์šฉํ•˜์—ฌ ๋น„์ „-์–ธ์–ด-ํ–‰๋™ ๋ชจ๋ธ(VLA)์˜ ํšจ์œจ์ ์ธ ์‹ค์‹œ๊ฐ„ ๋กœ๋ด‡ ์ œ์–ด๋ฅผ ์œ„ํ•œ ์ƒˆ๋กœ์šด ํ›ˆ๋ จ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ œ์•ˆํ•˜๋Š” ๋ฐฉ๋ฒ•์€ ์ถ”๋ก  ์‹œ ์ง€์—ฐ์„ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•˜๊ณ  ํ›ˆ๋ จ ์ค‘์— ํ–‰๋™ ์ ‘๋‘์‚ฌ๋ฅผ ์ง์ ‘ ์กฐ๊ฑดํ™”ํ•˜์—ฌ ์ถ”๋ก  ์‹œ ์˜ค๋ฒ„ํ—ค๋“œ๋ฅผ ์ œ๊ฑฐํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ฐฉ๋ฒ•์€ ๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜๋‚˜ ๋กœ๋ด‡ ๋Ÿฐํƒ€์ž„์— ๋Œ€ํ•œ ์ˆ˜์ • ์—†์ด ๊ตฌํ˜„ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ ์‹ค์ œ ์‹คํ—˜์„ ํ†ตํ•ด ๊ธฐ์กด RTC ๊ธฐ๋ฒ•๊ณผ ๋™๋“ฑํ•œ ์„ฑ๋Šฅ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ๊ณ„์‚ฐ ๋น„์šฉ์„ ์ ˆ๊ฐํ•จ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
ํ›ˆ๋ จ ์‹œ๊ฐ„ ์•ก์…˜ ์กฐ๊ฑดํ™”๋Š” ์ถ”๋ก  ์‹œ๊ฐ„ ์˜ค๋ฒ„ํ—ค๋“œ ์—†์ด ์‹ค์‹œ๊ฐ„ ๋กœ๋ด‡ ์ œ์–ด๋ฅผ ์œ„ํ•œ ํšจ์œจ์ ์ธ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜์—ฌ, ๊ธฐ์กด ์ธํŽ˜์ธํŒ… ๊ธฐ๋ฐ˜ RTC ๊ธฐ๋ฒ•์„ ๋Œ€์ฒดํ•  ์ˆ˜ ์žˆ๋Š” ์‹ค์šฉ์ ์ธ ๋Œ€์•ˆ์ž„์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ ๋ฐฉ๋ฒ•์€ VLA ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ๊ณ„์‚ฐ ๋น„์šฉ์„ ์ ˆ๊ฐํ•˜์—ฌ, ์‹ค์ œ ๋กœ๋ด‡ ์‹œ์Šคํ…œ์˜ ์‹ค์‹œ๊ฐ„ ์ œ์–ด ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ๋Š” ํŠน์ • VLA ๋ชจ๋ธ($\pi_{0.6}$)๊ณผ ๋‘ ๊ฐ€์ง€ ๋กœ๋ด‡ ์ž‘์—…(์ƒ์ž ์Œ“๊ธฐ, ์—์Šคํ”„๋ ˆ์†Œ ๋งŒ๋“ค๊ธฐ)์— ๋Œ€ํ•œ ์‹คํ—˜์„ ํ†ตํ•ด ๊ฒ€์ฆ๋˜์—ˆ์œผ๋ฏ€๋กœ, ๋‹ค๋ฅธ VLA ๋ชจ๋ธ ๋ฐ ๋‹ค์–‘ํ•œ ์ž‘์—… ํ™˜๊ฒฝ์œผ๋กœ์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์„ ์ถ”๊ฐ€์ ์œผ๋กœ ์—ฐ๊ตฌํ•  ํ•„์š”๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘