Sign In

PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

Created by
  • Haebom
Category
Empty

์ €์ž

Maciej K. Wozniak, Lianhang Liu, Yixi Cai, Patric Jensfelt

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์นด๋ฉ”๋ผ ๋ฐ์ดํ„ฐ๋งŒ์œผ๋กœ ์ตœ์ข… ๋ชฉ์ ์ง€๊นŒ์ง€ ์ฃผํ–‰ ๊ณ„ํš์„ ์ƒ์„ฑํ•˜๋Š” ํšจ์œจ์ ์ธ ์ข…๋‹จ๊ฐ„(end-to-end) ์ž์œจ์ฃผํ–‰ ๋ชจ๋ธ์ธ PRIX๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. PRIX๋Š” BEV(Bird's-Eye View) ํ‘œํ˜„์ด๋‚˜ LiDAR ์„ผ์„œ ์—†์ด๋„ raw ํ”ฝ์…€ ์ž…๋ ฅ์œผ๋กœ๋ถ€ํ„ฐ ์ง์ ‘ ์•ˆ์ „ํ•œ ์ฃผํ–‰ ๊ถค์ ์„ ์ƒ์„ฑํ•˜๋ฉฐ, ํŠนํžˆ ๋‹ค์ค‘ ๋ ˆ๋ฒจ ์‹œ๊ฐ ํŠน์ง•์„ ๊ฐ•ํ™”ํ•˜๋Š” Context-aware Recalibration Transformer (CaRT) ๋ชจ๋“ˆ์„ ํ•ต์‹ฌ์œผ๋กœ ํ•ฉ๋‹ˆ๋‹ค. ์‹คํ—˜ ๊ฒฐ๊ณผ, PRIX๋Š” ๊ธฐ์กด์˜ ํฌ๊ณ  ๋ณต์žกํ•œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ์— ํ•„์ ํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉด์„œ๋„ ์ถ”๋ก  ์†๋„์™€ ๋ชจ๋ธ ํฌ๊ธฐ ๋ฉด์—์„œ ํ›จ์”ฌ ํšจ์œจ์ ์ž„์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์นด๋ฉ”๋ผ ์„ผ์„œ๋งŒ์œผ๋กœ ๊ณ ์„ฑ๋Šฅ์˜ ์ž์œจ์ฃผํ–‰ ๊ณ„ํš์ด ๊ฐ€๋Šฅํ•จ์„ ๋ณด์—ฌ์ฃผ๋ฉฐ, LiDAR ์„ผ์„œ ์˜์กด๋„๋ฅผ ๋‚ฎ์ถœ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๊ธฐ์กด์˜ ๋ณต์žกํ•˜๊ณ  ์ž์› ์†Œ๋ชจ๊ฐ€ ํฐ ๋ชจ๋ธ์— ๋น„ํ•ด ํšจ์œจ์ ์ธ ์•„ํ‚คํ…์ฒ˜๋ฅผ ํ†ตํ•ด ๋Œ€์ค‘์ ์ธ ์ฐจ๋Ÿ‰์— ์ ์šฉ ๊ฐ€๋Šฅํ•œ ์‹ค์งˆ์ ์ธ ํ•ด๊ฒฐ์ฑ…์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์‹ค์ œ ๋ณต์žกํ•˜๊ณ  ์˜ˆ์ธก ๋ถˆ๊ฐ€๋Šฅํ•œ ๋‹ค์–‘ํ•œ ๋„๋กœ ํ™˜๊ฒฝ์—์„œ์˜ ๊ฐ•๊ฑด์„ฑ ๋ฐ ์•ˆ์ „์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๊ฒ€์ฆ์ด ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘