Sign In

Lance: Unified Multimodal Modeling by Multi-Task Synergy

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Fengyi Fu, Mengqi Huang, Shaojin Wu, Yunsheng Jiang, Yufei Huo, Hao Li, Yinghang Song, Fei Ding, Jianzhu Guo, Qian He, Zheren Fu, Zhendong Mao, Yongdong Zhang

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์ด๋ฏธ์ง€์™€ ๋น„๋””์˜ค๋ฅผ ๋ชจ๋‘ ์ง€์›ํ•˜๋Š” ๊ฒฝ๋Ÿ‰์˜ ํ†ตํ•ฉ ๋‹ค์ค‘๋ชจ๋‹ฌ ๋ชจ๋ธ์ธ Lance๋ฅผ ์ œ์•ˆํ•œ๋‹ค. Lance๋Š” ๋ชจ๋ธ ์šฉ๋Ÿ‰ ํ™•์žฅ์ด๋‚˜ ํ…์ŠคํŠธ-์ด๋ฏธ์ง€ ์ค‘์‹ฌ ์„ค๊ณ„ ๋Œ€์‹ , ํ˜‘๋ ฅ์ ์ธ ๋ฉ€ํ‹ฐํƒœ์Šคํฌ ํ›ˆ๋ จ์„ ํ†ตํ•ด ์‹ค์šฉ์ ์ธ ํ†ตํ•ฉ ๋‹ค์ค‘๋ชจ๋‹ฌ ๋ชจ๋ธ๋ง ํŒจ๋Ÿฌ๋‹ค์ž„์„ ํƒ๊ตฌํ•œ๋‹ค. ํ•ต์‹ฌ ์›์น™์€ ํ†ตํ•ฉ๋œ ์ปจํ…์ŠคํŠธ ๋ชจ๋ธ๋ง๊ณผ ๋ถ„๋ฆฌ๋œ ๋Šฅ๋ ฅ ๊ฒฝ๋กœ์ด๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์ดํ•ด์™€ ์ƒ์„ฑ์„ ํšจ๊ณผ์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•œ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋ชจ๋ธ ์šฉ๋Ÿ‰ ํ™•์žฅ์ด๋‚˜ ํŠน์ • ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ(ํ…์ŠคํŠธ-์ด๋ฏธ์ง€)์— ์น˜์šฐ์นœ ๊ธฐ์กด ๋ฐฉ์‹์—์„œ ๋ฒ—์–ด๋‚˜, ํ˜‘๋ ฅ์ ์ธ ๋ฉ€ํ‹ฐํƒœ์Šคํฌ ํ›ˆ๋ จ์„ ํ†ตํ•ด ํšจ์œจ์ ์ธ ํ†ตํ•ฉ ๋‹ค์ค‘๋ชจ๋‹ฌ ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ค€๋‹ค.
โ€ข
์ด๋ฏธ์ง€ ๋ฐ ๋น„๋””์˜ค ์ƒ์„ฑ์—์„œ ๊ธฐ์กด ์˜คํ”ˆ์†Œ์Šค ํ†ตํ•ฉ ๋ชจ๋ธ ๋Œ€๋น„ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜๋ฉด์„œ๋„ ๊ฐ•๋ ฅํ•œ ๋‹ค์ค‘๋ชจ๋‹ฌ ์ดํ•ด ๋Šฅ๋ ฅ์„ ์œ ์ง€ํ•˜๋Š” ์‹ค์šฉ์ ์ธ ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ•œ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ๋ชจ๋ธ์˜ ๋ถ„๋ฆฌ๋œ ๋Šฅ๋ ฅ ๊ฒฝ๋กœ์™€ ์ ์‘ํ˜• ๋ฐ์ดํ„ฐ ์Šค์ผ€์ค„๋ง ์ „๋žต์ด ๋‹ค์–‘ํ•œ ๋‹ค์ค‘๋ชจ๋‹ฌ ์ž‘์—…์—์„œ์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ๊ธฐ์—ฌํ•œ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์‹œ๋œ ๋ฉ€ํ‹ฐํƒœ์Šคํฌ ์‹œ๋„ˆ์ง€ ํšจ๊ณผ๋ฅผ ๋”์šฑ ํ™•์žฅํ•˜๊ณ , ๋‹ค์–‘ํ•œ ์œ ํ˜•์˜ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ(์˜ˆ: ์˜ค๋””์˜ค, 3D ๋ฐ์ดํ„ฐ)๋ฅผ ํ†ตํ•ฉํ•˜๋Š” ๋ฐ ์žˆ์–ด ๋ณธ ๋ชจ๋ธ์˜ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ํƒ๊ตฌํ•  ํ•„์š”๊ฐ€ ์žˆ๋‹ค.
๐Ÿ‘