Daily Arxiv

์ „ ์„ธ๊ณ„์—์„œ ๋ฐœ๊ฐ„๋˜๋Š” ์ธ๊ณต์ง€๋Šฅ ๊ด€๋ จ ๋…ผ๋ฌธ์„ ์ •๋ฆฌํ•˜๋Š” ํŽ˜์ด์ง€ ์ž…๋‹ˆ๋‹ค.
๋ณธ ํŽ˜์ด์ง€๋Š” Google Gemini๋ฅผ ํ™œ์šฉํ•ด ์š”์•ฝ ์ •๋ฆฌํ•˜๋ฉฐ, ๋น„์˜๋ฆฌ๋กœ ์šด์˜ ๋ฉ๋‹ˆ๋‹ค.
๋…ผ๋ฌธ์— ๋Œ€ํ•œ ์ €์ž‘๊ถŒ์€ ์ €์ž ๋ฐ ํ•ด๋‹น ๊ธฐ๊ด€์— ์žˆ์œผ๋ฉฐ, ๊ณต์œ  ์‹œ ์ถœ์ฒ˜๋งŒ ๋ช…๊ธฐํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค.

IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation

Created by
  • Haebom
Category
Empty

์ €์ž

Yuan Yin, Shashanka Venkataramanan, Tuan-Hung Vu, Andrei Bursuc, Matthieu Cord

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์‚ฌ์ „ ํ›ˆ๋ จ๋œ ๋ชจ๋ธ์˜ ํšจ์œจ์ ์ธ ์ ์‘์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ํŒŒ๋ผ๋ฏธํ„ฐ ํšจ์œจ์  ๋ฏธ์„ธ ์กฐ์ •(PEFT) ๋ฐฉ๋ฒ•์ธ IPA๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. IPA๋Š” LoRA์™€ ๋‹ฌ๋ฆฌ, ์ž…๋ ฅ ์ •๋ณด๋ฅผ ๋ณด์กดํ•˜๋Š” ํŠน์ง• ๊ธฐ๋ฐ˜ ํˆฌ์˜ ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•˜์—ฌ ์„ฑ๋Šฅ ๋ณ‘๋ชฉ ํ˜„์ƒ์„ ํ•ด๊ฒฐํ•ฉ๋‹ˆ๋‹ค. ์–ธ์–ด ๋ฐ ๋น„์ „ ๋ฒค์น˜๋งˆํฌ์—์„œ IPA๋Š” LoRA ๋ฐ DoRA๋ณด๋‹ค ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉฐ, ํŠนํžˆ ์ƒ์‹ ์ถ”๋ก ์—์„œ 1.5์ , VTAB-1k์—์„œ 2.3์  ๋” ๋†’์€ ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
IPA๋Š” LoRA์™€ ๊ฐ™์€ ๊ธฐ์กด PEFT ๋ฐฉ๋ฒ•์˜ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ํšจ๊ณผ์ ์ธ ๋Œ€์•ˆ์„ ์ œ์‹œํ•˜๋ฉฐ, ์‚ฌ์ „ ํ›ˆ๋ จ๋œ ๋ชจ๋ธ์˜ ํšจ์œจ์ ์ธ ์ ์‘์— ๊ธฐ์—ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
IPA๋Š” ์ž…๋ ฅ ์ •๋ณด๋ฅผ ๋ณด์กดํ•˜๋Š” ํˆฌ์˜ ๋ฐฉ์‹์„ ํ†ตํ•ด, ์ •๋ณด ์†์‹ค์„ ์ตœ์†Œํ™”ํ•˜๊ณ  ์„ฑ๋Šฅ์„ ๊ทน๋Œ€ํ™”ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
IPA๋Š” ์„ ํ˜• ์‚ฌ๋ก€์—์„œ ์ตœ๊ณ  ์ฃผ์„ฑ๋ถ„์„ ๊ทผ์‚ฌํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ฌ์šฉํ•˜๋ฉฐ, ์ด๋Š” ๋‹ค๋ฅธ ๋น„์„ ํ˜• ์‚ฌ๋ก€๋กœ ํ™•์žฅํ•  ์ˆ˜ ์žˆ๋Š” ์—ฌ์ง€๋ฅผ ๋‚จ๊ฒจ๋‘ก๋‹ˆ๋‹ค.
๐Ÿ‘