Sign In

Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine-Tuning via Sparsity Evolution

Created by
  • Haebom
Category
Empty

์ €์ž

Qiao Xiao, Alan Ansell, Boqian Wu, Lu Yin, Mykola Pechenizkiy, Shiwei Liu, Decebal Constantin Mocanu

๐Ÿ’ก ๊ฐœ์š”

๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์€ ๋†’์€ ๊ณ„์‚ฐ ์š”๊ตฌ๋Ÿ‰์œผ๋กœ ์ธํ•ด ๋ฐฐํฌ์— ์–ด๋ ค์›€์„ ๊ฒช์Šต๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ์‚ฌํ›„ ํ•™์Šต ๊ฐ€์ง€์น˜๊ธฐ ๋ฐฉ๋ฒ•์€ ๋†’์€ ํฌ์†Œ๋„์—์„œ ์„ฑ๋Šฅ ์ €ํ•˜ ๋ฌธ์ œ๋ฅผ ๊ฒช์ง€๋งŒ, ์ œ์•ˆ๋œ Sparsity Evolution Fine-Tuning (SEFT)์€ ํฌ์†Œ๋„ ์œ ์ง€ ๋ฐ ๊ฐ€์ค‘์น˜ ๋“œ๋กญ-์•ค-๊ทธ๋กœ ์ „๋žต์„ ํ†ตํ•ด ํฌ์†Œ LLM์˜ ํ† ํด๋กœ์ง€๋ฅผ ๋™์ ์œผ๋กœ ์ง„ํ™”์‹œ์ผœ ์„ฑ๋Šฅ ์ €ํ•˜๋ฅผ ์™„ํ™”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
SEFT๋Š” ํฌ์†Œ LLM์— ํŠนํ™”๋œ ํŒŒ์ธํŠœ๋‹ ๋ฐฉ๋ฒ•์„ ์ œ๊ณตํ•˜์—ฌ, ๊ธฐ์กด ๊ฐ€์ง€์น˜๊ธฐ ๋ฐฉ๋ฒ•์˜ ์„ฑ๋Šฅ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ์‹ค์ œ ๋ฐฐํฌ ๊ฐ€๋Šฅํ•œ ํฌ์†Œ LLM ๊ตฌ์ถ•์— ๊ธฐ์—ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๊ฐ€์ค‘์น˜ ๋“œ๋กญ-์•ค-๊ทธ๋กœ ์ „๋žต๊ณผ ๋ฏผ๊ฐ๋„ ๊ธฐ๋ฐ˜ ๊ฐ€์ง€์น˜๊ธฐ ๊ธฐ์ค€์„ ํ†ตํ•ด ๋ชจ๋ธ์ด ๋ฐ์ดํ„ฐ์…‹์— ์ ์‘ํ•˜๋ฉฐ ํฌ์†Œ์„ฑ์„ ์œ ์ง€ํ•˜๋„๋ก ํ•˜์—ฌ, ํšจ์œจ์„ฑ๊ณผ ์„ฑ๋Šฅ์„ ๋™์‹œ์— ๊ฐœ์„ ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ SEFT๋Š” ๋‹ค์–‘ํ•œ LLM ๋ฐ ๋ฒค์น˜๋งˆํฌ์—์„œ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ๊ณผ ํšจ์œจ์„ฑ์„ ์ž…์ฆํ–ˆ์ง€๋งŒ, ํŠน์ • ๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜๋‚˜ ๋ณต์žกํ•œ ๋‹ค์šด์ŠคํŠธ๋ฆผ ์ž‘์—…์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๊ฒ€์ฆ์ด ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘