Sign In

MemFly: On-the-Fly Memory Optimization via Information Bottleneck

Created by
  • Haebom
Category
Empty

์ €์ž

Zhenyuan Zhang, Xianzhang Jia, Zhiqin Yang, Zhenbo Song, Wei Xue, Sirui Han, Yike Guo

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM) ์—์ด์ „ํŠธ์˜ ์žฅ๊ธฐ ๊ธฐ์–ต ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ์ •๋ณด ๋ณ‘๋ชฉ(Information Bottleneck) ์›๋ฆฌ์— ๊ธฐ๋ฐ˜ํ•œ MemFly ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. MemFly๋Š” ์••์ถ• ์‹œ ๋ฐœ์ƒํ•˜๋Š” ์ •๋ณด ์†์‹ค์„ ์ตœ์†Œํ™”ํ•˜๊ณ  ๊ฒ€์ƒ‰ ์‹œ ๊ด€๋ จ์„ฑ ์ •๋ณด๋ฅผ ์ตœ๋Œ€ํ™”ํ•˜๋„๋ก ์„ค๊ณ„๋˜์—ˆ์œผ๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์˜จ๋””๋งจ๋“œ(on-the-fly) ๋ฉ”๋ชจ๋ฆฌ ์ตœ์ ํ™”๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์ œ์•ˆ๋œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๊ฒ€์ƒ‰ ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ ์˜๋ฏธ๋ก ์ , ๊ธฐํ˜ธ์ , ์œ„์ƒํ•™์  ์ •๋ณด๋ฅผ ํ†ตํ•ฉํ•˜๊ณ  ๋ฐ˜๋ณต์ ์ธ ์ •์ œ๋ฅผ ํ†ตํ•ด ๋ณต์žกํ•œ ๋‹ค๋‹จ๊ณ„ ์ฟผ๋ฆฌ์— ํšจ๊ณผ์ ์œผ๋กœ ๋Œ€์‘ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM์˜ ์žฅ๊ธฐ ๊ธฐ์–ต ๊ด€๋ฆฌ์—์„œ ๋ฐœ์ƒํ•˜๋Š” ์ •๋ณด ์••์ถ•๊ณผ ์ •ํ™•ํ•œ ๊ฒ€์ƒ‰ ๊ฐ„์˜ ์ƒ์ถฉ ๊ด€๊ณ„๋ฅผ ์ •๋ณด ๋ณ‘๋ชฉ ์ด๋ก ์œผ๋กœ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ๋Š” ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
MemFly๋Š” ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก  ๋Œ€๋น„ ๋ฉ”๋ชจ๋ฆฌ ์ผ๊ด€์„ฑ, ์‘๋‹ต ์ถฉ์‹ค๋„, ์ •ํ™•๋„ ์ธก๋ฉด์—์„œ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉฐ, ๋ณต์žกํ•œ ๋‹ค๋‹จ๊ณ„ ์ฟผ๋ฆฌ ์ฒ˜๋ฆฌ ๋Šฅ๋ ฅ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ต๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ํ•™์Šต ๋ฐ ์ถ”๋ก  ์‹œ ๊ณ„์‚ฐ ๋ณต์žก์„ฑ์ด๋‚˜ ํ™•์žฅ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๋ถ„์„์ด ํ•„์š”ํ•˜๋ฉฐ, ์‹ค์ œ ์ ์šฉ ํ™˜๊ฒฝ์—์„œ์˜ ํšจ์œจ์„ฑ ๊ฒ€์ฆ์ด ์š”๊ตฌ๋ฉ๋‹ˆ๋‹ค.
๐Ÿ‘