Sign In

S-PRESSO: Ultra Low Bitrate Sound Effect Compression With Diffusion Autoencoders And Offline Quantization

Created by
  • Haebom
Category
Empty

์ €์ž

Zineb Lahrichi (IP Paris), Gaetan Hadjeres (IP Paris), Gael Richard (IP Paris), Geoffroy Peeters (IP Paris)

๐Ÿ’ก ๊ฐœ์š”

์ด ๋…ผ๋ฌธ์€ ์ดˆ์ €๋น„ํŠธ์œจ์—์„œ๋„ ๊ณ ํ’ˆ์งˆ์˜ ์‚ฌ์šด๋“œ ํšจ๊ณผ ์••์ถ•์„ ๋‹ฌ์„ฑํ•˜๋Š” S-PRESSO๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์‚ฌ์šด๋“œ ํšจ๊ณผ ์••์ถ•์—์„œ ๊ธฐ์กด ๋ชจ๋ธ๋“ค์ด ๊ฒช๋Š” ์ €ํ•ด์ƒ๋„ ์˜ค๋””์˜ค ๋ฐ ๊ณ ์••์ถ• ์‹œ ๋ฐœ์ƒํ•˜๋Š” ์Œ์งˆ ์ €ํ•˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์‚ฌ์ „ ํ•™์Šต๋œ ํ™•์‚ฐ ๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ์ธ์ฝ”๋”-๋””์ฝ”๋” ๊ตฌ์กฐ์™€ ์˜คํ”„๋ผ์ธ ์–‘์žํ™” ๊ธฐ๋ฒ•์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด 1Hz์˜ ๋งค์šฐ ๋‚ฎ์€ ํ”„๋ ˆ์ž„ ์†๋„์—์„œ๋„ ์‹ค์ œ์™€ ์œ ์‚ฌํ•œ ์‚ฌ์šด๋“œ ํšจ๊ณผ๋ฅผ ๋ณต์›ํ•˜๋ฉฐ, ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก  ๋Œ€๋น„ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
0.096 kbps์™€ ๊ฐ™์€ ์ดˆ์ €๋น„ํŠธ์œจ์—์„œ๋„ ์Œํ–ฅ์  ์œ ์‚ฌ์„ฑ๊ณผ ๋ณต์› ์ธก๋ฉด์—์„œ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” ์‚ฌ์šด๋“œ ํšจ๊ณผ ์••์ถ• ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ™•์‚ฐ ๋ชจ๋ธ์˜ ์ƒ์„ฑ์  ์‚ฌ์ „ ์ง€์‹์„ ํ™œ์šฉํ•˜์—ฌ ๋†’์€ ์••์ถ•๋ฅ ์—์„œ๋„ ์‚ฌ์‹ค์ ์ธ ์‚ฌ์šด๋“œ ํšจ๊ณผ๋ฅผ ๋ณต์›ํ•˜๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•์€ ์ •ํ™•ํ•œ ์ถฉ์‹ค๋„(exact fidelity)๋ณด๋‹ค๋Š” ์‚ฌ์‹ค์„ฑ(realism)์— ์ดˆ์ ์„ ๋งž์ถ”๊ณ  ์žˆ์œผ๋ฏ€๋กœ, ์›๋ณธ๊ณผ์˜ ์™„๋ฒฝํ•œ ์ผ์น˜๋ฅผ ์š”๊ตฌํ•˜๋Š” ๊ฒฝ์šฐ์—๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘