Sign In

Why Deep Jacobian Spectra Separate: Depth-Induced Scaling and Singular-Vector Alignment

Created by
  • Haebom
Category
Empty

์ €์ž

Nathanael Haas, Franc\c{c}ois Gatine, Augustin M Cosse, Zied Bouraoui

๐Ÿ’ก ๊ฐœ์š”

์ด ๋…ผ๋ฌธ์€ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ ํ•™์Šต ์‹œ ๋‚˜ํƒ€๋‚˜๋Š” ๊ทธ๋ž˜๋””์–ธํŠธ ๊ธฐ๋ฐ˜ ํ•™์Šต์˜ ์•”๋ฌต์  ํŽธํ–ฅ์„ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•ด ๋”ฅ ์ œ์ด์ฝ”๋น„์•ˆ ์ŠคํŽ™ํŠธ๋ผ์— ์ฃผ๋ชฉํ•ฉ๋‹ˆ๋‹ค. ์—ฐ๊ตฌ์ง„์€ ๊นŠ์ด์— ๋”ฐ๋ฅธ ํŠน์ด๊ฐ’์˜ ์ง€์ˆ˜์  ์Šค์ผ€์ผ๋ง๊ณผ ๊ฐ•๋ ฅํ•œ ์ŠคํŽ™ํŠธ๋Ÿผ ๋ถ„๋ฆฌ๋ผ๋Š” ๋‘ ๊ฐ€์ง€ ํŠน์„ฑ์„ ์ œ์•ˆํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์ค‘๊ฐ„ ์ œ์ด์ฝ”๋น„์•ˆ ํ–‰๋ ฌ๋“ค์ด ๊ทผ์‚ฌ์ ์œผ๋กœ ๋‚ฎ์€ ๋žญํฌ๋ฅผ ๊ฐ–๊ฒŒ ๋จ์„ ๋ณด์ž…๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ œ์ด์ฝ”๋น„์•ˆ ๊ตฌ์กฐ๊ฐ€ ๋”ฅ๋Ÿฌ๋‹์˜ ์•”๋ฌต์  ํŽธํ–ฅ์„ ์„ค๋ช…ํ•˜๋Š” ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์˜ ๊นŠ์ด๊ฐ€ ๊นŠ์–ด์งˆ์ˆ˜๋ก ์ œ์ด์ฝ”๋น„์•ˆ์˜ ํŠน์ด๊ฐ’์ด ์ง€์ˆ˜์ ์œผ๋กœ ์Šค์ผ€์ผ๋ง๋˜๊ณ  ์ŠคํŽ™ํŠธ๋Ÿผ์ด ๋ถ„๋ฆฌ๋˜๋Š” ํ˜„์ƒ์ด ์•”๋ฌต์  ํŽธํ–ฅ์„ ์„ค๋ช…ํ•˜๋Š” ์ค‘์š”ํ•œ ๋‹จ์„œ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์ด์ฝ”๋น„์•ˆ์˜ ์ŠคํŽ™ํŠธ๋Ÿผ ๋ถ„๋ฆฌ๋Š” ํ–‰๋ ฌ ๊ณฑ์…ˆ์—์„œ ํŠน์ด ๋ฒกํ„ฐ์˜ ์ •๋ ฌ์„ ์œ ๋„ํ•˜์—ฌ, ์ค‘๊ฐ„ ์ œ์ด์ฝ”๋น„์•ˆ๋“ค์ด ๊ทผ์‚ฌ์ ์œผ๋กœ ์œ ์‚ฌํ•œ ํŠน์ด ๋ฒกํ„ฐ ๊ธฐ์ €๋ฅผ ๊ณต์œ ํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ๋ถ„์„์€ ๊ณ ์ •๋œ ๊ฒŒ์ดํŠธ(fixed-gates)๋ฅผ ๊ฐ€์ง„ piecewise-linear ๋„คํŠธ์›Œํฌ์— ๊ธฐ๋ฐ˜ํ•˜๋ฏ€๋กœ, ์‹ค์ œ ๋ณต์žกํ•œ ์‹ ๊ฒฝ๋ง์—์„œ์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์œ ํ•œ ๊นŠ์ด์—์„œ์˜ ๋ณด์ •(finite-depth corrections) ๋ฐ ์‹ค์ œ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์— ๋Œ€ํ•œ ์‹คํ—˜์  ๊ฒ€์ฆ์„ ํ™•์žฅํ•˜๋Š” ๊ฒƒ์ด ํ–ฅํ›„ ๊ณผ์ œ์ž…๋‹ˆ๋‹ค.
๐Ÿ‘