Sign In

StableGrad: Backward Scale Control without Batch Normalization

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Jose I. Mestre, Alberto Fernandez-Hernandez, Cristian Perez-Corral, Manuel F. Dolz, Enrique S. Quintana-Orti

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋ฐฐ์น˜ ์ •๊ทœํ™”(Batch Normalization) ์—†์ด ๊นŠ์€ ์‹ ๊ฒฝ๋ง ํ›ˆ๋ จ ์‹œ ๋ฐœ์ƒํ•˜๋Š” ๋ถˆ์•ˆ์ •์„ฑ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด StableGrad๋ผ๋Š” ์ƒˆ๋กœ์šด ์ตœ์ ํ™”๊ธฐ ๋ ˆ๋ฒจ์˜ ์Šค์ผ€์ผ ์ œ์–ด ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. StableGrad๋Š” ์ˆœ๋ฐฉํ–ฅ ๋ชจ๋ธ์„ ์ˆ˜์ •ํ•˜์ง€ ์•Š๊ณ  ์—ญ์ „ํŒŒ ํ›„ ๊ฐ€์ค‘์น˜-๊ทธ๋ž˜๋””์–ธํŠธ์˜ ๋ถˆ๊ท ํ˜•์„ ์ˆ˜์ •ํ•˜์—ฌ, ํŠนํžˆ ๋ฐฐ์น˜ ์ •๊ทœํ™”๊ฐ€ ๋ถ€์ ํ•ฉํ•œ ๋ฌผ๋ฆฌ ์ •๋ณด ์‹ ๊ฒฝ๋ง(PINN) ๋ฐ ๋ฐฐ์น˜ ์ •๊ทœํ™”๋ฅผ ์ œ๊ฑฐํ•œ CNN์—์„œ ํ›ˆ๋ จ ์•ˆ์ •์„ฑ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋ฐฐ์น˜ ์ •๊ทœํ™”์™€ ๊ฐ™์€ ์ˆœ๋ฐฉํ–ฅ ์ •๊ทœํ™” ๊ธฐ๋ฒ• ์—†์ด๋„ ๊นŠ์€ ์‹ ๊ฒฝ๋ง์˜ ํ›ˆ๋ จ ์•ˆ์ •์„ฑ์„ ํ™•๋ณดํ•  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋ฌผ๋ฆฌ ์ •๋ณด ์‹ ๊ฒฝ๋ง(PINN) ๋ถ„์•ผ์—์„œ ๋ฐฐ์น˜ ์ •๊ทœํ™”์˜ ๋ถ€์ ํ•ฉ์„ฑ์„ ํ•ด๊ฒฐํ•˜๊ณ , ๋” ๊นŠ์€ ๋ชจ๋ธ์˜ ์ •ํ™•๋„์™€ ์‹ ๋ขฐ์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๋ฐ ๊ธฐ์—ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋ฐฐ์น˜ ์ •๊ทœํ™”๋ฅผ ์ œ๊ฑฐํ–ˆ์„ ๋•Œ ํ›ˆ๋ จ ๋ถ•๊ดด๊ฐ€ ๋ฐœ์ƒํ•˜๋Š” ResNet, EfficientNet๊ณผ ๊ฐ™์€ ์ผ๋ฐ˜์ ์ธ CNN์—์„œ๋„ ํ›ˆ๋ จ์„ ์•ˆ์ •ํ™”์‹œํ‚ค๋Š” ํšจ๊ณผ๋ฅผ ๋ณด์—ฌ, ๊ด‘๋ฒ”์œ„ํ•œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
StableGrad์˜ ์ด๋ก ์  ๋ถ„์„์„ ํ†ตํ•ด ํšจ๊ณผ์ ์ธ ํ›ˆ๋ จ ์—ญํ•™์„ ์„ค๋ช…ํ•˜์ง€๋งŒ, ํŠน์ • ์•„ํ‚คํ…์ฒ˜๋‚˜ ๋ฌธ์ œ ์œ ํ˜•์— ๋Œ€ํ•œ ์ตœ์ ์˜ ์Šค์ผ€์ผ๋ง ์ „๋žต ํƒ์ƒ‰, ๋˜๋Š” ์ถ”๊ฐ€์ ์ธ ๋ณต์žกํ•œ ๋ฌธ์ œ์— ๋Œ€ํ•œ ํ™•์žฅ์„ฑ ๊ฒ€์ฆ์€ ํ–ฅํ›„ ์—ฐ๊ตฌ ๊ณผ์ œ๋กœ ๋‚จ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘