Sign In

Margin and Consistency Supervision for Calibrated and Robust Vision Models

Created by
  • Haebom
Category
Empty

์ €์ž

Salim Khazem

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋”ฅ๋Ÿฌ๋‹ ๋น„์ „ ๋ถ„๋ฅ˜ ๋ชจ๋ธ์˜ ๋ถˆ์ถฉ๋ถ„ํ•œ ๋ณด์ •(calibration)๊ณผ ์ž‘์€ ๋ถ„ํฌ ๋ณ€ํ™”์—๋„ ์ทจ์•ฝํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด Margin and Consistency Supervision (MaCS)๋ผ๋Š” ์ƒˆ๋กœ์šด ๊ทœ์ œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. MaCS๋Š” ๋กœ์ง“(logit) ๊ณต๊ฐ„์—์„œ์˜ ๋ถ„๋ฆฌ ๊ฐ•ํ™”์™€ ๊ตญ์†Œ์  ์˜ˆ์ธก ์•ˆ์ •์„ฑ์„ ๋™์‹œ์— ํ™•๋ณดํ•จ์œผ๋กœ์จ, ํ‘œ์ค€์ ์ธ ๊ต์ฐจ ์—”ํŠธ๋กœํ”ผ ์†์‹ค ํ•จ์ˆ˜๋ฅผ ๋ณด์™„ํ•œ๋‹ค. ์ด ๋ฐฉ๋ฒ•์€ ์ถ”๊ฐ€ ๋ฐ์ดํ„ฐ๋‚˜ ์•„ํ‚คํ…์ฒ˜ ์ˆ˜์ • ์—†์ด๋„ ๋ณด์ • ์„ฑ๋Šฅ๊ณผ ๊ฐ•๊ฑด์„ฑ(robustness)์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ํšจ๊ณผ๋ฅผ ๋ณด์ธ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋ณด์ •๊ณผ ๊ฐ•๊ฑด์„ฑ ํ–ฅ์ƒ: MaCS๋Š” ๋ชจ๋ธ์˜ ์˜ˆ์ธก์ด ์–ผ๋งˆ๋‚˜ ์ •ํ™•ํ•œ์ง€์— ๋Œ€ํ•œ ์‹ ๋ขฐ๋„๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ๋ณด์ • ์„ฑ๋Šฅ๊ณผ, ๋ฐ์ดํ„ฐ ๋ถ„ํฌ๊ฐ€ ๋ณ€ํ•ด๋„ ์˜ˆ์ธก์ด ์•ˆ์ •์ ์œผ๋กœ ์œ ์ง€๋˜๋Š” ๊ฐ•๊ฑด์„ฑ์„ ๋™์‹œ์— ๊ฐœ์„ ํ•œ๋‹ค.
โ€ข
์ด๋ก ์  ๊ธฐ๋ฐ˜ ์ œ๊ณต: ๋ถ„๋ฅ˜ ๋งˆ์ง„ ์ฆ๊ฐ€์™€ ๊ตญ์†Œ์  ๋ฏผ๊ฐ๋„ ๊ฐ์†Œ๊ฐ€ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ํ–ฅ์ƒ ๋ฐ ๊ฐ•๊ฑด์„ฑ ๋ฐ˜๊ฒฝ ํ™•์žฅ์— ๊ธฐ์—ฌํ•จ์„ ์ด๋ก ์ ์œผ๋กœ ๋ถ„์„ํ•˜๊ณ , ์ด๋Š” ๋งˆ์ง„ ๋Œ€๋น„ ๋ฏผ๊ฐ๋„ ๋น„์œจ์— ๋”ฐ๋ผ ํ™•์žฅ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์ธ๋‹ค.
โ€ข
์‹ค์šฉ์ ์ธ ์ ์šฉ ์šฉ์ด์„ฑ: ์ถ”๊ฐ€ ๋ฐ์ดํ„ฐ, ์•„ํ‚คํ…์ฒ˜ ๋ณ€๊ฒฝ, ์ถ”๋ก  ์˜ค๋ฒ„ํ—ค๋“œ ์—†์ด ๊ธฐ์กด ํ•™์Šต ๋ชฉํ‘œ๋ฅผ ๋Œ€์ฒดํ•  ์ˆ˜ ์žˆ๋Š” ๋“œ๋กญ์ธ(drop-in) ์†”๋ฃจ์…˜์œผ๋กœ, CNN ๋ฐ Vision Transformer๋ฅผ ํฌํ•จํ•œ ๋‹ค์–‘ํ•œ ๋ฐฑ๋ณธ ๋ชจ๋ธ์— ์ ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.
โ€ข
์ œํ•œ๋œ ์‹คํ—˜ ๋ฒ”์œ„: ์ฃผ๋กœ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ๋ฒค์น˜๋งˆํฌ์—์„œ์˜ ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•˜์˜€์œผ๋ฉฐ, ๋‹ค๋ฅธ ๋น„์ „ ํƒœ์Šคํฌ(์˜ˆ: ๊ฐ์ฒด ํƒ์ง€, ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜)์—์„œ์˜ ํšจ๊ณผ๋Š” ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•  ์ˆ˜ ์žˆ๋‹ค.
๐Ÿ‘