Sign In

Human Supervision as an Information Bottleneck: A Unified Theory of Error Floors in Human-Guided Learning

Created by
  • Haebom
Category
Empty

์ €์ž

Alejandro Rodriguez Dominguez

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์ด ์ธ๊ฐ„ ์ƒ์„ฑ ๋ฐ์ดํ„ฐ ๋ฐ ํ”ผ๋“œ๋ฐฑ์œผ๋กœ ํ•™์Šตํ•จ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ๋ฐœ์ƒํ•˜๋Š” ์˜ค๋ฅ˜๋“ค์ด ๋ชจ๋ธ ๊ทœ๋ชจ๋‚˜ ์ตœ์ ํ™” ๋ฌธ์ œ๋ณด๋‹ค๋Š” ์ธ๊ฐ„ ๊ฐ๋… ์ฑ„๋„ ์ž์ฒด์˜ ๊ตฌ์กฐ์  ํ•œ๊ณ„์—์„œ ๋น„๋กฏ๋œ๋‹ค๋Š” ํ†ต์ผ๋œ ์ด๋ก ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค. ์ธ๊ฐ„ ๊ฐ๋… ์ฑ„๋„์ด ์ž ์žฌ์  ํ‰๊ฐ€ ๋ชฉํ‘œ๋ฅผ ์™„์ „ํžˆ ๋‹ด์ง€ ๋ชปํ•  ๋•Œ, ์ด ์ฑ„๋„์€ ์ •๋ณด ์••์ถ•๊ธฐ ์—ญํ• ์„ ํ•˜์—ฌ ํ•™์Šต์ž์—๊ฒŒ ํ•„์—ฐ์ ์œผ๋กœ ์ดˆ๊ณผ ์œ„ํ—˜(excess-risk) ๋ฐ”๋‹ฅ์„ ๋ฐœ์ƒ์‹œํ‚จ๋‹ค๋Š” ๊ฒƒ์„ ์ˆ˜ํ•™์ ์œผ๋กœ ์ฆ๋ช…ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์ธ๊ฐ„ ๊ฐ๋… ์ฑ„๋„์˜ ์ •๋ณด ์••์ถ•์„ฑ์€ ๋ชจ๋ธ์˜ ํฌ๊ธฐ๋‚˜ ์ตœ์ ํ™” ๋ฐฉ์‹๊ณผ ๋ฌด๊ด€ํ•˜๊ฒŒ ๊ณ ์œ ํ•œ ์˜ค๋ฅ˜ ๋ฐ”๋‹ฅ์„ ์ƒ์„ฑํ•˜๋ฉฐ, ์ด๋Š” ์Šค์ผ€์ผ๋ง๋งŒ์œผ๋กœ๋Š” ํ•ด๊ฒฐํ•  ์ˆ˜ ์—†๋Š” ๊ทผ๋ณธ์ ์ธ ๋ฌธ์ œ์ž„์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๊ฒ€์ƒ‰, ํ”„๋กœ๊ทธ๋žจ ์‹คํ–‰, ๋„๊ตฌ ์‚ฌ์šฉ๊ณผ ๊ฐ™์€ ์ธ๊ฐ„ ์™ธ ๋ณด์กฐ ์‹ ํ˜ธ๋Š” ๊ฐ๋… ์ฑ„๋„์˜ ์œ ํšจ ์šฉ๋Ÿ‰์„ ์ฆ๋Œ€์‹œ์ผœ ์ด๋Ÿฌํ•œ ์˜ค๋ฅ˜ ๋ฐ”๋‹ฅ์„ ํšจ๊ณผ์ ์œผ๋กœ ๋‚ฎ์ถ”๊ฑฐ๋‚˜ ์ œ๊ฑฐํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์ด๋ก ์€ ์ธ๊ฐ„ ํ”ผ๋“œ๋ฐฑ์˜ ๋…ธ์ด์ฆˆ, ์„ ํ˜ธ๋„ ์™œ๊ณก, ์˜๋ฏธ ์••์ถ• ๋“ฑ ์ž ์žฌ์  ํ‰๊ฐ€ ๋ชฉํ‘œ๊ฐ€ ์ธ๊ฐ„ ๊ฐ๋… ์ฑ„๋„์„ ํ†ตํ•ด ์™„์ „ํžˆ ์ „๋‹ฌ๋˜์ง€ ๋ชปํ•˜๋Š” ์ƒํ™ฉ์— ๋Œ€ํ•œ ๊ตฌ์กฐ์  ์ดํ•ด๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘