Sign In

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Created by
  • Haebom
Category
Empty

์ €์ž

Jiaxin Zhang, Wendi Cui, Zhuohang Li, Lifu Huang, Bradley Malin, Caiming Xiong, Chien-Sheng Wu

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™”(Uncertainty Quantification, UQ)๊ฐ€ ๋‹จ์ˆœํ•œ ์ง„๋‹จ ์ง€ํ‘œ๋ฅผ ๋„˜์–ด ๋ชจ๋ธ์˜ ์‹ค์‹œ๊ฐ„ ํ–‰๋™์„ ์ œ์–ดํ•˜๋Š” ๋Šฅ๋™์ ์ธ ์‹ ํ˜ธ๋กœ ๋ฐœ์ „ํ•˜๋Š” ๊ณผ์ •์„ ํƒ๊ตฌํ•ฉ๋‹ˆ๋‹ค. LLM์˜ ๋ถˆํ™•์‹ค์„ฑ์€ ๊ณ ๊ธ‰ ์ถ”๋ก , ์ž์œจ ์—์ด์ „ํŠธ, ๊ฐ•ํ™” ํ•™์Šต ๋ถ„์•ผ์—์„œ ๊ณ„์‚ฐ ์ตœ์ ํ™”, ์ž๊ธฐ ๊ต์ •, ๋„๊ตฌ ์‚ฌ์šฉ ๊ฒฐ์ •, ์ •๋ณด ํƒ์ƒ‰, ๋ณด์ƒ ํ•ดํ‚น ์™„ํ™”, ๋‚ด์žฌ์  ๋ณด์ƒ์„ ํ†ตํ•œ ์ž๊ธฐ ๊ฐœ์„  ๋“ฑ์— ์ ๊ทน์ ์œผ๋กœ ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™”๋Š” LLM์˜ ์‹ ๋ขฐ์„ฑ์„ ๋†’์—ฌ ๊ณ ์œ„ํ—˜ ๋„๋ฉ”์ธ์—์„œ์˜ ๋ฐฐํฌ ์žฅ๋ฒฝ์„ ๋‚ฎ์ถœ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋ถˆํ™•์‹ค์„ฑ์€ ๊ณ ๊ธ‰ ์ถ”๋ก , ์ž์œจ ์—์ด์ „ํŠธ, ๊ฐ•ํ™” ํ•™์Šต ๋“ฑ ๋‹ค์–‘ํ•œ LLM ์‘์šฉ ๋ถ„์•ผ์—์„œ ๋Šฅ๋™์ ์ธ ์ œ์–ด ์‹ ํ˜ธ๋กœ ํ™œ์šฉ๋  ์ž ์žฌ๋ ฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
Bayesian ๋ฐฉ๋ฒ•๋ก  ๋ฐ Conformal Prediction๊ณผ ๊ฐ™์€ ์ด๋ก ์  ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ๋ถˆํ™•์‹ค์„ฑ ๊ธฐ๋ฐ˜ LLM์˜ ๋ฐœ์ „๊ณผ ์ดํ•ด๋ฅผ ์œ„ํ•œ ํ†ต์ผ๋œ ๊ด€์ ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋ณธ ๋…ผ๋ฌธ์˜ ํ•œ๊ณ„์ ์€ ํ˜„์žฌ๊นŒ์ง€ ๋ณด๊ณ ๋œ ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™” ๊ธฐ์ˆ ์˜ ์‹ค์งˆ์ ์ธ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ๊ณผ ํ™•์žฅ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•˜๋‹ค๋Š” ์ ์ž…๋‹ˆ๋‹ค.
๐Ÿ‘