Sign In

Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Models

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Mingda Li, Rundong Lv, Xinyu Li, Weinan Zhang, Ting Liu

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ์‹ ๋ขฐ์„ฑ์„ ๋†’์ด๊ธฐ ์œ„ํ•ด ๋ถˆํ™•์‹ค์„ฑ ์ •๋Ÿ‰ํ™”(UQ) ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ UQ ๋ฐฉ์‹์˜ ๋†’์€ ๊ณ„์‚ฐ ๋น„์šฉ๊ณผ ๋ถ„์‚ฐ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์˜๋ฏธ๋ก ์ ์œผ๋กœ ๋™๋“ฑํ•œ ์ž…๋ ฅ ๋ณ€ํ™”์— ๋Œ€ํ•œ ์ถœ๋ ฅ ๋ถ„ํฌ์˜ ์•ˆ์ •์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ์ตœ์ดˆ์˜ ๊ทธ๋ž˜๋””์–ธํŠธ ๊ธฐ๋ฐ˜ UQ ๋ฐฉ๋ฒ•์ธ SemGrad๋ฅผ ๊ฐœ๋ฐœํ–ˆ์Šต๋‹ˆ๋‹ค. SemGrad๋Š” ์˜๋ฏธ๋ก ์  ๊ณต๊ฐ„์—์„œ์˜ ๊ทธ๋ž˜๋””์–ธํŠธ๋ฅผ ํ™œ์šฉํ•˜๋ฉฐ, ํŠนํžˆ ์—ฌ๋Ÿฌ ์ •๋‹ต์ด ๊ฐ€๋Šฅํ•œ ์ƒํ™ฉ์—์„œ ํšจ์œจ์ ์ด๊ณ  ํšจ๊ณผ์ ์ธ ๋ถˆํ™•์‹ค์„ฑ ์ถ”์ • ์„ฑ๋Šฅ์„ ๋ณด์ž…๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๊ทธ๋ž˜๋””์–ธํŠธ ๊ธฐ๋ฐ˜ UQ ๋ฐฉ๋ฒ•๋ก ์„ ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ณต๊ฐ„์ด ์•„๋‹Œ ์˜๋ฏธ๋ก ์  ๊ณต๊ฐ„์œผ๋กœ ํ™•์žฅํ•˜์—ฌ, LLM์˜ ๋ถˆํ™•์‹ค์„ฑ์„ ํšจ์œจ์ ์œผ๋กœ ์ธก์ •ํ•  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ–ˆ์Šต๋‹ˆ๋‹ค.
โ€ข
SemGrad์™€ HybridGrad๋Š” ์ƒ˜ํ”Œ๋ง ๋ฐฉ์‹ ๋Œ€๋น„ ๊ณ„์‚ฐ ํšจ์œจ์„ฑ๊ณผ ์„ฑ๋Šฅ์„ ํฌ๊ฒŒ ๊ฐœ์„ ํ•˜๋ฉฐ, ํŠนํžˆ ๋ชจํ˜ธํ•˜๊ฑฐ๋‚˜ ๋‹ค์ˆ˜์˜ ์ •๋‹ต์ด ๊ฐ€๋Šฅํ•œ ์ƒ์„ฑ ์ž‘์—…์—์„œ ๋›ฐ์–ด๋‚œ UQ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
์˜๋ฏธ๋ก ์  ๋ณด์กด์ด ์ž˜ ๋˜๋Š” ์ž„๋ฒ ๋”ฉ์„ ํšจ๊ณผ์ ์œผ๋กœ ์‹๋ณ„ํ•˜๊ณ  ์„ ํƒํ•˜๋Š” 'Semantic Preservation Score (SPS)'๋ฅผ ๋„์ž…ํ•˜์—ฌ ๊ทธ๋ž˜๋””์–ธํŠธ ๊ณ„์‚ฐ์˜ ์ •ํ™•์„ฑ์„ ๋†’์˜€์Šต๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•๋ก ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ๋ฐ ๋‹ค์–‘ํ•œ LLM ์•„ํ‚คํ…์ฒ˜์— ๋Œ€ํ•œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์€ ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘