Sign In

Real-Time Trust Verification for Safe Agentic Actions using TrustBench

Created by
  • Haebom
Category
Empty

์ €์ž

Tavishi Sharma, Vinayak Sharma, Pragya Sharma

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์ด ๋Œ€ํ™”ํ˜• ๋น„์„œ์—์„œ ์ž์œจ ์—์ด์ „ํŠธ๋กœ ์ง„ํ™”ํ•จ์— ๋”ฐ๋ผ, ์‚ฌํ›„ ํ‰๊ฐ€์—์„œ ์‹ค์‹œ๊ฐ„ ํ–‰๋™ ๊ฒ€์ฆ์œผ๋กœ์˜ ์ „ํ™˜์ด ํ•„์š”ํ•จ์„ ๊ฐ•์กฐํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด ํ”„๋ ˆ์ž„์›Œํฌ๋“ค์ด ๊ณผ์ œ ์™„๋ฃŒ๋‚˜ ์ถœ๋ ฅ ํ’ˆ์งˆ ํ‰๊ฐ€์— ์ง‘์ค‘ํ•˜๋Š” ๋ฐ˜๋ฉด, ๋ณธ ๋…ผ๋ฌธ์€ ์—์ด์ „ํŠธ ์‹คํ–‰ ์ค‘์— ์œ ํ•ดํ•œ ํ–‰๋™์„ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•œ 'TrustBench'๋ผ๋Š” ์‹ค์‹œ๊ฐ„ ์‹ ๋ขฐ์„ฑ ๊ฒ€์ฆ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. TrustBench๋Š” ์—์ด์ „ํŠธ๊ฐ€ ํ–‰๋™์„ ์‹คํ–‰ํ•˜๊ธฐ ์ง์ „์— ์•ˆ์ „์„ฑ๊ณผ ์‹ ๋ขฐ์„ฑ์„ ๊ฒ€์ฆํ•˜์—ฌ ์œ ํ•ดํ•œ ํ–‰๋™์„ ํš๊ธฐ์ ์œผ๋กœ ๊ฐ์†Œ์‹œํ‚ต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์ž์œจ ์—์ด์ „ํŠธ์˜ ์•ˆ์ „์„ฑ๊ณผ ์‹ ๋ขฐ์„ฑ์„ ๋ณด์žฅํ•˜๊ธฐ ์œ„ํ•ด ํ–‰๋™ ์‹คํ–‰ ์ „์— ์‹ค์‹œ๊ฐ„์œผ๋กœ ๊ฒ€์ฆํ•˜๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹์˜ ์ค‘์š”์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋„๋ฉ”์ธ๋ณ„ ํ”Œ๋Ÿฌ๊ทธ์ธ์„ ํ™œ์šฉํ•˜์—ฌ ํŠน์ • ๋ถ„์•ผ์˜ ์•ˆ์ „ ์š”๊ตฌ์‚ฌํ•ญ์„ ์ถฉ์กฑ์‹œํ‚ค๊ณ , ์ด๋ฅผ ํ†ตํ•ด ์œ ํ•ด ํ–‰๋™ ๊ฐ์†Œ์œจ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ TrustBench๋Š” ๊ธฐ์กด ๋ฐฉ์‹ ๋Œ€๋น„ ํ˜„์ €ํžˆ ๋‚ฎ์€ ์ง€์—ฐ ์‹œ๊ฐ„์œผ๋กœ ์‹ค์ œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ์ž…์ฆํ•˜๋ฉฐ, ์ž์œจ ์—์ด์ „ํŠธ์˜ ์•ˆ์ „ํ•œ ์ž‘๋™์„ ์œ„ํ•œ ์‹ค์งˆ์ ์ธ ํ•ด๊ฒฐ์ฑ…์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ˜„์žฌ ์—ฐ๊ตฌ๋Š” TrustBench์˜ ํšจ๊ณผ์„ฑ์„ ๋‹ค์–‘ํ•œ ์—์ด์ „ํŠธ ์ž‘์—…์—์„œ ์ž…์ฆํ•˜์˜€์ง€๋งŒ, ๋ชจ๋“  ์ž ์žฌ์ ์ธ ์œ ํ•ด ํ–‰๋™ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ํฌ๊ด„ํ•˜์ง€ ๋ชปํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ํ–ฅํ›„ ์—ฐ๊ตฌ์—์„œ๋Š” ๋”์šฑ ๋ณต์žกํ•˜๊ณ  ์˜ˆ์ธก ๋ถˆ๊ฐ€๋Šฅํ•œ ํ™˜๊ฒฝ์—์„œ์˜ ๊ฒ€์ฆ ๋Šฅ๋ ฅ ๊ฐ•ํ™” ๋ฐ ์ƒˆ๋กœ์šด ์œ ํ˜•์˜ ์œ ํ•ด ํ–‰๋™์— ๋Œ€ํ•œ ๋Œ€์‘ ๋ฐฉ์•ˆ ๋งˆ๋ จ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘