Sign In

FormalJudge: A Neuro-Symbolic Paradigm for Agentic Oversight

Created by
  • Haebom
Category
Empty

์ €์ž

Jiayi Zhou, Yang Sheng, Hantao Lou, Yaodong Yang, Jie Fu

๐Ÿ’ก ๊ฐœ์š”

LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ์•ˆ์ „์„ฑ ํ™•๋ณด๋Š” ์ค‘์š”ํ•œ ๊ณผ์ œ์ด๋ฉฐ, ๊ธฐ์กด์˜ LLM-as-a-Judge ๋ฐฉ์‹์€ ํ™•๋ฅ ์  ์‹œ์Šคํ…œ์˜ ํ•œ๊ณ„๋ฅผ ๋‹ต์Šตํ•ฉ๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์ž์—ฐ์–ด ์š”๊ตฌ์‚ฌํ•ญ์„ ํ˜•์‹ ๋ช…์„ธ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ๋ณ‘๋ชฉ ํ˜„์ƒ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์‹ ๊ฒฝ-๊ธฐํ˜ธ์  ํ”„๋ ˆ์ž„์›Œํฌ์ธ FormalJudge๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. FormalJudge๋Š” LLM์„ ํ™œ์šฉํ•˜์—ฌ ์ธ๊ฐ„์˜ ์˜๋„๋ฅผ ๊ฒ€์ฆ ๊ฐ€๋Šฅํ•œ ์ œ์•ฝ ์กฐ๊ฑด์œผ๋กœ ๋ถ„ํ•ดํ•˜๊ณ , Dafny ๋ฐ Z3์™€ ๊ฐ™์€ ๋„๊ตฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ˆ˜ํ•™์  ๋ณด์žฅ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ์•ˆ์ „์„ฑ์„ ๋ณด์žฅํ•˜๊ธฐ ์œ„ํ•œ ํšจ๊ณผ์ ์ธ ์‹ ๊ฒฝ-๊ธฐํ˜ธ์  ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๊ธฐ์กด LLM-as-a-Judge ๋Œ€๋น„ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ ํ–ฅ์ƒ๊ณผ ์ทจ์•ฝํ•œ ๋ชจ๋ธ๋กœ๋ถ€ํ„ฐ ๊ฐ•ํ•œ ๋ชจ๋ธ์„ ๊ฐ์ง€ํ•˜๋Š” ๋Šฅ๋ ฅ์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ž์—ฐ์–ด ์š”๊ตฌ์‚ฌํ•ญ์„ ํ˜•์‹ ๋ช…์„ธ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ๊ณผ์ •์˜ ๋ณต์žก์„ฑ๊ณผ ์ƒˆ๋กœ์šด ๋„๊ตฌ ๋ฐ ๊ธฐ์ˆ ์˜ ํ•™์Šต ํ•„์š”์„ฑ์ด ํ•œ๊ณ„๋กœ ์ž‘์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘