Sign In

Hallucination-Resistant Security Planning with a Large Language Model

Created by
  • Haebom
Category
Empty

์ €์ž

Kim Hammar, Tansu Alpcan, Emil Lupu

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ํ™˜๊ฐ(hallucination) ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜์—ฌ ๋ณด์•ˆ ๊ด€๋ฆฌ ์˜์‚ฌ๊ฒฐ์ •์„ ์ง€์›ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ œ์•ˆ๋œ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” LLM์ด ์ƒ์„ฑํ•œ ํ›„๋ณด ํ–‰๋™์ด ์‹œ์Šคํ…œ ์ œ์•ฝ ์กฐ๊ฑด ๋ฐ ์˜ˆ์ธก๊ณผ ์ผ๊ด€์„ฑ์ด ์žˆ๋Š”์ง€ ํ™•์ธํ•˜๊ณ , ์ผ๊ด€์„ฑ์ด ๋‚ฎ์„ ๊ฒฝ์šฐ ์™ธ๋ถ€ ํ”ผ๋“œ๋ฐฑ(์˜ˆ: ๋””์ง€ํ„ธ ํŠธ์œˆ์—์„œ์˜ ํ‰๊ฐ€)์„ ํ†ตํ•ด ํ›„๋ณด ํ–‰๋™์„ ๊ฐœ์„ ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ํ™˜๊ฐ ์œ„ํ—˜์„ ์ œ์–ดํ•˜๊ณ  30%๊นŒ์ง€ ๋ณต๊ตฌ ์‹œ๊ฐ„์„ ๋‹จ์ถ•ํ•˜๋Š” ์„ฑ๊ณผ๋ฅผ ๋ณด์˜€์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM์˜ ์‹ ๋ขฐ์„ฑ ๋ถ€์กฑ ๋ฌธ์ œ๋ฅผ ์™„ํ™”ํ•˜์—ฌ ์‹ค์ œ ๋ณด์•ˆ ๊ด€๋ฆฌ ์ž‘์—…์— LLM์„ ํšจ๊ณผ์ ์œผ๋กœ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋Š” ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ผ๊ด€์„ฑ ์ž„๊ณ„๊ฐ’ ์กฐ์ ˆ ๋ฐ ์™ธ๋ถ€ ํ”ผ๋“œ๋ฐฑ์„ ํ†ตํ•œ ํ–‰๋™ ๊ฐœ์„  ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ LLM ๊ธฐ๋ฐ˜ ์˜์‚ฌ๊ฒฐ์ • ์‹œ์Šคํ…œ์˜ ์•ˆ์ „์„ฑ์„ ๋†’์ด๋Š” ์ค‘์š”ํ•œ ๋ฐฉ๋ฒ•๋ก ์ด ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ๋Š” ํŠน์ • ๊ฐ€์ • ํ•˜์—์„œ ICL์˜ ํ›„ํšŒ(regret)์— ๋Œ€ํ•œ ์ƒํ•œ์„ ์„ค์ •ํ•˜์˜€์œผ๋‚˜, ์‹ค์ œ ๋ณต์žกํ•˜๊ณ  ๋™์ ์ธ ๋ณด์•ˆ ํ™˜๊ฒฝ์—์„œ์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๊ฒ€์ฆ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘