Sign In

AST-PAC: AST-guided Membership Inference for Code

Created by
  • Haebom
Category
Empty

์ €์ž

Roham Koohestani, Ali Al-Kaswan, Jonathan Katzy, Maliheh Izadi

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์†Œ์Šค ์ฝ”๋“œ์— ๋Œ€ํ•œ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(Code LLM)์ด ์ €์ž‘๊ถŒ ๋ฌธ์ œ๊ฐ€ ์žˆ๋Š” ์ฝ”๋“œ๋ฅผ ํ•™์Šตํ•  ๊ฐ€๋Šฅ์„ฑ์— ์ฃผ๋ชฉํ•˜๋ฉฐ, ์ด๋ฅผ ๊ฐ์‚ฌํ•˜๊ธฐ ์œ„ํ•œ ๋ฉค๋ฒ„์‹ญ ์ถ”๋ก  ๊ณต๊ฒฉ(MIA) ๋ฐฉ๋ฒ•๋ก ์„ ํƒ๊ตฌํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด PAC(Polarized Augmentation Calibration) ๋ฐฉ๋ฒ•์ด ์ฝ”๋“œ์˜ ๋ฌธ๋ฒ•์  ์ œ์•ฝ์„ ๋ฌด์‹œํ•˜์—ฌ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๋Š” ๋ฌธ์ œ์ ์„ ๋ฐœ๊ฒฌํ•˜๊ณ , ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ์ถ”์ƒ ๊ตฌ๋ฌธ ํŠธ๋ฆฌ(AST) ๊ธฐ๋ฐ˜์˜ ๋ฌธ๋ฒ•์ ์œผ๋กœ ์œ ํšจํ•œ ๋ณ€ํ˜•์„ ํ™œ์šฉํ•˜๋Š” AST-PAC๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. AST-PAC๋Š” ์ฝ”๋“œ ๋ชจ๋ธ์˜ ์ถœ์ฒ˜ ๊ฐ์‚ฌ(provenance auditing)์— ๋Œ€ํ•œ ํ–ฅํ›„ ์—ฐ๊ตฌ์˜ ๊ธฐ๋ฐ˜์„ ๋งˆ๋ จํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
Code LLM์—์„œ ํ•™์Šต๋œ ๋ฐ์ดํ„ฐ์˜ ์ถœ์ฒ˜๋ฅผ ๊ฐ์‚ฌํ•˜๋Š” ๋ฐ MIA๊ฐ€ ํšจ๊ณผ์ ์ธ ๋„๊ตฌ๊ฐ€ ๋  ์ˆ˜ ์žˆ์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
AST-PAC๋Š” ๊ธฐ์กด PAC ๋ฐฉ๋ฒ•๋ณด๋‹ค ์ฝ”๋“œ ๋ฌธ๋ฒ•์„ ๊ณ ๋ คํ•˜์—ฌ ์„ฑ๋Šฅ์„ ๊ฐœ์„ ํ•˜๋ฉฐ, ํŠนํžˆ ๋ณต์žกํ•œ ์ฝ”๋“œ ํŒŒ์ผ์—์„œ ์œ ์šฉ์„ฑ์„ ๋ณด์ž…๋‹ˆ๋‹ค.
โ€ข
AST-PAC๋Š” ์ž‘์€ ํŒŒ์ผ์— ๋Œ€ํ•œ ๊ณผ๋„ํ•œ ๋ณ€ํ˜•์„ ์œ ๋ฐœํ•˜๊ฑฐ๋‚˜, ์˜์ˆซ์ž(alphanumeric)๊ฐ€ ํ’๋ถ€ํ•œ ์ฝ”๋“œ์—์„œ ์„ฑ๋Šฅ ์ €ํ•˜๋ฅผ ๋ณด์ด๋Š” ํ•œ๊ณ„์ ์„ ๊ฐ€์ง‘๋‹ˆ๋‹ค.
๐Ÿ‘