Sign In

Shaping the Prior: How Synthetic Task Distributions Determine Tabular Foundation Model Quality

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Mohamed Bouadi, Nassim Bouarour, Varun Kulkarni, Shivam Dubey, Aditya Tanna, Vinay Kumar Sankarapu

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ํ…Œ์ด๋ธ” ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ํ’ˆ์งˆ์„ ๊ฒฐ์ •ํ•˜๋Š” ์ค‘์š”ํ•œ ์š”์ธ์œผ๋กœ ์‚ฌ์ „ ํ•™์Šต ๋ฐ์ดํ„ฐ์…‹์˜ ํ•ฉ์„ฑ ๋ถ„ํฌ ์„ค๊ณ„์˜ ์ค‘์š”์„ฑ์„ ๊ฐ•์กฐํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด ๋ฐฉ์‹์ด ์ง€๋‚˜์น˜๊ฒŒ ์ด์ƒ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„ํฌ๋ฅผ ์ƒ์„ฑํ•˜์—ฌ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ์˜ ๊ฒฌ๊ณ ์„ฑ ๋ถ€์กฑ ๋ฌธ์ œ๋ฅผ ์•ผ๊ธฐํ•œ๋‹ค๋Š” ์ ์— ์ฐฉ์•ˆํ•˜์—ฌ, O'Prior๋ผ๋Š” ์ƒˆ๋กœ์šด ํ•ฉ์„ฑ ์‚ฌ์ „ ํ•™์Šต ๋ถ„ํฌ ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. O'Prior์€ ๊ณ„์ธต์  SCM ๊ธฐ๋ฐ˜ ๋ฉ”ํƒ€ ์ƒ์„ฑ๊ธฐ, ์ด์งˆ์ ์ธ ๋ณ€์ˆ˜ ๋ฐ ๊ฒฐ์ธก, ํƒ€๊ฒŸ ๋ณ€ํ™˜ ๋“ฑ์„ ํฌํ•จํ•˜๋Š” ๋ชจ๋“ˆ์‹ ํ˜„์‹ค์„ฑ ์—”์ง„, ํ˜ผ๋ž€ ๋ณ€์ˆ˜ ๋ฐ ์ง€์›-์ฟผ๋ฆฌ ๋ถˆ์ผ์น˜๋ฅผ ์ฃผ์ž…ํ•˜๋Š” ์ŠคํŠธ๋ ˆ์Šค ๋ชจ๋“ˆ, ๊ทธ๋ฆฌ๊ณ  ์ปค๋ฆฌํ˜๋Ÿผ ๊ธฐ๋ฐ˜์˜ ์•ˆ์ „ํ•œ ์ƒ์„ฑ ํ”„๋กœํ† ์ฝœ์„ ํ†ตํ•ด ์‹ค์ œ ๋ฐ์ดํ„ฐ์˜ ๋ณต์žก์„ฑ๊ณผ ๋ถˆ๊ทœ์น™์„ฑ์„ ๋ฐ˜์˜ํ•˜๋Š” ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
ํ…Œ์ด๋ธ” ๋ฐ์ดํ„ฐ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์€ ์‚ฌ์ „ ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ํ•ฉ์„ฑ ๋ถ„ํฌ ์„ค๊ณ„์— ํฌ๊ฒŒ ์ขŒ์šฐ๋˜๋ฉฐ, ํŠนํžˆ ํ˜„์‹ค์ ์ธ ๋ถˆ๊ทœ์น™์„ฑ๊ณผ ์‹คํŒจ ๋ชจ๋“œ๋ฅผ ํฌํ•จํ•˜๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•˜๋‹ค.
โ€ข
์ œ์•ˆ๋œ O'Prior ๋ฐฉ๋ฒ•๋ก ์€ ๋‹ค์–‘ํ•œ ๊ตฌ์„ฑ ์š”์†Œ์˜ ์กฐํ•ฉ์„ ํ†ตํ•ด ์‹ค์ œ ๋ฐ์ดํ„ฐ์˜ ๋ณต์žก์„ฑ์„ ํšจ๊ณผ์ ์œผ๋กœ ๋ชจ์‚ฌํ•˜์—ฌ ๋‹ค์šด์ŠคํŠธ๋ฆผ ํƒœ์Šคํฌ์˜ ์ •ํ™•๋„์™€ ๊ฒฌ๊ณ ์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.
โ€ข
O'Prior์˜ ๊ฐ ๊ตฌ์„ฑ ์š”์†Œ(๋ฉ”์ปค๋‹ˆ์ฆ˜ ๋‹ค์–‘์„ฑ, ํ˜„์‹ค์„ฑ ์กฐํ•ฉ, ๋ณ€ํ™” ์ธ์ง€ ์ŠคํŠธ๋ ˆ์Šค)๋Š” ๋…๋ฆฝ์ ์œผ๋กœ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ๊ธฐ์—ฌํ•˜๋ฉฐ, ์„œ๋กœ ๋Œ€์ฒด๋  ์ˆ˜ ์—†๋Š” ๊ณ ์œ ํ•œ ์—ญํ• ์„ ์ˆ˜ํ–‰ํ•œ๋‹ค.
๐Ÿ‘