Sign In

Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

Created by
  • Haebom
Category
Empty

์ €์ž

Hyuntae Park, Yeachan Kim, SangKeun Lee

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜ ์‚ฌ์ „ ํ•™์Šต ์–ธ์–ด ๋ชจ๋ธ(PLM)์ด ๊ฐ–๋Š” ์ธ๊ฐ„ ๋ณด๊ณ  ํŽธํ–ฅ์œผ๋กœ ์ธํ•œ ์ œ๋กœ์ƒท ์ƒ์‹ ์ถ”๋ก ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ๊ธฐ๊ณ„์  ์ƒ์ƒ๋ ฅ์„ ํ™œ์šฉํ•œ ์ƒˆ๋กœ์šด ํ”„๋ ˆ์ž„์›Œํฌ 'Imagine'์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. Imagine์€ ํ…์ŠคํŠธ ์ž…๋ ฅ์— ๊ธฐ๊ณ„๊ฐ€ ์ƒ์„ฑํ•œ ์ด๋ฏธ์ง€๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ์‹œ๊ฐ์  ์‹ ํ˜ธ๋ฅผ ์ถ”๊ฐ€ํ•จ์œผ๋กœ์จ PLM์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ๊ฐ•ํ™”ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์ถ• ๋ฐ ์ข…ํ•ฉ์ ์ธ ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด ๊ธฐ์กด ์ œ๋กœ์ƒท ์ ‘๊ทผ ๋ฐฉ์‹๊ณผ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๊ธฐ๊ณ„์  ์ƒ์ƒ๋ ฅ์„ ํ†ตํ•œ ์‹œ๊ฐ ์ •๋ณด ํ†ตํ•ฉ์€ ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์˜ ๋ณด๊ณ  ํŽธํ–ฅ์„ ์™„ํ™”ํ•˜๊ณ  ์ƒ์‹ ์ถ”๋ก ์˜ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
์‹œ๊ฐ์  ๋งฅ๋ฝ์„ ํšจ๊ณผ์ ์œผ๋กœ ํ™œ์šฉํ•˜๊ธฐ ์œ„ํ•œ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์ถ• ์ „๋žต์€ ํ–ฅํ›„ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ƒ์‹ ์ถ”๋ก  ์—ฐ๊ตฌ์— ์ค‘์š”ํ•œ ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๊ธฐ๊ณ„๊ฐ€ ์ƒ์„ฑํ•œ ์ด๋ฏธ์ง€์˜ ํ’ˆ์งˆ๊ณผ ๊ด€๋ จ์„ฑ์ด ์ถ”๋ก  ์„ฑ๋Šฅ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ ๋ฐ ์‹ค์ œ ์„ธ๊ณ„์˜ ๋ณต์žกํ•œ ์‹œ๊ฐ์  ์ •๋ณด๋ฅผ ์–ด๋–ป๊ฒŒ ๋” ํšจ๊ณผ์ ์œผ๋กœ ํ†ตํ•ฉํ•  ์ˆ˜ ์žˆ์„์ง€์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘