Sign In

How does longer temporal context enhance multimodal narrative video processing in the brain?

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Prachi Jindal, Anant Khandelwal, Manish Gupta, Bapi S. Raju, Subba Reddy Oota, Tanmoy Chakraborty

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(MLLM)์ด ๊ธด ์‹œ๊ฐ„์  ๋งฅ๋ฝ์„ ํ™œ์šฉํ•  ๋•Œ ๋‡Œ ํ™œ๋™๊ณผ์˜ ์ •๋ ฌ์ด ํ–ฅ์ƒ๋˜๋Š”์ง€ ์กฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค. ๋‡Œ ์˜์ƒ(fMRI)๊ณผ ๋ชจ๋ธ ํŠน์ง•์„ ๋น„๊ตํ•œ ๊ฒฐ๊ณผ, ํด๋ฆฝ ๊ธธ์ด๊ฐ€ ๊ธธ์–ด์งˆ์ˆ˜๋ก MLLM์˜ ๋‡Œ ์ •๋ ฌ์ด ํฌ๊ฒŒ ๊ฐœ์„ ๋˜์—ˆ์œผ๋‚˜, ๋‹จ์ผ ๋ชจ๋‹ฌ ๋น„๋””์˜ค ๋ชจ๋ธ์€ ๊ทธ๋ ‡์ง€ ์•Š์•˜์Šต๋‹ˆ๋‹ค. ์ด๋Š” ๊ธด ์‹œ๊ฐ„์  ๋งฅ๋ฝ ์ฒ˜๋ฆฌ๊ฐ€ MLLM์˜ ๊ณ ์ฐจ์› ํ†ตํ•ฉ ์˜์—ญ๊ณผ ๋‡Œ์˜ ์œ ์‚ฌํ•œ ์˜์—ญ ๊ฐ„์˜ ์ผ๊ด€์„ฑ์„ ๋†’์ž„์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๊ธด ์‹œ๊ฐ„์  ๋งฅ๋ฝ์€ MLLM์ด ์ธ๊ฐ„์˜ ์„œ์‚ฌ ์ดํ•ด ๊ณผ์ •์„ ๋‡Œ ํ™œ๋™๊ณผ ๋” ์ž˜ ์ผ์น˜์‹œํ‚ค๋„๋ก ๋•์Šต๋‹ˆ๋‹ค.
โ€ข
MLLM์˜ ๊ณ„์ธต์  ๊ตฌ์กฐ๋Š” ๋‡Œ์˜ ํ”ผ์งˆ ์˜์—ญ ๊ณ„์ธต ๊ตฌ์กฐ์™€ ์œ ์‚ฌํ•˜๊ฒŒ ์งง์€ ๋งฅ๋ฝ์€ ์ดˆ๊ธฐ ์ฒ˜๋ฆฌ ์˜์—ญ๊ณผ, ๊ธด ๋งฅ๋ฝ์€ ๊ณ ์ฐจ์› ํ†ตํ•ฉ ์˜์—ญ๊ณผ ์ •๋ ฌ๋ฉ๋‹ˆ๋‹ค.
โ€ข
์„œ์‚ฌ ๊ณผ์ œ ํ”„๋กฌํ”„ํŠธ๋Š” ๋‡Œ ์ •๋ ฌ ํŒจํ„ด์— ํŠน์ • ์ž‘์—… ๋ฐ ์˜์—ญ ์˜์กด์ ์ธ ์˜ํ–ฅ์„ ๋ฏธ์น˜๋ฉฐ, ๊ณ ์ฐจ์› ์˜์—ญ์˜ ๋งฅ๋ฝ ์˜์กด์ ์ธ ํŠœ๋‹ ๋ณ€ํ™”๋ฅผ ์œ ๋ฐœํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘