Sign In

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation

Created by
  • Haebom
Category
Empty

์ €์ž

Shuo Lu, Jianjie Cheng, Yinuo Xu, Yongcan Yu, Lijun Sheng, Peijie Wang, Siru Jiang, Yongguan Hu, Run Ling, Yihua Shao, Ao Ma, Wei Feng, Lingxiao He, Meng Wang, Qianlong Xie, Xingxing Wang, Ran He, Jian Liang

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ํ˜„์žฌ์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(MLLM)์ด 2D ๋ฐ 3D ๊ณต๊ฐ„ ๊ด€๊ณ„๋ฅผ ์ดํ•ดํ•˜๊ณ  ์ฒ˜๋ฆฌํ•˜๋Š” ์ˆ˜ํ•™์  ๊ณต๊ฐ„ ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ๋ถ€์กฑํ•˜๋‹ค๋Š” ์ ์„ ์ง€์ ํ•ฉ๋‹ˆ๋‹ค. ์ธ๊ฐ„์˜ ๋†’์€ ์ •ํ™•๋„์™€ ๋Œ€์กฐ์ ์œผ๋กœ MLLM์€ 60% ์ดํ•˜์˜ ๋‚ฎ์€ ์„ฑ๋Šฅ์„ ๋ณด์˜€์Šต๋‹ˆ๋‹ค. ์ด์— ๋ณธ ๋…ผ๋ฌธ์€ MLLM์˜ ๊ณต๊ฐ„ ์ถ”๋ก ์„ ํ‰๊ฐ€ํ•˜๊ณ  ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•œ ํ†ตํ•ฉ ํ”„๋ ˆ์ž„์›Œํฌ์ธ MathSpatial์„ ์ œ์•ˆํ•˜๋ฉฐ, ์ด๋Š” ๋ฒค์น˜๋งˆํฌ, ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์…‹, ๊ทธ๋ฆฌ๊ณ  ์ถ”๋ก  ๊ณผ์ •์„ ๋ชจ๋ธ๋งํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
MLLM์€ ์ธ์ง€์  ์ž‘์—…์—๋Š” ๊ฐ•์ ์„ ๋ณด์ด์ง€๋งŒ, ์ˆ˜ํ•™์  ๊ณต๊ฐ„ ์ถ”๋ก ์—์„œ๋Š” ๊ทผ๋ณธ์ ์ธ ์•ฝ์ ์„ ๊ฐ€์ง€๊ณ  ์žˆ์œผ๋ฉฐ, ์ด๋Š” ํ˜„์žฌ ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๋“œ๋Ÿฌ๋ƒ…๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ MathSpatial ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ์ถ”๋ก  ์–ด๋ ค์›€๊ณผ ์ง€๊ฐ ๋…ธ์ด์ฆˆ๋ฅผ ๋ถ„๋ฆฌํ•˜์—ฌ MLLM์˜ ์ˆ˜ํ•™์  ๊ณต๊ฐ„ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ์ •๋ฐ€ํ•˜๊ฒŒ ์ธก์ •ํ•˜๊ณ  ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋Š” ์ฒซ ๋Œ€๊ทœ๋ชจ ์ž์›์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
MathSpatial์„ ํ™œ์šฉํ•œ ๋ฏธ์„ธ ์กฐ์ • ์‹คํ—˜์€ ๋ชจ๋ธ์˜ ์ •ํ™•๋„๋ฅผ ๋†’์ด๊ณ  ํ† ํฐ ์‚ฌ์šฉ๋Ÿ‰์„ ์ค„์ด๋Š” ํšจ๊ณผ๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋ฉฐ, ์ด๋Š” ํ–ฅํ›„ MLLM์˜ ๊ณต๊ฐ„ ์ถ”๋ก  ๋Šฅ๋ ฅ ํ–ฅ์ƒ์— ๊ธฐ์—ฌํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋ณธ ์—ฐ๊ตฌ๋Š” MLLM์ด ์‹ค์ œ๋กœ ๊ณต๊ฐ„ ๊ด€๊ณ„๋ฅผ ์–ผ๋งˆ๋‚˜ ๊นŠ์ด ์ดํ•ดํ•˜๋Š”์ง€์— ๋Œ€ํ•œ ์งˆ๋ฌธ์„ ์ œ๊ธฐํ•˜๋ฉฐ, ํ–ฅํ›„ ์—ฐ๊ตฌ๋Š” ๋‹จ์ˆœํ•œ ํŒจํ„ด ์ธ์‹์„ ๋„˜์–ด์„  ์ง„์ •ํ•œ ๊ณต๊ฐ„์  ์ดํ•ด๋ฅผ ๋‹ฌ์„ฑํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ๋‚˜์•„๊ฐ€์•ผ ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘