Sign In

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty

Created by
  • Haebom
Category
Empty

์ €์ž

Ziqian Zhang, Xingjian Hu, Yue Huang, Kai Zhang, Ruoxi Chen, Yixin Liu, Qingsong Wen, Kaidi Xu, Xiangliang Zhang, Neil Zhenqiang Gong, Lichao Sun

๐Ÿ’ก ๊ฐœ์š”

๊ธฐ์กด LLM ๋ฒค์น˜๋งˆํฌ๋Š” ๋ฌธ์ œ ๋‚œ์ด๋„๋ฅผ ๊ณ ๋ คํ•˜์ง€ ์•Š์•„ ๋ชจ๋ธ์˜ ๋Šฅ๋ ฅ์„ ์„ธ๋ฐ€ํ•˜๊ฒŒ ํ‰๊ฐ€ํ•˜๋Š” ๋ฐ ํ•œ๊ณ„๊ฐ€ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค. ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ์—ฐ๊ตฌ๋Š” ๋ฌธ์ œ ๋‚œ์ด๋„์™€ ๋ชจ๋ธ ์—ญ๋Ÿ‰์„ ๋™์‹œ์— ์ธก์ •ํ•˜๋Š” RankLLM ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. RankLLM์€ ๋ชจ๋ธ์ด ๋ฌธ์ œ๋ฅผ ๋งžํžˆ๋ฉด ์—ญ๋Ÿ‰ ์ ์ˆ˜๊ฐ€ ์˜ค๋ฅด๊ณ , ๋ฌธ์ œ๊ฐ€ ๋ชจ๋ธ์„ ์–ด๋ ต๊ฒŒ ํ•˜๋ฉด ๋‚œ์ด๋„ ์ ์ˆ˜๊ฐ€ ์˜ค๋ฅด๋Š” ์ƒํ˜ธ ์ž‘์šฉ์„ ํ†ตํ•ด ๋ฏธ์„ธํ•œ ํ‰๊ฐ€๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
RankLLM์€ ๋ฌธ์ œ ๋‚œ์ด๋„๋ฅผ ํ•ต์‹ฌ ๊ธฐ์ค€์œผ๋กœ ์‚ผ์•„ LLM์˜ ๋Šฅ๋ ฅ์„ ๋ณด๋‹ค ์ •๊ตํ•˜๊ฒŒ ๋น„๊ต ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ ํ”„๋ ˆ์ž„์›Œํฌ๋Š” ์ธ๊ฐ„ ํ‰๊ฐ€์™€ 90%์˜ ์ผ์น˜์œจ์„ ๋ณด์ด๋ฉฐ, ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก ๋ณด๋‹ค ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ๊ณผ ์•ˆ์ •์„ฑ, ํšจ์œจ์„ฑ์„ ๊ฐ–์ถ”๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋Œ€๊ทœ๋ชจ LLM ํ‰๊ฐ€์— ์‹ค์šฉ์ ์œผ๋กœ ์ ์šฉ ๊ฐ€๋Šฅํ•˜์ง€๋งŒ, ํŠน์ • ๋„๋ฉ”์ธ์ด๋‚˜ ๋ฌธ์ œ ์œ ํ˜•์— ๋”ฐ๋ฅธ ํŽธํ–ฅ ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘