Sign In

PruneTIR: Inference-Time Tool Call Pruning for Effective yet Efficient Tool-Integrated Reasoning

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Luan Zhang, Dandan Song, Zhijing Wu, Zhengyu Chen, Chen Zhang, Yuhang Tian, Huipeng Ma, Chenhao Li, Changzhi Zhou, Xudong Li, Shuhao Zhang

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์ด๋ฏธ ๋„๊ตฌ ์‚ฌ์šฉ ๋Šฅ๋ ฅ์„ ๊ฐ–์ถ˜ LLM์˜ ์ถ”๋ก  ์‹œ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” PruneTIR ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. PruneTIR์€ ์ถ”๋ก  ๊ณผ์ •์—์„œ ๋ฐœ์ƒํ•˜๋Š” ์˜ค๋ฅ˜๊ฐ€ ๋งŽ์€ ๋„๊ตฌ ํ˜ธ์ถœ์„ ์‹๋ณ„ํ•˜๊ณ , ์ด๋ฅผ ๊ฐ€์ง€์น˜๊ธฐ(pruning)ํ•˜๋ฉฐ, ๋„๊ตฌ ์‚ฌ์šฉ์„ ์ผ์‹œ ์ค‘๋‹จํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์ž‘๋™ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ์˜ค๋ฅ˜๋กœ ์ธํ•œ ๋ถ€์ •์  ์˜ํ–ฅ์„ ์™„ํ™”ํ•˜๊ณ  ๋ฐ˜๋ณต์ ์ธ ์‹คํŒจ ํ•ด๊ฒฐ ์‹œ๋„๋ฅผ ๋ฐฉ์ง€ํ•˜์—ฌ LLM์˜ ๋ฌธ์ œ ํ•ด๊ฒฐ ๋Šฅ๋ ฅ๊ณผ ํšจ์œจ์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์ถ”๋ก  ์‹œ์  ์ตœ์ ํ™”: ๋ณ„๋„์˜ ์ถ”๊ฐ€ ํ•™์Šต ์—†์ด ์ถ”๋ก  ์‹œ์ ์— LLM์˜ ๋„๊ตฌ ํ†ตํ•ฉ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ํšจ๊ณผ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
โ€ข
ํšจ์œจ์„ฑ ์ฆ๋Œ€: ์˜ค๋ฅ˜๊ฐ€ ๋งŽ์€ ๋„๊ตฌ ํ˜ธ์ถœ์„ ๊ฐ€์ง€์น˜๊ธฐํ•˜๊ณ  ๋ถˆํ•„์š”ํ•œ ์‹œ๋„๋ฅผ ์ค„์—ฌ ์ž‘์—… ๋งฅ๋ฝ ๊ธธ์ด๋ฅผ ๋‹จ์ถ•ํ•˜๊ณ  ํšจ์œจ์„ฑ์„ ๋†’์ž…๋‹ˆ๋‹ค.
โ€ข
์˜ค๋ฅ˜ ํ•ด๊ฒฐ ์ „๋žต: ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ–ˆ์„ ๋•Œ ์ด๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ๊ด€๋ฆฌํ•˜๊ณ  LLM์ด ํ•ด๊ฒฐ ๋ถˆ๊ฐ€๋Šฅํ•œ ์ƒํƒœ์— ๋น ์ง€๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€ํ•˜๋Š” ์ƒˆ๋กœ์šด ์ ‘๊ทผ ๋ฐฉ์‹์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘