Sign In

Higress-RAG: A Holistic Optimization Framework for Enterprise Retrieval-Augmented Generation via Dual Hybrid Retrieval, Adaptive Routing, and CRAG

Created by
  • Haebom
Category
Empty

์ €์ž

Weixi Lin

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๊ธฐ์—… ํ™˜๊ฒฝ์—์„œ RAG ์‹œ์Šคํ…œ์˜ ๋‚ฎ์€ ๊ฒ€์ƒ‰ ์ •ํ™•๋„, ์ƒ์„ฑ ์‹œ ํ™˜๊ฐ ๋ฌธ์ œ, ๋†’์€ ์ง€์—ฐ ์‹œ๊ฐ„์ด๋ผ๋Š” ์„ธ ๊ฐ€์ง€ ์ฃผ์š” ๊ณผ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด Higress RAG MCP Server๋ผ๋Š” ํฌ๊ด„์ ์ธ ์ตœ์ ํ™” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ด ์‹œ์Šคํ…œ์€ ์ ์‘ํ˜• ๋ผ์šฐํŒ…, ์‹œ๋งจํ‹ฑ ์บ์‹ฑ, ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๊ฒ€์ƒ‰, CRAG(Corrective RAG)๋ฅผ ํฌํ•จํ•˜๋Š” ๊ณ„์ธต์  ์•„ํ‚คํ…์ฒ˜๋ฅผ ํ†ตํ•ด ๊ฒ€์ƒ‰ ์ˆ˜๋ช… ์ฃผ๊ธฐ ์ „๋ฐ˜์„ ์ตœ์ ํ™”ํ•ฉ๋‹ˆ๋‹ค. ์‹คํ—˜ ๊ฒฐ๊ณผ, Higress RAG๋Š” ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ AI ๋ฐฐํฌ์— ํ™•์žฅ ๊ฐ€๋Šฅํ•˜๊ณ  ํ™˜๊ฐ์— ๊ฐ•ํ•œ ์†”๋ฃจ์…˜์„ ์ œ๊ณตํ•จ์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๊ธฐ์—… ํ™˜๊ฒฝ์— ํŠนํ™”๋œ RAG ์‹œ์Šคํ…œ์˜ ์‹ค์งˆ์ ์ธ ๋ฌธ์ œ์ ๋“ค์„ ์‹ฌ์ธต์ ์œผ๋กœ ๋ถ„์„ํ•˜๊ณ , ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ํ†ตํ•ฉ์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๊ฒ€์ƒ‰ ์ •ํ™•๋„ ํ–ฅ์ƒ, ํ™˜๊ฐ ๊ฐ์†Œ, ์ง€์—ฐ ์‹œ๊ฐ„ ๋‹จ์ถ•์„ ์œ„ํ•œ ๋‹ค๊ฐ์ ์ธ ๊ธฐ์ˆ ์  ํ˜์‹ (๊ตฌ์กฐ ์ธ์‹ ๋ฐ์ดํ„ฐ ๋ถ„ํ• , RRF ๊ธฐ๋ฐ˜ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๊ฒ€์ƒ‰, ๋™์  ์ž„๊ณ„๊ฐ’ ๊ธฐ๋ฐ˜ ์‹œ๋งจํ‹ฑ ์บ์‹ฑ ๋“ฑ)์„ ๊ตฌ์ฒด์ ์œผ๋กœ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ Higress RAG ์‹œ์Šคํ…œ์€ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ AI ๋ฐฐํฌ์˜ ์ƒ์šฉํ™” ๊ฐ€๋Šฅ์„ฑ์„ ๋†’์ด๋Š” ๋ฐ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋…ผ๋ฌธ์˜ ํ•œ๊ณ„์ ์€ ๋ช…์‹œ์ ์œผ๋กœ ์–ธ๊ธ‰๋˜์ง€ ์•Š์•˜์œผ๋‚˜, ์ œ์•ˆ๋œ ํ”„๋ ˆ์ž„์›Œํฌ์˜ ๋ณต์žก์„ฑ๊ณผ ๋‹ค์–‘ํ•œ ๊ธฐ์—… ํ™˜๊ฒฝ์—์„œ์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ, ๊ทธ๋ฆฌ๊ณ  ์‹ค์ œ ์šด์˜ ํ™˜๊ฒฝ์—์„œ์˜ ์žฅ๊ธฐ์ ์ธ ์„ฑ๋Šฅ ๋ฐ ์•ˆ์ •์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ๊ฒ€์ฆ์ด ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘