Sign In

From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning

Created by
  • Haebom
Category
Empty

์ €์ž

Chen Zhan, Xiaoyu Tan, Gengchen Ma, Yu-Jie Xiong, Xiaoyan Jiang, Xihe Qiu

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์ž„์ƒ ์ง„๋‹จ์—์„œ LLM์˜ ๋ถˆํˆฌ๋ช…ํ•˜๊ณ  ์‹ ๋ขฐํ•  ์ˆ˜ ์—†๋Š” ์ถ”๋ก  ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ํŠœ๋ฏผ ๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ์ƒˆ๋กœ์šด ํ›ˆ๋ จ ํ”„๋ ˆ์ž„์›Œํฌ์ธ Curriculum Goal-Conditioned Learning (CGCL)์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. CGCL์€ 3๋‹จ๊ณ„ ์ปค๋ฆฌํ˜๋Ÿผ์„ ํ†ตํ•ด LLM์ด ์‚ฌ์‹ค ์ถ”์ถœ, ๊ฐ€์„ค ๊ฒ€์ฆ ๋ฐ ๋ฐ˜๋ฐ•, ๊ฒฐ๋ก  ๋„์ถœ ๋“ฑ ํŠœ๋ฏผ ๋ชจ๋ธ ๊ตฌ์กฐ์— ๋”ฐ๋ฅธ ๋ช…ํ™•ํ•œ ์ง„๋‹จ ๋…ผ์ฆ์„ ์ƒ์„ฑํ•˜๋„๋ก ์ ์ง„์ ์œผ๋กœ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ๊ธฐ์กด์˜ ๊ณ ๋น„์šฉ RL ๋ฐฉ์‹๊ณผ ๋™๋“ฑํ•œ ์ˆ˜์ค€์˜ ์ง„๋‹จ ์ •ํ™•๋„์™€ ์ถ”๋ก  ํ’ˆ์งˆ์„ ๋‹ฌ์„ฑํ•˜๋ฉด์„œ๋„ ๋” ์•ˆ์ •์ ์ด๊ณ  ํšจ์œจ์ ์ธ ํ›ˆ๋ จ์ด ๊ฐ€๋Šฅํ•จ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
LLM ๊ธฐ๋ฐ˜ ์ž„์ƒ ์ง„๋‹จ ์‹œ์Šคํ…œ์˜ ์‹ ๋ขฐ์„ฑ ๋ฐ ํˆฌ๋ช…์„ฑ ํ™•๋ณด๋ฅผ ์œ„ํ•œ ์‹ค์งˆ์ ์ธ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํŠœ๋ฏผ ๋ชจ๋ธ์ด๋ผ๋Š” ๊ฒ€์ฆ๋œ ๋…ผ์ฆ ๊ตฌ์กฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ LLM์˜ ์ถ”๋ก  ๊ณผ์ •์„ ๋ช…ํ™•ํ•˜๊ณ  ์ดํ•ด ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ญ๋‹ˆ๋‹ค.
โ€ข
๊ณ ๋น„์šฉ์˜ RL ๋ฐฉ์‹ ๋Œ€๋น„ ํšจ์œจ์ ์ด๊ณ  ์•ˆ์ •์ ์ธ ํ›ˆ๋ จ ํŒŒ์ดํ”„๋ผ์ธ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
T-Eval๊ณผ ๊ฐ™์€ ์ •๋Ÿ‰์  ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ œ์•ˆ ๋ฐฉ๋ฒ•๋ก ์˜ ํšจ๊ณผ๋ฅผ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์•„์ง ์‹ค์ œ ์ž„์ƒ ํ™˜๊ฒฝ์—์„œ์˜ ๊ด‘๋ฒ”์œ„ํ•œ ๊ฒ€์ฆ ๋ฐ ์ ์šฉ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํŠœ๋ฏผ ๋ชจ๋ธ์˜ ๋ณต์žก์„ฑ์ด LLM ํ›ˆ๋ จ์— ๋ฏธ์น˜๋Š” ์ถ”๊ฐ€์ ์ธ ์˜ํ–ฅ์— ๋Œ€ํ•œ ์‹ฌ์ธต ๋ถ„์„์ด ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘