Sign In

Towards Autonomous Railway Operations: A Semi-Hierarchical Deep Reinforcement Learning Approach to the Vehicle Rescheduling Problem

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Alberto Castagna, Stefan Zahlner, Adrian Egli, Christian Eichenberger, Daniel Boos, Manuel Meyer, Anton Fuxjager

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ๋ณต์žกํ•˜๊ณ  ์ฆ๊ฐ€ํ•˜๋Š” ์ฒ ๋„ ๊ตํ†ต๋Ÿ‰์œผ๋กœ ์ธํ•ด ๋ฐœ์ƒํ•˜๋Š” ์—ด์ฐจ ์šดํ–‰ ์ฐจ์งˆ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์šด์˜ ์ œ์•ฝ ์กฐ๊ฑด์„ ๊ณ ๋ คํ•œ ์ค€๊ณ„์ธต์  ์‹ฌ์ธต ๊ฐ•ํ™”ํ•™์Šต(Semi-Hierarchical Deep Reinforcement Learning) ๊ธฐ๋ฒ•์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ฐฉ๋ฒ•์€ ์—ด์ฐจ ๋ฐฐ์ฐจ์™€ ๊ฒฝ๋กœ ์„ค์ •์„ ๋ถ„๋ฆฌํ•˜์—ฌ ๊ฐ ๊ฒฐ์ • ๋ฒ”์œ„์— ํŠนํ™”๋œ ์ •์ฑ…์„ ํ•™์Šต์‹œํ‚ค๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ์‹ค์ œ ์ฒ ๋„ ์šด์˜ ํ™˜๊ฒฝ์—์„œ์˜ ํšจ์œจ์„ฑ๊ณผ ์•ˆ์ •์„ฑ์„ ๋†’์ด๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ, ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•์€ ๊ธฐ์กด ํœด๋ฆฌ์Šคํ‹ฑ ๋ฐ ๋‹จ์ผ ๊ณ„์ธต ๊ฐ•ํ™”ํ•™์Šต ๋ฐฉ์‹ ๋Œ€๋น„ ์—ด์ฐจ ๋„์ฐฉ๋ฅ ์„ ๋‘ ๋ฐฐ ๊ฐ€๊นŒ์ด ๋†’์ด๊ณ  ๋ฐ๋“œ๋ฝ ๋ฐœ์ƒ๋ฅ ์„ 5% ์ดํ•˜๋กœ ์œ ์ง€ํ•˜๋Š” ๋“ฑ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์ฒ ๋„ ์šด์˜์˜ ๋ณต์žก์„ฑ ํ•ด๊ฒฐ์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด AI ์ ‘๊ทผ๋ฒ• ์ œ์‹œ: ์‹ค์ œ ์ฒ ๋„ ์šด์˜์˜ ์ œ์•ฝ ์กฐ๊ฑด์„ ๋ฐ˜์˜ํ•œ ์ค€๊ณ„์ธต์  ๊ฐ•ํ™”ํ•™์Šต ๋ชจ๋ธ์€ ๊ธฐ์กด OR ๋ฐฉ๋ฒ•๋ก ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ์‹ค์‹œ๊ฐ„ ๋ฌธ์ œ ํ•ด๊ฒฐ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
โ€ข
๋ถ„์‚ฐ๋œ ์˜์‚ฌ๊ฒฐ์ •์˜ ํšจ๊ณผ์„ฑ ์ž…์ฆ: ๋ฐฐ์ฐจ์™€ ๊ฒฝ๋กœ ์„ค์ •์„ ๋ถ„๋ฆฌํ•˜์—ฌ ๊ฐ ์ •์ฑ…์ด ํŠน์ • ์˜์‚ฌ๊ฒฐ์ • ๋ฒ”์œ„์— ์ง‘์ค‘ํ•˜๊ฒŒ ํ•จ์œผ๋กœ์จ, ๋ณต์žกํ•œ ๋„คํŠธ์›Œํฌ์—์„œ๋„ ํšจ์œจ์ ์ธ ํ˜‘์—… ๋ฐ ์ž์› ํ™œ์šฉ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.
โ€ข
๋Œ€๊ทœ๋ชจ ๋ฐ ๋ฐ€์ง‘ ๋„คํŠธ์›Œํฌ์—์„œ์˜ ํ™•์žฅ์„ฑ ๋ฐ ์•ˆ์ •์„ฑ: ์ œ์•ˆ๋œ ๋ฐฉ๋ฒ•์€ ๋‹ค์–‘ํ•œ ๊ทœ๋ชจ์™€ ๋‚œ์ด๋„์˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ™˜๊ฒฝ์—์„œ ์œ ์˜๋ฏธํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์—ฌ, ์‹ค์ œ ๋Œ€๊ทœ๋ชจ ์ฒ ๋„ ๋„คํŠธ์›Œํฌ์—์„œ์˜ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ˜„์‹ค ์„ธ๊ณ„ ์ ์šฉ์„ ์œ„ํ•œ ์ถ”๊ฐ€ ์—ฐ๊ตฌ ํ•„์š”: Flatland-RL ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์—์„œ์˜ ๊ฒ€์ฆ์€ ๊ธ์ •์ ์ด์ง€๋งŒ, ์‹ค์ œ ์ฒ ๋„ ์‹œ์Šคํ…œ์˜ ๋™์ ์ธ ๋ณ€ํ™”, ์˜ˆ์ธก ๋ถˆ๊ฐ€๋Šฅํ•œ ์ด๋ฒคํŠธ, ์ธ๊ฐ„ ์šด์˜์ž์™€์˜ ์ƒํ˜ธ์ž‘์šฉ ๋“ฑ ํ˜„์‹ค์ ์ธ ์š”์†Œ๋“ค์„ ๋” ๋งŽ์ด ํ†ตํ•ฉํ•˜๋Š” ์—ฐ๊ตฌ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
๐Ÿ‘