Sign In

Large Language Models Can Help Mitigate Barren Plateaus in Quantum Neural Networks

Created by
  • Haebom
Category
Empty

์ €์ž

Jun Zhuang, Chaowen Guan

๐Ÿ’ก ๊ฐœ์š”

์–‘์ž ์‹ ๊ฒฝ๋ง(QNN) ํ•™์Šต์˜ ์ฃผ์š” ๋‚œ์ œ์ธ '๋ฐ”๋ ˆ์ธ ๊ณ ์›(barren plateaus)' ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ๋ณธ ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์„ ํ™œ์šฉํ•œ ์ƒˆ๋กœ์šด ์ดˆ๊ธฐํ™” ๋ฐฉ๋ฒ•๋ก ์ธ 'AdaInit'์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. AdaInit์€ LLM์˜ ๋ถ€๋ถ„๋งˆํ‹ด๊ฒŒ์ผ(submartingale) ์†์„ฑ์„ ์ด์šฉํ•˜์—ฌ QNN์˜ ์ดˆ๊ธฐ ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ๋ฐ˜๋ณต์ ์œผ๋กœ ์ƒ์„ฑํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ๊ธฐ์šธ๊ธฐ ๋ถ„์‚ฐ์ด ์†Œ์‹ค๋˜๋Š” ํ˜„์ƒ์„ ์™„ํ™”ํ•ฉ๋‹ˆ๋‹ค. ๋‹ค์–‘ํ•œ QNN ๊ทœ๋ชจ์—์„œ ๊ธฐ์กด ๋ฐฉ๋ฒ•๋ก ๋ณด๋‹ค ๋†’์€ ๊ธฐ์šธ๊ธฐ ๋ถ„์‚ฐ์„ ์œ ์ง€ํ•จ์œผ๋กœ์จ AdaInit์ด ๋ฐ”๋ ˆ์ธ ๊ณ ์›์„ ํšจ๊ณผ์ ์œผ๋กœ ์™„ํ™”ํ•จ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜์—ฌ ์–‘์ž ์‹ ๊ฒฝ๋ง ํ•™์Šต์˜ ํ•ต์‹ฌ ๋ฌธ์ œ์ธ ๋ฐ”๋ ˆ์ธ ๊ณ ์›์„ ํšจ๊ณผ์ ์œผ๋กœ ์™„ํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
๋ฐ์ดํ„ฐ์…‹ ํŠน์„ฑ๊ณผ ๊ธฐ์šธ๊ธฐ ํ”ผ๋“œ๋ฐฑ์„ ๋ฐ˜์˜ํ•˜๋Š” ๋™์ ์ด๊ณ  ์ ์‘์ ์ธ ์ดˆ๊ธฐํ™” ์ „๋žต์„ ํ†ตํ•ด ๊ธฐ์กด์˜ ์ •์ ์ธ ์ดˆ๊ธฐํ™” ๋ฐฉ๋ฒ•๋ก ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ด๋ก ์  ์ˆ˜๋ ด ๋ณด์žฅ๊ณผ ์‹ค์ฆ์  ๊ฒ€์ฆ์„ ํ†ตํ•ด ์ œ์•ˆ๋œ AdaInit ๋ฐฉ๋ฒ•๋ก ์˜ ํšจ๊ณผ์„ฑ์„ ์ž…์ฆํ–ˆ์Šต๋‹ˆ๋‹ค.
โ€ข
AdaInit์˜ LLM ํ”„๋กฌํ”„ํŠธ ์„ค๊ณ„ ๋ฐ LLM ์ž์ฒด์˜ ๊ณ„์‚ฐ ๋ณต์žก์„ฑ์ด ์‹ค์ œ QNN ํ•™์Šต์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ถ”๊ฐ€์ ์œผ๋กœ ์—ฐ๊ตฌํ•  ํ•„์š”๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘