Sign In

Screen, Cache, and Match: A Training-Free Causality-Consistent Reference Frame Framework for Human Animation

Created by
  • Haebom
Category
Empty

์ €์ž

Jianan Wang, Nailei Hei, Li He, Huanzhen Wang, Aoxing Li, Yingkai Zhao, Yuxuan Lin, Haofen Wang, Chunyang Wang, Yan Wang, Wenqiang Zhang

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ๋…ผ๋ฌธ์€ ์žฅ๊ธฐ์ ์ธ ์‹œ๊ฐ„ ์ผ๊ด€์„ฑ๊ณผ ์‹œ๊ฐ์  ์•ˆ์ •์„ฑ์„ ๊ฐ–์ถ˜ ์ธ๊ฐ„ ์• ๋‹ˆ๋ฉ”์ด์…˜ ์ƒ์„ฑ์ด๋ผ๋Š” ๊ณผ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ๊ณผ๊ฑฐ ํ”„๋ ˆ์ž„ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜๋Š” ์ธ๊ฐ„์˜ ๋Šฅ๋ ฅ์„ ๋ชจ๋ฐฉํ•œ ํ•™์Šต ์—†๋Š”(training-free) ์ธ๊ณผ๋ก ์  ์ผ๊ด€์„ฑ์„ ๊ฐ–์ถ˜ ์ฐธ์กฐ ํ”„๋ ˆ์ž„ ํ”„๋ ˆ์ž„์›Œํฌ์ธ FrameCache๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. FrameCache๋Š” Screen-Cache-Match(SCM) ์ „๋žต๊ณผ Trajectory-Aware Autoregressive Generation(TAAG) ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ํ†ตํ•ด ๊ณผ๊ฑฐ ์ƒ์„ฑ ๊ฒฐ๊ณผ๋ฌผ์„ ์ธ๊ณผ์  ์ง€์นจ์œผ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ, ์ •์ฒด์„ฑ ๋“œ๋ฆฌํ”„ํŠธ(identity drift)๋ฅผ ์ค„์ด๊ณ  ๋น„๋””์˜ค ์ฒญํฌ ๊ฐ„์˜ ๋…ธ์ด์ง• ๊ถค์ ์„ ์ •๋ ฌํ•ฉ๋‹ˆ๋‹ค. ์‹คํ—˜ ๊ฒฐ๊ณผ, FrameCache๋Š” ๋‹ค์–‘ํ•œ ํ™•์‚ฐ ๋ชจ๋ธ(diffusion baselines)๊ณผ ํ†ตํ•ฉ๋˜์–ด ์‹œ๊ฐ„์  ์ผ๊ด€์„ฑ๊ณผ ์‹œ๊ฐ์  ์•ˆ์ •์„ฑ์„ ์ง€์†์ ์œผ๋กœ ํ–ฅ์ƒ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
์‹œ์‚ฌ์  1: ํ•™์Šต ์—†์ด๋„ ์žฅ๊ธฐ์ ์ธ ์‹œ๊ฐ„ ์ผ๊ด€์„ฑ๊ณผ ์ •์ฒด์„ฑ ์•ˆ์ •์„ฑ์„ ์œ ์ง€ํ•˜๋Š” ์ธ๊ฐ„ ์• ๋‹ˆ๋ฉ”์ด์…˜ ์ƒ์„ฑ์˜ ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์‹œ์‚ฌ์  2: ๊ณผ๊ฑฐ ์ •๋ณด๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ฐธ์กฐํ•˜๊ณ  ํ™œ์šฉํ•˜๋Š” Screen-Cache-Match(SCM) ๋ฐ Trajectory-Aware Autoregressive Generation(TAAG) ๋ฉ”์ปค๋‹ˆ์ฆ˜์€ ํ–ฅํ›„ ์• ๋‹ˆ๋ฉ”์ด์…˜ ์ƒ์„ฑ ๋ชจ๋ธ ์„ค๊ณ„์— ์œ ์šฉํ•œ ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
โ€ข
ํ•œ๊ณ„์  ๋˜๋Š” ํ–ฅํ›„ ๊ณผ์ œ: ๋ณต์žกํ•˜๊ณ  ๋™์ ์ธ ์žฅ๋ฉด์—์„œ์˜ ์„ธ๋ถ€์ ์ธ ์ƒํ˜ธ์ž‘์šฉ์ด๋‚˜ ๊ธ‰๊ฒฉํ•œ ๋ณ€ํ™”์— ๋Œ€ํ•œ ์ถ”๊ฐ€์ ์ธ ์„ฑ๋Šฅ ๊ฒ€์ฆ ๋ฐ ์ตœ์ ํ™”๊ฐ€ ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘