ViTok-v2: Scaling Native Resolution Auto-Encoders to 5 Billion Parameters
μμ±μ
Haebom
μΉ΄ν κ³ λ¦¬
Empty
μ μ
Philippe Hansen-Estruch, Jiahui Chen, Vivek Ramanujan, Orr Zohar, Yan Ping, Animesh Sinha, Markos Georgopoulos, Edgar Schoenfeld, Ji Hou, Felix Juefei-Xu, Sriram Vishwanath, Ali Thabet