[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

FlexiTex: Enhancing Texture Generation via Visual Guidance

Created by
  • Haebom

Author

DaDong Jiang, Xianghui Yang, Zibo Zhao, Sheng Zhang, Jiaao Yu, Zeqiang Lai, Shaoxiong Yang, Chunchao Guo, Xiaobo Zhou, Zhihui Ke

Outline

This paper points out that although recent texture generation methods have achieved remarkable results thanks to the powerful generative dictionary exploited in large-scale text-to-image diffusion models, abstract text prompts have limitations in providing global texture or shape information, resulting in blurry or inconsistent patterns. To address this issue, this paper presents FlexiTex, which embeds rich information through visual guidance to generate high-quality textures. The core of FlexiTex is the Visual Guidance Enhancement module, which reduces the ambiguity of text prompts by incorporating more specific information in the visual guidance and preserves high-frequency details. To further enhance the visual guidance, this paper introduces the Direction-Aware Adaptation module, which automatically designs direction prompts according to different camera poses to avoid the Janus problem and maintain semantic global consistency. With the advantage of visual guidance, FlexiTex produces quantitatively and qualitatively excellent results, showing the potential to advance texture generation for real-world applications.

Takeaways, Limitations

Takeaways:
A novel method to improve the quality of texture generation by using visual guidance is presented.
Reduce ambiguity in text prompts and maintain high-frequency detail and global consistency with the Visual Guidance Enhancement module and Direction-Aware Adaptation module.
Contributing to the advancement of texture generation technology for real-world applications
Limitations:
Further research is needed on the generalization performance of the presented method and its applicability to different texture types.
Need to analyze the computational cost and efficiency of the Visual Guidance Enhancement module and the Direction-Aware Adaptation module
Experimental results using large-scale datasets are needed, and further performance evaluations in various environments are needed.
👍