[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting

Created by
  • Haebom

Author

Ruijie Zhu, Mulin Yu, Linning Xu, Lihan Jiang, Yixuan Li, Tianzhu Zhang, Jiangmiao Pang, Bo Dai

Outline

ObjectGS is an object recognition framework that integrates semantic understanding for object-level recognition while maintaining the high-quality reconstruction and real-time new view synthesis capabilities of 3D Gaussian Splatting. Unlike conventional 3D Gaussian Splatting that processes the scene as a whole, ObjectGS models individual objects as local anchors to generate neural Gaussians and share object IDs to enable accurate object-level reconstruction. During training, these anchors are dynamically generated or removed, features are optimized, and explicit semantic constraints are enforced via one-hot ID encoding and classification losses. Experimental results show that ObjectGS outperforms state-of-the-art methods on open-vocabulary and full-view segmentation tasks, and integrates seamlessly with applications such as mesh extraction and scene editing.

Takeaways, Limitations

Takeaways:
Object-level recognition is possible while maintaining the advantages of 3D Gaussian Splatting and adding semantic understanding.
Achieving SOTA performance on open vocabulary and full-view segmentation tasks.
Seamless integration with a variety of applications such as mesh extraction and scene editing.
Efficient object modeling possible through dynamic anchor management.
Limitations:
The paper does not mention specific __T6648_____. Additional experiments and analyses are needed to identify __T6649_____ (e.g., potential performance degradation for specific types of objects or complex scenes, computational costs, etc.).
👍