Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes

Created by
  • Haebom

Author

Seunghyeok Back, Joosoon Lee, Kangmin Kim, Heeseon Rho, Geonhyup Lee, Raeyoung Kang, Sangbeom Lee, Sangjun Noh, Youngjin Lee, Taeyeop Lee, Kyoobin Lee

Outline

GraspClutter6D is a large-scale real-world grasping dataset designed to address the problem of robust object grasping in cluttered environments for robots. It overcomes the simple scenes, low occlusion rates, and lack of diversity of existing datasets. It features 1,000 densely cluttered scenes (average 14.1 objects per scene, 62.6% occlusion rate), 200 objects and 75 environment configurations (boxes, shelves, tables), and multi-viewpoint captures using four RGB-D cameras. Rich annotations are provided, including 736K 6D object poses and 9.3B possible robot grasps for 52K RGB-D images. We benchmark existing state-of-the-art segmentation, object pose estimation, and grasp detection methods to analyze the task in cluttered environments, and demonstrate that a grasping network trained on GraspClutter6D outperforms networks trained on existing datasets in both simulations and real-world experiments. The dataset, toolkit, and annotation tools are publicly available.

Takeaways, Limitations

Takeaways:
Providing a large-scale, real-world, cluttered environment phage dataset that overcomes the limitations of existing datasets.
Provides important insights into the task of phage in cluttered environments.
Demonstrating the superior performance of a learning network based on GraspClutter6D (simulated and real-world environments)
Enabling research through the release of datasets, toolkits, and annotation tools.
Limitations:
Although the dataset is large, it may not fully encompass all the messy environments of the real world.
There is a possibility of bias towards certain types of objects or environments.
Consideration must be given to noise and errors that may occur during the data collection process.
👍