Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes

Created by
  • Haebom

Author

Seunghyeok Back, Joosoon Lee, Kangmin Kim, Heeseon Rho, Geonhyup Lee, Raeyoung Kang, Sangbeom Lee, Sangjun Noh, Youngjin Lee, Taeyeop Lee, Kyoobin Lee

Outline

GraspClutter6D is a large-scale real-world grasp dataset designed to address the problem of robust object grasping in complex environments for robots. To overcome the simple scenes and lack of diversity of existing datasets, it contains 1,000 dense (14.1 objects/scene, 62.6% occlusion) complex scenes, 52,000 RGB-D images of 200 objects captured from various angles in 75 different environmental configurations (boxes, shelves, and tables), 736,000 6D object poses, and 9.3 billion possible robot grasps. In this paper, we use this dataset to evaluate the performance of state-of-the-art segmentation, object pose estimation, and grasp detection methods, and demonstrate that grasp networks trained on GraspClutter6D outperform networks trained on existing datasets in both simulations and field experiments. The dataset, toolkit, and annotation tools are publicly available.

Takeaways, Limitations

Takeaways:
Providing a large-scale real-world complex environmental phage dataset that overcomes the limitations of existing datasets.
Contributes to the study of realistic phage problems, including diverse environments and objects, and high occlusion ratios.
Experimentally demonstrated that learning using GraspClutter6D yields superior retention performance over existing datasets.
Increase reproducibility and scalability of research through open datasets, toolkits, and annotation tools.
Limitations:
Despite the size of the dataset, it may not perfectly reflect all the complex environments of the real world.
There is a possibility that there may be a bias towards certain types of objects or environments.
Consideration must be given to noise and errors that may occur during the data collection process.
👍