Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

SCIZOR: A Self-Supervised Approach to Data Curation for Large-Scale Imitation Learning

Created by
  • Haebom

Author

Yu Zhang, Yuqi Xie, Huihan Liu, Rutav Shah, Michael Wan, Linxi Fan, Yuke Zhu

Outline

This paper proposes SCIZOR, a self-supervised learning-based data cleaning framework, to address the problem of poor quality in large-scale datasets in imitation learning, which trains robots to perform diverse behaviors. SCIZOR addresses two sources of poor data quality: suboptimal data (lack of task progression) and redundant patterns. Suboptimal data is removed using a self-supervised learning-based task progress predictor, and redundant data is removed using a deduplication module for the joint state-action representation. Experimental results demonstrate that SCIZOR achieves high-performance imitation learning policies even with limited data, achieving an average performance improvement of 15.4% across various benchmarks.

Takeaways, Limitations

Takeaways:
An efficient robot data purification method using self-supervised learning is presented.
Compared to the existing crude refinement methods at the dataset or trajectory level, precise refinement at the state-action pair level is possible.
High-performance imitation learning policies can be trained with little data.
Improve data quality by simultaneously processing suboptimal and duplicate data.
Limitations:
The performance of SCIZOR may depend on the performance of the self-supervised learning-based task progress predictor and deduplication module.
Further research is needed to determine the generalization performance of the proposed method. Applicability to various robotic tasks and datasets should be verified.
Additional explanation and analysis of the design and parameter settings of the task progress predictor and deduplication module are needed.
👍