Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

MLLM-Fabric: Multimodal Large Language Model-Driven Robotic Framework for Fabric Sorting and Selection

Created by
  • Haebom

Author

Liman Wang, Hanyang Zhong, Tianyuan Wang, Shan Luo, Jihong Zhu

Outline

This paper proposes MLLM-Fabric, a robotics framework that leverages a multimodal large-scale language model (MLLM), which plays a crucial role in selecting suitable fibers to meet functional and quality requirements in robotic textile manufacturing, apparel production, and smart retail. This system is trained to rank fiber attributes using supervised learning fine-tuning and explanation-based distillation. Furthermore, we release a dataset of 220 diverse fibers, synchronized with RGB images and visual-tactile and pressure data. Fabric-Llama-90B consistently outperforms pretrained vision-language-based models in both attribute ranking and selection confidence.

Takeaways, Limitations

Proposing a novel robotic framework for fiber classification and selection using MLLM.
Provides a public dataset containing RGB images, visual-tactile and pressure data for various fibers.
Demonstrated superior performance over existing vision-language-based models in attribute ranking and selection confidence.
(Limitations is not specified in the paper)
👍