Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

RUMI: Rummaging Using Mutual Information

Created by
  • Haebom

Author

Sheng Zhong, Nima Fazeli, Dmitry Berenson

Outline

This paper presents RUMI (Rummaging Using Mutual Information), a method for generating robot action sequences online to localize known movable objects in visually occluded environments. Focusing on high-contact rummaging, our approach exploits the mutual information between object position distributions and robot trajectories for action planning. From observed partial point clouds, RUMI infers compatible object position distributions and approximates the mutual information with task space occupancy in real time. Based on this, we develop information gain cost functions and reachability cost functions to keep objects within the robot's reach. These are integrated into a model predictive control (MPC) framework using a probabilistic dynamic model to update the position distribution in a closed loop. The main contributions include a novel belief framework for object position estimation, an efficient information gain computation strategy, and a robust MPC-based control scheme. RUMI demonstrates superior performance on both simulated and real tasks compared to baseline methods.

Takeaways, Limitations

Takeaways:
We present a novel method for effectively locating moving objects in visually occluded environments.
Development of an efficient information acquisition strategy using mutual information and a robust MPC-based control system.
Excellent performance verified through simulation and actual experiments.
Effectively applicable to high-contact, back-and-forth work.
Limitations:
Applies only to known movable objects.
Prior information about the shape and physical properties of the object is required.
Further research is needed on real-time processing speed and computational cost.
Verification of generalization performance for various environments and objects is required.
👍