Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Created by
  • Haebom

Author

Daniel Fl ogel, Marcos G omez Villafa ne, Joshua Ransiek, S oren Hohmann

Outline

In this paper, we present a novel approach for safe autonomous robot navigation in pedestrian-dense environments. We integrate uncertainty estimation into robot actions based on deep reinforcement learning (DRL), considering alleatoric, epistemic, and prediction uncertainties. We integrate observation-dependent variance (ODV) and dropout into the PPO algorithm, and compare and analyze deep ensembles with Monte Carlo dropout (MC-dropout) to estimate policy uncertainty. In uncertain decision-making situations, we shift the robot's social behavior to conservative collision avoidance. Experimental results demonstrate improved learning performance of PPO with ODV and dropout, and the impact of learning scenarios on generalization. MC-dropout is shown to be more sensitive to perturbations and better correlates uncertainty types with perturbations. Through safe action selection, the robot can navigate with fewer collisions in perturbed environments.

Takeaways, Limitations

Takeaways:
Integrating uncertainty estimation into DRL-based robot navigation to improve safety potential.
Improved learning performance confirmed by improving the PPO algorithm using ODV and dropout.
The possibility of more sophisticated uncertainty estimation is presented through correlation analysis of MC-dropout's perturbation sensitivity and uncertainty types.
Validation of the feasibility of safe robot navigation through conservative collision avoidance strategy.
Limitations:
Further verification of generalizability is needed due to the special nature of the experimental environment.
Further research is needed on robustness to different types of perturbations and pedestrian interactions.
Evaluation of computational costs and real-time processing performance that may occur when applying to real environments is required.
👍