[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

ASMA: An Adaptive Safety Margin Algorithm for Vision-Language Drone Navigation via Scene-Aware Control Barrier Functions

Created by
  • Haebom

Author

Sourav Sanyal, Kaushik Roy

Outline

This paper addresses the problem of ensuring the safety of physical agents in visual-language navigation (VLN). In particular, we focus on drone navigation based on human-computer interaction, which must understand natural language commands, perceive the environment, and avoid dangers in real time. To this end, we propose a novel scene recognition CBF that utilizes egocentric observation information from RGB-D cameras by utilizing control barrier function (CBF) and model predictive control (MPC). The baseline system, which does not use the existing CBF, plans the path using a visual-language encoder and an object detection model. In addition, we propose an adaptive safety margin algorithm (ASMA) to track moving objects and perform scene recognition CBF evaluation in real time, which is utilized as an additional constraint within the MPC framework. When applied to a Parrot Bebop2 quadrotor in a Gazebo environment, we confirm that the success rate increases by 64%-67% compared to the baseline system, and the path length increases by only 1.4%-5.8%.

Takeaways, Limitations

Takeaways:
Presenting an effective method for safe autonomous navigation of drones in visual-language navigation (VLN).
Demonstration of the effectiveness of scene recognition CBF and adaptive safety margin algorithm (ASMA) using RGB-D camera information.
Verification of the possibility of safe and efficient path planning through the combination of MPC and CBF.
Presentation of experimental results using an actual drone platform (Parrot Bebop2).
Limitations:
Only experimental results in the Gazebo simulation environment are presented, performance verification in a real environment is required.
Further analysis of ASMA's computational complexity and real-time processing performance is needed.
Lack of generalization performance assessment across diverse environments and complex situations.
Lack of safety validation for various types of risk factors.
👍