[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario

Created by
  • Haebom

Author

Yinsong Chen, Kaifeng Wang, Xiaoqiang Meng, Xueyuan Li, Zirui Li, Xin Gao

Outline

In this paper, we propose a Red-Team Multi-Agent Reinforcement Learning framework to address the problem that decision-making studies in safety-critical situations rely on inefficient data-driven scenario generation or specific modeling approaches that fail to capture real-world corner cases. The framework treats background vehicles with interference capabilities as adversarial agents (red-team agents) and actively interferes and explores to discover corner cases outside the data distribution. Using the Constraint Graph Representation Markov Decision Process, the adversarial agents are forced to continuously interfere with autonomous vehicles (AVs) while complying with safety rules. In addition, a policy threat zone model is constructed to quantify the threat posed by the adversarial agents to AVs, thereby inducing more extreme behaviors to increase the risk level of the scenario. Experimental results show that the proposed framework significantly affects the safety of AVs’ decision-making and generates various corner cases. This method provides a new direction for the study of safety-critical scenarios.

Takeaways, Limitations

Takeaways:
Contributing to improving the safety of autonomous driving decision-making in safety-critical situations
Ability to discover and generate corner cases outside the data distribution
A Novel Approach via the Red-Team Multi-Agent Reinforcement Learning Framework
Limitations:
Verification of the proposed framework's application in real environments is needed.
Further research is needed on the accuracy and generalization performance of policy threat domain models.
Need to assess adaptability to various types of interference and threats
👍