Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

A Value Based Parallel Update MCTS Method for Multi-Agent Cooperative Decision Making of Connected and Automated Vehicles

Created by
  • Haebom

Author

Ye Han, Lijun Zhang, Dejian Meng, Zhuang Zhang, Xingyu Hu, Songyu Weng

Outline

This paper proposes a Monte Carlo Tree Search (MCTS) method with parallel updates for a multi-agent Markov game with a limited horizon and time discounting setting to solve lateral and longitudinal collaborative decision-making problems in multi-vehicle cooperative driving of Connected and Automated Vehicles (CAVs). By analyzing parallel behaviors in the multi-vehicle collaborative action space under partially steady-state traffic flow, the parallel update method increases search depth without sacrificing search breadth by quickly excluding potentially risky actions. The proposed method is tested on multiple randomly generated traffic flows, and experimental results demonstrate excellent robustness and outperform state-of-the-art reinforcement learning algorithms and heuristic methods. The vehicle driving strategy using the proposed algorithm demonstrates rationality superior to that of human drivers, and it improves traffic efficiency and safety in coordination zones.

Takeaways, Limitations

Takeaways:
We present an MCTS-based parallel update method that is effective for multi-agent Markov games with limited horizon and time discount settings.
Demonstrated superior performance and robustness over existing reinforcement learning algorithms and heuristic methods.
Presenting rational vehicle driving strategies that surpass human drivers and improving traffic efficiency and safety.
Limitations:
Lack of verification of the proposed algorithm's application to real road environments.
Verification of generalization performance is required for various traffic situations and complex road environments.
Further research is needed on the algorithm's computational complexity and real-time processing potential.
👍