Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving

Created by
  • Haebom

Author

Xuewen Luo, Fengze Yang, Fan Ding, Xiangbo Gao, Shuo Xing, Yang Zhou, Zhengzhong Tu, Chenxi Liu

Outline

This paper proposes the V2X-UniPool framework, which integrates vehicle-to-everything (V2X) communication and language model-based reasoning to overcome the limitations of autonomous driving (AD). The framework transforms V2X data into structured language-based knowledge, stores it in a time-indexed knowledge pool for time-series consistent reasoning, and uses Retrieval-Augmented Generation (RAG) to make real-time context-based decisions. Experimental results using the real-world DAIR-V2X dataset demonstrate that V2X-UniPool achieves state-of-the-art planning accuracy and safety while reducing communication costs by more than 80%.

Takeaways, Limitations

Takeaways:
Integrating V2X and language models to improve autonomous driving performance (planning accuracy and safety)
Reduced communication costs (over 80%)
Validation on real dataset (DAIR-V2X)
Limitations:
There is no Limitations specified in the paper (however, since this is a summary of the paper, the detailed Limitations is omitted)
👍