This page organizes papers related to artificial intelligence published around the world. This page is summarized using Google Gemini and is operated on a non-profit basis. The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.
V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving
Created by
Haebom
Author
Xuewen Luo, Fengze Yang, Fan Ding, Xiangbo Gao, Shuo Xing, Yang Zhou, Zhengzhong Tu, Chenxi Liu
Outline
This paper proposes the V2X-UniPool framework, which integrates vehicle-to-everything (V2X) communication and language model-based reasoning to overcome the limitations of autonomous driving (AD). The framework transforms V2X data into structured language-based knowledge, stores it in a time-indexed knowledge pool for time-series consistent reasoning, and uses Retrieval-Augmented Generation (RAG) to make real-time context-based decisions. Experimental results using the real-world DAIR-V2X dataset demonstrate that V2X-UniPool achieves state-of-the-art planning accuracy and safety while reducing communication costs by more than 80%.
Takeaways, Limitations
•
Takeaways:
◦
Integrating V2X and language models to improve autonomous driving performance (planning accuracy and safety)
◦
Reduced communication costs (over 80%)
◦
Validation on real dataset (DAIR-V2X)
•
Limitations:
◦
There is no Limitations specified in the paper (however, since this is a summary of the paper, the detailed Limitations is omitted)