Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems

Created by
  • Haebom

Author

Sanjeda Akter, Ibne Farabi Shihab, Anuj Sharma

Outline

This paper systematically surveys the impact of integrating large-scale language models (LLMs) with computer vision on perceptual tasks such as image segmentation. Focusing specifically on Intelligent Transportation Systems (ITS), we present the applications, challenges, and future directions of LLM-based image segmentation in ITS, where accurate scene understanding is crucial for safety and efficiency. We categorize various LLM-based image segmentation approaches based on their prompting mechanisms and core architectures, and highlight innovations that enhance road scene understanding for autonomous driving, traffic surveillance, and infrastructure maintenance. Finally, we identify key challenges such as real-time performance and safety-critical reliability, and present a perspective on explainable, human-centered AI as essential for the successful deployment of this technology in next-generation transportation systems.

Takeaways, Limitations

Takeaways:
Innovative advancements in image segmentation technology in the ITS field are presented through the integration of LLM and computer vision.
Exploring the application of LLM-based image segmentation to various ITS applications, including autonomous driving, traffic monitoring, and infrastructure maintenance.
Providing a systematic classification and analysis of LLM-based image segmentation approaches.
Presenting a direction for the development of explainable and human-centered AI.
Limitations:
Difficulties in ensuring real-time performance and safety-critical reliability
Lack of concrete methodologies for developing explainable and human-centered AI.
👍