Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

OC-SOP: Enhancing Vision-Based 3D Semantic Occupancy Prediction by Object-Centric Awareness

Created by
  • Haebom

Author

Helin Cao, Sven Behnke

Outline

This paper presents the Semantic Occupancy Prediction (SOP) task, which simultaneously infers scene geometry and semantic labels from images to address the challenges of object occlusion and incomplete scene data in autonomous driving perception systems. To address the shortcomings of existing camera-based methods, which treat all categories equally and rely primarily on local features, resulting in poor prediction performance, especially for dynamic foreground objects, we propose an Object-Centric SOP (OC-SOP) framework. OC-SOP significantly improves foreground object prediction accuracy by incorporating high-level object-centric cues extracted through detection branches into the semantic occupancy prediction pipeline, achieving state-of-the-art performance across all categories on the SemanticKITTI dataset.

Takeaways, Limitations

Takeaways:
We improved the accuracy of semantic occupancy prediction, especially dynamic foreground object prediction, by incorporating object-centric cues.
Achieved state-of-the-art performance on the SemanticKITTI dataset.
It can contribute to improving the performance of autonomous driving perception systems.
Limitations:
The performance of the proposed method may be limited to a specific dataset (SemanticKITTI).
Further validation of generalization performance in real-world environments is needed.
Since it may depend on the performance of the detection branch, an error in the detection branch may affect the overall system performance.
👍