This is a page that curates AI-related papers published worldwide. All content here is summarized using Google Gemini and operated on a non-profit basis. Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.
This paper aims to improve the navigation and interaction capabilities of mobile robots to meet human needs in unknown, unstructured environments. To overcome the generalization limitations of existing data-driven demand-driven navigation (DDN) methods, we propose CogDDN, a VLM-based framework that mimics human cognitive and learning mechanisms. CogDDN integrates fast and slow thinking systems and selectively identifies key objects essential for satisfying user needs, thereby identifying appropriate target objects. It utilizes a dual decision-making module comprised of heuristic and analytical processes to achieve efficient decision-making, improve performance through past error analysis and knowledge base accumulation, and enhances the decision-making process through Chain of Traits (CoT) inference. Experimental results using the AI2Thor simulator and the ProcThor dataset demonstrate that CogDDN demonstrates 15% improved navigation accuracy and adaptability compared to existing single-camera approaches.
Takeaways, Limitations
•
Takeaways:
◦
CogDDN, a novel DDN framework that mimics human cognitive processes, is presented.
◦
Efficient and accurate decision-making through integration of fast and slow thinking systems
◦
Continuous performance improvement through learning from past errors and accumulating a knowledge base.
◦
15% improved navigation performance compared to the existing single-camera method
•
Limitations:
◦
Dependency on the AI2Thor simulator and ProcThor dataset. Further validation is required for real-world application.
◦
Further research is needed to determine the scalability of the current system and its ability to generalize to diverse environments.
◦
Optimization research is needed on the ratio and interaction between heuristic and analytical processes.