Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Inside Knowledge: Graph-based Path Generation with Explainable Data Augmentation and Curriculum Learning for Visual Indoor Navigation

Created by
  • Haebom

Author

Daniel Airinei, Elena Burceanu, Marius Leordeanu

Outline

This paper presents an efficient, real-time, and deployable deep learning-based direction prediction method using only visual input from a mobile device to address indoor navigation challenges where GPS access is challenging. Specifically, we automate, enhance, and robust the data collection, annotation, and training processes through a novel graph-based path generation method, explainable data augmentation, and curriculum learning. Using a large-scale indoor shopping mall video dataset, we annotate each frame with the correct direction to the target location. Direction prediction is achieved solely using visual information, without the need for specialized sensors, additional path markers, scene map information, or internet access. We will develop and release a user-friendly Android application, along with the data and code.

Takeaways, Limitations

Takeaways:
We present a novel, efficient, real-time, and deployable deep learning-based solution for indoor navigation problems.
It can operate without additional sensors or infrastructure using only visual information.
Improve development efficiency through automated data collection, annotation, and training processes.
Improving research reproducibility and accessibility through the release of a large-scale indoor environment dataset and Android application.
Limitations:
Further evaluation of the performance and generalization ability of the currently proposed method is needed.
Verification of generalized performance for various indoor environments is necessary.
Possible degradation of generalization performance due to the use of datasets in a specific environment, such as a shopping mall.
Additional validation of the real-world usability and stability of Android applications is needed.
👍