Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Without Paired Labeled Data: End-to-End Self-Supervised Learning for Drone-view Geo-Localization

Created by
  • Haebom

Author

Zhongwei Chen, Zhao-Xu Yang, Hai-Jun Rong, Guoqi Li

Outline

This paper proposes a novel end-to-end self-supervised learning-based method, dynamic memory-driven and neighborhood information learning (DMNIL), to overcome the high annotation costs and limited transferability of existing supervised learning-based methods for drone view geolocation (DVGL). DMNIL generates pseudo labels using a clustering algorithm and learns discriminative within-view representations via a dual-path contrastive learning framework. Furthermore, it includes a dynamic hierarchical memory learning (DHML) module that combines short- and long-term memories to enhance within-view feature consistency and discriminability, and an information consistency evolutionary learning (ICEL) module that captures implicit inter-view semantic correlations using a neighborhood-based dynamic constraint mechanism to improve feature alignment across views. By stabilizing and enhancing the self-supervised learning process through a pseudo-label refinement strategy, it outperforms existing self-supervised learning methods and several state-of-the-art supervised learning methods on three public benchmark datasets.

Takeaways, Limitations

Takeaways:
Contributing to solving the annotation cost and transferability problems of drone positioning through self-supervised learning.
Achieving performance that surpasses existing supervised learning-based methods.
Improved feature consistency and alignment across viewpoints via DHML and ICEL modules.
Improving the stability and performance of self-supervised learning through a pseudo-label improvement strategy.
Increased research reproducibility and usability through open source code provision.
Limitations:
There is a possibility that the performance of the proposed method may be biased on certain datasets.
Further validation of generalization performance in various environments and conditions is needed.
Performance may be affected by the accuracy of the physician labels.
Real-time performance evaluation in actual drone operation environments is required.
👍