Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

GalaxAlign: Mimicking Citizen Scientists' Multimodal Guidance for Galaxy Morphology Analysis

Created by
  • Haebom

Author

Ruoqi Wang, Haitao Wang, Qiong Luo

Outline

GalaxAlign is a novel multimodal approach for galaxy morphology analysis. To overcome the high cost or low accuracy of existing methods, it was inspired by the way citizen scientists identify galaxies using text descriptions and schematic symbols. GalaxAlign uses a trimodal alignment framework that aligns three types of data—schematic symbols, text labels, and galaxy images—during the fine-tuning process. This enables effective fine-tuning without expensive pretraining, and demonstrates performance improvements for galaxy classification and similarity retrieval tasks.

Takeaways, Limitations

Takeaways:
Effectively fine-tuning common pre-trained models to astronomical tasks in a cost-effective manner.
Improve accuracy by leveraging multimodal information (schematic symbols, text, images).
Presenting effective learning strategies by mimicking the approaches of citizen scientists.
Shows improved performance in galaxy classification and similarity search tasks.
Limitations:
Lack of detailed analysis of how the proposed method compares to other multi-modal approaches.
Lack of performance analysis for specific types of galaxies or specific image qualities.
Further research is needed on the generalizability of the schematic symbols and text labels used.
Generalization performance needs to be verified on datasets other than real astronomical datasets.
👍