Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

A Multimodal Conversational Assistant for the Characterization of Agricultural Plots from Geospatial Open Data

Created by
  • Haebom

Author

Juan Ca nada, Raul Alonso, Julio Molleda, Fidel D iez

Outline

This paper addresses the growing availability of open Earth observations (EO) and agricultural datasets, which hold great potential for supporting sustainable land management, but high technical barriers to entry limit their accessibility to non-expert users. To address this, this study presents an open-source conversational assistant that integrates multimodal retrieval and a large-scale language model (LLM). The proposed architecture combines orthoimagery, Sentinel-2 vegetation indices, and user-provided documents via augmented retrieval generation (RAG), allowing the system to flexibly utilize multimodal evidence, textual knowledge, or both to construct answers. To assess response quality, we employ an LLM-as-a-judge methodology using Qwen3-32B in a zero-shot, unsupervised setting, scoring directly within a multidimensional quantitative evaluation framework. Preliminary results demonstrate that the system can generate clear, relevant, and context-aware responses to agricultural questions, and is reproducible and scalable across geographic regions. Key contributions include an architecture that fuses multimodal EO and textual knowledge sources, a demonstration of lowering barriers to accessing expert agricultural information through natural language interaction, and an open and reproducible design.

Takeaways, Limitations

Takeaways:
A novel architecture that combines multimodal data and LLM to improve accessibility to agricultural information is presented.
Presenting the possibility of developing an agricultural data analysis system based on natural language processing.
Ensuring the reliability and scalability of research results through open and reproducible research design.
The possibility of objective response quality assessment through the LLM-as-a-judge methodology is presented.
Limitations:
Only preliminary results are presented, requiring further verification of the system's actual performance and generalizability.
Limitations of assessment using Qwen3-32B and the need for additional assessment using other LLMs
Further research is needed to determine the system's applicability to diverse agricultural environments and languages.
Consideration needs to be given to the computing resources and costs required to process large datasets.
👍