Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Hespi: A pipeline for automatically detecting information from hebarium specimen sheets

Created by
  • Haebom

Author

Robert Turnbull, Emily Fitzgerald, Karen Thompson, Joanne L. Birch

Outline

Hespi (HErbarium Specimen sheet PIpeline) is a pipeline that extracts pre-catalog data from the primary specimen labels of herbarium specimens using computer vision technology. It integrates two object detection models that detect components and fields of specimen labels, classifies the type of labels (printed, typed, handwritten, mixed), and extracts text using OCR and HTR. The extracted text is proofread based on an authoritative taxonomic database and improved using a multi-modal LLM. Hespi accurately detects and extracts text from international herbarium specimen sheets, and its modular design allows for training and integrating custom models.

Takeaways, Limitations

Takeaways:
Dramatically improves the efficiency of extracting Herbarium sample data.
Save time and money by minimizing human intervention.
Universal applicability to international herbarium specimens.
Modular design allows for development of customized models.
Improving text extraction accuracy by leveraging multi-mode LLM.
Limitations:
Dependency on the amount and quality of data required to train the model.
Limitations of generalization performance for different label formats and handwritings.
Potential performance degradation or errors due to LLM dependency.
The need for optimization of specimen characteristics of a specific herbarium.
👍