Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response

Created by
  • Haebom

Author

Mollie Shichman, Claire Bonial, Austin Blodgett, Taylor Hudson, Francis Ferraro, Rachel Rudinger

Outline

This paper presents the potential of leveraging the physical reasoning capabilities of large-scale language models (LLMs) for human-robot interaction (HRI) in disaster relief situations. To address the size constraints of existing large LLMs, we propose a dataset and pipeline for generating a Field Reasoning and Instruction Decoding Agent (FRIDA) model. Combining the knowledge of domain experts and linguists, we generate high-quality, few-shot prompts, which are then used to fine-tune a small, instruction-tuned model using synthetic data. We experimentally demonstrate that a FRIDA model trained solely on object physical state and feature data outperforms models trained entirely on synthetic data and baseline models, demonstrating the ability to instill physical common sense with minimal data.

Takeaways, Limitations

Takeaways:
Suggesting the possibility of using small LLMs to empower disaster relief robots with physical reasoning capabilities.
Presenting a method for generating high-quality data and training efficient models through collaboration between domain experts and linguists.
We reveal that the physical state and functional data of objects are important factors in improving physical inference performance.
We demonstrate that effective LLM fine-tuning is possible with minimal data.
Limitations:
Further research is needed to evaluate the generalization performance of the currently proposed FRIDA pipeline and its applicability to various disaster situations.
Potential performance degradation due to limitations in the quality and quantity of synthetic data used.
Lack of experimental validation in real disaster environments.
Performance evaluation is needed for more diverse and complex physical reasoning tasks.
👍