Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics

Created by
  • Haebom

Author

Kai Yin, Bo Li, Chengkai Liu, Ali Mostafavi, Xia Hu

Outline

This paper presents a novel approach that overcomes the limitations of existing single-label classification models by considering the multifaceted and dynamic nature of disaster-related social media data. We introduce a directive fine-tuning technique that utilizes a large-scale language model (LLM) to perform multi-label classification of disaster-related tweets. We build a comprehensive directive dataset from disaster-related tweets and use it to fine-tune an open-source LLM to incorporate disaster-specific knowledge. The fine-tuned model simultaneously classifies information across multiple dimensions, including disaster type, informational value, and humanitarian intervention, significantly enhancing the utility of social media data for disaster situational awareness. Experimental results demonstrate that this approach improves the classification of critical information in social media posts, making it more effective for situational awareness in emergency situations. This contributes to the development of more advanced, adaptive, and robust disaster management tools that enhance real-time situational awareness and response strategies.

Takeaways, Limitations

Takeaways:
Multi-label classification enables comprehensive analysis of various aspects of disaster-related social media data.
Effectively embedding disaster-specific knowledge into models through LLM-based directive fine-tuning techniques.
Presenting the possibility of developing effective disaster management tools that contribute to improving disaster situation awareness and response strategies.
Efficient extraction of information necessary for real-time situational awareness and rapid response
Limitations:
Lack of specific details about the size and quality of the dataset used.
Absence of comparative analysis with other multi-label classification models
Lack of real-time application and performance evaluation in actual disaster situations
The possibility of using datasets biased towards specific languages or regions, and the resulting potential for poor generalization performance.
👍