Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

PromptAug: Fine-grained Conflict Classification Using Data Augmentation

Created by
  • Haebom

Author

Oliver Warke, Joemon M. Jose, Faegheh Hasibi, Jan Breitsohl

Outline

In this paper, we propose PromptAug, a novel data augmentation technique based on a large-scale language model (LLM) to improve the performance of conflict detection models on social media. Considering the difficulties in securing high-quality data and the ethical constraints of LLM, PromptAug was developed and showed a 2% improvement in accuracy and F1-score on conflict and emotion-related datasets. We rigorously evaluated the performance of PromptAug through various data-insufficient scenarios, quantitative diversity analysis, and qualitative topic analysis, and identified four issues in augmented texts: linguistic fluency, ambiguity of humor, ambiguity of augmented content, and misunderstanding of augmented content. We demonstrate the effectiveness of PromptAug through a comprehensive evaluation that combines natural language processing and social science methodologies.

Takeaways, Limitations

Takeaways:
We present the possibility of improving the performance of conflict detection models on social media using PromptAug, an LLM-based data augmentation technique.
Presenting an effective data augmentation method even in limited data environments.
Presenting an integrated research approach that combines natural language processing and social science methodology.
Limitations:
There are issues with augmented data, such as linguistic fluency, ambiguity of humor, ambiguity of augmented content, and misunderstanding of augmented content.
A 2% performance improvement may be relatively small.
Further research is needed on the generalization performance of PromptAug.
👍