[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Persona-Based Synthetic Data Generation Using Multi-Stage Conditioning with Large Language Models for Emotion Recognition

Created by
  • Haebom

Author

Keito Inoshita, Rushia Harada

Outline

In this paper, we present PersonaGen, a novel emotion-rich text generation framework using large-scale language models (LLMs) to address the lack of high-quality and diverse emotion datasets in the field of emotion recognition. PersonaGen constructs hierarchical virtual personas by combining demographic attributes, sociocultural backgrounds, and detailed situational contexts to drive emotion expression generation. We perform comprehensive evaluations, including semantic diversity assessment via clustering and distribution metrics, human-likeness assessment via LLM-based quality scores, realism assessment via comparison with real emotion corpora, and practicality assessment for downstream emotion classification tasks. Experimental results show that PersonaGen significantly outperforms baseline methods in generating diverse, consistent, and discriminative emotion expressions, demonstrating its potential as a powerful alternative to complement or replace real emotion datasets.

Takeaways, Limitations

Takeaways:
A novel methodology for generating diverse and realistic emotional datasets using LLM is presented.
Contributes to solving the problem of insufficient existing emotional datasets
Contributes to improving the performance of downstream emotion classification tasks
Contribute to the advancement of emotion research and application fields
Limitations:
Difficulty in ensuring perfect realism of generated data (differences exist with actual emotional data)
LLM bias may affect the generated data
There are limitations in perfectly reflecting complex factors such as socio-cultural background and personal characteristics.
Ethical issues (ethical considerations needed in creating and utilizing virtual personas)
👍