Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation

Created by
  • Haebom

Author

Xinran Li, Xiujuan Xu, Jiaqi Qiao

Outline

In this paper, we address the practical yet challenging task of emotion recognition (ERC) in conversation, and propose a novel multimodal approach, the short-range graph neural network (LSDGNN). Based on the directed acyclic graph (DAG), we construct a long-range graph neural network and a short-range graph neural network to obtain multimodal features of distant and adjacent utterances, respectively. We use a differential regularizer to make long-range and short-range features as distinct as possible in expressions while allowing mutual influence between the two modules, and integrate bilinear modules to facilitate feature interaction. We also propose an improved curriculum learning (ICL) to address the data imbalance problem. We design a “weighted emotion change” index that emphasizes changes in similar emotions by calculating the similarity between different emotions, and develop a difficulty measure to enable a learning process that learns easy samples first. Experimental results on the IEMOCAP and MELD datasets show that the proposed model outperforms existing benchmarks.

Takeaways, Limitations

Takeaways:
We present LSDGNN, a novel multimodal approach to the problem of emotion recognition in conversations, which achieves performance improvements over existing methods.
We propose a novel architecture that effectively utilizes long-range and short-range contextual information.
We propose an improved curriculum learning method to address the data imbalance problem.
The importance of emotional changes was effectively reflected through the “weighted emotional change” indicator.
Limitations:
The computational complexity of the proposed model may be high.
Further experiments are needed to determine whether performance improvements for a specific dataset will generalize to other datasets.
Further research is needed to investigate the generality of the “weighted emotional change” index and its applicability to other emotion recognition tasks.
There is a lack of evaluation of the model's generalization performance across diverse linguistic and cultural backgrounds.
👍