[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation

Created by
  • Haebom

Author

Xinran Li, Xiujuan Xu, Jiaqi Qiao

Outline

To address the practical yet challenging task of emotion recognition (ERC) in conversation, this paper proposes a novel multimodal approach, the long-range graph neural network (LSDGNN). Based on the directed acyclic graph (DAG), the long-range graph neural network and the short-range graph neural network are constructed to obtain multimodal features of distant and adjacent utterances, respectively. To enable the mutual influence between the two modules and to make the long-range and short-range features in the expression as distinct as possible, we use the differential regularizer and incorporate the bi-linear module (BiAffine Module) to facilitate feature interaction. In addition, we propose the improved curriculum learning (ICL) to address the data imbalance problem. By calculating the similarity between different emotions and emphasizing the changes in similar emotions, we design the “weighted emotion change” metric and develop a difficulty measure to enable the learning process to learn easy samples first. Experimental results on the IEMOCAP and MELD datasets demonstrate that the proposed model outperforms existing benchmarks.

Takeaways, Limitations

Takeaways:
We present a novel model (LSDGNN) that effectively utilizes multi-modal features to improve emotion recognition performance in conversations.
More accurate emotion recognition possible by considering both long-range and short-range speech information.
Presenting an improved curriculum learning (ICL) strategy to address data imbalance issues.
Presenting an efficient learning strategy using the "weighted emotional change" indicator.
Demonstrates superior performance over existing models on IEMOCAP and MELD datasets.
Limitations:
Further experiments are needed to evaluate the generalization performance of the proposed model.
Performance evaluation on various types of conversation data is required.
Analysis of the model's complexity and computational cost is needed.
A more detailed explanation of the design and application of the “weighted affective change” index is needed.
👍