[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

University of Indonesia at SemEval-2025 Task 11: Evaluating State-of-the-Art Encoders for Multi-Label Emotion Detection

Created by
  • Haebom

Author

Ikhlasul Akmal Hanif, Eryawan Presma Yulianrifat, Jaycent Gunawan Ongris, Eduardus Tjitrahardja, Muhammad Falensi Azmi, Rahmat Bryan Naufal, Alfan Farizki Wicaksono

Outline

This paper presents our approach for SemEval 2025 Task 11 Track A (Multi-Label Sentiment Classification across 28 Languages). We explore two main strategies: fully fine-tuning a Transformer model and classifier-only training, and evaluate different settings including fine-tuning strategy, model architecture, loss function, encoder, and classifier. We find that training a classifier on top of a prompt-based encoder such as mE5 and BGE yields significantly better results than fully fine-tuning XLMR and mBERT. The best-performing model in the final leaderboard is an ensemble combining multiple BGE models, using CatBoost as a classifier, and applying different configurations. This ensemble achieves an average F1-macro score of 56.58 across all languages.

Takeaways, Limitations

Takeaways: We demonstrate that classifier training using prompt-based encoders (mE5, BGE) performs well on multilingual sentiment classification. We show that ensemble techniques can achieve better performance. We present an effective approach for multi-label sentiment classification across multiple languages.
Limitations: Focusing on optimizing a specific encoder and classifier combination may lack exploration of other approaches. Presenting average performance for 28 languages, but may lack analysis of performance differences across individual languages. Further research is needed to achieve higher F1-macro scores.
👍