Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

IRONIC: Coherence-Aware Reasoning Chains for Multi-Modal Sarcasm Detection

Created by
  • Haebom

Author

Aashish Anantha Ramakrishnan, Aadarsh Anantha Ramakrishnan, Dongwon Lee

Outline

This paper addresses the challenge of interpreting sarcasm in multimodal input. We highlight that existing Chain-of-Thought approaches fail to effectively leverage the cognitive processes humans use to identify sarcasm. We present IRONIC, a context-based learning framework that leverages multimodal coherence relations to analyze referential, analogical, and pragmatic image-text connections. Experimental results demonstrate that IRONIC achieves state-of-the-art performance in zero-shot multimodal sarcasm detection over various baseline models. This highlights the need to integrate linguistic and cognitive insights into the design of multimodal inference strategies.

Takeaways, Limitations

Takeaways: We demonstrate that a context-based learning framework leveraging multimodal consistency relations achieves state-of-the-art performance in zero-shot multimodal sarcasm detection, highlighting the importance of integrating linguistic and cognitive insights into multimodal inference strategies.
Limitations: Further research is needed to evaluate the generalization performance and robustness of the IRONIC model presented in this paper to various satire types. Furthermore, further analysis is needed to determine its dependence on specific datasets and its interpretability.
👍