Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

From Intents to Conversations: Generating Intent-Driven Dialogues with Contrastive Learning for Multi-Turn Classification

Created by
  • Haebom

Author

Junhua Liu, Yong Keat Tan, Bin Fu, Kwan Hui Lim

Outline

To address the challenges of generating large-scale, domain-specific multilingual conversation datasets, this paper presents Chain-of-Intent, a novel framework that integrates Hidden Markov Models (HMMs) and Large-Scale Language Models (LLMs). Chain-of-Intent extracts domain-specific intent transition patterns from real-world e-commerce chat logs and leverages them to model round-by-round dynamics and intent sequences. It then parameterizes the HMM's emission probabilities using LLMs to generate natural and consistent utterances that align with the predicted intent and conversation context. Furthermore, we propose MINT-CL, a multi-task contrastive learning framework that improves performance while reducing reliance on large annotated datasets. Experimental results demonstrate that the proposed method outperforms competing baseline models in both dialogue generation quality and classification accuracy, particularly in multilingual environments. Finally, we release MINT-E, a comprehensive multilingual, intent-aware multi-round conversation corpus derived from the e-commerce domain, for future research.

Takeaways, Limitations

Takeaways:
Combining HMM and LLM to solve the problem of generating large-scale datasets for training effective multilingual, multi-pass intent classification models.
Achieving reduced reliance on large annotated datasets and improved performance with MINT-CL.
Encouraging future research through the release of the multilingual, multi-session conversation corpus MINT-E.
It suggests applicability to various domains beyond e-commerce.
Limitations:
Because of the high reliance on actual e-commerce chat logs, performance can be affected by the quality of the log data.
It depends on the performance of LLM, and the limitations of LLM may affect the performance of Chain-of-Intent.
Further review of the size and diversity of the MINT-E dataset is needed.
Further experiments are needed to determine generalization performance to other domains.
👍