Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

MIRROR: Multimodal Cognitive Reframing Therapy for Rolling with Resistance

Created by
  • Haebom

Author

Subin Kim, Hoonrae Kim, Jihyun Lee, Yejin Jeon, Gary Geunbae Lee

Outline

This paper proposes a multimodal approach that integrates nonverbal cues to address the client resistance problem encountered in text-based cognitive behavioral therapy (CBT) models in the field of psychotherapy utilizing large-scale language models (LLMs). Specifically, we introduce a novel synthetic dataset, Mirror (Multimodal Interactive Rolling with Resistance), which combines client speech and facial images. Visual language models (VLMs) trained on this dataset analyze facial cues to infer emotions and generate empathic responses. This model measures the strength of the therapeutic alliance in situations of client resistance and outperforms existing text-based CBT approaches.

Takeaways, Limitations

Takeaways:
A multimodal approach utilizing nonverbal cues can address client resistance and contribute to the formation of a therapeutic alliance.
The new synthetic dataset Mirror is an effective resource for training visual language models.
The proposed model outperforms existing text-based CBT approaches.
The effectiveness of the model was verified through expert evaluation.
Limitations:
Further research is needed to determine how well synthetic datasets reflect real data.
Research is needed on the generalizability of the model and its applicability to various psychological situations.
Sufficient consideration must be given to ethical and safety issues.
👍