Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

HumAine-Chatbot: Real-Time Personalized Conversational AI via Reinforcement Learning

Created by
  • Haebom

Author

Georgios Makridis, George Fragiadakis, Jorge Oliveira, Tomaz Saraiva, Philip Mavrepis, Georgios Fatouros, Dimosthenis Kyriazis

Outline

HumAIne-chatbot is an AI-based conversational agent that provides personalized conversations based on user characteristics. It pre-trains various GPT-generated virtual personas to gain prior knowledge of a wide range of user types. During real-time interactions, it refines the user model by combining implicit signals (typing speed, emotions, engagement time) with explicit feedback (likes/dislikes) through a reinforcement learning agent. This profile is dynamically applied to the chatbot's conversational policies, adjusting content and style in real time. Controlled experiments with 50 synthetic personas showed that enabling personalization improved user satisfaction, personalization accuracy, and task accomplishment. Statistical analysis confirmed significant differences between personalized and non-personalized conditions.

Takeaways, Limitations

Takeaways:
Demonstrating the effectiveness of a personalized conversation system through AI-based user profiling.
Demonstrating the effectiveness of a personalized strategy based on online reinforcement learning that combines implicit and explicit feedback.
We have verified real performance improvements through increased user satisfaction, personalization accuracy, and task accomplishment.
Establishing a strong foundation for future validation in real-world environments.
Limitations:
Limited experiment size using 50 synthetic personas. Validation with real users is required.
Reliance on GPT-generated virtual personas. The need to ensure diversity and representativeness of real-world user data.
Because synthetic personas were used, there is a possibility of performance differences in interactions with real users.
Lack of consideration of the bias or errors that may arise in personalized models over long periods of use.
👍