HumAIne-Chatbot: Real-Time Personalized Conversational AI via Reinforcement Learning

📅 2025-09-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Contemporary conversational AI systems lack explicit modeling of individual user differences, hindering dynamic adaptation of both content and stylistic elements. To address this, we propose a reinforcement learning–based personalized dialogue management framework. First, we leverage GPT to generate diverse synthetic personas for policy pre-training. Second, we construct an online user profile that fuses implicit behavioral signals—including typing speed, dwell time, and affective responses—with explicit feedback, and optimize the dialogue policy in real time via multi-step reinforcement learning. Third, we introduce a fine-grained dynamic policy model enabling dual-dimensional adaptation—both stylistic and content-based. Evaluated across 50 synthetic personas, our system achieves significant improvements in user satisfaction (+28.6%), personalization accuracy (+31.4%), and task completion rate (+24.9%), with large effect sizes and high statistical significance (p < 0.001).

Technology Category

Application Category

📝 Abstract
Current conversational AI systems often provide generic, one-size-fits-all interactions that overlook individual user characteristics and lack adaptive dialogue management. To address this gap, we introduce extbf{HumAIne-chatbot}, an AI-driven conversational agent that personalizes responses through a novel user profiling framework. The system is pre-trained on a diverse set of GPT-generated virtual personas to establish a broad prior over user types. During live interactions, an online reinforcement learning agent refines per-user models by combining implicit signals (e.g. typing speed, sentiment, engagement duration) with explicit feedback (e.g., likes and dislikes). This profile dynamically informs the chatbot dialogue policy, enabling real-time adaptation of both content and style. To evaluate the system, we performed controlled experiments with 50 synthetic personas in multiple conversation domains. The results showed consistent improvements in user satisfaction, personalization accuracy, and task achievement when personalization features were enabled. Statistical analysis confirmed significant differences between personalized and nonpersonalized conditions, with large effect sizes across key metrics. These findings highlight the effectiveness of AI-driven user profiling and provide a strong foundation for future real-world validation.
Problem

Research questions and friction points this paper is trying to address.

Personalizing conversational AI interactions for individual users
Overcoming generic one-size-fits-all chatbot responses
Adapting dialogue management using real-time reinforcement learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning refines user models dynamically
GPT-generated personas establish broad user type priors
Dynamic profile adapts content and style real-time
🔎 Similar Papers
No similar papers found.
Georgios Makridis
Georgios Makridis
University of Piraeus
neural networksdata sciencemachine learninginformation theoryanomaly detection
G
Georgios Fragiadakis
Department of Informatics and Telematics, Harokopio University of Athens
J
Jorge Oliveira
HEI-Lab, University Lusofona/Immersive Lives
T
Tomaz Saraiva
Immersive Lives, Lisbon, Portugal
P
Philip Mavrepis
Department of Digital Systems, University of Piraeus
G
Georgios Fatouros
Department of Digital Systems, University of Piraeus
Dimosthenis Kyriazis
Dimosthenis Kyriazis
University of Piraeus
Distributed ComputingData Management & AnalyticsIoT