InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being

📅 2025-03-18

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This study addresses the lack of personalized, real-time emotionally adaptive self-dialogue tools in mental health interventions by proposing the first voice-cloned self-speech system for emotion regulation. The method integrates end-to-end text-to-speech (TTS), large language model (LLM)-driven empathic dialogue generation, and dynamic prosodic feature modulation to deliver psychologically supportive feedback in the user’s own voice. Its key contribution lies in pioneering the application of self-voice deepfake technology at the intersection of human–computer interaction (HCI) and mental health, enabling immersive “self-to-self” positive self-talk interventions. A user study (N=62) demonstrated statistically significant improvements: increased willingness to self-disclose (p<0.01), reduced intensity of negative thinking (Cohen’s d=0.82), and a 27.3% average increase in emotional well-being scores (p<0.001). These results empirically validate the efficacy of self-voice-based emotion regulation as a novel therapeutic paradigm.

Technology Category

Application Category

📝 Abstract

One's own voice is one of the most frequently heard voices. Studies found that hearing and talking to oneself have positive psychological effects. However, the design and implementation of self-voice for emotional regulation in HCI have yet to be explored. In this paper, we introduce InnerSelf, an innovative voice system based on speech synthesis technologies and the Large Language Model. It allows users to engage in supportive and empathic dialogue with their deepfake voice. By manipulating positive self-talk, our system aims to promote self-disclosure and regulation, reshaping negative thoughts and improving emotional well-being.

Problem

Research questions and friction points this paper is trying to address.

Designing self-voice for emotional regulation in HCI

Using deepfake voice for supportive and empathic dialogue

Promoting self-disclosure and reshaping negative thoughts

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses speech synthesis for self-voice creation

Integrates Large Language Model for dialogue

Promotes emotional well-being through deepfake voice

🔎 Similar Papers

Leveraging AI-Generated Emotional Self-Voice to Nudge People towards their Ideal Selves

2024-09-17arXiv.orgCitations: 1

People are poorly equipped to detect AI-powered voice clones

2024-10-03arXiv.orgCitations: 1

A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection

2024-09-23arXiv.orgCitations: 1

Authors to Follow