🤖 AI Summary
Aphasia exhibits high heterogeneity in communication impairments, posing significant challenges for designing effective augmentative and alternative communication (AAC) systems.
Method: This study proposes an AI-capability-centered “research-through-design” two-phase paradigm, developing four multimodal AI-augmented AAC prototypes. Technical approaches integrate text/image/speech generation, real-time automatic speech recognition (ASR) and text-to-speech (TTS), personalized language model fine-tuning, and user-intent co-calibration. We introduce four novel interaction mechanisms: visual verification, syntactic construction support, error correction, and cognitive load reduction.
Contribution/Results: In a user study with 11 persons with aphasia (PWAs), the prototypes significantly improved communicative intelligibility and perceived autonomy. Key challenges—including AI-user intent misalignment—were identified, yielding 12 evidence-based design principles for clinically deployable AAC. The core innovation lies in deeply embedding AI-native capabilities into a human–AI collaborative communication loop, transforming AAC from a passive output tool into a cognitively supportive conversational partner capable of pre-utterance preparation and real-time adaptation.
📝 Abstract
AI offers key advantages such as instant generation, multi-modal support, and personalized adaptability - potential that can address the highly heterogeneous communication barriers faced by people with aphasia (PWAs). We designed AI-enhanced communication tools and used them as design probes to explore how AI's real-time processing and generation capabilities - across text, image, and audio - can align with PWAs' needs in real-time communication and preparation for future conversations respectively. Through a two-phase"Research through Design"approach, eleven PWAs contributed design insights and evaluated four AI-enhanced prototypes. These prototypes aimed to improve communication grounding and conversational agency through visual verification, grammar construction support, error correction, and reduced language processing load. Despite some challenges, such as occasional mismatches with user intent, findings demonstrate how AI's specific capabilities can be advantageous in addressing PWAs' complex needs. Our work contributes design insights for future Augmentative and Alternative Communication (AAC) systems.