Improving Phishing Resilience with AI-Generated Training: Evidence on Prompting, Personalization, and Duration

📅 2025-12-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional phishing defense training is labor-intensive and difficult to scale. Method: This study proposes an LLM-based automated training content generation framework, employing a lightweight user-feature embedding prompting strategy—bypassing complex personalization modeling—to efficiently produce customized training materials. A controlled experiment (N=480) rigorously evaluates efficacy. Results: AI-generated training significantly improves users’ phishing detection accuracy; extending training duration yields only marginal gains in performance; and simple prompts achieve near-optimal outcomes, demonstrating that fine-grained user profiling is unnecessary for effective LLM-driven security education. The approach is empirically validated to be effective, highly scalable, and deployment-friendly—establishing a novel paradigm for AI-augmented cybersecurity education.

Technology Category

Application Category

📝 Abstract
Phishing remains a persistent cybersecurity threat; however, developing scalable and effective user training is labor-intensive and challenging to maintain. Generative Artificial Intelligence offers an interesting opportunity, but empirical evidence on its instructional efficacy remains scarce. This paper provides an experimental validation of Large Language Models (LLMs) as autonomous engines for generating phishing resilience training. Across two controlled studies (N=480), we demonstrate that AI-generated content yields significant pre-post learning gains regardless of the specific prompting strategy employed. Study 1 (N=80) compares four prompting techniques, finding that even a straightforward "direct-profile" strategy--simply embedding user traits into the prompt--produces effective training material. Study 2 (N=400) investigates the scalability of this approach by testing personalization and training duration. Results show that complex psychometric personalization offers no measurable advantage over well-designed generic content, while longer training duration provides a modest boost in accuracy. These findings suggest that organizations can leverage LLMs to generate high-quality, effective training at scale without the need for complex user profiling, relying instead on the inherent capabilities of the model.
Problem

Research questions and friction points this paper is trying to address.

AI-generated phishing training effectiveness validation
Comparing prompting strategies for personalized content creation
Evaluating personalization and duration impact on learning gains
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-generated phishing training using LLMs
Simple prompting with user traits effective
Generic content scalable without complex personalization
🔎 Similar Papers
No similar papers found.
F
Francesco Greco
Department of Computer Science, University of Bari Aldo Moro, Via E. Orabona 4, 70125, Italy
Giuseppe Desolda
Giuseppe Desolda
University of Bari Aldo Moro
Novel Interaction TechniquesInternet of ThingsUsable Security
C
Cesare Tucci
Department of Computer Science, University of Bari Aldo Moro, Via E. Orabona 4, 70125, Italy
A
Andrea Esposito
Department of Computer Science, University of Bari Aldo Moro, Via E. Orabona 4, 70125, Italy
Antonio Curci
Antonio Curci
Department of Computer Science, University of Bari Aldo Moro, Via E. Orabona 4, 70125, Italy
Antonio Piccinno
Antonio Piccinno
University of Bari "Aldo Moro" - Italy
End-User DevelopmentHuman-Computer InteractionVisual Interactive SystemsTheory of Visual LanguagesWWW Interfaces