Towards Leveraging LLMs to Generate Abstract Penetration Test Cases from Software Architecture

๐Ÿ“… 2026-03-24
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the critical gap in current software architecture design practicesโ€”the absence of effective security assessment mechanisms, which often allows vulnerabilities to persist into later development stages or even production. To bridge this gap, the paper introduces a novel Abstract Penetration Testing Case (APTC) metamodel that enables, for the first time, the automatic generation of security evaluation and penetration testing cases directly from software architecture models. By integrating large language models with tailored prompting strategies, the proposed approach autonomously produces APTCs with high practical utility (93%) and correctness (86%). This capability empowers architects to proactively identify security flaws at the design level and provides actionable guidance for subsequent penetration testing activities.

Technology Category

Application Category

๐Ÿ“ Abstract
Software architecture models capture early design decisions that strongly influence system quality attributes, including security. However, architecture-level security assessment and feedback are often absent in practice, allowing security weaknesses to propagate into later phases of the software development lifecycle and, in some cases, to remain undiscovered, ultimately leading to vulnerable systems. In this paper, we bridge this gap by proposing the generation of Abstract Penetration Test Cases (APTCs) from software architecture models as an input to support architecture-level security assessment. We first introduce a metamodel that defines the APTC concept, and then investigate the use of large language models with different prompting strategies to generate meaningful APTCs from architecture models. To design the APTC metamodel, we analyze relevant standards and state of the art using two criteria: (i) derivability from software architecture, and (ii) usability for both architecture security assessment and subsequent penetration testing. Building on this metamodel, we then proceed to generate APTCs from software architecture models. Our evaluation shows promising results, achieving up to 93\% usefulness and 86\% correctness, indicating that the generated APTCs can substantially support both architects (by highlighting security-critical design decisions) and penetration testers (by providing actionable testing guidance).
Problem

Research questions and friction points this paper is trying to address.

software architecture
security assessment
penetration testing
LLMs
abstract penetration test cases
Innovation

Methods, ideas, or system contributions that make the work stand out.

Abstract Penetration Test Cases
Software Architecture
Large Language Models
Security Assessment
Metamodel
๐Ÿ”Ž Similar Papers
No similar papers found.
M
Mahdi Jafari
Karlsruhe Institute of Technology (KIT), Germany
R
Rahul Sharma
Karlsruhe Institute of Technology (KIT), Germany
S
Sami Naim
Karlsruhe Institute of Technology (KIT), Germany
C
Christopher Gerking
Karlsruhe Institute of Technology (KIT), Germany
Ralf Reussner
Ralf Reussner
Professor for Software Engineering at KIT
Software ArchitecturePerformance EngineeringComponent Based Software EngineeringNon-functional