Sample-Efficient LLM-Based Detection of Malicious Web Server Logs with Forensically Explainable Reasoning

📅 2026-06-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of achieving both high-accuracy detection and legally compliant interpretability in forensic analysis of web server logs—a balance that traditional methods struggle to attain. The authors propose CEF-Log, a novel approach that integrates a structured five-step reasoning template into few-shot prompts for large language models, thereby embedding expert knowledge to guide the model toward traceable, multi-step inference rather than reliance on memorized patterns. To evaluate performance in realistic attack scenarios, they introduce a new dataset, ForenWebLog. On the CSIC 2010 benchmark, CEF-Log achieves an F1 score of 0.99 using only four examples, demonstrating a tenfold improvement in sample efficiency over existing prompting methods. The generated explanations are both forensically traceable and suitable for inclusion in legal documentation.
📝 Abstract
Forensic analysis of web server logs demands both accurate detection and human-readable explanations that can satisfy legal requirements. We present CEF-Log, a context-enhanced few-shot chain-of-thought prompting strategy for Large Language Models that addresses this dual requirement. CEF-Log embeds expert investigative methodology through a structured five-step reasoning template, enabling the model to learn \textit{how} to analyze logs rather than \textit{what} patterns to memorize. Experimental evaluation demonstrates that CEF-Log achieves an F1-score of 0.99 on the CSIC 2010 dataset using only four examples while providing a $10\times$ improvement in sample efficiency compared to other prompting-based methods. We also introduce ForenWebLog, a new dataset that incorporates real-world attacks and multi-step attack sequences for comprehensive evaluation. Qualitative analysis confirms that CEF-Log generates traceable, accurate explanations suitable for forensic documentation, addressing the critical "black-box" limitation of traditional machine learning approaches.
Problem

Research questions and friction points this paper is trying to address.

malicious web server logs
forensic analysis
sample efficiency
explainable reasoning
large language models
Innovation

Methods, ideas, or system contributions that make the work stand out.

sample-efficient
chain-of-thought prompting
forensically explainable
large language models
web server log analysis
🔎 Similar Papers