Intermittent File Encryption in Ransomware: Measurement, Modeling, and Detection

📅 2025-10-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the evasion of conventional detection by intermittent ransomware (e.g., BlackCat), this work conducts a byte-level empirical study, systematically analyzing the statistical impact of partial encryption on diverse file structures and constructing the first benchmark dataset for intermittent encryption detection. We propose a hybrid probabilistic model based on KL divergence to theoretically characterize the detectability upper bounds across file formats. Furthermore, we design a block-level convolutional neural network (Block-CNN) that enables fine-grained modeling of local structural anomalies under realistic ransomware configurations. Experiments demonstrate that Block-CNN significantly outperforms global detection methods, achieving an average accuracy improvement of 12.7% on mainstream formats—including PDF, DOCX, and JPEG—while exhibiting strong robustness and cross-format generalization. This work establishes a reproducible theoretical framework and an efficient, practical solution for intermittent encryption detection.

Technology Category

Application Category

📝 Abstract
File encrypting ransomware increasingly employs intermittent encryption techniques, encrypting only parts of files to evade classical detection methods. These strategies, exemplified by ransomware families like BlackCat, complicate file structure based detection techniques due to diverse file formats exhibiting varying traits under partial encryption. This paper provides a systematic empirical characterization of byte level statistics under intermittent encryption across common file types, establishing a comprehensive baseline of how partial encryption impacts data structure. We specialize a classical KL divergence upper bound on a tailored mixture model of intermittent encryption, yielding filetype specific detectability ceilings for histogram-based detectors. Leveraging insights from this analysis, we empirically evaluate convolutional neural network (CNN) based detection methods using realistic intermittent encryption configurations derived from leading ransomware variants. Our findings demonstrate that localized analysis via chunk level CNNs consistently outperforms global analysis methods, highlighting their practical effectiveness and establishing a robust baseline for future detection systems.
Problem

Research questions and friction points this paper is trying to address.

Characterizing byte-level statistics under intermittent encryption across file types
Establishing detectability ceilings for histogram-based ransomware detectors
Evaluating CNN-based detection against realistic intermittent encryption configurations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Characterizes byte statistics under intermittent encryption
Specializes KL divergence bound on tailored mixture model
Evaluates CNN detection using realistic encryption configurations
🔎 Similar Papers
2023-01-26arXiv.orgCitations: 8
Y
Ynes Ineza
Texas Tech University, Lubbock, TX 79409
G
Gerald Jackson
Texas Tech University, Lubbock, TX 79409
P
Prince Niyonkuru
Texas Tech University, Lubbock, TX 79409
J
Jaden Kevil
Texas Tech University, Lubbock, TX 79409
Abdul Serwadda
Abdul Serwadda
Associate Professor, Computer Science, Texas Tech University
CybersecurityArtificial Intelligence