🤖 AI Summary
To address the challenges of excessive model size, high inference latency, and deployment difficulty for deep learning–based intrusion detection systems (IDS) on resource-constrained IoT edge devices, this paper proposes a SHAP interpretability-driven feature pruning framework integrated with Kronecker-structured knowledge distillation. Our method jointly leverages SHAP-based feature importance analysis, sparse feature selection, lightweight network design via Kronecker-product parameterization, and soft-label knowledge distillation. Evaluated on the TON_IoT dataset, the student model achieves ~1000× compression in parameter count, attains a macro-F1 score of 0.986, and reduces single-inference latency to the millisecond level. Unlike conventional approaches, our work is the first to deeply couple model interpretability—via SHAP—with structured model compression, thereby significantly improving detection accuracy, energy efficiency, and scalability for edge deployment.
📝 Abstract
The widespread deployment of Internet of Things (IoT) devices requires intrusion detection systems (IDS) with high accuracy while operating under strict resource constraints. Conventional deep learning IDS are often too large and computationally intensive for edge deployment. We propose a lightweight IDS that combines SHAP-guided feature pruning with knowledge-distilled Kronecker networks. A high-capacity teacher model identifies the most relevant features through SHAP explanations, and a compressed student leverages Kronecker-structured layers to minimize parameters while preserving discriminative inputs. Knowledge distillation transfers softened decision boundaries from teacher to student, improving generalization under compression. Experiments on the TON_IoT dataset show that the student is nearly three orders of magnitude smaller than the teacher yet sustains macro-F1 above 0.986 with millisecond-level inference latency. The results demonstrate that explainability-driven pruning and structured compression can jointly enable scalable, low-latency, and energy-efficient IDS for heterogeneous IoT environments.