M3S-UPD: Efficient Multi-Stage Self-Supervised Learning for Fine-Grained Encrypted Traffic Classification with Unknown Pattern Discovery

📅 2025-05-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Encrypted traffic analysis faces dual challenges: fine-grained classification of known applications and detection of previously unseen traffic patterns. Method: This paper proposes a four-stage self-supervised collaborative framework that jointly models classification and anomaly detection. It introduces the first self-supervised unknown-pattern detection mechanism—requiring neither synthetic samples nor prior knowledge—and designs a concept-drift-resilient continual learning architecture. The architecture integrates multi-stage self-supervised learning, probabilistic embedding generation, clustering-driven structural discovery, distribution-alignment-based anomaly identification, and confidence-aware model updating. Contribution/Results: The framework achieves state-of-the-art performance on few-shot classification and zero-shot unknown traffic discovery. It delivers a deployable, robust, and scalable solution for real-world network management, significantly outperforming existing methods in both accuracy and adaptability under dynamic network conditions.

Technology Category

Application Category

📝 Abstract
The growing complexity of encrypted network traffic presents dual challenges for modern network management: accurate multiclass classification of known applications and reliable detection of unknown traffic patterns. Although deep learning models show promise in controlled environments, their real-world deployment is hindered by data scarcity, concept drift, and operational constraints. This paper proposes M3S-UPD, a novel Multi-Stage Self-Supervised Unknown-aware Packet Detection framework that synergistically integrates semi-supervised learning with representation analysis. Our approach eliminates artificial segregation between classification and detection tasks through a four-phase iterative process: 1) probabilistic embedding generation, 2) clustering-based structure discovery, 3) distribution-aligned outlier identification, and 4) confidence-aware model updating. Key innovations include a self-supervised unknown detection mechanism that requires neither synthetic samples nor prior knowledge, and a continuous learning architecture that is resistant to performance degradation. Experimental results show that M3S-UPD not only outperforms existing methods on the few-shot encrypted traffic classification task, but also simultaneously achieves competitive performance on the zero-shot unknown traffic discovery task.
Problem

Research questions and friction points this paper is trying to address.

Accurate classification of known encrypted traffic applications
Reliable detection of unknown encrypted traffic patterns
Overcoming data scarcity and concept drift in real-world deployment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-stage self-supervised learning framework
Probabilistic embedding and clustering discovery
Continuous learning without synthetic samples
🔎 Similar Papers
No similar papers found.
Yali Yuan
Yali Yuan
University of Göttingen
Intelligent Internet of Thingsnetwork intrusion and attack detectionprivacy protectionnetwork localization and security
Y
Yu Huang
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China
X
Xingjian Zeng
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China
H
Hantao Mei
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China
G
Guang Cheng
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China