M3S-UPD: Efficient Multi-Stage Self-Supervised Learning for Fine-Grained Encrypted Traffic Classification with Unknown Pattern Discovery

📅 2025-05-27

📈 Citations: 0

✨ Influential: 0

career value

238K/year

🤖 AI Summary

Encrypted traffic analysis faces dual challenges: fine-grained classification of known applications and detection of previously unseen traffic patterns. Method: This paper proposes a four-stage self-supervised collaborative framework that jointly models classification and anomaly detection. It introduces the first self-supervised unknown-pattern detection mechanism—requiring neither synthetic samples nor prior knowledge—and designs a concept-drift-resilient continual learning architecture. The architecture integrates multi-stage self-supervised learning, probabilistic embedding generation, clustering-driven structural discovery, distribution-alignment-based anomaly identification, and confidence-aware model updating. Contribution/Results: The framework achieves state-of-the-art performance on few-shot classification and zero-shot unknown traffic discovery. It delivers a deployable, robust, and scalable solution for real-world network management, significantly outperforming existing methods in both accuracy and adaptability under dynamic network conditions.

Technology Category

Application Category

📝 Abstract

The growing complexity of encrypted network traffic presents dual challenges for modern network management: accurate multiclass classification of known applications and reliable detection of unknown traffic patterns. Although deep learning models show promise in controlled environments, their real-world deployment is hindered by data scarcity, concept drift, and operational constraints. This paper proposes M3S-UPD, a novel Multi-Stage Self-Supervised Unknown-aware Packet Detection framework that synergistically integrates semi-supervised learning with representation analysis. Our approach eliminates artificial segregation between classification and detection tasks through a four-phase iterative process: 1) probabilistic embedding generation, 2) clustering-based structure discovery, 3) distribution-aligned outlier identification, and 4) confidence-aware model updating. Key innovations include a self-supervised unknown detection mechanism that requires neither synthetic samples nor prior knowledge, and a continuous learning architecture that is resistant to performance degradation. Experimental results show that M3S-UPD not only outperforms existing methods on the few-shot encrypted traffic classification task, but also simultaneously achieves competitive performance on the zero-shot unknown traffic discovery task.

Problem

Research questions and friction points this paper is trying to address.

Accurate classification of known encrypted traffic applications

Reliable detection of unknown encrypted traffic patterns

Overcoming data scarcity and concept drift in real-world deployment

Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-stage self-supervised learning framework

Probabilistic embedding and clustering discovery

Continuous learning without synthetic samples

🔎 Similar Papers

No similar papers found.