An Efficient Variant of One-Class SVM with Lifelong Online Learning Guarantees

📅 2025-12-11

📈 Citations: 0

✨ Influential: 0

career value

203K/year

🤖 AI Summary

Traditional kernel-based One-Class SVM (OCSVM) suffers from high computational overhead, elevated false negative rates, and poor adaptability to distributional drift in single-pass non-stationary streaming data. Method: We propose SONAR—a computationally efficient online OCSVM solver leveraging stochastic gradient descent (SGD) and strongly convex regularization, designed for lifelong learning. It integrates ensemble learning and change-point detection to enhance robustness against both benign and adversarial non-stationarity. Contribution/Results: SONAR establishes the first tight theoretical bounds on Type I and Type II errors for OCSVM-type methods. Its lifelong learning mechanism enables adaptive model evolution under evolving data distributions. Empirically, SONAR significantly reduces both false positive and false negative rates on synthetic and real-world benchmarks, while achieving 10–100× speedup over kernel OCSVM.

Technology Category

Application Category

📝 Abstract

We study outlier (a.k.a., anomaly) detection for single-pass non-stationary streaming data. In the well-studied offline or batch outlier detection problem, traditional methods such as kernel One-Class SVM (OCSVM) are both computationally heavy and prone to large false-negative (Type II) errors under non-stationarity. To remedy this, we introduce SONAR, an efficient SGD-based OCSVM solver with strongly convex regularization. We show novel theoretical guarantees on the Type I/II errors of SONAR, superior to those known for OCSVM, and further prove that SONAR ensures favorable lifelong learning guarantees under benign distribution shifts. In the more challenging problem of adversarial non-stationary data, we show that SONAR can be used within an ensemble method and equipped with changepoint detection to achieve adaptive guarantees, ensuring small Type I/II errors on each phase of data. We validate our theoretical findings on synthetic and real-world datasets.

Problem

Research questions and friction points this paper is trying to address.

Detects outliers in non-stationary streaming data efficiently

Reduces Type I/II errors compared to traditional One-Class SVM

Ensures adaptive performance under distribution shifts and adversarial changes

Innovation

Methods, ideas, or system contributions that make the work stand out.

SGD-based OCSVM solver with convex regularization

Lifelong learning guarantees under distribution shifts

Ensemble method with changepoint detection for adaptation

🔎 Similar Papers

Online Loss Function Learning

2023-01-30arXiv.orgCitations: 5

💼 Related Jobs

ML Researcher, Autonomous Security

Apple

Seattle, United States of America

Software Engineer, Machine Learning