🤖 AI Summary
The exponential growth of network monitoring data has rendered conventional query discovery and pattern analysis costly and inefficient. To address this, we propose OFCnetLLM—a novel network monitoring system leveraging open-source large language models (LLMs) within a multi-agent collaborative framework—designed to overcome fundamental limitations of traditional rule-based and statistical approaches in anomaly interpretation, root-cause localization, and automated response generation. OFCnetLLM integrates generative AI for log parsing, semantic pattern recognition, and natural-language interactive diagnostics. It has been preliminarily deployed and validated on the production network of the Optical Fiber Communication (OFC) Conference. Experimental results demonstrate that OFCnetLLM significantly reduces operational labor costs while improving both anomaly detection accuracy and the timeliness of root-cause analysis. The system establishes a new paradigm for explainable, adaptive, and fully automated intelligent network operations and maintenance.
📝 Abstract
The rapid evolution of network infrastructure is bringing new challenges and opportunities for efficient network management, optimization, and security. With very large monitoring databases becoming expensive to explore, the use of AI and Generative AI can help reduce costs of managing these datasets. This paper explores the use of Large Language Models (LLMs) to revolutionize network monitoring management by addressing the limitations of query finding and pattern analysis. We leverage LLMs to enhance anomaly detection, automate root-cause analysis, and automate incident analysis to build a well-monitored network management team using AI. Through a real-world example of developing our own OFCNetLLM, based on the open-source LLM model, we demonstrate practical applications of OFCnetLLM in the OFC conference network. Our model is developed as a multi-agent approach and is still evolving, and we present early results here.