Archi: Agentic Operations at the CMS Experiment

📅 2026-06-03
📈 Citations: 0
Influential: 0
📄 PDF

career value

212K/year
🤖 AI Summary
This work addresses the challenges of integrating heterogeneous, multi-source data and enabling efficient query-based analysis in the computing operations of the CMS experiment. We propose and implement an end-to-end open-source framework that deploys, for the first time in high-energy physics, a private, scalable agent system. The framework unifies documentation, historical logs, and real-time monitoring data, leveraging locally executed open-source large language models within a configurable agent architecture to support fully on-premises processing of sensitive data and efficient retrieval-augmented inference. Since its deployment in February 2026, the system has operated stably in a production environment, effectively responding to operational queries and receiving positive feedback from operators. A hybrid evaluation methodology further demonstrates the competitiveness and practical utility of open-source models in domain-specific tasks.
📝 Abstract
We present Archi, an open-source, end-to-end framework for scientific collaborations that combines the systematic ingestion and organization of heterogeneous data sources with the deployment of configurable, private, and extensible agents that retrieve and reason over them. An instance of Archi has been deployed for the Computing Operations team of the CMS experiment at CERN's LHC since February 2026 as a support agent for technical operators, offering retrieval and analysis capabilities by combining documentation, historical data, and live monitoring systems. We evaluate the system on operator feedback and a question set collected from production usage, graded by human and automated panels. The system proves effective at operational tasks, resolving real-world queries posed by CMS operators. We also observe that locally-hosted, open-weight models perform competitively, enabling fully private management of sensitive data.
Problem

Research questions and friction points this paper is trying to address.

scientific collaborations
heterogeneous data
computing operations
information retrieval
operational support
Innovation

Methods, ideas, or system contributions that make the work stand out.

agentic operations
heterogeneous data integration
private LLM deployment
scientific collaboration framework
operational AI
P
Pietro Lugato
Massachusetts Institute of Technology, Cambridge, MA, USA; CMS Collaboration, CERN, Geneva, Switzerland
L
Luca Lavezzo
Massachusetts Institute of Technology, Cambridge, MA, USA; CMS Collaboration, CERN, Geneva, Switzerland
Jason Mohoney
Jason Mohoney
University of Wisconsin-Madison
SystemsDatabasesGraph LearningVector SearchIndexing
H
Hasan Ozturk
CMS Collaboration, CERN, Geneva, Switzerland
M
Muhammad Hassan Ahmed
CMS Collaboration, CERN, Geneva, Switzerland
J
Juan Pablo Salas
CMS Collaboration, CERN, Geneva, Switzerland; University of Wisconsin-Madison, Madison, WI, USA
V
Viphava Ohm
CMS Collaboration, CERN, Geneva, Switzerland; University of Wisconsin-Madison, Madison, WI, USA
K
Krittin Phornsiricharoenphant
CMS Collaboration, CERN, Geneva, Switzerland
G
Gabriele Benelli
CMS Collaboration, CERN, Geneva, Switzerland; Fermi National Accelerator Laboratory, Batavia, IL, USA; Brown University, Providence, RI, USA
M
Mariarosaria D'Alfonso
Massachusetts Institute of Technology, Cambridge, MA, USA; CMS Collaboration, CERN, Geneva, Switzerland
M
Manasvita Joshi
Harvard University, Cambridge, MA, USA
W
Warren Nam
Massachusetts Institute of Technology, Cambridge, MA, USA
A
Aron Soha
CMS Collaboration, CERN, Geneva, Switzerland; Fermi National Accelerator Laboratory, Batavia, IL, USA
S
Samantha Sunnarborg
CMS Collaboration, CERN, Geneva, Switzerland; Brown University, Providence, RI, USA
A
Austin Swinney
Harvard University, Cambridge, MA, USA
J
Jack Tucker
Massachusetts Institute of Technology, Cambridge, MA, USA
D
Dmytro Kovalskyi
Massachusetts Institute of Technology, Cambridge, MA, USA; CMS Collaboration, CERN, Geneva, Switzerland
Tim Kraska
Tim Kraska
MIT
Systems for MLML for Systems
C
Christoph Paus
Massachusetts Institute of Technology, Cambridge, MA, USA; CMS Collaboration, CERN, Geneva, Switzerland