AI4EOSC: a Federated Cloud Platform for Artificial Intelligence in Scientific Research

📅 2025-12-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address challenges in AI-driven scientific research—including cross-institutional collaboration barriers, platform heterogeneity, and model irreproducibility—this paper proposes the first domain-specific cross-domain federated cloud platform architecture for scientific research. The architecture unifies interactive development (Jupyter/VS Code), GPU-accelerated training (with annotation support and MLflow experiment tracking), federated learning (FATE), lightweight inference (Triton), and cloud-edge-device coordinated deployment. It embeds a GAIA-X–compliant identity and data governance framework to ensure end-to-end traceability and reproducibility across the research lifecycle. Orchestrated via Kubernetes Federation, the platform has been integrated with 12 European e-Infrastructures and supports over 200 scientific projects. Empirical evaluation demonstrates a 76% reduction in model reproduction time and a 41% decrease in communication overhead during cross-institutional federated training.

Technology Category

Application Category

📝 Abstract
In this paper, we describe a federated compute platform dedicated to support Artificial Intelligence in scientific workloads. Putting the effort into reproducible deployments, it delivers consistent, transparent access to a federation of physically distributed e-Infrastructures. Through a comprehensive service catalogue, the platform is able to offer an integrated user experience covering the full Machine Learning lifecycle, including model development (with dedicated interactive development environments), training (with GPU resources, annotation tools, experiment tracking, and federated learning support) and deployment (covering a wide range of deployment options all along the Cloud Continuum). The platform also provides tools for traceability and reproducibility of AI models, integrates with different Artificial Intelligence model providers, datasets and storage resources, allowing users to interact with the broader Machine Learning ecosystem. Finally, it is easily customizable to lower the adoption barrier by external communities.
Problem

Research questions and friction points this paper is trying to address.

Federated cloud platform for AI in scientific research
Provides full ML lifecycle support with reproducibility
Customizable to lower adoption barriers for communities
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated cloud platform for distributed AI infrastructure
Integrated service catalog covering full ML lifecycle
Customizable tools for reproducibility and ecosystem integration
🔎 Similar Papers
No similar papers found.
Ignacio Heredia
Ignacio Heredia
Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n, Santander, 39006, Cantabria, Spain
Á
Álvaro López García
Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n, Santander, 39006, Cantabria, Spain
Germán Moltó
Germán Moltó
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain
A
Amanda Calatrava
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain
Valentin Kozlov
Valentin Kozlov
Karlsruher Institut für Technologie, Kaiserstraße 12, Karlsruhe, 76131, Germany
A
Alessandro Costantini
Istituto Nazionale di Fisica Nucleare (INFN), Via Enrico Fermi 40, Frascati, 00044, Roma, Italy
Viet Tran
Viet Tran
Institute of Informatics, Slovak Academy of Sciences
M
Mario David
Laboratório de Instrumentação e Física Experimental de Partículas, Av. Prof. Gama Pinto 2, Lisboa, 1649-003, Portugal
D
Daniel San Martín
Predictia Intelligent Data Solutions, Fernando de los Ríos 48, Santander, 39006, Cantabria, Spain
M
Marcin Płóciennik
Poznańskie Centrum Superkomputerowo Sieciowe, Jana Pawła II 10, Poznan, 61-139, Poland
M
Marta Obregón Ruiz
Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n, Santander, 39006, Cantabria, Spain
S
Saúl Fernandez
Instituto de Física de Cantabria (IFCA), CSIC-UC, Avda. los Castros s/n, Santander, 39006, Cantabria, Spain
Judith Sáinz-Pardo Díaz
Judith Sáinz-Pardo Díaz
Researcher, Instituto de Física de Cantabria (IFCA) CSIC-UC
data sciencemachine learningprivacyfederated learningdeep learning
Miguel Caballer
Miguel Caballer
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain
C
Caterina Alarcón Marín
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain
Stefan Dlugolinsky
Stefan Dlugolinsky
Institute of Informatics, Slovak Academy of Sciences (IISAS), Dúbravská cesta 9, Bratislava, 84507, Slovakia
Martin Šeleng
Martin Šeleng
Institute of Informatics, Slovak Academy of Sciences (IISAS), Dúbravská cesta 9, Bratislava, 84507, Slovakia
Lisana Berberi
Lisana Berberi
Karlsruher Institut für Technologie, Kaiserstraße 12, Karlsruhe, 76131, Germany
K
Khadijeh Alibabaei
Karlsruher Institut für Technologie, Kaiserstraße 12, Karlsruhe, 76131, Germany
B
Borja Esteban Sanchis
Karlsruher Institut für Technologie, Kaiserstraße 12, Karlsruhe, 76131, Germany
P
Pedro Castro
Predictia Intelligent Data Solutions, Fernando de los Ríos 48, Santander, 39006, Cantabria, Spain
G
Giacinto Donvito
Istituto Nazionale di Fisica Nucleare (INFN), Via Enrico Fermi 40, Frascati, 00044, Roma, Italy
D
Diego Aguirre
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain
S
Sergio Langarita
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain
V
Vicente Rodriguez
Instituto de Instrumentación para Imagen Molecular (I3M), Centro Mixto CSIC - Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia, 46022, Valencia, Spain