Exploring the Potential of Carbon-Aware Execution for Scientific Workflows

📅 2025-03-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Scientific workflows are energy-intensive and carbon-heavy, yet their delay tolerance, interruptibility, and scalability present unique opportunities for carbon-aware computing. This paper proposes a carbon-aware execution framework for scientific workflows, integrating time-aware scheduling, dynamic pause/resume, and elastic resource scaling. Leveraging a linear power model and real-world regional marginal carbon intensity (MCI) data—not average grid intensity—the framework enables fine-grained temporal carbon optimization within the Nextflow engine. It constitutes the first systematic evaluation of carbon-aware strategies for scientific workflows, quantifying their emission-reduction potential and identifying time-shifting and elastic scaling as key levers for carbon footprint reduction. Empirical evaluation across two geographically distinct regions demonstrates significant reductions in both end-to-end workflow and per-task carbon emissions, validating the effectiveness and generalizability of MCI-driven scheduling.

Technology Category

Application Category

📝 Abstract
Scientific workflows are widely used to automate scientific data analysis and often involve processing large quantities of data on compute clusters. As such, their execution tends to be long-running and resource intensive, leading to significant energy consumption and carbon emissions. Meanwhile, a wealth of carbon-aware computing methods have been proposed, yet little work has focused specifically on scientific workflows, even though they present a substantial opportunity for carbon-aware computing because they are inherently delay tolerant, efficiently interruptible, and highly scalable. In this study, we demonstrate the potential for carbon-aware workflow execution. For this, we estimate the carbon footprint of two real-world Nextflow workflows executed on cluster infrastructure. We use a linear power model for energy consumption estimates and real-world average and marginal CI data for two regions. We evaluate the impact of carbon-aware temporal shifting, pausing and resuming, and resource scaling. Our findings highlight significant potential for reducing emissions of workflows and workflow tasks.
Problem

Research questions and friction points this paper is trying to address.

Reducing carbon emissions in scientific workflows
Optimizing energy consumption in data-intensive workflows
Implementing carbon-aware methods for workflow execution
Innovation

Methods, ideas, or system contributions that make the work stand out.

Carbon-aware temporal shifting reduces emissions.
Pausing and resuming workflows cuts carbon footprint.
Resource scaling optimizes energy use efficiently.
🔎 Similar Papers
No similar papers found.
K
Kathleen West
University of Glasgow, United Kingdom
Fabian Lehmann
Fabian Lehmann
Ph.D. candidate, Humboldt-Universität zu Berlin
adaptive scheduling of large workflows
V
Vasilis Bountris
Humboldt-Universität zu Berlin, Germany
U
Ulf Leser
Humboldt-Universität zu Berlin, Germany
Yehia Elkhatib
Yehia Elkhatib
University of Glasgow, UK
Systems of SystemsCloud ManagementIntent Based NetworkingEdge ComputingQoE
L
L. Thamsen
University of Glasgow, United Kingdom