Domain-driven Metrics for Reinforcement Learning: A Case Study on Epidemic Control using Agent-based Simulation

📅 2025-08-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Evaluating reinforcement learning (RL) policies for optimizing agent-based epidemiological models (ABMs/RABMs) remains challenging due to system complexity, stochasticity, and the absence of domain-informed evaluation metrics aligned with public health objectives. Method: This paper proposes a domain-driven, integrated evaluation framework that translates key epidemiological goals—such as mask-wearing adherence, vaccination coverage, and lockdown intensity—into quantifiable metrics. These are jointly modeled with conventional RL metrics (e.g., cumulative reward, convergence speed) and augmented with assessments of dynamic responsiveness under resource constraints (e.g., fluctuating mask supply). Contribution/Results: Experiments across diverse epidemic scenarios demonstrate that the proposed metrics significantly improve policy discriminability and enhance alignment between RL-driven decisions and real-world public health outcomes. The framework establishes the first interpretable, verifiable, and epidemiology-semantics-aware evaluation paradigm for RL-optimized RABMs.

Technology Category

Application Category

📝 Abstract
For the development and optimization of agent-based models (ABMs) and rational agent-based models (RABMs), optimization algorithms such as reinforcement learning are extensively used. However, assessing the performance of RL-based ABMs and RABMS models is challenging due to the complexity and stochasticity of the modeled systems, and the lack of well-standardized metrics for comparing RL algorithms. In this study, we are developing domain-driven metrics for RL, while building on state-of-the-art metrics. We demonstrate our ``Domain-driven-RL-metrics'' using policy optimization on a rational ABM disease modeling case study to model masking behavior, vaccination, and lockdown in a pandemic. Our results show the use of domain-driven rewards in conjunction with traditional and state-of-the-art metrics for a few different simulation scenarios such as the differential availability of masks.
Problem

Research questions and friction points this paper is trying to address.

Developing domain-driven metrics for RL performance assessment
Optimizing agent-based epidemic control models using RL
Evaluating masking, vaccination, lockdown policies with RL metrics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Domain-driven metrics for reinforcement learning
Agent-based simulation for epidemic control
Policy optimization with domain-driven rewards
🔎 Similar Papers
No similar papers found.
R
Rishabh Gaur
Thoughtworks Technologies, Pune, INDIA
G
Gaurav Deshkar
Thoughtworks Technologies, Pune, INDIA
J
Jayanta Kshirsagar
Thoughtworks Technologies, Pune, INDIA
H
Harshal Hayatnagarkar
Thoughtworks Technologies, Pune, INDIA
Janani Venugopalan
Janani Venugopalan
Thoughtworks
Data MiningMachine LearningDeep Learning