InterQ: A DQN Framework for Optimal Intermittent Control

📅 2025-04-12

📈 Citations: 0

✨ Influential: 0

career value

224K/year

🤖 AI Summary

This paper addresses the joint optimization of communication and control in discrete-time stochastic linear systems, focusing on coordinated decision-making between a scheduler (determining when to communicate) and a controller (designing inputs based on intermittent observations) under a partially nested information structure. We rigorously prove, for the first time, that the optimal controller in this setting admits a deterministic-equivalent form. Leveraging this insight, we propose InterQ—the first deep Q-network (DQN)-based learning framework for intermittent scheduling—where scheduling is formulated as a partially observable Markov decision process (POMDP). Experiments demonstrate that InterQ significantly outperforms periodic and event-triggered baselines in balancing control performance and communication cost, while exhibiting strong generalization across system parameters and noise statistics. The implementation is publicly available.

Technology Category

Application Category

📝 Abstract

In this letter, we explore the communication-control co-design of discrete-time stochastic linear systems through reinforcement learning. Specifically, we examine a closed-loop system involving two sequential decision-makers: a scheduler and a controller. The scheduler continuously monitors the system's state but transmits it to the controller intermittently to balance the communication cost and control performance. The controller, in turn, determines the control input based on the intermittently received information. Given the partially nested information structure, we show that the optimal control policy follows a certainty-equivalence form. Subsequently, we analyze the qualitative behavior of the scheduling policy. To develop the optimal scheduling policy, we propose InterQ, a deep reinforcement learning algorithm which uses a deep neural network to approximate the Q-function. Through extensive numerical evaluations, we analyze the scheduling landscape and further compare our approach against two baseline strategies: (a) a multi-period periodic scheduling policy, and (b) an event-triggered policy. The results demonstrate that our proposed method outperforms both baselines. The open source implementation can be found at https://github.com/AC-sh/InterQ.

Problem

Research questions and friction points this paper is trying to address.

Balancing communication cost and control performance in stochastic systems

Designing optimal intermittent control via reinforcement learning

Developing a scheduler-controller co-design for efficient state transmission

Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep Q-network for intermittent control

Certainty-equivalence optimal control policy

Scheduler-controller co-design via DRL

🔎 Similar Papers

Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning