Mutual Information Optimal Control of Discrete-Time Linear Systems

📅 2025-07-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the mutual information optimal control problem (MIOCP) for discrete-time linear systems, aiming to jointly optimize both the control policy and the prior distribution—thereby relaxing the restrictive assumption in maximum entropy optimal control (MEOCP) that the prior is fixed as uniform. Under Gaussian policy and Gaussian prior assumptions, we derive closed-form analytical solutions for both the policy and the prior via variational inference, and propose an alternating minimization algorithm with provable convergence. To the best of our knowledge, this is the first work to incorporate mutual information as the control objective within the discrete linear system framework, significantly enhancing policy expressiveness and improving the exploration-exploitation trade-off. Numerical experiments demonstrate superior control performance and robustness compared to baseline methods, establishing a novel information-theoretic paradigm for optimal control.

Technology Category

Application Category

📝 Abstract
In this paper, we formulate a mutual information optimal control problem (MIOCP) for discrete-time linear systems. This problem can be regarded as an extension of a maximum entropy optimal control problem (MEOCP). Differently from the MEOCP where the prior is fixed to the uniform distribution, the MIOCP optimizes the policy and prior simultaneously. As analytical results, under the policy and prior classes consisting of Gaussian distributions, we derive the optimal policy and prior of the MIOCP with the prior and policy fixed, respectively. Using the results, we propose an alternating minimization algorithm for the MIOCP. Through numerical experiments, we discuss how our proposed algorithm works.
Problem

Research questions and friction points this paper is trying to address.

Extends maximum entropy control to optimize policy and prior jointly
Derives optimal Gaussian policy and prior for mutual information control
Proposes alternating minimization algorithm for mutual information optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mutual information optimizes policy and prior
Gaussian distributions used for optimal solutions
Alternating minimization algorithm proposed
🔎 Similar Papers
No similar papers found.
S
Shoju Enami
Graduate School of Informatics, Kyoto University, Kyoto, Japan
Kenji Kashima
Kenji Kashima
Associate Professor, Kyoto University
Control theoryMachine learning