Mutual Information Optimal Control of Discrete-Time Linear Systems

๐Ÿ“… 2025-07-07
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

275K/year
๐Ÿค– AI Summary
This paper addresses the mutual information optimal control problem (MIOCP) for discrete-time linear systems, aiming to jointly optimize both the control policy and the prior distributionโ€”thereby relaxing the restrictive assumption in maximum entropy optimal control (MEOCP) that the prior is fixed as uniform. Under Gaussian policy and Gaussian prior assumptions, we derive closed-form analytical solutions for both the policy and the prior via variational inference, and propose an alternating minimization algorithm with provable convergence. To the best of our knowledge, this is the first work to incorporate mutual information as the control objective within the discrete linear system framework, significantly enhancing policy expressiveness and improving the exploration-exploitation trade-off. Numerical experiments demonstrate superior control performance and robustness compared to baseline methods, establishing a novel information-theoretic paradigm for optimal control.

Technology Category

Application Category

๐Ÿ“ Abstract
In this paper, we formulate a mutual information optimal control problem (MIOCP) for discrete-time linear systems. This problem can be regarded as an extension of a maximum entropy optimal control problem (MEOCP). Differently from the MEOCP where the prior is fixed to the uniform distribution, the MIOCP optimizes the policy and prior simultaneously. As analytical results, under the policy and prior classes consisting of Gaussian distributions, we derive the optimal policy and prior of the MIOCP with the prior and policy fixed, respectively. Using the results, we propose an alternating minimization algorithm for the MIOCP. Through numerical experiments, we discuss how our proposed algorithm works.
Problem

Research questions and friction points this paper is trying to address.

Extends maximum entropy control to optimize policy and prior jointly
Derives optimal Gaussian policy and prior for mutual information control
Proposes alternating minimization algorithm for mutual information optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mutual information optimizes policy and prior
Gaussian distributions used for optimal solutions
Alternating minimization algorithm proposed
๐Ÿ”Ž Similar Papers
2024-10-04arXiv.orgCitations: 0