Abstract
In this paper, stochastic optimal strategy for unknown linear networked control system (NCS) quadratic zero-sum games related to H∞ optimal control in the presence of random delays and packet losses is solved in forward-in-time manner. This approach does not require the knowledge of the system matrices since it uses Q-learning. the proposed stochastic optimal control approach, referred as adaptive dynamic programming (ADP), involves solving the action dependent Q-function Q (z, u, d) of the zero-sum game instead of solving the state dependent value function J (z) which satisfies a corresponding Game Theoretic Riccati equation (GRE). an adaptive estimator (AE) is proposed to learn the Q-function online and value and policy iterations are not needed unlike in traditional ADP schemes. Update laws for tuning the unknown parameters of adaptive estimator (AE) are derived. Lyapunov theory is used to show that all signals are asymptotic stable (AS) and that the approximated control and disturbance signals converge to optimal control and disturbance inputs. Simulation results are included to show the effectiveness of the scheme. © 2011 IEEE.
Recommended Citation
H. Xu and S. Jagannathan, "Model-free H∞ Stochastic Optimal Design for Unknown Linear Networked Control System Zero-sum Games Via Q-learning," IEEE International Symposium on Intelligent Control - Proceedings, pp. 198 - 203, article no. 6045415, Institute of Electrical and Electronics Engineers, Nov 2011.
The definitive version is available at https://doi.org/10.1109/ISIC.2011.6045415
Department(s)
Electrical and Computer Engineering
Second Department
Computer Science
International Standard Book Number (ISBN)
978-145771104-6
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2024 Institute of Electrical and Electronics Engineers, All rights reserved.
Publication Date
07 Nov 2011