Abstract
Solving infinite time optimal control problems in an approximate dynamic programing framework with two-network structure has become popular in recent years. In this chapter, an alternative to the two-network structure is provided. We develop single network adaptive critics (SNAC) which eliminate the need to have a separate action network to output control. Two versions of SNAC are presented. The first version, called SNAC outputs the costates and the second version called J-SNAC outputs the cost function values. Special structure called Finite-SNAC to efficiently solve finite time problems is also presented. Illustrative infinite time and finite time problems are considered; numerical results clearly demonstrate the potential of the single network structures to solve optimal control problems. © 2013 The Institute of Electrical and Electronics Engineers, Inc.
Recommended Citation
J. Ding et al., "Single Network Adaptive Critics Networks-Development, Analysis, and Applications," Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, pp. 98 - 118, Wiley, Feb 2013.
The definitive version is available at https://doi.org/10.1002/9781118453988.ch5
Department(s)
Mechanical and Aerospace Engineering
Publication Status
Full Access
Keywords and Phrases
AC designs, HDP/DHP architectures; ADP, formulation with AC neural; Finite-SNAC, finite horizon of affine/DHP; SNAC networks, and applications; SNAC, separate action network elimination
International Standard Book Number (ISBN)
978-111810420-0
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2024 Wiley, All rights reserved.
Publication Date
07 Feb 2013