Convergence of Critic-based Training
Abstract
The paper discusses convergence issues when training adaptive critic designs (ACD) to control dynamic systems expressed as Markov sequences. We critically review two published convergence results of critic based training and propose to shift emphasis towards more practically valuable convergence proofs. We show a possible way to prove convergence of ACD training.
Recommended Citation
D. V. Prokhorov and D. C. Wunsch, "Convergence of Critic-based Training," Systems, Man, and Cybernetics, 1997. IEEE International Conference on Computational Cybernetics and Simulation, vol. 4, pp. 3057 - 3060, Institute of Electrical and Electronics Engineers (IEEE), Jan 1997.
The definitive version is available at https://doi.org/10.1109/ICSMC.1997.633056
Meeting Name
IEEE International Conference on Computational Cybernetics and Sumulation: Systems, Man and Cybernetics (1997: Oct. 12-15, Orlando, FL)
Department(s)
Electrical and Computer Engineering
International Standard Book Number (ISBN)
0000780340531
International Standard Serial Number (ISSN)
1062-922X
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 1997 Institute of Electrical and Electronics Engineers (IEEE), All rights reserved.
Publication Date
01 Jan 1997