Convergence of Critic-based Training
The paper discusses convergence issues when training adaptive critic designs (ACD) to control dynamic systems expressed as Markov sequences. We critically review two published convergence results of critic based training and propose to shift emphasis towards more practically valuable convergence proofs. We show a possible way to prove convergence of ACD training.
D. V. Prokhorov and D. C. Wunsch, "Convergence of Critic-based Training," Systems, Man, and Cybernetics, 1997. IEEE International Conference on Computational Cybernetics and Simulation, vol. 4, pp. 3057-3060, Institute of Electrical and Electronics Engineers (IEEE), Jan 1997.
The definitive version is available at https://doi.org/10.1109/ICSMC.1997.633056
IEEE International Conference on Computational Cybernetics and Sumulation: Systems, Man and Cybernetics (1997: Oct. 12-15, Orlando, FL)
Electrical and Computer Engineering
International Standard Book Number (ISBN)
International Standard Serial Number (ISSN)
Article - Conference proceedings
© 1997 Institute of Electrical and Electronics Engineers (IEEE), All rights reserved.