Target-sensitive Control of Markov and Semi-markov Processes

Abstract

We develop the theory for Markov and semi-Markov control using dynamic programming and reinforcement learning in which a form of semi-variance which computes the variability of rewards below a pre-specified target is penalized. the objective is to optimize a function of the rewards and risk where risk is penalized. Penalizing variance, which is popular in the literature, has some drawbacks that can be avoided with semi-variance. © ICROS, KIEE and Springer 2011.

Department(s)

Engineering Management and Systems Engineering

Comments

National Science Foundation, Grant 0841055

Keywords and Phrases

Relative value iteration; Semi-Markov control; Semi-variance; Stochastic shortest path problem; Target-sensitive

International Standard Serial Number (ISSN)

2005-4092; 1598-6446

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2024 Springer, All rights reserved.

Publication Date

01 Oct 2011

Share

 
COinS