Engineering Management and Systems Engineering Faculty Research & Creative Works

Beyond Exponential Utility Functions: A Variance-Adjusted Approach for Risk-Averse Reinforcement Learning

Abhijit Gosavi, Missouri University of Science and TechnologyFollow
Sajal K. Das, Missouri University of Science and TechnologyFollow
Susan L. Murray, Missouri University of Science and TechnologyFollow

Abstract

Utility theory has served as a bedrock for modeling risk in economics. Where risk is involved in decision-making, for solving Markov decision processes (MDPs) via utility theory, the exponential utility (EU) function has been used in the literature as an objective function for capturing risk-averse behavior. The EU function framework uses a so-called risk-averseness coefficient (RAC) that seeks to quantify the risk appetite of the decision-maker. Unfortunately, as we show in this paper, the EU framework suffers from computational deficiencies that prevent it from being useful in practice for solution methods based on reinforcement learning (RL). In particular, the value function becomes very large and typically the computer overflows. We provide a simple example to demonstrate this. Further, we show empirically how a variance-adjusted (VA) approach, which approximates the EU function objective for reasonable values of the RAC, can be used in the RL algorithm. The VA framework in a sense has two objectives: maximize expected returns and minimize variance. We conduct empirical studies on a VA-based RL algorithm on the semi-MDP (SMDP), which is a more general version of the MDP. We conclude with a mathematical proof of the boundedness of the iterates in our algorithm.

Recommended Citation

A. Gosavi et al., "Beyond Exponential Utility Functions: A Variance-Adjusted Approach for Risk-Averse Reinforcement Learning," Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (2014, Orlando, FL), Institute of Electrical and Electronics Engineers (IEEE), Dec 2014.

The definitive version is available at https://doi.org/10.1109/ADPRL.2014.7010645

Meeting Name

IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (2014: Dec. 9-12, Orlando, FL)

Department(s)

Engineering Management and Systems Engineering

Second Department

Computer Science

Third Department

Psychological Science

Keywords and Phrases

Algorithms; Behavioral research; Computation theory; Decision making; Decision theory; Dynamic programming; Markov processes; Risk analysis; Risks; Computational deficiency; Empirical studies; Exponential utility; Exponential utility function; Markov Decision Processes; Mathematical proof; Objective functions; Reasonable value; Reinforcement learning

International Standard Book Number (ISBN)

978-1479945535

International Standard Serial Number (ISSN)

2325-1824

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

12 Dec 2014

Link to Full Text

COinS

Engineering Management and Systems Engineering Faculty Research & Creative Works

Beyond Exponential Utility Functions: A Variance-Adjusted Approach for Risk-Averse Reinforcement Learning

Abstract

Recommended Citation

Meeting Name

Department(s)

Second Department

Third Department

Keywords and Phrases

International Standard Book Number (ISBN)

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Search

Browse

Faculty Gallery

Author Corner

Related Content

Useful Links

Article Locations

Engineering Management and Systems Engineering Faculty Research & Creative Works

Beyond Exponential Utility Functions: A Variance-Adjusted Approach for Risk-Averse Reinforcement Learning

Author

Abstract

Recommended Citation

Meeting Name

Department(s)

Second Department

Third Department

Keywords and Phrases

International Standard Book Number (ISBN)

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Share

Search

Browse

Faculty Gallery

Author Corner

Related Content

Useful Links

Article Locations