Quantile Markov Decision Processes
The goal of a traditional Markov decision process (MDP) is to maximize expected cumulative reward over a defined horizon (possibly infinite). In many applications, however, a decision maker may be interested in optimizing a specific quantile of the cumulative reward instead of its expectation. In th...
Saved in:
| Published in: | Operations research Vol. 70; no. 3; p. 1428 |
|---|---|
| Main Authors: | , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
United States
01.05.2022
|
| Subjects: | |
| ISSN: | 0030-364X |
| Online Access: | Get more information |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!