Abstract
We consider a Markov decision process with an uncountable state space for which the vector performance functional has the form of expected total rewards. Under the single condition that the fixed initial distribution and transition probabilities are nonatomic, we prove that for any policy there is a nonrandomized Markov policy such that for these two policies the performance vectors are equal.
| Original language | English |
|---|---|
| Pages (from-to) | 723-728 |
| Number of pages | 6 |
| Journal | Proceedings of the IEEE Conference on Decision and Control |
| Volume | 1 |
| DOIs | |
| State | Published - 2000 |
Fingerprint
Dive into the research topics of 'Nonatomic total rewards Markov decision processes with multiple criteria'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver