Abstract
The Unichain classification problem detects whether a finite state and action MDP is unichain under all deterministic policies. This problem is N P-hard. This paper provides polynomial algorithms for this problem when there is a state that is either recurrent under all deterministic policies or absorbing under some action.
| Original language | English |
|---|---|
| Pages (from-to) | 527-530 |
| Number of pages | 4 |
| Journal | Operations Research Letters |
| Volume | 36 |
| Issue number | 5 |
| DOIs | |
| State | Published - Sep 2008 |
Keywords
- Markov Decision Process
- Recurrent state
- Unichain condition
Fingerprint
Dive into the research topics of 'On polynomial cases of the unichain classification problem for Markov Decision Processes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver