A distributed asynchronous algorithm for expected average cost dynamic programming
A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors....
Saved in:
| Published in: | 29th IEEE Conference on Decision and Control pp. 1394 - 1395 vol.3 |
|---|---|
| Main Authors: | , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
1990
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors. The algorithm converges when the different processors iterate at different speeds. The information received by a processor regarding other coordinates may be outdated, and there may be an unpredictable delay in receiving information from other processors.< > |
|---|---|
| DOI: | 10.1109/CDC.1990.203839 |