A distributed asynchronous algorithm for expected average cost dynamic programming
A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors....
Gespeichert in:
| Veröffentlicht in: | 29th IEEE Conference on Decision and Control S. 1394 - 1395 vol.3 |
|---|---|
| Hauptverfasser: | , |
| Format: | Tagungsbericht |
| Sprache: | Englisch |
| Veröffentlicht: |
IEEE
1990
|
| Schlagworte: | |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors. The algorithm converges when the different processors iterate at different speeds. The information received by a processor regarding other coordinates may be outdated, and there may be an unpredictable delay in receiving information from other processors.< > |
|---|---|
| DOI: | 10.1109/CDC.1990.203839 |