A distributed asynchronous algorithm for expected average cost dynamic programming

A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	29th IEEE Conference on Decision and Control S. 1394 - 1395 vol.3
Hauptverfasser:	Jalali, A., Ferguson, M.J.
Format:	Tagungsbericht
Sprache:	Englisch
Veröffentlicht:	IEEE 1990
Schlagworte:	Business Communication networks Cost function Delay Dynamic programming Heuristic algorithms Iterative algorithms Out of order
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors. The algorithm converges when the different processors iterate at different speeds. The information received by a processor regarding other coordinates may be outdated, and there may be an unpredictable delay in receiving information from other processors.< >
DOI:	10.1109/CDC.1990.203839