Natural actor–critic algorithms

We present four new reinforcement learning algorithms based on actor–critic, natural-gradient and function-approximation ideas, and we provide their convergence proofs. Actor–critic reinforcement learning methods are online approximations to policy iteration in which the value-function parameters ar...

Full description

Saved in:
Bibliographic Details
Published in:Automatica (Oxford) Vol. 45; no. 11; pp. 2471 - 2482
Main Authors: Bhatnagar, Shalabh, Sutton, Richard S., Ghavamzadeh, Mohammad, Lee, Mark
Format: Journal Article
Language:English
Published: Kidlington Elsevier Ltd 01.11.2009
Elsevier
Subjects:
ISSN:0005-1098, 1873-2836
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first