Risk-sensitive reinforcement learning

We derive a family of risk-sensitive reinforcement learning methods for agents, who face sequential decision-making tasks in uncertain environments. By applying a utility function to the temporal difference (TD) error, nonlinear transformations are effectively applied not only to the received reward...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Neural computation Ročník 26; číslo 7; s. 1298
Hlavní autoři:	Shen, Yun, Tobia, Michael J, Sommer, Tobias, Obermayer, Klaus
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	United States 01.07.2014
Témata:	Algorithms Brain - physiology Brain Mapping Decision Making - physiology Humans Magnetic Resonance Imaging Markov Chains Models, Psychological Nonlinear Dynamics Oxygen - blood Probability Reinforcement (Psychology) Risk
ISSN:	1530-888X, 1530-888X
On-line přístup:	Zjistit podrobnosti o přístupu
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!