Suchergebnisse - "IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"

1

Wird geladen …

Approximate real-time optimal control based on sparse Gaussian process models von Boedecker, Joschka, Springenberg, Jost Tobias, Wulfing, Jan, Riedmiller, Martin

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… In this paper we present a fully automated approach to (approximate) optimal control of non-linear systems. Our algorithm jointly learns a non-parametric model …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
2

Wird geladen …

Protecting against evaluation overfitting in empirical reinforcement learning von Whiteson, S., Tanner, B., Taylor, M. E., Stone, P.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2011

Veröffentlicht in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“… Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
3

Wird geladen …

Model-based multi-objective reinforcement learning von Wiering, Marco A., Withagen, Maikel, Drugan, Madalina M.

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… This paper describes a novel multi-objective reinforcement learning algorithm. The proposed algorithm first learns a model of the multi-objective sequential …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
4

Wird geladen …

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play von van der Ree, Michiel, Wiering, Marco

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2013

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.04.2013)
“… This paper compares three strategies in using reinforcement learning algorithms to let an artificial agent learn to play the game of Othello. The three …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
5

Wird geladen …

Pseudo-MDPs and factored linear action models von Hengshuai Yao, Szepesvari, Csaba, Pires, Bernardo Avila, Xinhua Zhang

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… In this paper we introduce the concept of pseudo-MDPs to develop abstractions. Pseudo-MDPs relax the requirement that the transition kernel has to be a …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
6

Wird geladen …

Reinforcement learning algorithms for solving classification problems von Wiering, M. A., van Hasselt, H., Pietersma, Auke-Dirk, Schomaker, L.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2011

Veröffentlicht in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“… We describe a new framework for applying reinforcement learning (RL) algorithms to solve classification tasks by letting an agent act on the inputs and learn …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
7

Wird geladen …

A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work? von Jiang, Daniel R., Pham, Thuy V., Powell, Warren B., Salas, Daniel F., Scott, Warren R.

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… As more renewable, yet volatile, forms of energy like solar and wind are being incorporated into the grid, the problem of finding optimal control policies for …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
8

Wird geladen …

Data-driven partially observable dynamic processes using adaptive dynamic programming von Xiangnan Zhong, Zhen Ni, Yufei Tang, Haibo He

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… Adaptive dynamic programming (ADP) has been widely recognized as one of the "core methodologies" to achieve optimal control for intelligent systems in Markov …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
9

Wird geladen …

Parametric value function approximation: A unified view von Geist, M., Pietquin, O.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2011

Veröffentlicht in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“… Reinforcement learning (RL) is a machine learning answer to the optimal control problem. It consists of learning an optimal control policy through interactions …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
10

Wird geladen …

Multi-objective reinforcement learning for AUV thruster failure recovery von Ahmadzadeh, Seyed Reza, Kormushev, Petar, Caldwell, Darwin G.

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
11

Wird geladen …

Annealing-pareto multi-objective multi-armed bandit algorithm von Yahyaa, Saba Q., Drugan, Madalina M., Manderick, Bernard

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… In the stochastic multi-objective multi-armed bandit (or MOMAB), arms generate a vector of stochastic rewards, one per objective, instead of a single scalar …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
12

Wird geladen …

Active learning for classification: An optimistic approach von Collet, Timothe, Pietquin, Olivier

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… In this paper, we propose to reformulate the active learning problem occurring in classification as a sequential decision making problem. We particularly focus …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
13

Wird geladen …

Approximate reinforcement learning: An overview von Busoniu, L., Ernst, D., De Schutter, B., Babuska, R.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2011

Veröffentlicht in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“… Reinforcement learning (RL) allows agents to learn how to optimally interact with complex environments. Fueled by recent advances in approximation-based …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
14

Wird geladen …

Neural network-based adaptive optimal consensus control of leaderless networked mobile robots von Guzey, Haci Mehmet, Hao Xu, Jagannathan, S.

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… A novel neural network (NN)-based optimal adaptive consensus control scheme is introduced in this paper for networked mobile robots in the presence of unknown …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
15

Wird geladen …

Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device von Francois-Lavet, Vincent, Fonteneau, Raphael, Ernst, Damien

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… This paper proposes a methodology to estimate the maximum revenue that can be generated by a company that operates a high-capacity storage device to buy or …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
16

Wird geladen …

Information-theoretic stochastic optimal control via incremental sampling-based algorithms von Arslan, Oktay, Theodorou, Evangelos A., Tsiotras, Panagiotis

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
17

Wird geladen …

Pareto Upper Confidence Bounds algorithms: An empirical study von Drugan, Madalina M., Nowe, Ann, Manderick, Bernard

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… Many real-world stochastic environments are inherently multi-objective environments with conflicting objectives. The multi-objective multi-armed bandits …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
18

Wird geladen …

Exploring the relationship of reward and punishment in reinforcement learning von Lowe, Robert, Ziemke, Tom

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2013

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.04.2013)
“… We present a reinforcement learning algorithm based on Dyna-Sarsa that utilizes separate representations of reward and punishment when guiding state-action …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
19

Wird geladen …

An analysis of optimistic, best-first search for minimax sequential decision making von Busoniu, Lucian, Munos, Remi, Pall, Elod

ISSN: 2325-1824

Veröffentlicht: IEEE 01.12.2014

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“… We consider problems in which a maximizer and a minimizer agent take actions in turn, such as games or optimal control with uncertainty modeled as an opponent …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
20

Wird geladen …

Real-time tracking on adaptive critic design with uniformly ultimately bounded condition von Zhen Ni, Xiao Fang, Haibo He, Dongbin Zhao, Xin Xu

ISSN: 2325-1824

Veröffentlicht: IEEE 01.04.2013

Veröffentlicht in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.04.2013)
“… In this paper, we proposed a new nonlinear tracking controller based on heuristic dynamic programming (HDP) with the tracking filter. Specifically, we …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:

Suchergebnisse - "IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"

Approximate real-time optimal control based on sparse Gaussian process models von Boedecker, Joschka, Springenberg, Jost Tobias, Wulfing, Jan, Riedmiller, Martin

Protecting against evaluation overfitting in empirical reinforcement learning von Whiteson, S., Tanner, B., Taylor, M. E., Stone, P.

Model-based multi-objective reinforcement learning von Wiering, Marco A., Withagen, Maikel, Drugan, Madalina M.

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play von van der Ree, Michiel, Wiering, Marco

Pseudo-MDPs and factored linear action models von Hengshuai Yao, Szepesvari, Csaba, Pires, Bernardo Avila, Xinhua Zhang

Reinforcement learning algorithms for solving classification problems von Wiering, M. A., van Hasselt, H., Pietersma, Auke-Dirk, Schomaker, L.

A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work? von Jiang, Daniel R., Pham, Thuy V., Powell, Warren B., Salas, Daniel F., Scott, Warren R.

Data-driven partially observable dynamic processes using adaptive dynamic programming von Xiangnan Zhong, Zhen Ni, Yufei Tang, Haibo He

Parametric value function approximation: A unified view von Geist, M., Pietquin, O.

Multi-objective reinforcement learning for AUV thruster failure recovery von Ahmadzadeh, Seyed Reza, Kormushev, Petar, Caldwell, Darwin G.

Annealing-pareto multi-objective multi-armed bandit algorithm von Yahyaa, Saba Q., Drugan, Madalina M., Manderick, Bernard

Active learning for classification: An optimistic approach von Collet, Timothe, Pietquin, Olivier

Approximate reinforcement learning: An overview von Busoniu, L., Ernst, D., De Schutter, B., Babuska, R.

Neural network-based adaptive optimal consensus control of leaderless networked mobile robots von Guzey, Haci Mehmet, Hao Xu, Jagannathan, S.

Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device von Francois-Lavet, Vincent, Fonteneau, Raphael, Ernst, Damien

Information-theoretic stochastic optimal control via incremental sampling-based algorithms von Arslan, Oktay, Theodorou, Evangelos A., Tsiotras, Panagiotis

Pareto Upper Confidence Bounds algorithms: An empirical study von Drugan, Madalina M., Nowe, Ann, Manderick, Bernard

Exploring the relationship of reward and punishment in reinforcement learning von Lowe, Robert, Ziemke, Tom

An analysis of optimistic, best-first search for minimax sequential decision making von Busoniu, Lucian, Munos, Remi, Pall, Elod

Real-time tracking on adaptive critic design with uniformly ultimately bounded condition von Zhen Ni, Xiao Fang, Haibo He, Dongbin Zhao, Xin Xu

Suchwerkzeuge:

Treffer weiter einschränken

Format

Schlagwortumfeld

Thema

Sprache

Erscheinungsjahr