Search Results - "IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"

1

Loading…

Approximate real-time optimal control based on sparse Gaussian process models by Boedecker, Joschka, Springenberg, Jost Tobias, Wulfing, Jan, Riedmiller, Martin

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…In this paper we present a fully automated approach to (approximate) optimal control of non-linear systems. Our algorithm jointly learns a non-parametric model…”

Get full text

Conference Proceeding

Save to List

Saved in:
2

Loading…

Protecting against evaluation overfitting in empirical reinforcement learning by Whiteson, S., Tanner, B., Taylor, M. E., Stone, P.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed…”

Get full text

Conference Proceeding

Save to List

Saved in:
3

Loading…

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play by van der Ree, Michiel, Wiering, Marco

ISSN: 2325-1824

Published: IEEE 01.04.2013

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.04.2013)
“…This paper compares three strategies in using reinforcement learning algorithms to let an artificial agent learn to play the game of Othello. The three…”

Get full text

Conference Proceeding

Save to List

Saved in:
4

Loading…

Model-based multi-objective reinforcement learning by Wiering, Marco A., Withagen, Maikel, Drugan, Madalina M.

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…This paper describes a novel multi-objective reinforcement learning algorithm. The proposed algorithm first learns a model of the multi-objective sequential…”

Get full text

Conference Proceeding

Save to List

Saved in:
5

Loading…

Pseudo-MDPs and factored linear action models by Hengshuai Yao, Szepesvari, Csaba, Pires, Bernardo Avila, Xinhua Zhang

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…In this paper we introduce the concept of pseudo-MDPs to develop abstractions. Pseudo-MDPs relax the requirement that the transition kernel has to be a…”

Get full text

Conference Proceeding

Save to List

Saved in:
6

Loading…

Reinforcement learning algorithms for solving classification problems by Wiering, M. A., van Hasselt, H., Pietersma, Auke-Dirk, Schomaker, L.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…We describe a new framework for applying reinforcement learning (RL) algorithms to solve classification tasks by letting an agent act on the inputs and learn…”

Get full text

Conference Proceeding

Save to List

Saved in:
7

Loading…

Approximate reinforcement learning: An overview by Busoniu, L., Ernst, D., De Schutter, B., Babuska, R.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…Reinforcement learning (RL) allows agents to learn how to optimally interact with complex environments. Fueled by recent advances in approximation-based…”

Get full text

Conference Proceeding

Save to List

Saved in:
8

Loading…

A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work? by Jiang, Daniel R., Pham, Thuy V., Powell, Warren B., Salas, Daniel F., Scott, Warren R.

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…As more renewable, yet volatile, forms of energy like solar and wind are being incorporated into the grid, the problem of finding optimal control policies for…”

Get full text

Conference Proceeding

Save to List

Saved in:
9

Loading…

Exploring the relationship of reward and punishment in reinforcement learning by Lowe, Robert, Ziemke, Tom

ISSN: 2325-1824

Published: IEEE 01.04.2013

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.04.2013)
“…We present a reinforcement learning algorithm based on Dyna-Sarsa that utilizes separate representations of reward and punishment when guiding state-action…”

Get full text

Conference Proceeding

Save to List

Saved in:
10

Loading…

Real-time tracking on adaptive critic design with uniformly ultimately bounded condition by Zhen Ni, Xiao Fang, Haibo He, Dongbin Zhao, Xin Xu

ISSN: 2325-1824

Published: IEEE 01.04.2013

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.04.2013)
“…In this paper, we proposed a new nonlinear tracking controller based on heuristic dynamic programming (HDP) with the tracking filter. Specifically, we…”

Get full text

Conference Proceeding

Save to List

Saved in:
11

Loading…

Multi-objective reinforcement learning for AUV thruster failure recovery by Ahmadzadeh, Seyed Reza, Kormushev, Petar, Caldwell, Darwin G.

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles…”

Get full text

Conference Proceeding

Save to List

Saved in:
12

Loading…

Parametric value function approximation: A unified view by Geist, M., Pietquin, O.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…Reinforcement learning (RL) is a machine learning answer to the optimal control problem. It consists of learning an optimal control policy through interactions…”

Get full text

Conference Proceeding

Save to List

Saved in:
13

Loading…

Data-driven partially observable dynamic processes using adaptive dynamic programming by Xiangnan Zhong, Zhen Ni, Yufei Tang, Haibo He

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…Adaptive dynamic programming (ADP) has been widely recognized as one of the "core methodologies" to achieve optimal control for intelligent systems in Markov…”

Get full text

Conference Proceeding

Save to List

Saved in:
14

Loading…

Annealing-pareto multi-objective multi-armed bandit algorithm by Yahyaa, Saba Q., Drugan, Madalina M., Manderick, Bernard

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…In the stochastic multi-objective multi-armed bandit (or MOMAB), arms generate a vector of stochastic rewards, one per objective, instead of a single scalar…”

Get full text

Conference Proceeding

Save to List

Saved in:
15

Loading…

Grounding subgoals in information transitions by van Dijk, S. G., Polani, D.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…In reinforcement learning problems, the construction of subgoals has been identified as an important step to speed up learning and to enable skill transfer…”

Get full text

Conference Proceeding

Save to List

Saved in:
16

Loading…

Information-theoretic stochastic optimal control via incremental sampling-based algorithms by Arslan, Oktay, Theodorou, Evangelos A., Tsiotras, Panagiotis

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the…”

Get full text

Conference Proceeding

Save to List

Saved in:
17

Loading…

Path integral control and bounded rationality by Braun, D. A., Ortega, P. A., Theodorou, E., Schaal, S.

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…Path integral methods have recently been shown to be applicable to a very general class of optimal control problems. Here we examine the path integral…”

Get full text

Conference Proceeding

Save to List

Saved in:
18

Loading…

Feedback controller parameterizations for Reinforcement Learning by Roberts, John W., Manchester, Ian R., Tedrake, Russ

ISBN: 1424498872, 9781424498871

ISSN: 2325-1824

Published: IEEE 01.04.2011

Published in 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) (01.04.2011)
“…Reinforcement Learning offers a very general framework for learning controllers, but its effectiveness is closely tied to the controller parameterization used…”

Get full text

Conference Proceeding

Save to List

Saved in:
19

Loading…

Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device by Francois-Lavet, Vincent, Fonteneau, Raphael, Ernst, Damien

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…This paper proposes a methodology to estimate the maximum revenue that can be generated by a company that operates a high-capacity storage device to buy or…”

Get full text

Conference Proceeding

Save to List

Saved in:
20

Loading…

Pareto Upper Confidence Bounds algorithms: An empirical study by Drugan, Madalina M., Nowe, Ann, Manderick, Bernard

ISSN: 2325-1824

Published: IEEE 01.12.2014

Published in IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (01.12.2014)
“…Many real-world stochastic environments are inherently multi-objective environments with conflicting objectives. The multi-objective multi-armed bandits…”

Get full text

Conference Proceeding

Save to List

Saved in:

Search Results - "IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"

Approximate real-time optimal control based on sparse Gaussian process models by Boedecker, Joschka, Springenberg, Jost Tobias, Wulfing, Jan, Riedmiller, Martin

Protecting against evaluation overfitting in empirical reinforcement learning by Whiteson, S., Tanner, B., Taylor, M. E., Stone, P.

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play by van der Ree, Michiel, Wiering, Marco

Model-based multi-objective reinforcement learning by Wiering, Marco A., Withagen, Maikel, Drugan, Madalina M.

Pseudo-MDPs and factored linear action models by Hengshuai Yao, Szepesvari, Csaba, Pires, Bernardo Avila, Xinhua Zhang

Reinforcement learning algorithms for solving classification problems by Wiering, M. A., van Hasselt, H., Pietersma, Auke-Dirk, Schomaker, L.

Approximate reinforcement learning: An overview by Busoniu, L., Ernst, D., De Schutter, B., Babuska, R.

A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work? by Jiang, Daniel R., Pham, Thuy V., Powell, Warren B., Salas, Daniel F., Scott, Warren R.

Exploring the relationship of reward and punishment in reinforcement learning by Lowe, Robert, Ziemke, Tom

Real-time tracking on adaptive critic design with uniformly ultimately bounded condition by Zhen Ni, Xiao Fang, Haibo He, Dongbin Zhao, Xin Xu

Multi-objective reinforcement learning for AUV thruster failure recovery by Ahmadzadeh, Seyed Reza, Kormushev, Petar, Caldwell, Darwin G.

Parametric value function approximation: A unified view by Geist, M., Pietquin, O.

Data-driven partially observable dynamic processes using adaptive dynamic programming by Xiangnan Zhong, Zhen Ni, Yufei Tang, Haibo He

Annealing-pareto multi-objective multi-armed bandit algorithm by Yahyaa, Saba Q., Drugan, Madalina M., Manderick, Bernard

Grounding subgoals in information transitions by van Dijk, S. G., Polani, D.

Information-theoretic stochastic optimal control via incremental sampling-based algorithms by Arslan, Oktay, Theodorou, Evangelos A., Tsiotras, Panagiotis

Path integral control and bounded rationality by Braun, D. A., Ortega, P. A., Theodorou, E., Schaal, S.

Feedback controller parameterizations for Reinforcement Learning by Roberts, John W., Manchester, Ian R., Tedrake, Russ

Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device by Francois-Lavet, Vincent, Fonteneau, Raphael, Ernst, Damien

Pareto Upper Confidence Bounds algorithms: An empirical study by Drugan, Madalina M., Nowe, Ann, Manderick, Bernard

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication