Search Results - "policy-iteration algorithm"

1

Loading…

A primal–dual policy iteration algorithm for constrained Markov decision processes by Liu, Zeyu, Li, Xueping, Khojandi, Anahita

ISSN: 0377-2217

Published: Elsevier B.V 01.01.2026

Published in European journal of operational research (01.01.2026)
“…The solution algorithms of Constrained Markov Decision Process (CMDP), a widely adopted model for sequential decision-making, have been intensively studied in…”

Get full text

Journal Article

Save to List

Saved in:
2

Loading…

Data-driven policy iteration algorithm for optimal control of continuous-time Itô stochastic systems with Markovian jumps by Song, Jun, He, Shuping, Liu, Fei, Niu, Yugang, Ding, Zhengtao

ISSN: 1751-8644, 1751-8652

Published: The Institution of Engineering and Technology 08.08.2016

Published in IET control theory & applications (08.08.2016)
“…This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm…”

Get full text

Journal Article

Save to List

Saved in:
3

Loading…

Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games With Application to H-Infinity Control by Li, Jie, Li, Shengbo Eben, Duan, Jingliang, Lyu, Yao, Zou, Wenjun, Guan, Yang, Yin, Yuming

ISSN: 0018-9286, 1558-2523

Published: New York IEEE 01.01.2024

Published in IEEE transactions on automatic control (01.01.2024)
“…Though policy evaluation error profoundly affects the direction of policy optimization and the convergence property, it is usually ignored in policy iteration…”

Get full text

Journal Article

Save to List

Saved in:
4

Loading…

Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm by Peng, Zhinan, Zhao, Yiyi, Hu, Jiangping, Ghosh, Bijoy Kumar

ISSN: 0020-0255, 1872-6291

Published: Elsevier Inc 01.05.2019

Published in Information sciences (01.05.2019)
“… Compared to the classical policy iteration ADP algorithm with two components, policy evaluation, and policy improvement, a two-stage policy iteration algorithm is proposed to obtain the iterative…”

Get full text

Journal Article

Save to List

Saved in:
5

Loading…

Online fault compensation control based on policy iteration algorithm for a class of affine non-linear systems with actuator failures by Zhao, Bo, Liu, Derong, Li, Yuanchun

ISSN: 1751-8644, 1751-8652

Published: The Institution of Engineering and Technology 10.10.2016

Published in IET control theory & applications (10.10.2016)
“…In this study, a novel online fault compensation control scheme based on policy iteration (PI) algorithm is developed for a class of affine non-linear systems…”

Get full text

Journal Article

Save to List

Saved in:
6

Loading…

A data-driven α-policy iteration algorithm for optimal leader-following consensus of discrete-time multi-agent systems by Xiang, Aoxue, Zhao, Xinyuan, Ma, Ruicheng

ISSN: 0020-7721, 1464-5319

Published: London Taylor & Francis 10.12.2025

Published in International journal of systems science (10.12.2025)
“…In this paper, the data-driven α-policy iteration (PI) algorithm is proposed to address the optimal leader-following consensus problem of discrete-time…”

Get full text

Journal Article

Save to List

Saved in:
7

Loading…

An online fault-tolerant control approach based on policy iteration algorithm for nonlinear time-delay systems by Rahimi, Farshad

ISSN: 0020-7721, 1464-5319

Published: Taylor & Francis 04.07.2025

Published in International journal of systems science (04.07.2025)
“…This paper introduces an online fault-tolerant control method for nonlinear time-delay systems with actuator faults using a policy iteration algorithm…”

Get full text

Journal Article

Save to List

Saved in:
8

Loading…

Data‐driven policy iteration algorithm for continuous‐time stochastic linear‐quadratic optimal control problems by Zhang, Heng, Li, Na

ISSN: 1561-8625, 1934-6093

Published: Hoboken Wiley Subscription Services, Inc 01.01.2024

Published in Asian journal of control (01.01.2024)
“… Combining the Kronecker product theory with an existing policy iteration algorithm, a data…”

Get full text

Journal Article

Save to List

Saved in:
9

Loading…

Policy Iteration Algorithm for Optimal Control of Stochastic Logical Dynamical Systems by Wu, Yuhu, Shen, Tielong

ISSN: 2162-237X, 2162-2388, 2162-2388

Published: United States IEEE 01.05.2018

Published in IEEE transaction on neural networks and learning systems (01.05.2018)
“… Then, employing the method of semitensor product of matrices and the increasing-dimension technique, a succinct algebraic form of the policy iteration algorithm is derived to solve the optimal control problem…”

Get full text

Journal Article

Save to List

Saved in:
10

Loading…

The policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér–Lundberg risk model by Liu, Guoxin, Liu, Xiaoying, Liu, Zhaoyang

ISSN: 0377-0427, 1879-1778

Published: Elsevier B.V 15.10.2022

Published in Journal of computational and applied mathematics (15.10.2022)
“…In this paper, we focus on the policy iteration algorithm (PIA) for the optimal dividend problem under the Cramér–Lundberg risk model…”

Get full text

Journal Article

Save to List

Saved in:
11

Loading…

Pseudo-Target Optimization Strategy Based on Policy Iteration Algorithm by Meng, Yiming, Liu, Daichen, Wang, Ziyuan

ISSN: 2169-3536, 2169-3536

Published: Piscataway IEEE 2025

Published in IEEE access (2025)
“… To address the issue of redundant states in the application of the policy iteration algorithm in the environmental model optimization process and to accelerate the convergence speed of the algorithm…”

Get full text

Journal Article

Save to List

Saved in:
12

Loading…

Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems by Wang, Ding, Liu, Derong, Li, Hongliang

ISSN: 1545-5955, 1558-3783

Published: New York IEEE 01.04.2014

Published in IEEE transactions on automation science and engineering (01.04.2014)
“…In this paper, a novel strategy is established to design the robust controller for a class of continuous-time nonlinear systems with uncertainties based on the online policy iteration algorithm…”

Get full text

Journal Article

Save to List

Saved in:
13

Loading…

Online adaptive optimal control for continuous-time Markov jump linear systems using a novel policy iteration algorithm by He, Shuping, Song, Jun, Ding, Zhengtao, Liu, Fei

ISSN: 1751-8644, 1751-8652

Published: The Institution of Engineering and Technology 25.06.2015

Published in IET control theory & applications (25.06.2015)
“…) based on a novel policy iteration algorithm. By utilising a new decoupling technique named subsystems transformation, the authors re-construct the MJLSs and a set of new coupled systems composed of N subsystems are obtained…”

Get full text

Journal Article

Save to List

Saved in:
14

Loading…

An off‐policy iteration algorithm for robust stabilization of constrained‐input uncertain nonlinear systems by Yang, Xiong, Wei, Qinglai

ISSN: 1049-8923, 1099-1239

Published: Bognor Regis Wiley Subscription Services, Inc 01.12.2018

Published in International journal of robust and nonlinear control (01.12.2018)
“… Then, under the framework of reinforcement learning, an off‐policy iteration algorithm is proposed to solve the constrained H2 optimal control problem. The off…”

Get full text

Journal Article

Save to List

Saved in:
15

Loading…

A Neural Network-Based Policy Iteration Algorithm with Global H2-Superlinear Convergence for Stochastic Games on Domains by Ito, Kazufumi, Reisinger, Christoph, Zhang, Yufei

ISSN: 1615-3375, 1615-3383

Published: New York Springer US 01.04.2021

Published in Foundations of computational mathematics (01.04.2021)
“…In this work, we propose a class of numerical schemes for solving semilinear Hamilton–Jacobi–Bellman–Isaacs (HJBI) boundary value problems which arise…”

Get full text

Journal Article

Save to List

Saved in:
16

Loading…

Dynamic event-triggered tolerant containment control protocol for discrete multiagent systems based on finite index policy iteration algorithm by Yan, Shuya, Li, Xiaocong, Qian, Huaming, Al Mamun, Abdullah

ISSN: 0019-0578, 1879-2022, 1879-2022

Published: United States Elsevier Ltd 01.03.2025

Published in ISA transactions (01.03.2025)
“…-triggered policy iteration algorithm is proposed. This algorithm only requires input and output data, without relying on system models, and simultaneously considers…”

Get full text

Journal Article

Save to List

Saved in:
17

Loading…

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm by Fürnkranz, Johannes, Hüllermeier, Eyke, Cheng, Weiwei, Park, Sang-Hyeun

ISSN: 0885-6125, 1573-0565

Published: Boston Springer US 01.10.2012

Published in Machine learning (01.10.2012)
“…This paper makes a first step toward the integration of two subfields of machine learning, namely preference learning and reinforcement learning (RL). An…”

Get full text

Journal Article

Save to List

Saved in:
18

Loading…

Machine Learning Structure for Controlling the Speed of Variable Reluctance Motor via Transitioning Policy Iteration Algorithm by Alharkan, Hamad

ISSN: 2032-6653, 2032-6653

Published: Basel MDPI AG 01.09.2024

Published in World electric vehicle journal (01.09.2024)
“… By formulating a policy iteration algorithm for VRM applications, the speed of the motor…”

Get full text

Journal Article

Save to List

Saved in:
19

Loading…

A Neural Network-Based Policy Iteration Algorithm with Global $$H^2$$-Superlinear Convergence for Stochastic Games on Domains by Ito, Kazufumi, Reisinger, Christoph, Zhang, Yufei

ISSN: 1615-3375, 1615-3383

Published: 01.04.2021

Published in Foundations of computational mathematics (01.04.2021)
“…In this work, we propose a class of numerical schemes for solving semilinear Hamilton–Jacobi–Bellman–Isaacs (HJBI) boundary value problems which arise…”

Get full text

Journal Article

Save to List

Saved in:
20

Loading…

Neuro-Optimal Control for Discrete Stochastic Processes via a Novel Policy Iteration Algorithm by Liang, Mingming, Wang, Ding, Liu, Derong

ISSN: 2168-2216, 2168-2232

Published: New York IEEE 01.11.2020

Published in IEEE transactions on systems, man, and cybernetics. Systems (01.11.2020)
“… Hence, we can significantly reduce the computational burden for the CPU in comparison with the conventional policy iteration algorithm…”

Get full text

Journal Article

Save to List

Saved in:

Search Results - "policy-iteration algorithm"

A primal–dual policy iteration algorithm for constrained Markov decision processes by Liu, Zeyu, Li, Xueping, Khojandi, Anahita

Data-driven policy iteration algorithm for optimal control of continuous-time Itô stochastic systems with Markovian jumps by Song, Jun, He, Shuping, Liu, Fei, Niu, Yugang, Ding, Zhengtao

Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games With Application to H-Infinity Control by Li, Jie, Li, Shengbo Eben, Duan, Jingliang, Lyu, Yao, Zou, Wenjun, Guan, Yang, Yin, Yuming

Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm by Peng, Zhinan, Zhao, Yiyi, Hu, Jiangping, Ghosh, Bijoy Kumar

Online fault compensation control based on policy iteration algorithm for a class of affine non-linear systems with actuator failures by Zhao, Bo, Liu, Derong, Li, Yuanchun

A data-driven α-policy iteration algorithm for optimal leader-following consensus of discrete-time multi-agent systems by Xiang, Aoxue, Zhao, Xinyuan, Ma, Ruicheng

An online fault-tolerant control approach based on policy iteration algorithm for nonlinear time-delay systems by Rahimi, Farshad

Data‐driven policy iteration algorithm for continuous‐time stochastic linear‐quadratic optimal control problems by Zhang, Heng, Li, Na

Policy Iteration Algorithm for Optimal Control of Stochastic Logical Dynamical Systems by Wu, Yuhu, Shen, Tielong

The policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér–Lundberg risk model by Liu, Guoxin, Liu, Xiaoying, Liu, Zhaoyang

Pseudo-Target Optimization Strategy Based on Policy Iteration Algorithm by Meng, Yiming, Liu, Daichen, Wang, Ziyuan

Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems by Wang, Ding, Liu, Derong, Li, Hongliang

Online adaptive optimal control for continuous-time Markov jump linear systems using a novel policy iteration algorithm by He, Shuping, Song, Jun, Ding, Zhengtao, Liu, Fei

An off‐policy iteration algorithm for robust stabilization of constrained‐input uncertain nonlinear systems by Yang, Xiong, Wei, Qinglai

A Neural Network-Based Policy Iteration Algorithm with Global H2-Superlinear Convergence for Stochastic Games on Domains by Ito, Kazufumi, Reisinger, Christoph, Zhang, Yufei

Dynamic event-triggered tolerant containment control protocol for discrete multiagent systems based on finite index policy iteration algorithm by Yan, Shuya, Li, Xiaocong, Qian, Huaming, Al Mamun, Abdullah

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm by Fürnkranz, Johannes, Hüllermeier, Eyke, Cheng, Weiwei, Park, Sang-Hyeun

Machine Learning Structure for Controlling the Speed of Variable Reluctance Motor via Transitioning Policy Iteration Algorithm by Alharkan, Hamad

A Neural Network-Based Policy Iteration Algorithm with Global $$H^2$$-Superlinear Convergence for Stochastic Games on Domains by Ito, Kazufumi, Reisinger, Christoph, Zhang, Yufei

Neuro-Optimal Control for Discrete Stochastic Processes via a Novel Policy Iteration Algorithm by Liang, Mingming, Wang, Ding, Liu, Derong

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication