Výsledky vyhľadávania - "Policy-iteration algorithm"
-
1
A primal–dual policy iteration algorithm for constrained Markov decision processes
ISSN: 0377-2217Vydavateľské údaje: Elsevier B.V 01.01.2026Vydané v European journal of operational research (01.01.2026)“…The solution algorithms of Constrained Markov Decision Process (CMDP), a widely adopted model for sequential decision-making, have been intensively studied in…”
Získať plný text
Journal Article -
2
Data-driven policy iteration algorithm for optimal control of continuous-time Itô stochastic systems with Markovian jumps
ISSN: 1751-8644, 1751-8652Vydavateľské údaje: The Institution of Engineering and Technology 08.08.2016Vydané v IET control theory & applications (08.08.2016)“…This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm…”
Získať plný text
Journal Article -
3
Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games With Application to H-Infinity Control
ISSN: 0018-9286, 1558-2523Vydavateľské údaje: New York IEEE 01.01.2024Vydané v IEEE transactions on automatic control (01.01.2024)“…Though policy evaluation error profoundly affects the direction of policy optimization and the convergence property, it is usually ignored in policy iteration…”
Získať plný text
Journal Article -
4
Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm
ISSN: 0020-0255, 1872-6291Vydavateľské údaje: Elsevier Inc 01.05.2019Vydané v Information sciences (01.05.2019)“… Compared to the classical policy iteration ADP algorithm with two components, policy evaluation, and policy improvement, a two-stage policy iteration algorithm is proposed to obtain the iterative…”
Získať plný text
Journal Article -
5
Online fault compensation control based on policy iteration algorithm for a class of affine non-linear systems with actuator failures
ISSN: 1751-8644, 1751-8652Vydavateľské údaje: The Institution of Engineering and Technology 10.10.2016Vydané v IET control theory & applications (10.10.2016)“…In this study, a novel online fault compensation control scheme based on policy iteration (PI) algorithm is developed for a class of affine non-linear systems…”
Získať plný text
Journal Article -
6
A data-driven α-policy iteration algorithm for optimal leader-following consensus of discrete-time multi-agent systems
ISSN: 0020-7721, 1464-5319Vydavateľské údaje: London Taylor & Francis 10.12.2025Vydané v International journal of systems science (10.12.2025)“…In this paper, the data-driven α-policy iteration (PI) algorithm is proposed to address the optimal leader-following consensus problem of discrete-time…”
Získať plný text
Journal Article -
7
An online fault-tolerant control approach based on policy iteration algorithm for nonlinear time-delay systems
ISSN: 0020-7721, 1464-5319Vydavateľské údaje: Taylor & Francis 04.07.2025Vydané v International journal of systems science (04.07.2025)“…This paper introduces an online fault-tolerant control method for nonlinear time-delay systems with actuator faults using a policy iteration algorithm…”
Získať plný text
Journal Article -
8
Data‐driven policy iteration algorithm for continuous‐time stochastic linear‐quadratic optimal control problems
ISSN: 1561-8625, 1934-6093Vydavateľské údaje: Hoboken Wiley Subscription Services, Inc 01.01.2024Vydané v Asian journal of control (01.01.2024)“… Combining the Kronecker product theory with an existing policy iteration algorithm, a data…”
Získať plný text
Journal Article -
9
Policy Iteration Algorithm for Optimal Control of Stochastic Logical Dynamical Systems
ISSN: 2162-237X, 2162-2388, 2162-2388Vydavateľské údaje: United States IEEE 01.05.2018Vydané v IEEE transaction on neural networks and learning systems (01.05.2018)“… Then, employing the method of semitensor product of matrices and the increasing-dimension technique, a succinct algebraic form of the policy iteration algorithm is derived to solve the optimal control problem…”
Získať plný text
Journal Article -
10
The policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér–Lundberg risk model
ISSN: 0377-0427, 1879-1778Vydavateľské údaje: Elsevier B.V 15.10.2022Vydané v Journal of computational and applied mathematics (15.10.2022)“…In this paper, we focus on the policy iteration algorithm (PIA) for the optimal dividend problem under the Cramér–Lundberg risk model…”
Získať plný text
Journal Article -
11
Pseudo-Target Optimization Strategy Based on Policy Iteration Algorithm
ISSN: 2169-3536, 2169-3536Vydavateľské údaje: Piscataway IEEE 2025Vydané v IEEE access (2025)“… To address the issue of redundant states in the application of the policy iteration algorithm in the environmental model optimization process and to accelerate the convergence speed of the algorithm…”
Získať plný text
Journal Article -
12
Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems
ISSN: 1545-5955, 1558-3783Vydavateľské údaje: New York IEEE 01.04.2014Vydané v IEEE transactions on automation science and engineering (01.04.2014)“…In this paper, a novel strategy is established to design the robust controller for a class of continuous-time nonlinear systems with uncertainties based on the online policy iteration algorithm…”
Získať plný text
Journal Article -
13
Online adaptive optimal control for continuous-time Markov jump linear systems using a novel policy iteration algorithm
ISSN: 1751-8644, 1751-8652Vydavateľské údaje: The Institution of Engineering and Technology 25.06.2015Vydané v IET control theory & applications (25.06.2015)“…) based on a novel policy iteration algorithm. By utilising a new decoupling technique named subsystems transformation, the authors re-construct the MJLSs and a set of new coupled systems composed of N subsystems are obtained…”
Získať plný text
Journal Article -
14
An off‐policy iteration algorithm for robust stabilization of constrained‐input uncertain nonlinear systems
ISSN: 1049-8923, 1099-1239Vydavateľské údaje: Bognor Regis Wiley Subscription Services, Inc 01.12.2018Vydané v International journal of robust and nonlinear control (01.12.2018)“… Then, under the framework of reinforcement learning, an off‐policy iteration algorithm is proposed to solve the constrained H2 optimal control problem. The off…”
Získať plný text
Journal Article -
15
A Neural Network-Based Policy Iteration Algorithm with Global H2-Superlinear Convergence for Stochastic Games on Domains
ISSN: 1615-3375, 1615-3383Vydavateľské údaje: New York Springer US 01.04.2021Vydané v Foundations of computational mathematics (01.04.2021)“…In this work, we propose a class of numerical schemes for solving semilinear Hamilton–Jacobi–Bellman–Isaacs (HJBI) boundary value problems which arise…”
Získať plný text
Journal Article -
16
Dynamic event-triggered tolerant containment control protocol for discrete multiagent systems based on finite index policy iteration algorithm
ISSN: 0019-0578, 1879-2022, 1879-2022Vydavateľské údaje: United States Elsevier Ltd 01.03.2025Vydané v ISA transactions (01.03.2025)“…-triggered policy iteration algorithm is proposed. This algorithm only requires input and output data, without relying on system models, and simultaneously considers…”
Získať plný text
Journal Article -
17
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm
ISSN: 0885-6125, 1573-0565Vydavateľské údaje: Boston Springer US 01.10.2012Vydané v Machine learning (01.10.2012)“…This paper makes a first step toward the integration of two subfields of machine learning, namely preference learning and reinforcement learning (RL). An…”
Získať plný text
Journal Article -
18
Machine Learning Structure for Controlling the Speed of Variable Reluctance Motor via Transitioning Policy Iteration Algorithm
ISSN: 2032-6653, 2032-6653Vydavateľské údaje: Basel MDPI AG 01.09.2024Vydané v World electric vehicle journal (01.09.2024)“… By formulating a policy iteration algorithm for VRM applications, the speed of the motor…”
Získať plný text
Journal Article -
19
A Neural Network-Based Policy Iteration Algorithm with Global $$H^2$$-Superlinear Convergence for Stochastic Games on Domains
ISSN: 1615-3375, 1615-3383Vydavateľské údaje: 01.04.2021Vydané v Foundations of computational mathematics (01.04.2021)“…In this work, we propose a class of numerical schemes for solving semilinear Hamilton–Jacobi–Bellman–Isaacs (HJBI) boundary value problems which arise…”
Získať plný text
Journal Article -
20
Neuro-Optimal Control for Discrete Stochastic Processes via a Novel Policy Iteration Algorithm
ISSN: 2168-2216, 2168-2232Vydavateľské údaje: New York IEEE 01.11.2020Vydané v IEEE transactions on systems, man, and cybernetics. Systems (01.11.2020)“… Hence, we can significantly reduce the computational burden for the CPU in comparison with the conventional policy iteration algorithm…”
Získať plný text
Journal Article