Search Results - "DPG Algorithm"

1

Loading…

Feature selection in deterministic policy gradient by Li, Luntong, Li, Dazi, Song, Tianheng

ISSN: 2051-3305, 2051-3305

Published: The Institution of Engineering and Technology 01.07.2020

Published in Journal of engineering (Stevenage, England) (01.07.2020)
“… In order to solve this problem, the authors extend DPG algorithm by adding an approximate-linear-dependency-based sparsification procedure, which makes DPG algorithm to automatically select…”

Get full text

Journal Article

Save to List

Saved in:
2

Loading…

Manipulators with Machine Learning-based Control with Reinforcement by Mannaa, Ali Sagae, Zarubin, Andrei O.

Published: IEEE 21.09.2023

Published in 2023 V International Conference on Control in Technical Systems (CTS) (21.09.2023)
“…This industry is created for a number of tasks in which there is no unambiguous solution algorithm. Accordingly, these tasks cannot be solved in a fully…”

Get full text

Conference Proceeding

Save to List

Saved in:
3

Loading…

Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles by Shi, Wenjie, Song, Shiji, Wu, Cheng, Chen, C. L. Philip

ISSN: 2162-237X, 2162-2388, 2162-2388

Published: United States IEEE 01.12.2019

Published in IEEE transaction on neural networks and learning systems (01.12.2019)
“…This paper investigates trajectory tracking problem for a class of underactuated autonomous underwater vehicles (AUVs) with unknown dynamics and constrained…”

Get full text

Journal Article

Save to List

Saved in:
4

Loading…

Composite optimization with coupling constraints via dual proximal gradient method with applications to asynchronous networks by Wang, Jianzheng, Hu, Guoqiang

ISSN: 1049-8923, 1099-1239

Published: Bognor Regis Wiley Subscription Services, Inc 25.05.2022

Published in International journal of robust and nonlinear control (25.05.2022)
“… Then, an asynchronous DPG (Asyn‐DPG) algorithm is proposed for the asynchronous networks with heterogeneous step…”

Get full text

Journal Article

Save to List

Saved in:
5

Loading…

Deep reinforcement learning based rate enhancement scheme for RIS assisted mobile users underlaying UAV by Joshi, Neeraj, Budhiraja, Ishan, Garg, Deepak, Garg, Sahil, Choi, Bong Jun, Alrashoud, Mubarak

ISSN: 1110-0168

Published: Elsevier B.V 01.03.2024

Published in Alexandria engineering journal (01.03.2024)
“…The fifth generation (5G) network enabled communication between devices has emerged as a state-of-the-art technology. In the era of proliferating smart devices…”

Get full text

Journal Article

Save to List

Saved in:
6

Loading…

Intelligent Trajectory Design in UAV-Aided Communications With Reinforcement Learning by Yin, Sixing, Zhao, Shuo, Zhao, Yifei, Yu, F. Richard

ISSN: 0018-9545, 1939-9359

Published: New York IEEE 01.08.2019

Published in IEEE transactions on vehicular technology (01.08.2019)
“… Due to the continuous and deterministic action space, the deterministic policy gradient (DPG…”

Get full text

Journal Article

Save to List

Saved in:
7

Loading…

Path Design and Resource Management for NOMA Enhanced Indoor Intelligent Robots by Zhong, Ruikang, Liu, Xiao, Liu, Yuanwei, Chen, Yue, Wang, Xianbin

ISSN: 1536-1276, 1558-2248

Published: New York IEEE 01.10.2022

Published in IEEE transactions on wireless communications (01.10.2022)
“…A communication enabled indoor intelligent robots (IRs) service framework is proposed, where non-orthogonal multiple access (NOMA) technique is adopted to…”

Get full text

Journal Article

Save to List

Saved in:
8

Loading…

UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking by Wei, Minggao, Wang, Song, Zheng, Jinfan, Chen, Dan

ISSN: 2169-3536, 2169-3536

Published: Piscataway IEEE 2018

Published in IEEE access (2018)
“…The success of robotic, such as UGV systems, largely benefits from the fundamental capability of autonomously finding collision-free path(s) to commit mobile…”

Get full text

Journal Article

Save to List

Saved in:
9

Loading…

Detect Insider Attacks Using CNN in Decentralized Optimization by Li, Gangqiang, Wu, Sissi Xiaoxiao, Zhang, Shengli, Li, Qiang

ISSN: 2379-190X

Published: IEEE 01.05.2020

Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (01.05.2020)
“…) algorithm, when it is applied for solving a decentralized multi-agent optimization. It is known that the gossip-based DPG algorithm is vulnerable to insider attacks because each agent locally estimates its (sub…”

Get full text

Conference Proceeding

Save to List

Saved in:
10

Loading…

Learning-based state estimation and control using MHE and MPC schemes with imperfect models by Nejatbakhsh Esfahani, Hossein, Bahari Kordabad, Arash, Cai, Wenqi, Gros, Sebastien

ISSN: 0947-3580, 1435-5671

Published: Elsevier Ltd 01.09.2023

Published in European journal of control (01.09.2023)
“… A compatible Deterministic Policy Gradient (DPG) algorithm is then proposed to directly tune the parameters of both the estimator (MHE) and controller (MPC…”

Get full text

Journal Article

Save to List

Saved in:
11

Loading…

A curriculum-based multi-agent DPG-ASC algorithm for UAV area defense by Sun, Miaoping, Xu, Zehao, Yang, Zequan, Nian, Xiaohong, Chen, Yong

ISSN: 1573-0484, 0920-8542, 1573-0484

Published: New York Springer Nature B.V 21.11.2025

Published in The Journal of supercomputing (21.11.2025)
“… Second, we propose an enhanced DPG algorithm integrating an attention mechanism with selective communication to reduce redundant input information for each agent, thereby lowering per-step…”

Get full text

Journal Article

Save to List

Saved in:
12

Loading…

Hybrid IRS-Assisted Secure Satellite Downlink Communications: A Fast Deep Reinforcement Learning Approach by Ngo, Quynh Tu, Phan, Khoa Tran, Mahmood, Abdun, Xiang, Wei

ISSN: 2471-285X, 2471-285X

Published: Piscataway IEEE 01.08.2024

Published in IEEE transactions on emerging topics in computational intelligence (01.08.2024)
“… information and IRS power consumption. We leverage deep reinforcement learning (DRL) to solve the problem by proposing a fast DRL algorithm, namely the deep post-decision state-deterministic policy gradient (DPDS-DPG) algorithm…”

Get full text

Journal Article

Save to List

Saved in:
13

Loading…

High-Level Tracking of Autonomous Underwater Vehicles Based on Pseudo Averaged Q-Learning by Shi, Wenjie, Song, Shiji, Wu, Cheng

ISSN: 2577-1655

Published: IEEE 01.10.2018

Published in Conference proceedings - IEEE International Conference on Systems, Man, and Cybernetics (01.10.2018)
“…In this paper, we investigate the trajectory tracking problem of underactuated autonomous underwater vehicles (AUVs) with input saturation. Our proposed…”

Get full text

Conference Proceeding

Save to List

Saved in:
14

Loading…

Reinforcement Learning Based Adaptive Load Shedding by CANFIS Controllers for Frequency Recovery Criterion-Oriented Control by Yang, Hao, Jin, Bo, Ding, Zhaohao, Sun, Zhenglong, Liu, Cheng, Yang, Dongfeng, Cai, Guowei, Chen, Jian

ISSN: 0885-8950, 1558-0679

Published: New York IEEE 01.01.2025

Published in IEEE transactions on power systems (01.01.2025)
“…To ensure modern power systems with the ability to ride through frequency excursions after a power deficit, a refined and strict frequency recovery criterion…”

Get full text

Journal Article

Save to List

Saved in:
15

Loading…

Approximating Nash Equilibrium in Day-ahead Electricity Market Bidding with Multi-agent Deep Reinforcement Learning by Yan Du, Fangxing Li, Helia Zandi, Yaosuo Xue

ISSN: 2196-5420

Published: IEEE 01.01.2021

Published in Journal of modern power systems and clean energy (01.01.2021)
“…), is applied for approximating the Nash equilibrium (NE) in the above Markov game. The MAD-DPG algorithm has the advantage of generalization due to the automatic feature extraction ability of the deep neural networks…”

Get full text

Journal Article

Save to List

Saved in:
16

Loading…

Dual-Based Online Learning of Dynamic Network Topologies by Saboksayr, Seyed Saman, Mateos, Gonzalo

ISSN: 2379-190X

Published: IEEE 04.06.2023

Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (04.06.2023)
“… We also show that the online DPG algorithm converges faster than a primal-based baseline of comparable complexity…”

Get full text

Conference Proceeding

Save to List

Saved in:
17

Loading…

Design, Modeling and Control of a Novel Morphing Quadrotor by Hu, Dada, Pei, Zhongcai, Shi, Jia, Tang, Zhiyong

ISSN: 2377-3766, 2377-3766

Published: Piscataway IEEE 01.10.2021

Published in IEEE robotics and automation letters (01.10.2021)
“…In this letter, the design, modeling and control of a novel morphing quadrotor are presented. The morphing quadrotor can fly stably and accurately in the air…”

Get full text

Journal Article

Save to List

Saved in:
18

Loading…

Localization of Data Injection Attacks on Distributed M-Estimation by Shalom, Or, Leshem, Amir, Scaglione, Anna

ISSN: 2373-776X, 2373-7778

Published: Piscataway IEEE 2022

Published in IEEE transactions on signal and information processing over networks (2022)
“… The agents' data are independent and identically distributed. We have previously proposed a novel data injection attack on the Distributed Projected Gradient (DPG…”

Get full text

Journal Article

Save to List

Saved in:
19

Loading…

A curriculum-based multi-agent DPG-ASC algorithm for UAV area defense: A curriculum-based multi-agent DPG-ASC algorithm by Sun, Miaoping, Xu, Zehao, Yang, Zequan, Nian, Xiaohong, Chen, Yong

ISSN: 1573-0484

Published: New York Springer US 21.11.2025

Published in The Journal of supercomputing (21.11.2025)
“… Second, we propose an enhanced DPG algorithm integrating an attention mechanism with selective communication to reduce redundant input information for each agent, thereby lowering per-step…”

Get full text

Journal Article

Save to List

Saved in:
20

Loading…

Single-Parameter-Tuned Attitude Control for Quadrotor with Unknown Disturbance by Hu, Dada, Pei, Zhongcai, Tang, Zhiyong

ISSN: 2076-3417, 2076-3417

Published: Basel MDPI AG 01.08.2020

Published in Applied sciences (01.08.2020)
“… A deterministic policy gradient (DPG) algorithm that is based on an actor-critic structure in a model-free style is used as the learning algorithm…”

Get full text

Journal Article

Save to List

Saved in:

Search Results - "DPG Algorithm"

Feature selection in deterministic policy gradient by Li, Luntong, Li, Dazi, Song, Tianheng

Manipulators with Machine Learning-based Control with Reinforcement by Mannaa, Ali Sagae, Zarubin, Andrei O.

Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles by Shi, Wenjie, Song, Shiji, Wu, Cheng, Chen, C. L. Philip

Composite optimization with coupling constraints via dual proximal gradient method with applications to asynchronous networks by Wang, Jianzheng, Hu, Guoqiang

Deep reinforcement learning based rate enhancement scheme for RIS assisted mobile users underlaying UAV by Joshi, Neeraj, Budhiraja, Ishan, Garg, Deepak, Garg, Sahil, Choi, Bong Jun, Alrashoud, Mubarak

Intelligent Trajectory Design in UAV-Aided Communications With Reinforcement Learning by Yin, Sixing, Zhao, Shuo, Zhao, Yifei, Yu, F. Richard

Path Design and Resource Management for NOMA Enhanced Indoor Intelligent Robots by Zhong, Ruikang, Liu, Xiao, Liu, Yuanwei, Chen, Yue, Wang, Xianbin

UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking by Wei, Minggao, Wang, Song, Zheng, Jinfan, Chen, Dan

Detect Insider Attacks Using CNN in Decentralized Optimization by Li, Gangqiang, Wu, Sissi Xiaoxiao, Zhang, Shengli, Li, Qiang

Learning-based state estimation and control using MHE and MPC schemes with imperfect models by Nejatbakhsh Esfahani, Hossein, Bahari Kordabad, Arash, Cai, Wenqi, Gros, Sebastien

A curriculum-based multi-agent DPG-ASC algorithm for UAV area defense by Sun, Miaoping, Xu, Zehao, Yang, Zequan, Nian, Xiaohong, Chen, Yong

Hybrid IRS-Assisted Secure Satellite Downlink Communications: A Fast Deep Reinforcement Learning Approach by Ngo, Quynh Tu, Phan, Khoa Tran, Mahmood, Abdun, Xiang, Wei

High-Level Tracking of Autonomous Underwater Vehicles Based on Pseudo Averaged Q-Learning by Shi, Wenjie, Song, Shiji, Wu, Cheng

Reinforcement Learning Based Adaptive Load Shedding by CANFIS Controllers for Frequency Recovery Criterion-Oriented Control by Yang, Hao, Jin, Bo, Ding, Zhaohao, Sun, Zhenglong, Liu, Cheng, Yang, Dongfeng, Cai, Guowei, Chen, Jian

Approximating Nash Equilibrium in Day-ahead Electricity Market Bidding with Multi-agent Deep Reinforcement Learning by Yan Du, Fangxing Li, Helia Zandi, Yaosuo Xue

Dual-Based Online Learning of Dynamic Network Topologies by Saboksayr, Seyed Saman, Mateos, Gonzalo

Design, Modeling and Control of a Novel Morphing Quadrotor by Hu, Dada, Pei, Zhongcai, Shi, Jia, Tang, Zhiyong

Localization of Data Injection Attacks on Distributed M-Estimation by Shalom, Or, Leshem, Amir, Scaglione, Anna

A curriculum-based multi-agent DPG-ASC algorithm for UAV area defense: A curriculum-based multi-agent DPG-ASC algorithm by Sun, Miaoping, Xu, Zehao, Yang, Zequan, Nian, Xiaohong, Chen, Yong

Single-Parameter-Tuned Attitude Control for Quadrotor with Unknown Disturbance by Hu, Dada, Pei, Zhongcai, Tang, Zhiyong

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication