Search Results - "DPG Algorithm"

Refine Results
  1. 1

    Feature selection in deterministic policy gradient by Li, Luntong, Li, Dazi, Song, Tianheng

    ISSN: 2051-3305, 2051-3305
    Published: The Institution of Engineering and Technology 01.07.2020
    “… In order to solve this problem, the authors extend DPG algorithm by adding an approximate-linear-dependency-based sparsification procedure, which makes DPG algorithm to automatically select…”
    Get full text
    Journal Article
  2. 2

    Manipulators with Machine Learning-based Control with Reinforcement by Mannaa, Ali Sagae, Zarubin, Andrei O.

    Published: IEEE 21.09.2023
    “…This industry is created for a number of tasks in which there is no unambiguous solution algorithm. Accordingly, these tasks cannot be solved in a fully…”
    Get full text
    Conference Proceeding
  3. 3

    Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles by Shi, Wenjie, Song, Shiji, Wu, Cheng, Chen, C. L. Philip

    ISSN: 2162-237X, 2162-2388, 2162-2388
    Published: United States IEEE 01.12.2019
    “…This paper investigates trajectory tracking problem for a class of underactuated autonomous underwater vehicles (AUVs) with unknown dynamics and constrained…”
    Get full text
    Journal Article
  4. 4

    Composite optimization with coupling constraints via dual proximal gradient method with applications to asynchronous networks by Wang, Jianzheng, Hu, Guoqiang

    ISSN: 1049-8923, 1099-1239
    Published: Bognor Regis Wiley Subscription Services, Inc 25.05.2022
    “… Then, an asynchronous DPG (Asyn‐DPG) algorithm is proposed for the asynchronous networks with heterogeneous step…”
    Get full text
    Journal Article
  5. 5

    Deep reinforcement learning based rate enhancement scheme for RIS assisted mobile users underlaying UAV by Joshi, Neeraj, Budhiraja, Ishan, Garg, Deepak, Garg, Sahil, Choi, Bong Jun, Alrashoud, Mubarak

    ISSN: 1110-0168
    Published: Elsevier B.V 01.03.2024
    Published in Alexandria engineering journal (01.03.2024)
    “…The fifth generation (5G) network enabled communication between devices has emerged as a state-of-the-art technology. In the era of proliferating smart devices…”
    Get full text
    Journal Article
  6. 6

    Intelligent Trajectory Design in UAV-Aided Communications With Reinforcement Learning by Yin, Sixing, Zhao, Shuo, Zhao, Yifei, Yu, F. Richard

    ISSN: 0018-9545, 1939-9359
    Published: New York IEEE 01.08.2019
    Published in IEEE transactions on vehicular technology (01.08.2019)
    “… Due to the continuous and deterministic action space, the deterministic policy gradient (DPG…”
    Get full text
    Journal Article
  7. 7

    Path Design and Resource Management for NOMA Enhanced Indoor Intelligent Robots by Zhong, Ruikang, Liu, Xiao, Liu, Yuanwei, Chen, Yue, Wang, Xianbin

    ISSN: 1536-1276, 1558-2248
    Published: New York IEEE 01.10.2022
    “…A communication enabled indoor intelligent robots (IRs) service framework is proposed, where non-orthogonal multiple access (NOMA) technique is adopted to…”
    Get full text
    Journal Article
  8. 8

    UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking by Wei, Minggao, Wang, Song, Zheng, Jinfan, Chen, Dan

    ISSN: 2169-3536, 2169-3536
    Published: Piscataway IEEE 2018
    Published in IEEE access (2018)
    “…The success of robotic, such as UGV systems, largely benefits from the fundamental capability of autonomously finding collision-free path(s) to commit mobile…”
    Get full text
    Journal Article
  9. 9

    Detect Insider Attacks Using CNN in Decentralized Optimization by Li, Gangqiang, Wu, Sissi Xiaoxiao, Zhang, Shengli, Li, Qiang

    ISSN: 2379-190X
    Published: IEEE 01.05.2020
    “…) algorithm, when it is applied for solving a decentralized multi-agent optimization. It is known that the gossip-based DPG algorithm is vulnerable to insider attacks because each agent locally estimates its (sub…”
    Get full text
    Conference Proceeding
  10. 10

    Learning-based state estimation and control using MHE and MPC schemes with imperfect models by Nejatbakhsh Esfahani, Hossein, Bahari Kordabad, Arash, Cai, Wenqi, Gros, Sebastien

    ISSN: 0947-3580, 1435-5671
    Published: Elsevier Ltd 01.09.2023
    Published in European journal of control (01.09.2023)
    “… A compatible Deterministic Policy Gradient (DPG) algorithm is then proposed to directly tune the parameters of both the estimator (MHE) and controller (MPC…”
    Get full text
    Journal Article
  11. 11

    A curriculum-based multi-agent DPG-ASC algorithm for UAV area defense by Sun, Miaoping, Xu, Zehao, Yang, Zequan, Nian, Xiaohong, Chen, Yong

    ISSN: 1573-0484, 0920-8542, 1573-0484
    Published: New York Springer Nature B.V 21.11.2025
    Published in The Journal of supercomputing (21.11.2025)
    “… Second, we propose an enhanced DPG algorithm integrating an attention mechanism with selective communication to reduce redundant input information for each agent, thereby lowering per-step…”
    Get full text
    Journal Article
  12. 12

    Hybrid IRS-Assisted Secure Satellite Downlink Communications: A Fast Deep Reinforcement Learning Approach by Ngo, Quynh Tu, Phan, Khoa Tran, Mahmood, Abdun, Xiang, Wei

    ISSN: 2471-285X, 2471-285X
    Published: Piscataway IEEE 01.08.2024
    “… information and IRS power consumption. We leverage deep reinforcement learning (DRL) to solve the problem by proposing a fast DRL algorithm, namely the deep post-decision state-deterministic policy gradient (DPDS-DPG) algorithm…”
    Get full text
    Journal Article
  13. 13

    High-Level Tracking of Autonomous Underwater Vehicles Based on Pseudo Averaged Q-Learning by Shi, Wenjie, Song, Shiji, Wu, Cheng

    ISSN: 2577-1655
    Published: IEEE 01.10.2018
    “…In this paper, we investigate the trajectory tracking problem of underactuated autonomous underwater vehicles (AUVs) with input saturation. Our proposed…”
    Get full text
    Conference Proceeding
  14. 14

    Reinforcement Learning Based Adaptive Load Shedding by CANFIS Controllers for Frequency Recovery Criterion-Oriented Control by Yang, Hao, Jin, Bo, Ding, Zhaohao, Sun, Zhenglong, Liu, Cheng, Yang, Dongfeng, Cai, Guowei, Chen, Jian

    ISSN: 0885-8950, 1558-0679
    Published: New York IEEE 01.01.2025
    Published in IEEE transactions on power systems (01.01.2025)
    “…To ensure modern power systems with the ability to ride through frequency excursions after a power deficit, a refined and strict frequency recovery criterion…”
    Get full text
    Journal Article
  15. 15

    Approximating Nash Equilibrium in Day-ahead Electricity Market Bidding with Multi-agent Deep Reinforcement Learning by Yan Du, Fangxing Li, Helia Zandi, Yaosuo Xue

    ISSN: 2196-5420
    Published: IEEE 01.01.2021
    “…), is applied for approximating the Nash equilibrium (NE) in the above Markov game. The MAD-DPG algorithm has the advantage of generalization due to the automatic feature extraction ability of the deep neural networks…”
    Get full text
    Journal Article
  16. 16

    Dual-Based Online Learning of Dynamic Network Topologies by Saboksayr, Seyed Saman, Mateos, Gonzalo

    ISSN: 2379-190X
    Published: IEEE 04.06.2023
    “… We also show that the online DPG algorithm converges faster than a primal-based baseline of comparable complexity…”
    Get full text
    Conference Proceeding
  17. 17

    Design, Modeling and Control of a Novel Morphing Quadrotor by Hu, Dada, Pei, Zhongcai, Shi, Jia, Tang, Zhiyong

    ISSN: 2377-3766, 2377-3766
    Published: Piscataway IEEE 01.10.2021
    Published in IEEE robotics and automation letters (01.10.2021)
    “…In this letter, the design, modeling and control of a novel morphing quadrotor are presented. The morphing quadrotor can fly stably and accurately in the air…”
    Get full text
    Journal Article
  18. 18

    Localization of Data Injection Attacks on Distributed M-Estimation by Shalom, Or, Leshem, Amir, Scaglione, Anna

    ISSN: 2373-776X, 2373-7778
    Published: Piscataway IEEE 2022
    “… The agents' data are independent and identically distributed. We have previously proposed a novel data injection attack on the Distributed Projected Gradient (DPG…”
    Get full text
    Journal Article
  19. 19

    A curriculum-based multi-agent DPG-ASC algorithm for UAV area defense: A curriculum-based multi-agent DPG-ASC algorithm by Sun, Miaoping, Xu, Zehao, Yang, Zequan, Nian, Xiaohong, Chen, Yong

    ISSN: 1573-0484
    Published: New York Springer US 21.11.2025
    Published in The Journal of supercomputing (21.11.2025)
    “… Second, we propose an enhanced DPG algorithm integrating an attention mechanism with selective communication to reduce redundant input information for each agent, thereby lowering per-step…”
    Get full text
    Journal Article
  20. 20

    Single-Parameter-Tuned Attitude Control for Quadrotor with Unknown Disturbance by Hu, Dada, Pei, Zhongcai, Tang, Zhiyong

    ISSN: 2076-3417, 2076-3417
    Published: Basel MDPI AG 01.08.2020
    Published in Applied sciences (01.08.2020)
    “… A deterministic policy gradient (DPG) algorithm that is based on an actor-critic structure in a model-free style is used as the learning algorithm…”
    Get full text
    Journal Article