Suchergebnisse - "Policy gradient algorithm"

  1. 1

    Motion planning for 7-degree-of-freedom bionic arm: Deep deterministic policy gradient algorithm based on imitation of human action von Li, Baojiang, Qiu, Shengjie, Ye, Haiyan, Guo, Yuting, Wang, Haiyan, Bai, Jibo

    ISSN: 0952-1976
    Veröffentlicht: Elsevier Ltd 15.01.2025
    Veröffentlicht in Engineering applications of artificial intelligence (15.01.2025)
    “… Meanwhile, bionic arms often require specific programming to be implemented for the subject to initially meet the control requirements, which makes it difficult to match the motion of the bionic arm …”
    Volltext
    Journal Article
  2. 2

    Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm von Wu, Junta, Li, Huiyun

    ISSN: 1024-123X, 1563-5147
    Veröffentlicht: Cairo, Egypt Hindawi Publishing Corporation 2020
    Veröffentlicht in Mathematical problems in engineering (2020)
    “… Deep deterministic policy gradient algorithm operating over continuous space of actions has attracted great attention for reinforcement learning …”
    Volltext
    Journal Article
  3. 3

    Optimization control of the double‐capacity water tank‐level system using the deep deterministic policy gradient algorithm von Ye, Likun, Jiang, Pei

    ISSN: 2577-8196, 2577-8196
    Veröffentlicht: Hoboken John Wiley & Sons, Inc 01.11.2023
    Veröffentlicht in Engineering reports (Hoboken, N.J.) (01.11.2023)
    “… Process control systems are subject to external factors such as changes in working conditions and perturbation interference, which can significantly affect the system's stability and overall performance …”
    Volltext
    Journal Article
  4. 4

    Robust control for anaerobic digestion systems of Tequila vinasses under uncertainty: A Deep Deterministic Policy Gradient Algorithm von Mendiola-Rodriguez, Tannia A., Ricardez-Sandoval, Luis A.

    ISSN: 2772-5081, 2772-5081
    Veröffentlicht: Elsevier Ltd 01.06.2022
    Veröffentlicht in Digital Chemical Engineering (01.06.2022)
    “… Anaerobic digestion is a complex system subject to external perturbations and parametric uncertainty …”
    Volltext
    Journal Article
  5. 5

    Brain-Inspired Deep Meta-Reinforcement Learning for Active Coordinated Fault-Tolerant Load Frequency Control of Multi-Area Grids von Li, Jiawen, Zhou, Tao, Cui, Haoyang

    ISSN: 1545-5955, 1558-3783
    Veröffentlicht: IEEE 01.07.2024
    “… In addition, this paper proposes a brain-Inspired deep meta-deterministic policy gradient algorithm (BIMA-DMDPG …”
    Volltext
    Journal Article
  6. 6

    Wide-Range Variable Cycle Engine Control Based on Deep Reinforcement Learning von Ding, Yaoyao, Wang, Fengming, Mu, Yuanwei, Sun, Hongfei

    ISSN: 2226-4310, 2226-4310
    Veröffentlicht: Basel MDPI AG 01.05.2025
    Veröffentlicht in Aerospace (01.05.2025)
    “… deterministic policy gradient algorithm, and it applies an action space pruning technique to optimize …”
    Volltext
    Journal Article
  7. 7

    Robust Energy-Efficient DRL-Based Optimization in UAV-Mounted RIS Systems with Jitter von Salim, Mahmoud M., Rabie, Khaled M., Muqaibel, Ali H.

    ISSN: 1089-7798, 1558-2558
    Veröffentlicht: IEEE 2025
    Veröffentlicht in IEEE communications letters (2025)
    “… To address this, we reformulate the problem as a deep reinforcement learning (DRL) environment and develop a smoothed softmax dual deep deterministic policy gradient algorithm …”
    Volltext
    Journal Article
  8. 8

    Deep-Reinforcement-Learning-Based Computation Offloading for Servicing Dynamic Demand in Multi-UAV-Assisted IoT Network von Lin, Na, Bai, Lu, Hawbani, Ammar, Guan, Yunchong, Mao, Chaojin, Liu, Zhi, Zhao, Liang

    ISSN: 2327-4662, 2327-4662
    Veröffentlicht: Piscataway IEEE 15.05.2024
    Veröffentlicht in IEEE internet of things journal (15.05.2024)
    “… The dynamic scheduling and computation offloading of UAVs is the subject of this article. Specifically, we propose a deep deterministic policy gradient algorithm based on a greedy …”
    Volltext
    Journal Article
  9. 9

    Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID von Guan, Wei, Xi, Zhaoyong, Cui, Zhewen, Zhang, Xianku

    ISSN: 0007-215X, 1845-5859
    Veröffentlicht: Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje 01.01.2025
    Veröffentlicht in Brodogradnja (01.01.2025)
    “… An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface …”
    Volltext
    Journal Article Paper
  10. 10

    Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach von Wang, Yang, Gao, Zhen

    Veröffentlicht: IEEE 28.07.2021
    “… ), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e …”
    Volltext
    Tagungsbericht
  11. 11

    Online Assignment of Satellite Data Transmission Tasks Based on Twin Delayed Deep Deterministic Policy Gradient von Li, Ke, Xiong, Shunrui, Yang, Mingying, Ma, Sai

    Veröffentlicht: IEEE 12.04.2024
    “… This study focuses on the scheduling problem of satellite data transmission tasks, where ground station resources are scarce and subject to multiple constraints, such as limited funding, long …”
    Volltext
    Tagungsbericht
  12. 12

    Learning Constrained Resource Allocation Policies in Wireless Control Systems von Lima, Vinicius, Eisen, Mark, Ribeiro, Alejandro

    ISSN: 2576-2370
    Veröffentlicht: IEEE 14.12.2020
    “… As wireless networks are noisy and subject to packet losses - which might impact the operation of the control system - proper distribution of communication resources among components of the wireless …”
    Volltext
    Tagungsbericht
  13. 13

    Coordinated Scheduling Strategy for Multi-Agent System in Active Distribution Grid based on Deep Reinforcement Learning Method von Liu, Xuan, Li, Wanbin, Song, Lin, Yao, Zhanfeng, Lu, Tianguang, Bai, Xue, Zhang, Yuqi, Wang, Wenxin, Zheng, Yanan, Liu, Chunxiu

    Veröffentlicht: IEEE 14.07.2024
    “… First, each subject such as distributed power sources, energy storage devices and flexible loads is taken as an intelligent agent in an active distribution grid, and the real-time state information …”
    Volltext
    Tagungsbericht
  14. 14

    Independent Learning in Constrained Markov Potential Games von Jordan, Philip, Barakat, Anas, He, Niao

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 27.02.2024
    Veröffentlicht in arXiv.org (27.02.2024)
    “… We propose an independent policy gradient algorithm for learning approximate constrained Nash equilibria …”
    Volltext
    Paper
  15. 15

    Model-free Distortion Canceling and Control of Quantum Devices von Fouad, Ahmed F, Youssry, Akram, El-Rafei, Ahmed, Hammad, Sherif

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 13.07.2024
    Veröffentlicht in arXiv.org (13.07.2024)
    “… First, in practice the control signals are usually subject to unknown classical distortions that could arise from the device fabrication, material properties and/or instruments generating those signals …”
    Volltext
    Paper
  16. 16

    Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning von Lin, Bo, Zhong, Yangzheng, Ren, Weiqing

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 08.04.2024
    Veröffentlicht in arXiv.org (08.04.2024)
    “… Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology …”
    Volltext
    Paper
  17. 17

    A deep reinforcement learning approach to assess the low-altitude airspace capacity for urban air mobility von Asal Mehditabrizi, Samadzad, Mahdi, Sabzekar, Sina

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 23.01.2023
    Veröffentlicht in arXiv.org (23.01.2023)
    “… Path planning is a vital subject in urban air mobility which could enable a large number of UAVs to fly simultaneously in the airspace without facing the risk of collision …”
    Volltext
    Paper
  18. 18

    A Deep Reinforcement Learning Approach for Online Parcel Assignment von Zeng, Hao, Wu, Qiong, Han, Kunpeng, He, Junying, Hu, Haoyuan

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 17.01.2023
    Veröffentlicht in arXiv.org (17.01.2023)
    “… More specifically, we introduce a novel Markov Decision Process (MDP) framework to model the OPA problem, and develop a policy gradient algorithm that adopts …”
    Volltext
    Paper
  19. 19

    Sampling, Communication, and Prediction Co-Design for Synchronizing the Real-World Device and Digital Model in Metaverse von Meng, Zhen, She, Changyang, Zhao, Guodong, De Martini, Daniele

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 31.07.2022
    Veröffentlicht in arXiv.org (31.07.2022)
    “… This work proposes a sampling, communication and prediction co-design framework to minimize the communication load subject to a constraint on tracking the Mean Squared Error (MSE …”
    Volltext
    Paper
  20. 20

    Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach von Wang, Yang, Gao, Zhen

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 02.08.2021
    Veröffentlicht in arXiv.org (02.08.2021)
    “… ), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e …”
    Volltext
    Paper