Výsledky vyhledávání - "deep deterministic policy gradient algorithm"

  • Zobrazuji výsledky 1 - 17 z 17
Upřesnit hledání
  1. 1

    Motion planning for 7-degree-of-freedom bionic arm: Deep deterministic policy gradient algorithm based on imitation of human action Autor Li, Baojiang, Qiu, Shengjie, Ye, Haiyan, Guo, Yuting, Wang, Haiyan, Bai, Jibo

    ISSN: 0952-1976
    Vydáno: Elsevier Ltd 15.01.2025
    “… Meanwhile, bionic arms often require specific programming to be implemented for the subject to initially meet the control requirements, which makes it difficult to match the motion of the bionic arm…”
    Získat plný text
    Journal Article
  2. 2

    Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm Autor Wu, Junta, Li, Huiyun

    ISSN: 1024-123X, 1563-5147
    Vydáno: Cairo, Egypt Hindawi Publishing Corporation 2020
    “…Deep deterministic policy gradient algorithm operating over continuous space of actions has attracted great attention for reinforcement learning…”
    Získat plný text
    Journal Article
  3. 3

    Optimization control of the double‐capacity water tank‐level system using the deep deterministic policy gradient algorithm Autor Ye, Likun, Jiang, Pei

    ISSN: 2577-8196, 2577-8196
    Vydáno: Hoboken John Wiley & Sons, Inc 01.11.2023
    Vydáno v Engineering reports (Hoboken, N.J.) (01.11.2023)
    “…Process control systems are subject to external factors such as changes in working conditions and perturbation interference, which can significantly affect the system's stability and overall performance…”
    Získat plný text
    Journal Article
  4. 4

    Robust control for anaerobic digestion systems of Tequila vinasses under uncertainty: A Deep Deterministic Policy Gradient Algorithm Autor Mendiola-Rodriguez, Tannia A., Ricardez-Sandoval, Luis A.

    ISSN: 2772-5081, 2772-5081
    Vydáno: Elsevier Ltd 01.06.2022
    Vydáno v Digital Chemical Engineering (01.06.2022)
    “… Anaerobic digestion is a complex system subject to external perturbations and parametric uncertainty…”
    Získat plný text
    Journal Article
  5. 5

    Wide-Range Variable Cycle Engine Control Based on Deep Reinforcement Learning Autor Ding, Yaoyao, Wang, Fengming, Mu, Yuanwei, Sun, Hongfei

    ISSN: 2226-4310, 2226-4310
    Vydáno: Basel MDPI AG 01.05.2025
    Vydáno v Aerospace (01.05.2025)
    “… deterministic policy gradient algorithm, and it applies an action space pruning technique to optimize…”
    Získat plný text
    Journal Article
  6. 6

    Robust Energy-Efficient DRL-Based Optimization in UAV-Mounted RIS Systems with Jitter Autor Salim, Mahmoud M., Rabie, Khaled M., Muqaibel, Ali H.

    ISSN: 1089-7798, 1558-2558
    Vydáno: IEEE 2025
    Vydáno v IEEE communications letters (2025)
    “… To address this, we reformulate the problem as a deep reinforcement learning (DRL) environment and develop a smoothed softmax dual deep deterministic policy gradient algorithm…”
    Získat plný text
    Journal Article
  7. 7

    Deep-Reinforcement-Learning-Based Computation Offloading for Servicing Dynamic Demand in Multi-UAV-Assisted IoT Network Autor Lin, Na, Bai, Lu, Hawbani, Ammar, Guan, Yunchong, Mao, Chaojin, Liu, Zhi, Zhao, Liang

    ISSN: 2327-4662, 2327-4662
    Vydáno: Piscataway IEEE 15.05.2024
    Vydáno v IEEE internet of things journal (15.05.2024)
    “… The dynamic scheduling and computation offloading of UAVs is the subject of this article. Specifically, we propose a deep deterministic policy gradient algorithm based on a greedy…”
    Získat plný text
    Journal Article
  8. 8

    Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID Autor Guan, Wei, Xi, Zhaoyong, Cui, Zhewen, Zhang, Xianku

    ISSN: 0007-215X, 1845-5859
    Vydáno: Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje 01.01.2025
    Vydáno v Brodogradnja (01.01.2025)
    “…An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface…”
    Získat plný text
    Journal Article Paper
  9. 9

    Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach Autor Wang, Yang, Gao, Zhen

    Vydáno: IEEE 28.07.2021
    “…), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e…”
    Získat plný text
    Konferenční příspěvek
  10. 10

    Online Assignment of Satellite Data Transmission Tasks Based on Twin Delayed Deep Deterministic Policy Gradient Autor Li, Ke, Xiong, Shunrui, Yang, Mingying, Ma, Sai

    Vydáno: IEEE 12.04.2024
    “…This study focuses on the scheduling problem of satellite data transmission tasks, where ground station resources are scarce and subject to multiple constraints, such as limited funding, long…”
    Získat plný text
    Konferenční příspěvek
  11. 11

    Coordinated Scheduling Strategy for Multi-Agent System in Active Distribution Grid based on Deep Reinforcement Learning Method Autor Liu, Xuan, Li, Wanbin, Song, Lin, Yao, Zhanfeng, Lu, Tianguang, Bai, Xue, Zhang, Yuqi, Wang, Wenxin, Zheng, Yanan, Liu, Chunxiu

    Vydáno: IEEE 14.07.2024
    “… First, each subject such as distributed power sources, energy storage devices and flexible loads is taken as an intelligent agent in an active distribution grid, and the real-time state information…”
    Získat plný text
    Konferenční příspěvek
  12. 12

    Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning Autor Lin, Bo, Zhong, Yangzheng, Ren, Weiqing

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 08.04.2024
    Vydáno v arXiv.org (08.04.2024)
    “…Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology…”
    Získat plný text
    Paper
  13. 13

    A deep reinforcement learning approach to assess the low-altitude airspace capacity for urban air mobility Autor Asal Mehditabrizi, Samadzad, Mahdi, Sabzekar, Sina

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 23.01.2023
    Vydáno v arXiv.org (23.01.2023)
    “… Path planning is a vital subject in urban air mobility which could enable a large number of UAVs to fly simultaneously in the airspace without facing the risk of collision…”
    Získat plný text
    Paper
  14. 14

    Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach Autor Wang, Yang, Gao, Zhen

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 02.08.2021
    Vydáno v arXiv.org (02.08.2021)
    “…), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e…”
    Získat plný text
    Paper
  15. 15

    Deterministic Policy Gradients With General State Transitions Autor Cai, Qingpeng, Pan, Ling, Tang, Pingzhong

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 02.10.2018
    Vydáno v arXiv.org (02.10.2018)
    “…We study a reinforcement learning setting, where the state transition function is a convex combination of a stochastic continuous function and a deterministic…”
    Získat plný text
    Paper
  16. 16

    Guidance Design for Escape Flight Vehicle Using Evolution Strategy Enhanced Deep Reinforcement Learning Autor Hu, Xiao, Wang, Tianshu, Gong, Min, Yang, Shaoshi

    ISSN: 2169-3536, 2169-3536
    Vydáno: Piscataway IEEE 2024
    Vydáno v IEEE access (2024)
    “…Guidance commands of flight vehicles can be regarded as a series of data sets having fixed time intervals, thus guidance design constitutes a typical…”
    Získat plný text
    Journal Article
  17. 17

    Guidance Design for Escape Flight Vehicle Using Evolution Strategy Enhanced Deep Reinforcement Learning Autor Hu, Xiao, Wang, Tianshu, Gong, Min, Yang, Shaoshi

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 04.05.2024
    Vydáno v arXiv.org (04.05.2024)
    “… For the EFV, the objective of the guidance design entails progressively maximizing the residual velocity, subject to the constraint imposed by the given evasion distance…”
    Získat plný text
    Paper