Suchergebnisse - "Policy gradient algorithm"

1

Wird geladen …

Motion planning for 7-degree-of-freedom bionic arm: Deep deterministic policy gradient algorithm based on imitation of human action von Li, Baojiang, Qiu, Shengjie, Ye, Haiyan, Guo, Yuting, Wang, Haiyan, Bai, Jibo

ISSN: 0952-1976

Veröffentlicht: Elsevier Ltd 15.01.2025

Veröffentlicht in Engineering applications of artificial intelligence (15.01.2025)
“… Meanwhile, bionic arms often require specific programming to be implemented for the subject to initially meet the control requirements, which makes it difficult to match the motion of the bionic arm …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
2

Wird geladen …

Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm von Wu, Junta, Li, Huiyun

ISSN: 1024-123X, 1563-5147

Veröffentlicht: Cairo, Egypt Hindawi Publishing Corporation 2020

Veröffentlicht in Mathematical problems in engineering (2020)
“… Deep deterministic policy gradient algorithm operating over continuous space of actions has attracted great attention for reinforcement learning …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
3

Wird geladen …

Optimization control of the double‐capacity water tank‐level system using the deep deterministic policy gradient algorithm von Ye, Likun, Jiang, Pei

ISSN: 2577-8196, 2577-8196

Veröffentlicht: Hoboken John Wiley & Sons, Inc 01.11.2023

Veröffentlicht in Engineering reports (Hoboken, N.J.) (01.11.2023)
“… Process control systems are subject to external factors such as changes in working conditions and perturbation interference, which can significantly affect the system's stability and overall performance …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
4

Wird geladen …

Robust control for anaerobic digestion systems of Tequila vinasses under uncertainty: A Deep Deterministic Policy Gradient Algorithm von Mendiola-Rodriguez, Tannia A., Ricardez-Sandoval, Luis A.

ISSN: 2772-5081, 2772-5081

Veröffentlicht: Elsevier Ltd 01.06.2022

Veröffentlicht in Digital Chemical Engineering (01.06.2022)
“… Anaerobic digestion is a complex system subject to external perturbations and parametric uncertainty …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
5

Wird geladen …

Brain-Inspired Deep Meta-Reinforcement Learning for Active Coordinated Fault-Tolerant Load Frequency Control of Multi-Area Grids von Li, Jiawen, Zhou, Tao, Cui, Haoyang

ISSN: 1545-5955, 1558-3783

Veröffentlicht: IEEE 01.07.2024

Veröffentlicht in IEEE transactions on automation science and engineering (01.07.2024)
“… In addition, this paper proposes a brain-Inspired deep meta-deterministic policy gradient algorithm (BIMA-DMDPG …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
6

Wird geladen …

Wide-Range Variable Cycle Engine Control Based on Deep Reinforcement Learning von Ding, Yaoyao, Wang, Fengming, Mu, Yuanwei, Sun, Hongfei

ISSN: 2226-4310, 2226-4310

Veröffentlicht: Basel MDPI AG 01.05.2025

Veröffentlicht in Aerospace (01.05.2025)
“… deterministic policy gradient algorithm, and it applies an action space pruning technique to optimize …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
7

Wird geladen …

Robust Energy-Efficient DRL-Based Optimization in UAV-Mounted RIS Systems with Jitter von Salim, Mahmoud M., Rabie, Khaled M., Muqaibel, Ali H.

ISSN: 1089-7798, 1558-2558

Veröffentlicht: IEEE 2025

Veröffentlicht in IEEE communications letters (2025)
“… To address this, we reformulate the problem as a deep reinforcement learning (DRL) environment and develop a smoothed softmax dual deep deterministic policy gradient algorithm …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
8

Wird geladen …

Deep-Reinforcement-Learning-Based Computation Offloading for Servicing Dynamic Demand in Multi-UAV-Assisted IoT Network von Lin, Na, Bai, Lu, Hawbani, Ammar, Guan, Yunchong, Mao, Chaojin, Liu, Zhi, Zhao, Liang

ISSN: 2327-4662, 2327-4662

Veröffentlicht: Piscataway IEEE 15.05.2024

Veröffentlicht in IEEE internet of things journal (15.05.2024)
“… The dynamic scheduling and computation offloading of UAVs is the subject of this article. Specifically, we propose a deep deterministic policy gradient algorithm based on a greedy …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
9

Wird geladen …

Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID von Guan, Wei, Xi, Zhaoyong, Cui, Zhewen, Zhang, Xianku

ISSN: 0007-215X, 1845-5859

Veröffentlicht: Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje 01.01.2025

Veröffentlicht in Brodogradnja (01.01.2025)
“… An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface …”

Volltext

Journal Article Paper

Zu den Favoriten

Gespeichert in:
10

Wird geladen …

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach von Wang, Yang, Gao, Zhen

Veröffentlicht: IEEE 28.07.2021

Veröffentlicht in 2021 IEEE/CIC International Conference on Communications in China (ICCC) (28.07.2021)
“… ), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
11

Wird geladen …

Online Assignment of Satellite Data Transmission Tasks Based on Twin Delayed Deep Deterministic Policy Gradient von Li, Ke, Xiong, Shunrui, Yang, Mingying, Ma, Sai

Veröffentlicht: IEEE 12.04.2024

Veröffentlicht in 2024 7th World Conference on Computing and Communication Technologies (WCCCT) (12.04.2024)
“… This study focuses on the scheduling problem of satellite data transmission tasks, where ground station resources are scarce and subject to multiple constraints, such as limited funding, long …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
12

Wird geladen …

Learning Constrained Resource Allocation Policies in Wireless Control Systems von Lima, Vinicius, Eisen, Mark, Ribeiro, Alejandro

ISSN: 2576-2370

Veröffentlicht: IEEE 14.12.2020

Veröffentlicht in Proceedings of the IEEE Conference on Decision & Control (14.12.2020)
“… As wireless networks are noisy and subject to packet losses - which might impact the operation of the control system - proper distribution of communication resources among components of the wireless …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
13

Wird geladen …

Coordinated Scheduling Strategy for Multi-Agent System in Active Distribution Grid based on Deep Reinforcement Learning Method von Liu, Xuan, Li, Wanbin, Song, Lin, Yao, Zhanfeng, Lu, Tianguang, Bai, Xue, Zhang, Yuqi, Wang, Wenxin, Zheng, Yanan, Liu, Chunxiu

Veröffentlicht: IEEE 14.07.2024

Veröffentlicht in 2024 3rd International Conference on Energy and Electrical Power Systems (ICEEPS) (14.07.2024)
“… First, each subject such as distributed power sources, energy storage devices and flexible loads is taken as an intelligent agent in an active distribution grid, and the real-time state information …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
14

Wird geladen …

Independent Learning in Constrained Markov Potential Games von Jordan, Philip, Barakat, Anas, He, Niao

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 27.02.2024

Veröffentlicht in arXiv.org (27.02.2024)
“… We propose an independent policy gradient algorithm for learning approximate constrained Nash equilibria …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
15

Wird geladen …

Model-free Distortion Canceling and Control of Quantum Devices von Fouad, Ahmed F, Youssry, Akram, El-Rafei, Ahmed, Hammad, Sherif

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 13.07.2024

Veröffentlicht in arXiv.org (13.07.2024)
“… First, in practice the control signals are usually subject to unknown classical distortions that could arise from the device fabrication, material properties and/or instruments generating those signals …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
16

Wird geladen …

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning von Lin, Bo, Zhong, Yangzheng, Ren, Weiqing

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 08.04.2024

Veröffentlicht in arXiv.org (08.04.2024)
“… Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
17

Wird geladen …

A deep reinforcement learning approach to assess the low-altitude airspace capacity for urban air mobility von Asal Mehditabrizi, Samadzad, Mahdi, Sabzekar, Sina

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 23.01.2023

Veröffentlicht in arXiv.org (23.01.2023)
“… Path planning is a vital subject in urban air mobility which could enable a large number of UAVs to fly simultaneously in the airspace without facing the risk of collision …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
18

Wird geladen …

A Deep Reinforcement Learning Approach for Online Parcel Assignment von Zeng, Hao, Wu, Qiong, Han, Kunpeng, He, Junying, Hu, Haoyuan

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 17.01.2023

Veröffentlicht in arXiv.org (17.01.2023)
“… More specifically, we introduce a novel Markov Decision Process (MDP) framework to model the OPA problem, and develop a policy gradient algorithm that adopts …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
19

Wird geladen …

Sampling, Communication, and Prediction Co-Design for Synchronizing the Real-World Device and Digital Model in Metaverse von Meng, Zhen, She, Changyang, Zhao, Guodong, De Martini, Daniele

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 31.07.2022

Veröffentlicht in arXiv.org (31.07.2022)
“… This work proposes a sampling, communication and prediction co-design framework to minimize the communication load subject to a constraint on tracking the Mean Squared Error (MSE …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
20

Wird geladen …

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach von Wang, Yang, Gao, Zhen

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 02.08.2021

Veröffentlicht in arXiv.org (02.08.2021)
“… ), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:

Suchergebnisse - "Policy gradient algorithm"

Motion planning for 7-degree-of-freedom bionic arm: Deep deterministic policy gradient algorithm based on imitation of human action von Li, Baojiang, Qiu, Shengjie, Ye, Haiyan, Guo, Yuting, Wang, Haiyan, Bai, Jibo

Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm von Wu, Junta, Li, Huiyun

Optimization control of the double‐capacity water tank‐level system using the deep deterministic policy gradient algorithm von Ye, Likun, Jiang, Pei

Robust control for anaerobic digestion systems of Tequila vinasses under uncertainty: A Deep Deterministic Policy Gradient Algorithm von Mendiola-Rodriguez, Tannia A., Ricardez-Sandoval, Luis A.

Brain-Inspired Deep Meta-Reinforcement Learning for Active Coordinated Fault-Tolerant Load Frequency Control of Multi-Area Grids von Li, Jiawen, Zhou, Tao, Cui, Haoyang

Wide-Range Variable Cycle Engine Control Based on Deep Reinforcement Learning von Ding, Yaoyao, Wang, Fengming, Mu, Yuanwei, Sun, Hongfei

Robust Energy-Efficient DRL-Based Optimization in UAV-Mounted RIS Systems with Jitter von Salim, Mahmoud M., Rabie, Khaled M., Muqaibel, Ali H.

Deep-Reinforcement-Learning-Based Computation Offloading for Servicing Dynamic Demand in Multi-UAV-Assisted IoT Network von Lin, Na, Bai, Lu, Hawbani, Ammar, Guan, Yunchong, Mao, Chaojin, Liu, Zhi, Zhao, Liang

Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID von Guan, Wei, Xi, Zhaoyong, Cui, Zhewen, Zhang, Xianku

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach von Wang, Yang, Gao, Zhen

Online Assignment of Satellite Data Transmission Tasks Based on Twin Delayed Deep Deterministic Policy Gradient von Li, Ke, Xiong, Shunrui, Yang, Mingying, Ma, Sai

Learning Constrained Resource Allocation Policies in Wireless Control Systems von Lima, Vinicius, Eisen, Mark, Ribeiro, Alejandro

Coordinated Scheduling Strategy for Multi-Agent System in Active Distribution Grid based on Deep Reinforcement Learning Method von Liu, Xuan, Li, Wanbin, Song, Lin, Yao, Zhanfeng, Lu, Tianguang, Bai, Xue, Zhang, Yuqi, Wang, Wenxin, Zheng, Yanan, Liu, Chunxiu

Independent Learning in Constrained Markov Potential Games von Jordan, Philip, Barakat, Anas, He, Niao

Model-free Distortion Canceling and Control of Quantum Devices von Fouad, Ahmed F, Youssry, Akram, El-Rafei, Ahmed, Hammad, Sherif

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning von Lin, Bo, Zhong, Yangzheng, Ren, Weiqing

A deep reinforcement learning approach to assess the low-altitude airspace capacity for urban air mobility von Asal Mehditabrizi, Samadzad, Mahdi, Sabzekar, Sina

A Deep Reinforcement Learning Approach for Online Parcel Assignment von Zeng, Hao, Wu, Qiong, Han, Kunpeng, He, Junying, Hu, Haoyuan

Sampling, Communication, and Prediction Co-Design for Synchronizing the Real-World Device and Digital Model in Metaverse von Meng, Zhen, She, Changyang, Zhao, Guodong, De Martini, Daniele

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach von Wang, Yang, Gao, Zhen

Suchwerkzeuge:

Treffer weiter einschränken

Format

Schlagwortumfeld

Thema

Sprache

Erscheinungsjahr