Optimal dual-model controller of solid oxide fuel cell output voltage using imitation distributed deep reinforcement learning
Solid oxide fuel cell (SOFC) is disadvantaged by significant nonlinearity, which makes it difficult to control output voltage of SOFC and satisfy the constraints of fuel utilization simultaneously. In order to solve this problem, a dual-model control framework (DMCF) is proposed. In particular, ther...
Uloženo v:
| Vydáno v: | International journal of hydrogen energy Ročník 48; číslo 37; s. 14053 - 14067 |
|---|---|
| Hlavní autoři: | , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Elsevier Ltd
30.04.2023
|
| Témata: | |
| ISSN: | 0360-3199, 1879-3487 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | Solid oxide fuel cell (SOFC) is disadvantaged by significant nonlinearity, which makes it difficult to control output voltage of SOFC and satisfy the constraints of fuel utilization simultaneously. In order to solve this problem, a dual-model control framework (DMCF) is proposed. In particular, there are two controllers deployed under this framework, with an PID controller and a supplementary dynamic controller to track the SOFC output voltage. The supplementary dynamic controller is conducive to the stabilization of tracking by adapting to the uncertainties, considering the constraint on fuel utilization. In addition, an imitation distributed deep deterministic policy gradient (ID3PG) algorithm, which integrates imitation learning and distributed deep reinforcement learning to enhance the robustness and adaptive capacity of this framework, is proposed for the supplementary dynamic controller. The simulation results obtained in this work have demonstrated that the proposed framework is effective in imposing control on SOFC output voltage and preventing constraint violations of fuel utilization.
•An ID3PG-PID dual-model control framework is proposed.•A new deep reinforcement learning algorithm is proposed.•ID3PG is introduced for enhancing adaptive capacity.•The proposed framework is effective in preventing constraint violations. |
|---|---|
| ISSN: | 0360-3199 1879-3487 |
| DOI: | 10.1016/j.ijhydene.2022.12.194 |