Distributed deep reinforcement learning-based gas supply system coordination management method for solid oxide fuel cell
In order to sustain solid oxide fuel cell (SOFC) net output power and prevent violation of oxygen excess ratio (OER) constraint and fuel utilization (FU) constraint, a data-driven gas supply system coordination management method is proposed. Accordingly, a population evolution-based multi-agent doub...
Gespeichert in:
| Veröffentlicht in: | Engineering applications of artificial intelligence Jg. 120; S. 105818 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Elsevier Ltd
01.04.2023
|
| Schlagworte: | |
| ISSN: | 0952-1976, 1873-6769 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | In order to sustain solid oxide fuel cell (SOFC) net output power and prevent violation of oxygen excess ratio (OER) constraint and fuel utilization (FU) constraint, a data-driven gas supply system coordination management method is proposed. Accordingly, a population evolution-based multi-agent double delay deep deterministic policy gradient (PE-MA4DPG) algorithm is introduced. The artificial intelligence design of the algorithm is guided by the concepts of imitation learning and curriculum learning, whereby different agents of different combinations are trained in different environments, thus improving the robustness of the coordination strategy. In this algorithm, the hydrogen controller and the air controller are treated as two agents. The centralized training enables agents with different objectives to coordinate with each other. The effectiveness of the proposed algorithm is demonstrated in three experiments, wherein the proposed algorithm is compared with a group of existing algorithms.
•A 5kW SOFC gas supply system model considering various operating parameters is proposed.•A data-driven gas supply system coordination management method is proposed.•A novel large-scale deep reinforcement learning algorithm is proposed for this method.•The PE-MA4DPG algorithm proposed is characterized by superior robustness.•The proposed algorithm can guarantee better SOFC stability, performance, and efficiency, whilst satisfying the SOFC constraints. |
|---|---|
| ISSN: | 0952-1976 1873-6769 |
| DOI: | 10.1016/j.engappai.2023.105818 |