Distributed deep reinforcement learning-based gas supply system coordination management method for solid oxide fuel cell

In order to sustain solid oxide fuel cell (SOFC) net output power and prevent violation of oxygen excess ratio (OER) constraint and fuel utilization (FU) constraint, a data-driven gas supply system coordination management method is proposed. Accordingly, a population evolution-based multi-agent doub...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Engineering applications of artificial intelligence Jg. 120; S. 105818
Hauptverfasser: Li, Jiawen, Cui, Haoyang, Jiang, Wei
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Ltd 01.04.2023
Schlagworte:
ISSN:0952-1976, 1873-6769
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In order to sustain solid oxide fuel cell (SOFC) net output power and prevent violation of oxygen excess ratio (OER) constraint and fuel utilization (FU) constraint, a data-driven gas supply system coordination management method is proposed. Accordingly, a population evolution-based multi-agent double delay deep deterministic policy gradient (PE-MA4DPG) algorithm is introduced. The artificial intelligence design of the algorithm is guided by the concepts of imitation learning and curriculum learning, whereby different agents of different combinations are trained in different environments, thus improving the robustness of the coordination strategy. In this algorithm, the hydrogen controller and the air controller are treated as two agents. The centralized training enables agents with different objectives to coordinate with each other. The effectiveness of the proposed algorithm is demonstrated in three experiments, wherein the proposed algorithm is compared with a group of existing algorithms. •A 5kW SOFC gas supply system model considering various operating parameters is proposed.•A data-driven gas supply system coordination management method is proposed.•A novel large-scale deep reinforcement learning algorithm is proposed for this method.•The PE-MA4DPG algorithm proposed is characterized by superior robustness.•The proposed algorithm can guarantee better SOFC stability, performance, and efficiency, whilst satisfying the SOFC constraints.
ISSN:0952-1976
1873-6769
DOI:10.1016/j.engappai.2023.105818