Broad reinforcement learning based adaptive state transition algorithm for global optimization

The state transition algorithm (STA) is an efficient intelligent optimization method with superior search capabilities in diverse applications, while its key operator selection strategies depend on manual design. The integration of deep reinforcement learning (DRL) with STA offers a promising paradi...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Swarm and evolutionary computation Ročník 97; s. 102038
Hlavní autoři: Du, Yangyi, Zhou, Xiaojun, Yang, Chunhua, Gui, Weihua
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 01.08.2025
Témata:
ISSN:2210-6502
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:The state transition algorithm (STA) is an efficient intelligent optimization method with superior search capabilities in diverse applications, while its key operator selection strategies depend on manual design. The integration of deep reinforcement learning (DRL) with STA offers a promising paradigm for adaptive selection strategy during optimization. However, conventional DRL methods require extensive training data and iterative model refinement, creating fundamental barriers with limited evaluation budgets. Therefore, this paper proposes a novel STA framework incorporating broad reinforcement learning to develop an adaptive operator selection mechanism. First, the selection strategy is formulated as a Markov decision process, where an agent learns to identify optimal operators based on real-time state. Specifically, environmental states are characterized through systematic landscape analysis derived from population information. Second, a broad learning system replaces neural networks in DRL frameworks. The associated incremental learning mechanism is carefully designed to enhance training efficiency. Third, a Gaussian mixture model-based data augmentation mechanism is proposed to generate sufficient training samples under limited interactions. The proposed method is evaluated using benchmark functions and practical applications, with comparisons against STA variants and other prominent optimization algorithms. Experimental results demonstrate that BRL-STA achieves competitive performance compared with competitors.
ISSN:2210-6502
DOI:10.1016/j.swevo.2025.102038