Reinforcement-learning-based parallel genetic algorithm for the multi-objective hot-rolling scheduling problem of wide-thick slab
Hot rolling is the core process of steel production, and its scheduling problem is the key to determining the rolling rhythm. Compared with traditional hot-rolling scheduling, hot-rolling scheduling optimisation of wide-thick slabs needs to consider the characteristics of cross rolling and the influ...
Saved in:
| Published in: | Applied soft computing Vol. 179; p. 113355 |
|---|---|
| Main Authors: | , , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Elsevier B.V
01.07.2025
|
| Subjects: | |
| ISSN: | 1568-4946 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Hot rolling is the core process of steel production, and its scheduling problem is the key to determining the rolling rhythm. Compared with traditional hot-rolling scheduling, hot-rolling scheduling optimisation of wide-thick slabs needs to consider the characteristics of cross rolling and the influence of the reheating furnace, which significantly increase the difficulty of problem-solving. In this paper, the characteristics of cross rolling and the influence of the reheating furnace are analysed, and the hot-rolling scheduling problem of the wide-thick slab (WTS-HRSP) is mapped to a multi-objective asymmetric vehicle routing problem (MAVRP) model. Combined with the multi-objective characteristics of MAVRP, a reinforcement-learning-based parallel genetic algorithm (RPGA) was designed, which implements parallel computation of multiple populations based on the master-slave island model, designs three priority rules to initialise populations, and optimises the parameters of the genetic operation using Q-Learning. Based on the rolling data of the wide-thick slab hot-rolling mill of a Chinese iron and steel group, experiments were conducted to compare RPGA with five advanced multi-objective optimisation algorithms, and the results showed that RPGA had better convergence and stability and was more suitable for the WTS-HRSP. The findings can help optimise hot-rolling scheduling.
•The multi-objective hot-rolling scheduling model of wide-thick slab is proposed.•Designing a reinforcement-learning-based parallel genetic algorithm.•Designing a parallel computation based on the master-slave island model.•Designing the priority rules and Q-Learning to improve the genetic algorithm.•Experiments demonstrate the proposed scheduling algorithm has the best performance. |
|---|---|
| ISSN: | 1568-4946 |
| DOI: | 10.1016/j.asoc.2025.113355 |