Fractional-Order Optimal Control and FIOV-MASAC Reinforcement Learning for Combating Malware Spread in Internet of Vehicles
Internet of Vehicles (IoV) is gradually becoming popular, but it also brings more opportunities for malware intrusion. The intrusion of malware into IoV will cause a series of security issues and increase the incidence of road accidents. Therefore, the suppressing measures to combat the spread of ma...
Saved in:
| Published in: | IEEE transactions on automation science and engineering Vol. 22; pp. 10313 - 10332 |
|---|---|
| Main Authors: | , , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
IEEE
01.01.2025
|
| Subjects: | |
| ISSN: | 1545-5955, 1558-3783 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Internet of Vehicles (IoV) is gradually becoming popular, but it also brings more opportunities for malware intrusion. The intrusion of malware into IoV will cause a series of security issues and increase the incidence of road accidents. Therefore, the suppressing measures to combat the spread of malware in IoV will be fundamental and urgent. To address this critical issue, this paper proposes a fractional-order IoV (FIOV) to investigate malware propagation patterns in Road Side Unit (RSU) and Vehicles. To accurately reflect the actual spread of malware, the traffic density, the channel fading and the actual connectivity are considered in mathematical model. Then, the model-based optimal treatment and quarantine control strategy is derived by optimal control theory. Additionally, a novel model-free FIOV multi-agent soft actor-critic (FIOV-MASAC) approach is first proposed to suppress the malware propagation in IoV. Simulation experiments demonstrate that the proposed FIOV-MASAC approach exhibits better learning ability compared to other reinforcement learning (RL) algorithms.Note to Practitioners-Frequent attacks by malware on IoV are recognized as being challenging to prevent, with these attacks posing threats to data security and potentially resulting in traffic accidents and vehicle malfunctions. In response, a novel mathematical model has been introduced within this study to better predict the propagation trends of malware in IoV, effectively managing its spread within the vehicular network systems. While RL methods have been extensively utilized in the domain of control systems, it is noted that current RL methods depend on rich experience pools, rendering them inapplicable to more complex systems without adaptation. To address this, an effective and pragmatic RL algorithm has been devised in this study. This algorithm, devoid of the requirement for complex model establishment, is capable of intelligently learning and adjusting to the sophisticated environment of IoV, thereby effectively countering the propagation of malware. It should be highlighted that the RL method proposed herein is applicable to the majority of epidemic systems, enabling the achievement of stable control while substantially minimizing control expenditures. The integration of this method is anticipated to augment the security and robustness of IoV in the face of malware attacks. |
|---|---|
| ISSN: | 1545-5955 1558-3783 |
| DOI: | 10.1109/TASE.2024.3521614 |