Reinforcement Learning for Dynamic Optimization of Eco-Driving in Smart Healthcare Transportation Networks

Smart transportation networks face increasing demands for efficiency and sustainability. This study presents a reinforcement learning approach that optimizes eco-driving strategies for connected and automated vehicles (CAVs) in urban environments, with a particular application to healthcare logistic...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE transactions on intelligent transportation systems s. 1 - 12
Hlavní autoři: Cai, Wang, Anwlnkom, Tomley, Zhang, Lingling, Basheer, Shakila, Yang, Jing
Médium: Journal Article
Jazyk:angličtina
Vydáno: IEEE 2025
Témata:
ISSN:1524-9050, 1558-0016
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Smart transportation networks face increasing demands for efficiency and sustainability. This study presents a reinforcement learning approach that optimizes eco-driving strategies for connected and automated vehicles (CAVs) in urban environments, with a particular application to healthcare logistics. Specifically, we propose a novel approach using reinforcement learning, specifically a twin delayed deep deterministic policy gradient (TD3) algorithm, to dynamically optimize CAV trajectories at signalized intersections. The proposed healthcare eco-driving trajectory optimization (TD3-HETO) model incorporates real-time traffic conditions, signal timing information, and healthcare urgency levels to generate optimal acceleration profiles. The reward function is designed to balance energy efficiency, traffic flow, safety, comfort, and healthcare delivery timeliness. Additionally, the model introduces a dynamic exploration strategy that adapts to healthcare task urgency, enabling efficient balancing between energy consumption and delivery timelines. Experimental results show that TD3-HETO reduces energy consumption by up to 28.7% compared to baseline methods while improving average speeds by 3.7% for urgent healthcare deliveries. The model achieves superior safety performance with 98.7% of time steps showing zero conflicts, compared to 95.3% for the best baseline. TD3-HETO also demonstrates remarkable adaptability to varying traffic demands and signal timings, maintaining consistent performance even at high traffic volumes. This research contributes to developing intelligent transportation systems to enhance environmental sustainability and healthcare accessibility in smart cities, potentially improving patient outcomes and operational efficiency in urban healthcare logistics.
ISSN:1524-9050
1558-0016
DOI:10.1109/TITS.2025.3561034