Toward the Age in Forwarding: A Deep Reinforcement Learning Enabled Routing Mechanism for Large-Scale Satellite Networks via Spatial-Temporal Graph Neural Networks

In large-scale satellite networks, end-to-end data delivery confronts long propagation latency and dynamical connectivity, heavily challenging current routing strategies due to the time-changing topology. For the tasks with high timeliness requirements, especially, it will cause obviously low freshn...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE Transactions on Networking pp. 1 - 16
Main Authors:	Gao, Ronghao, Zhang, Bo, Zhang, Qinyu, Yang, Zhihua
Format:	Journal Article
Language:	English
Published:	IEEE 15.08.2025
Subjects:	age of information (AoI) Costs deep reinforcement learning (DRL) Feature extraction graph neural network (GNN) Graph neural networks Heuristic algorithms Information age Low earth orbit satellites Network topology Routing Satellite networks Satellites Topology
ISSN:	2998-4157, 2998-4157
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	In large-scale satellite networks, end-to-end data delivery confronts long propagation latency and dynamical connectivity, heavily challenging current routing strategies due to the time-changing topology. For the tasks with high timeliness requirements, especially, it will cause obviously low freshness of data since huge computational cost and transmission overhead are required for implementing optimal routing decisions. To address this issue, in this paper, we propose an Age-driven Deep Reinforcement Learning (DRL) enabled Routing Mechanism (ADRLRM) for the large-scale satellite network by considering the impact of time-varying topology on the timeliness of data, in which a Parameter-adjustable Two-step Slotting Algorithm (PTSA)-assisted Spatial-Temporal Graph Neural Network (STGNN) is well-designed to extract the topological features both in temporal and spatial dimensions. In particular, we develop a novel metric of data freshness called Routing-aware Age of Information (RAoI) to evaluate the timeliness of data delivery, which is incorporated into the customized reward function of the proposed mechanism as an optimization objective. The simulation results indicate that the proposed mechanism performs better in reducing the end-to-end delivery latency, number of forwarding hops, and average RAoI compared with the conventional Open Shortest Path First (OSPF) routing algorithm, Deep Q-Networks based Intelligent Routing (DQN-IR), and GNN-based GraphPR routing algorithm, respectively.
ISSN:	2998-4157 2998-4157
DOI:	10.1109/TON.2025.3597928