Online refueling policy for liner ships with offline learning

•We model online refueling in liner shipping with distribution-free fuel prices as a finite-time MDP via Bellman equation.•We design a Bayesian GM model to predict fuel prices, capturing spatio-temporal port correlations for adaptive refueling.•We propose an approximate dynamic programming method us...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:European journal of operational research
Hlavní autori: Liu, Haitao, Liang, Jinpeng, Li, Liming, Tan, Zhijia
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier B.V 01.11.2025
Predmet:
ISSN:0377-2217
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:•We model online refueling in liner shipping with distribution-free fuel prices as a finite-time MDP via Bellman equation.•We design a Bayesian GM model to predict fuel prices, capturing spatio-temporal port correlations for adaptive refueling.•We propose an approximate dynamic programming method using GPs and dynamic sampling to efficiently learn refueling.•We validate the approach on an Asia-Mediterranean route, showing lower costs and better performance than benchmark. Liner shipping vessels consume substantial bunker fuel to maintain reliable service on fixed routes, yet volatile and spatially-correlated fuel prices across ports pose critical challenges for refueling optimization. This study addresses the online refueling problem where fuel prices are revealed sequentially, requiring real-time decisions on whether and how much to refuel at each port based on current inventory and observed prices. We formulate the problem as a Markov decision process and characterize the optimal policy through the Bellman equation. To capture the complex dynamics of fuel prices, we develop a multivariate Gaussian mixture (GM) model that approximates the joint price distribution across ports, coupled with GM regression for dynamic price prediction at unvisited locations. We propose an approximate dynamic programming framework that leverages Gaussian processes to model value functions and introduces a novel sampling procedure to generate informative training samples for policy learning. Through extensive computational experiments on synthetic datasets as well as empirical data from an Asia-Mediterranean shipping route, we demonstrate that our proposed policy achieves superior performance relative to established benchmark policies. Our findings reveal the GM model’s exceptional capability in capturing complex fuel price dependencies and volatility patterns, while the integration of predictive learning within the optimization framework yields significant cost reductions. This work contributes a theoretically grounded, data-driven methodology for dynamic refueling optimization under uncertainty, offering significant implications for operational efficiency and strategic fuel management in maritime logistics.
ISSN:0377-2217
DOI:10.1016/j.ejor.2025.11.013