Inventory replenishment and fulfilment decisions for an omnichannel retailer: a reinforcement learning-based method.
Saved in:
| Title: | Inventory replenishment and fulfilment decisions for an omnichannel retailer: a reinforcement learning-based method. |
|---|---|
| Authors: | Kolyaei, Maryam, Zhang, Lele, Blom, Michelle |
| Source: | International Journal of Production Research; Dec2025, Vol. 63 Issue 24, p9571-9592, 22p |
| Subject Terms: | REINFORCEMENT learning, INVENTORY control, STOCK management (Finance), RETAIL industry, PROFIT maximization, ONLINE shopping, MARKOV processes |
| Abstract: | We address the replenishment and fulfilment challenges faced by an omnichannel retailer within a capacitated retail network, selling products to a large region across a multi-period horizon. This horizon is partitioned into cycles, where replenishment occurs at the start of each cycle and fulfilment decisions regarding how much to replenish and allocate across sales channels take place in each time period. Our model considers Click and Collect (C&C) – also known as Buy Online and Pick-up in Store (BOPS) – as well as ship-from-store strategies with the aim of maximising the retailer's expected total profit. The problem is formulated as a Markov Decision Process (MDP). To solve the MDP, a tailored Proximal Policy Optimisation (PPO) algorithm, a form of Deep Reinforcement Learning (DRL), is adopted. We conduct experiments across varying product and store numbers, store capacities, and demand variability to evaluate the performance and robustness of our approach. Furthermore, we evaluate the impact of different demand patterns by first training decision-making policies on specific patterns and then testing them on alternative patterns. Our results reveal that the tailored approach effectively handles high-dimensional decision-making, different demand patterns, uncertainty, and constrained capacity environments while improving profitability compared to baseline methods. [ABSTRACT FROM AUTHOR] |
| Copyright of International Journal of Production Research is the property of Taylor & Francis Ltd and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Complementary Index |
Be the first to leave a comment!
Full Text Finder
Nájsť tento článok vo Web of Science