Inventory replenishment and fulfilment decisions for an omnichannel retailer: a reinforcement learning-based method.

Uloženo v:
Podrobná bibliografie
Název: Inventory replenishment and fulfilment decisions for an omnichannel retailer: a reinforcement learning-based method.
Autoři: Kolyaei, Maryam, Zhang, Lele, Blom, Michelle
Zdroj: International Journal of Production Research; Dec2025, Vol. 63 Issue 24, p9571-9592, 22p
Témata: REINFORCEMENT learning, INVENTORY control, STOCK management (Finance), RETAIL industry, PROFIT maximization, ONLINE shopping, MARKOV processes
Abstrakt: We address the replenishment and fulfilment challenges faced by an omnichannel retailer within a capacitated retail network, selling products to a large region across a multi-period horizon. This horizon is partitioned into cycles, where replenishment occurs at the start of each cycle and fulfilment decisions regarding how much to replenish and allocate across sales channels take place in each time period. Our model considers Click and Collect (C&C) – also known as Buy Online and Pick-up in Store (BOPS) – as well as ship-from-store strategies with the aim of maximising the retailer's expected total profit. The problem is formulated as a Markov Decision Process (MDP). To solve the MDP, a tailored Proximal Policy Optimisation (PPO) algorithm, a form of Deep Reinforcement Learning (DRL), is adopted. We conduct experiments across varying product and store numbers, store capacities, and demand variability to evaluate the performance and robustness of our approach. Furthermore, we evaluate the impact of different demand patterns by first training decision-making policies on specific patterns and then testing them on alternative patterns. Our results reveal that the tailored approach effectively handles high-dimensional decision-making, different demand patterns, uncertainty, and constrained capacity environments while improving profitability compared to baseline methods. [ABSTRACT FROM AUTHOR]
Copyright of International Journal of Production Research is the property of Taylor & Francis Ltd and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáze: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edb&genre=article&issn=00207543&ISBN=&volume=63&issue=24&date=20251215&spage=9571&pages=9571-9592&title=International Journal of Production Research&atitle=Inventory%20replenishment%20and%20fulfilment%20decisions%20for%20an%20omnichannel%20retailer%3A%20a%20reinforcement%20learning-based%20method.&aulast=Kolyaei%2C%20Maryam&id=DOI:10.1080/00207543.2025.2520596
    Name: Full Text Finder
    Category: fullText
    Text: Full Text Finder
    Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif
    MouseOverText: Full Text Finder
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Kolyaei%20M
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 189933513
RelevancyScore: 1162
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 1162.15161132813
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Inventory replenishment and fulfilment decisions for an omnichannel retailer: a reinforcement learning-based method.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Kolyaei%2C+Maryam%22">Kolyaei, Maryam</searchLink><br /><searchLink fieldCode="AR" term="%22Zhang%2C+Lele%22">Zhang, Lele</searchLink><br /><searchLink fieldCode="AR" term="%22Blom%2C+Michelle%22">Blom, Michelle</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: International Journal of Production Research; Dec2025, Vol. 63 Issue 24, p9571-9592, 22p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22REINFORCEMENT+learning%22">REINFORCEMENT learning</searchLink><br /><searchLink fieldCode="DE" term="%22INVENTORY+control%22">INVENTORY control</searchLink><br /><searchLink fieldCode="DE" term="%22STOCK+management+%28Finance%29%22">STOCK management (Finance)</searchLink><br /><searchLink fieldCode="DE" term="%22RETAIL+industry%22">RETAIL industry</searchLink><br /><searchLink fieldCode="DE" term="%22PROFIT+maximization%22">PROFIT maximization</searchLink><br /><searchLink fieldCode="DE" term="%22ONLINE+shopping%22">ONLINE shopping</searchLink><br /><searchLink fieldCode="DE" term="%22MARKOV+processes%22">MARKOV processes</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: We address the replenishment and fulfilment challenges faced by an omnichannel retailer within a capacitated retail network, selling products to a large region across a multi-period horizon. This horizon is partitioned into cycles, where replenishment occurs at the start of each cycle and fulfilment decisions regarding how much to replenish and allocate across sales channels take place in each time period. Our model considers Click and Collect (C&C) – also known as Buy Online and Pick-up in Store (BOPS) – as well as ship-from-store strategies with the aim of maximising the retailer's expected total profit. The problem is formulated as a Markov Decision Process (MDP). To solve the MDP, a tailored Proximal Policy Optimisation (PPO) algorithm, a form of Deep Reinforcement Learning (DRL), is adopted. We conduct experiments across varying product and store numbers, store capacities, and demand variability to evaluate the performance and robustness of our approach. Furthermore, we evaluate the impact of different demand patterns by first training decision-making policies on specific patterns and then testing them on alternative patterns. Our results reveal that the tailored approach effectively handles high-dimensional decision-making, different demand patterns, uncertainty, and constrained capacity environments while improving profitability compared to baseline methods. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of International Journal of Production Research is the property of Taylor & Francis Ltd and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=189933513
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1080/00207543.2025.2520596
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 22
        StartPage: 9571
    Subjects:
      – SubjectFull: REINFORCEMENT learning
        Type: general
      – SubjectFull: INVENTORY control
        Type: general
      – SubjectFull: STOCK management (Finance)
        Type: general
      – SubjectFull: RETAIL industry
        Type: general
      – SubjectFull: PROFIT maximization
        Type: general
      – SubjectFull: ONLINE shopping
        Type: general
      – SubjectFull: MARKOV processes
        Type: general
    Titles:
      – TitleFull: Inventory replenishment and fulfilment decisions for an omnichannel retailer: a reinforcement learning-based method.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Kolyaei, Maryam
      – PersonEntity:
          Name:
            NameFull: Zhang, Lele
      – PersonEntity:
          Name:
            NameFull: Blom, Michelle
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 15
              M: 12
              Text: Dec2025
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 00207543
          Numbering:
            – Type: volume
              Value: 63
            – Type: issue
              Value: 24
          Titles:
            – TitleFull: International Journal of Production Research
              Type: main
ResultId 1