Virtual power plant containing electric vehicles scheduling strategies based on deep reinforcement learning
•VPP agent and EV charging station agent games to obtain electricity price.•The VPP tends to use mixed strategy, while EVs tend to use pure strategies.•Using Stackelberg game to prevent VPP from obtaining excess profit from EV members. Virtual power plants (VPPs), which aggregate customer-side flexi...
Uloženo v:
| Vydáno v: | Electric power systems research Ročník 205; s. 107714 |
|---|---|
| Hlavní autoři: | , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Amsterdam
Elsevier B.V
01.04.2022
Elsevier Science Ltd |
| Témata: | |
| ISSN: | 0378-7796, 1873-2046 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | •VPP agent and EV charging station agent games to obtain electricity price.•The VPP tends to use mixed strategy, while EVs tend to use pure strategies.•Using Stackelberg game to prevent VPP from obtaining excess profit from EV members.
Virtual power plants (VPPs), which aggregate customer-side flexibility resources, provide an effective way for customers to participate in the electricity market, and provide a variety of flexible technologies and services to the market. Importantly, VPPs can provide services to electric vehicle (EV) charging stations. In this paper, we constructed a deep reinforcement learning (DRL) based Stackelberg game model for a VPP with EV charging stations. Considering the interests of both sides of the game, soft actor-critic (SAC) algorithm is used for the VPP agent and twin delay deep deterministic policy gradient (TD3) algorithm is used for the EV charging station agent. By alternately training the network parameters of the agents, the strategy and solution at the equilibrium of the game are calculated. Results of cases demonstrate that the VPP agent can learn the strategy of selling electricity to EVs, optimize the scheduling of distributed energy resources (DERs), and bidding strategy for participation in the electricity market. Meanwhile, the EV aggregation agent can learn scheduling strategies for charging and discharging EVs. When the EV aggregator uses a deterministic strategy and the virtual power plant uses a stochastic strategy, energy complementarity is achieved and the overall operating economy is improved. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 0378-7796 1873-2046 |
| DOI: | 10.1016/j.epsr.2021.107714 |