Real-Time interaction of active distribution network and virtual microgrids: Market paradigm and data-driven stakeholder behavior analysis
2021; Elsevier BV; Volume: 297; Linguagem: Inglês
10.1016/j.apenergy.2021.117107
ISSN1872-9118
AutoresZiqing Zhu, Ka Wing Chan, Siqi Bu, Bin Zhou, Shiwei Xia,
Tópico(s)Optimal Power Flow Distribution
Resumo• Market paradigm for energy and ancillary service trading is proposed. • The bi-level trading and market clearing procedure is formulated. • The bidding game among virtual microgrids is by WoLF-PHC algorithm. • The real-time dispatching model of virtual microgrids considering risk is proposed. • The dynamics of multi-agent reinforcement learning procedure is analysed. In order to incorporate the independent Virtual Microgrids (VMGs) to the real-time operation of upstream active distribution network (ADN), an interactive dispatching model of VMGs and ADN is proposed, in which the downstream VMGs perform self-dispatching while trading both energy and ancillary service procurement to the Distribution System Operator (DSO). The bi-level bidding and market clearing model is modelled as a data-driven Multi-Agent Reinforcement Learning (MARL) with the solution of Win-or-Learn-Fast Policy Hill-Climbing (WoLF-PHC) algorithm, which is an online and fully-distributed training, enabling VMGs to dynamically update their bidding strategies based on previous market clearing results. VMGs would thereafter conduct the economic dispatching considering the conditional value-at-risk (CVaR) of penalties caused by the curtailment of renewables, load loss, and failure of providing energy or ancillary service to DSO. Finally, the evolutionary game theory (EGT) with replication dynamic equations (RDEs) is adopted to analyze the inherent dynamics of the proposed MARL driven by WoLF-PHC, revealing the relation between VMGs’ bidding strategy convergence and the trading paradigm. The case study validates the advancement of computational performance of WoLF-PHC compared with conventional Q-learning in the aspects of convergence and computation speed, and the impact of risk coefficient on the VMGs’ real-time dispatching strategies is also demonstrated.
Referência(s)