电力市场环境下的购销竞价策略是一个复杂的动态问题,传统的数学解析方法很难对其进行分析计算,这在中长期交易时间段尤为突出。文章提出了以最优潮流为基础的双层中长期最优竞价策略学习模型。采用启发式动态规划方法,将动态规划的策略行为和奖惩因子迭代过程看作智能体(agent),将外界不确定因素当作该智能体的“外部环境”。在所处的环境条件下,agent通过评价环境作出判断来选择可行的策略方案,通过学习以往的报价经验和对手的行为来指导自身达到最优生产效益。在标准IEEE5节点6支路系统中进行实例计算,证明了所提出方法的适应性和优越性。
The bidding strategy for electricity purchase and sale in electricity market environment is a complex dynamic problem that is especially prominent in the duration of medium- and long-term transaction, so it is hard for traditional mathematical analysis method to analyze and calculate such a problem. An optimal power flow based learning model for two-layer medium- and long-term optimal bidding strategy is proposed. Using heuristic dynamic programming method, the strategic behavior of dynamic programming and iteration of reward and penalty factors are regarded as agent and the external uncertain factors as the external environment of the agent. Under this condition, through the assessment and judgment on environment, the agent chooses feasible strategic scheme guides itself to attain optimal production revenue by learning former bidding experiences and the behavior of competitors. Taking IEEE 5-bus 6-branch test system as the case, calculation results prove that the proposed method is adaptive and superior.