Learn to Bid as a Price-Maker Wind Power Producer
Shobhit Singhal, Marta Fochesato, Liviu Aolaritei, Florian Dörfler
TL;DR
The paper tackles revenue optimization for price-maker wind power producers in short-term markets, where bids influence both dispatch and prices. It introduces a delayed-feedback Lipschitz contextual multi-armed bandit approach that uses contextual forecasts to learn bidding decisions, with a formal regret bound and a practical market-simulation framework built on Nord Pool and ENTSO-E data. Key contributions include reformulating the price-maker bidding problem as a context-dependent stochastic program suitable for CMAB, adapting the algorithm to delayed feedback, and demonstrating empirical gains over benchmarks in a German market setting. The work offers a practical, scalable method for WPPs to exploit price-maker effects and informs market participants on how contextual information can improve bidding performance in real-time and day-ahead markets.
Abstract
Wind power producers (WPPs) participating in short-term power markets face significant imbalance costs due to their non-dispatchable and variable production. While some WPPs have a large enough market share to influence prices with their bidding decisions, existing optimal bidding methods rarely account for this aspect. Price-maker approaches typically model bidding as a bilevel optimization problem, but these methods require complex market models, estimating other participants' actions, and are computationally demanding. To address these challenges, we propose an online learning algorithm that leverages contextual information to optimize WPP bids in the price-maker setting. We formulate the strategic bidding problem as a contextual multi-armed bandit, ensuring provable regret minimization. The algorithm's performance is evaluated against various benchmark strategies using a numerical simulation of the German day-ahead and real-time markets.
