Next Article in Journal
Time–Energy Correlation for Multithreaded Matrix Factorizations
Previous Article in Journal
A SOC Correction Method Based on Unsynchronized Full Charge and Discharge Control Strategy in Multi-Branch Battery System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dynamic Overlapping Coalition Formation in Electricity Markets: An Extended Formal Model

Group Digitalized Energy Systems, Carl von Ossietzky University of Oldenburg, 26129 Oldenburg, Germany
*
Author to whom correspondence should be addressed.
Energies 2023, 16(17), 6289; https://doi.org/10.3390/en16176289
Submission received: 31 July 2023 / Revised: 24 August 2023 / Accepted: 25 August 2023 / Published: 29 August 2023
(This article belongs to the Section C: Energy Economics and Policy)

Abstract

:
The future power system will be characterized by many small decentralized power plants—so-called distributed energy resources (DERs). The integration of these DERs is vital from an economic and grid operation point of view. One approach to this is the aggregation of such DERs. The formation of coalitions as an aggregation method has already been examined in the literature and applied in virtual power plants, active distribution networks, and microgrids. The spread of DERs also increases the need for flexibility and dynamics in the power grid. One approach to address this can be overlapping coalitions. Therefore, in this paper, we first performed an analysis of related work and, in this context, found no work on overlapping coalitions for energy use cases in the literature. We then described a method for dynamic coalition formation, called dynamic coalition in electricity markets (DYCE), and analyzed how DYCE would need to be extended to include overlapping coalition formation. The extension includes the phases of product portfolio optimization and the actual coalition formation. Our analysis of DYCE shows that the methods used for the optimization of the DYCE sub-tasks are not suitable for overlapping coalitions and would have to be replaced by other methods in order to be able to form overlapping coalitions.

1. Introduction

In order to achieve a sustainable future for our planet, the power grid is currently undergoing a transformation from a system with a few large energy generators (such as gas or coal-fired power plants) to a system based on distributed generation (DG), mainly based on distributed energy resources (DERs) and renewable energy sources (RESs) [1,2]. DERs are small- or medium-sized plants—typically under 10 MW in capacity. They are connected directly to the distribution grid and are allocated close to the load. They can consist of fuel generators, solar arrays, small wind farms, or battery energy storage systems (BESSs) [3].
With the further increase in DERs and a simultaneous dismantling of traditional generation plants, the power system will also face some challenges. The unpredictability and spatial dispersion of power production by DG systems will increase the complexity of system balancing (in spatial and temporal terms). Further challenges arise from the fact that DERs are mainly connected to low- and medium-voltage distribution systems, which will lead to a paradigm shift in power systems. In “traditional” power systems, a top-down philosophy typically prevails, which implies that power generation is mainly in the form of centrally controlled large power plants in the high-voltage grid. This development implies increased complexity in distribution grid management but also creates new opportunities and needs for overall system optimization. It allows distribution grids to participate in system operation actively and enables DERs to participate in distribution grid management actively. For example, they can provide ancillary services (AS) to the power grid, including balancing energy, frequency regulation, voltage support, congestion management, and reserves [3,4].
At the same time, integrating DERs creates new planning problems for system operators, as the system’s complexity increases significantly due to the many new plants. Operating such a decentralized and dynamic infrastructure will require the ability to solve large-scale problems in real time with hundreds of thousands of DERs simultaneously online [5]. For example, information from a multitude of plants must be collected, processed, and the plants must be planned and controlled afterwards. One approach to reducing this technical and operational complexity is to aggregate DERs into pools of plants.
The aggregation of DERs leads to a reduction in the technical and operational complexity and provides a significant amount of DER capacity to participate in electricity markets by turning many small entities into large market participants. It also enables integrated risk management along with advanced forecasting, resulting in an overall increase in combined reliability and addressing the requirements for the provision of electricity products by DERs [6,7]. Through the aggregation of generation or consumption potential, small-scale participants can meet the minimum quantity requirements of electricity wholesale markets, enabling them to engage in the trading of products surpassing their individual capacities. In addition, aggregating assets with different reliability levels or complementary capabilities provides the opportunity to reduce or offset unpredictable impacts. As a result, product fulfillment reliability increases, leading to cost savings for members. By pooling resources and targeting coordination among members, a collective pool enables a greater degree of adaptability in the face of changing market dynamics. This adaptability, in turn, enables members to compete more effectively in markets [8,9]. With respect to typical power plant characteristics such as minimum/maximum capacity and ramp criteria, aggregated DERs can behave like conventional power plants and participate in markets by selling power and AS or offer their flexibility [7,8].
The formation of coalitions is a concept from game theory and multi-agent system (MAS) research that can be part of a virtual power plant (VPP), microgrid (MG), or active distribution network (ADN) to allow operators to obtain a short-term, robust, and scalable aggregation method through a distributed design, taking into account the requirement for reliable product delivery as well as the increasing dynamics in the markets [9,10]. The coalition formation (CF) is performed in a decentralized manner by self-organized agents, with each agent taking control of one or more DER units. Thus, the agents act as representatives of the DER units. As part of the DER control, agents then form coalitions with other agents to maximize the utility of the units they control. For simplicity, in the further course, it will always be assumed that an agent controls only one DER unit. In the CF process, the agents dynamically group into coalitions, e.g., on a product basis, such that all DERs fulfill the same product. Aggregating DERs into coalitions offers several advantages. Thus, coalitions can be formed in a completely decentralized manner by agents, which takes into account the decentralization of the future energy system. In addition, various constraints regarding the formation of coalitions can be considered, for example, the provision of local energy products such as the provision of reactive power by DERs. Topological information of DERs can be explicitly considered in CF to take advantage of the aggregation of small plants while serving local power products [9]. Finally, CF as an aggregation concept can reduce the uncertainty about expected energy output [11]. An example of aggregation of DERs using CF is shown in Figure 1. In this example, different types of DERs in a low-voltage grid are controlled by agents and are aggregated into coalitions, e.g., to provide various types of electricity products.
CF in MGs aims to optimize energy sharing and management among different entities within the MG. By forming coalitions, these entities can work together more efficiently, share resources, and achieve common objectives. In an ADN, CF aims to optimize the use of DERs and improve the overall efficiency, reliability, and flexibility of the grid. The key benefits of CF in MGs and ADNs are optimal resource utilization, improved resilience and reliability, cost optimization, grid decentralization, and islanded operation.
VPPs are long-established aggregation concepts in which a group of DERs is aggregated, optimized, coordinated, and controlled to behave like a single resource. The VPP can function as a dispatchable unit in power system operations and as a tradeable unit in energy markets [12]. While some concepts on dynamic [13,14], hierarchical [15], holonic [16], and self-organizing [17,18] VPPs have been presented in the literature, VPPs in the field still are typically static systems with a fixed pool of assets [19]. This fixed pool is fully aggregated and marketed for a single purpose at a time, with the result that the flexibility potential of the assets cannot be fully exploited. However, flexibility plays a vital role in the future power system and, therefore, will become more economically viable in the future [20,21]. Hence, it is essential to develop and investigate concepts that exploit the full potential of plant flexibility while meeting the requirements for reliability in providing energy products provided by DERs.
The integration of CF into a VPP as an extended aggregation concept enables multiple products to be offered at one point in time through all the assets of a VPP. However, other concepts are needed to exploit the full flexibility potential of DER plants and, thus, make these aggregation concepts ready for use in the field. For example, to achieve the economical operation of a BESS and to use its full flexibility potential, multi-purpose usage of BESSs is necessary [22,23,24].
A concept that enables the provision of multiple products at one time is the model of cooperative games with overlapping coalitions, which can be described as an overlapping coalition formation (OCF) game [25,26]. This concept allows one DER plant to be in multiple coalitions at one time, which also allows that plant to fulfill various products simultaneously.
All approaches to the formation of coalitions as an aggregation method presented so far consider only disjoint coalitions, and, therefore, limit the approach, especially with regard to the use of flexibility potentials. The contribution of the paper is twofold. On the one hand, an extended formal model for OCF in the energy domain is presented. This formal model provides a clear and precise representation of the underlying system (the energy system), the process of OCF, and the underlying concepts from game theory. On the other hand, based on the formal model, an approach for dynamic CF with overlapping coalitions is presented. For this purpose, the requirements regarding the extensions for overlapping coalitions are discussed on the basis of an already existing approach and the extensions are outlined afterwards.
The remainder of this paper is structured as follows. In the following Section 2, we review the related work on the aggregation of DERs, including overlapping and non-overlapping CF in the energy domain. In Section 3, we will present a formal model for CF and show new formalizations to extend the model to overlapping CF. Using the formalizations from the previous section, in Section 4, we present a model for the dynamic formation of coalitions from the literature and discuss where the model would need to be extended to allow overlapping coalitions to be formed. Finally, the paper closes with a summary and outlook in Section 5.

2. Aggregation of Distributed Energy Resources

Due to the increasing number of DERs and the resulting challenges, the topic of the aggregation of DERs has gained increasing scientific attention in this decade. As a result, there is also a large body of work describing various aggregation concepts based on the formation of coalitions. Therefore, for the following literature review, Section 2.1 first introduces the criteria by which the literature comparison was conducted. Then, in Section 2.2, the relevant papers are described and compared based on the criteria. Finally, a summary of the literature review is given in Section 2.3.

2.1. Literature Review Criteria

The challenge of generating a coalition structure (CS), referred to as the coalition structure generation (CSG) problem, represents a fundamental abstraction of a key issue within MAS [27]. This issue revolves around how agents can organize themselves into cooperative groups to enhance their collective performance. Consequently, CF is a broad academic domain that garners substantial attention, particularly in game theory [10] and MAS research [28]. Nonetheless, deriving optimal CS configurations within MAS poses significant computational complexities [29], spurring researchers to devise an array of algorithms and heuristics to tackle this difficulty [30,31,32]. In addition to research on the process of CF, the equitable distribution of gains [33], and stability states of CS [34], there are also already some applications of CF as an aggregation method of DERs in the energy domain.
Concerning applications addressing the aggregation of DERs based on the concept of CF, a distinction can be made between different characteristics, which are briefly described below and then used to compare different approaches, as summarized in Table 1. The work presented can be distinguished as to the CF’s objective, e.g., the reliable provision of electricity products or the economic optimization of a coalition of DERs, leading to different requirements regarding uncertainty handling or guarantees. Furthermore, a distinction can be made in how coalitions are formed, how the CS is created, i.e., how the membership of an agent in a coalition is determined. The CS can be initiated endogenously, e.g., by initiator agents and built up in a process, or the CS initiation can be exogenous, by an external central entity. This also concerns the coalition value calculation mechanism, which can be executed in a distributed manner by the agents or calculated centrally. It can also be distinguished how the problem of finding an optimal CS can be solved. For example, this can be formulated as an optimization problem and solved by suitable solvers [35], or learned by learning algorithms [36]. It can also be distinguished by whether the CF is temporally static or dynamic [37], that is, whether the CS is fixed or whether the structure can change continuously, even after the CSG process. The presented approaches also differ in how the topology of the power grid is reflected in the CSG concept [38], which is necessary for specific application purposes, such as the provision of reactive power. In addition, the approaches can be distinguished by whether the flexibility of the individual systems is taken into account in the CF process and whether a reliability assessment or reliability requirements about the coalitions are taken into account. Finally, the approaches can be distinguished in terms of whether overlapping coalitions are allowed when creating a CS.

2.2. Coalition Formation Literature

In [35], Bremer and Lehnhoff present a decentralized CF approach based on the combinatorial optimization heuristics for distributed agents (COHDA) for predictive scheduling with the characteristic of keeping all information about the local decision base and local operational constraints private. Two interlocking optimization processes orchestrate an overall procedure that adapts a CS to fit a given set of energy products best while considering the DER flexibilities. The approach is then evaluated in several simulation scenarios with different types of established DER integration models and extended to the induced surplus distribution use case. During the CF process with COHDA, no topological information about the power system or the coalition reliability is considered, and the results of CF are disjoint coalitions.
Lehnhoff et al. present a methodology in [39] to uncover decentralized coalitions comprising units operated by agents capable of providing frequency response reserve to meet specific reliability goals. In their approach, they introduce the concepts of base and core coalitions, wherein the core coalition agents actively partake in providing AS. The task of identifying the most suitable agent coalitions is cast as an optimization problem. The goal is to select a core coalition with the longest operational duration and minimal costs, guided by the attributes of the agents while ensuring a predefined level of reliability. To tackle this challenge, they employ a multi-objective optimization approach facilitated by solvers such as IBM ILOG CPLEX. It is important to note, however, that this approach does not account for power plant flexibilities and exclusively forms non-overlapping coalitions.
The doctoral thesis of Beer [9] introduces a game-theoretic framework for non- overlapping CF named dynamic coalition formation in electricity markets, abbreviated as DYCE. This approach, centered around agents, addresses dynamic CF within electricity markets. The core objective of DYCE is to optimize distributed energy resources (DERs) by efficiently fulfilling aggregated power products. What sets DYCE apart from existing aggregation methods is its fully distributed nature, which takes into account the individual preferences of units as well as the broader supply system. DYCE incorporates the grid’s topological information, facilitating the trading of electricity products localized to specific areas in the power grid. Since coalitions are structured to fulfill singular products, they exhibit temporal flexibility to adapt to changing circumstances. Once a successful trade of a product occurs, the surplus gained is shared equitably among the coalition members, guided by a game-theoretic distribution model. The process of CF is decentralized in DYCE, initiating with agents forming individual coalitions and progressively evolving these coalitions to better fulfill their intended products. This iterative process involves agents regrouping based on a heuristic approach. Inter-agent communication adheres to the ContractNet protocol, with each agent aiming to maximize the average coalition value, thus contributing to the overall value of the CS. Due to the inherent complexity of the problem, the implemented heuristics prevent the formation of overlapping coalitions. While flexibility is introduced in the formal model of DYCE, it is not considered further for CF.
Another approach for dynamic CF of DERs is presented in [40] by Ossowski et al. Finding an optimal CS is first modeled as a dynamic CF game. The actual CF process is achieved by opportunistic aggregation of self-organized agents using a negotiation mechanism, which makes the approach fully distributed. The formation process consists of three phases: coalition initiation, provider aggregation phase, and consumer aggregation phase. The formation and coalition self-adaptation mechanisms introduced perform an open-ended adaptation of groups of organizational agents, converging towards stable configurations. In the context of the experiments conducted, no information about the grid topology and only non-overlapping coalitions were considered. While the approach is motivated by maximizing reliability and the benefits of flexibility, neither is explicitly considered in the approach presented by the authors.
In their work [41], Shabanzadeh et al. introduce a medium-term model for CF involving diverse DERs within a VPP. The main aim is to maximize the anticipated profit of the VPP. Their model comprises two distinct stages: The initial stage focuses on optimal decision making concerning the selection of DERs for VPP coalitions, the agreed-upon quantities through bilateral negotiation, and the determination of the type and quantity of forward market contracts. The subsequent stage involves decisions based on the most likely outcomes of stochastic pricing in the day-ahead market. Both stages are orchestrated by a VPP manager, leading to predetermined CS. The authors frame the CSG problem as a mixed-integer programming challenge that can be tackled using high-performance solvers. The approach they propose solely accommodates non-overlapping coalitions. No consideration is given to incorporating information about the power grid’s topology during the coalition formation process. Furthermore, neither the flexibilities of DERs nor the reliability of coalitions is taken into account within their proposed framework.
In [42], Valinejad et al. introduce a CF approach tailored for microgrids (MGs) equipped with DERs and BESSs in energy markets. The authors present a hierarchical structure consisting of two levels to design and plan the integration of DERs and BESS within home-microgrids (H-MGs). Their approach leverages demand response strategies to harness the flexibility potential within these systems. At the upper level of this hierarchical structure, a competitive scenario unfolds among H-MGs, aiming to maximize their individual and collective earnings through the process of CF with other H-MGs. To realize this, the upper-level problem is deconstructed into a series of lower-level market-clearing challenges. This comprehensive model addresses both electricity and heat markets concurrently. Subsequently, the intricate algorithm of this bilevel hierarchical framework is resolved by transforming it into an equivalent single-level model, utilizing the Karush–Kuhn–Tucker optimization conditions. The approach is adept at accommodating various forms of DERs but does not cater to coalitions with overlapping memberships. The quest for an optimal CS is formulated as an optimization problem, which is tackled using a central solver. The findings underscore that the establishment of coalitions among H-MGs within a grid not only influences the scheduling and control of generated power but also has repercussions on consumer behavior and demand within demand response initiatives. No topology information was used during the CF process, and the reliability was not assessed.
In [43], Vinyals et al. present an approach using social relationships of customers to organize them into coalitions of virtual electricity consumers to gain discounts through the collective buying of electricity. Therefore, the problem of finding an optimal CS is modeled as a coalitional game, and an algorithm based on linear programming to form coalitions is provided. The CF process consists of three differentiated activities determining which coalitions can be formed and the coalition value calculation, finding a CS with maximal value, and dividing the payments generated among the coalition members. Multiple agents calculate the coalition values while creating a CS is solved using an integer programming approach by a single entity. No overlapping coalitions are considered, no information about the topology of the power system is considered during the formation of the coalitions, and neither the plant’s flexibilities nor the reliability of the resulting coalitions are considered in their approach.
An approach to finding optimal large-scale CS in smart grids based on hierarchical population-based learning is proposed by Lee et al. [44]. The objective of the CF is the aggregation to reduce fluctuations in RES feed-in. To form the coalitions, a Top k merit weighting population-based incremental learning (PBIL-MW) algorithm is used, which can consider many plants in the CF process. The learning algorithm is part of a central agent that subsequently specifies a CS. Different types of plants, such as BESS, PV plants, or wind turbines, were considered as part of the experiments. In the experiments conducted, the authors show that the learning-based approach presented scales well in terms of runtime for solving the CSG up to a number of 80 agents, whereas previous methods based on dynamic programming scale only up to a size of 20 agents. The CS is formed only from disjoint coalitions, and no topological information, flexibility, or reliability are considered in the formation process.
Another learning-based approach is proposed by Sadeghi et al. in [45]. Coalitions are formed to address the problem of minimizing costs in MG communities while considering the dynamic nature of the system. The objective of one coalition is to efficiently group buyers and sellers in a way that minimizes the cost of energy transactions compared to buying or selling energy to the macrogrid. To create suitable CS, the deep-learning-based algorithm deep Q-learning-based CF (DQN-CF) is introduced. Only households in the MG are considered for CF, so the approach only allows for homogeneous plants. In the approach presented, plant flexibility and reliability of coalitions are not considered.

2.3. Summary

In Section 2.1, we first described various criteria for reviewing CF approaches from the literature. In Section 2.2, different approaches to aggregate DERs by CF were presented and compared based on the criteria introduced before. An overview of the presented CF approaches is given in Table 1.
In the brief literature review, we examined nine papers in the context of CF for power plant aggregation. The literature review shows that the objectives of CF are mostly to minimize costs or reduce the variability in RES. In four of the nine approaches, the CS is centrally or externally specified (exogenous). In three approaches, the CS is formed decentrally, and one approach pursues both. Finding a CS is usually defined in the papers as an optimization problem and solved by, e.g., a heuristic. Most approaches do not consider asset flexibility for CF (six of nine), and only one paper considers asset and coalition reliability. None of the approaches allow for the formation of overlapping coalitions. During the brief literature search, no work on OCF in the energy context could be found either.

3. Formalizing Overlapping Coalition Formation

In this section, some formalizations for CF are introduced, which will later serve as the basis for describing a method for CF. For this purpose, basic formalizations of CF in the context of CPES are first introduced in Section 3.1. In Section 3.2, these formalizations are extended with respect to overlapping coalitions.

3.1. Basic Formal Model

The following definitions are mostly taken from [9,13] and modified by us. The formalizations can be divided into four domains: physical device representation in Section 3.1.1, coalition in Section 3.1.2, product in Section 3.1.3, and optimization problem in Section 3.1.4.

3.1.1. Domain 1—Physical Device Representation

The power grid is relevant in the context of CF for assessing the neighborhood of units, e.g., for determining the sensitivity of reactive power products. A power grid is denoted as G = ( V G , E G ) , represented through a weighted graph structure. Here, V G = { v G , 1 , , v G , n } constitutes a collection of vertices corresponding to the nodes within the grid, while E G comprises a set of edges { v G , i , v G , j } , where v G , i , v G , j V G , and i j , signifying the power transmission lines. Each specific edge { v G , i , v G , j } possesses a numerical weight termed as the distance weight, denoted by d w ( v G , i , v G , j ) which quantifies the spatial separation, with higher values indicating greater distances between the respective nodes. The calculation of each distance weight relies on a predefined assortment of distance criteria referred to as c d i s , which encompasses parameters such as line impedance.
Units are atomic elements connected to the power grid and are of the type producers, consumers, or both. A unit u is a technical installation physically connected to a power grid G via a grid connection point and can influence current or voltage. Let U be the set of all units.
Units are represented in terms of information technology via agents assigned to them. Therefore, there is an agent for each unit considered in the CF process. Let A = { a 1 , , a n } be a finite non-empty set of agents. Each agent a i is assigned a non-empty subset U a i of units u i U according to the assignment u a (unit assignment): u a : A P ( U ) , u a ( a i ) U a i , U a i U . According to their intended purpose, agents are also referred to as participants in an electricity market. For ease of understanding, we assume that an agent represents exactly one unit, i.e., u a ( a i ) = u i . From an agent’s perspective, the task of CF is directly related to planning the operations of its associated units. In general, all planning activities take place concerning a planning horizon, which is a discrete-time frame for which agents plan in advance.
Let t b u represent a time base unit, and T N 0 denote the set of time values in t b u . The planning horizon is defined as T p l = { t p l ( i ) | 0 i i m a x } , with t p l ( i ) = [ t i · j , t ( i + 1 ) · j ) as a planning interval. Here, t m a x is the planning horizon length in N , Δ t is the planning interval length in N , both measured in t b u . We have t i · j T , mapped to a real-time point t r t through a time mapping function τ : T R , t t r t , where i is an interval index and i m a x = t m a x j · t u 1 is the maximum interval index within a planning horizon. Here, t u N is a constant in t b u , determining the time unit of a planning interval. Δ t is a multiple of t u , and j = Δ t t u is the interval length, measured in the time unit defined by t u .
Given the definition of a planning horizon T p l , the operation schedule of a unit u is defined as a function o s u : T p l R 3 , where o s u ( t p l ( i ) ) = ( e u ( i ) , e r r u ( i ) , c u ( i ) ) specifies the amount of electrical energy e u ( i ) which is produced or consumed by unit u with an error e r r u ( i ) at cost c u ( i ) in planning interval t p l ( i ) . The electricity amount e u ( i ) can be equivalently expressed as amount of electric power p o w u ( i ) = e u ( i ) Δ t which is provided or demanded by unit u over a planning interval t p l ( i ) . Let e o s u denote the electrical energy produced or consumed by unit u, accompanied by error e r r o s u at a cost of c o s u predefined by o s u within T p l . The collection of all operation schedules is O S U , while the feasible operating schedules of agent a i are defined as O S a i .
Specific unit types can offer flexibility about their scheduled amounts of electrical energy. Given a planning horizon T p l and an operation schedule o s u , the operational flexibility of a unit u within a planning interval t p l ( i ) is denoted by o f u ( i ) = ( e u , ( i ) , e u , + ( i ) ) R 2 . This tuple indicates the adjustable range of produced or consumed electrical energy. Here, e u , ( i ) and e u , + ( i ) are relative to e u ( i ) and define the negative and positive bounds for modifications. These bounds represent the extent of energy that can be added to or subtracted from e u ( i ) . The coalitional flexibility of a coalition C during a planning interval t p l ( i ) , termed as o f C ( i ) , is the summation of individual unit flexibilities within C :
o f C ( i ) = ( e C , ( i ) , e C , + ( i ) ) , where e C , ( i ) = u U C e u , ( i ) and e C , + ( i ) = u U C e u , + ( i )
This aggregation accounts for the operational flexibility of the member units (it is important to note that this simplified approach does not incorporate temporal and technical dependencies in flexibility modeling, whereas more intricate models, such as [22], do encompass these complexities). Given this definition, a unit u can assume multiple types with respect to its capacity to generate or consume electrical energy and its operational flexibility. The operational flexibility of a unit extends certain advantages to the controlling agent during CF. This expanded array of choices enables the agent to decide on trading products or contributing to a coalition with greater flexibility. The integration of flexibility within coalitions generally leads to enhanced dependability in fulfilling product commitments. This is attributed to the capability to adapt to unexpected deviations from scheduled electricity amounts. In the context of CF, operational flexibilities acquire even more significance. Coalition members gain the ability to compensate for errors committed by others, thus averting the need for costly balancing reserves.

3.1.2. Domain 2—Coalition

Given a set of agents A, a coalition is defined as a finite, non-empty set C A . The set of all possible coalitions is denoted as C ˜ = P ( A ) . Agents a C are also acknowledged as members of C . The size of a coalition is defined as the number of its members | C | , where in the case of | C | = 1 , it can also be denoted as a singleton coalition. Let U C be the set of all assigned units to C . A coalition is an organizational aggregation of one or more agents for the trade of a power product. The commitment of each member to a coalition is limited by time, as a coalition dissolves once a product has been physically fulfilled. CF results in a partition that divides the entire set of agents into distinct subsets. A coalition structure  CS is a set of coalitions that partitions the set of all agents A into disjunct coalitions, i.e.,
C , C CS w i t h C C : C C = C CS C = A
The size of CS is defined as the number of comprised coalitions | CS | . The set of all possible CS is given by CS ˜ , where the number of valid coalition structures | CS ˜ | = B n for non-OCF is equal to the Bell number B n with n = | A | being the number of agents.
The requirement that all coalitions of a given CS have to be disjunct (see Equation (2)) guarantees that these are independent of each other with regard to their products, leading to only non-overlapping CS . The value v of a coalition C is defined by the coalition value and is defined as a characteristic function v : C ˜ R , where v ( C ) is also referred as the value (or worth) of C . The function maps each coalition C to its value v ( C ) . While a higher coalition value typically results in higher revenues or lower expenses for a coalition, it does not directly correspond to the amount of money it ultimately receives or pays when trading a power product. Therefore, the characteristic function describes the payoff available to coalition members but does not define a way of distributing these payoffs.

3.1.3. Domain 3—Product

Based on a planning horizon T p l , we define a product horizon T p r ( p ) T p l of a product p as a set T p r ( p ) = { t p r ( 0 ) , , t p r ( j m a x ) } with 0 j m a x i m a x , where t ( j ) T p r ( p ) is termed as the product interval. Thus, a product horizon T p r ( p ) is a set of product intervals corresponding to the planning intervals of a particular planning horizon T p l in which a product p is to be physically fulfilled.
Given the definition of the product horizon, a product p determines the amount of electrical energy e p ( i ) which is produced or consumed by a coalition C with a particular error e r r p ( i ) at a particular cost c p ( i ) in each product interval t p r ( i ) , such that p : T p r ( p ) R 3 , where p ( t p r ( i ) ) = ( e p ( i ) , e r r p ( i ) , c p ( i ) ) . Let e p C be the amount of electrical energy consumed or produced by a coalition C with error e r r p C at cost c p C in T p r ( p ) . The set of all potential products is represented as P ˜ .
To fulfill its intended target products, an agent a i joins or initiates coalitions to which it contributes according to the technical capabilities of its units U a i . Given an agent a i controlling a unit u a ( a i ) = U a i = u , the contribution of a to a product p is defined as the function c o n a , p : T p r ( p ) R 3 , where c o n a , p ( t p r ( i ) ) = ( e u ( i ) , e r r u ( i ) , c u ( i ) ) specifies the amount of electrical energy e u ( i ) which is produced or consumed by u with error e r r u ( i ) at cost c u ( i ) in the product interval t p r ( i ) as determined by its operation schedule o s u ( t p l ( i ) ) . Let C O N C , p be the set of all contributions of the members of a coalition C . Since, so far, only non-overlapping coalitions can be formed, an agent a can only contribute to one product in each product horizon T p r ( p ) .
The payoff  ρ ˘ resulting from a trade of p by a coalition C is defined as the final total cost which the members of C receive from or pays to its trading partner. Given a coalition’s payoff ρ ˘ , the payoff distribution is defined by a function ψ : C ˜ × R R | C | , where the allocation vector ψ ( ( C , ρ ˘ ) ) = x C specifies the individual shares ( x 1 , , x n ) for the coalition members { a 1 , , a n } . The computation of x C is predicated on a set of criteria, denoted as c ψ . The distribution function ψ allocates the coalition’s payoff ρ ˘ among its members. The shares x i delineate individual revenues or expenses. This distribution hinges on dedicated criteria c ψ that are relevant for its calculation, such as a member’s contributed electrical energy amount or its reliability in ensuring physical product fulfillment.
Given a planning horizon T p l and a template portfolio T P = { p t m p , 1 , , p t m p , n } containing product templates p t m p , a product portfolio P P = { p t a r , 1 , , p t a r , n } is defined as a set of n template-compliant target products p t a r which an agent intends to trade in T p l at market M by forming coalitions. In this context, a target product p t a r is defined as the function p t a r : T p r ( p t a r ) R 3 , where p t a r ( t p r ( i ) ) = ( e p t a r ( i ) , e r r p t a r ( i ) , c p t a r ( i ) ) specifies the target amount of electrical energy e p t a r ( i ) , which is produced or consumed with target error e r r p t a r ( i ) at target cost c p t a r ( i ) in t p r ( i ) .
The time model, in particular the relationship between the real time and the product, and the planning time, as well as the notation used for the different times, is shown in Figure 2.

3.1.4. Domain 4—Optimization Problem

Given these formalizations, the problem of finding an optimal coalition structure CS can be seen as a characteristic function game (CFG) or non-OCF game Γ . In a CFG, a coalition’s value depends solely on its members’ identities. Such a game is given by a tuple Γ = ( A , v ) , where A is a set of agents and v is the characteristic value function. In CFGs, for any coalition C A , the value of a coalition structure CS CS ˜ is denoted by V ( CS ) and is given by V ( CS ) = C i CS v ( C i ) . The coalition structure generation problem is the problem of finding a CS over A whose value V ( CS ) is maximal. In CFGs, the problem of finding CS can be defined as:
CS = arg max CS CS ˜ , s . t . H C P ˜ V ( CS ) , V ( CS ) = C i CS v ( C i )
This is a combinatorial optimization problem, which in principle, could be solved to optimality using a brute-force search. However, this is not practicable, as the number of possible coalition structures over n agents is O ( 2 n 1 ) . The CSG problem is NP-complete, and research mainly investigates concepts that provide an efficient approximation to an optimal solution [46]. In particular, algorithms from the field of dynamic programming (DP), anytime exact algorithms such as integer-partition-based search (IP) or a combination of both approaches, such as the ODP-IP algorithm [30], are used to find optimal CS [27].

3.2. Extended Formal Model

In the following, we extend the model of a non-OCF game from the previous subsection to a cooperative game with overlapping coalitions. The game-theoretic definitions of overlapping coalitions are mainly taken from [25]. The presented model can be applied when agents allocate different portions of their associated units to contribute to multiple products as members of different coalitions simultaneously. We define overlapping coalition formation as the process of forming coalitions of entities with shared goals or interests to achieve a specific objective. These coalitions may overlap, meaning that some entities may belong to multiple coalitions simultaneously.

Extending Agents Contributions

In the game theory literature, a game with overlapping coalitions is typically defined as an OCF game with a set of players N = { 1 , , n } and a function v : [ 0 , 1 ] n R , where v ( 0 n ) = 0 [25]. An overlapping coalition is now given by a vector r = ( r 1 , , r n ) where r j is the fraction of agent j’s resources contribution and the value function v maps each partial coalition r to its corresponding payoff. This approach allows not only the pure membership in a coalition to be mapped (which would be the case if always r j { 0 , 1 } ) but also to what extent an agent a j distributes its resource among different coalitions, i.e., r j [ 0 , 1 ] .
An agent’s a contribution to a coalition is already explicit in the formalization for non-overlapping CF. The definition of a non-overlapping coalition includes only membership information over its members, but agents explicitly specify a contribution c o n a , p to a product p. Since, in our context, products are provided by coalitions, there is already a link between the faction of an agent’s resource and the contribution to a product.
In Equation (2), we described the constraint for disjoint coalitions that applies to non-OCF games. To make the previous concepts applicable to overlapping coalitions or OCF games, the condition in Equation (2) does not apply to OCF. This means that the intersection of both coalitions now does not necessarily have to be an empty set but can also include the agents that contribute to both coalitions.
Since an agent a can now participate in more than one coalition in a planning horizon T p l , its contributions c o n a , p and c o n a , p fulfilling products p and p having the same product horizon T p r p = T p r p or at least overlapping product intervals t p r T p r p t p r T p r p , is bounded by the operation schedule considering a’s operational flexibility, i.e., the amount of electrical energy that a contributes to all products in a time interval must not exceed the amount determined by the operational schedule, including the possible positive flexibility bound. The extended contribution function
c o n a + : T p l R 3 × n , where c o n a + ( t p l ( i ) ) = ( e u ( i ) , err u ( i ) , c u ( i ) )
now determines a tuple of vectors containing the electrical amount e u ( i ) , with error err u ( i ) at cost c u ( i ) for all n products for which a contributes in t p l ( i ) :
e u ( i ) = e p 1 ( i ) e p 2 ( i ) e p n ( i ) , err u ( i ) = e r r p 1 ( i ) e r r p 2 ( i ) e r r p n ( i ) , c u ( i ) = c p 1 ( i ) c p 2 ( i ) c p n ( i ) .
It must hold that:
t ( i ) T p l : κ a ( i ) ( e u ( i ) + e u , + ( i ) ) , where κ a ( i ) = e e u ( i ) e p j ( i )

4. Discussion of Overlapping Coalition Formation in DYCE

The CF process can be divided into three phases: (1) forming a coalition structure involves each agent joining a coalition. This can be achieved either through the agents making decisions independently using a bargaining process, or through external means such as a system designer. The result of this phase is a CS. (2) Solving the (optimization) problem of each coalition—this includes fulfilling the coalition’s purpose, such as solving an optimization problem or providing an electricity product. (3) Dividing the reward of each coalition among its members—if a coalition receives benefits from cooperative action, the members of the coalition will need to decide how to (e.g., fairly) distribute these benefits among themselves [27].
In the following Section 4.1, we first provide an overview of the DYCE methodology. In Section 4.2, we briefly explain the different value maximization levels in DYCE before we discuss for each of the phases in DYCE what problems exist within the phase regarding the formation of overlapping coalitions in Section 4.3.

4.1. Introducing the DYCE Method

The DYCE method presented by Beer in [9] is a method for dynamic CF in electricity markets, including the three phases above. DYCE involves an agent performing four activities in an iterative and partially parallel manner, as shown in Figure 3. Each activity addresses a specific sub-problem of the overall problem and involves solving an associated optimization problem. The DYCE method improves both local and global utility by considering the individual benefits of agents during product portfolio management and optimizing the value of the CS when forming coalitions.

4.1.1. Product Portfolio Management

In this phase, the agents in the DYCE activity have three main tasks. The first is to build a portfolio of target products that it will trade within its planning horizon, using product templates and price predictions. The second is to choose an operation schedule from its controlled units, which will be used to make contributions to coalitions during formation processes. The portfolio and schedule are designed to optimize local utility. Finally, the agent plans and carries out CF processes for its identified target products according to the created schedule.

4.1.2. Neighborhood Formation

If an agent initiates negotiations during CF, it first creates a group of nearest neighbors based on the physical distance between units in the grid. This group, called a neighborhood, is limited in size to reduce communication and computational costs while considering the layout of the grid. If the current neighbors are not successful in forming a coalition, the agent can expand its neighborhood to include a larger number of potential partners.

4.1.3. Coalition Formation

During the third DYCE activity, agents form coalitions with their neighbors to fulfill their target products jointly. The formation process aims to maximize the overall value of the CS. An agent has two options if a coalition cannot be formed with the current neighbors. The first is to expand its neighborhood to include more potential partners and increase the chances of forming a successful coalition (see condition c 1 in Figure 3). As a second option, if the neighborhood already covers the whole grid, the agent can try to create a new portfolio of target products as an alternative (see condition c 2 in Figure 3). If neither of these options is possible, the agent will end the formation process and schedule the following target product for which to form a coalition (see condition c 3 in Figure 3). If the agent is a member of a successful coalition, it will complete the final activity of the overall process before ending the process (see condition c 4 in Figure 3).

4.1.4. Payoff Distribution

If a coalition is formed successfully, it will seek to trade its fulfilled target product and distribute the resulting payoff fairly among its members. The distribution is typically based on the Shapley value, a game-theoretical model that determines the individual shares of the coalition’s utility. The resulting distribution vector specifies the share that each member receives from the cooperation.

4.2. Value Maximization Level

When forming coalitions, agents can maximize value with regard to different levels. Therefore, we first introduce the different levels of value maximization for both disjoint and overlapping CF. Depending on the agents’ utilized objective function, value optimization within the context of CF can occur on three different levels [9]:
  • At the agent level (a-level), each agent a i strives to maximize its personal share x i acquired as a member of a coalition C CS , as indicated by the objective function
    max C CS x i , C
  • At the coalition level (C-level), the agents try to maximize the value function of single coalitions C CS as indicated by the objective function
    max C CS v ( C )
  • At the coalition structure level (CS-level), the agents try to maximize the value of a coalition structure CS as reflected by the objective function
    max C CS v ( C ) ¯ , where v ( C ) ¯ = C CS v ( C ) | CS |
The choice of objective function an agent employs for its specific task is contingent upon its optimization objective. This can entail optimizing its individual payoff (a-level), the value of individual coalitions (C-level), or the value of the entire coalition structure (CS-level). From a more pragmatic perspective, these choices align with optimizing the value for the unit owner, unit pools, or the entire system. Figure 4 shows the different levels of value maximization in abstract terms, including the differences between non-overlapping and overlapping CF. The figure shows that at the C- and CS-levels, overlapping coalitions create dependencies on other coalitions that do not exist for disjoint coalitions.
The objectives pursued across distinct levels might conflict. For example, when maximizing value on the C-level, an agent might opt to join a coalition even if it leads to a reduction in its own share or the overall value of the CS. The choice of objective function for an agent ultimately hinges on the specific use case and regulatory constraints defined by hard regulations.

4.3. Overlapping Coalition Formation in DYCE

In the remainder of this section, we will systematically examine the four stages of DYCE and identify the aspects within the model that necessitate a modification to facilitate the emergence of overlapping coalitions.

4.3.1. Product Portfolio Management for OCF

In the phase of product portfolio management (PPM), each agent tries to maximize its utility function by selecting an optimal template portfolio T P and operation schedule o s u to create a product portfolio P P . The expected utility of a producer u t p or consumer u t p for a given operation schedule o s u and product template p t m p is given by
u t i l i t y ( p t m p , o s u ) = i = 0 j m a x e u ( i ) · c p t m p ( i ) e u ( i ) · c u ( i ) if u is of type u t p , i = 0 j m a x e u ( i ) · c u ( i ) e u ( i ) · c p t m p ( i ) if u is of type u t c .
Given Equation (10), the utility of a template portfolio T P and an operation schedule o s u is defined as
u t i l i t y ( T P , o s u ) = p t m p T P u t i l i t y ( p t m p , o s u )
The above Formula (11) enables an agent to identify an optimal pair of template portfolio T P and operation schedule o s u . Assuming that template portfolio T P comprises n product templates, the corresponding optimization problem is given by
maximize u t i l i t y ( T P , o s u )
subject to T P P ( P t m p ) ,
o s u O S S u ,
T p r ( p t m p , i ) T p l = T p r ( p t m p , i ) = , 1 i n ,
s a t h c u ( o s u ) = 1 h c u H C u .
As each agent in the context of the PPM seeks to maximize its utility, optimization occurs at the a-level. The first phase of DYCE results in the creation of a product portfolio P P , which comprises all target products p t a r that the agent aims to trade within a given planning horizon T p l across one or more sub-markets M s . The specification of P P draws upon the template portfolio T P , which represents a selection of product templates p t m p from a catalog of templates P t m p that maximizes the agent’s expected utility.
To create a product portfolio P P , an agent performs six consecutive tasks. First, the agent establishes the template catalog P t m p , which serves as a basis for making a utility-maximizing template portfolio T P . If an agent trades only standardized products, e.g., on an energy exchange, T P is predefined. Subsequently, an agent creates price forecasts c p t m p ( i ) for all product templates p t m p , which are used to determine the expected utility from T P and o s U . The next task of the agent is to determine the operation schedule space O S S u . For this purpose, the agent forecasts the operational behavior of the unit u by identifying all feasible plans within the considered planning horizon. Depending on the type of plant, these forecasts must take into account controllable factors, such as planned events in a production process, or uncontrollable factors, such as future weather conditions. Before the agent determines T P and o s u using a heuristic, it creates a global formation schedule that includes time intervals in which to form the coalitions for the target products. The template catalog P t m p , the price predictions C P t m p , and the operation schedule space O S S u from the previous tasks then serve as input to the combinatorial optimization for portfolio enhancement (COPE) heuristic. The agent next runs COPE to obtain P P and o s u to maximize its expected utility. Finally, the agent schedules the next of its target products according to the previously created formation schedule and starts the activity of neighborhood formation when the corresponding formation period begins.

4.3.2. Extended Product Portfolio P P +

The main result of the PPM is a product portfolio P P , which for a planning horizon T p l comprises several target products p t a r P P and an associated operation schedule o s u O S S u . However, the P P comprises only one product at a time. For an agent to trade multiple products at a time, its capacity based on an o s u must be proportionally allocated to one or more products, resulting in an advanced product portfolio P P + . Figure 5 shows an example representation of a P P for only one product per time in a T p l with a total of 11 target products, including the associated o s u , as well as an extended P P + that allows multiple target products p t a r per time and a total of 15 target products. Both product portfolios P P and P P + use the same operation schedule o s u and planning interval T p l .
When trading multiple products simultaneously is allowed, the problem of finding an optimal P P is no longer a combinatorial problem. Instead of selecting one product for a period, the resource must now be allocated to one or more products as optimally as possible. This type of problem can be described as a resource allocation problem, a particular case of nonlinear programming problems. Therefore, searching for an optimal P P + is no longer possible with the heuristic COPE and other methods must be used to find an optimal P P + .

4.3.3. Neighborhood Formation for OCF

After an agent a i chooses a product portfolio P P and operation schedule o s u that maximizes its expected utility, the agent performs the neighborhood formation to limit the set of potential interaction partners. This activity consists of four individual sub-tasks which are executed sequentially by the agent a i . Initiating the procedure for a predetermined target product p t a r , the agent constructs an intrinsic model of the provided power grid G, encompassing all linked units U a i and assigned agents a j . Following this, principles from graph theory are employed to estimate distances between grid nodes, relying on the previously established grid model (in this context, the distance between two grid nodes v G , i and v G , j is represented by the impedance of the connecting power lines). The created internal grid model and the distances to the neighbors of a i are used in the next sub-task to calculate the corresponding abstract trust values by the trust function t r u s t ( t , a i , a j ) to evaluate the trustworthiness of potential cooperation partners a j of a i for a specific point in time t. Finally, the agent selects its initial role for the forthcoming CF negotiations. This involves deciding whether to proactively initiate (the i n i t i a t o r role ) or respond (¬initiator role) to formation requests. The agent determines the choice of role randomly based on the given probabilities P ( A ) = r i n i t (probability of choosing the role initiator) and P ( A ¯ ) = 1 r i n i t (probability of choosing the role ¬initiator). This leads to having a certain number of initiator agents | A i n i t | since the ratio r i n i t = | A i n i t | | A | has implications for the global communication cost and runtime of the CF.
Since the Neighborhood Formation is performed separately for each target product p t a r , the sub-tasks apply to both classical product portfolios P P as well as extended product portfolios P P + with overlapping products. However, trading multiple products in one period may also have different requirements for determining different types of neighborhoods. For example, for the formation of a coalition to deliver a reactive power product, the local proximity of the agents to each other plays an important role, whereas for the delivery of an active power product, the dispersed nature of the agents may be beneficial for reliable fulfillment of the product. In addition, there may be other requirements regarding the ratio r i n i t , as overlapping coalitions may lead to a larger number of coalitions, and, therefore, there is a larger communication overhead. It would, therefore, be conceivable, for example, that an agent who is in two coalitions and is the initiator in one coalition also takes on this role in the second coalition.

4.3.4. Coalition Formation for OCF

The activity of coalition formation is the third step of the DYCE method. It is structured as a decentralized procedure where agents initially form singleton coalitions and, through an interaction protocol and heuristic, progressively create optimized coalitions to achieve their target products. Throughout this process, agents aim to maximize the mean coalition value v ( C ) ¯ , consequently enhancing the overall value of the coalition structure CS . Thus, the optimization problem that agents address during the formation process is:
maximize v ( C ) ¯
subject   to C C S ˜ ,
C , C CS with C C : C C = C CS C = A .
The problem can be solved optimally by finding a coalition structure CS in which all the coalitions involved achieve their desired product perfectly. To accomplish this, agents aggregate the technical capabilities of the units they control to fulfill an intended target product p t a r and perform regroupings to improve the mean coalition values.
The contribution an agent a makes to a product p with product horizon T p r ( p ) = { t p r ( 0 ) , , t p r ( j m a x ) } depends mainly on the operation schedule o s u generated in the previous phase and is given by the function c o n a , p : T p r ( p ) R 3 , whose values c o n a , p ( t p r ( i ) ) = ( e u ( i ) , e r r u ( i ) , c u ( i ) ) specify the electricity amount e u ( i ) produced or consumed by unit u with error e r r u ( i ) at cost c u ( i ) in each product interval t p r ( i ) T p r ( p ) . Since DYCE only allows each agent to be a member in one coalition in each product interval, it follows that
t p r ( i ) T p r ( p ) with t p r ( i ) = t p l ( i ) : c o n a , p ( t p r ( i ) ) = o s u ( t p l ( i ) )
Aggregating the contributions of all participating agents of a coalition C results in the cumulative contribution c o n C , p ( t p r ( i ) ) = ( e C ( i ) , e r r C ( i ) , c C ( i ) ) . To achieve meaningful outcomes, it is necessary to aggregate these attributes suitably. Given a coalition C = { a 1 , , a n } with associated units U C = { u 1 , , u n } , the cumulative electricity amount (Equation (21)), error (Equation (22)), and cost (Equation (23)) in a product interval t p r ( i ) are calculated as
e C ( i ) = u U C e u ( i )
e r r C ( i ) = u U C ( e r r u ( i ) · | e u ( i ) | ) 2 | e C ( i ) |
c C ( i ) = u U C c u ( i ) · e u ( i ) | e C ( i ) |
During the process of forming coalitions, agents must evaluate the worth of their coalition to determine whether a regrouping enhances the overall value of the CS , as defined by optimization problem (17). A regrouping ( C ^ o l d , C ^ n e w ) is generally considered as beneficial if
v ( C n e w ) ¯ > v ( C o l d ) ¯ ,
i.e., if the mean coalition value of a coalition resulting from regrouping v ( C n e w ) ¯ has a higher value than the previous coalition v ( C o l d ) ¯ . The main objective of a coalition C is given by the trade of its intended product p t a r . Therefore, a reasonable way to specify the worth of C is to determine its ability to fulfill p t a r . Given the cumulative contribution c o n C , p t a r for a target product p t a r it is optimally fulfilled if
t p r ( i ) T p r ( p t a r ) : c o n C , p t a r ( t p r ( i ) ) = ( e C ( i ) , e r r C ( i ) , c C ( i ) ) = ( e p t a r ( i ) , e r r p t a r ( i ) , c p t a r ( i ) ) = p t a r ( t p r ( i ) )
The approximation of a cumulative contribution to a target product is then defined in terms of the following attribute-based formulas:
e C , f ( i ) = ϱ ( 1 e C ( i ) e p t a r ( i ) ) ,
e r r C , f ( i ) = ϱ ( e r r p t a r ( i ) e r r C ( i ) ) ,
c C , f ( i ) = ϱ ( c p t a r ( i ) c C ( i ) ) ,
with
ϱ : R ] 0 , 1 ] , ϱ ( x ) = 1 1 + | 3 · x | .
The approximation values from Equations (26) to (28) indicate the degree of fulfillment of the target electricity, target error, and target cost. Their assessment involves computing the real approximations of the target values. These results are then standardized by mapping them onto the interval ] 0 , 1 ] through the application of the function ϱ (Equation (29)). As a result, optimal values with no deviation ( x = 0 ) from the target values will have a value of 1, which indicates that the pursued target value is exactly met.
Given the above definitions, the worth of a coalition C can now be assessed by the value function
v ( C ) = w e · e C , f ¯ + w e r r · e r r C , f ¯ + w c · c C , f ¯ if e C , f ¯ 0 , 0 else , i a t t r w i = 1 .
The variables w i denote weights employed to prioritize distinct attributes, while e C , f ¯ , e r r C , f ¯ , c C , f ¯ indicate the average degrees of fulfillment computed across all product intervals t p r ( i ) T p r ( p t a r ) using Equations (26) to (28). Therefore, the formula (in Equation (30)) allows rating a coalition according to its ability to fulfill the average interval-related target values of the pursued product and enables agents to reasonably argue about the worth of different coalitions during the interactive formation process.
Given the ability to assess the value of a coalition, agents conduct the activity of CF to cooperate to fulfill their pursued target products optimally. The agents begin as singleton coalitions and then use an interaction protocol based on the well-established ContractNet protocol to evaluate possible regroupings. Through this iterative process, they optimize their coalitions in a fully decentralized and self-organized manner. Based on the role (initiator a I or responder a R ) chosen by an agent during the neighborhood formation, the agent initiates the formation of a coalition or waits for requests from other agents. When an agent a I has the initiator role, it first removes from its list neighboring agents who are already in its coalition, those who fall below a minimum trust value, or those who are in the set of unlikely cooperation partners u c p . Then, a I sends a call for proposal (CFP) to all remaining agents, with a payload that includes its coalition C I , the individual contributions of its members C O N C I , p t a r , the target product p t a r , and a corresponding tolerance band p t a r , t o l , as well as a timestamp t d , C F P as a deadline for responses to the CFP. A responder agent a R receiving such CFPs first checks whether the requesting agent a I is in its neighborhood, whether the target product p t a r matches, for local power products whether agents in its coalition C R match for that product, and whether agents from the requesting coalition C I have sufficient trust values. If all preconditions are met, the responding agent a R identifies a potential regrouping using a heuristic called combined coalition regrouping (CCOR).
CCOR enables a member transfer between an initiating coalition C I , o l d and a set of responding coalitions C ˜ R , o l d . This is achieved by iterating over C ˜ R , o l d and performing regrouping in each cycle if the value of the new coalition is greater than the old one. More precisely, the method essentially consists of two complementary steps. First, members are unilaterally transferred from responding to the initiating coalition to accumulate potential. Then, members are exchanged between the two coalitions to approximate the target values of the target product. The outputs of CCOR are two new optimized initiator and responder coalitions ( C ^ I , n e w , C ^ R , n e w ) . It is generally true, that 0 | C ^ R , n e w | | C ^ I , n e w | 1 and C ^ R , n e w = C ^ I , n e w = if no optimization was possible.
After an agent a R has checked the preconditions and executed CCOR, it responds to the CFP to a I with either a refusal or proposal. The refusal always includes the reason and is sent, for example, if the trust values of the requesting coalitions are insufficient. The proposal consists of the new optimized coalitions ( C ^ I , n e w , C ^ R , n e w ) as well as the optimization cycle parameter o c , which stores the cycles CCOR needed to find an optimal solution. When a I receives a refusal, it adds the agents from the coalition C R to its list of u c p . After a I has processed all refusals, it checks the trust values of all new members it received from a R . Subsequently, a R executes CCOR with the replying coalitions and obtains coalitions C ^ I , c o m and C ^ R , c o m as a result. After each cycle, a new initiator agent may need to be decided for each coalition, as the previous initiator agent may have been transferred to another coalition.
The interaction of the agents and the heuristic CCOR allows the agents to optimize the coalition structure CS in a decentralized and fully self-organized manner. An important constraint of CCOR is that the transfer of members from one coalition to another can never degrade the quality of the fulfillment of a product. In addition, various parameters, such as o c , are also used to prevent dead- and livelocks. Finding a CS ends when a certain number of cycles have been run or an optimal coalition structure CS has been found. The fulfillment of a coalition is considered successful if each unit in a coalition delivers the agreed contribution so that the target product can be fulfilled as planned (see Equation (25)).

Extended Coalition Formation

The CF in DYCE in the form of the interaction protocol and the heuristic is designed in such a way that the result can only be disjoint coalitions. This is also reflected in the constraints of the optimization problem, which ensures that all coalitions are disjoint (Equation (19)) and the assumption that the contribution of a unit equals its operation schedule (Equation (20)). However, since coalitions are formed individually for each target product, an extension of the current CF process is possible. Given an extended P P + that already allows agents to contribute to multiple products in a time interval, as illustrated in Formula (4), the expansion mainly concerns the heuristic CCOR in which the mean coalition values are maximized.
CCOR maximizes the mean coalition values by transferring agents between coalitions, leading to new coalitions, while never diminishing the value of a coalition by member transfers. However, due to overlapping contributions by an agent a, dependencies on other coalitions in which a is also a member can arise. These transitive dependencies must be taken into account by CCOR during optimization. At the same time, overlapping contributions also create other possibilities. For example, during the formation, an agent can explicitly use information during the optimization of CS from both coalitions in which it is located for the optimization.

4.3.5. Payoff Distribution for OCF

The payoff distribution is the last of the four phases in DYCE. Here, the added values of a coalition generated by a trade of a product are fairly distributed among the coalition members. If the trade of a product p with a product horizon T p r ( p ) = { t p r ( 0 ) , , t p r ( j m a x ) } was successful, the resulting payoff ρ ˘ is defined as:
ρ ˘ = i = 0 j m a x e p ( i ) · c ˘ ( i )
where c ˘ ( i ) are the actually contracted costs per unit of electrical energy in a product interval t ( i ) . In reality, the achieved payoff ρ ˘ usually differs from the originally determined payoff ρ . The payoff ρ determined by the agents in a coalition for a product p is defined as follows:
ρ = i = 0 j m a x e p ( i ) · c p ( i )
The payoff ultimately establishes the utility attained by a coalition through the trading of a product p. This utility is defined as follows for a coalition of producer agents C p and a coalition of consumer agents C c :
u t i l i t y C ( p ) = i = 0 j m a x u t i l i t y C ( i ) ( p ) = i = 0 j m a x e p ( i ) · c ˘ ( i ) e p ( i ) · c p ( i ) = ρ ˘ ρ if C is C p , i = 0 j m a x e p ( i ) · c p ( i ) e p ( i ) · c ˘ ( i ) = ρ ρ ˘ if C is C c .
The coalition’s utility is established as the sum of interval-associated utilities u t i l i t y C ( i ) ( p ) . For a producer coalition C p , the utilities are computed by subtracting the initial payoffs from the final payoffs ( ρ ˘ ρ ) . For a consumer coalition C c , the utilities are determined by subtracting the final payoffs from the initial payoffs ( ρ ρ ˘ ) , as depicted in Equation (33).
All organizational and contractual tasks within a coalition C are performed by a representative agent a C , r e p , which is a member of C . This includes, among other things, placing orders on the electricity markets for a trade of the target product. If the product trade was successful, a C , r e p informs all coalition members to control their units accordingly. After the physical fulfillment of a product, a C , r e p takes over the fair distribution of the payoff. For this purpose, a C , r e p asks for the actual contributions of the members of C , as they may differ from the agreed values due to unforeseen reasons. Based on the error an agent had in its contribution to a coalition, the agents calculate new trust values for that agent. If an agent has a low trust value, this may make future collaborations with other agents difficult. Agents, therefore, are interested in fulfilling their agreed contributions to a coalition, which positively impacts the reliability of coalitions. Subsequently, a C , r e p determines the fair distribution of the payoff for all members of C . For this purpose, a C , r e p uses a distribution model based on cooperative game theory methods to determine the distribution vector x C = ( x 1 , , x n ) , which indicates the individual shares of all members. Given a product horizon T p r ( p ) = { t p r ( 0 ) , , t p r ( j m a x ) } of a successfully traded product p, the individual share x k of a coalition member a k is calculated as
x k = i = 0 j m a x x k ( i ) = i = 0 j m a x c u ( i ) · e u ( i ) + ϕ k ( i ) ( C , v ) · u t i l i t y C ( i ) ( p ) ,
where ϕ k ( i ) ( C , v ) is a fair percentage in terms of the Shapley value, and v is referred to as the characteristic function. Hence, the agent a k ’s total share is outlined as the sum of interval-associated shares x k ( i ) , wherein an interval-linked share constitutes the accumulation of the corresponding schedule cost c u ( i ) · e u ( i ) alongside an equitable portion of the coalition’s utility ϕ ( i ) k ( C , v ) · u t i l i t y C ( i ) ( p ) . The equitable proportion ϕ k ( i ) ( C , v ) is determined through the utilization of Shapley values, a concept drawn from the realm of cooperative game theory. Shapley values serve to create a just distribution vector in a given coalition game Γ , where the notion of fairness is characterized by specific axioms that encapsulate attributes of the resultant distribution [47].
An extension or adaptation for overlapping coalitions is not necessary because the distribution of payoffs in DYCE happens purely on the C-level.

5. Conclusions and Outlook

5.1. Conclusions

The future energy system will be characterized by a variety of DERs because of its transformation driven by the energy transition. Due to their nature, these DERs place various requirements on the future power grid, such as a volatile feed-in due to weather-dependent power generation. For an economical and grid-serving use of DERs, they must be integrated into the power system. One type of integration that addresses various grid integration and economical requirements is the aggregation of DERs into pools. The formation of coalitions is a method of aggregation of DERs that enables dynamic product-based aggregation of DERs. Due to an increasing need for dynamics and a flexibilization of DERs in the future, further requirements for the aggregation of DERs arise. By forming overlapping coalitions, these requirements potentially can be addressed. Here, DERs can be members of more than one coalition at a time.
In this paper, we first investigated the related work in the context of the CF of DERs. In the analysis of related work, we found that a large body of work studies the aggregation of DERs for different purposes. The approaches to the formation of coalitions are primarily based on heuristics for finding an optimal CS or agent-based negotiations. There are approaches to both the centralized and decentralized formation of coalitions. However, very few works (none in the energy context) study the formation of overlapping coalitions.
Therefore, we presented the DYCE model and analyzed where DYCE would need to be extended to allow overlapping coalitions to be formed. For this purpose, we first described the formal model of DYCE and expanded it to include the formalisms for overlapping coalitions. These extended formalizations mainly concern the game-theoretic modeling of overlapping coalitions as an overlapping CF game and the coalition structure generation problem, as well as the extension of the contribution of a unit to coalitions. We then described the four phases of DYCE and, for each phase, examined the extensions needed to allow overlapping coalitions. The most extensions are necessary for the first phase, the product portfolio management. In this phase, an agent creates a product portfolio P P for a planning horizon, comprising the products the agent wants to trade. An optimal P P for an agent is searched using a heuristic. For an agent to be able to trade overlapping products, an extended product portfolio P P + is necessary, which, however, can no longer be determined with the previous heuristic. Further adjustments are required for the third phase, the actual CF. In this phase, agents decentrally form coalitions based on an interaction protocol to maximize the value of the CS. To achieve this, they can perform member transfers between coalitions. As part of the member transfer, the agents optimize the values of the coalitions using a heuristic. However, this optimization does not allow the formation of overlapping coalitions since, among other things, transitive dependencies between coalitions and the membership of an agent in several coalitions are not considered in the optimization.
Finally, Table 2 summarizes all of the previously described extensions required for OCF in DYCE, grouped by phase in DYCE.

5.2. Outlook

The formation of overlapping coalitions increases the overall complexity of the optimizations in the different phases. Therefore, the heuristics used in DYCE are no longer applicable. Future work should consequently investigate different approaches for the creation of a product portfolio with overlapping products as well as for the process of optimizing the CS in the context of agent interaction, while keeping the requirements such as the decentralized formation. For example, the investigation of methods from the field of multi-agent learning for the formation of overlapping coalitions would be conceivable.
The flexibility of the units is already rudimentarily taken into account in the DYCE formalizations but not in the formation of coalitions. However, with the increasing importance of flexibility in the future energy system, the need for modeling flexibility also increases. An extension of DYCE is, therefore, also conceivable with regard to the consideration of unit flexibility in the process of CF.
Since AS will also have to be provided by aggregates of DERs in the future, the reliability of product delivery plays an important role. Although there are already some approaches to assessing the reliability of coalitions (e.g., in [48]), these approaches only assess the reliability of an already existing CS. Therefore, DYCE could be extended to consider maximizing the reliability of a CS already during the formation process.
Within the CF process, the agents communicate based on the ContractNet protocol. ContractNet is a very general, application-unspecific protocol. However, due to the increasing complexity during the CF process, caused by overlapping coalitions, the use of an application specific protocol may be necessary, such as those presented in [49].
Overlapping coalitions offer additional advantages which could be analyzed in an extended version of DYCE. For example, an agent’s membership in two coalitions could be used to identify unneeded capacities from units in one coalition and make it available in the other coalition if it is needed there in the short term to improve coalition reliability.

Author Contributions

Conceptualization, T.W. and A.N.; methodology, T.W. and A.N.; formal analysis, T.W.; resources, T.W.; writing—original draft preparation, T.W.; writing—review and editing, T.W. and A.N.; visualization, T.W.; supervision, A.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Nomenclature

GPower grid
V G Set of vertices of power grid G
E G Set of edges representing power lines in grid G
d w ( v G , i , v G , j ) Distance weight function
c d i s Distance weight criteria
uA unit in a power grid
UThe set of all units
AA set of agents
a i A specific agent a A
U a i Non-empty set of units assigned to agent a i
u a ( a i ) Unit assignment function of agent a i
t b u Time base unit
TThe set of all time values measured in t b u
T p l Planning horizon
t p l ( i ) Planning interval
t m a x Planning horizon length
τ Time mapping function
o s u Operation schedule of a unit u
e u ( i ) Consumed or produced energy of unit u in planning interval t p l ( i )
e o s u Consumed or produced energy of unit u
e r r u ( i ) Error/deviation of energy of unit u in planning interval t p l ( i )
e r r o s u Error/deviation of energy of unit u
c u ( i ) Cost of consumed or produced energy of unit u in planning interval t p l ( i )
c u Cost of consumed or produced energy of unit u
O S U Set of all operation schedules
O S a i Set of all possible operating schedule of agent a i
O S S u Operation schedule space of unit u
o f u ( i ) Operational flexibility of unit u in planning interval t p l ( i )
e u , ( i ) Negative flexibility bound of unit u
e u , + ( i ) Positive flexibility bound of unit u
o f C ( i ) Operational flexibility of coalition C in planning interval t p l ( i )
e C , ( i ) Negative flexibility bound of coalition C
e C , + ( i ) Positive flexibility bound of coalition C
C A coalition of agents
C ˜ The set of all coalitions
| C | Number of agents in C
U C Set of all assigned units to a coalition C
CS A coalition structure
| CS | The amount of coalitions in CS
CS ˜ The set of all possible coalition structures
v ( C ) Coalition value function
pElectricity product
P ˜ Set of all electricity products
e p ( i ) Electrical energy for product p in each product interval t p r ( i )
e r r p ( i ) Error/deviation in the provision of a product p in a product interval t p r ( i )
c p ( i ) Cost for the provision of a product p in product interval t p r ( i )
T p r ( p ) A product horizon of a product p
c o n a , p Function to determine the contribution of an agent a to a product p
e u ( i ) Electrical energy for product p in each product interval by unit u
e r r u ( i ) Error in the provision of a product p in a product interval by unit u
c u ( i ) Cost for the provision of a product p by unit u
C O N C , p The set of all contributions of the members of a coalition C
ρ ˘ The payoff resulting from a trade of p by a coalition C
ψ ( ( C , ρ ˘ ) ) Payoff distribution function
x C Payoff distribution vector
c ψ Payoff distribution criteria
P P Product portfolio containing a set of target products
T P Template portfolio of product templates p t m p
p t a r Target product
Γ Non-overlapping coalition formation game
V ( CS ) Coalition structure value function
CS Optimal coalition structure
c o n a + Function to determine the contributions an agent a makes in a time interval
e u ( i ) Vector containing the electrical amounts for n products in a time interval
err u ( i ) Vector containing the errors for n products in a time interval
c u ( i ) Vector containing the costs for n products in a time interval

Abbreviations

The following abbreviations are used in this manuscript:
ADNActive distribution network
ASAncillary services
BESSBattery energy storage system
CFCoalition formation
CFGCharacteristic function game
CFPCall for proposal
COHDACombinatorial optimization heuristics for distributed agents
CSCoalition structure
CSGCoalition structure generation
DERDistributed energy resource
DGDistributed generation
DRDemand response
DSODistribution system operator
H-MGHome-microgrid
MASMulti-agent system
MDPIMultidisciplinary Digital Publishing Institute
MGMicrogrid
OCFOverlapping coalition formation
PPMProduct portfolio management
RESRenewable energy source
VPPVirtual power plant

References

  1. Gielen, D.; Boshell, F.; Saygin, D.; Bazilian, M.D.; Wagner, N.; Gorini, R. The role of renewable energy in the global energy transformation. Energy Strategy Rev. 2019, 24, 38–50. [Google Scholar] [CrossRef]
  2. Quint, R.; Stringer, N.; Dangelmaier, L.; Green, I.; Edelson, D.; Ganugula, V.; Kaneshiro, R.; Pigeon, J.; Quaintance, B.; Riesz, J. Transformation of the Grid: The Impact of Distributed Energy Resources on Bulk Power Systems. IEEE Power Energy Mag. 2019, 17, 35–45. [Google Scholar] [CrossRef]
  3. IRENA. Innovation Landscape Brief: Market Integration of Distributed Energy Resources; International Renewable Energy Agency: Abu Dhabi, United Arab Emirates, 2019; ISBN 978-92-9260-132-4. [Google Scholar]
  4. Damsgaard, N.; Helbrink, J.; Papaefthymiou, G.; Grave, K.; Giordano, V.; Gentili, P. Study on the Effective Integration of Distributed Energy Resources for Providing Flexibility to the Electricity System. Report to the European Commission; European Commission: Brussels, Belgium, 2015. [Google Scholar] [CrossRef]
  5. Wang, T.; O’Neill, D.; Kamath, H. Dynamic Control and Optimization of Distributed Energy Resources in a Microgrid. IEEE Trans. Smart Grid 2015, 6, 2884–2894. [Google Scholar] [CrossRef]
  6. Braun, M.; Philipp, S. A Review on Aggregation Approaches of Controllable Distributed Energy Units in Electrical Power Systems. Int. J. Distrib. Energy Resour. 2008, 4, 297–319. [Google Scholar]
  7. Gao, Z.; Alshehri, K.; Birge, J.R. On Efficient Aggregation of Distributed Energy Resources. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 14–17 December 2021; pp. 7064–7069. [Google Scholar] [CrossRef]
  8. Alshehri, K.; Ndrio, M.; Bose, S.; Başar, T. The Impact of Aggregating Distributed Energy Resources on Electricity Market Efficiency. In Proceedings of the 2019 53rd Annual Conference on Information Sciences and Systems, CISS 2019, Baltimore, MD, USA, 20–22 March 2019; ISBN 9781728111513. [Google Scholar] [CrossRef]
  9. Beer, S. Dynamic Coalition Formation in Electricity Markets. Ph.D. Thesis, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany, 2016. [Google Scholar]
  10. Ray, D. A Game-Theoretic Perspective on Coalition Formation; Oxford University Press: Oxford, UK, 2007; pp. 2013–2015. [Google Scholar] [CrossRef]
  11. Bitar, E.Y.; Baeyens, E.; Khargonekar, P.P.; Poolla, K.; Varaiya, P. Optimal Sharing of Quantity Risk for a Coalition of Wind Power Producers Facing Nodal Prices. In Proceedings of the American Control Conference, Montreal, QC, Canada, 27–29 June 2012. [Google Scholar]
  12. Wang, X.; Liu, Z.; Zhang, H.; Zhao, Y.; Shi, J.; Ding, H. A Review on Virtual Power Plant Concept, Application and Challenges. In Proceedings of the 2019 IEEE Innovative Smart Grid Technologies-Asia (ISGT Asia), Chengdu, China, 21–24 May 2019; pp. 4328–4333. [Google Scholar] [CrossRef]
  13. Nieße, A. Verteilte kontinuierliche Einsatzplanung in Dynamischen Virtuellen Kraftwerken. Ph.D. Thesis, Carl von Ossietzky University Oldenburg, Oldenburg, Germany, 2015. [Google Scholar]
  14. Marinescu, B.; Gomis-Bellmunt, O.; Dörfler, F.; Schulte, H.; Sigrist, L. Dynamic Virtual Power Plant: A New Concept for Grid Integration of Renewable Energy Sources. arXiv 2021, arXiv:2108.00153. [Google Scholar] [CrossRef]
  15. Saito, T.; Yoshida, A.; Kashikawa, T.; Kimura, K.; Amano, Y. Combinatorial Optimization-based Hierarchical Management of Residential Energy Systems as Virtual Power Plant. In Proceedings of the 2020 59th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE), Chiang Mai, Thailand, 23–26 September 2020; pp. 1833–1839. [Google Scholar] [CrossRef]
  16. Tröschel, M. Aktive Einsatzplanung in Holonischen Virtuellen Kraftwerken. Ph.D. Thesis, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany, 2010. [Google Scholar]
  17. Hinrichs, C. Selbstorganisierte Einsatzplanung Dezentraler Akteure im Smart Grid. Ph.D. Thesis, Carl von Ossietzky University Oldenburg, Oldenburg, Germany, 2014. [Google Scholar]
  18. Bremer, J.; Lehnhoff, S. Controlled Self-organization for Steering Local Multi-objective Optimization in Virtual Power Plants. In Highlights in Practical Applications of Agents, Multi-Agent Systems, and Trust-Worthiness. The PAAMS Collection; De La Prieta, F., Mathieu, P., Rincón Arango, J.A., El Bolock, A., Del Val, E., Jordán Prunera, J., Carneiro, J., Fuentes, R., Lopes, F., Julian, V., Eds.; Communications in Computer and Information Science; Springer International Publishing: Berlin/Heidelberg, Germany, 2020; Volume 1233, pp. 314–325. [Google Scholar] [CrossRef]
  19. Goia, B.; Cioara, T.; Anghel, I. Virtual Power Plant Optimization in Smart Grids: A Narrative Review. Future Internet 2022, 14, 128. [Google Scholar] [CrossRef]
  20. Abrell, J.; Rausch, S.; Streitberger, C. The Economic and Climate Value of Flexibility in Green Energy Markets. Environ. Resour. Econ. 2022, 83, 289–312. [Google Scholar] [CrossRef]
  21. Degefa, M.Z.; Sperstad, I.B.; Sæle, H. Comprehensive classifications and characterizations of power system flexibility resources. Electr. Power Syst. Res. 2021, 194, 107022. [Google Scholar] [CrossRef]
  22. Tiemann, P.H.; Nebel-Wenner, M.; Holly, S.; Frost, E.; Jimenez Martinez, A.; Nieße, A. Operational Flexibility for Multi-Purpose Usage of Pooled Battery Storage Systems. Energy Inf. 2022, 5, 14. [Google Scholar] [CrossRef]
  23. Englberger, S.; Jossen, A.; Hesse, H. Unlocking the Potential of Battery Storage with the Dynamic Stacking of Multiple Applications. Cell Rep. Phys. Sci. 2020, 1, 100238. [Google Scholar] [CrossRef]
  24. Hart, J.; Bull, P.; Albers, K.; Sevilla, S. Multiple-Use Applications of Distributed Energy Resources to Meet Customer Needs and Provide Utility and Grid Services. Nat. Gas Electr. 2019, 36, 19–28. [Google Scholar] [CrossRef]
  25. Chalkiadakis, G.; Elkind, E.; Markakis, E.; Jennings, N.R. Overlapping Coalition Formation. In Internet and Network Economics, Proceedings of the 4th International Workshop, WINE 2008, Shanghai, China, 17–20 December 2008; Papadimitriou, C., Zhang, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2008; pp. 307–321. [Google Scholar]
  26. Mahdiraji, H.A.; Razghandi, E.; Hatami-Marbini, A. Overlapping coalition formation in game theory: A state-of-the-art review. Expert Syst. Appl. 2021, 174, 114752. [Google Scholar] [CrossRef]
  27. Rahwan, T.; Michalak, T.P.; Wooldridge, M.; Jennings, N.R. Coalition structure generation: A survey. Artif. Intell. 2015, 229, 139–174. [Google Scholar] [CrossRef]
  28. Shehory, O.M.; Sycara, K.; Jha, S. Multi-agent coordination through coalition formation. In Intelligent Agents IV Agent Theories, Architectures, and Languages; Singh, M.P., Rao, A., Wooldridge, M.J., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1998; Volume 1365, pp. 143–154. [Google Scholar] [CrossRef]
  29. Aziz, H.; de Keijzer, B. Complexity of Coalition Structure Generation. In Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems, Taipei, Taiwan, 2–6 May 2011; International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, USA, 2011; Volume 1, pp. 191–198. [Google Scholar]
  30. Rahwan, T.; Jennings, N.R. Coalition Structure Generation: Dynamic Programming Meets Anytime Optimization. In Proceedings of the 23rd National Conference on Artificial Intelligence, Chicago, IL, USA, 13–17 July 2008; AAAI Press: Chicago, IL, USA, 2008; Volume 1, pp. 156–161. [Google Scholar]
  31. Krausburg, T.; Bordini, R.H. Constrained coalition formation among heterogeneous agents for the multi-agent programming contest. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence, Prague, Czech Republic, 19–21 February 2019; Volume 1, pp. 162–169, ISBN 9789897583506. [Google Scholar] [CrossRef]
  32. Capezzuto, L.; Tarapore, D.; Ramchurn, S. Anytime and Efficient Coalition Formation with Spatial and Temporal Constraints. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Cham, Switzerland, 2020; Volume 12520 LNAI, pp. 589–606. ISBN 9783030664114. [Google Scholar] [CrossRef]
  33. Hausken, K. The Shapley value of coalitions to other coalitions. Humanit. Soc. Sci. Commun. 2020, 7, 104. [Google Scholar] [CrossRef]
  34. Chalkiadakis, G.; Elkind, E.; Markakis, E.; Polukarov, M.; Jennings, N.R. Stability of overlapping coalitions. ACM SIGecom Exch. 2009, 8, 1–5. [Google Scholar] [CrossRef]
  35. Bremer, J.; Lehnhoff, S. Decentralized Coalition Formation with Agent-based Combinatorial Heuristics. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 2017, 6, 29–44. [Google Scholar] [CrossRef]
  36. Li, X.; Soh, L.K. Investigating reinforcement learning in multiagent coalition formation. In Proceedings of the AAAI Conference on Artificial Intelligence, San Jose, CA, USA, 25–29 July 2004; Volume WS-04-06, pp. 22–28. [Google Scholar]
  37. Arnold, T.; Schwalbe, U. Dynamic coalition formation and the core. J. Econ. Behav. Organ. 2002, 49, 363–380. [Google Scholar] [CrossRef]
  38. Capezzuto, L.; Tarapore, D.; Ramchurn, S.D. Large-Scale, Dynamic and Distributed Coalition Formation with Spatial and Temporal Constraints. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Cham, Switzerland, 2021; Volume 12802 LNAI, pp. 108–125. ISBN 9783030822538. [Google Scholar] [CrossRef]
  39. Lehnhoff, S.; Klingenberg, T.; Blank, M.; Calabria, M.; Schumacher, W. Distributed coalitions for reliable and stable provision of frequency response reserve. In Proceedings of the 2013 IEEE International Workshop on Inteligent Energy Systems (IWIES), Vienna, Austria, 14 November 2013; pp. 11–18. [Google Scholar] [CrossRef]
  40. Mihailescu, R.C.; Vasirani, M.; Ossowski, S. Dynamic Coalition Adaptation for Efficient Agent-Based Virtual Power Plants. In Multiagent System Technologies, Proceedings of the Multiagent System Technologies, 8th German Conference, MATES 2011, Leipzig, Germany, 6–7 October 2011; Klügl, F., Ossowski, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 101–112. [Google Scholar] [CrossRef]
  41. Shabanzadeh, M.; Sheikh-El-Eslami, M.K.; Haghifam, M.R. A medium-term coalition-forming model of heterogeneous DERs for a commercial virtual power plant. Appl. Energy 2016, 169, 663–681. [Google Scholar] [CrossRef]
  42. Valinejad, J.; Marzband, M.; Korkali, M.; Xu, Y.; Saad Al-Sumaiti, A. Coalition Formation of Microgrids with Distributed Energy Resources and Energy Storage in Energy Market. J. Mod. Power Syst. Clean Energy 2020, 8, 906–918. [Google Scholar] [CrossRef]
  43. Vinyals, M.; Bistaffa, F.; Farinelli, A.; Rogers, A. Coalitional energy purchasing in the smart grid. In Proceedings of the 2012 IEEE International Energy Conference and Exhibition (ENERGYCON), Florence, Italy, 9–12 September 2012; pp. 848–853. [Google Scholar] [CrossRef]
  44. Lee, S.H.S.; Deng, J.D.; Purvis, M.K.; Purvis, M. Hierarchical population-based learning for optimal large-scale coalition structure generation in smart grids. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Berlin/Heidelberg, Germany, 2018; Volume 11320. [Google Scholar] [CrossRef]
  45. Sadeghi, M.; Erol-Kantarci, M. Deep Reinforcement Learning Based Coalition Formation for Energy Trading in Smart Grid. In Proceedings of the 2021 IEEE 4th 5G World Forum (5GWF), Montreal, QC, Canada, 13–15 October 2021; pp. 200–205. [Google Scholar] [CrossRef]
  46. Sandholm, T.; Larson, K.; Andersson, M.; Shehory, O.; Tohmé, F. Coalition structure generation with worst case guarantees. Artif. Intell. 1999, 111, 209–238. [Google Scholar] [CrossRef]
  47. Shapley, L.S. 17. A Value for n-Person Games. In Contributions to the Theory of Games (AM-28), Volume II; Kuhn, H.W., Tucker, A.W., Eds.; Princeton University Press: Princeton, NJ, USA, 1953; pp. 307–318. [Google Scholar] [CrossRef]
  48. Blank, M. Reliability Assessment of Coalitions for the Provision of Ancillary Services. Ph.D. Thesis, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany, 2015. [Google Scholar]
  49. Tošić, P.T.; Ordonez, C. Distributed Protocols for Multi-Agent Coalition Formation: A Negotiation Perspective. In Active Media Technology, Proceedings of the Active Media Technology 8th International Conference, AMT 2012, Macau, China, 4–7 December 2012; Huang, R., Ghorbani, A.A., Pasi, G., Yamaguchi, T., Yen, N.Y., Jin, B., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; pp. 93–102. [Google Scholar]
Figure 1. Example of coalition formation as an aggregation concept for DERs in a low-voltage grid to provide different electricity products P n with agents controlling different types of units.
Figure 1. Example of coalition formation as an aggregation concept for DERs in a low-voltage grid to provide different electricity products P n with agents controlling different types of units.
Energies 16 06289 g001
Figure 2. Representation of planning horizon and product horizon in relation to real time based on [9].
Figure 2. Representation of planning horizon and product horizon in relation to real time based on [9].
Energies 16 06289 g002
Figure 3. DYCE activity diagram showing the four phases and the flow of the phases within DYCE based on [9].
Figure 3. DYCE activity diagram showing the four phases and the flow of the phases within DYCE based on [9].
Energies 16 06289 g003
Figure 4. Value maximization level for disjoint and overlapping coalitions based on [9].
Figure 4. Value maximization level for disjoint and overlapping coalitions based on [9].
Energies 16 06289 g004
Figure 5. Product portfolio P P as a result of COPE with only one product at a time and product portfolio P P + allowing multiple products per time.
Figure 5. Product portfolio P P as a result of COPE with only one product at a time and product portfolio P P + allowing multiple products per time.
Energies 16 06289 g005
Table 1. Comparison of the presented literature regarding the aggregation of DERs based on the formation of coalitions. The criterion was fully implemented: ✓. The criterion was not implemented: ✗. The criterion was partially implemented: (✓).
Table 1. Comparison of the presented literature regarding the aggregation of DERs based on the formation of coalitions. The criterion was fully implemented: ✓. The criterion was not implemented: ✗. The criterion was partially implemented: (✓).
PaperObjectiveEndogenous vs. ExogenousCSG SolutionDynamic AggregationReflection of Grid TopologyFlexibility AssessmentReliability AssessmentOverlapping CF
[35]Fit a given set of energy productsEndogenousCombinatorial heuristics
[39]Reliable provision of primary frequency controlExogenousSolving multi-objective optimization problem(✓)
[9]Fulfill power productsEndogenousHeuristic(✓)
[40]Integration of DERsEndogenousNegotiation mechanism
[41]Maximize expected profitExogenousSolving an MILP
[42]Profit maximizationExogenousSolving bilevel optimization problem
[43]Discount through collective buyingMixedInteger programming(✓)
[44]Reduction in the variability of RESExogenousTop k merit weighting PBIL
[45]Cost minimization in peer-to-peer energy tradingExogenousDeep Q-learning-based coalition formation(✓)
Table 2. Summary of all extensions required for overlapping coalition formation in DYCE, grouped by phase in DYCE.
Table 2. Summary of all extensions required for overlapping coalition formation in DYCE, grouped by phase in DYCE.
PhaseExtension
1. Product Portfolio ManagementExtended product portfolio P P + as part of the formal model to formalize the possibility of trading multiple products at a time.
Method for solving the resource allocation problem, which considers several products at a time and provides a product portfolio as a result.
2. Neighborhood FormationMaintaining different neighborhoods to consider different constraints for different types of electricity products.
Investigation of different values for the ratio r i n i t , since the communication overhead for overlapping coalitions could be larger.
3. Coalition FormationExtended agent contribution function c o n a + as part of the formal model to formalize the possibility of agent contributions to multiple coalitions at a time.
Heuristic for finding an optimal CS with overlapping coalitions that takes into account the characteristics of OCF, such as transitive dependencies agents in multiple coalitions.
4. Payoff DistributionNo extensions necessary.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wolff, T.; Nieße, A. Dynamic Overlapping Coalition Formation in Electricity Markets: An Extended Formal Model. Energies 2023, 16, 6289. https://doi.org/10.3390/en16176289

AMA Style

Wolff T, Nieße A. Dynamic Overlapping Coalition Formation in Electricity Markets: An Extended Formal Model. Energies. 2023; 16(17):6289. https://doi.org/10.3390/en16176289

Chicago/Turabian Style

Wolff, Torge, and Astrid Nieße. 2023. "Dynamic Overlapping Coalition Formation in Electricity Markets: An Extended Formal Model" Energies 16, no. 17: 6289. https://doi.org/10.3390/en16176289

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop