Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks

Wang, Yanfang; Li, Shaobo; Chen, Kangkun; Guo, Ran; Li, Judy

doi:10.3390/math13030422

Open AccessArticle

Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks

by

Yanfang Wang

¹,

Shaobo Li

^1,2,3,*

,

Kangkun Chen

³,

Ran Guo

³ and

Judy Li

³

¹

State Key Laboratory of Public Big Data, Guizhou University, Guiyang 550025, China

²

Guizhou Institute of Technology, Guiyang 550003, China

³

School of Computer Science and Engineering, Central South University, Changsha 410083, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(3), 422; https://doi.org/10.3390/math13030422

Submission received: 25 December 2024 / Revised: 20 January 2025 / Accepted: 25 January 2025 / Published: 27 January 2025

(This article belongs to the Special Issue New Advances in Network and Edge Computing)

Download

Browse Figures

Versions Notes

Abstract

:

Efficient and secure data sharing is paramount for advancing modern digital ecosystems, especially within edge computing environments characterized by resource-constrained nodes and dynamic network topologies. In such settings, privacy preservation, computational efficiency, and system resilience are critical for user engagement and overall system performance. However, existing approaches face three primary challenges: (i) limited optimization of privacy protection and absence of dynamic privacy budget scheduling for resource-constrained scenarios, (ii) static incentive mechanisms that overlook individual differences in data quality and resource consumption, and (iii) inadequate strategies to ensure resilience in environments with limited resources and unstable networks. This paper introduces the Federated Learning-based Dynamic Incentive Allocation Framework (FL-DIAF) to address these issues. FL-DIAF integrates differential privacy into the federated learning paradigm deployed on edge nodes, enabling collaborative model training that safeguards individual data privacy while maintaining computational efficiency and system resilience. Additionally, the framework employs a Shapley value-based dynamic incentive allocation model to ensure equitable and transparent distribution of incentives by accurately quantifying each participant’s contribution within an elastic edge computing infrastructure. Comprehensive experimental evaluations on diverse datasets demonstrate that FL-DIAF achieves a 9.573% reduction in the objective function value under typical conditions and attains a 100% task completion rate across all tested resilient edge scenarios.

Keywords:

federated learning; dynamic incentive allocation; privacy protection; resource optimization; Shapley value

MSC:

68P27; 68T05; 90C26; 37N40

1. Introduction

Data sharing, which is the process of sharing and using data among individuals, organizations, or systems, has become more and more important for promoting technological innovation and collaborative work, especially in edge computing. With the rise of big data and artificial intelligence, data sharing is a fundamental backbone for various applications, especially those deployed on distributed edge nodes. Edge computing leverages decentralized Internet technologies to distribute tasks or projects to a diverse group of participants, facilitating the completion of these tasks through collective effort and enhancing system resilience [1].

It is essential to implement effective and suitable incentive mechanisms to prompt data owners to share their data and boost user engagement in edge computing networks. First of all, a strong incentive mechanism can draw in a larger user base and heighten platform activity by motivating participation across distributed edge nodes. By offering direct monetary rewards or points, edge computing platforms motivate users to invest their time and computational resources. Secondly, introducing tiered rewards can incentivize participants to improve the quality of their task completions [2,3,4]. Given the substantial variability in the data quality provided by individual users, including the quality of user contributions in the incentive mechanism encourages participants to engage more carefully. This, in turn, improves the overall quality of the output and ensures system flexibility [5,6]. Lastly, incentive mechanisms enable platforms to collect behavioral data and user feedback, facilitating the continuous optimization of task designs and fostering a virtuous improvement cycle [7].

Figure 1 illustrates the comprehensive flow of an edge-based data-sharing platform, highlighting the design and role of the incentive mechanism, strategies for enhancing task completion quality, existing challenges, optimization solutions, and the logical framework of the closed-loop optimization system. Specifically, the incentive mechanism optimizes the platform’s activities by increasing user participation, improving data quality, and gathering behavioral feedback. Concurrently, the design of incentives encounters two primary challenges: the diversification of incentives and the decay of long-term incentives, both of which constrain the platform’s long-term growth potential in dynamic edge environments. By establishing a closed-loop optimization system, platforms can build a sustainable cycle that combines user engagement, improvement of task quality, and continuous optimization. In this way, the system’s resilience and flexibility can be ensured.

Although existing incentive mechanisms effectively motivate user participation in data sharing across various domains, such as federated learning [8,9] and crowdsourcing spectrum sensing, challenges remain in designing mechanisms that attract and sustain long-term user engagement within edge computing environments. A primary challenge lies in the wide-ranging motivations of users. These motivations consist of financial incentives, social recognition, and personal interests. As a result, it becomes arduous for general-purpose incentives to tackle multiple motivational factors concurrently. Additionally, shifts in user interests and needs over time can lead to decreased participation, rendering existing incentive structures less effective in maintaining system elasticity and resilience.

Much of the current research focuses on developing dynamic incentive models to address the challenge of attracting and maintaining user participation in edge computing networks. These models monitor user participation behavior and feedback in real time, allowing for the adjustment of incentive strategies to align with evolving user motivations and needs [10,11]. Dynamic incentive mechanisms are generally more effective than static ones in sustaining continuous user engagement, as they can adapt to changes in user motivations and prevent user attrition due to evolving interests, thereby enhancing the system’s resilience and elasticity.

However, dynamic incentive models must balance privacy protection with their effectiveness in enhancing user participation within distributed edge nodes. Developing dynamic incentive strategies typically requires collecting and analyzing extensive user data, which can raise concerns about privacy breaches and diminish users’ willingness to participate [12]. Techniques such as differential privacy can be employed to mitigate these privacy risks. While differential privacy enhances data security, it may also reduce data accuracy, potentially undermining the incentive mechanism’s effectiveness [13,14]. Thus, there is a need for further exploration to balance privacy safeguards with the effectiveness of incentive mechanisms, ensuring that user privacy protection does not compromise the mechanism’s ability to maintain long-term user motivation and system resilience.

This study addresses significant challenges in resource-constrained edge computing environments by proposing the Federated Learning and Dynamic Incentive Allocation Framework (FL-DIAF). The framework optimally balances privacy protection, incentive fairness, computational efficiency, and system resilience. Fundamentally, this research presents a scalable, transparent, and privacy-safeguarding solution crafted to boost user participation and system performance in distributed data-sharing networks. In comparison with current solutions, the proposed approach shows better performance in terms of fairness, efficiency, and user engagement. The main contributions of this paper are as follows:

Federated Learning (FL): We propose a multi-party collaborative training framework where participants can jointly train machine learning models without uploading raw data to a central server. This approach ensures that sensitive data remains decentralized, thus reducing the risk of privacy leakage. By leveraging federated averaging (FedAvg) and incorporating local training updates, the framework effectively mitigates the need for data sharing while maintaining high model accuracy and ensuring data confidentiality. This method is particularly beneficial in resource-constrained environments, as it allows each participant to only transmit model updates instead of full datasets.
Dynamic Incentive Allocation: To enhance fairness in resource-constrained edge computing environments, we integrate a dynamic incentive allocation mechanism that adapts to varying user contributions and data privacy requirements. Unlike traditional static mechanisms (e.g., fixed or auction-based incentives), our model uses Shapley values to measure the individual contribution of each participant based on the quality of their data, computational effort, and resource consumption. Additionally, the incentive distribution is adjusted dynamically based on each node’s privacy requirements, ensuring that both high privacy and low privacy demands are considered, offering a balanced trade-off between efficiency and fairness.
Algorithm with Fairness and Privacy Mechanisms: Our framework harmoniously combines differential privacy (DP) with the incentive allocation strategy to ensure both data privacy and fair resource distribution. By introducing a dynamic privacy budget, we adjust the level of noise added to the model updates according to the participant’s privacy needs, enabling a flexible privacy mechanism without compromising overall system efficiency. This privacy-preserving approach allows participants to control their data exposure, while the incentive allocation ensures that participants are fairly compensated for their contributions, even in a heterogeneous environment with varying privacy demands.
Resource Constraints and Optimization Strategy: Recognizing the resource limitations of edge nodes (e.g., computational power, storage, and communication bandwidth), we design an optimization framework that operates effectively under such constraints. The framework incorporates adaptive task offloading strategies and resource allocation models that optimize the use of local resources while minimizing network overhead. Additionally, we propose a game-theoretic model that facilitates fair resource sharing among participants, ensuring that no single node is overburdened and that the overall system remains resilient to fluctuating network conditions and resource constraints.

2. Related Work

In recent years, incentivizing data owners to participate in data sharing while safeguarding privacy has emerged as a critical research area. Current studies can be broadly categorized into four domains: privacy-preserving incentives, group-aware incentives, mobile group perception incentives, and privacy data processing and distribution. Despite the various approaches to tackling these challenges, practical applications still confront several constraints. Primarily, achieving an optimal balance between data privacy and sharing incentives remains unresolved. Additionally, existing incentives lack fairness and transparency, and computational efficiency in resource-constrained mobile environments is not adequately addressed. This section categorizes and synthesizes the prevailing research approaches, evaluates their strengths and weaknesses, and identifies future research directions.

2.1. Privacy-Preserving Incentives

Privacy-preserving incentive mechanisms aim to ensure data privacy while sharing through differential privacy and data scrambling techniques. Despite their effectiveness in reducing privacy leakage and enhancing data-sharing willingness [15,16,17,18,19,20,21,22], these methods often incur high computational costs and face privacy leakage risks in large-scale data sharing. For instance, Miao et al. [15] introduces a lightweight privacy-preserving truth discovery framework that employs differential privacy to protect data integrity, while Andrés et al. [19] presents a geographic differential privacy scheme to secure location data sharing without revealing specific user locations. Similarly, Zhang et al. [6] leverages state-dependent noise to perturb data, effectively mitigating the risk of sensitive information leakage. Furthermore, Gan et al. [21] and Wang et al. [5] propose dual privacy protection mechanisms based on federated learning, ensuring data security in multi-party scenarios by localizing data processing. Wang et al. [23] propose an efficient homomorphic encryption-based secure search scheme in a multi-owner setting. Feng et al. [24] design and build Panther, a lightweight and efficient secure 2P-NN inference system, which has great efficiency in evaluating 2P-NN inference while safeguarding the privacy of the server and the client. Although these methods have shown their efficacy in numerous real-world applications, the constantly growing volume of data presents a challenge regarding computational resource consumption. Improving computational efficiency and the applicability of differential privacy algorithms in large-scale settings are still crucial areas for future research. In addition, Xuan et al. [25] proposes a blockchain-based data-sharing incentive model using evolutionary game theory. This aims to attract more data owners to participate in the FL process through a goal-first approach while effectively preventing information leakage.

2.2. Group-Aware Incentives

While privacy-preserving mechanisms address data leakage concerns, incentive fairness and effectiveness issues persist. To mitigate these problems, researchers have developed group-aware incentive models based on auction mechanisms and game-theoretic frameworks. These models not only ensure equitable incentive distribution but also enhance data quality [26,27,28,29,30,31]. For example, Feng et al. [29] proposes the TRAC auction mechanism, which incorporates location awareness to dynamically evaluate and reward data contributions. Similarly, the BidGuard framework introduced by Lin et al. [32] balances privacy protection and incentive fairness by minimizing social costs and reducing resource consumption among participants. Additionally, Lin et al. [31] develops an auction-based privacy-preserving mechanism tailored for mobile group perception. Yan et al. [28] employs a game-theoretic model to optimize incentive allocation strategies, thereby improving data quality and privacy protection. These studies demonstrate that well-designed incentive mechanisms can significantly improve fairness and participant motivation. However, challenges in enhancing these mechanisms’ transparency and dynamic adaptability, particularly in large-scale group-aware environments, warrant further in-depth investigation. He et al. [33] introduces a hierarchical federated learning incentive mechanism using contract theory to address data volume, quality, and cost in an information asymmetry scenario, improving user participation and maximizing model owners’ utility.

2.3. Mobile Group Perception Incentives

In mobile environments, group-aware incentive mechanisms must also address the constraints of limited computational resources and device energy consumption. Researchers have tackled these challenges by introducing personalized privacy protection and leveraging deep learning techniques to optimize computational resource usage and data utilization [8,34,35,36,37,38]. For instance, Sun et al. [8] designs an incentive mechanism grounded in personalized privacy protection, utilizing contract theory to ensure privacy security in mobile data sharing. On the other hand, Sun et al. [35] integrates privacy protection with personalized incentive allocation in federated learning through deep learning, thereby enhancing data processing efficiency in mobile scenarios. Additionally, Jin and Zhang [34] develops a differential privacy protection mechanism for spectrum sensing in mobile group contexts, effectively balancing privacy needs with data-sharing requirements. Despite these advancements, existing methods still face limitations in managing computational resources on mobile devices. Future research should focus on optimizing privacy protection mechanisms within resource-constrained environments to further enhance data processing efficiency in mobile settings.

2.4. Privacy Data Processing and Distribution

Ensuring privacy security during data distribution remains a significant challenge, even with advancements in privacy protection and data processing efficiency within group-aware incentive mechanisms. Researchers have proposed various data processing and publishing techniques based on differential privacy to address privacy leakage and enhance security and transparency in data publishing [8,18,34,39,40,41]. For example, Jin and Zhang [34] introduces a privacy protection mechanism for spectrum sensing that employs differential privacy techniques to prevent sensitive information disclosure during data distribution. Similarly, Liu et al. [9] and Zhang et al. [3] propose solutions that enhance transparency and traceability in the data distribution process, ensuring secure data release in IoT environments and improving data trustworthiness. These studies highlight the potential of differential privacy in securing data processing and publishing. Nevertheless, there is still a necessity to boost the computational efficiency of differential privacy algorithms in large-scale data distribution scenarios, which remains a key area for future exploration.

Current privacy and incentive mechanisms in federated learning often focus on single aspects and use static models, overlooking the needs of multi-participant and resource-constrained environments. This paper introduces the Federated Learning and Dynamic Incentive Allocation Framework (FL-DIAF), which integrates differential privacy with game-theoretic incentives. FL-DIAF offers adaptable strategies to balance privacy protection, user motivation, and resource utilization, ensuring scalable and efficient solutions for large-scale distributed networks.

3. Problem Description and Formulation

3.1. System Model

In resource-constrained data-sharing environments, users share data via mobile devices, while the platform is responsible for data collection, processing, and distribution. To achieve the goals of privacy protection and incentive allocation while optimizing resource utilization, the system relies on seamless interactions between user devices and the data-sharing platform.

Figure 2 illustrates a data-sharing system model. In a resource-constrained data-sharing environment, users share data via mobile devices, while the data-sharing platform is responsible for data collection, processing, and distribution. The system consists of four core components: user devices, the data-sharing platform, the incentive mechanism, and the privacy protection mechanism. User devices upload data

D_{i}

and receive corresponding incentives

R_{i}

. The data-sharing platform uses a differential privacy mechanism to protect user data privacy and calculates the incentive based on each user’s resource consumption and privacy protection level. The incentive mechanism motivates users to actively participate in data sharing, while the privacy protection mechanism ensures data privacy by controlling the privacy level parameter

ϵ

.

In the system, user devices and the data-sharing platform interact through secure communication protocols. The uploaded data

D_{i}

are processed through a differential privacy mechanism, and the platform calculates the incentive allocation scheme. This scheme is based on user data contribution and effective system resource utilization, which results in the incentive

R_{i}

being fed back to the user. The goal of the system is to improve user participation through the incentive mechanism while optimizing overall system performance under resource constraints and privacy protection.

3.2. Problem Definition

The input to this problem includes the set of users

U = {1, 2, \dots, N}

, the amount of data uploaded by each user

D_{i}

, each user’s resource consumption

C_{i}

, and the privacy protection level

ϵ

. The output is the incentive allocation scheme

R = [R_{1}, R_{2}, \dots, R_{N}]

. The goal is to maximize the user’s utility

U_{i}

through the incentive mechanism while minimizing the system’s energy consumption. The user utility function is defined as

U_{i} = α R_{i} - β C_{i}

, where

R_{i}

represents the user’s incentive,

C_{i}

represents the user’s resource consumption, and

α

and

β

are weighting factors that balance the influence of incentive and resource consumption.

To address this problem, we formulate it as a multi-objective optimization problem involving incentive allocation, resource optimization, and privacy protection, among other constraints. Given the dynamic resource allocation and privacy protection mechanism, the complexity of solving the problem is high, and it is proven to be NP-hard.

3.3. Constraints

To ensure the feasibility and practicality of the solution, several constraints must be considered:

3.3.1. Resource Constraint

Each user’s resource consumption

C_{i}

must not exceed the maximum allowable resource limit per user

C_{\max}

:

C_{i} \leq C_{\max}, \forall i \in U

(1)

This constraint ensures that individual users do not deplete their device resources excessively, which could deter participation.

3.3.2. Total Resource Constraint

The aggregate resource consumption must not exceed the system’s total resource capacity

C_{total}

:

\sum_{i = 1}^{N} C_{i} \leq C_{total}

(2)

3.3.3. Non-Negativity of Incentive

Incentives allocated must be non-negative:

R_{i} \geq 0, \forall i \in U

(3)

3.3.4. Incentive Budget Constraint

The total incentives distributed should not exceed the platform’s budget

R_{budget}

:

\sum_{i = 1}^{N} R_{i} \leq R_{budget}

(4)

3.3.5. Privacy Protection Constraint

The differential privacy mechanism must satisfy the

ϵ_{i}

-differential privacy for each user:

\Pr [f (D_{i}) = o] \leq e^{ϵ_{i}} \cdot \Pr [f (D_{i}^{'}) = o], \forall o

(5)

where f is the randomized algorithm implemented by the privacy mechanism.

D_{i}

and

D_{i}^{'}

are datasets differing in a single individual’s data. This constraint ensures that the presence or absence of a user’s data does not significantly affect the outcome, thereby protecting individual privacy.

3.3.6. Incentive Fairness Constraint

To promote fairness, the platform imposes a constraint on the incentive disparity between users:

| R_{i} - R_{j} | \leq δ, \forall i, j \in U

(6)

where

δ

is the maximum allowable difference in incentives between any two users. This prevents significant disparities that could be perceived as unfair.

3.3.7. Quality of Data Constraint

The data quality

Q_{i}

provided by each user should meet a minimum threshold

Q_{\min}

:

| R_{i} - R_{j} | \leq δ, \forall i, j \in U

(7)

This ensures that the collected data are of sufficient quality for meaningful analysis.

3.4. Objective Function

The primary objective is to optimize the trade-offs between user utility, system energy consumption, and fairness under the given constraints. The multi-objective optimization problem can be formulated as follows:

max_{R, C, ϵ} \sum_{i = 1}^{N} U_{i} = \sum_{i = 1}^{N} (α R_{i} - β C_{i} - γ P_{i})

(8)

min_{C} E_{total} = \sum_{i = 1}^{N} E_{i}

(9)

min_{R} F = \sum_{i = 1}^{N} \sum_{j = 1}^{N} | R_{i} - R_{j} |

(10)

To simplify the problem, we can aggregate the objectives into a single scalar objective function using weighting coefficients:

min_{R, C, ϵ} f = λ_{1} (- \sum_{i = 1}^{N} U_{i}) + λ_{2} E_{total} + λ_{3} F

(11)

s . t . C 1 - C 7

where

λ_{1}

,

λ_{2}

, and

λ_{3}

are non-negative coefficients that reflect the relative importance of user utility, energy consumption, and fairness, respectively.

The inclusion of the privacy cost

P_{i}

in the utility function captures the users’ sensitivity to privacy loss. A common model for

P_{i}

is

P_{i} = ϕ (ϵ_{i})

(12)

where

ϕ (\cdot)

is a convex, decreasing function, indicating that a higher privacy level (smaller

ϵ_{i}

) results in a higher privacy cost due to reduced data utility.

Theorem 1.

The incentive allocation problem Equation (11) is NP-hard.

Proof.

We use a reduction from the classical 0–1 knapsack problem, which is known to be NP-hard. The 0–1 knapsack problem is defined as follows: given a set of items, each item i has a value

v_{i}

and a weight

w_{i}

. The goal is to select a subset of items such that the total weight does not exceed a given capacity

W_{\max}

, while maximizing the total value. The mathematical formulation is

max \sum_{i = 1}^{n} v_{i} x_{i}

subject to

\sum_{i = 1}^{n} w_{i} x_{i} \leq W_{\max}, x_{i} \in {0, 1}

.

We establish a correspondence between the variables in the incentive allocation problem and the knapsack problem. First, consider the set of users

U = {1, 2, \dots, N}

, where each user i corresponds to an item. For each user i, the data uploaded

D_{i}

are analogous to the weight

w_{i}

, the incentive

R_{i}

is analogous to the value

v_{i}

, and the user’s resource consumption

C_{i}

corresponds to the weight in the knapsack. The total resource limit

C_{\max}

corresponds to the knapsack’s weight capacity

W_{\max}

.

The objective of the incentive allocation problem is to maximize the total user utility

U_{i}

, which is

max \sum_{i = 1}^{N} (α R_{i} - β C_{i})

.

To reduce the problem to the 0–1 knapsack problem, we make the following assumption: each user can either participate in data sharing or not. Let

x_{i} \in {0, 1}

, where

x_{i} = 1

indicates that user i participates, and

x_{i} = 0

indicates non-participation. Thus, the incentive allocation problem can be rewritten as:

max \sum_{i = 1}^{N} (α R_{i} - β C_{i}) x_{i}

.

The resource constraint is then expressed as

\sum_{i = 1}^{N} C_{i} x_{i} \leq C_{\max}

.

This structure matches the 0–1 knapsack problem, where

R_{i}

corresponds to the value

v_{i}

, and

C_{i}

corresponds to the weight

w_{i}

. The total resource limit

C_{\max}

is equivalent to the knapsack’s capacity

W_{\max}

. Therefore, the goal is to select a subset of users to maximize the utility under resource constraints, similar to selecting items to maximize value in the knapsack problem. Since the 0–1 knapsack problem is NP-hard, and the incentive allocation problem can be reduced to the knapsack problem, the incentive allocation problem is also NP-hard and cannot be solved in polynomial time using an exact algorithm. □

4. Solving Algorithm and Methodology

This paper proposes a solution framework to address privacy protection challenges, user incentive allocation, and resource optimization in resource-constrained data-sharing environments. The method consists of two main components: a federated learning framework and a dynamic incentive allocation model, which handles privacy-preserving model training and contribution-based incentive distribution.

In the federated learning framework, users participate in global model training through local updates while employing differential privacy mechanisms to prevent raw data leakage. This process satisfies privacy and resource constraints. A Stackelberg game models the interaction between the platform and users, optimizing incentive allocation and privacy levels to balance user utility and system performance.

The dynamic incentive allocation model quantifies users’ contributions to the system utility using the Shapley value. To handle large-scale user scenarios, a Monte Carlo approximation method is used for Shapley value computation. The model dynamically adjusts incentive distribution based on resource consumption, privacy costs, and user contributions, ensuring fairness and compliance with budget constraints.

This framework integrates advanced designs to collaboratively achieve privacy protection, incentive allocation, and resource optimization. And it works to provide a practical solution for data sharing in resource-constrained environments.

4.1. Federated Learning Framework with Privacy and Incentive Mechanisms

Optimizing user participation incentives and system performance while ensuring data privacy is a significant challenge in resource-constrained data-sharing systems. We propose a strategy that combines federated learning, differential privacy, and a Stackelberg game-based incentive mechanism. This method balances user utility, resource consumption, and privacy protection under the constraints outlined in the problem model. Our innovation is seamlessly integrating these components to address the multi-objective optimization problem in resource-constrained environments.

Traditional centralized data-sharing systems require users to upload their raw data

D_{i}

to a central server for processing. This raises significant privacy concerns and leads to excessive resource consumption on user devices. Federated learning enables users to train a global model collaboratively without sharing their raw data, which addresses these issues. We enhance privacy protection by incorporating differential privacy into the federated learning process.

We consider a set of users

U = {1, 2, \dots, N}

, where each user i has

Local dataset

D_{i}

of size

n_{i} = | D_{i} |

, containing data samples

(x_{i j}, y_{i j})

; resource consumption

C_{i}

, including computation cost

C_{i}^{cmp}

and communication cost

C_{i}^{com}

; and privacy protection level

ϵ_{i}

.

Other key variables are essential for understanding the model. The local model parameters of the user i are denoted as

θ_{I} \in R^{d}

, where d represents the dimensionality of the parameter space. After incorporating differential privacy mechanisms, the parameters are transformed into noisy model parameters, denoted by

{\tilde{θ}}_{I}

. The platform aggregates these local parameters from all users to obtain the global model parameters, represented as

θ

. Each user i introduces Gaussian noise to their parameters with a standard deviation

σ_{i}

to ensure privacy. The sensitivity of the loss function for user i, which captures the impact of adding or removing a single sample, is indicated by

Δ_{i}

. For a given data sample

(x_{i j}, y_{i j})

, the loss function is expressed as

f (θ_{I}; x_{i j}, y_{i j})

, reflecting how well the model fits the data. To incentivize participation, the platform allocates a reward to the user i, denoted by

R_{i}

. However, the chosen privacy level

ϵ_{i}

incurs a privacy cost for the user, represented by

P_{i} (ϵ_{i})

. Balancing these various aspects, the user’s utility function incorporates weighting coefficients

α

,

β

, and

γ

, which govern the trade-offs between incentives, resource consumption, and privacy costs, ensuring the model aligns with system goals and user interests.

Our federated learning process consists of the following steps, aiming to balance model performance, resource consumption, and data privacy.

4.1.1. Differential Privacy and Federated Learning Mechanism

This mechanism is designed to satisfy the privacy protection constraint Equation (5) in the problem model by ensuring that each user’s contribution to the global model is differentially private. It also addresses the resource constraints Equations (1) and (2) by optimizing resource consumption during local training and communication.

Each user

i \in U

trains a local model by minimizing a local loss function

F_{i} (θ_{i})

:

F_{i} (θ_{i}) = \frac{1}{n_{i}} \sum_{j = 1}^{n_{i}} f (θ_{i}; x_{i j}, y_{i j})

(13)

where

f (θ_{i}; x_{i j}, y_{i j})

could be, for example, the mean squared error (MSE) for regression tasks:

f (θ_{i}; x_{i j}, y_{i j}) = {(y_{i j} - θ_{i}^{⊤} x_{i j})}^{2}

(14)

or the cross-entropy loss for classification tasks:

f (θ_{i}; x_{i j}, y_{i j}) = - y_{i j} log (σ (θ_{i}^{⊤} x_{i j})) - (1 - y_{i j}) log (1 - σ (θ_{i}^{⊤} x_{i j}))

(15)

where

σ (\cdot)

denotes the sigmoid function. Users perform multiple local iterations of stochastic gradient descent (SGD) to minimize

F_{i} (θ_{i})

:

θ_{i}^{(k + 1)} = θ_{i}^{(k)} - η \nabla F_{i} (θ_{i}^{(k)})

(16)

where

η

is the learning rate, and k indexes the local iterations. To protect user privacy, each user applies a differential privacy mechanism to their model updates. Specifically, each user adds Gaussian noise to their model parameters:

{\tilde{θ}}_{i} = θ_{i} + N (0, σ_{i}^{2} I)

(17)

where

N (0, σ_{i}^{2} I)

denotes a multivariate Gaussian distribution with mean zero and covariance matrix

σ_{i}^{2} I

, and

I

is the identity matrix of size

d \times d

.

The

l_{2}

-sensitivity

Δ_{i}

is defined as

Δ_{i} = max_{D_{i}, D_{i}^{'}} {∥F_{i} (θ_{i}; D_{i}) - F_{i} (θ_{i}; D_{i}^{'})∥}_{2}

(18)

where

D_{i}

and

D_{i}^{'}

are neighboring datasets differing in a single data point. For functions that are averages over data samples, the sensitivity can be bounded as:

Δ_{i} = \frac{1}{n_{i}} max_{θ_{i}, x_{i j}, y_{i j}} {∥f (θ_{i}; x_{i j}, y_{i j}) - f (θ_{i}; x_{i j}^{'}, y_{i j}^{'})∥}_{2}

(19)

Assuming the loss function is bounded, and the data are normalized,

Δ_{i}

can be considered a known constant. To achieve

ϵ_{i}

-differential privacy with a failure probability

δ

, the noise scale

σ_{i}

is calculated as

σ_{i} = \frac{Δ_{i}}{ϵ_{i}} \sqrt{2 ln \frac{1.25}{δ}}

(20)

Lemma 1.

Adding Gaussian noise as per Equation (17) ensures that the mechanism satisfies

(ϵ_{i}, δ)

-differential privacy [42].

Proof.

To establish that adding Gaussian noise to the function f satisfies

(ϵ, δ)

-differential privacy, we proceed by applying the Gaussian Mechanism for differential privacy. According to the Gaussian Mechanism, a function f achieves

(ϵ, δ)

-differential privacy if Gaussian noise with standard deviation

σ

is added, where the noise distribution is

N (0, σ^{2})

. The relationship between the noise scale

σ

and the desired privacy parameters

ϵ

and

δ

is given by

σ \geq \frac{Δ f}{ϵ} \sqrt{2 ln \frac{1.25}{δ}}

Δ f

denotes the

l_{2}

-sensitivity of the function f, defined as the maximum possible change in the output of f over any two neighboring datasets D and

D^{'}

. For our specific case, the sensitivity is

Δ f = Δ_{i}

, which corresponds to the sensitivity of the loss function used in our federated learning framework.

Setting the noise scale

σ_{i}

according to the above equation, we ensure that the amount of noise added aligns with the desired privacy parameters. Thus, the noisy version of the function f, denoted as

f (D_{i}) + N (0, σ_{i}^{2})

, ensures that the privacy requirements are met. To verify this formally, consider two neighboring datasets

D_{i}

and

D_{i}^{'}

that differ by at most one element.

For any measurable subset

S \subseteq R^{d}

, the Gaussian Mechanism ensures that the probability of the noisy function’s output falling within S is bounded by

Pr [f (D_{i}) + N (0, σ_{i}^{2}) \in S] \leq e^{ϵ_{i}} Pr [f (D_{i}^{'}) + N (0, σ_{i}^{2}) \in S] + δ

This inequality shows that the presence or absence of a single element in the dataset has a limited impact on the probability distribution of the noisy output, controlled by the parameters

ϵ_{i}

and

δ

. The term

e^{ϵ_{i}}

ensures that the probability ratio between the two neighboring datasets remains bounded, while

δ

accounts for a small probability of failure where the guarantee might not hold.

The Gaussian noise with variance

σ_{i}^{2}

, scaled as described, ensures that this bound is respected. Consequently, as required, the noisy function satisfies the definition of

(ϵ_{i}, δ)

-differential privacy. □

The platform aggregates the noisy model parameters received from all users to update the global model:

θ = \frac{1}{N} \sum_{i = 1}^{N} {\tilde{θ}}_{i}

(21)

This aggregation reduces the variance introduced by the added noise due to the averaging effect. The variance of the aggregated model is reduced by a factor of N:

Var [θ] = \frac{1}{N^{2}} \sum_{i = 1}^{N} σ_{i}^{2} I = \frac{σ^{2}}{N} I

(22)

assuming all

σ_{i} = σ

, the updated global model

θ

is then distributed back to all users, who update their local models accordingly for the next iteration.

This process ensures that the privacy protection constraint is met while the resource constraints are managed by controlling the number of local iterations and the communication frequency, as a result, optimizing resource consumption.

Under standard assumptions for convex loss functions and bounded gradients, federated learning with differential privacy converges to a neighborhood of the optimal solution. The added noise introduces a bias in estimating the global model, but the impact can be minimized with sufficient iterations and appropriate noise scaling.

Theorem 2

(Convergence of Federated Learning with Differential Privacy). Let

F (θ) = \frac{1}{N} \sum_{i = 1}^{N} F_{i} (θ)

be the global loss function. Under assumptions of Lipschitz continuity and strong convexity of

F_{i}

, the federated learning algorithm converges to a global optimum within a bounded error due to noise.

Proof.

Assume each local loss function

F_{i} (θ)

is L-Lipschitz continuous, meaning that the gradient of the function is bounded. For any two model parameters

θ

and

θ^{'}

, the following inequality holds:

∥ \nabla F_{i} (θ) - \nabla F_{i} (θ^{'}) ∥_{2} \leq L {∥ θ - θ^{'} ∥}_{2} .

Additionally, assume that

F_{i} (θ)

is

μ

-strongly convex, indicating that the function grows at least quadratically within the parameter space:

F_{i} (θ) \geq F_{i} (θ^{'}) + \nabla F_{i} {(θ^{'})}^{⊤} (θ - θ^{'}) + \frac{μ}{2} {∥ θ - θ^{'} ∥}_{2}^{2} .

The goal is to minimize the global loss function

F (θ) = \frac{1}{N} \sum_{i = 1}^{N} F_{i} (θ)

through federated learning. In each iteration t, the model parameters are updated using stochastic gradient descent (SGD) with the following update rule:

θ^{(t + 1)} = θ^{(t)} - η \nabla F (θ^{(t)}) + ξ^{(t)},

where

ξ^{(t)}

represents the noise term drawn from a Gaussian distribution with variance

σ^{2}

. We need to analyze the convergence by estimating the expected squared distance between the model parameters

θ^{(t)}

and the optimal solution

θ^{*}

.

We introduce the following recurrence relation to describe the distance between

θ^{(t)}

and

θ^{*}

at each iteration:

E [∥ θ^{(T)} - θ^{*} ∥_{2}^{2}] .

Using the assumptions of Lipschitz continuity and strong convexity, we obtain the following bound for the gradient difference:

∥ \nabla F (θ^{(t)}) - \nabla F (θ^{*}) ∥_{2} \leq L {∥ θ^{(t)} - θ^{*} ∥}_{2} .

Substituting this bound into the model update equation and applying the strong convexity condition, we derive the following recursive inequality:

∥ θ^{(t + 1)} - θ^{*} ∥_{2}^{2} \leq (1 - \frac{2 η μ L}{μ + L}) {∥ θ^{(t)} - θ^{*} ∥}_{2}^{2} + η^{2} σ^{2} .

By expanding the recurrence relation over T iterations and solving the resulting inequality, we obtain the following upper bound:

E [∥ θ^{(T)} - θ^{*} ∥_{2}^{2}] \leq {(1 - \frac{2 η μ L}{μ + L})}^{T} {∥ θ^{(0)} - θ^{*} ∥}_{2}^{2} + \frac{η σ^{2}}{μ} .

As

T \to \infty

, the first term in the bound converges to zero, meaning that the parameters converge to a neighborhood around the optimal solution

θ^{*}

. The radius of this neighborhood is proportional to the noise variance

σ^{2}

, as shown by the following expression:

lim_{T \to \infty} E [∥ θ^{(T)} - θ^{*} ∥_{2}^{2}] \leq \frac{η σ^{2}}{μ} .

Therefore, with appropriate noise scaling and a sufficiently small learning rate, the federated learning algorithm converges to a solution that is close to the optimal parameters. This demonstrates that the algorithm achieves stable convergence even in the presence of noise. □

4.1.2. Incentive Mechanism Based on Stackelberg Game Theory

This incentive mechanism addresses the incentive allocation and fairness constraints Equations (4) and (7) in the problem model by modeling the interaction between the platform and users as a Stackelberg game. The platform (leader) aims to maximize overall system utility, while users (followers) adjust their strategies to maximize their individual utilities under the constraints specified.

We model the interaction between the platform and the users as a Stackelberg game, where the platform aims to optimize the overall system performance by strategically allocating incentives

R_{i}

and determining acceptable privacy levels

ϵ_{i}

for each user

i \in U

. The users, in turn, decide on their resource consumption

C_{i}

and privacy levels

ϵ_{i}

to maximize their individual utilities, given the platform’s strategies. The utility function for each user i is defined as

U_{i} (R_{i}, C_{i}, ϵ_{i}) = α R_{i} - β C_{i} - γ P_{i} (ϵ_{i}),

(23)

where

α, β, γ

are weighting coefficients reflecting the importance of incentive, resource consumption, and privacy cost, respectively. The privacy cost

P_{i} (ϵ_{i})

is modeled as

P_{i} (ϵ_{i}) = \frac{κ_{i}}{ϵ_{i}},

(24)

where

κ_{i}

being the privacy sensitivity coefficient of user i. The resource consumption

C_{i}

comprises computation and communication costs:

C_{i} = C_{i}^{cmp} + C_{i}^{com},

(25)

where

C_{i}^{cmp} = β_{i} n_{i} η_{i}

is the computation cost,

n_{i}

is the size of user i’s dataset,

η_{i}

is the number of local training epochs, and

β_{i}

is the unit computation cost. The communication cost is

C_{i}^{com} = γ_{i} d

, with

γ_{i}

being the unit communication cost and d the dimension of the model parameters.

The users aim to maximize their utilities

U_{i} (R_{i}, C_{i}, ϵ_{i})

by choosing appropriate

C_{i}

and

ϵ_{i}

, subject to the following constraints:

C_{i} \leq C_{\max}, \forall i \in U,

(26)

ensuring that the resource consumption does not exceed the maximum allowable limit.

ϵ_{i} \geq ϵ_{\min}, \forall i \in U,

(27)

guaranteeing a minimum level of privacy protection.

C_{i} \geq 0, ϵ_{i} \geq 0, \forall i \in U .

(28)

The platform’s decision variables are the incentives

R_{i}

and the acceptable privacy levels

ϵ_{i}

. The platform is subject to the following constraints:

\sum_{i = 1}^{N} R_{i} \leq R_{budget},

(29)

where

R_{budget}

is the total incentive budget available to the platform.

| R_{i} - R_{j} | \leq δ, \forall i, j \in U,

(30)

ensuring that the difference in incentives between any two users does not exceed a threshold

δ

, promoting fairness.

R_{i} \geq 0, \forall i \in U .

(31)

The platform’s objective is to maximize the overall system utility, which can be represented as the sum of all users’ utilities, while adhering to budget and fairness constraints:

max_{{R_{i}}, {ϵ_{i}}} \sum_{i = 1}^{N} U_{i} (R_{i}, C_{i}, ϵ_{i}) .

(32)

To solve this hierarchical optimization problem, we use backward induction. First of all, we derive the users’ best responses to the platform’s strategies by solving their individual optimization problems. Then, we incorporate these responses into the platform’s optimization problem to determine the optimal incentives and acceptable privacy levels.

Each user i aims to maximize their utility

U_{i} (R_{i}, C_{i}, ϵ_{i})

with respect to

C_{i}

and

ϵ_{i}

, given

R_{i}

and the constraints in Equations (26) and (27). Formally, the user’s optimization problem is

\begin{matrix} max_{C_{i}, ϵ_{i}} & U_{i} (R_{i}, C_{i}, ϵ_{i}) = α R_{i} - β C_{i} - γ \frac{κ_{i}}{ϵ_{i}} \\ s . t . & C_{i} \leq C_{\max}, \\ ϵ_{i} \geq ϵ_{\min}, \\ C_{i} \geq 0, \\ ϵ_{i} \geq 0 . \end{matrix}

(33)

Since

R_{i}

is given from the platform’s strategy, the term

α R_{i}

is constant with respect to the user’s decision variables

C_{i}

and

ϵ_{i}

. To solve the user’s optimization problem, we formulate the Lagrangian function:

L_{i} = α R_{i} - β C_{i} - γ \frac{κ_{i}}{ϵ_{i}} + λ_{i}^{C} (C_{\max} - C_{i}) + λ_{i}^{ϵ} (ϵ_{i} - ϵ_{\min}),

(34)

where

λ_{i}^{C} \geq 0

and

λ_{i}^{ϵ} \geq 0

are the Lagrange multipliers associated with the resource and privacy constraints, respectively. Taking the partial derivatives of the Lagrangian with respect to

C_{i}

and

ϵ_{i}

, we have

\frac{\partial L_{i}}{\partial C_{i}} = - β - λ_{i}^{C} = 0,

(35)

\frac{\partial L_{i}}{\partial ϵ_{i}} = γ \frac{κ_{i}}{ϵ_{i}^{2}} + λ_{i}^{ϵ} = 0 .

(36)

From Equation (35), we find

λ_{i}^{C} = - β .

(37)

Since

λ_{i}^{C} \geq 0

and

β > 0

, this leads to a contradiction unless

β = 0

, which is not practical. Therefore, the resource constraint is active, and the user consumes the maximum allowable resources:

C_{i}^{*} = C_{\max} .

(38)

Similarly, from Equation (36), we obtain

λ_{i}^{ϵ} = - γ \frac{κ_{i}}{ϵ_{i}^{2}} .

(39)

Since

λ_{i}^{ϵ} \geq 0

and

γ, κ_{i} > 0

, it follows that

ϵ_{i}^{*} = ϵ_{\min} .

(40)

Thus, the user opts for the minimum acceptable privacy level to maximize utility. Substituting the users’ optimal responses into the platform’s objective function Equation (32), we have

max_{{R_{i}}} \sum_{i = 1}^{N} (α R_{i} - β C_{\max} - γ \frac{κ_{i}}{ϵ_{\min}}) .

(41)

Since

C_{\max}

and

ϵ_{\min}

are constants, the platform’s problem simplifies to maximizing

\sum_{i = 1}^{N} α R_{i}

under the constraints in Equations (29)–(31). To solve this, we observe that maximizing

\sum_{i = 1}^{N} R_{i}

will maximize the platform’s objective, given that

α > 0

. Therefore, the platform should allocate the entire budget

R_{budget}

to the users.

Considering the fairness constraint Equation (30), an equitable distribution of incentives is

R_{i}^{*} = \frac{R_{budget}}{N}, \forall i \in U,

(42)

which satisfies

| R_{i} - R_{j} | = 0 \leq δ

. To ensure that the obtained solutions satisfy the Karush-Kuhn-Tucker (KKT) conditions, we verify the complementary slackness conditions.

For the user:

Resource constraint:

λ_{i}^{C} (C_{\max} - C_{i}^{*}) = λ_{i}^{C} (C_{\max} - C_{\max}) = 0 .

(43)

Privacy constraint:

λ_{i}^{ϵ} (ϵ_{i}^{*} - ϵ_{\min}) = λ_{i}^{ϵ} (ϵ_{\min} - ϵ_{\min}) = 0 .

(44)

For the platform:

Incentive budget constraint:

λ_{0} (\sum_{i = 1}^{N} R_{i}^{*} - R_{budget}) = λ_{0} (R_{budget} - R_{budget}) = 0 .

(45)

The incentive fairness constraint is satisfied by the equal distribution. Therefore, the KKT conditions are satisfied, confirming that the solutions

C_{i}^{*} = C_{\max}

,

ϵ_{i}^{*} = ϵ_{\min}

, and

R_{i}^{*} = R_{budget} / N

are optimal.

In the federated learning framework, the users perform local model training using their maximum allowable resources and adopt the minimum acceptable privacy levels. The platform aggregates the users’ contributions and updates the global model accordingly. At each iteration t, users update their local models:

θ_{i}^{(t)} = θ^{(t - 1)} - η \nabla F_{i} (θ^{(t - 1)}),

(46)

where

η

is the learning rate, and

\nabla F_{i} (θ^{(t - 1)})

is the gradient of the local loss function. Then, users add differential privacy noise:

{\tilde{θ}}_{i}^{(t)} = θ_{i}^{(t)} + N (0, σ_{i}^{2} I),

(47)

with noise scale computed using:

σ_{i} = \frac{Δ_{i}}{ϵ_{\min}} \sqrt{2 ln \frac{1.25}{δ}},

(48)

where

Δ_{i}

is the sensitivity of the loss function, and

δ

is the failure probability.

The platform aggregates the noisy models:

θ^{(t)} = \frac{1}{N} \sum_{i = 1}^{N} {\tilde{θ}}_{i}^{(t)} .

(49)

Figure 3 presents the architecture of the proposed federated learning framework with a Stackelberg game-based incentive mechanism. The platform, positioned at the top, acts as the leader, allocating incentives and setting privacy levels. Each user device, represented by rounded rectangles, performs local model training and applies differential privacy before sending noisy parameters to the platform. The platform aggregates these parameters to form the global model. This figure illustrates the interaction flow between the platform and user devices, balancing incentives, resource consumption, and privacy protection to optimize learning performance.

Algorithm 1, Federated Learning with Stackelberg Game-Based Incentive Mechanism (FL-SGIM), integrates a federated learning process with a Stackelberg game-based incentive mechanism. It aims to efficiently train a global model while balancing user participation, privacy protection, and resource usage. By encouraging active user participation within their resource and privacy constraints, the algorithm ensures fairness and scalability.

Algorithm 1 FL-SGIM For Model Optimization

1:: Input: Initial global model parameters $θ^{(0)}$ , incentive budget $R_{budget}$ , minimum privacy level $ϵ_{\min}$ , maximum resource capacity $C_{\max}$ , learning rate $η$ , convergence threshold $ε$ .
2:: Output: Optimized global model $θ^{*}$ .
3:: Initialize incentives $R_{i}^{(0)} = \frac{R_{budget}}{N}$ for all users $i \in U$ .
4:: Set privacy levels $ϵ_{i}^{(0)} = ϵ_{\min}$ for all users.
5:: Initialize iteration counter $t \leftarrow 0$ .
6:: repeat
7:: $t \leftarrow t + 1$
8:: for each user $i \in U$ in parallel do
9:: Perform local training using Equation (46)
10:: Compute noise scale $σ_{i}$ using Equation (48):
11:: Generate noisy model parameters using Equation (47):
12:: Ensure resource consumption $C_{i} \leq C_{\max}$ .
13:: Send ${\tilde{θ}}_{i}^{(t)}$ to the platform.
14:: end for
15:: Aggregate the global model using Equation (49):
16:: until ${∥θ^{(t)} - θ^{(t - 1)}∥}_{2} \leq ε$
17:: return Optimized global model $θ^{*} = θ^{(t)}$ .

The algorithm starts by evenly distributing the total incentive budget, giving each user

R_{i}^{(0)} = \frac{R_{budget}}{N}

(Line 3). It assigns the minimum privacy level

ϵ_{\min}

to all users to ensure baseline privacy protection (Line 4) and initializes the iteration counter

t = 0

(Line 5). During each iteration, the counter t is incremented (Line 7). All users

i \in U

perform parallel local training using stochastic gradient descent (SGD) as per Equation (46) (Line 9). To maintain differential privacy, each user computes the noise scale

σ_{i}

(Line 10) and adds Gaussian noise to generate private model parameters

{\tilde{θ}}_{i}

(Line 11). Users ensure their resource consumption

C_{i}

stays within the allowed limit

C_{\max}

(Line 12) and send the noisy parameters

{\tilde{θ}}_{i}^{(t)}

to the platform (Line 13). The platform aggregates the models using Equation (49) (Line 15) and checks if the difference between the current and previous global models, measured by

{∥θ^{(t)} - θ^{(t - 1)}∥}_{2}

, falls below the threshold

ε

(Line 16). If not, further iterations are performed. Once convergence is achieved, the optimized global model

θ^{*}

is returned, completing the training process (Line 17).

The computational complexity for each user is

O (n_{i} d)

, where

n_{i}

is the data size and d is the model dimension. The communication complexity per user is

O (d)

. The overall complexity scales linearly with the number of users N, making it efficient for large-scale systems.

Under the standard assumptions of convexity and Lipschitz continuity of the loss functions, the algorithm converges to a global model within a bounded error. The noise introduced by differential privacy diminishes with the number of users due to the averaging effect during aggregation. The choice of parameters like

ϵ_{\min}

involves a trade-off between privacy protection and model accuracy. A smaller

ϵ_{\min}

offers stronger privacy but may introduce more noise, degrading model performance. Careful tuning of the learning rate

η

and convergence threshold

ε

ensures the stability and efficiency of the learning process.

This algorithm integrates a Stackelberg game-based incentive mechanism into federated learning, addressing key user participation, privacy, and resource allocation challenges. It ensures fair and transparent incentive distribution while maintaining user privacy through differential privacy techniques. The algorithm also accommodates user heterogeneity by managing resource consumption limits and supporting personalized incentives. The platform achieves a scalable, efficient, and privacy-preserving federated learning system through this approach.

In this method, the incentive allocation mechanism between the platform and users is implemented by introducing Stackelberg game theory. Specifically, as the leader of the game, the platform first formulates incentive strategies and privacy levels based on the overall goals and constraints of the system. When formulating these strategies, the platform takes into account the optimal response of users, that is, how users optimize their resource consumption

C_{i}

and privacy choices

ϵ_{i}

) to maximize their own utility under given incentives and privacy requirements. As followers of the game, users choose the optimal resource consumption and privacy level based on their own utility function

U_{i} (R_{i}, C_{i}, ϵ_{i})

after the platform strategy is determined.

Through this leadership–follower game structure, the platform is able to anticipate and guide user behavior, thereby optimizing overall system utility. In each round of the game, the platform adjusts the incentive amount and privacy parameters based on the current incentive allocation and privacy requirements, in order to encourage users to achieve a balance between resource utilization and privacy protection. Due to the incentive allocation strategy of the platform being based on the optimal response of users, a balanced state is ultimately achieved, making incentive allocation both fair and transparent, while meeting budget and fairness constraints. By combining the Shapley value dynamic incentive allocation model, the platform can not only quantify the contribution of each user but also ensure fairness and efficient use of resources in the incentive allocation process through game theory methods, thereby enhancing user trust and participation enthusiasm.

4.2. Dynamic Incentive Allocation Model

In resource-constrained data-sharing networks, achieving a fair, transparent, and efficient incentive allocation mechanism is crucial for promoting user participation and enhancing overall system performance. In the previous section, we described the system architecture, constraint conditions, and objective functions. Building on this foundation, to further address the fairness objective (minimizing FFF) and the incentive fairness constraint Equation (7), this subsection delves into the design and derivation of a dynamic incentive allocation model based on the Shapley value. This model provides a fair and transparent method for distributing incentives based on each user’s contribution to the system’s utility.

4.2.1. Fair Incentive Distribution Based on Shapley Value

Figure 4 illustrates the dynamic incentive allocation process based on the Shapley value. In this flow, the set of users and their data characteristics are first taken as inputs to calculate the contribution value of each user in the coalition, and the marginal contribution of each user is evaluated by the Shapley value formula. To ensure computational efficiency in large-scale networks, a Monte Carlo approximation is used to estimate the Shapley value by sample alignment, and finally, the estimation results are used for incentive allocation. This model not only ensures the fairness of incentive allocation but also protects user privacy while maintaining high efficiency.

The Shapley value, originating from cooperative game theory, serves as a core method for fairly distributing the total gains generated from cooperation among multiple participants. In our data-sharing network, each user’s contribution depends not only on their own data volume and quality but also on the participation of other users. Utilizing the Shapley value allows for a fair assessment of each user’s actual contribution, ensuring fairness and transparency in incentive distribution.

As described in the previous section, we consider a user set

U = {1, 2, \dots, N}

, where each user

i \in U

possesses a local dataset

D_{i}

with data volume

n_{i} = | D_{i} |

and data quality

Q_{i}

. Each user incurs resource consumption

C_{i}

and adopts differential privacy mechanisms with privacy parameter

ϵ_{i}

, satisfying the constraints outlined earlier.

First, we define the coalition value function

v (S)

to measure the total utility of a coalition

S \subseteq U

:

v (S) = \sum_{i \in S} U_{i} .

(50)

To quantify the individual user’s contribution to the coalition, we introduce the concept of marginal contribution. The marginal contribution

Δ_{i} (S)

of user i to coalition S is defined as

Δ_{i} (S) = v (S \cup {i}) - v (S) .

(51)

The Shapley value measures a user’s fair contribution by averaging their marginal contributions over all possible user permutations:

ϕ_{i} (v) = \frac{1}{N!} \sum_{π \in Π} Δ_{i} (S_{i}^{π}),

(52)

where

Π

is the set of all

N!

possible permutations of users, and

S_{i}^{π}

denotes the set of users preceding user i in permutation

π

.

The Shapley value has several important properties: efficiency, symmetry, linearity, fairness, and monotonicity. These properties ensure the fairness and effectiveness of incentive allocation and encourage users to participate actively and contribute.

To align the incentive allocation with the user’s actual contribution, we define the incentive

R_{i}

of user i as their Shapley value:

R_{i} = ϕ_{i} (v) .

(53)

Therefore, the incentive

R_{i}

reflects the contribution of user i to the overall utility of the system, ensuring fairness and efficiency in incentive distribution.

The Shapley value is a solution concept in cooperative game theory that fairly distributes the total gains generated by the coalition of all users based on their individual contributions. It considers all possible subsets of users and calculates each user’s marginal contribution to every possible coalition, ensuring a fair and unbiased incentive distribution.

Theorem 3.

The dynamic incentive allocation model based on the Shapley value ensures equitable and transparent distribution of incentives by satisfying the following Shapley axioms:

1.: Efficiency: The total incentives are fully distributed among all users, i.e., $\sum_{i = 1}^{N} R_{i} = R_{budget}$ .
2.: Symmetry: If two users contribute equally to all possible coalitions, they receive identical incentives, i.e., $R_{i} = R_{j}$ whenever $v (S \cup {i}) = v (S \cup {j})$ for all $S \subseteq U ∖ {i, j}$ .
3.: Dummy Player: If a user does not contribute any additional value to any coalition, their incentive is zero, i.e., if $v (S \cup {i}) = v (S)$ for all $S \subseteq U ∖ {i}$ , then $R_{i} = 0$ .
4.: Additivity: For any two games with characteristic functions v and w, the Shapley value for the game $v + w$ is the sum of the Shapley values for games v and w individually.

Consequently, the Shapley value-based incentive allocation model guarantees that each user’s incentive is fairly determined by their marginal contribution to the overall system utility, ensuring both fairness and transparency in the distribution process.

Proof. Efficiency:

By definition, the Shapley value distributes the total value of the grand coalition

v (U)

among all players such that

\sum_{i = 1}^{N} ϕ_{i} (v) = v (U)

. In our model,

v (U) = R_{budget}

, ensuring that all incentives are fully allocated.

Symmetry: If two users i and j contribute equally to every possible coalition, then for every

S \subseteq U ∖ {i, j}

,

v (S \cup {i}) = v (S \cup {j})

. Thus, their Shapley values satisfy

ϕ_{i} (v) = ϕ_{j} (v)

, ensuring equal incentives.

Dummy Player: If a user i does not contribute additional value to any coalition, i.e.,

v (S \cup {i}) = v (S)

for all

S \subseteq U ∖ {i}

, then

ϕ_{i} (v) = 0

. This ensures that users who do not enhance system utility receive no incentives.

Additivity: For any two characteristic functions v and w, the Shapley value for the combined game

v + w

satisfies

ϕ_{i} (v + w) = ϕ_{i} (v) + ϕ_{i} (w)

. This property ensures that incentives from independent sources can be aggregated linearly.

Since the Shapley value satisfies all four axioms, it guarantees that the incentive distribution is both fair and transparent, as each user’s incentive precisely reflects their contribution to the system utility. □

The Shapley value-based dynamic incentive allocation model thus provides a principled approach to distributing incentives fairly and transparently, ensuring that each user’s reward is directly proportional to their actual contribution to the system’s overall utility.

4.2.2. Approximation Method for Scalable Incentive Allocation

Given the computational complexity of calculating the Shapley value directly, we employ an approximation method using the Monte Carlo sampling technique. We use this approach to ensure scalability in large-scale systems while still satisfying the constraints and objectives in our problem model, particularly those related to computational efficiency and privacy protection.

Specifically, we set an appropriate sample size K, satisfying

K ≪ N!

, to balance computational complexity and estimation accuracy. Then, we randomly generate K user permutations

{π_{k}}_{k = 1}^{K}

, where each permutation

π_{k}

is a random ordering of

U

.

For each user i and each permutation

π_{k}

, we compute their marginal contribution in that permutation:

Δ_{i}^{π_{k}} = v (S_{i}^{π_{k}} \cup {i}) - v (S_{i}^{π_{k}}) .

(54)

By averaging the marginal contributions over all permutations, we obtain the approximate Shapley value:

{\hat{ϕ}}_{i} (v) = \frac{1}{K} \sum_{k = 1}^{K} Δ_{i}^{π_{k}} .

(55)

According to the law of large numbers, when K is sufficiently large, the approximate Shapley value

{\hat{ϕ}}_{i} (v)

will converge to the true Shapley value

ϕ_{i} (v)

. Increasing K can reduce the estimation variance and improve approximation accuracy.

To establish a complete incentive allocation model, we derive the specific expressions for the resource consumption

C_{i}

and privacy cost

P_{i}

used in the utility function

U_{i}

.

First, the resource consumption

C_{i}

includes computational cost

C_{i}^{c}

and communication cost

C_{i}^{t}

, expressed as

C_{i}^{c} = ρ n_{i},

(56)

C_{i}^{t} = τ n_{i},

(57)

where

ρ

is the computational cost per unit of data,

τ

is the transmission cost per unit of data, and

n_{i}

is the data volume of user i. The total resource consumption is

C_{i} = C_{i}^{c} + C_{i}^{t} = (ρ + τ) n_{i} .

(58)

Next, the privacy cost

P_{i}

is modeled using the privacy cost function

ϕ (ϵ_{i})

, which is a convex decreasing function of the privacy parameter

ϵ_{i}

:

P_{i} = ϕ (ϵ_{i}) = \frac{κ}{ϵ_{i}},

(59)

where

κ

is the privacy sensitivity coefficient, representing the user’s sensitivity to privacy leakage.

Substituting the above expressions into the utility function

U_{i}

, we obtain the complete expression:

U_{i} = α R_{i} - β (ρ + τ) n_{i} - γ \frac{κ}{ϵ_{i}} .

(60)

This formula accurately describes the relationship among user incentives, resource consumption, and privacy costs, laying the foundation for subsequent theoretical analysis.

To verify the effectiveness and stability of the designed incentive allocation model, we need to prove that the system will reach a Nash equilibrium under this mechanism; that is, no user can unilaterally change their strategy to improve their utility. To this end, we first introduce a lemma to reveal the relationship between a user’s marginal contribution and their actual contribution.

Before introducing the lemma, we saw that in cooperative games, users’ strategy choices affect their marginal contributions to the coalition and influence the incentives they receive. The lemma helps us analyze the impact of users’ strategy changes on their utility, and this thereby proves the system’s stability.

Lemma 2

(Monotonicity of Marginal Contributions). For any coalition

S \subseteq U

, if user i increases their data volume

n_{i}

or improves data quality

Q_{i}

while other conditions remain unchanged, their marginal contribution

Δ_{i} (S)

to the coalition S will increase monotonically [43].

Proof.

We aim to prove that for any coalition

S \subseteq U

, if user i increases their data volume

n_{i}

or improves data quality

Q_{i}

while other conditions remain unchanged, their marginal contribution

Δ_{i} (S)

to the coalition will increase monotonically.

Recall the definition of the marginal contribution:

Δ_{i} (S) = v (S \cup {i}) - v (S),

where

v (S) = \sum_{j \in S} U_{j}

represents the total utility of the coalition S, and

U_{i}

is the utility of user i. The utility function of user i is expressed as

U_{i} = α R_{i} - β C_{i} - γ P_{i},

where

R_{i}

denotes the incentive,

C_{i}

is the resource consumption, and

P_{i}

is the privacy cost. Firstly, let us analyze how data volume

n_{i}

affects marginal contribution. The resource consumption

C_{i}

is given by

C_{i} = (ρ + τ) n_{i},

where

ρ

and

τ

are the computational and communication costs per unit of data, respectively. As

n_{i}

increases, the resource consumption

C_{i}

increases linearly. The incentive

R_{i}

, calculated via the Shapley value, reflects the user’s contribution to the coalition. Since data volume directly impacts the value of the user’s contribution, increasing

n_{i}

will increase

R_{i}

. We can express this as

\frac{\partial R_{i}}{\partial n_{i}} > 0 .

The change in the utility

U_{i}

with respect to

n_{i}

is given by

\frac{\partial U_{i}}{\partial n_{i}} = α \frac{\partial R_{i}}{\partial n_{i}} - β (ρ + τ) .

Since both

α

and

β

are positive, as long as the increase in

R_{i}

sufficiently compensates for the rise in resource consumption, the derivative remains positive. Thus, we conclude that

\frac{\partial U_{i}}{\partial n_{i}} > 0,

indicating that the utility

U_{i}

and, therefore, the marginal contribution

Δ_{i} (S)

, increase as the data volume

n_{i}

grows. Next, we evaluate the effect of data quality

Q_{i}

on marginal contribution. Higher data quality improves the value of the user’s contribution to the coalition, thereby increasing the incentive

R_{i}

. This relationship can be expressed as

\frac{\partial R_{i}}{\partial Q_{i}} > 0 .

The change in the utility

U_{i}

with respect to data quality is

\frac{\partial U_{i}}{\partial Q_{i}} = α \frac{\partial R_{i}}{\partial Q_{i}} .

Since higher data quality increases the incentive

R_{i}

, the utility

U_{i}

also increases:

\frac{\partial U_{i}}{\partial Q_{i}} > 0 .

This demonstrates that improving the data quality

Q_{i}

results in an increase in the marginal contribution

Δ_{i} (S)

. Combining the above results, we conclude that both increasing the data volume

n_{i}

and improving the data quality

Q_{i}

lead to higher utility

U_{i}

and, consequently, a higher marginal contribution

Δ_{i} (S)

. Thus, the marginal contribution of user i increases monotonically with both

n_{i}

and

Q_{i}

. □

This lemma indicates that users can enhance their marginal contributions to the coalition by increasing data volume or improving data quality, potentially obtaining higher incentives.

Based on the above lemma, we propose the following theorem:

Theorem 4

(Existence of Nash Equilibrium). Under the incentive allocation mechanism based on the Shapley value, the system has a Nash equilibrium where users cannot unilaterally change their strategies (such as adjusting

n_{i}

or

ϵ_{i}

) to improve their utility

U_{i}

.

Proof.

First, consider the case where user i increases their data volume

n_{i}

or improves data quality

Q_{i}

. According to the marginal contribution definition:

Δ_{i} (S) = v (S \cup {i}) - v (S),

Adding user i increases the coalition’s utility, which increases the Shapley value

ϕ_{i} (v)

. Since

R_{i} = ϕ_{i} (v)

, the incentive allocated to user i also increases. The user’s utility function is defined as

U_{i} = α R_{i} - β C_{i} - γ P_{i},

where resource consumption

C_{i} = (ρ + τ) n_{i}

increases with data volume

n_{i}

. As

n_{i}

increases, the incentive

R_{i}

increases, but so does the resource consumption

C_{i}

. The rate of change in utility is

\frac{\partial U_{i}}{\partial n_{i}} = α \frac{\partial R_{i}}{\partial n_{i}} - β (ρ + τ) .

If

\frac{\partial R_{i}}{\partial n_{i}} > 0

, the increase in incentive must outweigh the increase in resource consumption for

U_{i}

to improve. However, under the Shapley value mechanism, the growth in incentive is balanced by the cost, meaning users cannot benefit from indefinitely increasing data volume. Then, consider the adjustment of the privacy parameter

ϵ_{i}

. The privacy cost is given by

P_{i} = \frac{κ}{ϵ_{i}} .

As

ϵ_{i}

decreases, privacy protection strengthens, but the privacy cost

P_{i}

increases. Although data quality

Q_{i}

may improve, the rate of change in utility with respect to

ϵ_{i}

is

\frac{\partial U_{i}}{\partial ϵ_{i}} = - γ \frac{\partial P_{i}}{\partial ϵ_{i}} = γ \frac{κ}{ϵ_{i}^{2}} .

As

ϵ_{i}

decreases, the cost increases faster than the benefit from improved data quality. Thus, users cannot benefit from reducing their privacy parameters.

In conclusion, regardless of whether users increase their data volume, improve data quality, or adjust their privacy parameters, they cannot improve their net utility

U_{i}

under the balanced incentive and cost conditions. Therefore, users have no incentive to unilaterally change their strategy, and the system reaches a Nash equilibrium. □

Since computing the Shapley value has a complexity of

O (N!)

, it may be infeasible for large-scale systems. To solve this problem, we adopt the Monte Carlo method to approximate the Shapley value, thereby reducing computational complexity. Moreover, to protect user privacy, we can compute the gains using aggregated utility functions, avoiding the disclosure of specific personal data. Specifically, the aggregated utility function is defined as

V (S) = \sum_{i \in S} U_{i},

(61)

where S is a subset of the user set. By using the aggregated utility function, we can compute the value of a coalition without needing detailed information about individual users, thereby protecting user privacy.

The Dynamic Incentive Allocation Algorithm (DIASV) in Algorithm 2 leverages the Shapley value to ensure fairness and transparency in resource-constrained data-sharing networks. Assessing each user’s actual contribution dynamically adjusts incentives to maintain stability and efficiency. Incorporating user utility, resource consumption, and privacy costs, as previously defined, the algorithm achieves fair and effective incentive distribution.

Algorithm 2 DIASV For Incentive Allocation

1:: Input: User set $U$ , initial incentives $R_{i} (0) = 0$ , learning rate $η$ , incentive difference threshold $δ$ , convergence threshold $ϵ$ , sample size K
2:: Output: Optimal incentive allocation $R_{i}^{*}$
3:: Initialize time step $t = 0$
4:: repeat
5:: $t \leftarrow t + 1$
6:: for each user $i \in U$ do
7:: Collect $n_{i}, Q_{i}, ϵ_{i}$
8:: Compute resource consumption $C_{i}$ using Equation (58)
9:: Compute privacy cost $P_{i}$ using Equation (59)
10:: Compute utility $U_{i} (t)$ using Equation (60)
11:: end for
12:: for each user $i \in U$ do
13:: Generate K random permutations ${π_{k}}_{k = 1}^{K}$
14:: Approximate Shapley value ${\hat{ϕ}}_{i} (v)$ using Equation (55)
15:: Update incentive $R_{i} (t)$ using Equation (62)
16:: end for
17:: for all $i, j \in U$ do
18:: if $| R_{i} (t) - R_{j} (t) | > δ$ then
19:: Adjust incentives so that $| R_{i} (t) - R_{j} (t) | = δ$
20:: end if
21:: end for
22:: until $| R_{i} (t) - R_{i} (t - 1) | < ϵ$ for all i
23:: return $R_{i}^{*} = R_{i} (t)$

The algorithm begins by initializing the time step and incentives and then enters the main loop. In each iteration, the algorithm collects each user’s data volume

n_{i}

, data quality

Q_{i}

, and privacy parameter

ϵ_{i}

(Line 7). It computes resource consumption using Equation (58) (Line 8), privacy cost using Equation (59) (Line 9), and utility using Equation (60) (Line 10). For each user, K random permutations are generated, and the Shapley value is approximated using Equation (55), followed by incentive updates using Equation (62) (Line13–15). The incentives are then adjusted to ensure fairness if the difference between any two users’ incentives exceeds the threshold

δ

(Line 19). The loop continues until the incentive changes for all users fall below the convergence threshold

ϵ

(Line 22).

The algorithm’s time complexity is

O (N K)

, where N is the number of users and K is the number of random permutations. Compared with the direct calculation of the Shapley value with

O (N!)

complexity, the use of random sampling significantly reduces the computational burden.

The incentive update formula is

R_{i} (t) = R_{i} (t - 1) + η [{\hat{ϕ}}_{i} (v) - R_{i} (t - 1)] .

(62)

Define the incentive error:

e_{i} (t) = R_{i} (t) - {\hat{ϕ}}_{i} (v) .

(63)

Then, we have

e_{i} (t) = (1 - η) e_{i} (t - 1) .

(64)

Since

0 < η < 1

,

| e_{i} (t) |

decays exponentially toward zero. This indicates that the incentive

R_{i} (t)

will converge to the approximate Shapley value

{\hat{ϕ}}_{i} (v)

.

This method introduces a dynamic incentive allocation algorithm based on the Shapley value, reducing computational complexity through approximation techniques. It achieves fair and efficient incentive distribution while respecting system constraints and user privacy. The algorithm effectively addresses fairness issues, encouraging users to actively participate and contribute high-quality data. With flexible parameter settings, it adapts to different scenarios, balancing convergence speed and computational costs, ensuring the stable operation of large-scale data-sharing networks.

Theorem 5.

The Monte Carlo approximation method for computing the Shapley value reduces the computational complexity from

O (N!)

to

O (N \cdot M)

, where M is the number of Monte Carlo samples. Additionally, for a sufficiently large M, the approximation

{\hat{ϕ}}_{i} (v)

of the Shapley value

ϕ_{i} (v)

satisfies

Pr (|{\hat{ϕ}}_{i} (v) - ϕ_{i} (v)| \leq ϵ) \geq 1 - δ

for any

ϵ > 0

and

δ \in (0, 1)

, ensuring both computational efficiency and accurate incentive allocation.

Proof.

The exact computation of the Shapley value requires evaluating the marginal contribution of each user across all

N!

possible permutations, resulting in a computational complexity of

O (N!)

. This approach becomes infeasible for large-scale systems due to the factorial growth in computation time.

The Monte Carlo approximation method estimates the Shapley value by randomly sampling M permutations and averaging the marginal contributions of user i across these samples. Each sampled permutation involves

O (N)

operations to determine the position and marginal contribution of user i, leading to a total computational complexity of

O (N \cdot M)

. This linear scaling with respect to both the number of users N and the number of samples M significantly enhances computational efficiency, making the method scalable for large N.

Furthermore, by the Law of Large Numbers, as the number of samples M increases, the Monte Carlo estimate

{\hat{ϕ}}_{i} (v)

converges to the true Shapley value

ϕ_{i} (v)

. To quantify the accuracy of the approximation, we apply Hoeffding’s inequality, which provides a bound on the probability that the approximation deviates from the true value by more than

ϵ

:

Pr (|{\hat{ϕ}}_{i} (v) - ϕ_{i} (v)| > ϵ) \leq 2 exp (- 2 M ϵ^{2}) .

To ensure that the probability of deviation exceeds

ϵ

is at most

δ

, we set

2 exp (- 2 M ϵ^{2}) \leq δ,

which simplifies to

M \geq \frac{ln (2 / δ)}{2 ϵ^{2}} .

Thus, for any desired accuracy level

ϵ

and confidence level

1 - δ

, a sufficient number of samples M can be chosen to guarantee that the Monte Carlo approximation

{\hat{ϕ}}_{i} (v)

is within

ϵ

of the true Shapley value

ϕ_{i} (v)

with a probability of at least

1 - δ

.

In summary, the Monte Carlo approximation method achieves computational efficiency by reducing the complexity from

O (N!)

to

O (N \cdot M)

and maintains system resilience by providing accurate estimates of the Shapley value with high probability. This balance between efficiency and accuracy ensures that the incentive allocation mechanism remains both scalable and reliable in large-scale, resource-constrained environments. □

Theorem 6.

The Shapley value-based incentive allocation satisfies the properties of efficiency, symmetry, and additivity, ensuring a fair distribution of incentives in the data-sharing framework.

Proof.

1. Efficiency: The sum of the Shapley values for all users equals the total utility generated by the coalition:

\sum_{i = 1}^{N} ϕ_{i} (v) = v (U)

This ensures that the total incentive distributed equals the total utility generated, satisfying the efficiency property.

2. Symmetry: If two users i and j contribute equally to every possible coalition, then their Shapley values are equal:

If v (S \cup {i}) = v (S \cup {j}) for all S \subseteq U ∖ {i, j}, then ϕ_{i} (v) = ϕ_{j} (v)

This ensures that users who contribute equally receive the same incentives, satisfying the symmetry property.

3. Additivity: If the utility function can be decomposed into two separate utility functions, the Shapley value of the combined utility function is the sum of the Shapley values of the individual utility functions:

ϕ_{i} (v_{1} + v_{2}) = ϕ_{i} (v_{1}) + ϕ_{i} (v_{2})

This ensures that the incentive allocation is consistent when considering multiple utility components, satisfying the additivity property. □

Theorem 7.

The dynamic incentive allocation framework proposed in this paper converges to a stable state under practical resource constraints and privacy requirements.

Proof.

We prove the convergence of the dynamic incentive allocation framework by showing that the incentive distribution stabilizes over time under the resource and privacy constraints.

Let

R_{i} (t)

denote the incentive allocated to user i at time t, and let

R_{i}^{*}

be the stable incentive for user i. We assume that the incentives evolve according to the update rule:

R_{i} (t + 1) = R_{i} (t) + η [ϕ_{i} (v) - R_{i} (t)],

where

η

is a learning rate, and

ϕ_{i} (v)

is the Shapley value of user i. As

η \to 0

, the incentives converge to their optimal values.

By applying the stability criterion, we assert that

lim_{t \to \infty} R_{i} (t) = ϕ_{i} (v) for all i .

This convergence is guaranteed due to the bounded nature of the Shapley value, the resource constraints

C_{i} \leq C_{m a x}

, and the privacy constraints

ϵ_{i} \geq ϵ_{m i n}

, which limit the incentive fluctuations within the system. Thus, the incentive allocation stabilizes and reaches an equilibrium where no user can unilaterally improve their utility by deviating from the allocated strategy, ensuring fairness and system stability. □

5. Experiment Result

This experiment was conducted in the MATLAB R2023a environment on a Windows 11 Pro 64-bit operating system. The hardware configuration included an Intel Core i7-12700H processor and 16 GB of memory. Regarding software, the experiment utilized MATLAB’s Optimization Toolbox and Parallel Computing Toolbox to support the implementation of the incentive allocation model’s optimization algorithm, including incentive allocation, resource constraints, and privacy protection mechanisms. The experiment environment is shown in the Table 1.

5.1. Comparative Algorithm

Given the characteristics of our problem model, which requires efficient data optimization with multiple objectives, we select five comparison algorithms and present their computational complexities. The chosen algorithms are as follows: Lagrange Multiplier-Based Optimization (LMBO) [44], with a time complexity of

O (n^{2})

; Fuzzy-Based Graph Clustering Algorithm with Multi-objective Particle Swarm Optimization (FCAN-MOPSO) [45], having a complexity of

O (n V^{2})

; Hybrid Harris Hawks Optimization and Simulated Annealing (HHO-SA) [46], with a complexity of

O (N \cdot D \cdot T)

; Deep Reinforcement Learning-Based Adaptive Operator Selection (DRL-AOS) [47], which exhibits a complexity of

O (N D^{2})

; and Genetic Algorithm-Based Test Data Generation (GA-TDG) [48], with a time complexity of

O (m \cdot n)

. Blockchain-based Evolutionary Game Incentive (BC-EGI) [25] is

O (n)

.

LMBO is typically employed in optimization problems involving constraints, demonstrating effective resource allocation under specific limitations. FCAN-MOPSO excels in solving clustering problems in complex networks, providing superior convergence and solution accuracy. HHO-SA demonstrates efficiency in non-linear optimization through hybrid exploitation and exploration strategies, effectively avoiding local minima. DRL-AOS is particularly useful for multi-objective optimization, leveraging reinforcement learning to dynamically improve operator selection. Finally, GA-TDG is commonly applied to test data generation, offering robust solutions for multi-path exploration and optimization.

5.2. The Description of Dataset Details

This study leverages MATLAB’s numerical computing capabilities and simulation tools to construct datasets suitable for resource-constrained data-sharing environments. The dataset design simulates user interactions and system dynamics, including characteristics such as user resource consumption, privacy requirements, data volume, and incentives, to meet the constraints and objectives of the optimization problem.

To simulate user behavior and system characteristics, data were generated using MATLAB’s random number generation functions and Simulink for dynamic scenario construction. Each user’s data characteristics include resource consumption, privacy protection level, data volume, and incentives. Resource consumption was generated randomly within a defined range using MATLAB’s rand function to simulate the computational capacity and communication costs of different devices. Privacy protection levels were sampled from a uniform distribution within a specific range, reflecting user privacy requirements. Data volume was defined as a constrained random variable representing the size of the data uploaded by users. Incentives were initially set to zero and dynamically adjusted during the optimization process to reflect platform decisions.

Dynamic scenarios were established using Simulink, simulating the interaction process between user devices and the data-sharing platform, including resource fluctuations, user participation, and incentive feedback. The time-series data generated by these dynamic processes were exported to the MATLAB environment for further analysis and optimization.

As shown in Table 2, four datasets were used in this experiment, with user numbers set to 10, 50, 100, and 200, respectively, for each dataset. Each dataset underwent three experimental runs.

The datasets used in this study included multiple key fields describing user behavior and system performance in resource-constrained data-sharing scenarios, as shown in Table 3. These fields encompass various aspects of the system. user_id serves as a unique identifier for each user, allowing the system to distinguish individuals participating in the data-sharing process. uploaded_data (

D_{i}

) represents the data each user contributes to the platform, forming the basis of their participation. incentive (

R_{i}

) indicates the rewards allocated to users, based on their contributions and resource usage. In contrast, resource_consumption (

C_{i}

) captures the computational or energy resources expended by each user during data-sharing tasks. The privacy_level (

ϵ_{i}

) reflects the level of differential privacy applied to protect user data, balancing privacy and utility.

Additionally, utility (

u_{i}

) quantifies the net benefit each user derives, incorporating incentives, resource consumption, and privacy costs. The privacy_cost (

P_{i}

) measures the trade-off incurred by privacy mechanisms, while data_quality (

Q_{i}

) ensures the shared data meet minimum standards for effective processing. task_completion represents the proportion of tasks completed by users, serving as a metric for system effectiveness.

5.3. Evaluation Metrics

The experimental evaluation focuses on two primary metrics.

Objective Function Value: This metric assesses the comprehensive efficiency of the optimization objective, encompassing the overall balance between user utility, resource consumption, and privacy protection.

f = \sum_{i = 1}^{N} u_{i}

(65)

where

u_{i}

represents the utility of user i, and N is the total number of users.

Task Completion Rate: This metric indicates the proportion of users who successfully complete data-sharing tasks under given resource constraints, reflecting the effectiveness of the incentive mechanism in promoting user participation.

η = \frac{N_{c}}{N}

(66)

where

η

represents the ask completion rate,

N_{c}

represents the number of users who successfully complete their tasks, and N represents the total number of users.

Communication costs = \sum_{i = 1}^{N} τ \times T \times (Upload {Cost}_{i} + Download {Cost}_{i})

(67)

where N is the number of users,

τ

is the unit communication cost, T is the total number of training rounds. The upload and download costs depend on each user’s resource demands

R [i]

, computational needs

C [i]

, and the system’s total resource budget

R_{budget}

.

Upload Cost: The cost for a user to upload data, proportional to their resource requirements

R [i]

and task size. Download Cost: The cost for a user to download data, proportional to their computational needs

C [i]

and task complexity. Communication Costs: The total cost for data exchange across users, reflecting resource demands, number of participants, and resource budget. Lower communication costs imply better system efficiency.

5.4. Numerical Analysis

As shown in Figure 5, the method in this paper (FL-DIAF) shows superior optimization performance on different datasets, with objective function values consistently lower than those of the comparison algorithms. In Dataset 1, the objective function value of FL-DIAF is −190.94, which is significantly better than GA-TDG (−3.56) and DRL-AOS (171.04), with a reduction of 185% and 211%, respectively. In the small-scale dataset, the optimization effect of FL-DIAF is mainly due to its dynamic incentive allocation mechanism, which improves the overall optimization efficiency through accurate resource matching and allocation.

As the dataset size increases and the task complexity increases significantly, FL-DIAF still maintains a low objective function value in Dataset 3 and Dataset 4. For example, in Dataset 4, the objective function value of FL-DIAF is −1699.34, which is 100% and 99.95% lower compared with FCAN-MOPSO (19,566,519.45) and DRL-AOS (3,489,501.02), respectively. Such results indicate that the optimization capability of FL-DIAF in complex scenarios is not only due to the dynamic incentive allocation but also to its full use of resource collaboration and efficient adaptation to changes in user requirements. Therefore, the method in this paper shows significant advantages in terms of objective function values, especially in large-scale datasets, where the dynamic allocation and collaboration mechanism of FL-DIAF significantly improves the resource utilization efficiency and reduces the optimization objective.

The detailed data regarding objective function value are summarized in Table 4.

Taking the method of this paper (FL-DIAF) as a benchmark, Figure 6 demonstrates the performance of the other compared algorithms in terms of the reduction rate of the objective function values. In Dataset 1, the reduction rates of GA-TDG and LMBO are 20.678 and 2.255, respectively, which are higher than those of DRL-AOS (0.528) and FCAN-MOPSO (1.146). This suggests that GA-TDG and LMBO achieve a certain degree of optimization by heuristic rules in small-scale datasets, but the adaptability of their strategies is limited for further optimization.

As the dataset size increases, the reduction rates of the compared algorithms gradually converge to 1. For example, in Dataset 4, the reduction rates of FCAN-MOPSO and DRL-AOS are 1.000 and 1.024, respectively, indicating that these algorithms improve the adaptability by adjusting the parameters in complex scenarios, but they are still not as good as FL-DIAF in terms of the ability of dynamic adjustment. FCAN-MOPSO and DRL-AOS lack an efficient response mechanism to user demands in complex tasks, which leads to a decrease in the efficiency of resource allocation, thus limiting the optimization capability. An analysis of the reduction rates shows that although the comparison algorithms perform close to FL-DIAF in complex scenarios, their optimization ability is limited by the lack of dynamic adjustment ability, whereas this paper’s method is able to adapt to the changes in task complexity more efficiently by dynamically incentivizing the allocation and collaboration strategies.

The detailed data regarding the average reduction rate of the objective function are summarized in Table 5.

In order to comprehensively evaluate the performance of the FL-DIAF framework proposed in this paper in different security environments, we introduce the node malicious attack scenario. In this scenario, malicious nodes may disrupt the normal operation of the system by manipulating model updates or sending false information, which negatively affects incentive allocation and model training in the experiments under malicious node attacks. The experimental results in response to Figure 7 can be attributed to the architectural design of FL-DIAF. FL-DIAF protects user data privacy through federated learning and differential privacy; however, in the context of malicious node attacks, malicious nodes may attempt to manipulate the update data or tamper with the model parameters, which results in the system having to perform additional security checks and data validation, which increases the communication cost. FL-DIAF adjusts node participation through a dynamic incentive allocation mechanism, which, while helping to maintain the fairness and transparency of the system, also creates an additional communication burden for identifying and isolating attacking nodes in the context of malicious attacks.

In contrast, other comparative algorithms such as LMBO, FCAN-MOPSO, and HHO-SA perform more efficiently in the face of malicious node attacks. These algorithms do not directly rely on collaborative model updates when performing optimization tasks, so the impact of malicious nodes is less and the increase in communication cost is relatively small. On the other hand, DRL-AOS and GA-TDG are able to flexibly cope with multi-objective optimization problems in the face of malicious nodes due to their reinforcement learning and genetic algorithm exploration process, but the presence of malicious nodes still increases the computational and communication costs, especially when dealing with anomalous data.

Overall, the increased communication cost of FL-DIAF in the face of malicious node attacks reflects its complexity in ensuring privacy protection and system robustness. Nonetheless, FL-DIAF is still able to effectively counter the interference of malicious nodes through its unique mechanism, guaranteeing data security and fairness in incentive distribution, whereas other algorithms give relatively less consideration to these security and privacy preservation needs.

As shown in Figure 8, FL-DIAF has the second-highest communication cost among the algorithms. Although FL-DIAF employs federated learning and differential privacy to ensure data privacy, the introduction of differential privacy increases the communication overhead, and additional noise and data encryption processing is required for each round of training. In addition, although the dynamic incentive allocation mechanism in FL-DIAF ensures fairness, its dynamic adjustment and uncertainty of participation may lead to inefficient communication. In contrast, comparative algorithms such as LMBO, FCAN-MOPSO, and HHO-SA have lower communication costs, mainly due to their different optimization objectives and lower communication frequency during the optimization process. Nevertheless, the advantages of FL-DIAF in terms of privacy protection and system resilience make it still of significant practical value in specific application scenarios.

Figure 9 illustrates the task completion rates for various algorithms across four datasets. The proposed FL-DIAF method demonstrates exceptional performance, achieving a perfect task completion rate of 100% across all datasets and experiments, highlighting its robustness and adaptability. In contrast, other algorithms exhibit varying degrees of success depending on the dataset complexity and experimental conditions.

In small-scale datasets (Datasets 1 and 2), FL-DIAF and other algorithms such as GA-TDG, LMBO, DRL-AOS, and HHO-SA consistently achieve high completion rates. For instance, in Dataset 1, most algorithms reach a completion rate of 100%, indicating their ability to handle simple tasks efficiently. However, FCAN-MOPSO underperforms, particularly in Experiment 3, where its task completion rate falls below 80% for Dataset 1 and drops significantly to below 70% for Dataset 2. These results suggest that, as employed by FCAN-MOPSO, static allocation strategies struggle to optimize resource utilization, even in small-scale tasks.

As task complexity increases in Datasets 3 and 4, notable differences in algorithm performance emerge. FL-DIAF maintains its perfect task completion rate of 100% across all experiments, demonstrating its ability to handle large-scale and complex optimization problems. Other algorithms, such as DRL-AOS and HHO-SA, exhibit slight declines in task completion rates but still perform relatively well. In contrast, FCAN-MOPSO shows a significant drop in completion rates, especially in Dataset 3, where it struggles to exceed 60%. Its performance worsens further in Dataset 4, where the completion rate remains below 50% in many experiments. These results underscore the limitations of static optimization approaches when dealing with complex, large-scale scenarios.

5.5. Summary

In all experiments, the proposed FL-DIAF demonstrated exceptional optimization performance and robustness. In terms of objective function value reduction rate, it achieved the best performance under typical conditions, reaching 9.573%. Moreover, in terms of task completion rate, FL-DIAF achieved 100% across all datasets and experimental conditions. These results fully validate the advantages of the proposed method in multi-objective optimization problems, particularly its ability to allocate resources efficiently and adapt dynamically to complex scenarios.

6. Conclusions

This paper addresses privacy protection challenges, user incentive allocation, and resource optimization in resource-constrained data-sharing environments. The proposed FL-DIAF method introduces a federated learning framework combined with a dynamic incentive allocation model based on the Shapley value, balancing user privacy, resource usage, and system utility. The main innovation lies in integrating privacy-preserving differential mechanisms with cooperative game-theory-based incentive allocation, ensuring fairness and adaptability to dynamic user contributions. Experimental results demonstrate the exceptional performance of FL-DIAF, achieving a task completion rate of 100% across all datasets and experiments and a superior average objective function reduction rate of 9.573% under typical conditions, significantly outperforming comparable methods. However, the computational overhead associated with large-scale Monte Carlo approximations for the Shapley value highlights a limitation in scalability, affecting real-time applications in massive user scenarios.

Future research could focus on how to further optimize the computational efficiency of the dynamic incentive model by using advanced approximation algorithms and explore adaptive privacy budgets for real-time applications. Thus, the scalability and practicality of the proposed framework can be enhanced in broader contexts.

Author Contributions

Conceptualization, Y.W.; methodology, Y.W. and S.L.; software, R.G.; validation, K.C. and Y.W.; formal analysis, K.C.; investigation, Y.W. and J.L.; resources, J.L.; data curation, K.C.; writing—original draft preparation, Y.W.; writing—review and editing, Y.W. and S.L.; visualization, S.L.; supervision, K.C. and S.L.; project administration, S.L.; funding acquisition, S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 52275480) and the Guizhou Provincial Science and Technology Program (Zhu Ke Project [2023] No. 7).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

Zhang, H.; Liu, B.; Susanto, H.; Xue, G.; Sun, T. Incentive mechanism for proximity-based Mobile Crowd Service systems. In Proceedings of the IEEE INFOCOM 2016—The 35th Annual IEEE International Conference on Computer Communications, San Francisco, CA, USA, 10–14 April 2016; pp. 1–9. [Google Scholar] [CrossRef]
Peng, D.; Wu, F.; Chen, G. Pay as How Well You Do: A Quality Based Incentive Mechanism for Crowdsensing. In Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, Hangzhou, China, 22–25 June 2015; pp. 177–186. [Google Scholar] [CrossRef]
Zhang, R.; Zhang, J.; Zhang, Y.; Zhang, C. Secure crowdsourcing-based cooperative spectrum sensing. In Proceedings of the 2013 Proceedings IEEE INFOCOM, Turin, Italy, 14–19 April 2013; pp. 2526–2534. [Google Scholar] [CrossRef]
Jin, H.; Su, L.; Chen, D.; Nahrstedt, K.; Xu, J. Quality of Information Aware Incentive Mechanisms for Mobile Crowd Sensing Systems. In Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, Hangzhou, China, 22–25 June 2015; pp. 167–176. [Google Scholar] [CrossRef]
Wang, D.; Ren, J.; Wang, Z.; Wang, Y.; Zhang, Y. PrivAim: A Dual-Privacy Preserving and Quality-Aware Incentive Mechanism for Federated Learning. IEEE Trans. Comput. 2023, 72, 1913–1927. [Google Scholar] [CrossRef]
Zhang, F.; He, L.; He, W.; Liu, X. Data Perturbation with State-Dependent Noise for Participatory Sensing. In Proceedings of the 2012 Proceedings IEEE INFOCOM, Orlando, FL, USA, 25–30 March 2012; pp. 2246–2254. [Google Scholar] [CrossRef]
Zhao, Y.; Gong, X.; Chen, X. Privacy-Preserving Incentive Mechanisms for Truthful Data Quality in Data Crowdsourcing. IEEE Trans. Mob. Comput. 2022, 21, 2518–2532. [Google Scholar] [CrossRef]
Sun, P.; Wang, Z.; Feng, Y.; Wu, L.; Li, Y.; Qi, H.; Wang, Z. Towards Personalized Privacy-Preserving Incentive for Truth Discovery in Crowdsourced Binary-Choice Question Answering. In Proceedings of the IEEE INFOCOM 2020—IEEE Conference on Computer Communications, Toronto, ON, Canada, 6–9 July 2020; pp. 1133–1142. [Google Scholar] [CrossRef]
Liu, T.; Di, B.; An, P.; Song, L. Privacy-Preserving Incentive Mechanism Design for Federated Cloud-Edge Learning. IEEE Trans. Netw. Sci. Eng. 2021, 8, 2588–2600. [Google Scholar] [CrossRef]
Tu, X.; Zhu, K.; Luong, N.C.; Niyato, D.; Zhang, Y.; Li, J. Incentive Mechanisms for Federated Learning: From Economic and Game Theoretic Perspective. IEEE Trans. Cogn. Commun. Netw. 2022, 8, 1566–1593. [Google Scholar] [CrossRef]
Chen, Y.; Zhang, Y.; Wang, S.; Wang, F.; Li, Y.; Jiang, Y.; Chen, L.; Guo, B. DIM-DS: Dynamic Incentive Model for Data Sharing in Federated Learning Based on Smart Contracts and Evolutionary Game Theory. IEEE Internet Things J. 2022, 9, 24572–24584. [Google Scholar] [CrossRef]
Deng, H.; Qin, Z.; Sha, L.; Yin, H. A Flexible Privacy-Preserving Data Sharing Scheme in Cloud-Assisted IoT. IEEE Internet Things J. 2020, 7, 11601–11611. [Google Scholar] [CrossRef]
John, M.F.S.; Denker, G.; Laud, P.; Martiny, K.; Pankova, A.; Pavlovic, D. Decision Support for Sharing Data using Differential Privacy. In Proceedings of the 2021 IEEE Symposium on Visualization for Cyber Security (VizSec), New Orleans, LA, USA, 27 October 2021; pp. 26–35. [Google Scholar] [CrossRef]
Wang, J.; Li, M.; He, Y.; Li, H.; Xiao, K.; Wang, C. A Blockchain Based Privacy-Preserving Incentive Mechanism in Crowdsensing Applications. IEEE Access 2018, 6, 17545–17556. [Google Scholar] [CrossRef]
Miao, C.; Su, L.; Jiang, W.; Li, Y.; Tian, M. A Lightweight Privacy-Preserving Truth Discovery Framework for Mobile Crowd Sensing Systems. In Proceedings of the IEEE INFOCOM 2017—IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017; pp. 1–9. [Google Scholar] [CrossRef]
Miao, C.; Jiang, W.; Su, L.; Li, Y.; Guo, S.; Qin, Z.; Xiao, H.; Gao, J.; Ren, K. Cloud-Enabled Privacy-Preserving Truth Discovery in Crowd Sensing Systems. In Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems, Seoul, Republic of Korea, 1–4 November 2015; pp. 183–196. [Google Scholar] [CrossRef]
Tang, X.; Wang, C.; Yuan, X.; Wang, Q. Non-Interactive Privacy-Preserving Truth Discovery in Crowd Sensing Applications. In Proceedings of the IEEE INFOCOM 2018—IEEE Conference on Computer Communications, Honolulu, HI, USA, 16–19 April 2018; pp. 1988–1996. [Google Scholar] [CrossRef]
Zhuo, G.; Jia, Q.; Guo, L.; Li, M.; Li, P. Privacy-Preserving Verifiable Data Aggregation and Analysis for Cloud-Assisted Mobile Crowdsourcing. In Proceedings of the IEEE INFOCOM 2016—The 35th Annual IEEE International Conference on Computer Communications, San Francisco, CA, USA, 10–14 April 2016; pp. 1–9. [Google Scholar] [CrossRef]
Andrés, M.E.; Bordenabe, N.E.; Chatzikokolakis, K.; Palamidessi, C. Geo-indistinguishability: Differential Privacy for Location-Based Systems. In Proceedings of the 2013 ACM SIGSAC Conference on Computer & Communications Security, Berlin, Germany, 4–8 November 2013; pp. 901–914. [Google Scholar] [CrossRef]
Groat, M.M.; Edwards, B.; Horey, J.; He, W.; Forrest, S. Enhancing Privacy in Participatory Sensing Applications with Multidimensional Data. In Proceedings of the 2012 IEEE International Conference on Pervasive Computing and Communications, Lugano, Switzerland, 19–23 March 2012; pp. 144–152. [Google Scholar] [CrossRef]
Gan, X.; Li, Y.; Huang, Y.; Fu, L.; Wang, X. When Crowdsourcing Meets Social IoT: An Efficient Privacy-Preserving Incentive Mechanism. IEEE Internet Things J. 2019, 6, 9707–9721. [Google Scholar] [CrossRef]
Liu, Y.; Wang, H.; Peng, M.; Guan, J.; Wang, Y. An Incentive Mechanism for Privacy-Preserving Crowdsensing via Deep Reinforcement Learning. IEEE Internet Things J. 2021, 8, 8616–8631. [Google Scholar] [CrossRef]
Wang, Y.; Miao, Y.; Li, X.; Leng, T.; Liu, Z.; Liu, X.; Choo, K.K.R.; Deng, R.H. Efficient Homomorphic Encryption-Based Secure Search in Multi-owner Setting for Internet of Things (IoT). IEEE Internet Things J. 2024, 1. [Google Scholar] [CrossRef]
Feng, J.; Wu, Y.; Sun, H.; Zhang, S.; Liu, D. Panther: Practical Secure Two-Party Neural Network Inference. IEEE Trans. Inf. Forensics Secur. 2025, 20, 1149–1162. [Google Scholar] [CrossRef]
Xuan, S.; Zheng, L.; Chung, I.; Wang, W.; Man, D.; Du, X.; Yang, W.; Guizani, M. An incentive mechanism for data sharing based on blockchain with smart contracts. Comput. Electr. Eng. 2020, 83, 106587. [Google Scholar] [CrossRef]
Yang, D.; Xue, G.; Fang, X.; Tang, J. Crowdsourcing to Smartphones: Incentive Mechanism Design for Mobile Phone Sensing. In Proceedings of the 18th Annual International Conference on Mobile Computing and Networking, Istanbul, Turkey, 22–26 August 2012; pp. 173–184. [Google Scholar] [CrossRef]
Jin, H.; Su, L.; Ding, B.; Nahrstedt, K.; Borisov, N. Enabling Privacy-Preserving Incentives for Mobile Crowd Sensing Systems. In Proceedings of the 2016 IEEE 36th International Conference on Distributed Computing Systems (ICDCS), Nara, Japan, 27–30 June 2016; pp. 344–353. [Google Scholar] [CrossRef]
Yan, X.; Ng, W.W.Y.; Zeng, B.; Zhao, B.; Luo, F.; Gao, Y. P2SIM: Privacy-Preserving and Source-Reliable Incentive Mechanism for Mobile Crowdsensing. IEEE Internet Things J. 2022, 9, 25424–25437. [Google Scholar] [CrossRef]
Feng, Z.; Zhu, Y.; Zhang, Q.; Ni, L.M.; Vasilakos, A.V. TRAC: Truthful Auction for Location-Aware Collaborative Sensing in Mobile Crowdsourcing. In Proceedings of the IEEE INFOCOM 2014—IEEE Conference on Computer Communications, Toronto, ON, Canada, 27 April–2 May 2014; pp. 1231–1239. [Google Scholar] [CrossRef]
Jin, H.; Su, L.; Xiao, H.; Nahrstedt, K. INCEPTION: Incentivizing Privacy-Preserving Data Aggregation for Mobile Crowd Sensing Systems. In Proceedings of the 17th ACM International Symposium on Mobile Ad Hoc Networking and Computing, Paderborn, Germany, 5–8 July 2016; pp. 341–350. [Google Scholar] [CrossRef]
Lin, J.; Yang, D.; Li, M.; Xu, J.; Xue, G. Frameworks for Privacy-Preserving Mobile Crowdsensing Incentive Mechanisms. IEEE Trans. Mob. Comput. 2018, 17, 1851–1864. [Google Scholar] [CrossRef]
Lin, J.; Yang, D.; Li, M.; Xu, J.; Xue, G. BidGuard: A Framework for Privacy-Preserving Crowdsensing Incentive Mechanisms. In Proceedings of the 2016 IEEE Conference on Communications and Network Security (CNS), Philadelphia, PA, USA, 17–19 October 2016; pp. 145–153. [Google Scholar] [CrossRef]
He, G.; Li, C.; Song, M.; Shu, Y.; Lu, C.; Luo, Y. A hierarchical federated learning incentive mechanism in UAV-assisted edge computing environment. Ad Hoc Netw. 2023, 149, 103249. [Google Scholar] [CrossRef]
Jin, X.; Zhang, Y. Privacy-Preserving Crowdsourced Spectrum Sensing. IEEE/ACM Trans. Netw. 2018, 26, 1236–1249. [Google Scholar] [CrossRef]
Sun, P.; Che, H.; Wang, Z.; Wang, Y.; Wang, T.; Wu, L.; Shao, H. Pain-FL: Personalized Privacy-Preserving Incentive for Federated Learning. IEEE J. Sel. Areas Commun. 2021, 39, 3805–3820. [Google Scholar] [CrossRef]
Sun, J.; Zhang, R.; Zhang, J.; Zhang, Y. PriStream: Privacy-preserving distributed stream monitoring of thresholded PERCENTILE statistics. In Proceedings of the IEEE INFOCOM 2016—The 35th Annual IEEE International Conference on Computer Communications, San Francisco, CA, USA, 10–14 April 2016; pp. 1–9. [Google Scholar] [CrossRef]
Xing, K.; Wan, Z.; Hu, P.; Zhu, H.; Wang, Y.; Chen, X.; Wang, Y.; Huang, L. Mutual privacy-preserving regression modeling in participatory sensing. In Proceedings of the 2013 Proceedings IEEE INFOCOM, Turin, Italy, 14–19 April 2013; pp. 3039–3047. [Google Scholar] [CrossRef]
Xiao, M.; Wu, J.; Zhang, S.; Yu, J. Secret-sharing-based secure user recruitment protocol for mobile crowdsensing. In Proceedings of the IEEE INFOCOM 2017—IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017; pp. 1–9. [Google Scholar] [CrossRef]
Li, Q.; Cao, G. Providing Efficient Privacy-Aware Incentives for Mobile Sensing. In Proceedings of the 2014 IEEE 34th International Conference on Distributed Computing Systems, Madrid, Spain, 30 June–3 July 2014; pp. 208–217. [Google Scholar] [CrossRef]
Gao, L.; Hou, F.; Huang, J. Providing long-term participation incentive in participatory sensing. In Proceedings of the 2015 IEEE Conference on Computer Communications (INFOCOM), Hong Kong, China, 26 April–1 May 2015; pp. 2803–2811. [Google Scholar] [CrossRef]
Zhang, X.; Xue, G.; Yu, R.; Yang, D.; Tang, J. Truthful incentive mechanisms for crowdsourcing. In Proceedings of the 2015 IEEE Conference on Computer Communications (INFOCOM), Hong Kong, China, 26 April–1 May 2015; pp. 2830–2838. [Google Scholar] [CrossRef]
Zhan, Y.; Li, P.; Qu, Z.; Zeng, D.; Guo, S. A Learning-Based Incentive Mechanism for Federated Learning. IEEE Internet Things J. 2020, 7, 6360–6368. [Google Scholar] [CrossRef]
He, X.; Wang, X.; Wang, S.; Xu, S.; Ren, J.; He, C.; Zhang, Y. A Shapley Value-Based Incentive Mechanism in Collaborative Edge Computing. In Proceedings of the 2021 IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, 7–11 December 2021; pp. 1–7. [Google Scholar] [CrossRef]
Tran, H.G.; Ton-That, L.; Thao, N.G.M. Lagrange Multiplier-Based Optimization for Hybrid Energy Management System with Renewable Energy Sources and Electric Vehicles. Electronics 2023, 12, 4513. [Google Scholar] [CrossRef]
Hu, L.; Yang, Y.; Tang, Z.; He, Y.; Luo, X. FCAN-MOPSO: An Improved Fuzzy-Based Graph Clustering Algorithm for Complex Networks With Multiobjective Particle Swarm Optimization. IEEE Trans. Fuzzy Syst. 2023, 31, 3470–3484. [Google Scholar] [CrossRef]
Izci, D.; Ekinci, S.; Zeynelgil, H.L. Controlling an automatic voltage regulator using a novel Harris hawks and simulated annealing optimization technique. Adv. Control Appl. Eng. Ind. Syst. 2023, 6, e121. [Google Scholar] [CrossRef]
Tian, Y.; Li, X.; Ma, H.; Zhang, X.y.; Tan, K.C.; Jin, Y. Deep Reinforcement Learning Based Adaptive Operator Selection for Evolutionary Multi-Objective Optimization. IEEE Trans. Emerg. Top. Comput. Intell. 2023, 7, 1051–1064. [Google Scholar] [CrossRef]
Yao, X.; Gong, D. Genetic Algorithm-Based Test Data Generation for Multiple Paths via Individual Sharing. Comput. Intell. Neurosci. 2014, 2014, 591294. [Google Scholar] [CrossRef]

Figure 1. Framework of technical routes and ideas for data-sharing incentives in edge computing.

Figure 2. System model for resource-constrained data-sharing systems.

Figure 3. Architecture of federated learning framework with differential privacy and incentive mechanism.

Figure 4. Dynamic incentive allocation process based on Shapley value.

Figure 5. Three-dimensional visualization of objective function value performance across Datasets 1–4.

Figure 6. Average reduction rate of the objective function.

Figure 7. Average communication costs of algorithms under malicious node attack.

Figure 8. Communication costs of all algorithms with flexible

R_{budget}

across 4 datasets.

Figure 8. Communication costs of all algorithms with flexible

R_{budget}

across 4 datasets.

Figure 9. Task completion rates across Datasets 1–4.

Table 1. Experiment Environment Description.

Category	Description
Software Environment	MATLAB R2023a
Operating System	Windows 11 Pro 64-bit
Processor	Intel Core i7-12700H
Memory	16 GB
MATLAB Toolboxes	Optimization Toolbox, Parallel Computing Toolbox

Table 2. Number of Users in Datasets 1–4.

Datasets	D1	D2	D3	D4
Number of Users	10	50	100	200

Table 3. Field Definitions and Value Ranges for Dataset.

Field Name	Notation	Value Range
user_id	-	Integer or string
uploaded_data	$D_{i}$	$[0, \infty)$
incentive	$R_{i}$	$[0, R_{budget}]$
resource_consumption	$C_{i}$	$[0, C_{\max}]$
privacy_level	$ϵ_{i}$	$[0.1, 1)$
utility	$u_{i}$	$(- \infty, \infty)$
privacy_cost	$P_{i}$	$[0, \infty)$
data_quality	$Q_{i}$	$[Q_{\min}, \infty)$
task_completion	-	$[0, 1]$
objective_function_value	f	$(- \infty, \infty)$

Table 4. Objective Function Values of Algorithms Across Datasets 1–4.

Dataset 1

Dataset 2

Test

GA-TDG

LMBO

FCAN-MOPSO

DRL-AOS

HHO-SA

FL-DIAF

GA-TDG

LMBO

FCAN-MOPSO

DRL-AOS

HHO-SA

FL-DIAF

Experiment 1

−3.56

−58.55

1086.12

171.04

−10.16

−190.94

1776.92

−289.90

284,457.61

43,726.04

3007.15

−833.66

Experiment 2

−28.14

−57.70

1087.13

−281.04

40.29

−188.81

2386.26

−290.26

295,107.22

51,073.45

2957.28

−839.48

Experiment 3

−41.69

−60.42

2179.11

−244.95

46.06

−195.22

1980.30

−294.10

284,453.65

50,825.20

3539.67

−831.20

Dataset 3

Dataset 4

Test

GA-TDG

LMBO

FCAN-MOPSO

DRL-AOS

HHO-SA

FL-DIAF

GA-TDG

LMBO

FCAN-MOPSO

DRL-AOS

HHO-SA

FL-DIAF

Experiment 1

11,690.70

−577.96

2,462,266.03

375,508.50

15,205.34

−1398.12

55,870.52

−1165.92

18,873,525.04

3,719,618.62

69,521.27

−1757.58

Experiment 2

10,524.29

−577.78

2,380,717.22

350,463.50

15,700.96

−1347.73

60,345.35

−1165.84

19,459,923.97

3,485,882.32

69,814.57

−1575.13

Experiment 3

11,700.28

−589.25

2,467,204.64

369,491.21

15,055.42

−1397.53

55,764.80

−1169.80

19,566,519.45

3,489,501.02

68,644.23

−1699.34

Table 5. Average Reduction Rate of the Objective Function Across 4 Datasets.

Dataset	GA-TDG	LMBO	FCAN-MOPSO	DRL-AOS	HHO-SA
D1	20.678	2.255	1.146	0.528	9.573
D2	1.414	1.865	1.003	1.017	1.265
D3	1.112	1.374	1.001	1.004	1.090
D4	1.029	0.437	1.000	1.000	1.024

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, Y.; Li, S.; Chen, K.; Guo, R.; Li, J. Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks. Mathematics 2025, 13, 422. https://doi.org/10.3390/math13030422

AMA Style

Wang Y, Li S, Chen K, Guo R, Li J. Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks. Mathematics. 2025; 13(3):422. https://doi.org/10.3390/math13030422

Chicago/Turabian Style

Wang, Yanfang, Shaobo Li, Kangkun Chen, Ran Guo, and Judy Li. 2025. "Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks" Mathematics 13, no. 3: 422. https://doi.org/10.3390/math13030422

APA Style

Wang, Y., Li, S., Chen, K., Guo, R., & Li, J. (2025). Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks. Mathematics, 13(3), 422. https://doi.org/10.3390/math13030422

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Privacy-Preserving Incentive Allocation for Fair and Resilient Data Sharing in Resource-Constrained Edge Computing Networks

Abstract

1. Introduction

2. Related Work

2.1. Privacy-Preserving Incentives

2.2. Group-Aware Incentives

2.3. Mobile Group Perception Incentives

2.4. Privacy Data Processing and Distribution

3. Problem Description and Formulation

3.1. System Model

3.2. Problem Definition

3.3. Constraints

3.3.1. Resource Constraint

3.3.2. Total Resource Constraint

3.3.3. Non-Negativity of Incentive

3.3.4. Incentive Budget Constraint

3.3.5. Privacy Protection Constraint

3.3.6. Incentive Fairness Constraint

3.3.7. Quality of Data Constraint

3.4. Objective Function

4. Solving Algorithm and Methodology

4.1. Federated Learning Framework with Privacy and Incentive Mechanisms

4.1.1. Differential Privacy and Federated Learning Mechanism

4.1.2. Incentive Mechanism Based on Stackelberg Game Theory

4.2. Dynamic Incentive Allocation Model

4.2.1. Fair Incentive Distribution Based on Shapley Value

4.2.2. Approximation Method for Scalable Incentive Allocation

5. Experiment Result

5.1. Comparative Algorithm

5.2. The Description of Dataset Details

5.3. Evaluation Metrics

5.4. Numerical Analysis

5.5. Summary

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI