A Differentially Private Framework for the Dynamic Heterogeneous Redundant Architecture System in Cyberspace

Kang, Yilin; Zhang, Qiao; Jiang, Bingbing; Bu, Youjun

doi:10.3390/electronics13101805

Open AccessArticle

A Differentially Private Framework for the Dynamic Heterogeneous Redundant Architecture System in Cyberspace

¹

Purple Mountain Laboratories, Nanjing 211111, China

²

Information Technology Institute, Information Engineering University, Zhengzhou 450002, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Electronics 2024, 13(10), 1805; https://doi.org/10.3390/electronics13101805

Submission received: 10 April 2024 / Revised: 5 May 2024 / Accepted: 6 May 2024 / Published: 7 May 2024

Download

Browse Figures

Versions Notes

Abstract

:

With the development of information technology, tremendous vulnerabilities and backdoors have evolved, causing inevitable and severe security problems in cyberspace. To fix them, the endogenous safety and security (ESS) theory and one of its practices, the Dynamic Heterogeneous Redundant (DHR) architecture, are proposed. In the DHR architecture, as an instance of the multi-heterogeneous system, a decision module is designed to obtain intermediate results from heterogeneous equivalent functional executors. However, privacy-preserving is not paid attention to in the architecture, which may cause privacy breaches without compromising the ESS theory. In this paper, based on differential privacy (DP), a theoretically rigorous privacy tool, we propose a privacy-preserving DHR framework called DP-DHR. Gaussian random noise is injected into each (online) executor output in DP-DHR to guarantee DP, but it also makes the decision module unable to choose the final result because each executor output is potentially correct even if it is compromised by adversaries. To weaken this disadvantage, we propose the advanced decision strategy and the hypersphere clustering algorithm to classify the perturbed intermediate results into two categories, candidates and outliers, where the former is closer to the correct value than the latter. Finally, the DP-DHR is proven to guarantee DP, and the experimental results also show that the utility is not sacrificed for the enhancement of privacy by much (a ratio of 4–7% on average), even in the condition of some executors (less than one-half) being controlled by adversaries.

Keywords:

differential privacy; endogenous safety and security; dynamic heterogeneous redundant architecture; decision strategy; hypersphere clustering

1. Introduction

Recent decades have witnessed the development of information technology; thousands of complicated information systems not only bring convenience but also lead to a huge amount of cyberspace security and safety problems [1,2,3,4,5]. Traditional methods always protect the information system (or cyberspace) by adding ‘fencing’ around the system, such as firewalls [6,7], an Intrusion Detection System (IDS) [8,9], etc. These conventional protection methods attempt to eliminate cyberspace safety and security problems through various approaches.

However, absolute safety and security are impossible [10] since the vulnerabilities and backdoors are inevitable in the present information systems which have millions of lines of code, and adversaries can either exploit preexisting vulnerabilities and backdoors or discover them later, then launch cyber attacks. To address this issue, the ESS theory [11,12] and its practice, the DHR architecture, are proposed, where the attacks can be thwarted through a meticulously designed system architecture, without any prior knowledge of adversaries [13]. Specifically, in the DHR architecture, upon receiving an input, equivalent but heterogeneous executors process the same input and output intermediate results, then a decision module processes these intermediate results using a decision-making strategy (e.g., the majority principle) and outputs the final result. From this perspective, the DHR architecture system is a specified multi-heterogeneous system. Due to the superiority, the ESS theory along with the DHR architecture have been introduced to the fields of wireless communications systems [14], sixth-generation mobile communication (6G) networks [15], embedded systems [16], industrial control systems [17], zero trust systems [18], etc.

While the ESS theory offers a groundbreaking perspective on safeguarding cyberspace, the privacy discourse surrounding the DHR architecture remains unexplored. Nowadays, vast amounts of data are being collected and analyzed, posing privacy challenges for all information systems, particularly in sensitive domains such as industrial control, wireless communications, zero trust (including healthcare and finance), etc. Therefore, it is imperative to integrate this new breed of cybersecurity technology with privacy-preserving techniques, in order to provide comprehensive security measures for these sensitive domains.

Differential privacy (DP) [19,20], a rigorously established tool, has emerged as the de facto definition of privacy and is being widely applied in various scenarios due to its simplicity, soundness and completeness, including the Internet of Things (IoT) [21], healthcare [22], data management [23,24,25], machine learning [26,27,28], and quantum computations [29,30]. Its purpose is to protect the sensitive data of individuals who interact with information systems. Specifically, DP introduces random noise into the information system, preventing adversaries from inferring any specific information about individuals participating in data collection. When compared to most alternative security mechanisms, such as Homomorphic Encryption (HE) and Trusted Execution Environments (TEE), DP exhibits superior complexity and scalability, while maintaining comparable performance as shown in Table 1. Researchers have proved that it guarantees privacy mathematically. And in practice, researches have demonstrated that DP can effectively mitigate various types of attacks, such as data inference attacks, attribute inference attacks, membership inference attacks, and memorization attacks [31,32,33,34].

In this paper, one of the limitations of the DHR architecture, namely, data privacy, is addressed, by introducing DP to the ESS theory. We propose a DP-DHR framework that aims to protect sensitive information exposed to the DHR architecture information system. However, it is not a straightforward task to apply DP to the traditional DHR architecture due to the introduction of random noise. This noise can potentially confuse the decision module when judging perturbed intermediate results. To fix it, we propose the advanced decision strategy and hypersphere clustering method to filter out values that are far from the true value, including the normal results but adding larger noises with very small probabilities or the influenced results controlled by adversaries. The contributions of this paper can be summarized as follows:

1.: We propose the DP-DHR framework integrating DP into the DHR architecture. To the best of our knowledge, this is the first work to incorporate data privacy considerations into the DHR architecture.
2.: We conduct an analysis of the privacy property of DP-DHR using the Gaussian mechanism and the post-processing lemma of DP. The theoretical results demonstrate that the guarantee of $(ϵ, δ)$ -DP is achieved.
3.: We perform experiments on a simulated DHR architecture system to verify the utility of our proposed DP-DHR framework. The experimental results show that the utility is similar to the traditional DHR architecture, indicating that enhancing privacy does not sacrifice the system’s utility. On average, the difference in utility is around 4–7%. Additionally, the experiments demonstrate that even if an adversary has control over some of the executors (less than one-half), they cannot manipulate the output of the DP-DHR system without prior knowledge of the zero-noise result.

The rest of the paper is organized as follows. The related work is discussed in Section 2. Preliminaries are introduced in Section 3. We propose a DP-DHR framework and give the privacy guarantees in Section 4. Experimental results and discussions are given in Section 5. Finally, we conclude the paper in Section 6.

2. Related Work

The ESS theory [11] proposes that vulnerabilities and backdoors, whether visible or invisible, are inherent in information systems in practice, and claims the inevitability of endogenous security problems in information systems. These problems cannot be completely eliminated; instead, they tend to evolve into more complex threats [35] alongside traditional protective measures such as digging holes, patching, sealing doors, checking poisons, and killing Trojan horses. Therefore, inspired by the consensus mechanism [36] and the dissimilar redundant structure (DRS) [37], the DHR architecture is designed to address endogenous safety and security problems from the perspective of system architecture, which has been proved according to the Shannon channel coding theorem [38]. Additionally, there are various types of mimic devices that utilize the DHR architecture in practice, including routes, clouds, and web servers [12]. ‘Mimic devices’ implement a deceptive mimicry, camouflage, or invisibility mechanism that creates a physical or logical scenario akin to the uncertainty principle of quantum mechanics. This structure and mechanism can be utilized to enhance the ‘trinity’ functionality: application service provision, reliability assurance, and secure and trusted defense within the target entity.

The DHR architecture finds application across various critical domains: Wireless ESS (WESS) is introduced in [14], analyzing ESS attributes of the wireless channel. Moreover, the DHR architecture is extended to WESS to address ESS challenges in wireless communication systems, demonstrating improvements in signal-to-noise ratio and anti-interference capabilities. In [15], to tackle complex ESS issues in 6G networks, the DHR architecture 6G polymorphic network is devised. Here, both the control plane and user plane of the 6G core network are based on the DHR architecture. The DHR architecture is leveraged in embedded systems in [16]. Theoretical and experimental results across diverse operating conditions under various strategies demonstrate the effectiveness and correctness of the DHR architecture. Industrial control systems adopt a similar concept: parallel redundant mimicry (this is similar to the DHR architecture). The system resilience is enhanced, introducing four protection dimensions, Technology, Emergency, Management, and Time (TEMt) in [17]. The internal structure of the basic framework of the zero-trust system is restructured in [18] through the lens of the DHR architecture, termed ZETSA. This restructuring improves network services, reliability, and security defenses with broad applicability.

DP [19,20] is proposed to formally define the concept of data privacy and offers solutions to privacy concerns in diverse domains, including the Internet of Things (IoT) [21], healthcare [22], data management [23,24,25], machine learning [26,27,28], and quantum computations [29,30]. Extensive research studies [31,32,33,34] have illustrated that DP can block some attacks such as data inference attacks, attribute inference attacks, membership inference attacks, and memorization attacks.

Although the ESS theory presents a new paradigm for addressing cyberspace challenges and the DHR architecture offers an implementation approach, the privacy is neither considered theoretically nor practically. DP provides a method for privacy protection with theoretical guarantees, but there is currently limited research on the integration of these two technologies. To address these issues, we propose the DP-DHR framework in this paper, which integrates DP into the DHR architecture to resolve privacy concerns. We also mathematically prove that the DP-DHR framework satisfies the DP property. Moreover, directly incorporating DP into the DHR architecture can pose significant challenges to the decision-making module, potentially enabling adversaries to manipulate system outputs, and undermining the ESS theory. To overcome these challenges, we design the hypersphere clustering method to construct the advanced decision strategy. This method classifies the intermediate results perturbed by random noise (to guarantee DP) into two sets: the candidates set and the outliers set. The candidates set is then utilized to make the final decision. This approach reduces the confusion of the decision module caused by different perturbed intermediate results.

3. Preliminaries

In this section, we introduce basic definitions and notations of the ESS theory, the DHR architecture, and DP.

3.1. ESS and DHR

Simply put, the ESS theory states that vulnerabilities, backdoors and random faults during system running are inevitable in an information system, but even in such circumstances where adversaries can exploit these weaknesses to launch cyber attacks, the probability of the system producing incorrect outputs is negligible.

Motivated by the reliability theory and methods, one of implementations of the ESS theory is proposed, namely, the DHR architecture as shown in Figure 1, in which there are m functionally equivalent but heterogeneously implemented executors in total, and k executors among them are chosen to work at the same time (called online executors; this will lead to

m - k

free executors, called offline executors). As shown in Figure 1, a traditional system can be considered a single executor system and the DHR architecture system is formed by its heterogeneous extension k times when working.

The goal of the DHR architecture is to enhance the reliability of information transmission and the workflow can be summarized as follows:

1.: The input proxy receives the input of the system and sends it to all the k online executors.
2.: Each online executor processes the input and forwards the resulting output (called the intermediate result) to the decision module.
3.: The decision module renders a verdict based on all k intermediate results according to the decision strategy (in this paper, we take the majority rule as the decision strategy) and choose (or generate) the final result.
4.: The system outputs the final result.

To further demonstrate the superiority of the DHR architecture system against the traditional information system, we consider the following example: If an attacker controls an executor, then the whole traditional system is controlled since it can be considered a single executor system, and the attacker can output whatever he/she would like. In contrast, with the DHR architecture system under the same case that is not controlled by the attacker, if

k > 2

, the decision module can obtain correct intermediate results from other executors since different executors are heterogeneously implemented and it is hard for the attacker to control most of them. As a result, the system output cannot be tampered with since the DHR architecture enhances the reliability of the system.

In brief, due to the heterogeneous implementations of these executors, it is extremely difficult to identify common vulnerabilities or weaknesses among them, e.g., exploiting shared vulnerabilities between Windows, Linux and macOS systems is nearly impossible; thus, the success probability of attacks is relatively low. Via the discussion mentioned above, one may easy to observe that the DHR architecture system is one of the practices of the multi-heterogeneous systems.

While it may be challenging to successfully attack all of the online executors, gaining control over one or a few of them is comparatively straightforward. Considering the majority rule as an illustrative decision strategy, the system’s output cannot be altered unless a minimum of

⌈ \frac{k}{2} ⌉

online executors are compromised and manipulated to produce the same intermediate result. The scenario where more than

⌈ \frac{k}{2} ⌉

online executors are compromised is called a ‘common modulus attack’. However, for the DHR architecture, achieving a ‘common modulus attack’ is theoretically difficult. This is because of the heterogeneously redundant implementations, which undergo scheduling and cleaning operations periodically or when some executors are suspected of being compromised. Specifically, if some executors attempt to deviate from the original function of the operation, called ‘bad executors’, the decision module will detect this discrepancy compared to all intermediate results from other executors. Consequently, the system will cease their operations and assign new executors from the offline pool based on the scheduling strategy. As for the compromised executor, it will be reconfigured to eliminate the attacked parts. The system will replace online executors even in non-attack scenarios, following the scheduling strategy, in order to reduce the attack surface. Thus, the output of the system remains unaffected by the ‘out of control’ situation of a few executors, as the decision and scheduling strategies prevent any fluctuations in the intermediate results from impacting the final output.

Figure 1 shows the fundamental principles and distinctive features of the DHR architecture: dynamism, heterogeneity, and redundancy. The scheduling strategy, encompassing both planned and triggered replacement between online and offline executors, embodies the dynamism of the DHR architecture. This strategy effectively removes any attacked executors, resulting in a reduced attack surface. The presence of m different but equivalent executors showcases the heterogeneity of the architecture, ensuring that no single attack can compromise the entire system. The combination of k online executors and

m - k

offline executors represents the redundancy, which forms the basis for the aforementioned dynamic and heterogeneous features.

The theory of coding channel [39] is introduced to illustrate the efficacy of the DHR architecture, drawing inspiration from the Shannon channel coding theorem [38]. It indicates that the DHR architecture enhances cyberspace endogenous safety and security, particularly in terms of reliable transmission.

In the DHR architecture, we denote the executors by

E_{1}, E_{2}, \dots, E_{k}, E_{k + 1}, \dots, E_{m}

, where the first k executors are online executors and the left are offline ones (due to the dynamic of the DHR architecture, the online and offline executors are not fixed, and here, we use this notation for simplicity). For a fixed task, the system can be formulated as function f, and given the input I, the expected (correct) output of the system is denoted by

f (I)

. Similarly, the executors

E_{1}, E_{2}, \dots, E_{m}

can also be formulated as functions, denoted by

f_{1}, f_{2}, \dots, f_{m}

, respectively, and the intermediate result of the corresponding executor is represented by

f_{i} (I)

. In the DHR architecture, the executors are heterogeneous but equivalent, so the expected (correct) functions of all the heterogeneous executors are the same as the system, i.e.,

f_{1} = f_{2} = \dots = f_{m} = f

. Additionally, given an input I, we denote the intermediate results and the final output of the system in practice by

o_{1} (I), o_{2} (I), \dots, o_{m} (I)

and

o (I)

, respectively.

3.2. Differential Privacy

Let

P

be a probability measure defined on the data space

Z

and let database

S = {z_{1}, \dots, z_{n}}

be independently drawn from

P

. Databases

S, S^{'}

differing by at most one data instance are denoted by

S \sim S^{'}

, called adjacent databases.

Definition 1

(Differential Privacy [20]). Algorithm

A : Z^{n} \to R^{p}

is (

ϵ, δ

)-differential privacy (DP) if for all

S \sim S^{'}

and event

O \in r a n g e (A)

,

P [A (S) \in O] \leq e^{ϵ} P [A (S^{'}) \in O] + δ,

(1)

where

ϵ, δ

are the privacy budgets.

Differential privacy requires essentially the same distributions to be drawn over any adjacent databases so that the adversaries cannot infer whether an individual participates on the data collection. Some kinds of attacks, such as data inference attack [31], attribute inference attack [32], membership inference attack [34], and memorization attack [33], can be thwarted by DP.

The Gaussian mechanism is proposed to guarantee DP, shown in Lemma 1.

Lemma 1

(Gaussian Mechanism [20]). Let

ϵ \in (0, 1)

be arbitrary, and

f : Z^{n} \to R^{p}

is a p-dimensional function, then for a database

S \in Z^{n}

, if function f returns:

f_{ϵ} (S) = f (S) + b,

(2)

where

R

is the real number set,

b \sim N (0, σ^{2} I_{p})

is the p-dimensional Gaussian random noise and

σ > \frac{Δ_{2} (f) \sqrt{2 log (1.25 / δ)}}{ϵ},

(3)

then

f_{ϵ} (S)

satisfies (

ϵ, δ

)-DP.

In the inequality (3) of Lemma 1,

Δ_{2} (f)

represents the

ℓ_{2}

-sensitivity of the function f, defined as the following.

Definition 2

(

ℓ_{2}

-sensitivity [20]). For any adjacent databases

S \sim S^{'}

, the

ℓ_{2}

-sensitivity of the function

f : Z^{n} \to R^{p}

is:

Δ_{2} (f) = max_{S, S^{'} \in Z^{n}} {∥f (S) - f (S^{'})∥}_{2},

(4)

where

{∥ x ∥}_{2}

represents the Euclidean norm (

ℓ_{2}

norm) of vector

x

. Suppose that vector

x = [x_{1}, x_{2}, \dots, x_{p}]

is p-dimensional, and its Euclidean norm is

{∥ x ∥}_{2} = {(\sum_{i = 1}^{p} {| x_{i} |}^{2})}^{1 / 2}

.

Remark 1.

Note that the

ℓ_{2}

-sensitivity

Δ_{2} (f)

is a quantity inherent in function f: it only depends on f, cannot be chosen by policy, and is independent of the actual database [19].

For DP, an important property is the post-processing property, which claims the fact that the composition of a data-independent mapping g with an

(ϵ, δ)

-DP algorithm f is also

(ϵ, δ)

-DP, formally described as the following.

Lemma 2

(Post-processing [20]). Let

f : Z^{n} \to R^{p}

be a randomized algorithm that is

(ϵ, δ)

-DP, let

g : R^{p} \to R^{p}

be an arbitrary randomized mapping, then

g \circ f : Z^{n} \to R^{p}

is

(ϵ, δ)

-DP, where symbol ∘ represents the composition of functions.

4. Materials and Methods

In this section, we propose the DP-DHR framework (Figure 2), which firstly guarantees differential privacy of the DHR architecture (Figure 1). All the notations are given in Section 3.

In Figure 1, it can be observed that the input I is transmitted to all k online executors, making it possible for any of them to disclose sensitive information. Therefore, to achieve DP, random noise should be injected into the k intermediate results obtained from the online executors

E_{1}, E_{2}, \dots, E_{k}

.

In Figure 2, the random noises that are independently sampled by

E_{1}, E_{2}, \dots, E_{k}

are denoted by

b_{1}, b_{2}, \dots, b_{k}

, respectively. The perturbed intermediate result of each executor

E_{i}

on the input I is denoted by

o_{i} (I) + b_{i}

. In contrast to the traditional DHR architecture, the DP-DHR framework has differences in two coins. One is that each online executor

E_{i}

samples the noise

b_{i}

and adds it to the intermediate result

o_{i} (I)

before sending it to the decision module. Another is that the decision module makes a verdict upon k perturbed intermediate results according to the advanced decision strategy which requires invoking the machine learning algorithm.

Remark 2.

A key challenge of applying DP to the DHR architecture is making a decision upon perturbed intermediate results. Specifically, for the traditional DHR architecture (Figure 1), under normal system operating conditions (no attacks or failures happen), each online executor

E_{i} \in {E_{1}, E_{2}, \dots, E_{k}}

on a given input I outputs

o_{i} (I)

such that

o_{i} (I) = f_{i} (I) = f (I)

. In other words, the expected output of each online executor is consistent with the actual result in the normal system running. In this scenario, the decision module can easily make a verdict using the majority rule, and the expected system output also aligns with the actual one. On the other hand, if attacks or failures occur, the executors’ outputs and even the system output may be influenced. Concretely, the online executors are divided into two categories, the ‘attacked executors’ set A and the ‘normal executors’ set U:

\begin{matrix} A & = {i | E_{i} i s a t t a c k e d a n d c o n t r o l l e d b y t h e a d v e r s a r y}, \\ U & = {j | E_{j} i s n o t c o n t r o l l e d b y t h e a d v e r s a r y}, \end{matrix}

(5)

the practical intermediate results are

\begin{matrix} o_{i} (I) & \neq f_{i} (I) = f (I), i \in A, \\ o_{j} (I) & = f_{j} (I) = f (I), j \in U . \end{matrix}

(6)

If the size of set A satisfies

| A | < ⌈ \frac{k}{2} ⌉

, which means the common modulus attack does not appear (for the DHR architecture system, a successful common modulus attack means ‘zero utility’; in this case, data privacy without utility is meaningless, so we do not discuss the common modulus attack condition in this paper), the decision module can easily reach a verdict by the majority rule, as all ‘normal executors’ generate intermediate results identical to

f (I)

.

However, in the DP-DHR framework, the above decision process may be prone to errors because of introducing random noise into the intermediate results. It is extremely hard to obtain

b_{1} = b_{2} = \dots = b_{k}

due to the random noises independently sampled by online executors. As a result, k perturbed intermediate results collected by the decision module are highly likely to be different even under the non-attack condition, and it will be more complicated under attacks. Thus, the decision strategy of the majority rule is not directly performed in the DP-DHR framework, and we need to redesign an advanced decision strategy in this paper.

The advanced decision strategy is illustrated in Figure 3, where the decision module is highlighted, and the details of online executors are omitted.

Specifically, the decision module receives k perturbed intermediate results and divides them into two categories, the candidates C and the outliers O, which is the key of the advanced decision strategy. Due to the majority rule, the number of the candidates C is larger than that of the outliers O, and the perturbed intermediate results in the candidates C can be applied to generate the system output.

Details of the DP-DHR framework with the advanced decision strategy are formally described in Algorithm 1.

The differences between the DP-DHR framework and traditional DHR architecture can be described from three perspectives:

1.: In lines 3 and 4 of Algorithm 1, each online executor $E_{i}$ injects independently sampled Gaussian random noise into the intermediate results to guarantee the DP property.
2.: The introduction of random noise to the intermediate results poses challenges for the decision module in generating the final output (discussed in Remark 2). To address this problem, the decision module applies a classification method (the Algorithm 1) to make a decision upon the perturbed intermediate results.
3.: After classification, the DP-DHR framework outputs the average of the candidates C. It is worth noting that the random noise added to the intermediate results is zero-mean, and the averaging process can somewhat mitigate the impact of the noise.

In Algorithm 1, the classification method is utilized to separate k perturbed intermediate results into the candidates C and the outliers O. To ensure the utility of the DP-DHR framework, it is preferable to minimize the gaps between the expected output (correct, without random noise) and the candidates C. Conversely, the gap between the expected output and the outliers O should be relatively large to guarantee DP. Therefore, before classifying the perturbed intermediate results, the expected output should be first estimated (since the decision module cannot obtain the exact expected value from the executors due to the perturbations).

Algorithm 1 Differentially private algorithm for the DHR architecture.

Require:: Privacy budgets $ϵ, δ > 0$ , input I, the executor functionality f
Ensure:: A system output $o_{b} (I)$ guaranteeing $(ϵ, δ)$ -DP
1:: for $i = 1$ to k do
2:: $E_{i}$ gives the p-dimensional intermediate result $o_{i} (I)$ upon the input I;
3:: $E_{i}$ samples the Gaussian random noise $b_{i} \sim N (0, σ^{2} I_{p})$ independently;
4:: $E_{i}$ sends the perturbed intermediate result $o_{i} (I) + b_{i}$ to the decision module;
5:: end for
6:: The decision module collects k perturbed intermediate results, invokes the classification method of the hypersphere clustering method (the Algorithm 2) and obtains two categories, the candidates C and the outliers O:

$\begin{matrix} C & = {v_{11}, v_{12}, \dots, v_{1 k_{1}}}, \\ O & = {v_{21}, v_{22}, \dots, v_{2 k_{2}}}, \end{matrix}$

(7)

whose sizes are $| C | = k_{1}, | O | = k_{2}$ s.t. $k_{1} > k_{2}$ , $k_{1} + k_{2} = k$ ;
7:: $o_{b} (I) = \frac{1}{k_{1}} \sum_{i = 1}^{k_{1}} v_{1 i}$
8:: Return $o_{b} (I)$

In line 3 of Algorithm 1, the Gaussian random noise added to the original expected intermediate results from online executors has a zero-mean, implying that the expected result (without random noise) and its corresponding perturbed intermediate result (with random noise) are statistically unbiased. Based on this observation, a straightforward approach to approximate the expected output for the decision module is to compute the average of all the perturbed intermediate results from online executors and treat this average as the estimated output. Furthermore, the accuracy of this approximation improves as the number of online executors k increases. Nonetheless, this approximation approach is susceptible to attacks because the influence of a few controlled executors can manipulate the estimated value (the average) and deviate it from the expected result. The accuracy of the approximation decreases as adversaries gain control over more executors.

On the other hand, the candidates C should be close to the estimated output, and the outliers O should be far from the estimated output. For the p-dimensional output, the distribution of the candidates is a p-dimensional hypersphere around the evaluated expected output, and the distribution of the outliers is a p-dimensional hyperspherical shell outside the hypersphere. However, this classification requirement cannot be easily achieved by traditional clustering methods such as k-means [40] and its variants [41,42], spectral clustering [43], density-based clustering [44], (variational) auto-encoder (AE or VAE)-based clustering [45,46], graph-based clustering [47], etc.

In brief, in Algorithm 1, the intermediate results should be classified into two categories, the candidates set C and the outliers set O, according to the invisible expected correct result

o (I)

. Considering that the DHR architecture aims to ensure reliable transmission, i.e., the correct output of the system, certain intermediate results are classified into the outliers set O due to the following reasons:

(1): They originate from executors controlled by adversaries attempting to manipulate the output.
(2): The sampled Gaussian random noise (to guarantee DP) is too large.

In Algorithm 1, due to the property of Gaussian random noise injected into the intermediate results, the perturbed intermediate results are near

o (I)

with high probability. These factors result in the following observation: the candidates set C comprises intermediate results close to

o (I)

, while elements in the outliers set O are distant from

o (I)

. In a p-dimensional Euclidean space, this observation renders the candidates set C a p-dimensional hypersphere around

o (I)

, with the outliers set O forming a p-dimensional hyperspherical shell outside the hypersphere, further away from

o (I)

. This distinction cannot be achieved through traditional clustering methods.

To overcome this problem, we propose the hypersphere clustering method to generate hypersphere-type clusters with robustness against attacks; details can be found in Algorithm 2, in which

| S |

denotes the size of set S.

Algorithm 2 Hypersphere clustering.

Require:: A p-dimensional dataset with k elements $V = {v_{1}, v_{2}, \dots, v_{k}}$ , a candidate percentage parameter $ρ$ , a step parameter $τ$
Ensure:: The candidates C and the outliers O
1:: for $i = 1$ to k do
2:: Initialize $r_{i} = 0$ , $C_{i} = {v_{i}}$
3:: while $| C_{i} | \leq ρ k$ do
4:: $r_{i} = r_{i} + τ$
5:: With the center $v_{i}$ and the radius $r_{i}$ , construct hypersphere $H_{i}$
6:: For any element $v_{j} \in V$ , if $v_{j} \in H_{i}$ , $C_{i} = C_{i} \cup v_{j}$
7:: end while
8:: end for
9:: Get the variance of elements in each set $C_{i}$ , denoting by $v a r_{i}$
10:: $C = arg min_{C_{i}} v a r_{i}$ , $O = V \ C$
11:: Return the candidates set C and the outliers set O

In Algorithm 2, the hypersphere clustering method classifies all the perturbed intermediate results to two categories: the candidates and the outliers. To elaborate further, each perturbed intermediate result is considered the center of a hypersphere in p-dimensions, with

ρ k

other perturbed intermediate results enclosed within the hypersphere. Subsequently, the hypersphere clustering method selects the elements within the hypersphere that have the least variance as the candidates, while the remaining elements are categorized as outliers. This approach ensures that the candidates are more uniformly distributed and physically more compact, which aligns well with the decision strategy of the DHR architecture. Furthermore, this clustering method enhances the robustness of Algorithm 2 against attacks mentioned earlier to some extent as discussed in Section 5.

Here, we theoretically analyze the privacy of our proposed DP-DHR framework.

Theorem 1.

Given the parameters

ϵ, δ \in (0, 1)

, if the variance of the Gaussian random noise

σ^{2}

satisfies:

σ^{2} \geq \frac{2 Δ_{2} {(f)}^{2} log (1.25 / δ)}{ϵ^{2}},

(8)

the output

o_{b} (I)

of Algorithm 1 guarantees

(ϵ, δ)

-DP.

The following is a sketch of the proof.

Proof.

According to the Gaussian mechanism given in Lemma 1, it is obvious to obtain that each perturbed intermediate result

o_{i} (I) + b_{i}

satisfies

(ϵ, δ)

-DP sampled by (8).The generated process of the perturbed intermediate results is denoted as the mechanism

g_{1} (I)

.

The adjudication process of the decision module, including the hypersphere clustering method given in Algorithm 2, is independent of the above mechanism. We denote this adjudication process as the mechanism

g_{2}

.

Thus, based on the post-processing property given in Lemma 2, we have that:

o_{b} (I) = g_{2} \circ g_{1} (I)

(9)

is

(ϵ, δ)

-DP.

The proof is concluded. □

In Theorem 1, the

ℓ_{2}

-sensitivity

Δ_{2} (f)

is determined by the specific system functionality f and can vary across different systems. And it is expected that when the privacy budget becomes smaller (representing stricter privacy requirements), the variance of the random noise also increases. This aligns with our intuition, as smaller privacy budgets necessitate stronger perturbation to protect sensitive information.

Remark 3.

As shown in Definition 1, smaller privacy budgets

ϵ, δ

indicate that the probability distributions over adjacent databases are more similar, reflecting higher privacy requirements. Consequently, in real-world scenarios, stricter privacy requirements, such as those in financial or healthcare systems, lead to smaller privacy budgets, and vice versa. Furthermore, to guarantee meaningful DP, ϵ is consistently set smaller than

1.0

and δ is always set smaller than

1 / n

, where n denotes the size of the database [20]. Moreover, smaller privacy budgets decrease the utility of the system, which is in line with the common sense and leads to a tradeoff between the utility and the privacy; corresponding discussions are studied in the field of machine learning [48], data-processing systems [49], open data sharing [50], etc. The tradeoff is also shown by the experimental results given in Section 5.

Remark 4.

From the perspective of generalized robust control, the DP-DHR framework strengthens the cyberspace endogenous safety and security. In the cyberspace endogenous safety and security theory, attacks can be considered perturbations that affect information systems. As a result, the defense mechanisms can be formulated as measures to enhance the robustness of the systems. Within the DP-DHR framework, the random noise introduced into the system serves as the perturbation, and the advanced decision strategy implemented is specifically designed to be resilient against the perturbations induced by this random noise. Consequently, the overall robustness of the DHR architecture is significantly enhanced.

5. Results and Discussion

In this section, we conduct simulated experiments to evaluate the utility and the robustness of our proposed DP-DHR framework.

Before presenting the detailed results, we first give the information of the simulation environment as shown in Table 2. The original data (the

100 \times 30

matrix mentioned below) follow the uniform distribution, which is generated by the Java function random.NextInt(), with random seed 1. The variance of the sampled (zero-mean) Gaussian random noise follows Theorem 1.

In certain fields where sensitive information is collected, such as finance and healthcare, ensuring cyberspace security and data privacy are of the utmost importance. In these contexts, the frequency feature of private data is a critical factor to consider. For example, in the field of healthcare, the incidence rate of a specific disease, and in the field of finance, the high or low income rates for a particular population, are crucial statistical measures in the real world. Therefore, in this section, we consider the ‘frequency statistics’ as the executor functionality of the DP-DHR framework and analyze its utility.

To simulate our experiment, we generate a dataset consisting of 100 data points, each with 30 features. This dataset can be represented as a

100 \times 30

matrix, where each element at the i-th row and j-th column is denoted as

e_{i j}

and

e_{i j} \in {0, 1}

. This representation is frequently utilized in real-life situations, like medical databases that capture the occurrence or absence of diseases within a population, or financial databases that depict salary ranges for various individuals.

Two experiments were performed on this dataset:

1-Dimensional Outputs: The outputs of executors are scalar values (1-dimension) with $p = 1$ as specified in Algorithm 1. We performed the experiments using different privacy budgets $ϵ$ and varying numbers of executors k to demonstrate that the differences between the outputs of the DP-DHR framework and the traditional DHR architecture are minimal.
2-Dimensional Outputs: The outputs of the executors are 2-dimensional vectors with $p = 2$ as specified in Algorithm 1. We conducted the experiments under the ‘normal’ (non-attack) and ‘attacked’ conditions to show that our proposed advanced decision strategy with the hypersphere clustering method is robust against attacks to some extent.

In all the experiments described below, we fix the candidate percentage parameter (introduced in Algorithm 2) as

ρ = 60 %

, indicating that

60 %

of the perturbed intermediate results are selected as candidates C.

5.1. 1-Dimensional Outputs

Given that outputs are 1-dimensional scalars, we define the expected output of the system as the frequency of data points where the value of the first dimension is 1. For example, if the database D consists of data points

[1, 0, 0]

,

[1, 0, 1]

and

[0, 1, 1]

, denoted as:

D = [\begin{matrix} 1 & 0 & 0 \\ 1 & 0 & 1 \\ 0 & 1 & 1 \end{matrix}],

(10)

then the expected output of the system is

2 / 3 = 0.67

. This notion of frequency is often used in scenarios such as determining the prevalence of a particular characteristic within a population.

We conducted experiments with varying numbers of executors, k = 10, 50, 100, 200, 300, 400, 500, and privacy budgets

ϵ

= 0.01, 0.05, 0.1, 0.3, 0.5, 1.0, while ensuring meaningful privacy

δ = 10^{- 6}

. The experimental results are presented in Table 3, where the absolute differences between the outputs of the traditional DHR system and the DP-DHR system are shown in the parenthesis. For our simulated database, the frequency output by the traditional DHR system is

0.51

.

As shown in Table 3, despite some fluctuations, the difference between the outputs of the DP-DHR framework and the traditional DHR architecture increases as the privacy budget

ϵ

decreases, which means that higher privacy requirements (smaller privacy budget

ϵ

) derive lower utility of the system. This observation is consistent with the fact that higher privacy levels typically result in lower utility and this phenomenon demonstrates the tradeoff between the utility and the privacy as discussed in Remark 3. Notably, for

ϵ > 0.1

, the outputs of both systems exhibit somewhat negligible distinction.

Additionally, in spite of the presence of fluctuations, it can be observed that the difference between the outputs of the DP-DHR framework and the traditional DHR architecture generally decreases with an increasing number of executors k. This can be attributed to the fact that the injected random noise has a zero-mean, resulting in an unbiased approximation, as the perturbed intermediate results are averaged over a larger number of executors.

As a result of the aforementioned phenomenon, it can be concluded that for

k \geq 200

and

ϵ \geq 0.3

, the outputs of the DP-DHR framework and the traditional DHR architecture are virtually identical.

5.2. Two-Dimensional Outputs

If we consider 2-dimensional outputs, then the frequency of data points with a value of 1 in the first dimension is denoted as

d_{1}

, and that in the second dimension is denoted as

d_{2}

. Therefore, the expected output of the system can be represented as the pairwise frequency

(d_{1}, d_{2})

. For instance, considering the database D given in (10), the expected output of the system would be

(2 / 3 = 0.67, 1 / 3 = 0.33)

. This type of frequency is often utilized when assessing the concordance property of disease prevalence.

In our simulated database, the traditional DHR architecture, without being subjected to any attacks, produces a pairwise frequency output of

(0.51, 0.58)

. This disparity between the outputs of the DP-DHR framework and the traditional DHR architecture (referred to as T-DHR) is illustrated in Figure 4.

Figure 4a depicts the outputs obtained at various privacy budgets

ϵ

, with a randomly chosen

k = 100

. The green point in the figure represents the output produced by the T-DHR architecture, while the red points correspond to the outputs generated by the DP-DHR framework for different values of

ϵ

, with the respective values of

ϵ

marked next to the points for reference.

Similarly, Figure 4b shows the outputs at different numbers of executors k, with a randomly chosen

ϵ = 0.1

. The green point in the figure represents the output of the T-DHR architecture, whereas the red points denote the outputs generated by the DP-DHR framework. The numbers adjacent to the red points correspond to the respective values of k.

As shown in Figure 4, the difference between the outputs generated by the T-DHR architecture and the DP-DHR framework (measured by the Euclidean distance) diminishes as

ϵ

and k increase, respectively. This observation is similar to the phenomenon discussed in the experiments involving ‘1-dimensional outputs’ in Section 5.1, in which the former (Figure 4a) demonstrates the tradeoff between the utility and the privacy as demonstrated in Remark 3.

As is evident, the hypersphere clustering method plays a crucial role in the advanced decision strategy outlined in Algorithm 1. Thus, our focus is on evaluating the utility of our proposed clustering method under both normal (unattacked) and attacked conditions.

To begin, we conduct experiments under normal conditions. In order to evaluate the performance of our proposed hypersphere clustering method, we randomly choose

ϵ = 0.1

and showcase the clustering results for different numbers of executors k = 10, 50, 100, 200, 300, 400.

The clustering results are shown in Figure 5. The blue points are the perturbed intermediate results classified to the candidates C, while the red points represent the perturbed intermediate results classified to the outliers O. The green points represent the outputs of the T-DHR architecture (

d_{1} = 0.51

and

d_{2} = 0.58

, as mentioned before), and the brown points represent the outputs generated by the DP-DHR framework.

In Figure 5a–f, we consider a candidate percentage of

ρ = 60 %

as mentioned above. Accordingly, the number of blue points is slightly larger than

0.6 k

as per Algorithm 2. From Figure 5, it can be seen that for the DP-DHR framework, the injected random noise causes the perturbed intermediate results (represented by the red and blue points) to deviate from the expected system output (the green points). The gaps are particularly large for some individual executors. However, since the Gaussian random noise added to the intermediate results has a zero-mean, the gaps brought by individuals are weakened by averaging all the candidates, as discussed in the analysis of Algorithm 1. As mentioned earlier, due to this property, the gap between the outputs of the DP-DHR framework and the T-DHR architecture decreases with the increase in the number of executor k. When

k \geq 50

, the difference between them becomes relatively small, and with

k \geq 200

, the outputs of the DP-DHR framework and T-DHR architecture are almost identical.

Then, we perform experiments under attacked conditions. In this scenario, the adversary gains control over a certain percentage of executors and aims to manipulate the system output. In our experiments, we consider different attack rates (AR), where 10%, 20%, 30%, 40% of the executors are assumed to be under the adversary’s control. Due to the decision strategy of the majority rule, all the executors controlled by the adversary are assumed to output the same result. Furthermore, for the attacked condition, we consider two cases: (1) where the random noise injecting process is not controlled, and (2) where the random noise injecting process is controlled. In the latter case, the attack is considered to be more sophisticated. We start with the former case.

In the first case, where the random noise injecting process is not controlled, the outputs of the compromised executors are assumed to be fixed values, such as

o = (3, 3)

and

o = (1, 1)

. Then, random noise is added to these outputs o to obtain the perturbed intermediate results. The clustering results are shown in Figure 6 and Figure 7, where the green and brown points represent the same data points as in Figure 5. Additionally, in these figures, the light blue points represent the outputs given by the attacked executors and classified into the candidates C. The dark blue points represent the outputs given by the normal executors and also classified into the candidates C. The light red points correspond to the outputs given by the attacked executors and classified into the outliers O, while the dark red points correspond to the outputs given by the normal executors and classified into the outliers O.

In the case where the adversary outputs

o = (3, 3)

, we simulate the scenario where the adversary intends to mislead the system output, regardless of whether it is detected or not. Since the frequency values

d_{1}, d_{2} \in [0, 1]

naturally, and

o = (3, 3)

is obviously far from the reasonable range, this simulation represents an attempt by the adversary to manipulate the system. In this case, Figure 6 illustrates that all the outputs generated by the attacked executors are classified to the outliers O, except for Figure 6d. The reason for this exception is that the candidate percentage parameter and the attack ratio are set to

ρ = 60 %

and AR =

40 %

, which means that at least one attacked perturbed intermediate result is selected as a candidate according to Algorithm 2. However, this does not affect the final system output. In general, unless the candidates percentage

ρ \geq 1 - AR

, it is highly likely for the outputs given by the attacked executors to be classified as candidates when the adversary’s output is significantly different from the expected value. This indicates that our proposed hypersphere clustering method is robust against this kind of attack.

In the case where the adversary output

o = (1, 1)

, we simulate the scenario where the adversary intends to mislead the system output but is not easily detected because of the frequency

d_{1}, d_{2} \in [0, 1]

naturally. In this case, Figure 7 demonstrates that while some of the outputs generated by the attacked executors are classified as candidates, the impact of the adversary on the final output of the DP-DHR framework is minimal. This occurrence becomes more pronounced with an increase in AR, indicating the robustness of Algorithm 2 against this type of attack.

Now, let us consider a more complex scenario where the random noise injection process is controlled by the adversary. In this case, we assume that the outputs o of the executors controlled by the adversary are as follows: (1)

o = (3, 3)

: Both dimensions of o are set to 3; (2)

o_{1}, o_{2} \in [0, 1]

: Two dimensions of o are randomly chosen within the range of

[0, 1]

; (3)

o = (1, 1)

: Both dimensions of o are set to 1. The clustering results for these scenarios are depicted in Figure 8, Figure 9, and Figure 10, respectively. The point colors in these figures correspond to those in Figure 6 and Figure 7.

Regarding the adversary’s output

o = (3, 3)

, similar to Figure 6, we simulate the scenario where the adversary intends to mislead the system output, whether it is detected or not. In Figure 8, the light red point located at the top right corner (coordinates

(3, 3)

) represents all the intermediate results generated by the attacked executors, as all the controlled intermediate results are mapped to this single point. Specifically, in Figure 8a–d,

0.1 k

(40),

0.2 k

(80),

0.3 k

(120), and

0.4 k

(160) intermediate results are respectively mapped to this point.

The clustering results presented in Figure 8 demonstrate that the perturbed intermediate results classified as candidates C are almost exclusively generated by normal executors, and not by the attacked executors controlled by the adversary. Furthermore, the results obtained from the T-DHR architecture and the DP-DHR framework are similar, indicating that our proposed hypersphere clustering method is robust against this kind of attack.

In the case where the adversary’s output

o = (o_{1}, o_{2})

, we simulate that the adversary randomly generates two dimensions

o_{1}, o_{2} \in [0, 1]

. The intention of the adversary is to mislead the system output while avoiding easy detection, as the frequency

d_{1}, d_{2} \in [0, 1]

naturally. Figure 9 displays the clustering results in this scenario.

In this case, the intermediate results classified as candidates C may contain results generated by the attacked executors. The ratio of light blue points (representing attacked candidates) to the total number of blue points (attacked candidates + (normal) candidates) increases with the growth in AR. However, the difference between the outputs of the T-DHR architecture and the DP-DHR framework is not significant, regardless of the scale of AR. One reason for this is that the expected output in the T-DHR architecture is close to

(0.5, 0.5)

. If the expected result was different (e.g., (0,0)), the gap between the outputs might be larger.

In the case where the adversary’s output

o = (1.0, 1.0)

, we simulate the scenario where the adversary aims to mislead the system output without being easily detected. The clustering results are shown in Figure 10, in which all the attacked results are mapped to the single point

(1, 1)

, represented by the light blue point.

As mentioned above,

40 %

of the attacked results are stained by the light blue color, indicating that all the attacked results are classified as candidates. This represents that all the intermediate results generated by the attacked executors are misclassified and the system is successfully attacked.

As shown in Figure 10, the gap between the outputs given by the DP-DHR framework and the T-DHR architecture is larger compared to previous conditions. When comparing Figure 10a–d to Figure 9 (or Figure 8a–d), it is obvious that the gaps between the brown and green points are notably wider in Figure 10. Due to the relatively large distance between

(1, 1)

and the expected output

(0.51, 0.58)

, the output of the DP-DHR framework (the brown point) is ‘dragged’ away from the expected output of the T-DHR architecture (the green point) by the attacked executors. When AR is small, the consequence is not very pronounced. For example, in Figure 10a, when AR

= 10 %

, although all the attacked results are classified as the candidates C, the gap between the DP-DHR output and the T-DHR output is not significant due to the small AR. On the contrary, when a large AR is employed as shown in Figure 10d, the attacked results would significantly mislead the output of the DP-DHR framework. In conclusion, it is evident that the gap between the DP-DHR framework and the T-DHR architecture becomes larger with the increase in AR as shown in Figure 10a–d. This observation aligns with common sense and the expectations.

In summary, the proposed DP-DHR framework demonstrates resilience against the majority of attack conditions, with one notable exception: when the adversary coerces the majority of attacked executors to produce the same value

o_{A}

within a reasonable range (e.g., (0,1) in frequency), particularly when the disparity between

o_{A}

and the expected correct value

o_{E}

is substantial. This issue is exacerbated with higher levels of AR as shown in Figure 10.

Here, we discuss the effects of the Advanced Persistent Threat (APT) attack, using the coordinated multi-vector attack as an example. In the DHR architecture (or the proposed DP-DHR framework), a coordinated multi-vector attack occurs if

A R > 1 / k

(when more than one executor is controlled by the adversary) and we classify the attack into two categories:

1.: $A R < 50 %$ : Due to the majority principle decision strategy, in the case of the traditional DHR architecture, the system’s output cannot be tampered with. This is because the intermediate results provided by the controlled executors are categorized as ‘attacked’, while the correct intermediate results are considered when making decisions. As a result, the integrity and the availability of traditional DHR do not decrease when $A R < 50 %$ ; however, the confidentiality (data privacy) is not protected. On the other hand, for the proposed DP-DHR framework, the theoretical and experimental results given above (Theorem 1 and Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10) enhance the confidentiality (data privacy) without much sacrifice of the integrity and the availability.
2.: $A R \geq 50 %$ : If more than half of the executors are controlled by an adversary, the adversary would definitely control the output of the traditional DHR architecture. This is because of the majority principle decision strategy, which allows the adversary to map the majority of the intermediate results to a single value. In this case, neither the traditional DHR architecture nor the proposed DP-DHR framework could ensure the integrity and the availability. However, the confidentiality (data privacy) is guaranteed in the DP-DHR framework rather than the traditional DHR architecture. It is worth noting that such a case is unlikely to occur in the DHR architecture (and the DP-DHR framework). This is because different executors are heterogeneously designed, reducing the probability of the majority of these heterogeneous executors being simultaneously controlled. Additionally, the dynamic property of the DHR architecture (and the DP-DHR framework) further decreases the probability of $A R \geq 50 %$ .

Remark 5.

Through our analysis of the attacked results, we observed that the outputs generated by the controlled executors can alter the distributions of the injected Gaussian random noise, which in turn affects the privacy properties of the entire DP-DHR system. This phenomenon is even more pronounced in the DHR architecture, as it is designed to tolerate the compromise of several executors. This allows the adversary to construct certain probability distributions using a certain number of controlled executors to affect the privacy properties.

For instance, to guarantee DP, online executors not under the adversary’s control sample Gaussian random noise

b \sim N (0, σ^{2})

. However, if the adversary employs a different distribution, such as the uniform distribution with a sample range of

(a, b)

,

b_{A} \sim U (a, b)

, and adds this noise to the intermediate results from controlled executors, the resulting noise distribution received by the decision module becomes unpredictable due to the aggregation, which alters the original Gaussian distribution

N (0, σ^{2})

.

Furthermore, due to the dynamic feature of the DHR architecture, the attacked online executor may be scheduled offline, while new executors may be scheduled online, making the situation more complicated. This issue is a subsequent consequence arising from our proposed framework, which aims to tackle privacy concerns in the DHR architecture. It stems from the specific noise distribution within the utilized DP tool and holds significant research value both theoretically and empirically. Therefore, we will conduct more detailed research on this matter in our future work.

Remark 6.

Here, we delve into the implementation and potential implications of the DP-DHR framework in practical scenarios. On one hand, network latency in real-world scenarios could potentially impact the efficiency of the DP-DHR framework. Since the framework lacks the ability to verify the correctness of individual intermediate results, the effectiveness of the hypersphere clustering method (Algorithm 2) relies on the majority principle. Consequently, the decision module must await the aggregation of most (if not all) of the k intermediate results to reach a verdict, which may lead to efficiency decreases in the presence of prolonged network latency. Moreover, the heterogeneous nature of the DP-DHR framework further complicates efficiency considerations. Heterogeneously designed executors exhibit varying processing times for identical inputs and experience divergent network latency within the same cyber environment. This phenomenon mirrors challenges encountered in traditional DHR architectures. On the other hand, due to the heterogeneously designed executors, the DHR architecture requires more hardware or software resources compared to the traditional architecture. On this basis, the time complexity of the DP-DHR framework is of the order

O (k^{2})

, which is brought by the hypersphere clustering method. This might lead to more processing time. However, both the resource costs associated with the DHR architecture and the time complexity introduced by the DP-DHR framework stem from security and privacy concerns, which are deemed acceptable. Furthermore, the validation and corresponding experiments under realistic conditions (including live data stream) will be performed in the future work.

Based on the comprehensive discussions provided above, we observed that the gap between the output of our proposed DP-DHR framework, which incorporates advanced decision strategies and hypersphere clustering methods, and the output of the T-DHR architecture is relatively small under most circumstances. Specifically, the DP-DHR framework demonstrates greater resilience against attacks designed to manipulate system outputs, regardless of whether they are detected or not. However, in the case of attacks that aim to mislead the system output without being easily detected, the robustness of the DP-DHR architecture diminishes, particularly when the attack rate is high. This behavior is similar to that observed in the T-DHR architecture.

6. Conclusions

In this paper, the focus is on the differential privacy of the DHR architecture, which is a practice of the multi-heterogeneous system designed to enhance cyberspace endogenous safety and security. We propose the DP-DHR framework, which involves adding Gaussian random noise to the DHR architecture, and address technical issues that arise from the injected noise with our advanced decision strategy through the use of hypersphere clustering methods within the decision module. Our theoretical analysis shows that the proposed framework guarantees

(ϵ, δ)

-differential privacy. Experimental results demonstrate that the utility of the framework is not sacrificed, while its privacy is significantly enhanced. Additionally, the DP-DHR framework, equipped with our proposed advanced decision strategy and hypersphere clustering method, exhibits robustness against some attacks. To the best of our knowledge, this is the first work to incorporate differential privacy with the ESS theory, providing a theoretical guarantee of privacy to the DHR architecture. In future work, we will attempt to design decision strategies that are robust against common modulus attacks and analyze the issue that the outputs generated by attacked executors can change the distribution of the injected Gaussian random noise, thereby affecting privacy properties (as explained in Remark 5).

Author Contributions

Conceptualization, B.J.; methodology, Y.K.; software, Q.Z.; validation, B.J., Y.K. and Y.B.; formal analysis, Y.K.; investigation, Q.Z.; resources, Q.Z.; data curation, Q.Z.; writing—original draft preparation, Y.K.; writing—review and editing, B.J.; visualization, Q.Z.; supervision, Y.B.; project administration, Y.B.; funding acquisition, Y.B. and B.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China grant number 62176264 and Shuangchuang Program of Jiangsu Province grant number JSSCBC20221657. The APC was funded by National Natural Science Foundation of China grant number 62176264.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

IDS	Intrusion Detection System
DP	Differential Privacy
DHR	Dynamic Heterogeneous Redundant
DRS	Dissimilar Redundant Structure
IoT	Internet of Things
ESS	Endogenous Safety and Security

References

Huang, J.; Luo, Y.; Fu, Q.; Chen, Y.; Wang, C.; Song, L. Generic attacks on small-state stream cipher constructions in the multi-user setting. Cybersecurity 2023, 6, 53. [Google Scholar] [CrossRef]
Ghiasi, M.; Niknam, T.; Wang, Z.; Mehrandezh, M.; Dehghani, M.; Ghadimi, N. A comprehensive review of cyber-attacks and defense mechanisms for improving security in smart grid energy systems: Past, present and future. Electr. Power Syst. Res. 2023, 215, 108975. [Google Scholar] [CrossRef]
Ahmetoglu, H.; Das, R. A comprehensive review on detection of cyber-attacks: Data sets, methods, challenges, and future research directions. Internet Things 2022, 20, 100615. [Google Scholar] [CrossRef]
Duo, W.; Zhou, M.; Abusorrah, A. A Survey of Cyber Attacks on Cyber Physical Systems: Recent Advances and Challenges. IEEE/CAA J. Autom. Sin. 2022, 9, 784–800. [Google Scholar] [CrossRef]
Scala, N.M.; Reilly, A.C.; Goethals, P.L.; Cukier, M. Risk and the Five Hard Problems of Cybersecurity. Risk Anal. 2019, 39, 2119–2126. [Google Scholar] [CrossRef]
Heino, J.; Hakkala, A.; Virtanen, S. Study of methods for endpoint aware inspection in a next generation firewall. Cybersecurity 2022, 5, 25. [Google Scholar] [CrossRef]
Zalenski, R. Firewall technologies. IEEE Potentials 2002, 21, 24–29. [Google Scholar] [CrossRef]
Khraisat, A.; Gondal, I.; Vamplew, P.; Kamruzzaman, J. Survey of intrusion detection systems: Techniques, datasets and challenges. Cybersecurity 2019, 2, 20. [Google Scholar] [CrossRef]
Liao, H.J.; Lin, C.H.R.; Lin, Y.C.; Tung, K.Y. Intrusion detection system: A comprehensive review. J. Netw. Comput. Appl. 2013, 36, 16–24. [Google Scholar] [CrossRef]
Wu, J. Problems and solutions regarding generalized functional safety in cyberspace. Secur. Saf. 2022, 1, 2022001. [Google Scholar] [CrossRef]
Wu, J. Introduction to Cyberspace Mimic Defense; Science Press: Beijing, China, 2017. [Google Scholar]
Wu, J. The Principle of Cyberspace Mimic Defense. In Cyberspace Mimic Defense: Generalized Robust Control and Endogenous Security; Springer International Publishing: Berlin/Heidelberg, Germany, 2020; pp. 371–493. [Google Scholar]
Wu, J. Development paradigms of cyberspace endogenous safety and security. Sci. China Inf. Sci. 2022, 65, 156301. [Google Scholar] [CrossRef]
Jin, L.; Hu, X.; Lou, Y.; Zhong, Z.; Sun, X.; Wang, H.; Wu, J. Introduction to wireless endogenous security and safety: Problems, attributes, structures and functions. China Commun. 2021, 18, 88–99. [Google Scholar] [CrossRef]
Ji, X.; Wu, J.; Jin, L.; Huang, K.; Chen, Y.; Sun, X.; You, W.; Huo, S.; Yang, J. Discussion on a new paradigm of endogenous security towards 6G networks. Front. Inf. Technol. Electron. Eng. 2022, 23, 1421–1450. [Google Scholar] [CrossRef]
Zhiwen, J.; Tao, L.; Aiqun, H. Research on Endogenous Security Methods of Embedded System. In Proceedings of the IEEE 6th International Conference on Computer and Communications (ICCC), Chengdu, China, 11–14 December 2020; pp. 1946–1950. [Google Scholar]
Xin, Y. Protection architecture of endogenous safety and security for industrial control systems. Secur. Saf. 2023, 2, 2023001. [Google Scholar] [CrossRef]
Guo, J.; Xu, M. ZTESA—A Zero-Trust Endogenous Safety Architecture: Gain the endogenous safety benefit, avoid insider threats. In Proceedings of the International Symposium on Computer Applications and Information Systems, Shenzhen, China, 25–27 February 2022; p. 122500S. [Google Scholar]
Dwork, C.; McSherry, F.; Nissim, K.; Smith, A.D. Calibrating Noise to Sensitivity in Private Data Analysis. In Proceedings of the Theory of Cryptography Conference, New York, NY, USA, 4–7 March 2006; pp. 265–284. [Google Scholar]
Dwork, C.; Roth, A. The Algorithmic Foundations of Differential Privacy. Found. Trends Theor. Comput. Sci. 2014, 9, 211–407. [Google Scholar] [CrossRef]
Zhang, K.; Tian, J.; Xiao, H.; Zhao, Y.; Zhao, W.; Chen, J. A Numerical Splitting and Adaptive Privacy Budget-Allocation-Based LDP Mechanism for Privacy Preservation in Blockchain-Powered IoT. IEEE Internet Things J. 2023, 10, 6733–6741. [Google Scholar] [CrossRef]
Ali, M.; Naeem, F.; Tariq, M.; Kaddoum, G. Federated Learning for Privacy Preservation in Smart Healthcare Systems: A Comprehensive Survey. IEEE J. Biomed. Health Inform. 2023, 27, 778–789. [Google Scholar] [CrossRef]
Zhao, Y.; Chen, J. A Survey on Differential Privacy for Unstructured Data Content. ACM Comput. Surv. 2022, 54, 5217–5233. [Google Scholar] [CrossRef]
Wang, Q.; Zhang, Y.; Lu, X.; Wang, Z.; Qin, Z.; Ren, K. Real-Time and Spatio-Temporal Crowd-Sourced Social Network Data Publishing with Differential Privacy. IEEE Trans. Dependable Secur. Comput. 2018, 15, 591–606. [Google Scholar] [CrossRef]
Chen, R.; Mohammed, N.; Fung, B.C.M.; Desai, B.C.; Xiong, L. Publishing Set-Valued Data via Differential Privacy. Proc. VLDB Endow. 2011, 4, 1087–1098. [Google Scholar] [CrossRef]
Ren, C.; Yu, H.; Yan, R.; Li, Q.; Xu, Y.; Niyato, D.; Dong, Z.Y. SecFedSA: A Secure Differential Privacy-Based Federated Learning Approach for Smart Cyber-Physical Grid Stability Assessment. IEEE Internet Things J. 2023, 11, 5578–5588. [Google Scholar] [CrossRef]
Blanco-Justicia, A.; Sánchez, D.; Domingo-Ferrer, J.; Muralidhar, K. A Critical Review on the Use (and Misuse) of Differential Privacy in Machine Learning. ACM Comput. Surv. 2022, 55, 1–16. [Google Scholar] [CrossRef]
Denisov, S.; McMahan, H.B.; Rush, J.; Smith, A.; Guha Thakurta, A. Improved Differential Privacy for SGD via Optimal Private Linear Operators on Adaptive Streams. In Proceedings of the Advances in Neural Information Processing Systems, New Orleans, LA, USA, 28 November–9 December 2022; pp. 5910–5924. [Google Scholar]
Hirche, C.; Rouzé, C.; França, D.S. Quantum Differential Privacy: An Information Theory Perspective. IEEE Trans. Inf. Theory 2023, 69, 5771–5787. [Google Scholar] [CrossRef]
Du, Y.; Hsieh, M.H.; Liu, T.; You, S.; Tao, D. Quantum Differentially Private Sparse Regression Learning. IEEE Trans. Inf. Theory 2022, 68, 5217–5233. [Google Scholar] [CrossRef]
Ye, D.; Shen, S.; Zhu, T.; Liu, B.; Zhou, W. One Parameter Defense—Defending Against Data Inference Attacks via Differential Privacy. IEEE Trans. Inf. Forensics Secur. 2022, 17, 1466–1480. [Google Scholar] [CrossRef]
Jayaraman, B.; Evans, D. Evaluating Differentially Private Machine Learning in Practice. In Proceedings of the 28th USENIX Security Symposium, Santa Clara, CA, USA, 14–16 August 2019; pp. 1895–1912. [Google Scholar]
Carlini, N.; Liu, C.; Erlingsson, Ú.; Kos, J.; Song, D. The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks. In Proceedings of the 28th USENIX Security Symposium, Santa Clara, CA, USA, 14–16 August 2019; pp. 267–284. [Google Scholar]
Backes, M.; Berrang, P.; Humbert, M.; Manoharan, P. Membership Privacy in MicroRNA-based Studies. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, 24–28 October 2016; pp. 319–330. [Google Scholar]
Wu, J. Cyberspace Endogenous Safety and Security. Engineering 2022, 15, 179–185. [Google Scholar] [CrossRef]
Dwork, C.; Lynch, N.; Stockmeyer, L. Consensus in the Presence of Partial Synchrony. J. ACM 1988, 35, 288–323. [Google Scholar] [CrossRef]
Zhong, W.; Wu, W.; An, G.; Ren, J.; Yu, S. Dissimilar Redundancy Structure Design for Carrier Landing Guidance Computer and Reliability Analysis. In Proceedings of the First Symposium on Aviation Maintenance and Management-Volume II; Springer: Berlin/Heidelberg, Germany, 2014; pp. 379–385. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Wu, J. Cyberspace Endogenous Safety and Security; Science Press: Beijing, China, 2020. [Google Scholar]
MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 21 June 1967; pp. 281–297. [Google Scholar]
Pelleg, D.; Moore, A.W. X-Means: Extending K-Means with Efficient Estimation of the Number of Clusters. In Proceedings of the Seventeenth International Conference on Machine Learning, San Francisco, CA, USA, 29 June–2 July 2000; pp. 727–734. [Google Scholar]
Arthur, D.; Vassilvitskii, S. K-Means++: The Advantages of Careful Seeding. In Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, LA, USA, 7–9 January 2007; pp. 1027–1035. [Google Scholar]
Luxburg, U. A Tutorial on Spectral Clustering. Stat. Comput. 2007, 17, 395–416. [Google Scholar] [CrossRef]
Khan, K.; Rehman, S.U.; Aziz, K.; Fong, S.; Sarasvady, S. DBSCAN: Past, present and future. In Proceedings of the Fifth International Conference on the Applications of Digital Information and Web Technologies, Hanoi, Vietnam, 4–5 December 2014; pp. 232–238. [Google Scholar]
Xu, J.; Ren, Y.; Tang, H.; Pu, X.; Zhu, X.; Zeng, M.; He, L. Multi-VAE: Learning Disentangled View-Common and View-Peculiar Visual Representations for Multi-View Clustering. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada, 11–17 October 2021; pp. 9234–9243. [Google Scholar]
Caciularu, A.; Goldberger, J. An entangled mixture of variational autoencoders approach to deep clustering. Neurocomputing 2023, 529, 182–189. [Google Scholar] [CrossRef]
Tsitsulin, A.; Palowitch, J.; Perozzi, B.; Müller, E. Graph Clustering with Graph Neural Networks. J. Mach. Learn. Res. 2023, 24, 1–21. [Google Scholar]
Li, Y.; Liu, Y.; Li, B.; Wang, W.; Liu, N. Towards practical differential privacy in data analysis: Understanding the effect of epsilon on utility in private ERM. Comput. Secur. 2023, 128, 103147. [Google Scholar] [CrossRef]
Seeman, J.; Susser, D. Between Privacy and Utility: On Differential Privacy in Theory and Practice. ACM J. Responsibale Comput. 2024, 1, 1–18. [Google Scholar] [CrossRef]
Slavković, A.; Seeman, J. Statistical Data Privacy: A Song of Privacy and Utility. Annu. Rev. Stat. Its Appl. 2023, 10, 189–218. [Google Scholar] [CrossRef]

Figure 1. The DHR architecture.

Figure 2. The DP-DHR framework.

Figure 3. The advanced decision strategy of the DP-DHR framework.

Figure 4. Outputs of DP-DHR framework and T-DHR architecture (unattacked).

Figure 5. Hypersphere clustering results at different k (

ϵ = 0.1

).

Figure 5. Hypersphere clustering results at different k (

ϵ = 0.1

).

Figure 6. Hypersphere clustering at different AR (

k = 400, ϵ = 0.1, o = (3.0, 3.0)

, random noise injected normally).

Figure 6. Hypersphere clustering at different AR (

k = 400, ϵ = 0.1, o = (3.0, 3.0)

, random noise injected normally).

Figure 7. Hypersphere clustering at different AR (

k = 400, ϵ = 0.1, o = (1.0, 1.0)

random noise injected normally).

Figure 7. Hypersphere clustering at different AR (

k = 400, ϵ = 0.1, o = (1.0, 1.0)

random noise injected normally).