A Novel Edge Cache-Based Private Set Intersection Protocol via Lightweight Oblivious PRF

Zhang, Jing; Yang, Li; Tang, Yongli; Jin, Minglu; Wang, Shujing

doi:10.3390/e25091347

Open AccessArticle

A Novel Edge Cache-Based Private Set Intersection Protocol via Lightweight Oblivious PRF

College of Software, Henan Polytechnic University, Jiaozuo 454000, China

^*

Author to whom correspondence should be addressed.

Entropy 2023, 25(9), 1347; https://doi.org/10.3390/e25091347

Submission received: 12 June 2023 / Revised: 1 September 2023 / Accepted: 12 September 2023 / Published: 16 September 2023

(This article belongs to the Special Issue Information-Theoretic Privacy in Retrieval, Computing, and Learning)

Download

Browse Figures

Versions Notes

Abstract

:

With the rapid development of edge computing and the Internet of Things, the problem of information resource sharing can be effectively solved through multi-party collaboration, but the risk of data leakage is also increasing. To address the above issues, we propose an efficient multi-party private set intersection (MPSI) protocol via a multi-point oblivious pseudorandom function (OPRF). Then, we apply it to work on a specific commercial application: edge caching. The proposed MPSI uses oblivious transfer (OT) together with a probe-and-XOR of strings (PaXoS) as the main building blocks. It not only provides one-sided malicious security, but also achieves a better balance between communication and computational overhead. From the communication pattern perspective, the client only needs to perform OT with the leader and send a data structure PaXoS to the designated party, making the protocol extremely efficient. Moreover, in the setting of edge caching, many parties hold a set of items containing an identity and its associated value. All parties can identify a set of the most frequently accessed common items without revealing the underlying data.

Keywords:

private set intersection; edge computing; multi-party cooperative cache; concrete efficiency

1. Introduction

Co-creation and sharing gained significance in the transition from the era of information technology to the era of digital technology. While information sharing brings convenience, the risk of privacy breaches also rises. The private set intersection (PSI) protocol is a widely used approach to distributed set computation. It is devoted to the joint intersection calculation of data from two or more parties. The PSI protocol guarantees that all parties can collaboratively calculate the intersection of the sets without disclosing anything beyond that intersection. PSI plays an important role in improving pattern matching [1], private contact discovery [2], advertisement conversion rate [3], and edge caching [4]. Edge caching is a key technology for communication networks. In order to utilize cache resources more efficiently, individual operators tend to keep their public items in a shared cache that can be accessed by all parties. However, since the cache is shared among multiple parties, these parties aim to identify the set of most frequently visited common data items and add them to the network edge cache. Their objective is to achieve this without revealing the actual underlying data. This is known as the multi-party shared cache problem, where determining the common term is a typical private set intersection problem.

Most of the current efficient PSI protocols are built on OT [5,6,7]. The OT-based PSI protocols offer greater advantages in terms of communication and computation when compared with PSI based on public key encryption [8,9] and PSI based on a garbled circuit [10,11,12]. Efficient OT extension techniques allow parties to generate many OT protocol instances at a low computational cost through a few public key operations. Chase et al. [5] implemented a two-party PSI protocol with one-sided malicious security. This protocol uses OT and a multi-point OPRF to achieve a good balance between computational and communication overhead. The protocol can only interact between two parties, and multiple runs are required to accomplish the intersection computation with multiple parties involved. Kavousi et al. [13] proposed a MPSI based on OT and multi-point OPRF. This protocol can only be implemented in the semi-honest model. Inbar et al. [14] presented an enhanced semi-honest MPSI protocol based on OT and a garbled Bloom filter (GBF). However, the protocols [13,14] require the transmission of the GBF for communication, creating a certain degree of communication burden.

In response to the above issues, we constructed an MPSI protocol for malicious actors that combines a PaXoS and multi-point OPRF based on OT. The protocol relies only on symmetric keys, hashing, coding techniques, and bitwise operations, thus providing good computational performance. This protocol can solve the problem of privacy-preserving edge cooperative cache sharing by making a simple transformation of this protocol. We show the following contributions:

Multi-party PSI protocol: We propose a specifically efficient MPSI protocol utilizing OT and a PaXoS. The PaXoS can be seen as a corresponding Encode/Decode algorithm achieving a constant rate. Therefore, our protocol has good computational performance. The protocol has low communication overhead since the clients only need to send a data structure. Theoretical analysis shows that the protocol leads to a better balance between communication and computational cost.
Security against malicious clients: We present that our protocol uses the data structure PaXoS to hide the key during encoding to resist malicious adversaries, which can achieve one-sided malicious security against the clients with almost no additional overhead. At the same time, we prove that the protocol can also resist any possible collusion attack from malicious clients.
Multi-party cooperative cache: Our MPSI protocol can be applied to edge caching scenarios by using cuckoo hashing and simple hashing. The protocol supports having data associated with each input and the extension of payloads to multi-party. In a multi-party cooperative cache (MPCCache) setting, the MPCCache protocol allows parties to compute a sum depending on the data associated with the intersection items. Compared with [4], our MPCCache protocol eliminates the computing burden associated with polynomial interpolation and improves computational efficiency.

2. Related Work

PSI. The development of efficient constructions for PSI functionality has received considerable research attention in the last decade or more. Some of the recent relevant works on PSI are illustrated in Table 1. Ghosh et al. [15] presented a MPSI protocol using oblivious linear function evaluation (OLE) with optimal asymptotic communication complexity. However, the balance between communication and computation cost is not good. Kolesniko [16] proposed a two-party PSI protocol against semi-honest adversaries. The protocol is mainly based on OT techniques for security string equivalence testing and is computationally efficient. Pinkas [17] proposed a two-party semi-honest PSI protocol based on OT and a GBF. The parallelized processing of the protocol allows for some improvement in protocol efficiency. Nevo [18] proposed a malicious PSI protocol utilizing oblivious programmable PRF (OPPRF) and oblivious key-value store (OKVS) technology, which solves the problem of multi-party PSI against malicious adversaries. However, this protocol does not lead to a better trade-off between communication and computational overhead. Pinkas [19] also proposed a PSI protocol for two parties in the malicious model which uses a PaXoS to implement, for the first time, a malicious secure PSI using cuckoo hashing. Ben-Efraim et al. [20] implemented malicious MPSI based on a GBF for multiple parties. However, GBFs suffer from a certain false positive rate and their high communication overhead. Bui et al. [21] constructed an optimized semi-honest PSI based on a pseudorandom correlation generator (PCG). Additionally, they can use the PCG to construct protocols with fully malicious security in the standard model.

Function-based PSI. Many studies have focused on developing efficient techniques for PSI construction. In addition, these studies have explored the output results of computing a function over intersections, allowing for potential extensions to various business scenarios. Table 2 shows recent related works on function-based PSI. Ion et al. [3] proposed a PI-Sum Protocol utilizing Diffie–Hellman (DDH) and homomorphic encryption (HE). Thinking about the advertising (Ad) conversion problem: Ad providers want to analyze Ad effectiveness by age, which obviously cannot be solved using the PI-Sum. Chida [22] proposed a new function based on OPRF and DDH assumptions to calculate the weighted sum of two-party privacy sets (PIW-sum), which has more practical application value. Pinkas et al. [11] proposed an idea of calculating payloads based on the circuit, OPPRF, and cuckoo hash constructs, which allows each input item from one party to have payload data attached to it, and finally to calculate some specific functions of the payloads in the intersection set. Based on a new shuffled distributed oblivious PRF (DOPRF), Miao et al. [23] constructed a two-party PSI cardinality (PSI-CA) protocol for malicious settings which achieves a good computation and communication cost. In the above protocols, only one party can own the payload data, which can be applied in limited practical scenarios. Nguyen et al. [4] extended payload data to the multi-party setting and proposed an MPCCache sharing framework based on polynomial interpolation and OPPRF, which enables multiple parties to calculate a sum of data payloads on each of common data items and can identify the most frequently accessed data items.

3. Preliminaries

3.1. Notions

The computational and statistical security parameters are denoted by

λ

and

σ

.

[n]

stands in for the set

{1, \dots, n}

.

\overset{R}{\leftarrow}

indicates uniformly random selection. The notation

| |

denotes concatenation between strings.

{0, 1}^{*}

denotes the set of strings consisting of 0 and 1, where * means that the strings in the set can be of any length. We use

\overset{C}{\approx}

to indicate that the real world is indistinguishable from the ideal world. We denote with

v [i]

the i-th element of a vector

v

of length

l

. The i-th column vector

i \in [n]

of the matrix

M_{n \times m}

is denoted by the symbol

M_{i}

. The Hamming weight of the binary string

x

is represented by

| | x | |_{H}

.

3.2. One-Sided Malicious Security

One-sided malicious security [5] is a security property found in cryptographic protocols wherein one party is allowed to engage in arbitrary malicious behavior in an attempt to compromise security while the other parties follow specified behavioral guidelines. In this context, only the targeted party is vulnerable to malicious action, whereas the other parties maintain their assigned roles and responsibilities. Our MPSI protocol achieves unilateral malicious security against the clients, as they are considered as a whole. We further prove that the proposed MPSI is secure against malicious clients.

3.3. Security Model

MPSI is a unique instance of secure multi-party computation (MPC). We adhere to the MPC standard security definition. The ideal functionality of MPSI is defined in Figure 1.

The security models [24] of secure multi-party computation are divided into semi-honest and malicious models. For the semi-honest model, an adversary can completely obey the protocol execution process, yet might record all the data in the protocol execution process and try to learn more from the data generated during the protocol execution process. The adversary under the malicious model can not only infer the sensitive information through the data of the protocol process but also refuse to participate in the protocol, alter the private input set information, or prematurely stop the protocol from running. Our protocol can achieve one-sided malicious security.

Definition 1.

(Malicious security against the clients) If there is a PPT adversary

A

who might unilaterally depart from the protocol in the real world, there exists a PPT adversary

S

who could modify the input to the ideal functionality and terminate the output in an ideal world. Then, the protocol Π can protect from malicious clients, such that for each input

X_{1}, \dots, X_{n}

:

{Real}_{A}^{\prod} (X_{1}, \dots, X_{n}) \overset{c}{\approx} {Ideal}_{S}^{F} (X_{1}, \dots, X_{n}) .

(1)

3.4. Oblivious Transfer

Rabin et al. [25] proposed a crucial cryptographic primitive OT. In a 1-out-of-2 OT configuration, the receiver can have a choice bit

b \in {0, 1}

, while the sender can have input strings

(m_{0}, m_{1})

. The OT acts to prevent the receiver from knowing nothing regarding

m_{1 - b}

and prevent the sender from learning anything about

b

. OT necessitates costly public-key operations. Ishai et al. [26] described an OT extension technique that permits many OT executions at the cost of doing few public-key procedures. We can use the instantiation OT in [15]. The ideal functionality of OT is defined in Figure 2.

3.5. PaXoS

The following is a way to encode key-value mapping into a brief data structure using a PaXoS [19]. The associated Encode/Decode methods are frequently more convenient to describe when describing a PaXoS than the

u

mapping.

Encode ((x_{1}, y_{1}), \dots, (x_{t}, y_{t}))

: Given

t

items

(x_{i}, y_{i})

, where

x_{i} \in {0, 1}^{*}

and

y_{i} \in {0, 1}^{w}

, indicate via

M

the

t \times m

matrix where the i-th row is

u (x_{i})

. Note that

u (x)

is the result of using the mapping

u

to

x

. It is possible to find a data structure (matrix)

D = {(d_{1}, \dots, d_{m})}^{T} \in {({0, 1}^{w})}^{m}

satisfying

M \times D = {(y_{1}, \dots, y_{t})}^{T}

. In particular, the subsequent linear system of equations is fulfilled when the

u (x_{i})

’s are linearly independent:

[\begin{matrix} - u (x_{1}) - \\ - u (x_{2}) - \\ ⋮ \\ - u (x_{t}) - \end{matrix}] \times [\begin{matrix} d_{1} \\ d_{2} \\ ⋮ \\ d_{m} \end{matrix}] = [\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{t} \end{matrix}] .

(2)

Decode (D, x)

: Given

D \in {({0, 1}^{w})}^{m}

and

x \in {0, 1}^{*}

, we can extract the corresponding “value” via

y = 〈 u (x), D 〉 \overset{def}{=} \underset{j : v {(x)}_{j} = 1}{\oplus} d_{j}

.

3.6. Multi-Point OPRF

Chase [5] presented a PSI protocol for two parties based on multi-point OPRF. The sender chooses a pseudorandom seed

s \overset{R}{\leftarrow} {\{0, 1\}}^{w}

, and the receiver computes a pseudorandom function

v = F_{k} (x_{i})

based on its set elements to construct two matrices:

A_{m \times w}

and

B_{m \times w}

. For each

x_{i} \in X_{1}

, the corresponding bits in matrices are the same, while others are different. The sender obtains a matrix

C_{m \times w}

depending on seed

s

and runs

w

OTs with the receiver. Each column of the matrix is either

A_{j}

or

B_{j}

for all

j \in [w]

. Then, the sender computes

v = F_{k} (x_{i})

according to each element

x_{i} \in X_{2}

to obtain all the resulting OPRFs

φ = H (C_{1} [v [1] | | \dots | | C_{w} [v [w]])

and sends them to the receiver. Eventually, the receiver can find the intersection of the two sets based on its computed OPRF value.

3.7. Hamming Correlation Robustness

Under the assumption of correlation robustness for the underlying hash function, our MPSI structure is demonstrated to be secure.

Definition 2.

(Hamming Correlation Robustness [5]) If the distribution produced by the sampling of

s \leftarrow {0, 1}^{n}

at random is pseudorandom for

a_{1}, \dots, a_{m}

,

b_{1}, \dots, b_{m} \in {0, 1}^{n}

, and has

| | b_{i} | |_{H} \geq d

for each

i \in [m]

,

H

is d-Hamming correlation robust. Namely:

(H (a_{1} \oplus [b_{1} \cdot s]), \dots, H (a_{m} \oplus [b_{m} \cdot s])) \overset{c}{\approx} (F (a_{1} \oplus [b_{1} \cdot s]), \dots, F (a_{m} \oplus [b_{m} \cdot s])),

(3)

where ⊕ denotes bitwise-AND and bitwise-XOR, respectively, and

F

is a random function.

3.8. Cuckoo Hashing and Simple Hashing

Hash technology is one of the essential tools for optimizing communication and computational complexity in PSI protocols. There are two commonly used construction methods for hash technology: simple hashing and cuckoo hashing [10]. Simple hashing can map elements to

k

positions in a hash table using

k

hash functions, with each bucket being capable of storing multiple elements. Cuckoo hashing can map elements to a specific location in a hash table using a hash function, and its basic idea is to use multiple hash functions to handle collisions. When collisions occur, cuckoo hashing evicts the element occupying the original position, which can be rehomed to alternative positions. If alternative positions are already occupied, the process repeats until all elements can find their homes. Typically, cuckoo hashing and simple hashing are combined to achieve optimal results in PSI protocols.

4. Our MPSI Protocol

4.1. Overview

In this section, we show the MPSI protocol. A couple of parties

P_{1}, \dots, P_{n}

with respective private input sets

X_{1}, \dots, X_{n}

desire to collectively compute

X_{1} \cap \dots \cap X_{n}

without disclosing any more information. Note that we regard

t

as the set sizes for parties,

P_{n}

as the leader, and

P_{i \in [n - 1]}

as the client. The system model of the MPSI protocol is shown in Figure 3.

P_{n}

constructs a random matrix

A_{m \times w}

and chooses strings for

P_{i \in [n - 1]}

to generate the

A_{j}^{i} \overset{R}{\leftarrow} {0, 1}^{m}

and sets

A_{j}^{n} = A_{j}^{1} \oplus \dots \oplus A_{j}^{n - 1}

, where

j \in [w]

. For each

i \in [n - 1]

, from its input elements,

P_{n}

constructs unique matrices

B_{m \times w}

.

P_{n}

first initializes a matrix

E_{m \times w}

to all 1’s. For

x \in X_{n}

computing

v = F_{k} (H_{1} (x))

,

B_{m \times w}

is designed such that

E_{j} [v [j]] = 0

for all

j \in [w]

, and hence

A_{j}^{i} [v [j]] = B_{j}^{i} [v [j]] = C_{j}^{i} [v [j]]

for all

i \in [n - 1]

and

j \in [w]

. Then,

P_{i \in [n - 2]}

locally encode a data structure PaXoS of their input sets

D^{i} \leftarrow Encode (\{(x, C_{1}^{i} [v [1]] | | \dots | | C_{w}^{i} [v [w]])\})

using the entries of the received matrix and send

D^{i}

to

P_{n - 1}

.

P_{n - 1}

decodes all the

D^{i}

. Then, they compute and sends the OPRF values

φ = H_{2} (\oplus_{i = 1}^{n - 2} Decode (D^{i}, x) \oplus (C_{1}^{n - 1} [v [1]] | | \dots | | C_{w}^{n - 1} [v [w]]))

to

P_{n}

. After receiving the OPRF values,

P_{n}

computes

φ = H_{2} (C_{1}^{n} [v [1]] | | \dots | | C_{w}^{n} [v [w]])

according to its input set, which allows

P_{n}

to find the intersection. This implies that, if

x \in I

, the hash function’s input from

P_{n - 1}

and

P_{n}

will be equal. While the output of the PRF would be pseudorandom to

P_{n}

if

x \notin I

, the hash function’s input from

P_{n - 1}

will be dramatically different from any

P_{n}

’s input.

4.2. Our Protocol

We show our MPSI protocol in Figure 4. The selection of

m

,

w

,

l_{1}

, and

l_{2}

in our MPSI protocol follows [5] and they show how to choose the parameters concretely.

4.3. Protocol Correctness

P_{n}

constructs the special matrices

A^{i}

and

B^{i}

for

P_{i \in [n - 1]}

such that

v = F_{k} (H_{1} (x))

computed for each

x \in X_{n}

satisfies

A_{j}^{i} [v [j]] = B_{j}^{i} [v [j]]

for all

j \in w

. Let

x

be the intersection element. Since each column of matrix

A_{j}^{n}

is composed of uniform random shares as

A_{j}^{n} = A_{j}^{1} \oplus \dots \oplus A_{j}^{n - 1}

for

j \in w

, after the client

P_{i \in [n - 1]}

runs OTs with

P_{n}

, the matrix

C_{j}^{i}

is obtained, satisfying

A_{j}^{i} [v [j]] = C_{j}^{i} [v [j]]

. It holds that

A_{j}^{n} [v [j]] = \oplus_{i = 1}^{n - 1} C_{j}^{i} [v [j]]

for each

x \in I

.

Based on the nature of the constructed data structure, we have

Decode (D^{i}, x) = C_{1}^{i} [v [1]] | | \dots | | C_{w}^{i} [v [w]]

. So, for

x \in I

, let

v = F_{k} (H_{1} (x))

, and we can always satisfy

H_{2} (\oplus_{i = 1}^{n - 1} (C_{1}^{i} [v [1]] | | \dots | | C_{w}^{i} [v [w]])) = H_{2} (A_{1}^{i} [v [1]] | | \dots | | A_{w}^{i} [v [w]])

.

4.4. Protocol Security

Theorem 1.

If

F

is a PRF,

H_{1}

and

H_{2}

are random oracles, and the underlying OT is protected against malicious receivers, then our MPSI protocol has one-sided malicious security which can be secure against malicious clients when

m

,

w

,

l_{1}

, and

l_{2}

are chosen appropriately.

Proof of Theorem 1.

We consider any client

P = {P_{1}, \dots, P_{n - 1}}

corrupted by an adversary

A

. Let

l

clients

P_{1}, \dots, P_{l}

be corrupted, making the number of uncorrupted clients

(n - l - 1)

. Given

{X_{i}}_{i \in [l]}

, the simulator

S

interacts with

{P_{i}}_{i \in [l]}

as follows.

S

samples random matrices

{C_{i}}_{i \in [l]} \in {0, 1}^{m \times w}

and performs malicious OT simulator on

{P_{i}}_{i \in [l]}

with outputs

C_{1}^{i}, \dots, C_{w}^{i}

.

S

honestly chooses PRF key

k

and sends

k

to

{P_{i}}_{i \in [l]}

. The simulator

S

constructs random data structures representing honest parties according to the randomness of the matrices.

T_{1}

and

T_{2}

are initialized to an empty table. In

P_{i \in [n - 1]}

’s query

x

to

H_{1}

,

S

records

(x, H_{1} (x))

in table

T_{1}^{i}

. In

P_{n - 1}

’s query

y

to

H_{2}

,

S

records

(y, H_{2} (y))

in table

T_{2}

. When

P_{n}

receives OPRF value

Ψ

,

S

finds all

φ \in Ψ

such that

φ = H_{2} (y)

for some

y

in

T_{2}

, and

y = \oplus_{i \in [l]} (C_{1}^{i} [v [1]] | | \dots | | C_{w}^{i} [v [w]]) \oplus (\oplus_{j \in [n - t - 1]} (C_{1}^{j} [v [1]] | | \dots | | C_{w}^{j} [v [w]]))

where

v = F_{k} (H_{1} (x))

for

x \in T_{1}^{1} \cap \dots \cap T_{1}^{n - 1}

. Finally,

S

can send these

x

to ideal functionality.

Let

Q_{1}^{i}, Q_{2}

be a set of queries

P_{i \in [n - 1]}

and

P_{n - 1}

make to

H_{1}

and

H_{2}

, respectively, and let

Q = \cap_{i = 1}^{n - 1} Q_{1}^{i}

,

Q_{1}^{i} = | Q_{1}^{i} |

, and

Q_{2} = | Q_{2} |

. We will misuse notation: for matrix

C_{m \times w}

and vector

u \in {[m]}^{w}

,

C [v]

means

C_{1} [v [1]] | | \dots | | C_{w} [v [w]]

. For the set

V

of vectors in

{[m]}^{w}

, the set

\{C [V] | v \in V\}

is denoted by

C [V]

.

We prove

{Real}_{A}^{\prod} (X_{1}, \dots, X_{n}) \overset{c}{\approx} {Ideal}_{S}^{F} (X_{1}, \dots, X_{n})

.

Hyb0: The outputs of parties in the real world.
Hyb1: Similar to $Hyb 0$ , but $S$ performs OT simulator on ${P_{i}}_{i \in [l]}$ to obtain $s_{i}$ . If $s_{i} [j] = 0$ , it randomly chooses string $A_{j}^{i}$ of length $m$ and constructs matrix $B_{j}^{i} = A_{j}^{i} \oplus D_{j}$ , and it randomly chooses string $B_{j}^{i}$ of length $m$ and constructs matrix $A_{j}^{i} = B_{j}^{i} \oplus D_{j}$ ; otherwise, it gives $C_{1}^{i}, \dots, C_{w}^{i}$ to OT simulator as output. $Hyb 1$ is computationally indistinguishable from $Hyb 0$ due to OT security against malicious receiver.
Hyb2: Similar to $Hyb 1$ except that the protocol terminates if there exists $x^{a}, x^{b} \in X_{1} \cup X_{2} \cup \dots \cup X_{n}$ , $x^{a} \neq x^{b}$ such that $H_{1} (x^{a}) = H_{1} (x^{b})$ . Since $H_{1}$ is a random oracle, the protocol is aborted with negligible probability.
Hyb3: Same as $Hyb 2$ , but, for each OPRF value $φ$ received by $P_{n}$ , if $φ \notin H_{2} (Q_{2})$ , then $P_{n}$ ignores $φ$ . Since $H_{2}$ is a random oracle, the probability of changing $P_{n}$ ’s output is negligible. $φ$ equals the output of $H_{2}$ on one of $P_{n}$ ’s elements with negligible probability.
Hyb4: Same as $Hyb 3$ except that the protocol terminates if there exists $y \in Q_{2}$ , $y^{'} \in A [F_{k} (H_{1} (X_{n}))]$ with $y \neq y^{'}$ and $H_{2} (y) = H_{2} (y^{'})$ . Since $H_{2}$ is a random oracle, the protocol is aborted with negligible probability.
Hyb5: Same as $Hyb 4$ , but, for each OPRF value $φ$ received by $P_{n}$ , $P_{n}$ ignores $φ$ when calculating the set intersection if $φ = H_{2} (y)$ for some $y \in Q_{2}$ , where $y \notin (\oplus_{i \in [t]} (C^{i} [F_{k} (H_{1} (Q))])) \oplus (\oplus_{j \in [n - t - 1]} (C^{j} [F_{k} (H_{1} (Q))]))$ .
This hybrid changes output only if there exist $x \in X_{n}$ satisfying $φ = H_{2} (A [F_{k} (H_{1} (x))])$ , which implies $y = A [F_{k} (H_{1} (x))]$ via the terminate condition added in $Hyb 4$ .
Note that if $x \in X_{n}$ and $x \in Q$ , because of the construction of $E$ , we then have $y = A [F_{k} (H_{1} (x))] = \oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x))])) \in (\oplus_{i \in [t]} (C^{i} [F_{k} (H_{1} (Q))]))$ $\oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (Q))]))$ . Therefore, we need only think about $x \in X_{n} \ Q$ . For all $x \in X_{n}$ , $A [F_{k} (H_{1} (x))] = \oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x))]))$ , the output of $Hyb 5$ changes only if there exist $x \in X_{n} \ Q$ , $y \in Q_{2}$ satisfying $y = \oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x))]))$ .
Suppose there is a PPT adversary $A$ that, with non-negligible probability, produces $Q$ , $Q_{2}$ , and $X_{n}$ such that there exist $y \in Q_{2}$ , $x \in X_{n} \ Q$ satisfying $y = \oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x))]))$ . Then, [5] shows we can break the security of the PRF.
Hyb6: Same as $Hyb 5$ except that the protocol terminates if there exists $x^{a} \in Q$ , $x^{b} \in X_{n}$ such that, $y = \oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x^{a}))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x^{a}))])) = A [F_{k} (H_{1} (x^{b}))]$ but $x^{a} \neq x^{b}$ . The protocol is aborted with negligible probability because of the security of the PRF.
Hyb7: Same as $Hyb 6$ except that $P_{n}$ ’s outputs are substituted by its outputs in the ideal world. $Hyb 7$ can change $P_{n}$ ’s outputs if and only if there exists a value $φ$ received by $P_{n}$ and considered by $P_{n}$ such that $φ = H_{2} (\oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x^{a}))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x^{a}))])))$ for some $x^{a} \in Q$ , and $φ = A [F_{k} (H_{1} (x^{b}))]$ for some $x^{b} \in X_{n}$ , $x^{a} \neq x^{b}$ . Because $H_{2}$ is a random oracle, $\oplus_{i \in [l]} (C^{i} [F_{k} (H_{1} (x^{a}))]) \oplus (\oplus_{j \in [n - l - 1]} (C^{j} [F_{k} (H_{1} (x^{a}))])) \neq A [F_{k} (H_{1} (x^{b}))]$ is aborted via terminate condition in $Hyb 6$ with negligible probability.
Hyb8: Same as $Hyb 7$ except that the protocol does not terminate. $Hyb 7$ and $Hyb 8$ are computationally indistinguishable since $H_{1}$ and $H_{2}$ are random oracles and $F_{k}$ is a PRF.
Hyb9: The output in the ideal world. The difference between $Hyb 9$ and $Hyb 8$ is that $S$ samples a random matrix $C$ and encodes a data structure PaXoS, which is identically distributed.

□

5. Performance Evaluation

5.1. Complexity Analysis

To better evaluate the complexity of the protocol, we first need to perform a simple analysis of the overall protocol process. It is important to note that this protocol uses only inexpensive tools such as OTs and bitwise operations, making it concretely efficient. We treat

t

as the set sizes and set

m = t

as in [5]. So,

w

can be viewed as a value dependent on

λ

by fixing

m

and

t

.

Party

P_{n}

is referred to as the leader carrying the majority overhead of the protocol, while the others are referred to as clients. Regarding the complexity of the protocol,

P_{n}

designs matrices of a particular form, requiring linear complexity in

t

. Then, they perform

w

OTs for clients independently, resulting in linear complexity in the number of OTs. Moreover,

P_{i \in [n - 2]}

just do encoding operations for data structure

D^{i}

, and

P_{n - 1}

does hashing, bitwise-XOR, and decoding operations, which require linear communication and computation complexities. Although the computational overhead of

P_{n - 1}

is larger than that of other clients, they do not need to encode and send a data structure. From this, we can regard the overall communication and computation costs as uniformly distributed across all clients.

Note that our protocol can be divided into offline and online phases. Only lightweight procedures are required in the online phase, and communication and computation costs associated with performing OT can be handled in the offline phase. In addition, the bits exchanged among the parties concerning the random OT and the optimized malicious OT extension are summarized in Table 3.

5.2. Comparison

It should be noted that, due to the variations in architectures and security levels, making a fair comparison is challenging. Nevertheless, we have endeavored to include some recent studies pertaining to diverse security models (e.g., semi-honest, malicious, etc.). So, we contrast the complexity of communication and computation with [13,14,15] in Table 4, where

n

is the number of parties,

k

is the number of hash functions,

t

is the size of input sets, and

λ

is the security parameter. In our MPSI protocol, the communication and computation complexity of the leader are

O (t n λ)

, which is linear in the number of parties. Meanwhile, the complexity for the client remains constant regardless of the number of parties involved (namely,

O (t λ)

) because the client

P_{i \in [n - 1]}

only needs to compute and send a data structure

D^{i}

and does not need to perform additional data transfers with other parties. Therefore, our protocol achieves a good trade-off between communication and computation overhead.

Figure 5 shows the security levels of the discussed protocols. Compared with [13], our protocol achieves a stronger security model without sacrificing communication and computation costs. We implement one-sided malicious security and [14] implements the Aug semi-honest model. It is difficult to define which security model is more practical, but our protocol has better computation and communication performance. Although the security model in [15] is higher-performing, our protocol has greater communication performance and achieves a better trade-off between communication and computation.

5.3. Experimental Evaluation

In order to compare the runtime overhead of each protocol more intuitively, simulation experiments and a results analysis were performed. It should be noted that the time consumed by this protocol is the average time of multiple experiments. The experimental platform was Windows 10, Intel (R) Core (TM) i5-8250U CPU @ 1.60 GHz 1.80 GHz, 8.00 GB of RAM, and a compiled environment of Dev-C++5.11.

We first consider the total time required for each protocol to execute with different numbers of set elements. It is assumed that

n = 100

,

k = λ = 128

, and

t = 2^{10}, 2^{11}, 2^{12}, 2^{13}

are chosen for the comparison experiment, and Figure 6 shows the total running time of the protocol as a function of the number of elements contained in the set.

From Figure 6, the total time overhead in each protocol grows essentially linearly as the number of set elements continues to increase. However, the time of our MPSI protocol increases the slowest when the fixed set cardinality is small. Our MPSI protocol has the slowest time growth rate.

In addition, the effect of the change in the number of parties on the running time of the protocol is further considered. Suppose that the maximum number of elements contained in the set is

t = 1000

, the security parameters are kept fixed at

k = λ = 128

, and the number of parties

n = 10^{1}, 10^{2}, 10^{3}, 10^{4}

is selected for the comparison experiment. The total protocol runtime as a function of the number of parties is shown in Figure 7.

From Figure 7, the running time of all protocols increases gradually with the number of parties. The time overheads of our MPSI protocol are lower than those of the other protocols when n is fixed. In addition, our MPSI protocol has the slowest time growth rate.

6. MPCCache in Edge Computing

This section aims to address the problem of edge collaborative content caching, wherein all parties can jointly cache the most frequently accessed common data items in shared caches. Figure 8 shows the difference between the traditional cache model and edge cache model. Our challenge is to find how to determine a set of the most frequently accessed common items without revealing any underlying data.

6.1. Our MPCCache

We describe how to use our MPCCache protocol to handle the edge cache case. The network operators

P_{i \in [n]}

respectively own set

K_{i} = {(x_{1}^{i}, z_{1}^{i}), \dots, (x_{t}^{i}, z_{t}^{i})}

, where

x^{i} \in {0, 1}^{*}

denotes an identify element and

z^{i} \in {0, 1}^{w}

denotes its associated value. Note that the latter may represent the anticipated frequency of content being accessed or the value to network operators of the cached content. Let the common items

I = \cap_{i = 1}^{n} X_{i} = {x_{1}, x_{2} \dots}

be the intersection of the identifiers, where

X_{i} = {x_{1}^{i}, \dots, x_{t}^{i}}

is the set of identity for

P_{i \in [n]}

. For each common item

x \in I

, calculate a sum of the associated values

z

; that is,

s u m^{(x)} = \sum_{i = 1}^{n} z^{(x)}

. The sum of a common item is determined as the total of the individual values of the operators for the item.

We present the MPCCache protocol in Figure 9.

P_{i \in [n - 1]}

conduct simple hashing and

P_{n}

conducts cuckoo hashing that maps common items to the same bucket. According to the PaXoS, all the buckets are compressed into a data structure so that

P_{n}

can efficiently compute the MPCCache. In detail,

P_{i \in [n - 1]}

choose

q_{j}^{n}

and

s_{j}^{n}

uniformly at random for

j \in [β]

. Notice that

A_{j}^{n} [v [j]] = \oplus_{i = 1}^{n - 1} C_{j}^{i} [v [j]]

for each

x \in I

. For

(x, z) \in K_{i}

and

v = F_{k} (H_{1} (x))

,

P_{i \in [n - 1]}

compute

f_{x_{j}}^{i} \overset{d e f}{=} (C_{1}^{1} [v [1]] | |, \dots, | | C_{w}^{1} [v [w]]) \oplus q_{j}^{i}

and

g_{x_{j}}^{i} \overset{d e f}{=} z - s_{j}^{i}

, and send the encoding

Encode (x | | j, f_{x_{j}}^{i})

and

Encode (x | | j, g_{x_{j}}^{i})

to

P_{n}

, where

x_{j} = (HT [j] | | j)

means that

x

is in

j - th

bucket.

P_{n}

can use

x_{j}^{n} \in {\{(x | | j) | x \in {GT}_{n} [j]\}}_{j \in [β]}

to obtain the correct decoding

f_{x_{j}}^{i}

and

g_{x_{j}}^{i}

if

x_{j}^{n} = x_{j}^{i}

; it is otherwise random. Then,

P_{n}

computes

q_{j}^{n} \overset{d e f}{=} \oplus_{i = 1}^{n - 1} f_{x_{j}}^{i} \oplus (A_{1}^{n} [v [1]] | | \dots | | A_{w}^{n} [v [w]])

and

s_{j}^{n} \overset{d e f}{=} \sum_{i = 1}^{n - 1} g_{x_{j}}^{i} + z

. Finally,

P_{i \in [n]}

input

q_{j \in [β]}^{n}

and

s_{j \in [β]}^{n}

, respectively, to check whether

\oplus_{i = 1}^{n} q_{j}^{i} = 0

is based on a garbled circuit, and, if so, obtain the sum of the corresponding common item

\oplus_{i = 1}^{n} s_{j}^{i}

.

6.2. Correctness and Security

Correctness: Section 4.3 proves that

A_{j}^{n} [v [j]] = \oplus_{i = 1}^{n - 1} C_{j}^{i} [v [j]]

for each

x \in I

and

v = F_{k} (H_{1} (x))

; that is,

A_{1}^{n} [v [1]] | | \dots | | A_{w}^{n} [v [w]] = \oplus_{i = 1}^{n - 1} (C_{1}^{1} [v [1]] | | \dots | | C_{w}^{1} [v [w]])

. Via the property of the data structure PaXoSs

D_{x}^{i}

and

D_{z}^{i}

constructed by

P_{i \in [n - 1]}

, for

x \in I

,

j \in [β]

, and

v = F_{k} (H_{1} (x))

, we always have

\oplus_{i = 1}^{n - 1} Decode (D_{x}^{i}, x | | j) = \oplus_{i = 1}^{n - 1} ((C_{1}^{i} [v [1]] | | \dots | | C_{w}^{i} [v [w]]) \oplus q_{j}^{i})

,

\oplus_{i = 1}^{n - 1} Decode (D_{z}^{i}, x | | j) = \sum_{i = 1}^{n - 1} (z^{i} - s_{j}^{i})

. At the same time,

P_{n}

defines

q_{j}^{n} \overset{d e f}{=} (\oplus_{i = 1}^{n - 1} Decode (D_{x}^{i}, x | | j)) \oplus (A_{1}^{n} [v [1]] | | \dots | | A_{w}^{n} [v [w]])

and

s_{j}^{n} \overset{d e f}{=} \sum_{i = 1}^{n - 1} Decode (D_{z}^{i}, x | | j) + z

in terms of the

D_{x}^{i}

and

D_{z}^{i}

they receive from

P_{i \in [n - 1]}

. That is, when

x \in I

, it always satisfies that

\oplus_{i = 1}^{n} q_{j}^{i} = 0

and

\oplus_{i = 1}^{n} s_{j}^{i} = \sum_{i = 1}^{n} z^{i}

.

Theorem 2.

If

F

is a PRF and

H_{1}

is a random oracle, then the construction of our MPCCache protocol has colluding semi-honest security, given the OT, PaXoS, GC, and appropriate parameters.

Proof of Theorem 2.

If we consider

l

parties

{P_{i}}_{i \in [l]}

to be corrupted by an adversary

A

, then the number of uncorrupted parties is

(n - l)

. Given

{K_{i}}_{i \in [l]}

, the simulator

S

interacts with

{P_{i}}_{i \in [l]}

as follows.

S

samples random matrices, performs OT, chooses the PRF key

k

and sends

k

to

{P_{i}}_{i \in [l]}

. The simulator

S

constructs random data structures representing honest parties according to the randomness of the matrices.

S

sends two data structures

D_{x}^{i}

and

D_{z}^{i}

constructed on a PaXoS to ideal functionality. We prove

{Real}_{A}^{\prod} (K_{1}, \dots, K_{n}) \overset{c}{\approx} {Ideal}_{S}^{F} (K_{1}, \dots, K_{n})

.

Hyb0: The outputs of parties in the real world.
Hyb1: Same as $Hyb 1$ , $Hyb 2$ , and $Hyb 6$ in Section 4.4.
Hyb2: Similar to $Hyb 1$ except that the decoding executions of the PaXoS are replaced as follows. When ${P_{i}}_{i \in [l]}$ does not contain $P_{n}$ , $S$ receives nothing from the data structure PaXoS. When ${P_{i}}_{i \in [l]}$ contains $P_{n}$ , if $x \in I$ , $P_{n}$ receives $D_{x}^{i}$ and $D_{z}^{i}$ , thus $(C_{1}^{i} [v [1]] | | \dots | | C_{w}^{i} [v [w]]) \oplus q_{j}^{i}$ , $(z^{i} - s_{j}^{i})$ for the PaXoS involving the non-colluding party ${P_{i}}_{i \in [n - l]}$ and $j \in [β]$ . Note that $q_{j}^{i}$ and $s_{j}^{i}$ are used in the above expression for each bin $j \in [β]$ . Since these values are uniform, so are $D_{x}^{i}$ and $D_{z}^{i}$ . Therefore, we replace the decoding outputs of the PaXoS with random ones. Otherwise, all the decoding outputs of the PaXoS are uniformly random from the perspective of $P_{n}$ and ${P_{i}}_{i \in [l]}$ . $Hyb 2$ is computationally indistinguishable from $Hyb 1$ due to the PaXoS’s security.
Hyb3: The output in the ideal world. The only difference between $Hyb 3$ and $Hyb 2$ is that $S$ executes the output of the circuit.

□

7. Conclusions

In this work, we design an efficient MPSI protocol and the MPCCache protocol to better solve the information leakage problem in resource sharing. The proposed MPSI protocol derived from multi-point OPRF demonstrates concrete efficiency in achieving one-sided malicious security. The protocol also leads to a better trade-off between communication and computational overhead. It is based on OT and a data structure PaXoS and achieves linear computation and communication complexity concerning the input set size of each party. In our MPSI protocol, the asymptotic communication and computational complexity of the clients are largely determined by the size of the input sets rather than the number of parties (namely,

O (t λ)

). Overall, this research has contributed to the development of efficient MPSI protocols for multiple parties in practice. In fact, we apply the MPCCache protocol to edge caching scenarios using a simple transformation of the MPSI protocol. The MPCCache protocol under the semi-honest model can support the computation of specific functions on intersections. It is our belief that future work can improve the fairness of the MPSI protocol, as well as propose more application scenarios with practical application value.

Author Contributions

Conceptualization, J.Z., L.Y. and Y.T.; methodology, L.Y. and Y.T.; validation, J.Z., L.Y. and Y.T.; formal analysis, L.Y. and M.J.; writing—original draft preparation, L.Y.; writing—review and editing, Y.T., S.W. and M.J.; supervision, Y.T. and S.W.; funding acquisition, J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Henan Key Laboratory of Network Cryptography Technology (No. LNCT2022-A11), the Henan Province Key R&D and Promotion Special Project (No. 212102210166), and the PhD Foundation of Henan Polytechnic University (No. B2021-41).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wei, X.; Xu, L.; Cai, G.; Wang, H. Secure approximate pattern matching protocol via Boolean threshold private set intersection. Int. J. Intell. Syst. 2022, 37, 9245–9266. [Google Scholar] [CrossRef]
Kales, D.; Rechberger, C.; Schneider, T.; Senker, M.; Weinert, C. Mobile private contact discovery at scale. In Proceedings of the 28th USENIX Security Symposium (USENIX Security 19), Santa Clara, CA, USA, 14–16 August 2019; pp. 1447–1464. [Google Scholar]
Ion, M.; Kreuter, B.; Nergiz, E.; Patel, S.; Saxena, S.; Seth, K.; Shanahan, D.; Yung, M. Private intersection-sum protocol with applications to attributing aggregate ad conversions. Cryptol. ePrint Arch. 2017. preprint. Available online: https://eprint.iacr.org/2017/738 (accessed on 11 September 2023).
Nguyen, D.T.; Trieu, N. MPCCache: Privacy-preserving multi-party cooperative cache sharing at the edge. In Financial Cryptography and Data Security: 26th International Conference, FC 2022, Grenada; Springer International Publishing: Berlin/Heidelberg, Germany, 2022; pp. 80–99. [Google Scholar] [CrossRef]
Chase, M.; Miao, P. Private set intersection in the internet setting from lightweight oblivious PRF. In Proceedings of the Advances in Cryptology–CRYPTO 2020: 40th Annual International Cryptology Conference, Santa Barbara, CA, USA, 17–21 August 2020; pp. 34–63. [Google Scholar] [CrossRef]
Pinkas, B.; Rosulek, M.; Trieu, N.; Yanai, A. SpOT-light: Lightweight private set intersection from sparse OT extension. In Proceedings of the Advances in Cryptology–CRYPTO 2019: 39th Annual International Cryptology Conference, Santa Barbara, CA, USA, 18–22 August 2019; pp. 401–431. [Google Scholar] [CrossRef]
Pinkas, B.; Schneider, T.; Zohner, M. Scalable private set intersection based on OT extension. ACM Trans. Priv. Secur. TOPS 2018, 21, 1–35. [Google Scholar] [CrossRef]
Cong, K.; Moreno, R.C.; da Gama, M.B.; Dai, W.; Iliashenko, I.; Laine, K.; Rosenberg, M. Labeled PSI from homomorphic encryption with reduced computation and communication. In Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, 15–19 November 2021; pp. 1135–1150. [Google Scholar] [CrossRef]
Chen, H.; Huang, Z.; Laine, K.; Rindal, P. Labeled PSI from fully homomorphic encryption with malicious security. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, Toronto, ON, Canada, 15–19 October 2018; pp. 1223–1237. [Google Scholar] [CrossRef]
Pinkas, B.; Schneider, T.; Weinert, C.; Wieder, U. Efficient circuit-based PSI via cuckoo hashing. In Proceedings of the Advances in Cryptology–EUROCRYPT 2018: 37th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Tel Aviv, Israel, 29 April–3 May 2018; pp. 125–157. [Google Scholar] [CrossRef]
Pinkas, B.; Schneider, T.; Tkachenko, O.; Yanai, A. Efficient circuit-based PSI with linear communication. In Proceedings of the Advances in Cryptology–EUROCRYPT 2019: 38th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Darmstadt, Germany, 19–23 May 2019; pp. 122–153. [Google Scholar] [CrossRef]
Chandran, N.; Gupta, D.; Shah, A. Circuit-PSI With Linear Complexity via Relaxed Batch OPPRF. Proc. Priv. Enhancing Technol. 2022, 1, 353–372. [Google Scholar] [CrossRef]
Kavousi, A.; Mohajeri, J.; Salmasizadeh, M. Efficient scalable multi-party private set intersection using oblivious PRF. In Proceedings of the Security and Trust Management: 17th International Workshop, STM 2021, Darmstadt, Germany, 8 October 2021; pp. 81–99. [Google Scholar] [CrossRef]
Inbar, R.; Omri, E.; Pinkas, B. Efficient scalable multiparty private set-intersection via garbled bloom filters. In Proceedings of the Security and Cryptography for Networks: 11th International Conference, SCN 2018, Amalfi, Italy, 5–7 September 2018; pp. 235–252. [Google Scholar]
Ghosh, S.; Nilges, T. An algebraic approach to maliciously secure private set intersection. In Proceedings of the Advances in Cryptology–EUROCRYPT 2019: 38th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Darmstadt, Germany, 19–23 May 2019; pp. 154–185. [Google Scholar] [CrossRef]
Kolesnikov, V.; Kumaresan, R.; Rosulek, M.; Trieu, N. Efficient batched oblivious PRF with applications to private set intersection. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, 24–28 October 2016; pp. 818–829. [Google Scholar]
Pinkas, B.; Schneider, T.; Zohner, M. Faster private set intersection based on {OT} extension. In Proceedings of the 23rd USENIX Security Symposium (USENIX Security 14), San Diego, CA, USA, 20–22 August 2014; pp. 797–812. [Google Scholar]
Nevo, O.; Trieu, N.; Yanai, A. Simple, fast malicious multiparty private set intersection. In Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, 15–19 November 2021; pp. 1151–1165. [Google Scholar] [CrossRef]
Pinkas, B.; Rosulek, M.; Trieu, N.; Yanai, A. PSI from PaXoS: Fast, malicious private set intersection. In Proceedings of the Advances in Cryptology–EUROCRYPT 2020: 39th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Zagreb, Croatia, 10–14 May 2020; pp. 739–767. [Google Scholar] [CrossRef]
Ben-Efraim, A.; Nissenbaum, O.; Omri, E.; Paskin-Cherniavsky, A. Psimple: Practical multiparty maliciously-secure private set intersection. In Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security, 30 May–2 June 2022; pp. 1098–1112. [Google Scholar]
Bui, D.; Couteau, G. Private Set Intersection from Pseudorandom Correlation Generators. IACR Cryptol. ePrint Arch. 2022, 2022, 334. [Google Scholar]
Chida, K.; Hamada, K.; Ichikawa, A.; Kii, M.; Tomida, J. Communication-Efficient Inner Product Private Join and Compute with Cardinality. Cryptol. ePrint Arch. 2022. preprint. Available online: https://eprint.iacr.org/2022/338 (accessed on 11 September 2023).
Miao, P.; Patel, S.; Raykova, M.; Seth, K.; Yung, M. Two-sided malicious security for private intersection-sum with cardinality. In Proceedings of the Advances in Cryptology–CRYPTO 2020: 40th Annual International Cryptology Conference, Santa Barbara, CA, USA, 17–21 August 2020; pp. 3–33. [Google Scholar] [CrossRef]
Goldreich, O. Foundations of Cryptography: Volume 2, Basic Applications; Cambridge University Press: New York, NY, USA, 2009. [Google Scholar]
Rabin, M.O. How to exchange secrets with oblivious transfer. Cryptol. ePrint Arch. 2005. preprint. Available online: https://eprint.iacr.org/2005/187 (accessed on 11 September 2023).
Ishai, Y.; Kilian, J.; Nissim, K.; Petrank, E. Extending Oblivious Transfers Efficiently. Crypto 2003, 2729, 145–161. [Google Scholar] [CrossRef]

Figure 1. Ideal functionality of MPSI

F_{MPSI}

.

Figure 1. Ideal functionality of MPSI

F_{MPSI}

.

Figure 2. Ideal functionality of OT

F_{OT}

.

Figure 2. Ideal functionality of OT

F_{OT}

.

Figure 3. System model.

Figure 4. Our MPSI protocol.

Figure 5. Comparison of security levels.

Figure 6. Running time vs. set cardinality.

Figure 7. Running time vs. the number of parties.

Figure 8. Traditional cache model and edge cache model.

Figure 9. Our MPCCache protocol.

Table 1. The related work of PSI.

Protocol	Technical	Number of Parties	Security Model
[15]	OLE	multi-party	Malicious
[16]	OT	two-party	Semi-Honest
[17]	GBF + OT	two-party	Semi-Honest
[18]	OPPRF + OKVS	multi-party	Malicious
[19]	PaXoS	two-party	Malicious
[20]	GBF	multi-party	Malicious
[21]	PCG	two-party	Semi-Honest

Table 2. The related work of function-based PSI.

Protocol	Technical	Number of Parties	Protocol Type
[3]	DDH + HE	two-party	PI-Sum
[4]	OPPRF	multi-party	MPCCache
[11]	OPPRF + Circuit	multi-party	PSI- payload
[22]	OPRF + DDH	two-party	PIW-Sum
[23]	DOPRF	two-party	PSI-CA

Table 3. Bits sent for leader and client.

Communication Party	Total Bit Transmission
$P_{n} \to P_{i \in [n - 1]}$	$t w$
$P_{i \in [n - 1]} \to P_{n}$	$w (λ - 1)$
$P_{i \in [n - 2]} \to P_{n - 1}$	$t w$
$P_{n - 1} \to P_{n}$	$t l_{2}$

Table 4. Complexity of MPSI protocols.

Protocol	Communication		Computation		Security Model
Protocol	Leader	Clients	Leader	Clients	Security Model
[13]	$O (t n λ)$	$O (t λ k)$	$O (t n λ)$	$O (t n λ)$	Semi-Honest
[14]	$O (\log (n) t n λ k)$	$O (\log (n) t n λ k)$	$O (t n λ k)$	$O (t n λ k)$	Aug Semi-Honest
[15]	$O ((n^{2} + t n) λ)$	$O (t λ)$	$O (t n \log (t))$	$O (t \log (t))$	Malicious
Ours	$O (t n λ)$	$O (t λ)$	$O (t n λ)$	$O (t λ)$	One-sided Malicious

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, J.; Yang, L.; Tang, Y.; Jin, M.; Wang, S. A Novel Edge Cache-Based Private Set Intersection Protocol via Lightweight Oblivious PRF. Entropy 2023, 25, 1347. https://doi.org/10.3390/e25091347

AMA Style

Zhang J, Yang L, Tang Y, Jin M, Wang S. A Novel Edge Cache-Based Private Set Intersection Protocol via Lightweight Oblivious PRF. Entropy. 2023; 25(9):1347. https://doi.org/10.3390/e25091347

Chicago/Turabian Style

Zhang, Jing, Li Yang, Yongli Tang, Minglu Jin, and Shujing Wang. 2023. "A Novel Edge Cache-Based Private Set Intersection Protocol via Lightweight Oblivious PRF" Entropy 25, no. 9: 1347. https://doi.org/10.3390/e25091347

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Edge Cache-Based Private Set Intersection Protocol via Lightweight Oblivious PRF

Abstract

1. Introduction

2. Related Work

3. Preliminaries

3.1. Notions

3.2. One-Sided Malicious Security

3.3. Security Model

3.4. Oblivious Transfer

3.5. PaXoS

3.6. Multi-Point OPRF

3.7. Hamming Correlation Robustness

3.8. Cuckoo Hashing and Simple Hashing

4. Our MPSI Protocol

4.1. Overview

4.2. Our Protocol

4.3. Protocol Correctness

4.4. Protocol Security

5. Performance Evaluation

5.1. Complexity Analysis

5.2. Comparison

5.3. Experimental Evaluation

6. MPCCache in Edge Computing

6.1. Our MPCCache

6.2. Correctness and Security

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI