Fast Multi-User Searchable Encryption with Forward and Backward Private Access Control

Bulbul, Salim Sabah; Abduljabbar, Zaid Ameen; Najem, Duaa Fadhel; Nyangaresi, Vincent Omollo; Ma, Junchao; Aldarwish, Abdulla J. Y.

doi:10.3390/jsan13010012

Open AccessArticle

Fast Multi-User Searchable Encryption with Forward and Backward Private Access Control

¹

Directorate General of Education Basra, Ministry of Education, Basra 61004, Iraq

²

Department of Computer Science, College of Education for Pure Sciences, University of Basrah, Basrah 61004, Iraq

³

Department of Cyber Security, College of Computer Science and Information Technology, University of Basrah, Basrah 61004, Iraq

⁴

Department of Computer Science and Software Engineering, Jaramogi Oginga Odinga University of Science & Technology, Bondo 40601, Kenya

⁵

College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China

^*

Authors to whom correspondence should be addressed.

J. Sens. Actuator Netw. 2024, 13(1), 12; https://doi.org/10.3390/jsan13010012

Submission received: 7 December 2023 / Revised: 10 January 2024 / Accepted: 18 January 2024 / Published: 2 February 2024

(This article belongs to the Special Issue Security and Smart Applications in IoT and Wireless Sensor and Actuator Networks)

Download

Browse Figures

Versions Notes

Abstract

:

Untrusted servers are servers or storage entities lacking complete trust from the data owner or users. This characterization implies that the server hosting encrypted data may not enjoy full trust from data owners or users, stemming from apprehensions related to potential security breaches, unauthorized access, or other security risks. The security of searchable encryption has been put into question by several recent attacks. Currently, users can search for encrypted documents on untrusted cloud servers using searchable symmetric encryption (SSE). This study delves deeply into two pivotal concepts of privacy within dynamic searchable symmetric encryption (DSSE) schemes: forward privacy and backward privacy. The former serves as a safeguard against the linkage of recently added documents to previously conducted search queries, whereas the latter guarantees the irretrievability of deleted documents in subsequent search inquiries. However, the provision of fine-grained access control is complex in existing multi-user SSE schemes. SSE schemes may also incur high computation costs due to the need for fine-grained access control, and it is essential to support document updates and forward privacy. In response to these issues, this paper suggests a searchable encryption scheme that uses simple primitive tools. We present a multi-user SSE scheme that efficiently controls access to dynamically encrypted documents to resolve these issues, using an innovative approach that readily enhances previous findings. Rather than employing asymmetric encryption as in comparable systems, we harness low-complexity primitive encryption tools and inverted index-based DSSE to handle retrieving encrypted files, resulting in a notably faster system. Furthermore, we ensure heightened security by refreshing the encryption key after each search, meaning that users are unable to conduct subsequent searches with the same key and must obtain a fresh key from the data owner. An experimental evaluation shows that our scheme achieves forward and Type II backward privacy and has much faster search performance than other schemes. Our scheme can be considered secure, as proven in a random oracle model.

Keywords:

symmetric encryption; cloud computing; access control; multiple user; backward privacy

1. Introduction

Following the rapid emergence of cloud computing, individuals and enterprises can now outsource storage and computation to the cloud [1,2,3]. The encryption of outsourced data before uploading to the cloud can prevent privacy leaks; however, data availability will be reduced if traditional encryption is used, as querying outsourced data is impossible in this case [4].

Searchable symmetric encryption (SSE) allows encrypted documents to be searched securely, thereby solving the problem described above [5,6,7]. Two methods can be used to design searchable encryption schemes: the first considers a single user who uploads encrypted data to the server and retrieves them when needed by requesting a query [8,9], and the second involves the data owner, who sends encrypted data to the server and allows many users to access them [10,11,12,13]. In addition to searching for documents, users sometimes need to upload their documents in realistic scenarios: for instance, collaborative e-health applications and projects are examples of situations in which a crowd of members need to upload relevant information, and then to search for and access each other’s contributions depending on their roles. SSE was originally introduced by Song et al. [5] with the aim of facilitating efficient access to encrypted data while minimizing privacy constraints. During the execution of a user’s query, SSE reveals some information to the server, which is known as leakage. An access pattern and a search pattern are usually included in the leak. Search patterns reveal which searches have the same keyword, whereas access patterns reveal the files returned by searches.

Dynamic searchable encryption with leakage mitigation SSE was applied to static datasets until 2009, when the principle of updating encrypted databases (described as dynamic SSE schemes) was first presented [14,15]. Existing dynamic symmetric searchable encryption (DSSE) schemes encounter substantial challenges, particularly in terms of storage complexity and overhead costs. While traditional searchable symmetric encryption (SSE) schemas designed for static databases excel in efficiently searching encrypted data, they fall short in accommodating dynamic updates, such as deletions and additions. The demand for adaptability in dynamic environments, characterized by constant changes in data, poses a significant hurdle for current schemes. Storage complexity has emerged as a primary concern within DSSE schemes, with many struggling to manage escalating storage requirements as datasets expand or undergo evolution. Furthermore, the security guarantees of certain schemes may be compromised when adapted for dynamic operations, creating a critical gap in ensuring the confidentiality and integrity of sensitive information. However, despite the added functionality, dynamic SSE raises privacy concerns. Dynamic, searchable cryptographic schemes face many challenges, including forward and backward specificity. The notion of forward privacy is straightforward: within a database, a file may encompass keywords that have been previously queried, even after their inclusion in the database. A brief discussion of this concept was presented in [16], and a more detailed discussion in [17,18,19]. Recently, file injection attacks were introduced in [20]. Dynamic SSE schemes now require forward privacy as an essential attribute. The concept of backward privacy is the second concept related to dynamic SSE: when a file is searched for in the present, the system does not reveal whether it was previously deleted from the encrypted database. This concept was first mentioned in [18], but without a formal definition, and has been expanded on by Bost et al. [21], who provided a formal definition of leakage with three levels and proposed a system that eliminated it.

Type I backward privacy leakage: When a search for a keyword w is carried out, Type I schemes reveal how many updates were made to w, what type of updates were made, and when each update was made.
Type II backward privacy leakage: As well as the leakage at the first level, the second level may also reveal the times at which all updates associated with the keyword were made.
Type III backward privacy leakage: The last level is considered the weakest, as it reveals the deletion of a previously added keyword, in addition to the leakage at the above levels.

Multi-user searchable encryption (MUSE) is a cryptographic technique designed to facilitate encrypted data querying by multiple users, ensuring rigorous standards of privacy and security. It serves as a robust solution for integrating search capabilities with encrypted information, particularly in applications like cloud storage, where shared access to data is often necessary. MUSE leverages encryption to enable data exploration without requiring decryption, employing the searchable symmetric encryption (SSE) method. SSE empowers users to conduct searches within encrypted data while preventing unauthorized access. In the MUSE framework, each user possesses an individualized secret key critical for the encryption and decryption of data, enhancing the overall security and privacy measures. The encrypted data are subsequently stored on a server tasked with enabling search operations on the data. Whenever a user initiates a search for specific information, a search query is transmitted to the server, which employs the SSE scheme to conduct a search of the encrypted data and delivers the results back to the user. Throughout this process, data remain in an encrypted state, thereby preserving the paramount principles of privacy and security.

Two pressing issues remain unresolved, although researchers have provided solutions for certain scenarios such as that in [22]. The first problem is to find a lightweight approach to multi-user access control. Due to their intricate nature and the significant computational demands involved, existing multi-user SSE schemes lack a suitable access control method for all cases. The second requirement is for dynamic multi-user scenarios. In order to ensure secure uploading, forward privacy must also be provided [7,18] during the upload process. We developed an efficient multi-user SSE scheme that preserves forward privacy and efficiently controls access. The levels of access control privilege in our construction are symmetrical, and it is a privilege-based multi-user SSE scheme that is simpler and faster than attribute-based schemes with complex access structures.

In this paper, we present a novel searchable symmetric encryption (SSE) scheme designed to achieve robust access control in dynamic multi-user environments. Our approach introduces a unique combination of pseudorandom functions (PRFs) and forward–backward privacy preservation mechanisms, ensuring that updates are indistinguishable from random numbers, thereby guaranteeing both Type II backward and forward privacy. The novelty of our scheme lies in its efficient yet secure design, demonstrated through a comprehensive security analysis and experimental evaluations, where it consistently outperforms existing schemes, including

S S M D O

[23] and

N a j a f i

[24], in terms of index generation, search process efficiency, and overall system performance. In our approach, the server maintains an encrypted index that correlates individual keywords with specific sets of file identifiers, which denote the files containing those keywords. Each keyword entry is encrypted with a unique, fresh key, which is determined based on the keyword itself, the number of times that keyword has been updated, and a random value. A fresh random value is generated after each search query to ensure forward privacy. This fresh key is derived using a pseudorandom function from a master key held by the owner, and the owner is therefore required to keep records of the number of updates for each keyword and to present such a random value when needed. To conduct a keyword search, the owner must generate and disclose a ‘once’ key, which enables the user to interact with the relevant entries in the encrypted index, retrieve file identifiers associated with the keyword, and subsequently retrieve the corresponding encrypted files. Upon retrieval, the user decrypts the encrypted files, ensuring that results stored on the server are promptly deleted to achieve backward privacy. The key is then automatically updated to prevent its reuse by the same user or any other users for subsequent searches. This comprehensive system establishes a secure environment suitable for multiple users. We can summarize the main contributions of our model as follows:

We propose a multi-user searchable symmetric encryption scheme that uses symmetric keys, thus significantly reducing the computation cost of the scheme.
Our scheme is shown to be secure in multi-user scenarios via theoretical analyses of forward privacy and user security. Several experiments are also carried out to demonstrate that our design is efficient from the perspectives of computation and storage.
Our scheme achieves reverse privacy: search queries should not reveal deleted files, because the updating key is changed when each search query is finished. This type of privacy is lacking in most similar constructions.
We employ an optimized indexing system that marks and subsequently removes accessed entries, as the access pattern inadvertently reveals these results, and their encryption similarly exposes them.

The remainder of the paper is structured as follows. Related work is discussed in Section 2, and we present the problem statement in Section 3. The primitive tools and secure definitions used in our scheme are introduced in Section 4. The proposed scheme is described in detail in Section 5. Section 6 presents an analysis of the system and its performance. Section 7 concludes the paper.

2. Previous Work

Kamara et al. [15] presented the first scheme to explicitly support efficient database updates. In their later work, Kamara and Papamanthou [14] were able to reduce the amount of leakage in the first method. In a seminal paper [16], the concept of forward privacy was introduced as a novel idea for encryption schemes; however, it is important to highlight that the studies in [14,15] did not consider this concept in the design of their systems, meaning that there is potential susceptibility to privacy breaches of the data. Since this concept was introduced, it has become one of the basic pillars of preserving privacy for any searchable encryption scheme and has been extensively studied and applied, as described in references [17,18,19,25,26,27]. The concept of backward privacy was introduced for the first time by Stefanov et al. [18], although they provided no definition or construction of this concept. To the best of our knowledge, Bost et al. [21] were the first and only researchers to focus on the notion of backward privacy and to offer schemes for implementing it, as well as to present a formal definition of backward privacy. Specific constructions have been proposed to bolster privacy in various adversarial scenarios, as detailed in [28]. Nevertheless, it is imperative to note that these constructions do not accommodate multiple users and are tailored exclusively to an individual user of the system.

A multi-user scenario has not been seriously considered in SSE schemes for a long time. In one research study [22], an attempt was made to develop a multi-client SSE scheme based on an attribute-based encryption ABE scheme and OXT scheme, as described in [6]. In this proposed scheme, clients had the capability to establish access policies for their documents using their individual attributes, and subsequently to upload these documents to a server. Other clients could access these documents concurrently only if their own attributes were aligned with the access policy for the document. However, two key limitations were identified in [22]. Firstly, the integration of ABE introduces a considerable and potentially unnecessary overhead, given that more straightforward access control methods, such as privilege levels, would suffice for most use cases. Secondly, the system permits any client to upload documents without imposing constraints to address the corresponding security concerns, specifically the issue of forward or backward privacy. Stefanov et al. [18] claimed forward privacy by stating that the same search tokens should not match a newly inserted document. File injection attacks [20] can destroy encrypted systems when an SSE scheme lacks forward privacy. Owing to the need for and importance of accommodating multiple users in some systems, several schemes have been developed with the aim of addressing this requirement, as documented in [23,24,29]. The constructions outlined in these articles employed asymmetric encryption techniques, resulting in elevated computational costs during search and decryption processes. In addition to the substantial computational expenses associated with these systems [23,29], they also failed to ensure backward privacy. We compare our proposed scheme with other schemes involving multiple users. To the best of our knowledge, there have been no previous works that have supported several users while offering backward privacy, as noted in Table 1. In addition, the approach of Zhang et al. [22] is the only one to support multiple users. In the scenarios considered here, fine-grained access control is not needed and imposes excessive computational burdens.

3. Problem Statement

3.1. System Model

Figure 1 shows the SSE system, which consists of three distinct entities: data owners, cloud servers, and users, as follows:

Data owners bear responsibility for the maintenance of the system and resolving issues. They play a pivotal role in the allocation and distribution of access tokens to users. Data owners are tasked with generating encryption keys, modifying databases, and creating keys for users upon request, particularly when users initiate search operations.
Users have the authority to upload their own data to the cloud and to perform searches across documents uploaded by other users. Access tokens, which are furnished by data owners, enable users to formulate access policies when uploading documents. Users can only access documents for which their access tokens align with the access policies, thus ensuring secure access during query execution.
Cloud servers within the SSE system are equipped with advanced computational and storage capabilities, thereby facilitating robust data processing and storage. When a user uploads ciphertext accompanied by its associated index, the cloud server assumes responsibility for data processing. In cases where a user dispatches a search token to the cloud server, it initiates the search operation and subsequently returns the corresponding ciphertext results. The functionality of this server is crucial in terms of ensuring the security and efficiency of the system’s search operations.

3.2. Security Assumptions

In the proposed scheme, the cloud server is presumed to adhere to the principles of integrity and some inquisitiveness. It is assumed that the cloud server will faithfully execute the prescribed protocols and procedures within this framework. Simultaneously, the cloud server undertakes the task of aggregating and scrutinizing the documents uploaded by users to extract additional privacy-related information. The owner generates a temporary key when a user requests a search on the server, which expires upon completion of the search operation. The key is essential for accessing encrypted data stored on the server, and users are unable to access the data without it.

3.3. Design Objectives

We present an efficient scheme for dynamic multi-user environments with the aim of facilitating straightforward queries while ensuring a high level of data user privacy. Our design therefore considered several security and performance objectives, as follows:

The owner has the exclusive capability to generate tokens from a master primitive key, which can subsequently be entrusted to authorized users as necessary. Subsequently, these users can perform queries on an encrypted database by utilizing the securely stored keywords at their disposal.
A multi-user environment is provided. Using the access tokens assigned to each user, the cloud server allows them to outsource their documents on behalf of all users and to search for outsourced documents that other users have contributed.
The computational cost is low. In our scheme, access control is implemented using only symmetric encryption. We contend that our proposed scheme will be less computationally expensive than previous schemes.
Forward and backward privacy are preserved. To achieve privacy, we use a counter based on keywords and a random number that is modified after each search to generate fresh keys. For these reasons, the cloud server cannot establish a connection between new documents that have been uploaded and previous search tokens.

4. Preliminaries

4.1. Notation

We use

r \overset{$}{\leftarrow} X

to refer to an element

r

that has been randomly selected from a set of elements

X

. Strings of length

l

are denoted as

{0, 1}^{l}

, while strings of arbitrary length are denoted as

{0, 1}^{*}

. Two strings

a

and

b

are concatenated using

a | | b

.

λ

is the security parameter. We use

| S |

to denote the cardinality of a set

S

and the symbol

A

to represent the adversary server.

L

denotes information that has been leaked during the operation of the system. P(x; y) refers to a protocol executed in the context of a client–server setting where the protocol P is executed using the client’s input, denoted as x, and the server utilizes the input y, in accordance with the provided parameters. To implement the search protocol, we typically need

L^{s r c h} = (a p, q p)

, where

a p

represents the

a c c e s s p a t t e r n

and

q p

is the

q u e r y p a t t e r n

in the protocol.

4.2. Forward and Backward Privacy and Leakage Functions

In dynamic searchable encryption (DSE) systems, leakage functions play a crucial role in maintaining the integrity of the encrypted index as new documents are either added to or removed from the collection.

An effective searchable encryption scheme should disclose the least possible amount of information. Leakage can be captured using leakage functions. The setup protocol exclusively reveals

L^{s t u p}

, encompassing solely the count of documents and keywords within the database

D B

, as well as the size of the database. In more formal terms,

L^{s r c h}

maintains a history

H i s t

containing the history of all queries

q i

.

Forward privacy. Informally, forward private schemes are those where it is impossible to relate an operation to one that occurred previously. There is a strong property within DSSE that prevents the leakage of update operations during an update. In principle, updating a keyword requires that no information about it is leaked in the update query. Formally, it is defined as follows:

Definition 1.

(Forward Privacy). SSE schemes with

L

-adaptive security are forward private if used for an update query.

L^{u p d t} (o p, w, i n d) = L^{' u p d t} (o p, i n d) .

A stateless function

L^{'}

is used here. This definition is the same as that in [17], where multiple keywords appear in the update query.

Backward privacy. Backward privacy is a critical property of searchable encryption schemes that ensures the confidentiality of previously submitted queries even in the event of server compromise or collusion. By deleting some entries from prior searches for a keyword w, the server does not learn any new information. When a search for a keyword

w

found in a previously deleted entry is performed, backward privacy guarantees that the server remains unaware of any additional information beyond the initial search. In an ideal scenario, the SSE scheme should not conceal previously removed entries, or at least their file identifiers, from an adversary [18]. The author of [21] provides a formal definition of three types of backward privacy, where Type I reveals the least information and Type III the most. In accordance with the notation used in [21], there are several functions that need to be added before we give our final definition.

Each query executed is represented by one entry in a list

Q

. In the case of a search, the entry will take the form

(t, w),

where

t

denotes the timestamp for the query and

w

denotes the keyword searched. An example of an update would be (

t, o p, (w, i n d)

), where

= a d d o r d e l

, and

i n d

is the file that has been modified. Let

T i m e D B (w)

be a function that, for a keyword

w

, returns a list of all the timestamps/file identifier pairs associated with keyword

w

that have been added to

D B

during the term of this keyword and that have not been deleted during this time.

T i m e D B (w) = \{(t, i n d)| (t, a d d, (w, i n d)) \in Q a n d \forall t^{'}, (t^{'}, d e l, (w, i n d)) \notin Q}

A function called

U p d a t e s (w)

records updates, timestamps

t

, deletions, and additions to a specific keyword

w

. Formally, we have the following:

U p d a t e s (w) = {t | (t, a d d, (w, i n d)) \in Q o r (t, d e l, (w, i n d)) \in Q} .

A function called

D e l H i s t (w)

returns to the adversary all pairs of insertion timestamps and deletion timestamps that are associated with deleted entries. The deletions corresponding to additions are explicitly revealed.

D e l H i s t (w) = {(t^{a d d}, t^{d e l}) | \exists i n d : (t^{a d d}, a d d, (w, i n d)) \in Q a n d (t^{d e l}, d e l, (w, i d)) \in Q}

Clearly, leakage from these functions is increasing. As a result, backward privacy is now formally defined for different types of leaks.

Definition 2.

([21]). An

L

-adaptively-secure DSSE scheme has backward privacy where

L = (L^{s t u p}, L^{s r c h}, L^{u p d t})

:

$T y p e I (B P l e v e l - I) : i f f$ $L^{u p d t} (o p, w, i n d) = L^{' u p d t}$ ( $o p$ ) and $L^{s r c h} (w) = L^{″ s r c h} \{T i m e D B (w), a_{w}\}$
$T y p e I I (B P l e v e l - I I) : i f f$ $L^{u p d t} (o p, w, i n d) = L^{' u p d t}$ ( $o p, w$ ) and $L^{s r c h} (w) = L^{″ s r c h} \{T i m e D B (w), U p d a t e s (w)\}$
$T y p e I I I (B P l e v e l - I I I) : i f f$ $L^{u p d t} (o p, w, i n d) = L^{' u p d t}$ ( $o p, w$ ) and $L^{s r c h} (w) = L^{″ s r c h} \{T i m e D B (w), D e l H i s t (w)\}$
In this case, $L^{'}$ and $L^{″}$ represent a stateless function.

A leak happens when a user actually retrieves the files. The definition given above presupposes that a scheme results in the inadvertent exposure of files currently holding the data

w

. Specifically,

T i m e D B (w)

has the capability to reveal the indexes of the documents returned.

Pseudorandom functions. Let

G e n P R F (1^{λ}) \in {0,1}^{λ}

for key generation, and let the function

G : {0, 1}^{λ} \times {0, 1}^{l} \to {0, 1}^{l^{'}}

be a family of pseudorandom functions (

P R F s

). For all PPT adversaries

A d v

,

G

is a secure

P R F

family,

| P r [K \leftarrow G e n P R F (1^{λ}); {A d v}_{G_{K}} (・) (1^{λ}) = 1] - P r [{A d v}_{R} (・) (1^{λ}) = 1] | \leq v (λ)

, where

R

:

{0, 1}^{l} \to {0, 1}^{l^{'}}

means the function is truly random.

5. Proposed Forward and Backward Multi-User Scheme

5.1. Overview

This section presents the proposed scheme, which to the best of our knowledge is the first to support multiple users with backward and forward privacy. We employ a straightforward method to store encrypted records in such a manner that no information is divulged to the server during updates, which include file insertions and deletions. The only information revealed to the server is the timing of each update that occurred while the records were stored in an encrypted state. An inverted index (address value) is used in this construction to store encrypted data of the form (

i n d, o p

), where

o p

refers to operation insertions or deletions and

i n d

is the name of a specific file associated with these operations. Using a pseudorandom function, the owner can generate the set of locations that are associated with a keyword w for a given search operation using the address at which the values are stored in the hash table. This ensures that the user can efficiently produce a set of locations associated with a particular keyword. The author of [12] introduced a similar syntax based on asymmetric encoders; however, our scheme is based on symmetric encoders, which makes searching and decoding much faster. Our scheme is also more secure, as privacy is ensured in both the forward and backward directions, which is not possible in [12].This issue was effectively resolved in our scheme, which permits the creation of locations directly on the user’s device. The decryption of entries also occurs locally on the user’s device, thereby obviating the need to transmit the decryption key to the server for the purpose of creating location addresses and decrypting results. In the context of dynamic symmetric searchable encryption (DSSE), the assurance of dissociating previous search queries from subsequent ones necessitates forward privacy. Updates process, while backward privacy ensures the non-association of subsequent search requests for the retrieval of deleted documents from the past. This study introduces a comprehensive method for preserving both forward and backward privacy in DSSE. Notably, this strategy marks the development of the first operational and non-interactive Type II backward–private DSSE framework without reliance on secure execution environments. The proposed DSSE scheme achieves Type II backward and forward privacy by generating a unique one-time (fresh) key, denoted as

K_{g}

, for each search query, resulting in distinct encryption outcomes for each iteration. Additionally, it safeguards against the server by discerning the underlying operation (delete or add) embedded in the update query.

Setup. Managing the setup is the responsibility of the owner of the data. A secret key

K_{S K E}

is generated by the setup algorithm (Algorithm 1) upon entering the security parameters

λ

. The owner starts with four blank maps (

E_{D B}, P_{D B}, {k e y}_{C n t}, {S r c h}_{r n}

). In the first map

E_{D B}

, entries are encrypted and a cashed plaintext identifier is stored in

P_{D B}

; these are then transmitted to the server, where they are stored for later modification through the update and search protocols. The other two maps are stored locally with the owner. A counter is stored on one map

{k e y}_{C n t}

for each keyword, and a random value is stored on another map

{S r c h}_{r n}

for each keyword search.

Update. A specific file is updated (added or deleted) during the update procedure shown in Algorithm 2. In addition to receiving the file

i n d

that contains a list of keywords, the owner will also receive the operation

o p

(add or remove). The owner has access to the

K_{S K E}

key, the values of the local state search random

{S r c h}_{r n}

, and keyword counter

{k e y}_{C n t}

.

{S r c h}_{r n}

will be modified after each distinct keyword is searched for, and

{k e y}_{C n t}

is a counter that indicates how many updates have been made for each distinct keyword. The generation of unique fresh keys for each update operation, resulting in distinct values for the same keyword on each update, introduces additional computational complexity cost to the system. However, the significant benefits derived from this practice include heightened data privacy protection and the alleviation of the need for clients to manage an extensive array of keys. The system incorporates one private key (

K_{S K E})

exclusively stored with the data owner, reinforcing the security infrastructure. A fresh key

K_{g}

is generated by using

K_{S K E}

with a random value

{S r c h}_{r n}

. Only one search is performed with this key

K_{g}

, and the search engine considers this to have expired once it has been used, so that users cannot use it again. Through the generation of a fresh key, denoted as

K_{g}

, for each update query, distinct encryption outcomes are achieved during each update operation (deletion or addition) for the same keyword. This approach aims to prevent information leakage and uphold the security properties of Type II backward privacy and forward privacy. Furthermore, it provides protection against the server by elucidating the underlying operation (delete or add) inherent in the update query.

Let us examine an illustrative example to elucidate the concept embodied by the entry (add, ind), signifying a request for the inclusion of a file in the encrypted database. Subsequently, we shall elucidate the operation of the update algorithm in the following manner. To add a keyword, it must be associated with a file. FileCnt[w] is first checked for configuration by the owner. When this happens, the counter value

{k e y}_{C n t}

for w is set to zero, and

{S r c h}_{r n}

to a random value. The counter is incremented in both cases (lines 2–4). To ensure backward privacy and one-time use when searching, a renewable key

K_{g}

is generated for each keyword. Next, the owner runs the PRF

G

with key

K_{g}

twice and calculates

G (K_{g}, w | | {k e y}_{C n t} [w] | | 0)

and

G (K_{g}, w | | {k e y}_{C n t} [w] | | 1)

.

(i n d | | o p)

will be encrypted and stored on the server side at the location

{A d d r}_{w}

extracted based on the first PRF output. In contrast, the second

P R F

output is XORed with the entry

(i n d | | o p)

, resulting in an encrypted value

{V a l}_{w}

, which is stored by the server (lines 6–7). A pair of addresses and values (

{A d d r}_{w}, {V a l}_{w}

) is sent to the server, which stores them as

E_{D B} [{A d d r}_{w}] = {V a l}_{w}

(line 9).

$A l g o r i t h m 1 S e t u p (λ; ⊥)$
Owner:
1:	$K_{S K E} \overset{$}{\leftarrow} G e n P R F (1^{λ})$
2:	$E_{D B}, P_{D B}, {k e y}_{C n t}, {S r c h}_{r n} \leftarrow e m p t y m a p$
3:	${\partial \leftarrow {k e y}_{C n t}, {S r c h}_{r n}} s t o r e i n O w n e r$
4:	Send $E_{D B}$ To the Server side.

$A l g o r i t h m 2 U p d a t e (\partial, f, o p, K_{S K E}; E_{D B})$
Owner:
1:	$F o r a l l w \in f$
2:	$I f \partial [w] \leftarrow {}$
3:	${k e y}_{C n t} = 0, {S r c h}_{r n} \overset{$}{\leftarrow} {0,1}^{λ}$
4:	${k e y}_{C n t} + +$
5:	$K_{g} \leftarrow G (K_{S K E}, w \| \| {S r c h}_{r n})$
6:	${A d d r}_{w} \leftarrow G (K_{g}, w \| \| {k e y}_{C n t} [w] \| \| 0)$
7:	${V a l}_{w} \leftarrow (i n d \| \| o p) \oplus G (K_{g}, w \| \| {k e y}_{C n t} [w] \| \| 1)$
8:	Send { ${A d d r}_{w}, {V a l}_{w}$ } to server
	Server:
9:	$E_{D B} [{A d d r}_{w}] \leftarrow {V a l}_{w}$

Search. Finally, we describe the algorithm that is used to search for keywords (Algorithm 3). A request containing a keyword is sent to the owner when the user

U_{k}

requests a search of the encrypted database stored on the cloud server. The owner searches for the keyword in the local state

\partial

maps. A one-time key

K_{g}

is generated for the search and decryption if it exists, and the user

U_{k}

can use this. As well as generating a tag value

K_{t}

for

w

, the server also returns a list of document identifiers (if any exist) that are a match for

w

in the last search query, allowing the server to return the set of documents that match

w

(lines 2–3). The key

K_{g}

,

K_{t}

and the keyword counters

{k e y}_{C n t} [w]

are sent over a secure channel by the owner to

U_{k}

.

{k e y}_{C n t} [w]

represents a cumulative count of the updates made to

w

, which is used by

U_{k}

when searching for files containing this keyword. By employing key

K_{g}

and counter, a user generates a list

L_{a d d r}

that shows where corresponding entries in

E_{D B}

are located. This is accomplished by evaluating the

P R F G

on input

(K_{g}, w | | i | | 0)

f o r i = 1, \dots, {k e y}_{C n t} [w]

. Since

G

is a deterministic function, the locations that are calculated are the same as those obtained during the previous updates to

w

. The server then receives the list

L_{a d d r}

of locations (lines 6–9).

E_{D B}

contains the values of all locations in

L_{a d d r}

, which the server retrieves from

E_{D B}

as a result

E_{R e s}

of the search query and the results of previous searches from

P_{D B} [K_{t}]

(lines 10–15). When the encrypted values

E_{R e s}

are received by the user, they are decrypted using the

P R F

outputs

G (K_{g}, w | | i | | 1

f o r i = 1, \dots, {k e y}_{C n t} [w]

and then

X O R e d

with the i-th encrypted value and the i-th

P R F

output. A key

K_{g}

that corresponds to a particular keyword will expire after a search is completed and the value

{S r c h}_{r n}

will update. As a result, after the user filters the results Res, the server stores the plaintext version of the results in the map

P_{D B}

since is deemed an

a c c e s s p a t t e r n

leakage that would be useless if it were encrypted (lines 10–15).

$A l g o r i t h m 3 S e a r c h (\partial, w, K_{S K E}; E_{D B})$
1:	User $U_{k}$ asks the owner to search for keyword $w$
	Owner:
2:	$K_{g} \leftarrow G (K_{S K E}, w \| \| {S r c h}_{r n})$
3:	$K_{t} \leftarrow G (K_{S K E}, w)$
4:	${S r c h}_{r n} \overset{$}{\leftarrow} {0,1}^{λ}$
5:	Send $K_{g}, K_{t}, {k e y}_{C n t} [w]$ to user $U_{k}$
	User U_k:
6:	$L_{a d d r} = {}$
7:	$f o r i = 1 t o {k e y}_{C n t} [w]$
8:	$L_{a d d r} \leftarrow L_{a d d r} \cup G (K_{g}, w \| \| i \| \| 0)$
9:	Send { $L_{a d d r}, K_{t}$ } to Server
	Server:
10:	$E_{R e s} = {}$ , Res={}
11:	Res $\leftarrow P_{D B} [K_{t}]$
12:	$f o r i = 1 t o \| L_{a d d r} \|$
14:	$E_{R e s}$ $\leftarrow E_{R e s} \cup E_{D B} [L_{a d d r} [i]]$
15:	Delete $E_{D B} [L_{a d d r} [i]]$
	Send { $E_{R e s}, R e s$ } to User $U_{k}$
	User U_k:
16:	$f o r i = 1 t o \| E_{R e s} \|$
17:	$(i n d \| \| o p) \leftarrow E_{R e s} [i] \oplus G (K_{g}, w \| \| i \| \| 1)$
18:	$I f o p = a d d$
19:	Res $\leftarrow$ Res $\cup i n d$
20:	Else
21:	Res $\leftarrow$ Res $∕ i n d$
22:	Send { Res} to Server
	Server:
23:	$P_{D B} [K_{t}] \leftarrow R e s$

5.2. Security Analysis

Type II backward and forward privacy is achieved by our scheme. As a result of the pseudorandom nature of G and the fact that the PRF consumes a different input during each update, forward privacy is assured when the two values (

{A d d r}_{w}, {V a l}_{w}

) observed by the server are indistinguishable from random numbers. Servers are not even notified of which operation has been executed (additions/deletions), so leakage cannot occur. In the process of querying for ‘w’, the server encounters numerous pseudorandom function (PRF) evaluations that it has previously encountered during updates, thereby risking potential breaches of backward privacy. This situation prompts the server to promptly disclose the timing of each update operation pertaining to ‘w.’ Moreover, the server’s perspective remains restricted to revealing only the information supplied by the owner, meaning it will not gain insight into operations such as deletions negating additions or vice versa.

Now that we have declared our scheme secure as the full proof follows later in the next section, we can state the following theorem:

Theorem 1.

Assuming G is a secure PRF, our scheme is an adaptively secure DSSE scheme with

L^{S t u p} = ⊥, L^{U p d t} (o p, w, i n d) = ⊥ a n d L^{S r c h} (w) = (T i m e D B (w), U p d a t e s (w))

6. Experimental Evaluation

6.1. Comparison of the Performance of Our Scheme

Proof of Theorem 1.

In our scheme, the transcript is performed as follows: At each update request, data are stored as (

A d d r, V a l

) pairs in the encrypted database EDB on the server. EDB is initially empty. Also included are lists

L_{a d d r}

of locations of encrypted values retrieved from the server when a keyword is searched for. We assume that

q

is the total length of the transcript. The simulator

S i m

operates as follows. On starting,

S i m

creates an empty EDB and an empty

I

list. By randomly sampling from

G

,

S i m

calculates (

A d d r, V a l

) during an update query. Let the timestamp of the update be

i

, and let t be the time it occurred. An entry in

S i m

is stored as

I (m) = (A d d r, V a l

). Null entries

I (m)

are returned for timestamps

m

that do not correspond to updates. Leakage functions such as

T i m e D B (w)

and

U p d a t e s (w)

are used in the simulator

S i m

during a keyword search. Using

U p d a t e s (w)

, the timestamps of previously updated keywords are derived based on the timestamps of updates from

U p d a t e s (w)

, represented as

M = (m_{1}, . . ., m_{a_{w}})

, and the addresses are sent to the server as they are stored in

I (m_{i})

for

m = 1, . . ., a_{w} .

The inference of document identifiers containing the search keyword from

T i m e D B (w)

is carried out by the

S i m

. This is carried out when a keyword search is completed.

S i m

is used here to prove the security of our scheme. □

${G a m e}_{0}$ . In this hybrid, it deals with the real system ${D S S E}_{R e a l}$ , as shown in the following figure.

$b \leftarrow {D S S E}_{R e a l}_{A}^{Π} q (λ)$
1:	$N \leftarrow A (1^{λ}); (K_{S K E}, \partial_{0}, {E_{D B}}_{0}) \leftarrow I n i t i a l i s e (1^{λ}, N);$
2:	$f o r n = 1 t o q d o$
3:	$({t y p e}_{n}, {i n d}_{n}, w_{n}) \leftarrow A (1^{λ}, {E_{D B}}_{0}, t_{1}, . . ., t_{n - 1});$
4:	$i f {t y p e}_{n} = s e a r c h t h e n$
5:	$(\partial_{n}, D B (w_{n}); {E_{D B}}_{n}) \leftarrow S e a r c h (K_{S K E}, w_{n}, \partial_{n - 1}; {E_{D B}}_{n - 1})$
6:	$e l s e i f {t y p e}_{n} = u p d a t e t h e n$
7:	$(\partial_{n}; {E_{D B}}_{n}) \leftarrow U p d a t e (K_{S K E}, o p, ({i n d}_{n}, w_{n}), \partial_{n - 1}; {E_{D B}}_{n - 1})$
11:	$b \leftarrow A (1^{λ}, {E_{D B}}_{0}, t_{1}, . . ., t_{q});$
12:	$r e t u r n b;$

P [{D S S E R e a l}_{A}^{Π} q (λ) = 1] - P [{G a m e}_{0} = 1] = 0

{G a m e}_{1}

. This hybrid is the same as for the previous game, except for one difference. A keyword’s value

G (K_{g}, w | | {k e y}_{C n t} [w] | | m)

is computed during an update for

m

and is either zero or one, and the values are chosen randomly from the range of the function

G

.

q

entries are maintained in a list

I

. In the case where the i-th operation is an update, the entry

I (j) = (A d d r, V a l, w, i n d)

will be used to save the sampled random values along with the operation input

(w, i n d)

for

j =

1, …,

q

. If it is not an update operation, the entry will contain a null value. When searching for a particular keyword

w

, the game initiates a scan of

I

to find entries that are a match for

w

. Subsequently, the game sends the associated Addr values to the server and awaits a response. Afterward, the game conducts another scan of

I

to infer

R e s

, which denotes the collection of documents that currently hold

w

. This set is then transmitted to the server.

The security of the

P R F

ensures that

{G a m e}_{1}

cannot be differentiated from

{G a m e}_{0}

, since the

P R F

is never evaluated with the same input during updates in

{G a m e}_{0} .

P [{G a m e}_{0} = 1] - P [{G a m e}_{1} = 1] \leq A_{G}^{p r f} (λ) \leq n e g l (λ)

{G a m e}_{2}

. The game referred to as

{D S S E}_{I d e a l}

is formally defined in Figure 3a, and utilizes the Sim simulator as described above. It is evident that the generated transcript adheres to the same probability distribution as the one created during

{G a m e}_{1}

.

This is because the leakage functions correspond to identical values computed in that game, and when a uniformly random value

r v

is selected, the resulting

i n d | | o p \oplus r v

also follows a uniform random distribution.

$b \leftarrow {D S S E}_{I d e a l}_{A, S i m}^{Π} q (λ)$
1:	$N \leftarrow A (1^{λ}); (\partial_{S i m}, {E_{D B}}_{0}) \leftarrow S i m I n i t i a l i s e (1^{λ}, N);$
2:	$f o r n = 1 t o q d o$
3:	$({t y p e}_{n}, {i n d}_{n}, w_{n}) \leftarrow A (1^{λ}, {E_{D B}}_{0}, t_{1}, . . ., t_{n - 1});$
4:	$i f {t y p e}_{n} = s e a r c h t h e n$
5:	$(\partial_{S i m}; t_{n}, {E_{D B}}_{n}) \leftarrow S i m S e a r c h (\partial_{S i m}, {L^{s r c h} (w}_{n}); {E_{D B}}_{n - 1})$
6:	$e l s e i f {t y p e}_{n} = u p d a t e t h e n$
7:	$(\partial_{S i m}; {E_{D B}}_{n}) \leftarrow S i m U p d a t e (\partial_{S i m}, L^{u p d t} (w_{n}); {E_{D B}}_{n - 1})$
11:	$b \leftarrow A (1^{λ}, {E_{D B}}_{0}, t_{1}, . . ., t_{q});$
12:	$r e t u r n b;$

P [{D S S E}_{R e a l}_{A}^{Π} q (λ) = 1] - P [{D S S E}_{I d e a l}_{A, S i m}^{Π} q (λ) = 1] \leq A_{G}^{p r f} (λ) \leq n e g l (λ)

In regard to the accuracy of our approach, it should be mentioned that if

G

is not a pseudorandom function (as it is in our implementation), there is a possibility of collisions arising when calculating

A d d r

and

V a l

for various

w

and

i n d

pairs. This probability can be significantly reduced by expanding the range of

G

.

6.2. Performance Evaluation

The

S S M D O

[23] and

N a j a f i

[24] share similarities with our proposed approach in terms of achieving privacy for multiple users, and we, therefore, compared the execution times of these algorithms. This comparison included evaluating the time taken for index generation, client storage, and the search process. We carried out these experiments on a single computer, and the encrypted database was kept in the memory of the same computer, without the need for a WAN network. Two computers in different countries (Iraq and the UAE) were used to calculate the round-trip time for the data, and the result was approximately 2 ms. Our main focus was to calculate the time spent on searching and updating operations, in addition to the communication size and the amount of storage required for each plan on the client side.

To implement the proposed scheme and assess its efficiency, we used the Java language on the Windows 11 operating system, with an x64-based processor. The comparison schemes were applied to a real dataset called

E n r o n

[31]. The selected ranges in the experimental evaluation, including dataset size, dictionary size, and other parameters, are pivotal for comprehending the performance and generalizability of the proposed (DSSE) scheme in real-world scenarios. The dataset comprises an extensive collection of textual email exchanges among Enron employees. Notably, the dataset is characterized by its considerable size, offering a diverse and realistic sample that proves valuable for testing information retrieval systems. The test results were obtained on a computer equipped with an Intel(R) Core(TM) i5-10210U CPU @ 1.60GHz 2.11 GHz, 8GB of RAM, and a 512 GB SSD hard disk. To enable a comparative analysis of our proposed scheme with existing ones, the time cost and running time results for SSMDO and Al-Najafy, as reported in [24], have been referenced.

Metrics include index generation time, measuring the efficiency of the setup process; search process efficiency, evaluating the time complexity of the search algorithm; communication overhead, quantifying data transmission during operations; storage requirements, analyzing client-side storage; and round-trip time for data transfer, simulating real-world scenarios. The Enron [31] dataset served as the foundation for evaluations, conducting experiments on computers across various countries to authentically assess communication expenditures. Recognized as an authentic database and a live illustration of the system’s application in a real-world environment, this approach allowed for a comprehensive and realistic evaluation of the system’s performance. The results, analyzed in the experimental evaluation, reveal the DSSE scheme’s remarkable efficiency in index generation, outperforming existing methodologies. The search process efficiency, communication overhead, and storage requirements demonstrate the lightweight nature of the design. The round-trip time for data transfer showcases the scheme’s practicality in a distributed environment. The DSSE scheme ensures privacy preservation and exhibits superior efficiency. The comprehensive experimental evaluation validates the approach, with future work focusing on enhancing system security through a verification mechanism for the authenticity and integrity of results.

Index generation. In the proposed scheme, the encrypted index is created by performing a continuous update process for each keyword. The index is represented as a set of ordered pairs consisting of a value and a key. We note that our scheme utilizes symmetric encryption. The updating procedure exhibits a time complexity of

O (1)

, indicating that as the size of the encrypted index increases, the required time for the process does not escalate linearly. This aspect is straightforward in our scheme. In the

S S M D O

and

N a j a f i

methodologies for constructing encrypted indexes, the index data structure comprises two numerical components along with a vector component. It also keeps track of the size of the dictionary used in the system.

The

N a j a f i

approach has two additional numerical components in the index structure, meaning that more exponentiation operations need to be performed. The time it takes to generate the index is influenced by two factors: the number of documents in the dataset and the total number of keywords defined in the system.

From Figure 2, we can see the variations in the time it takes to generate an index due to changes in these two factors across the three different schemes (

S S M D O, N a j a f i

, and our proposed scheme). By comparing the diagrams, we can see that our scheme performs the index generation process in a shorter amount of time than the other schemes, indicating that it is the most efficient in terms of index generation.

Figure 2 shows the time taken to generate an encrypted index in two different scenarios. Figure 2a shows the impact of varying the dataset size while keeping the dictionary size constant at n = 50, while Figure 2b shows the effect of varying the dictionary size while maintaining a constant dataset size of m = 262. The figures provide insights into how the index generation time changes in relation to these factors.

Trapdoor generation. It should be noted that the schemes under comparison outperform our proposed scheme for trapdoor generation. This distinction arises from the fact that our scheme calculates the locations of values in the encrypted database for a specific keyword at the client before initiating the search query for that keyword. In contrast, in our scheme, the time expended by the server in retrieving results is notably quicker than in comparable schemes. This discrepancy underscores the efficiency of our scheme in the subsequent comparison.

Search process. The selection of dataset size and dictionary size for experimental variations is a nuanced process influenced by practical considerations and relevance to real-world scenarios. The chosen ranges (100 to 800 for dataset size and 30 to 100 for the number of keywords) are motivated by several factors. They may reflect realistic expectations in the application domain, mirroring document or record quantities in a typical database, and the diversity of searchable terms. Additionally, the ranges enable scalability assessments, testing the system’s performance with varying dataset and dictionary sizes. Considerations for resource constraints, trade-off analysis, benchmarking against other schemes, and statistical significance underscore the deliberate and comprehensive nature of the experimental design.

The figures presented above illustrate a direct correlation between the size of the result set and the corresponding search time across all schemes, indicating a linear increase in search time as result size grows. Bilinear pairing plays an important role in the search algorithm for both the SSMDO and Najafi schemes; however, this imposes a significant computational overhead, and the frequency of its application directly impacts the time complexity of the search algorithm. From Figure 3, we can observe how the time complexity changes for each scheme as we increase two key factors: the dataset size and the dictionary size. For all of the executions, our scheme consistently outperformed the Najafi scheme, which is the only other scheme with the same type of information leakage. The predominant portion of the time is allocated to client-side operations within our proposed scheme, with the server primarily tasked with retrieving precomputed location results, thus minimizing its computational workload. Figure 3a demonstrates that as the dataset size increases, the time cost of the search process also increases almost linearly. Interestingly, despite using the same number of pairings, the

S S M D O

and

N a j a f i

schemes show a steeper increase in the search time cost compared to our scheme. This suggests that the search algorithm used in our approach is more efficient. Furthermore, Figure 3b shows that the time cost of the search algorithm remains relatively independent of the size of the dictionary; in other words, the size of the dictionary does not significantly impact the time required for the search process. Overall, these findings highlight the efficiency and effectiveness of the proposed scheme’s search algorithm in comparison to the

S S M D O

and

N a j a f i

schemes, considering the computational overhead of the bilinear pairing map. This result is not surprising, since the

S S M D O

and

N a j a f i

schemes rely on bilinear pairings to achieve privacy in both directions, whereas our scheme uses symmetric encryption. Figure 3 shows the time taken for a search operation, measured in terms of the time cost. Figure 3 is divided into two parts: Figure 3a represents the variations in search time with different dataset sizes while maintaining a constant dictionary size of n = 50, and Figure 3b showcases the changes in search time with different dictionary sizes while keeping the dataset size constant at m = 262. These figures help us understand how the search time is influenced by these factors.

Figure 3. (a) The relationship between search time and different dataset sizes, holding a constant dictionary size of n = 50. (b) The impact of various dictionary sizes on search time, with a fixed dataset size of m = 262.

7. Conclusions

This paper has presented a searchable symmetric encryption scheme that can be used to achieve effective access control in a dynamic multi-user environment. We must contend with two essential properties when designing an SSE scheme: (forward and backward) privacy and efficiency. Since granular access control faces problems when used in similar scenarios, an SSE based on simple tools was introduced to achieve high security and retrieval speed. We also noted the importance of advanced privacy in multi-user scenarios and showed that our scheme achieved forward and backward privacy preservation through a security analysis. Extensive experiments also indicated that our proposed scheme was feasible for real-world scenarios involving multiple users due to the accounts, storage, and connections used, which meant that our build was lightweight.

The paper identifies potential limitations and outlines areas for future research in the proposed scheme. It acknowledges the risk of collisions in the pseudorandom function (

G

) when calculating

A d d r

and

V a l

, suggesting future research to mitigate these risks through strategies like expanding the function range. Another avenue for improvement involves implementing a verification mechanism to ensure the authenticity and integrity of results, warranting the exploration of robust mechanisms. This additional layer of security will provide a robust defence against any potential attacks, making the system completely secure and reliable. Scalability, resource utilization, real-world deployment challenges, and extension to different use cases are highlighted as further research areas. The proposed scheme can also be integrated with fine-grained access control technology. Additionally, a call for a more quantitative analysis of security guarantees in various threat models is emphasized for future investigations, aiming to enhance the proposed scheme’s contributions and practical applicability.

Author Contributions

All authors have contributed equally to this article. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Natural Science Foundation of Top Talent of SZTU (grant No. 20211061010016).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

Al Sibahee, M.A.; Lu, S.; Abduljabbar, Z.A.; Liu, X.; Abdalla, H.B.; Hussain, M.A.; Hussien, Z.A.; Ghrabat, M.J.J. Lightweight secure message delivery for E2E S2S communication in the IoT-cloud system. IEEE Access 2020, 8, 218331–218347. [Google Scholar] [CrossRef]
Abduljabbar, Z.A.; Jin, H.; Ibrahim, A.; Hussien, Z.A.; Hussain, M.A.; Abbdal, S.H.; Zou, D. Secure Biometric Image Retrieval in IoT-Cloud. In Proceedings of the 2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Hong Kong, China, 5–8 August 2016; pp. 1–6. [Google Scholar]
Hussain, M.A.; Hussien, Z.A.; Abduljabbar, Z.A.; Ma, J.; Al Sibahee, M.A.; Hussain, S.A.; Nyangaresi, V.O.; Jiao, X. Provably throttling SQLI using an enciphering query and secure matching. Egypt. Inform. J. 2022, 23, 145–162. [Google Scholar] [CrossRef]
Abduljabbar, Z.A.; Jin, H.; Ibrahim, A.; Hussien, Z.A.; Hussain, M.A.; Abbdal, S.H.; Zou, D. Sepim: Secure and efficient private image matching. Appl. Sci. 2016, 6, 213. [Google Scholar] [CrossRef]
Song, D.X.; Wagner, D.; Perrig, A. Practical Techniques for Searches on Encrypted Data. In Proceedings of the 2000 IEEE Symposium on Security and Privacy, S&P 2000, Berkeley, CA, USA, 14–17 May 2000; pp. 44–55. [Google Scholar]
Cash, D.; Jarecki, S.; Jutla, C.; Krawczyk, H.; Roşu, M.-C.; Steiner, M. Highly-Scalable Searchable Symmetric Encryption with Support for Boolean Queries. In Proceedings of the Advances in Cryptology–CRYPTO 2013: 33rd Annual Cryptology Conference, Santa Barbara, CA, USA, 18–22 August 2013; Proceedings, Part I. Springer: Berlin/Heidelberg, Germany, 2013; pp. 353–373. [Google Scholar]
Lai, S.; Patranabis, S.; Sakzad, A.; Liu, J.K.; Mukhopadhyay, D.; Steinfeld, R.; Sun, S.-F.; Liu, D.; Zuo, C. Result Pattern Hiding Searchable Encryption for conjunctive Queries. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, Toronto, ON, Canada, 15–19 October 2018; pp. 745–762. [Google Scholar]
Abduljabbar, Z.A.; Ibrahim, A.; Hussain, M.A.; Hussien, Z.A.; Al Sibahee, M.A.; Lu, S. EEIRI: Efficient encrypted image retrieval in IoT-cloud. KSII Trans. Internet Inf. Syst. (TIIS) 2019, 13, 5692–5716. [Google Scholar]
Sun, S.-F.; Zuo, C.; Liu, J.K.; Sakzad, A.; Steinfeld, R.; Yuen, T.H.; Yuan, X.; Gu, D. Non-interactive multi-client searchable encryption: Realization and implementation. IEEE Trans. Dependable Secur. Comput. 2020, 19, 452–467. [Google Scholar] [CrossRef]
Huaze, L.; Kaiping, X.; David, S.L.W.; Ruidong, L. An efficient multi-user multi-keyword fuzzy search scheme over encrypted cloud storage. J. Univ. Sci. Technol. China 2021, 51, 1361–1382. [Google Scholar]
Sun, S.-F.; Liu, J.K.; Sakzad, A.; Steinfeld, R.; Yuen, T.H. An Efficient Non-Interactive Multi-Client Searchable Encryption with Support for Boolean Queries. In European Symposium on Research in Computer Security; Springer: Berlin/Heidelberg, Germany, 2016; pp. 154–172. [Google Scholar]
Du, L.; Li, K.; Liu, Q.; Wu, Z.; Zhang, S. Dynamic multi-client searchable symmetric encryption with support for boolean queries. Inf. Sci. 2020, 506, 234–257. [Google Scholar] [CrossRef]
Al Sibahee, M.A.; Lu, S.; Abduljabbar, Z.A.; Ibrahim, A.; Hussien, Z.A.; Mutlaq KA, A.; Hussain, M.A. Efficient encrypted image retrieval in IoT-cloud with multi-user authentication. Int. J. Distrib. Sens. Netw. 2018, 14, 1550147718761814. [Google Scholar] [CrossRef]
Kamara, S.; Papamanthou, C. Parallel and Dynamic Searchable Symmetric Encryption. In International Conference on Financial Cryptography and Data Security; Springer: Berlin/Heidelberg, Germany, 2013; pp. 258–274. [Google Scholar]
Kamara, S.; Papamanthou, C.; Roeder, T. Dynamic Searchable Symmetric Encryption. In Proceedings of the 2012 ACM Conference on Computer and Communications Security, Raleigh, NC, USA, 16–18 October 2012; pp. 965–976. [Google Scholar]
Chang, Y.-C.; Mitzenmacher, M. Privacy Preserving Keyword Searches on Remote Encrypted Data. In International Conference on Applied Cryptography and Network Security; Springer: Berlin/Heidelberg, Germany, 2005; pp. 442–455. [Google Scholar]
Bost, R. ∑ oφoς: Forward Secure Searchable Encryption. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, 24–28 October 2016; pp. 1143–1154. [Google Scholar]
Stefanov, E.; Papamanthou, C.; Shi, E. Practical Dynamic Searchable Encryption with Small Leakage. Cryptology ePrint Archive 2013. [Google Scholar]
Etemad, M.; Küpçü, A.; Papamanthou, C.; Evans, D. Efficient dynamic searchable encryption with forward privacy. Proc. Priv. Enhancing Technol. 2018, 2018, 5–20. [Google Scholar] [CrossRef]
Huang, Y.; Lv, S.; Liu, Z.; Song, X.; Li, J.; Yuan, Y.; Dong, C. Cetus: An efficient symmetric searchable encryption against file-injection attack with SGX. Sci. China Inf. Sci. 2021, 64, 182314. [Google Scholar] [CrossRef]
Bost, R.; Minaud, B.; Ohrimenko, O. Forward and Backward Private Searchable Encryption from Constrained Cryptographic Primitives. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, TX, USA, 30 October–3 November 2017; pp. 1465–1482. [Google Scholar]
Zhang, K.; Wen, M.; Lu, R.; Chen, K. Multi-client sub-linear boolean keyword searching for encrypted cloud storage with owner-enforced authorization. IEEE Trans. Dependable Secur. Comput. 2020, 18, 2875–2887. [Google Scholar] [CrossRef]
Guo, Z.; Zhang, H.; Sun, C.; Wen, Q.; Li, W. Secure multi-keyword ranked search over encrypted cloud data for multiple data owners. J. Syst. Softw. 2018, 137, 380–395. [Google Scholar] [CrossRef]
Najafi, A.; Bayat, M.; Javadi, H.H.S. Fair multi-owner search over encrypted data with forward and backward privacy in cloud-assisted Internet of Things. Future Gener. Comput. Syst. 2021, 124, 285–294. [Google Scholar] [CrossRef]
Kim, K.S.; Kim, M.; Lee, D.; Park, J.H.; Kim, W.-H. Forward Secure Dynamic Searchable Symmetric Encryption with Efficient Updates. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, TX, USA, 30 October–3 November 2017; pp. 1449–1463. [Google Scholar]
Cash, D.; Tessaro, S. The Locality of Searchable Symmetric Encryption. In Annual International Conference on the Theory and Applications of Cryptographic Techniques; Springer: Berlin/Heidelberg, Germany, 2014; pp. 351–368. [Google Scholar]
Naveed, M.; Prabhakaran, M.; Gunter, C.A. Dynamic Searchable Encryption via Blind Storage. In 2014 IEEE Symposium on Security and Privacy; IEEE: Berlin/Heidelberg, Germany, 2014; pp. 639–654. [Google Scholar]
Chatterjee, S.; Puria, S.K.P.; Shah, A. Efficient backward private searchable encryption. J. Comput. Secur. 2020, 28, 229–267. [Google Scholar] [CrossRef]
Yang, J.; Liu, F.; Luo, X.; Hong, J.; Li, J.; Xue, K. Forward Private Multi-Client Searchable Encryption with Efficient Access Control in Cloud Storage. In GLOBECOM 2022-2022 IEEE Global Communications Conference; IEEE: Berlin/Heidelberg, Germany, 2022; pp. 3791–3796. [Google Scholar]
Alyousif, A.; Yassin, A.; Abduljabbar, Z.; Xu, K. Improving the performance of searchable symmetric encryption by optimizing locality. J. Basrah Res. 2023, 49, 102–113. [Google Scholar] [CrossRef]
Enron Email Dataset. Available online: https://www.cs.cmu.edu/~enron/ (accessed on 10 November 2023).

Figure 1. Model of the proposed system.

Figure 2. (a) The impact of dataset size variations with a constant dictionary size of n = 50. (b) The effects of dictionary size variations while maintaining a constant dataset size of m = 262.

Table 1. Comparison of the proposed scheme with alternatives in the literature.

Scheme	Computation		Communication		BP	Multi-User
Scheme	Search	Update	Search	Update	BP	Multi-User
$M o n e t a$ [17]	$\hat{O} (a_{w} l o g N + {l o g}^{3} N)$	$\hat{O} ({l o g}^{2} N)$	$\hat{O} (a_{w} l o g N + {l o g}^{3} N)$	$\hat{O} ({l o g}^{3} N)$	I	$\times$
${D i a n a}_{d e l}$ [17]	$O (a_{w})$	$O ({l o g a}_{w})$	$O (n_{w} + n_{w} {l o g a}_{w})$	$O (1)$	III	$\times$
${S D}_{a}$ [30]	$O (a_{w} + l o g N)$	$O (l o g N)$	$O (a_{w} + l o g N)$	$O (l o g N)$	II	$\times$
$π_{W B P}$ [28]	$O ({o^{'}}_{w})$	$O (1)$	$O (n_{w})$	$O (1)$	III	$\times$
$N a j a f i$ [24]	$O (n_{w})$	$O (m)$	$O (S R)$	$O (1)$	III	✓
$S S M D O$ [23]	$O (1)$	$O (m)$	$O ({(o}_{w} + 1) * Q)$	$O (m)$	$\times$	✓
Ours	$O ({o^{'}}_{w})$	$O (1)$	$O (n_{w})$	$O (1)$	II	✓

B P

represents backward privacy,

N

indicates how many (keyword, identifier) mappings there are.

m

is the number of distinctive keywords. An addition operation on w is represented by

a_{w}

, a delete operation on w is denoted as

d_{w}

, and an update operation on

w

is represented by

o_{w}

(i.e.,

o_{w} = a_{w} + d_{w}

). According to our last search, there have been

{o^{'}}_{w}

updates, while

n_{w}

refers to the number of documents we currently have shared with

w

.

\hat{O}

masks the

l o g N

contents. ✓ indicates satisfied, while

\times

indicates not satisfied.

S R

is the size of the result set for

w

. A conjunctive query

Q = {w_{1}, . . ., w_{| Q |}}, | Q | > 1

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bulbul, S.S.; Abduljabbar, Z.A.; Najem, D.F.; Nyangaresi, V.O.; Ma, J.; Aldarwish, A.J.Y. Fast Multi-User Searchable Encryption with Forward and Backward Private Access Control. J. Sens. Actuator Netw. 2024, 13, 12. https://doi.org/10.3390/jsan13010012

AMA Style

Bulbul SS, Abduljabbar ZA, Najem DF, Nyangaresi VO, Ma J, Aldarwish AJY. Fast Multi-User Searchable Encryption with Forward and Backward Private Access Control. Journal of Sensor and Actuator Networks. 2024; 13(1):12. https://doi.org/10.3390/jsan13010012

Chicago/Turabian Style

Bulbul, Salim Sabah, Zaid Ameen Abduljabbar, Duaa Fadhel Najem, Vincent Omollo Nyangaresi, Junchao Ma, and Abdulla J. Y. Aldarwish. 2024. "Fast Multi-User Searchable Encryption with Forward and Backward Private Access Control" Journal of Sensor and Actuator Networks 13, no. 1: 12. https://doi.org/10.3390/jsan13010012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fast Multi-User Searchable Encryption with Forward and Backward Private Access Control

Abstract

1. Introduction

2. Previous Work

3. Problem Statement

3.1. System Model

3.2. Security Assumptions

3.3. Design Objectives

4. Preliminaries

4.1. Notation

4.2. Forward and Backward Privacy and Leakage Functions

5. Proposed Forward and Backward Multi-User Scheme

5.1. Overview

5.2. Security Analysis

6. Experimental Evaluation

6.1. Comparison of the Performance of Our Scheme

6.2. Performance Evaluation

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI