Blockchain-Based Caching Architecture for DApp Data Security and Delivery

Kim, Daun; Park, Sejin

doi:10.3390/s24144559

Open AccessArticle

Blockchain-Based Caching Architecture for DApp Data Security and Delivery

by

Daun Kim

and

Sejin Park

^*

Department of Computer Engineering, Keimyung University, Daegu 1095, Republic of Korea

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(14), 4559; https://doi.org/10.3390/s24144559

Submission received: 17 June 2024 / Revised: 10 July 2024 / Accepted: 12 July 2024 / Published: 14 July 2024

(This article belongs to the Special Issue 6G and Blockchain for Advanced Future Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Decentralized applications (DApps) built on blockchain technology offer a promising solution to issues caused by centralization. However, traditional DApps leveraging off-chain storage face performance challenges due to factors such as storage location, network speed, and hardware conditions. For example, decentralized storage solutions such as IPFS suffer from diminished download performance due to I/O constraints influenced by data access patterns. Aiming to enhance the Quality of Service (QoS) in DApps built on blockchain technology, this paper proposes a blockchain node-based distributed caching architecture that guarantees real-time responsiveness for users. The proposed architecture ensures data integrity and user data ownership through blockchain while maintaining cache data consistency through local blockchain data. By implementing local cache clusters on blockchain nodes, our system achieves rapid response times. Additionally, attribute-based encryption is applied to stored content, enabling secure content sharing and access control, which prevents data leakage and unauthorized access in unreliable off-chain storage environments. Comparative analysis shows that our proposed system achieves a reduction in request processing latency of over 89% compared to existing off-chain solutions, maintaining cache data consistency and achieving response times within 65 ms. This demonstrates the model’s effectiveness in providing secure and high-performance DApp solutions.

Keywords:

blockchain; decentralized applications; data integrity; distributed system; user privacy; cache coherence

1. Introduction

In traditional centralized systems, users relinquish control over their data to centralized authorities, raising concerns about data misuse and unauthorized access. Furthermore, centralized architectures are susceptible to single points of failure, making them vulnerable to external attacks such as hacking and DDoS attacks as well as performance degradation due to high loads. The rise of decentralized services has revolutionized various industries, offering a compelling alternative to traditional centralized models [1].

To address these inherent issues in centralized systems, decentralized solutions based on blockchain technology [2] have been introduced. Blockchain relies on immutable distributed ledger technology, enabling trustworthy data management without central authorities. This characteristic eliminates single points of failure, enhancing security and reliability. The most important factor enabling decentralized applications is smart contract technology [3] operating on the blockchain. Smart contracts provide automated contract execution without third-party intervention, as contract terms are encoded into code and executed automatically on the blockchain network’s virtual machine. This capability enables secure and reliable operations across various domains, such as the Internet of Things (IoT), healthcare systems, digital rights management, financial systems, and real estate within the system [4].

1.1. Motivation

This section discusses the limitations present in existing research and provides an explanation for the necessity and motivation behind this research.

As blockchain technology continues to permeate diverse industrial sectors, the demand for real-time functionality is becoming increasingly evident. However, the inherent scalability issues in blockchain technology, particularly around storage, continue to pose challenges when utilizing blockchain in decentralized services. In decentralized applications, data operations rely only on smart contract functions, resulting in slow throughput and large resource requirements [5]. Smart contracts exist on the blockchain, making both their code and data publicly accessible. Consequently, malicious actors can exploit the code, and there are concerns about privacy infringement when sensitive information is exposed to anyone. Moreover, smart contracts face challenges in resolving performance issues in resource-constrained execution environments [6]. Therefore, addressing these issues requires approaches that consider security and privacy concerns while minimizing or efficiently managing resource utilization. To address these concerns, off-chain storage solutions have gained attention; data generated by actual services can be stored in external cloud storage, while only the hash values of the original data are maintained for integrity [7,8]. Utilizing shared cloud storage among devices for data operations can save resources required for smart contract execution. Thus, a hybrid approach utilizing both off-chain and on-chain storage offers a promising solution by combining real-time interaction with the advantages of blockchain [9].

However, the cloud services used for off-chain storage are typically limited to providing average throughput via large-scale expansion. End-to-end latency during cloud search operations mostly arises from communication delays. Traditional cloud-based infrastructure can lead to increased latency, particularly for geographically dispersed users. This latency can significantly degrade user experience. Additionally, decentralized storage solutions such as IPFS may suffer from diminished download performance due to I/O constraints influenced by data access patterns [10]. In summary, existing off-chain storage solutions such as IPFS and cloud storage impose performance limitations on real-time requirements. When applied to actual decentralized applications, the system’s performance is heavily reliant on the off-chain storage location, network speed, and hardware conditions, leading to reduced Quality of Service (QoS) due to off-chain storage latency [11]. Digital services are highly susceptible to latency issues, and even slight delays can significantly degrade user experience, leading to revenue loss [12].

Therefore, response latency is a critical factor to consider when designing decentralized services. Our research aims to address the latency issues inherent in existing decentralized application solutions by proposing a blockchain-based caching architecture. In order for decentralized services to be universally applicable across various fields, it is essential to consider the real-time capabilities provided by traditional centralized structures. We propose an architecture that utilizes a blockchain-based cache cluster to store and retrieve decentralized service data. By enabling direct queries to cache nodes holding the content, our approach eliminates the need for cloud requests, thereby minimizing latency and enhancing overall performance. Additionally, our system leverages off-chain storage on blockchain nodes to cache data, effectively reducing traffic for off-chain storage requests and ensuring data integrity, ultimately providing users with superior QoS. In addition, we adopt asset-based groups to enable secure sharing of content stored on the blockchain.

1.2. Contribution

In this section, the contributions of the proposed research are delineated. The proposed architecture contributes to ensuring the reliability of nodes and consistency of cached data through a blockchain node-based off-chain storage content caching system. Additionally, it supports granular access control for content sharing using attribute-based encryption, contributing to data protection in untrusted cloud storage and distributed caching system environments. Lastly, by utilizing blockchain nodes as cache nodes and forming regional cache clusters, it optimizes request performance, reduces latency, and enhances system responsiveness.

The rest of this paper is structured as follows: Section 2 delves into research pertinent to blockchain-based caching and access control mechanisms; Section 3 elucidates the background technology that underpins our proposed system; Section 4 details the specific design of the proposed system; Section 5 discusses the experiments conducted and the evaluations performed to assess the efficacy of our approach; finally, Section 6 presents the conclusions.

2. Related Work

In this section, we describe research on caching techniques and access control research utilizing blockchain. Following this, a comparative analysis contrasts the distinguishing features between our proposed approach and existing systems in this field.

Research on systems that introduce blockchain technology to address the data security issues of traditional centralized systems is gaining attention. Alrebdi et al. [13] incorporated blockchain into electronic medical systems to ensure decentralization, security, anonymity, immutability, and tamper-resistance. The system provides storage, verification, and retrieval through smart contracts and employs IPFS and cloud computing to store patient data. Zhang et al. [14] proposed a blockchain-based privacy-preserving e-health system to solve the security issues of cloud-based Electronic Health Records (EHRs). By introducing pairing-based encryption, they created tamper-proof EHR managed through contracts.

2.1. Caching Research for Distributed Environments

In this section, we introduce caching in distributed environments, discuss related works, and compare each work, with a focus on caching architectures. The importance of caching research applicable to distributed environments, such as edge computing to improve the performance of central servers or cloud environments, has been emphasized in several works. Yamanaka et al. [15] proposed a user-centric caching mechanism within an off-chain storage network based on information-centric networking, which overcomes the limitations of existing off-chain solutions by providing efficient data transmission and high access performance. Bai et al. [16] proposed a pre-caching strategy and a distributed caching technique in a blockchain-based mobile edge computing environment. Performance improvements were achieved through caching optimization utilizing node utilities, and a penalty incentive mechanism for caching operations was introduced to increase system utility and stability in a distributed environment. However, offline proof between the data provider and the cache node causes oracle problems and does not guarantee the stability of the system utilizing smart contracts. Guo et al. [17] proposed a caching optimization model and data storage method that utilizes blockchain in an edge environment to cache data while meeting user requirements and increasing the cache hit rate by predicting the cache contents. Zhang et al. [18] proposed a three-layer caching mechanism that adds a cache node layer through a smart contract, allowing users to efficiently obtain content through a three-layer cache in a large-scale network environment while providing increased response speed and reduced network overhead. Large files are transformed into a Merkle tree structure, encrypted, and stored using an IPFS system, then a Third-Party Auditor (TPA) performs integrity verification. Heo et al. [19] designed Multi-Level Distributed Caching (MLDC), a multi-level distributed caching strategy based on node availability for blockchain storage optimization. It reduces blockchain storage requirements by proposing that one node caches only those data that it needs, rather than holding the data of the entire blockchain. Wang et al. [20] proposed a distributed framework for dictionary caching in blockchain-based hierarchical wireless networks. An autonomous content caching transaction is constructed using smart contracts, and an incentive mechanism is used to activate cache helpers by combining blockchain and game theory perspectives. Freedman et al. [21] designed CoralCDN, a web content distribution network that distributes web content by leveraging the bandwidth of volunteers to reduce the burden on origin servers. Saturn [22] implements a content delivery network that accelerates content delivery by designing L1 nodes that cache data stored in the InterPlanetary File System (IPFS), promoting content delivery network activation through incentive mechanisms. Table 1 presents a summary and comparison of the proposed studies.

2.2. Research on Blockchain-Based Access Control Mechanisms for Security

This section introduces research related to blockchain-based access control mechanisms that ensure system stability.

In security, access control is a critical component that restricts unauthorized users from accessing resources or accessing resources in unauthorized ways. Preventing such access is especially crucial in distributed environments. In distributed environments, access control not only prevents unauthorized access, it also fosters trust among users and stakeholders by ensuring that sensitive information remains secure across diverse network nodes and interconnected systems. Research on solutions enabling management of private keys and performing access control without third-party intervention in distributed environments is actively being pursued.

Horvath et al. [23] proposed methods for making attribute-based encryption more suitable for access control to data stored in the cloud. Sukhodolskiy et al. [24] designed a prototype of a multi-user system capable of controlling access to datasets stored in untrusted cloud environments using dynamic attribute-based encryption policies. Sharma et al. [25] proposed a blockchain-based framework with a CP-ABE algorithm to provide access control and user revocation methods for cloud storage systems. Wang et al. [26] proposed a cloud storage framework combining the Ethereum blockchain and CP-ABE. Li et al. [27] proposed a fog computing-based blockchain-supported access control system to address privacy issues in a Smart Healthcare System (SHS). This system adopts BC-assisted multi-authority attribute-based encryption (MA-ABE) with keyword search to implement fine-grained access control. Because the approach is not suitable for resource-constrained IoT environments, most computational tasks are outsourced to fog nodes. Alharbi [28] enhanced centralized Cybersecurity Information Sharing (CIS) using the Access Control-Enabled Blockchain (ACE-BC) framework to bolster data security. This framework restricts unauthorized access using attribute-based encryption technology. Yang et al. [29] proposed a blockchain-based access control framework called AuthPrivacyChain with enhanced cloud security features with the aim of protecting personal information. This framework designs access control, authorization, and revocation processes to prevent illegal resource access by hackers and cloud administrators. Han et al. [30] proposed an auditable access control model based on attribute-based access control (ABAC) to overcome security support issues in access control processes due to the limited performance of IoT devices. This model stores request logs, response logs, and access logs on the blockchain to manage access control policies and enable auditing.

Previous studies have adopted blockchain in caching technology to evaluate the integrity and usefulness of caching node roles and introduce incentive mechanisms. However, this paper is different in that it proposes a blockchain-based caching technique that focuses on ensuring the consistency of caching content and protecting content integrity in Web3.0 application environments. Through this architecture, performance improvement of blockchain-based distributed applications is achieved.

2.3. Research Gap Analysis

In this section, we discuss the differences between related previous research and the proposed research. Table 2 illustrates the distinctions between the described studies and the proposed research.

Research not focusing on system architecture is excluded from the comparison with the proposed system. The comparison criteria include decentralization, caching systems, preventing cache poisoning, data integrity, and access control. The first category assesses the degree of decentralization and the extent to which single points of failure are avoided; a “low” rating indicates the presence of central points of failure, a “middle” rating indicates the integration of blockchain networks with cloud storage, and a “high” rating indicates an architecture capable of integrating blockchain networks with distributed storage. Cache system indicates whether a distributed caching system utilizing edge nodes is implemented to delegate original data to other nodes for performance improvement. Preventing cache poisoning examines whether the system avoids cache poisoning risks among the edge nodes utilizing each cache system. Data integrity evaluates whether the system ensures the integrity of the original data stored in external storage. Finally, access control describes whether the system supports access control for the security of shared data. The proposed system can achieve a fully decentralized shared system by adopting distributed storage as an off-chain storage layer and utilizing blockchain nodes to form a distributed cache cluster. Additionally, it improves performance through a caching system and prevents cache poisoning threats by verifying the integrity of updated data and identifying modified data via the local blockchain ledger. The integrity of the data is ensured by storing the hash value of the original data on-chain. Furthermore, the proposed system supports access control for data stored in untrusted storage using attribute-based encryption algorithms. In contrast, existing research does not address the prevention of cache data corruption or access control of the data transmitted within the system. By considering all these aspects, we propose a reliable blockchain-based cache system that completely prevents data leakage and tampering.

3. Background

In this section, we provide explanations of the notation used in the paper and discuss two key background technologies, namely, blockchain and Ciphertext Policy–Attribute-Based Encryption (CP-ABE).

3.1. Blockchain-Based Decentralized Apps

In this section, we explain the characteristics of blockchain technology and its application in decentralized applications.

Blockchain, first introduced as the underlying technology of Bitcoin, has developed into a powerful and secure distributed ledger system that can be applied to a variety of fields beyond digital currency. A blockchain is a distributed immutable ledger that records transactions transparently. Each block in the blockchain contains a list of transactions and the hash of the previous block, creating a chain of linked blocks. Unlike centralized databases, blockchains operate network-wide, with each node maintaining a copy of the entire ledger. This makes data tampering virtually impossible, as changing a single block requires changing all subsequent blocks in the network. Blockchain technology uses cryptographic methods such as hashing and digital signatures to secure data and verify the authenticity of transactions without intermediaries.

Decentralized Applications (DApps) are applications that run on a blockchain, providing a trusted environment without the need for intermediaries or central authorities. DApps have spread across various fields, including finance, healthcare, and social networking. DApps leverage the security and decentralization of blockchain technology to provide features such as improved privacy and user control over data. Smart contracts, self-executing pieces of code deployed on a blockchain, are a core component of DApps. Contracts are distributed across the blockchain network and executed by multiple nodes, and all nodes equally verify the execution results of the contract. This means that unlike centralized systems, the entire network verifies and executes contracts without a single administrator or intermediary. These distributed characteristics increase reliability and transparency while preventing forgery, falsification, and data fraud.

3.2. Ciphertext Policy–Attribute-Based Encryption

This section provides a detailed explanation of the features and functions of attribute-based encryption for access control.

Public key encryption [31] allows for bidirectional communication in which data are encrypted for secure transmission to a single recipient by utilizing the recipient’s public key over a public channel. However, this bidirectional model presents limitations regarding encryption for multiple users, including unknown or potential users. To achieve scalable and secure data sharing in untrusted distributed environments, Ciphertext Policy–Attribute-Based Encryption (CP-ABE) [32] enables fine-grained access control over data. CP-ABE expresses access policies as attributes and encrypts data based on these attributes, determining decryption based on the match between user and data attributes. The encryption technique used for our access control is based on the CP-ABE method optimized for cloud computing, as proposed by Horvath [23]. The functionality of the proposed CP-ABE is outlined in Table 3.

Initially, the system is identified through circuit initialization, setting up global system parameters (

G P

). The

c o n f i g C A

function establishes the public key pair (

S K *, P K *

) to configure the Certification Authority (

C A

). The

C A

plays a role in assigning attributes in the system environment. Using

g e n U s e r P r i v K e y

, the

C A

generates individual private keys (K_{$G I D$}) for users participating in the system by utilizing its own private key (

S K

). Each

C A

can create public key pairs (

S K, P K

) for attributes through

g e n A t t r K e y

. Subsequently, users receive attribute private keys (K_i) and user identification public keys (K_{$G I D$}) using the attribute public key, their global ID (

G I D

), and the attribute index (i). By generating individual identification keys alongside attribute private keys, CP-ABE prevents situations in which multiple users collude to decrypt ciphertexts encrypted with multiple attributes. Encryption involves encrypting content M using the

G P

, attribute policy P, a set of

P K

s,

P K

*, and a revoked user list (

R L

). To decrypt ciphertexts, users require their attribute private key set ({K_i, K_{$G I D$}}) and user identification private key (K*_{$G I D$}). Trustworthy

C A

s issue attribute keys, allowing users to decrypt ciphertexts using attribute private keys obtained from

C A

s through the utilized attribute-based public keys. Attribute-based encryption utilizes various operators to classify users with diverse attributes, allowing only users possessing designated attributes to access data. This can be utilized to provide subsequent access for users who join later to ciphertexts encrypted with attribute keys, as issuance of the user’s attribute private key enables granular access for multiple users using a single ciphertext.

Table 4 presents the frequently used notation in this paper.

4. System Design

In this section, we propose a blockchain-based caching architecture and explain the caching system processes. Additionally, we define the specific processes of the key components constituting the system and describe the security systems that configure the access control.

4.1. Blockchain-Based Caching Architecture

This section provides an overall overview of the blockchain-based caching architecture, detailing its structure and the roles of each layer within the architecture. Furthermore, it includes descriptions of both the conductor and the cache node supporting the caching system.

The proposed system assumes that users interact with the service system through the application interface and that user data produced within the service are stored in cloud storage. Figure 1 illustrates the overall structure of the system operating in this environment. The system is broadly composed of three layers: the Application layer, the Blockchain layer, and the Cloud Network layer.

Users can upload content through the system and download desired content. Additionally, they are empowered to modify their uploaded data, thereby realizing data control authority. Comprehensively, the application is divided into seven main modules, each performing a specific function within the system. These modules are as follows:

Conductor linker: Handles essential connections with the conductor, whose role is elaborated later.
Content uploader: Users utilize this module to upload their desired content. The content is uploaded to the cloud network, and its hash value is also uploaded to the Blockchain Network ( $B C N$ ) to ensure data integrity.
Content updater: Allows users to update their data stored in cloud storage. To ensure the integrity of the updated content, the update must be reflected through the BCN.
Content requester: Users can request data stored in the cloud using this module. Using the content’s ID as the key, a request is sent to Cache Nodes ( $N_{c a c h e}$ ) within the BCN. The $N_{c a c h e}$ then delivers the requested data to the user.
Content encryption and decryption: Supports content encryption and decryption on a single user device while protecting data stored in external storage by ensuring that only authorized users can decrypt and access the content.
Integrity checker: This module verifies the integrity of received data through the blockchain. It enables the verification of the reliability of external storage and prevents incorrect data transmission by malicious $N_{c a c h e}$ .

The Blockchain Network (

B C N

) consists of a set of blockchain nodes (N) and multiple cache nodes (

N_{c a c h e}

). All network nodes, including users and the blockchain, must have accounts within the blockchain. One of the cache nodes serves as a conductor node (

N_{c d t}

).

To enhance performance on the public

B C N

, multiple

N_{c a c h e s}

are assumed to be available. The Cache Manager module, operated by

N_{c a c h e s}

, performs caching. Smart contracts for managing the system and

N_{c a c h e s}

are deployed. These contracts include:

Cache manager contract: Maintains and manages a list of $N_{c a c h e s}$ to ensure reliability. One node from $N_{c a c h e}$ is selected as the $N_{c d t}$ , which manages node departures and controls malicious nodes.
Access manager contract: Handles access control for each user. It maintains a list of Global IDs ( $G I D$ s) of users within the $B C N$ and maps the addresses of each access control contract. Users can assign properties to request users and set access ranges for shared content using attribute-based encryption.
Content hash store contract: Stores the hash values of content kept in off-chain storage and ensures the integrity of off-chain content. Data owners can sign updated data and update hash values when the content in off-chain storage changes.

In cloud networks, off-chain storage is provided to handle the storage of original data uploaded by users onto the system. The integrity of stored data is verified through its hash values managed on the blockchain, ensuring storage reliability. Moreover, all data are encrypted before storage, ensuring safety from leaks. This layer can employ distributed file storage systems or centralized servers, enabling fully decentralized system operation. Ultimately, the system is designed with a caching architecture comprising blockchain network layers, cloud network layers, applications, and cache clusters, ensuring reliability and performance improvement.

4.1.1. Location-Based Cache Clustering with Conductor

This section provides a detailed explanation of the specific roles performed by the conductor in supporting the caching system.

The conductor’s role is to select appropriate cache nodes that deliver content when users first request it. To achieve this, the conductor (

N_{c d t}

) maintains IP and location details for each cache node (

N_{c a c h e}

) in order to form cache clusters per region. The cache nodes configure these clusters by identifying nearby ones through the conductor, and also manage routing information. This setup allows physically adjacent nodes to form cache clusters, allowing for multiple clusters on a single Blockchain Network (BCN). Figure 2 illustrates how the conductor organizes cache clusters by region.

The clusters are operated by

N_{c a c h e s}

, and aim to reduce latency by processing requests from nearby users. In addition, they ensure cached data coherence and replicate data for availability, ensuring efficient and stable cache clusters.

4.1.2. Cache Node Architecture

This section includes a detailed explanation of the components and operations of the Cache Manager module, which runs on cache nodes within the blockchain network. The components of the cache nodes are depicted in Figure 3. The cache manager module includes a Routing Module and an Off-Chain Module that work in tandem to manage cache node information and cache data, respectively.

The Routing Module maintains a Node Table with Node IDs, IP addresses, and port numbers alongside Node Info detailing memory capacity. The Set Connect function facilitates the establishment of connections within the cluster, while the Check Node State and Update Table functions are responsible for continuously monitoring the status of nodes and updating the routing table to reflect any changes, ensuring that it remains current and accurate.

The Off-Chain Module utilizes a Distributed Hash Table (DHT) to manage cache data. It stores keys and encrypted data values in memory, ensuring quick access and efficient retrieval of requested data. Key functions within this module include:

Request: Initiates requests for specific data stored within the DHT.
Store: Stores data into the DHT, ensuring redundancy and availability across nodes.
Replicate: Copies data across multiple nodes to enhance resilience and availability.
Check Coherence: Maintains data coherence by verifying consistency and ensuring update events propagate across all nodes.

These functions collectively manage data requests, storage, replication, and coherence within the cache, ensuring data integrity and efficient retrieval.

Additionally, the On-Chain Module integrates with the blockchain to enhance data integrity and consistency. The Contract Module monitors update events from contracts deployed on the blockchain, and processes such as Check Update Event and Update Cache verify changes to the data and reload updated data from off-chain storage, ensuring the cache table is synchronized with the latest data. The Blockchain Network component further supports data verification and integrity through the Verification Contract, which stores hashes of data and verifies their integrity by comparing with the hashes stored on the blockchain. Off-chain storage serves as an external data storage solution, efficiently managing large datasets. Through these comprehensive modules, the Cache Manager establishes a robust system that enhances content delivery performance by utilizing both on-chain and off-chain storage solutions, ensuring rapid data access, high availability, and data consistency across the network.

4.2. Process of the Caching System

This section provides a detailed explanation of how the proposed architecture operates. The caching process in the proposed system activates when data are requested. Thus, when a data request is made, the system caches the requested data and returns the data if they were already cached. Figure 4 shows a flowchart of how the file request process operates. The sequence of the request flow is as follows:

File request to cache node: The process begins with the user requesting a file through the application. The user is connected to the nearest cache cluster by the conductor. After connection, the user can send their request to the cache node with the fastest response time in the cache cluster. Before sending the request, the user must verify that they have the attribute keys to access the file. If the user has the attribute key for the file, they send the request to the cache node. If not, the user must request access from the file owner to obtain the attribute key. Details regarding key generation and issuance are explained in Section 4.4. After control approval, the file request is sent to the cache network.
File search and delivery: The cache node that received the request searches for the node within the cache cluster that caches the requested file based on the file ID. If the file is found in the cache cluster, then it is delivered to the user. If the file is not found in the cache cluster, then the original file is retrieved from the off-chain storage and cached. The cached file is then delivered to the requester.
File integrity verification: After receiving the file, the user decrypts it and retrieves the hash value corresponding to the file ID from the blockchain network. The user verifies whether the received hash value matches the decrypted data. If the hash values do not match, the data are considered tampered with, and the request process terminates. If the data are not tampered with, the integrity of the data is ensured and the original data are output to the user, concluding the process. If the user does not receive the original data and the process terminates, it is possible that the cache node may have maliciously tampered with the data. In such cases, the user must verify the reliability of the cache node by directly retrieving the data from the off-chain storage.

4.3. Main Component

This section elaborates the detailed requirements and process of the main functionalities: connection of the cache nodes via the conductor, content uploading and cache integrity verification, content requesting, content updating, and maintaining content coherence.

4.3.1. Connection of Cache Nodes via Conductor

Figure 5 illustrates the flow diagram of the process.

N_{c a c h e s}

register themselves as cache nodes on the blockchain. Smart contracts proceed with registration when the registration

G I D

matches the

N_{c a c h e}

signing the transaction. Subsequently, the cache node proceeds with initial setup to join the cluster through the

N_{c d t}

. At this point, the

N_{c a c h e}

transmits its public key (

P K

) and the signed message

S i g (N_{c a c h e}

_{(ip, port)}) encrypted with

N_{c d t}

’s public key to the

N_{c d t}

. The

N_{c d t}

verifies the received

N_{c a c h e}

’s

P K

against the registered entries in

S C_{c a c h e M a n a g e r}

and authenticates the signed

N_{c a c h e}

_{(ip, port)} to confirm the node’s authenticity. It records the

N_{c a c h e}

’s information and timestamp based on its IP address in the corresponding region (defined as countries in the system).

Because each cluster is regionally distinguished, the

N_{c d t}

forwards the recorded cluster information to the

N_{c a c h e}

, allowing it to join the appropriate regional cluster. Subsequently, the

N_{c d t}

monitors the

N_{c a c h e}

’s status and periodically updates the region table.

For user connection requests, users initially transmit their public key

U_{pub}

and the signed message

S i g (U_{pub, ip, port})

to

N_{c d t}

.

N_{c d t}

verifies the user’s registration status and provides a set of

N_{c a c h e s} (i p)

closest to the user’s

U_{i p}

based on the nearest region. The Haversine formula is used to calculate the distance between the user’s city coordinates and the cluster capital coordinates to determine the proximity.

Users send arbitrary requests to the received set of

N_{c a c h e s}

and select the

N_{c a c h e}

with the fastest response time. The selected

N_{c a c h e}

then serves content delivery to the user. This process, facilitated through

N_{c d t}

, ensures the selection of the nearest and fastest

N_{c a c h e}

, thereby minimizing the transmission delay.

4.3.2. Content Uploading and Cache Integrity Verification

This section provides a detailed explanation of the requirements and processes involved in connecting cache nodes through the conductor. Users encrypt the content they intend to upload and register it in off-chain storage. To prove the integrity and ownership of the content, users record the hash value of the uploaded content through

S C

_{$h a s h S t o r e$}. Subsequently, other users can decrypt the content and verify its hash value using the verify function of

S C

_{$h a s h S t o r e$}. Algorithm 1 shows the operation process. This process ensures the trustworthiness of the content regardless of the trust level of the cache nodes or off-chain storage.

Algorithm 1: Content Upload and Cache Integrity Verification

4.3.3. Content Request

When a user requests content from the service

N_{c a c h e}

using the content ID, the

N_{c a c h e}

receiving the cache request initiates a search in the cache cluster to determine whether the requested content exists. This search involves querying nodes within the cluster based on the hash value of the content ID in order to identify the node holding the content. If the search reveals that the content is cached in the cluster, the

N_{c a c h e}

retrieves and returns the content. However, due to the limited storage capacity of each

N_{c a c h e}

, cache misses may occur. In case of a cache miss, the

N_{c a c h e}

loads the data directly from off-chain storage corresponding to the content ID for delivery to the requester. Subsequently, the requested data are stored in the cache table and requests are made to cache the data in two randomly chosen nodes, thereby ensuring the recent data are maintained in the cache and enhancing the availability of cache data. If the cache table becomes full, the Least Recently Used (LRU) policy is employed to replace data. Upon receiving a response to the request, the requester measures the response time. If the actual delay exceeds the processing time required by the

N_{c a c h e}

, then the requester can choose a new

N_{c a c h e}

. Moreover, if no response is received within a specified time frame, then a transaction is sent to the

B C N

to remove the designated

N_{c a c h e}

from the list maintained by

S C_{c a c h e M a n a g e r}

. Algorithm 2 shows the operation process. This process enables the monitoring of

N_{c a c h e}

performance and detects any abnormal behavior in an unreliable distributed environment.

Algorithm 2: Content Download Request Process

4.3.4. Content Update and Coherence Maintenance

Ensuring coherence in a distributed caching system is essential. A process is required to maintain coherence between off-chain data and cached data. Each

N_{c a c h e}

holds a ledger of the

B C N

locally, enabling it to ensure coherence of the cached data by comparing the data with its own maintained ledger data. However, delivering the content to users after checking each request for coherence results in additional overhead and performance degradation. Because content updates by users are optional, performing coherence checks for every request is unnecessary. Therefore, we design the system utilizing smart contract events to incur coherence-related overhead only when update events occur.

Initially, users update the content corresponding to the Content ID (

C_{i d}

) to the modified content and request storage in the

S C_{h a s h S t o r e}

by signing the transaction. After verifying the transaction, SChashStore triggers an update event upon confirming that the user who signed the transaction requested a change in the hash value corresponding to the Cid they authored. As each N executes transactions within its respective

S C

virtual machine, every N executing the transaction can immediately detect the occurring event. Consequently,

N_{c a c h e s}

listening to such events promptly detect the update and retrieve the updated data from off-chain storage if there is correspondence with their own cached data. Algorithm 3 shows the corresponding operation process. This process ensures rapid and efficient maintenance of coherence.

Algorithm 3: Content Update and Coherence Maintenance Process

4.4. Distributed Access Control from Off-Chain Storage

This section provides an overview of how distributed access control is performed from off-chain storage, including a description of its key functionalities. To securely store off-chain data, users must encrypt their data, ensuring that only the specified users authorized by the data owner can decrypt it. This section details the methods for granting access to system users. The access control system operates as depicted in Figure 6.

An access controller exists under the content upload and download modules of the application layer. The key manager within this controller performs public key encryption and attribute-based encryption. It generates keys for each encryption, stores the generated keys locally, and performs encryption and decryption functions. Data users encrypt their content through their device’s application before storing it in off-chain storage.

A user management contract is pre-established for system user authentication. The User Manager Contract manages system users, identified through their global IDs, which must be issued by a trusted public certificate authority. This ensures that

G I D

s are reliable markers of identity. Users authenticate their

G I D

s using a public–private key pair, ensuring that only those with a verified identity can register. When submitting access requests, users encrypt the

G I D

using their private key. The data owner decrypts this information using the corresponding public key to verify authenticity. If the decrypted GID does not match the claimed GID, then the request is discarded. This mechanism prevents identity spoofing by requiring both the GID and the corresponding private key, ensuring that only legitimate users can access the system.

Each user autonomously deploys their access control contract, registering its address within the user management contract. Through this registered address, users gain access to the public key associated with access control attributes designated by data owners. This mechanism facilitates selective decryption, permitting users to decrypt only the attribute group specified by the respective data owner, thereby enabling flexible utilization across varied contexts. Additionally, users are equipped to transmit access requests to content creators through the content hash store contract. The specific process is as follows:

$c e r t i f i c a t e U s e r$ ( $G I D$ ): To deploy an access control contract, the user’s GID must be verified as a registered user in the system through $S C_{u s e r M a n a g e r}$ . Registered users can then register their $S C_{a c c e s s C o n t r o l}$ .
$d e p l o y C o n t r a c t$ ( $G P$ )→ $M S K, M P K$ : The user generates a public–private key pair to control attribute keys through $G P$ and subsequently deploys the access control contract. The contract records the deployer’s $M P K$ and $G P$ and maps the contract address with the user’s $G I D$ in $S C$ _{$u s e r M a n a g e r$}. By managing attribute control through $S C_{a c c e s s C o n t r o l}$ , the user can define attributes and control access to them. Users within a specific group can access the contract and perform encryption with the attribute public key, and only the group members with the corresponding attribute keys can decrypt it. This public key management ensures reliable access control, as only the contract deployer can record it.
$g e n A t t r P u b K e y (G P)$ → $S K_{i}, P K_{i}$ : After contract deployment, the user generates public key pairs for defined attributes. Multiple attributes can be created and used to define access policies. The generated $P K_{i}$ is recorded in the contract, while $S K_{i}$ is stored locally by the attribute creator ( $D O$ ). Only $D O$ can record valid attribute keys in the contract, ensuring control over attribute management.
$r e q u e s t (G I D)$ : $D U$ requests access by invoking the transaction of the $D O$ ’s contract, providing their $P K$ and signed $G I D$ to $D O$ . $D O$ verifies the identity of $D U$ and sets access permissions to their data.
$g e n U s e r P r i v K e y (G I D, M S K)$ : Upon detecting a request, $D O$ generates an identification key ( $K *$ _{$G I D$}) for $D U$ , preventing unauthorized access through collusion among multiple $D U$ s.
$g e n A t t r P r i v K e y (G P, S K_{i}, GID)$ → $K_{i}, K_{GID}$ : $D O$ assigns attributes to $D U$ and uses the relevant $S K_{i}$ and the requester’s $G I D$ to generate user-specific attribute private keys ( $K_{i}$ ) and identification keys (K_{$G I D$}).
$s e n d K e y s (e n c r y p t e d P K (K_{i}, K_{GID}, K *_{GID}))$ : After key generation, $D O$ encrypts the keys with $D U$ ’s $P K$ and sends them to $D U$ .
$e n c r y p t (G P, M, P, {P K}, M P K)$ → $C T$ : $D O$ encrypts the content (M) using a set of defined attributes, combining them into a single ciphertext. Only users with attribute private keys matching the policy(P) can decrypt the data.
$d e c r y p t (G P, C T, P, {K_{i}, K_{GID}},$ K $*_{G I D}) \to M$ : $D U$ attempts to decrypt the ciphertext using their set of attribute keys ( $K_{i}$ ) and $K_{G I D}$ . If the attribute set matches the access matrix and $D U$ has the correct $K *$ _{$G I D$}, the decryption succeeds.

Through this blockchain-based access control process, user access policies can be managed reliably. CP-ABE encryption ensures that data stored in untrusted off-chain storage is protected while allowing the data owner to control access through a single ciphertext in a distributed environment. Without a centralized key server, the data user directly controls access, preventing data breaches.

N_{c a c h e}

caches only encrypted content, preventing the risk of content manipulation or unauthorized decryption. Overall, a secure distributed caching system is established through a secure content sharing access control system in a distributed environment.

5. Experiments and Evaluation

In this section, we describe the experimental environment for conducting simulations, evaluate the system’s security aspects, and assess the proposed system’s performance to demonstrate its contributions.

5.1. Experimental Environment

This section describes the configuration of the experimental environment, including node specifications and parameters.

We evaluate the performance of the proposed system through actual simulations. We adopt Firebase [33] for the off-chain storage layer used for simulation. The blockchain network consists of one conductor node and three physically distinct nodes composing regional cache clusters. Each regional cache node is implemented to form cache cluster with three nodes using Docker v26.1.2, with the regions identified as Korea, UK, and the USA. Thus, the blockchain network comprises a total of ten nodes. To configure the blockchain network, we used geth v1.10.26 to ensure that each cache node forms a private Ethereum [34] blockchain network. For cache cluster configuration, we employed Redis v4.1.3 to set up cache node servers and perform clustering. Smart contracts were written in Solidity and deployed through Truffle v5.11.5. Cache nodes within the blockchain ran the cache manager module to process requests, interact with the cache clusters, and communicate with the blockchain module to interact with the blockchain network. The cache manager module was developed using Python v3.8.10. We used Maxmind [35]’s GeoIP database to obtain the geographic location coordinates corresponding to the IP of the cache node by calculating the distance between user city coordinates based on the city coordinates to which the cache node belongs. The specifications of each node used in the experimental environment are listed in Table 5. We evaluate the system from various aspects by applying various parameters in the simulation. The parameters used in the experiments are shown in Table 6. In order to consider more realistic scenarios, we designed the content request distribution using the Zipf function, allowing us to evaluate the system in terms of its content size, cache size, user request count, cache hit ratio, and off-chain storage aspects.

5.2. Security Analysis

In this section, we analyze the security of the system by discussing potential security issues using STRIDE threat modeling. We evaluate how the system addresses or mitigates these issues to ensure robust security.

Spoofing: An attacker could impersonate a user by exploiting the user’s GID and sending authorization requests to the data owner. Both users and cache nodes in the blockchain network register the GID required for signing with the smart contract for identity verification. Authorization requests in the system are signed with the private key of the user’s authenticated GID in the smart contract. Therefore, any request from an attacker who cannot obtain the private key is discarded. Only the user who signed with the private key can modify the blockchain data corresponding to the GID. Thus, each node signs information with the private key corresponding to the GID and communicates with other authenticated users, preventing spoofing attacks by attackers attempting to impersonate other nodes.
Tampering: Because blockchain data cannot be altered, there are two tampering risks in the system. The first is the risk of tampering with data in off-chain storage. An attacker could gain access to off-chain storage and modify the data. The proposed system prevents tampering by verifying the integrity of the data retrieved from off-chain storage. The hash of the original data is recorded on the blockchain, so the integrity is verified by comparing the requested data with the blockchain data. The second risk is that cache nodes could arbitrarily tamper with cached data. This tampering can also be detected through user verification post-request, and malicious cache nodes can be removed to mitigate the risk. Therefore, the system ensures the trustworthiness of nodes and the integrity of the data, preventing unauthorized data modifications.
Repudiation: An attacker could undermine the system’s reliability by claiming that updated content was not actually updated. The system requires transactions to be signed with a private key to modify blockchain values through the smart contract. Because blockchain values must be updated during the update process, users cannot deny that they performed the update if it was successful. Only the user who signed the transaction can perform records on the smart contract, so all information regarding content registration and updates is recorded and verifiable on the ledger. Additionally, a malicious cache node could claim that it did not receive a request or send manipulated data. If the cache node does not respond within a certain time frame or sends incorrect content, its authority, recorded in the smart contract, can be revoked; thus, the system’s security is ensured by blocking malicious cache nodes.
Information Disclosure: An attacker could exploit the public ledger recorded on the blockchain to obtain and manipulate user information. However, the main information recorded on the blockchain in the system is the user’s global ID and the hash value of the content, making it impossible to attack system users based solely on the disclosed information. Additionally, an attacker could attack off-chain storage and cache nodes where the original data are stored in order to leak the stored data. However, the original content is encrypted based on attributes, and only users with the attribute key can decrypt it. The attribute key is also issued directly by the data user and encrypted with the requester’s public key, meaning that no one other than the involved parties can obtain the key. Therefore, security regarding leaks is enhanced, as the data owner directly manages access rights.
Denial of Service: An attacker could paralyze the system by discovering and attacking the information of the conductor that configures the cache cluster. However, because the conductor is randomly selected from the cache nodes in the system, it is difficult to attack. Additionally, cache nodes continuously check the conductor’s status, and the conductor also continuously monitors the status of the cache nodes, allowing continuous management of the cluster’s state. Furthermore, cache nodes can verify and reject unauthorized users within the system. In order for an attacker to paralyze the entire cache cluster, they would need to gather information on and attack all distributed cache nodes in the blockchain network, which is difficult to execute.
Elevation of Privilege: Data owners can directly grant data access rights to users based on specified attributes. Because data owners directly deliver the attribute key to the users, attackers cannot obtain the rights to the data. However, an attacker with attribute A and another attacker with attribute B in the system might attempt to combine their attribute keys to decrypt content encrypted with an “A AND B” attribute policy. However, since different identification keys are generated for attributes per user, if the two users are differently identified, they cannot satisfy the attribute policy. Therefore, colluding attacks that attempt to satisfy access policies are impossible. Only authenticated users can request authorization, and the owner directly approves it, preventing privilege escalation attacks.

5.3. Performance Evaluation and Analysis

This section approaches the analysis of system performance using various parameters.

When forming cache clusters with blockchain nodes distributed globally, without considering locality, the distance between the cache nodes caching user data and the users themselves may be physically distant. Figure 7 illustrates the comparison of average request processing latency between a scenario where blockchain nodes are distributed globally and one where cache clusters are formed based on locality. The experiments were divided into two scenarios; the first involved randomly requesting one of the 1000 stored contents, while the second considered a Zipf distribution based on content popularity to send requests, with 5000 requests sent to measure the average latency. The average size of the content was 64 KB. In the case where locality was not considered, network delays occur due to cache nodes based on globally distributed blockchain nodes. In contrast, when considering locality, clients send requests to cache nodes in their closest region, resulting in similar or improved latency, demonstrating the efficiency of regional cache clustering. This was effective in both the random and Zipf scenarios, showing fast request processing performance of 95 ms in the Zipf scenario, which reflects realistic biased content popularity situations. These experimental results demonstrate that performance optimization through cache clustering when considering locality in the blockchain network is achievable.

Figure 8 illustrates the average request latency according to the requested content size. It shows the difference in latency between requesting uncached content and cached content. When content is uncached, the measured latency represents the time taken to fetch data directly from cloud storage; when content is cached, it represents the time taken to fetch data directly from the cache cluster. For 1 KB of data, the average latency when hitting the cache was 28 ms, whereas fetching directly from the cloud resulted in an average latency of 262 ms, a difference of more than nine times. On the other hand, for 128 KB data, the average latency when hitting the cache was 67 ms, while downloading directly from the cloud took 989 ms, a difference of over fifteen times. This demonstrates that applying caching mechanisms results in a latency reduction of over 89%, proving particularly effective in large-scale data transfers.

Figure 9 illustrates the latency for concurrent user requests on cached data. The average latency remains relatively stable up to an initial 300 concurrent requests, after which it sharply increases. While the minimum latency values remain relatively stable, the maximum latency values fluctuate greatly, and increase sharply with higher concurrent request numbers. When measuring the latency for requests to a single cache node, we observed bottlenecks in the load both of that node and of the network bandwidth, leading to a sharp increase in latency at specific intervals. However, In a distributed environment with multiple cache nodes, increasing the number of cache nodes can provide faster latency to a larger number of users. Consequently, there is a need to establish optimization strategies for cache node specifications and quantities according to user scale.

The proposed system utilizes the smart contract’s update event logs to validate the consistency of data cached by each cache node. By defining update events and listening to them, cache nodes can track updates and verify the consistency of their cached data for delivery to users. Figure 10 illustrates the results of measuring response latency based on the frequency of event occurrences. Without leveraging event logs, the process of comparing blockchain data with caching data for consistency verification resulted in a latency of 89 ms per request. However, when processing requests by detecting update events defined in the smart contract, the maximum latency was reduced to 65 ms. Despite frequent updates, the system demonstrates minimal performance degradation or impact on responsiveness. These findings indicate a 26% reduction in latency compared to reading local ledger data, ensuring consistency with relatively low overhead.

Figure 11 illustrates the relationship between cache hit ratio and average latency, demonstrating that the data retrieval time decreases as the cache hit ratio increases, resulting in reduced response latency. With a higher cache hit ratio, a maximum reduction in latency of up to 81% is observed. Hence, it is imperative to consider policies that can effectively increase cache hit ratios. Figure 12 illustrates the latency across different off-chain storage layers based on the cache hit ratio. Firebase, MongoDB Atlas, and CouchDB Cappella were utilized as off-chain storage layers, all of which are located in the United States. The results indicate that CouchDB Cappella exhibits the lowest average latency; across all cloud storage options, the average latency decreases as the hit ratio increases. This suggests that applying the caching system on any cloud storage layer can yield performance benefits. Additionally, the experimental results underscore the importance of optimizing system performance by selecting an appropriate storage solution while considering the performance changes with varying cache hit ratios.

In addition, to validate the effectiveness of the system it is necessary to apply the appropriate cache replacement policy in real-world constrained environments. Figure 13 illustrates the cache hit ratio and latency when employing the Least Recently Used (LRU) replacement policy in cache replacement processes while considering limited cache sizes for each cache cluster. The experiment comprises 5000 requests, with request patterns defined using Zipf distribution for realistic simulation. Each content has a size of 64 KB, totaling 64 MB of content in cloud storage. The results demonstrate a gradual decrease in average latency with increasing cache size. For instance, with a cache size of 9 MB (approximately 14% of the total storage size) the cache hit ratio is around 21%, while for a cache size of 21 MB (approximately 32% of the total storage size) the hit ratio increases to 64%. These findings highlight the efficacy of the cache system in resource-constrained cache clusters. By appropriately adjusting the cache size, it becomes feasible to achieve high cache hit ratios and low response latency. Thus, the experiment provides empirical evidence for determining the optimal cache size. Furthermore, leveraging cache replacement policies that enhance cache hit ratios can further maximize performance improvements.

Figure 14 illustrates the process of detecting intentional delivery of corrupted content by cache nodes that directly provide content and selecting new cache nodes. In the sixth request sequence, a cache node intentionally delivers corrupted content. Upon receiving the corrupted content, users verify the integrity of the content to determine whether the cache node delivered the correct content. If corruption is detected, users report the cache node to the cache manager contract. Consequently, the cache manager contract records the malicious cache node in the list of malicious cache nodes within the blockchain, preventing it from further cache node registrations. After reporting, the users request a new connection from the conductor and select a new cache node according to the existing process. Subsequently, the selected cache node receives cached data from the actual cluster, verifies data integrity, and receives the correct data. Typically, there is a delay of 0.1 s between receiving and verifying the requested data. The process of removing a malicious cache node and recovering the cache node incurs an additional delay of approximately 0.22 s, indicating that the overhead required for system recovery is not significant.

The overarching goal of the proposed system is to enhance the performance of existing blockchain-based off-chain storage solutions. To assess the potential improvement in performance compared to previous research, a comparative analysis was conducted. The study extracted retrieval latency times from the literature for systems utilizing IPFS as off-chain storage; the retrieval latency of the system using cloud storage was directly measured using Firebase as the cloud storage, mimicking an environment in which only the caching system of the proposed system is removed. Table 7 presents a comparison of latency times across each system. The proposed system demonstrated a latency of 0.067 s under 100% cache hit rate conditions and 0.526 s when the cache hit rate was 10%. Even with a lower hit rate setting, the proposed system showed a latency difference more than twice that in the literature. This confirms that the proposed system effectively mitigates performance overhead for off-chain storage that has not been accessed previously.

Collectively, by considering factors such as locality in cache clustering, leveraging smart contract update events, and optimizing cache hit ratios, the proposed system demonstrates substantial reductions in latency along with improved performance. Indeed, the combination of performance enhancements with the implementation of robust data integrity validation mechanisms holds the potential to significantly bolster the efficiency and reliability of caching mechanisms within blockchain-based applications. By leveraging these improvements, blockchain applications can effectively mitigate latency issues, enhance responsiveness, and ensure the consistency and reliability of data storage and retrieval processes.

6. Conclusions

The proposed system introduces a novel approach for enhancing the performance of DApps by integrating a caching architecture into existing off-chain storage solutions, thereby ensuring the integrity and secure management of application data. Through the cache manager, the system constructs regional cache clusters based on blockchain nodes to deliver fast request processing latency to users. Additionally, leveraging attribute-based encryption protects data stored in untrusted cloud storage from leakage and unauthorized decryption enables access control for multiple users with a single ciphertext. The proposed system underwent a security evaluation and performance comparison simulations. The results confirm a significant improvement in request processing speed, with a reduction of over 89% compared to the pre-application system in terms of regional cache clustering and cache consistency. Considering the direct correlation between cache hit ratio and system performance, future research should focus on securing cache storage resources and developing cache replacement policies to maximize performance.

Ultimately, the proposed system ensures the secure storage and transmission of user content in distributed environments while overcoming the performance issues inherent in existing decentralized applications. This design can be applied across various domains operating in the Web3.0 environment, potentially fulfilling the role of CDNs (Content Delivery Networks) in improving request performance for centralized servers. Consequently, it fosters a decentralized application environment that guarantees real-time responsiveness, providing users with a seamless experience.

Author Contributions

D.K.: conceptualization, methodology, programming, writing—original draft preparation. S.P.: supervision, review, project administration, funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by an Institute of Information and Communications Technology Planning and Evaluation (IITP) grant funded by the Korean Government (MSIT) (No. 2021-0-00484, Core technologies for hybrid P2P network-based blockchain services).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article material, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Twesige, R.L. A simple explanation of Bitcoin and Blockchain technology. Comput. Sci 2015, 1, 1–5. [Google Scholar]
Nakamoto, S. Bitcoin: A peer-to-peer electronic cash system. Decentralized Bus. Rev. 2007. [Google Scholar]
Zheng, Z.; Xie, S.; Dai, H.N.; Chen, X.; Wang, H. Blockchain challenges and opportunities: A survey. Int. J. Web Grid Serv. 2018, 14, 352–375. [Google Scholar] [CrossRef]
Mohanta, B.K.; Panda, S.S.; Jena, D. An overview of smart contract and use cases in blockchain technology. In Proceedings of the 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Bangalore, India, 10–12 July 2018. [Google Scholar]
Kina-Kina, K.M.; Cutipa-Arias, H.E.; Shiguihara-Juárez, P. A comparison of performance between fully and partially decentralized applications. In Proceedings of the 2019 IEEE XXVI International Conference on Electronics, Electrical Engineering and Computing (INTERCON), Lima, Peru, 12–14 August 2019. [Google Scholar]
Zou, W.; Lo, D.; Kochhar, P.S.; Le, X.B.D.; Xia, X.; Feng, Y.; Chen, Z.; Xu, B. Smart contract development: Challenges and opportunities. IEEE Trans. Softw. Eng. 2019, 47, 2084–2106. [Google Scholar] [CrossRef]
Jayabalan, J.; Jeyanthi, N. Scalable blockchain model using off-chain IPFS storage for healthcare data security and privacy. J. Parallel Distrib. Comput. 2022, 164, 152–167. [Google Scholar] [CrossRef]
Solaiman, E.; Wike, T.; Sfyrakis, I. Implementation and evaluation of smart contracts using a hybrid on-and off-blockchain architecture. Concurr. Comput. Pract. Exp. 2021, 33, e5811. [Google Scholar] [CrossRef]
Reyna, A.; Martín, C.; Chen, J.; Soler, E.; Díaz, M. On blockchain and its integration with IoT. Challenges and opportunities. Future Gener. Comput. Syst. 2018, 88, 173–190. [Google Scholar] [CrossRef]
Shen, J.; Li, Y.; Zhou, Y.; Wang, X. Understanding I/O performance of IPFS storage: A client’s perspective. In Proceedings of the International Symposium on Quality of Service, Phoenix, AZ, USA, 24–25 June 2019. [Google Scholar]
Zilberman, P.D.A.N.; Moore, A. Characterizing the Impact of Network Latency on Cloud-Based Applications’ Performance; University of Cambridge: Cambridge, UK, 2017. [Google Scholar]
Gigaspaces. Amazon Found Every 100 ms of Latency Cost Them 1% in Sales. 2023. Available online: https://www.gigaspaces.com/blog/amazon-found-every-100ms-of-latency-cost-them-1-in-sales/ (accessed on 20 February 2024).
Alrebdi, N.; Alabdulatif, A.; Iwendi, C.; Lian, Z. SVBE: Searchable and verifiable blockchain-based electronic medical records system. Sci. Rep. 2022, 12, 266. [Google Scholar] [CrossRef] [PubMed]
Zhang, G.; Yang, Z.; Liu, W. Blockchain-based privacy preserving e-health system for healthcare data in cloud. Comput. Netw. 2022, 203, 108586. [Google Scholar] [CrossRef]
Yamanaka, H.; Teranishi, Y.; Hayamizu, Y.; Ooka, A.; Matsuzono, K.; Li, R.; Asaeda, H. User-centric In-network Caching Mechanism for Off-chain Storage with Blockchain. In Proceedings of the ICC 2022-IEEE International Conference on Communications, IEEE, Seoul, Republic of Korea, 16–20 May 2022. [Google Scholar]
Bai, J.; Zhu, S.; Ji, H. Blockchain Based Decentralized and Proactive Caching Strategy in Mobile Edge Computing Environment. Sensors 2024, 24, 2279. [Google Scholar] [CrossRef] [PubMed]
Guo, J.; Li, C.; Luo, Y. Blockchain-assisted caching optimization and data storage methods in edge environment. J. Supercomput. 2022, 78, 18225–18257. [Google Scholar] [CrossRef]
Zhang, Q.; Li, C.; Du, T.; Luo, Y. Multi-level caching and data verification based on ethereum blockchain. Wirel. Netw. 2023, 29, 713–727. [Google Scholar] [CrossRef]
Heo, J.W.; Ramachandran, G.S.; Dorri, A.; Jurdak, R. Blockchain storage optimisation with multi-level distributed caching. IEEE Trans. Netw. Serv. Manag. 2022, 19, 3724–3736. [Google Scholar] [CrossRef]
Wang, W.; Niyato, D.; Wang, P.; Leshem, A. Decentralized caching for content delivery based on blockchain: A game theoretic perspective. In Proceedings of the 2018 IEEE iNternational Conference On Communications (ICC), Kansas City, MO, USA, 20–24 May 2018; pp. 1–6. [Google Scholar]
Freedman, M.J.; Freudenthal, E.; Mazières, D. Democratizing content publication with coral. In Proceedings of the Networked Systems Design and Implementation (NSDI), San Francisco, CA, USA, 29–31 March 2004; Volume 4. [Google Scholar]
Saturn Doc. 2024. Available online: https://docs.saturn.tech/how-does-saturn-work (accessed on 24 March 2024).
Horváth, M. Attribute-based encryption optimized for cloud computing. In Proceedings of the 41st International Conference on Current Trends in Theory and Practice of Computer Science, Pec pod Snezkou, Czech Republic, 24–29 January 2015. [Google Scholar]
Sukhodolskiy, I.; Zapechnikov, S. A blockchain-based access control system for cloud storage. In Proceedings of the 2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus), Moscow, Russia, 29–31 January 2018; pp. 1575–1578. [Google Scholar]
Sharma, P.; Jindal, R.; Borah, M.D. Blockchain-based cloud storage system with CP-ABE-based access control and revocation process. J. Supercomput. 2022, 78, 7700–7728. [Google Scholar] [CrossRef]
Wang, S.; Wang, X.; Zhang, Y. A secure cloud storage framework with access control based on blockchain. IEEE Access 2019, 7, 112713–112725. [Google Scholar] [CrossRef]
Li, J.; Li, D.; Zhang, X. A secure blockchain-assisted access control scheme for smart healthcare system in fog computing. IEEE Internet Things J. 2023, 10, 15980–15989. [Google Scholar] [CrossRef]
Alharbi, A. Applying Access Control Enabled Blockchain (ACE-BC) Framework to Manage Data Security in the CIS System. Sensors 2023, 23, 3020. [Google Scholar] [CrossRef]
Yang, C.; Tan, L.; Shi, N.; Xu, B.; Cao, Y.; Yu, K. AuthPrivacyChain: A blockchain-based access control framework with privacy protection in cloud. IEEE Access 2020, 8, 70604–70615. [Google Scholar] [CrossRef]
Han, D.; Zhu, Y.; Li, D.; Liang, W.; Souri, A.; Li, K.C. A blockchain-based auditable access control system for private data in service-centric IoT environments. IEEE Trans. Ind. Inform. 2021, 18, 3530–3540. [Google Scholar] [CrossRef]
Bellare1, M.; Boldyreva, A.; Desai, A.; Pointcheval, D. Key-privacy in public-key encryption. In Proceedings of the International Conference on the Theory and Application of Cryptology and Information Security, Gold Coast, Australia, 9–13 December 2001; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Bethencourt, J.; Sahai, A.; Waters, B. Ciphertext-policy attribute-based encryption. In Proceedings of the 2007 IEEE Symposium on Security and Privacy (SP’07), Oakland, CA, USA, 20–23 May 2007; pp. 321–334. [Google Scholar]
Firebase Doc. 2024. Available online: https://firebase.google.com/docs (accessed on 16 December 2023).
Wood, G. Ethereum: A secure decentralised generalised transaction ledger. Ethereum Proj. Yellow Pap. 2014, 151, 1–32. [Google Scholar]
Maxmind. 2024. Available online: https://www.maxmind.com/en/home (accessed on 7 April 2024).

Figure 1. Overview of blockchain based caching architecture.

Figure 2. Cache cluster formation by the conductor.

Figure 3. Cache node architecture combining blockchain and off-chain storage.

Figure 4. Caching system workflow.

Figure 5. Cache node connection flow diagram.

Figure 6. Overview of blockchain-based caching architecture.

Figure 7. Comparison of request processing latency based on request patterns and locality-aware cache cluster configuration; (Random) indicates that the requested content is randomly selected and requested, while (Zipf) is a method for randomly changing the requested content according to its priority.

Figure 8. Request processing latency by content size based on caching status; (Caching) indicates that the requested content is cached, while (Non Caching) indicates that the requested content is stored only in external storage.

Figure 9. Request processing latency based on number of concurrent users.

Figure 10. Request processing latency for different update frequencies.

Figure 11. Average response latency based on cache hit ratio.

Figure 12. Response latency based on cache hit ratio for off-chain storage.

Figure 13. Average response latency and cache hit ratio based on request and cache data replacement simulation.

Figure 14. Time required for issue detection and cache node replacement to occur. After attempting to cause a failure in the sixth request sequence, the failure recovery time and the proportion of each process delay are measured.

Table 1. Comparison of caching research for distributed environments.

	Abstract	Contribution	Differences
Ymanaka et al. [15]	A user-centric in-network caching mechanism for ICN-based off-chain storage to address the limitations of existing off-chain solutions.	Caches data based on user location to shorten response time. Utilizes ICN routing protocols for efficient data transmission between network nodes.	Presence of a key server managing keys in a distributed environment poses security threats due to centralization. ICN is in its early stages, making it difficult to apply complex structures and processes.
Ymanaka et al. [15]	A distributed and proactive caching strategy based on blockchain for mobile edge computing environments.	Introduces a proactive caching strategy based on node utility to optimize caching. Addresses the trade-off between blockchain consensus process delay and content caching delay. Achieves optimal caching strategy using linear relaxation techniques and internal point methods.	Delay in cache transaction process through blockchain between users and cache nodes. Oracle issues arise due to offline proofs between data providers and cache nodes, leading to decreased system reliability.
Bai et al. [16]	Propose a distributed and proactive caching strategy based on blockchain for mobile edge computing environments.	Introduces a proactive caching strategy based on node utility to optimize caching. Addresses the trade-off between blockchain consensus process delay and content caching delay. Achieves optimal caching strategy using linear relaxation techniques and internal point methods.	Delay in cache transaction process through blockchain between users and cache nodes. Oracle issues arise due to offline proofs between data providers and cache nodes, leading to decreased system reliability.
Guo et al. [17]	Improves data security and caching hit rate in edge computing environments using blockchain technology.	Ensures data security in edge environments using blockchain. Improves search performance through caching optimization models.	Specifically designed for edge environments. Delays occur in IPFS search and backup processes between edge nodes and blockchain nodes.
Zhang et al. [18]	A three-layer caching mechanism including cache node layers in large-scale distributed networks.	Provides faster response to users through hotness-based dynamic caching. Converts large files into Merkle tree structures and stores them using IPFS systems after encryption.	Relies on a TPA (Third-Party Auditor) for data integrity verification, posing issues around data reliability and single points of failure.
Heo et al. [19]	A blockchain-based decentralized framework for proactive caching in hierarchical wireless networks.	Combines blockchain and game-theoretic perspectives to design a distributed caching system. Provides approaches to activate cache helpers through incentive mechanisms.	Lacks a method for ensuring the integrity and consistency of stored cache data.
Wang et al. [20]	Proposes MLDC multi-level distributed caching strategy for blockchain storage optimization.	Reduces storage requirements by storing only necessary data instead of full data replication. Enables data integrity verification and requesting of necessary data using hash values.	Increase in network traffic when requesting data from other class nodes. Difficulty in guaranteeing minimum nodes occupying the upper layer among all nodes, leading to system instability.
Freedman et al. [21]	Proposes CoralCDN, a web content distribution network utilizing bandwidth from volunteer resources.	Minimizes hotspot congestion through two-stage insertion algorithms. Improves content latency using hierarchical cluster systems.	Delays occur due to use of proxy and DNS servers for requests between clients and Coral clusters, leading to delays in inter-cluster exploration.
Saturn [22]	Proposes blockchain-based L1 nodes for caching IPFS-stored data.	Implements a content delivery network consisting of L1 nodes to accelerate content delivery. Encourages L1 node availability through incentive mechanisms.	Lacks universality due to supporting only IPFS-based content delivery networks.

Table 2. Comparison of related research on blockchain-based systems.

	Decentralized	Cache System	Preventing Cache Poisoning	Data Integrity	Access Control
[15]	low	✓	×	✓	✓
[16]	middle	✓	×	✓	×
[17]	high	✓	×	✓	×
[18]	high	✓	✓	✓	×
[20]	middle	✓	✓	×	×
[24]	middle	×	×	✓	✓
[25]	middle	×	×	×	✓
[26]	middle	×	×	×	✓
[27]	middle	✓	×	✓	✓
[28]	middle	×	×	✓	✓
Proposed	high	✓	✓	✓	✓

Table 3. CP-ABE function descriptions.

Function	Description
$i n i t C i r c u i t (λ) \to G P$	Circuit initialization
$c o n f i g C A (G P) \to (S K , P K )$	Certification Authority configuring
$g e n U s e r P r i v K e y (G P, R L, G I D, S K ) \to K _{GID}$	Private key generation for user certification
$g e n A t t r K e y (G P) \to (S K, P K)$	Attribute key pair generation
$g e n A t t r P r i v K e y (G P, S K, G I D, i) \to K_{i}, K_{GID}$	Attribute private key generation for data user
$e n c r y p t (G P, M, P, {P K}, P K^{*}, R L) \to C T$	Message encryption to generate ciphertext
$d e c r y p t (G P, C T, P, {K_{i}, K_{GID}}, K *_{GID}, R L) \to M$	Ciphertext decryption to recover the original message

Table 4. Frequently used notation.

Notation	Description
$B C N$	Blockchain network
$S C$	Smart contract
U	System user
N	Node in blockchain network
$N_{c a c h e}$	Cache node in blockchain network
$N_{c d t}$	Conductor node for cache cluster
$D O$	Data owner of system
$D U$	Data user of system
$P K$	User’s public Key
$S K$	User’s private Key
$M P K$	Data owner’s master public Key
$M S K$	Data owner’s master private Key
$K *$	User identification private key
$K_{i}$	private key for attribute i

Table 5. Experimental environment.

Type	Location	Specifications
Conductor	KR	Window 11, CPU Intel Core i9-13900KF, RAM 64 GB, SSD 1 TB
Cache node	KR	Ubuntu 20.04, vCPU 2 core, RAM 4 GB, SSD 50 GB
	US	Ubuntu 18.04, vCPU 2 core, RAM 4 GB, SSD 50 GB
	UK	Ubuntu 22.04, vCPU 1 core, RAM 1 GB, SSD 30 GB
Client	KR	Ubuntu 22.04, vCPU 1 core, RAM 1 GB, SSD 30 GB
	US	Ubuntu 22.04, vCPU 1 core, RAM 1 GB, SSD 30 GB
	UK	Ubuntu 22.04, vCPU 1 core, RAM 1 GB, SSD 30 GB

Table 6. Experiment parameters.

Parameter	Value
Zipf factor	1
Content size	1–128 KB
Cache size	9–24 MB
Number of users	1–1000
Total number of contents	1000
Number of caching contents	100–900

Table 7. Performance comparison of research on blockchain-based systems; request file size = 100 KB.

System	Retrieval Latency (s)
Blockchain based system with IPFS [13]	3
Blockchain based system with cloud storage [14]	1.155
Proposed system	0.067–0.526

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kim, D.; Park, S. Blockchain-Based Caching Architecture for DApp Data Security and Delivery. Sensors 2024, 24, 4559. https://doi.org/10.3390/s24144559

AMA Style

Kim D, Park S. Blockchain-Based Caching Architecture for DApp Data Security and Delivery. Sensors. 2024; 24(14):4559. https://doi.org/10.3390/s24144559

Chicago/Turabian Style

Kim, Daun, and Sejin Park. 2024. "Blockchain-Based Caching Architecture for DApp Data Security and Delivery" Sensors 24, no. 14: 4559. https://doi.org/10.3390/s24144559

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Blockchain-Based Caching Architecture for DApp Data Security and Delivery

Abstract

1. Introduction

1.1. Motivation

1.2. Contribution

2. Related Work

2.1. Caching Research for Distributed Environments

2.2. Research on Blockchain-Based Access Control Mechanisms for Security

2.3. Research Gap Analysis

3. Background

3.1. Blockchain-Based Decentralized Apps

3.2. Ciphertext Policy–Attribute-Based Encryption

4. System Design

4.1. Blockchain-Based Caching Architecture

4.1.1. Location-Based Cache Clustering with Conductor

4.1.2. Cache Node Architecture

4.2. Process of the Caching System

4.3. Main Component

4.3.1. Connection of Cache Nodes via Conductor

4.3.2. Content Uploading and Cache Integrity Verification

4.3.3. Content Request

4.3.4. Content Update and Coherence Maintenance

4.4. Distributed Access Control from Off-Chain Storage

5. Experiments and Evaluation

5.1. Experimental Environment

5.2. Security Analysis

5.3. Performance Evaluation and Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI