Next Article in Journal
Fast Radius Outlier Filter Variant for Large Point Clouds
Previous Article in Journal
A Retinal Oct-Angiography and Cardiovascular STAtus (RASTA) Dataset of Swept-Source Microvascular Imaging for Cardiovascular Risk Assessment
Previous Article in Special Issue
Blockchain Payment Services in the Hospitality Sector: The Mediating Role of Data Security on Utilisation Efficiency of the Customer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Towards Data Storage, Scalability, and Availability in Blockchain Systems: A Bibliometric Analysis

by
Meenakshi Kandpal
1,
Veena Goswami
2,
Rojalina Priyadarshini
3 and
Rabindra Kumar Barik
2,*
1
School of Computer Engineering, Kalinga Institute of Industrial Technology Deemed to Be University, Bhubaneswar 751024, India
2
School of Computer Applications, Kalinga Institute of Industrial Technology Deemed to Be University, Bhubaneswar 751024, India
3
Department of CSE, C.V. Raman Global University, Bhubaneswar 752054, India
*
Author to whom correspondence should be addressed.
Data 2023, 8(10), 148; https://doi.org/10.3390/data8100148
Submission received: 17 August 2023 / Revised: 21 September 2023 / Accepted: 26 September 2023 / Published: 2 October 2023
(This article belongs to the Special Issue Blockchain Applications in Data Management and Governance)

Abstract

:
In recent years, blockchain research has drawn attention from all across the world. It is a decentralized competence that is spread out and uncertain. Several nations and scholars have already successfully applied blockchain in numerous arenas. Blockchain is essential in delicate situations because it secures data and keeps it from being altered or forged. In addition, the market’s increased demand for data is driving demand for data scaling across all industries. Researchers from many nations have used blockchain in various sectors over time, thus bringing extreme focus to this newly escalating blockchain domain. Every research project begins with in-depth knowledge about the working domain, and new interest information about blockchain is quite scattered. This study analyzes academic literature on blockchain technology, emphasizing three key aspects: blockchain storage, scalability, and availability. These are critical areas within the broader field of blockchain technology. This study employs CiteSpace and VOSviewer to understand the current state of research in these areas comprehensively. These are bibliometric analysis tools commonly used in academic research to examine patterns and relationships within scientific literature. Thus, to visualize a way to store data with scalability and availability while keeping the security of the blockchain in sync, the required research has been performed on the storage, scalability, and availability of data in the blockchain environment. The ultimate goal is to contribute to developing secure and efficient data storage solutions within blockchain technology.

1. Introduction

Blockchain, the new emerging technology, has captivated the interest of many researchers. It is a technology that can solve many different issues of security, safety, storage, confidentiality, and accessibility within diverse sectors of medicine, land, finance, IoT, etc., [1]. This technology has proved itself in the health sector which is one of the most concerned sectors. It helps maintain patient information confidentiality while providing secure access to required authorities. A blockchain with IoT can take technology use to another level. Blockchain is a digitally distributed database that stores data electronically in blocks linked together by cryptography (a method of protecting information and communication through code) [2]. The blocks are repositories of information having specific storage capacities. These blocks maintain a chronology in the chain by adding each fresh bit of knowledge that arrives after a newly added block in that block. When the block is filled, it is then sealed and joined to the block that came before it to complete the chain of data on the blockchain network [3,4,5].
Blockchains are also known as distributed ledger technology (DLT) because they function as immutable ledgers or transaction records that cannot be edited, erased, or annihilated. This infrastructure allows concurrent access, validation, and record updating across a distributed and connected database [6,7,8,9]. A network of nodes maintains the ledgers, each with a copy of the ledger, verifies the data, and aids in establishing consensus. All blockchains are distributed ledgers, but all distributed ledgers are not blockchains. Although it is simple to read the records and add fresh data to the chain of transactions, every fresh transaction must pass numerous security checks before being put in the blockchain. Nobody can alter or delete existing data. Any effort to interfere with the ledger may be easily tracked back to the prospective hacker, who normally loses network access. Blockchains manage a large-scale record of transactions and data protected by various security levels. As a result, these systems are often considered to be reliable and secure. The most common utilization of blockchains is in cryptocurrency systems such as Bitcoin, which have a significant function as decentralized blockchains [10]. In this case, the control is shared by all users, not just a select few. Every transaction leaves a permanent record accessible, making the data entered into the system irreversible.
There are two types of individuals involved in processing a blockchain transaction; one is the user, while the other is the miner or validator. Users are regular people who want to initiate a trade through blockchain. Miners or validators are people who validate or verify the authenticity of the incoming transaction of the blockchain system. Every transaction is first initiated by a user and broadcasted to all the computers in a network that operates on a peer-to-peer basis [11]. These computers are known as nodes. Further, the transaction is sent to wait in the memory pool in the pending state. This is where the miners come into play. The miners or validators verify the transactions by checking them against some transaction rules established by the network’s designers on each computer. Every fresh block must be validated by a specified number of validator nodes, achieving a consensus and creating a new token to correspond with the newly minted data block. They are rewarded for doing so by the gas fees paid by the user during the initiation of the transaction. After being validated, transactions are kept in locked blocks (hash). The permanent history is then created by connecting these blocks together. As a result, the transaction is concluded [12].
As a new technology, it has some parts that can be improvised to exponentially increase the use of blockchain in the future. These aspects are mainly blockchain storage, scalability, and availability. Many researchers have considered blockchain a new technology, including its storage, scalability, and availability as a scope [13].
Blockchain technology provides a solid data storage foundation. Unlike a traditional centralized server controlled by a single entity, blockchain data storage offers a technique that breaks data into pieces and distributes it over multiple cloud-based storage devices. As a result, a robust blockchain-based data storage system is being developed. In short, using blockchain as data storage makes it easier to transfer files without having to bounce requests all over the network to only a few data centers. Data duplication among nodes causes storage issues, worsening as networks expand. As a result, there are issues with availability, performance, and scalability. One of the most frequently brought-up difficulties facing blockchains today is storage [14].
Scalability in blockchain is the capability of any object to handle increasing numbers of data. It refers to handling increasing transactions and network nodes in a blockchain network. Scaling options are nonexistent due to the capacity investments made in security and decentralization. As a result, blockchains experience slow throughput and lengthy queues. The problem of scalability has been introduced previously. Since then, developers have developed scaling solutions to increase transactional speed [15,16]. The leading cause of this scalability problem is that all the participants in the blockchain must concurrently agree to the validity of a transaction.
Dependability properties, like availability, are critical for many applications, but the guarantees offered by blockchain technology still need to be clarified, especially from an application perspective. We demonstrate that while the read availability of blockchains is typically high, write availability—for transaction management—is low. One of the hottest topics of recent times, blockchain has a lot of scope for future expansion and improvement, including its storage, scalability, and availability. Many researchers from around the world have already been working on it to improve this new rising technology. A bibliometric study is necessary to understand the interest and progress in blockchain scalability, storage, and availability in recent years.
The bibliometric analysis is the quantitative study of research papers, books, and articles to give the researcher a clear idea and understanding of the research topic. This technology is recently being used in many fields for this purpose. Bibliometric analysis is a valuable research methodology for gaining insights into the evolving field of blockchain technology. This quantitative approach allows researchers to examine academic publications’ growth, impact, and interconnections in the blockchain domain. Tools like CiteSpace and VOSviewer enable researchers to visualize patterns, identify influential papers, and map the intellectual structure of blockchain research. This helps to draw a clear picture of the topic’s current status and allows the researcher to identify the direction of further investigation. This paper uses VOSviewer to plot the graphs for conducting detailed research on blockchain and its scalability, storage, and availability. Thus, it provides some clear directions for future research.

1.1. Research Questions

RQ1: Which authors actively participate in blockchain research and which country or institution do they belong to?
The answer to this question would provide researchers with the scope of collaboration with other researchers and professionals interested in this field. It may help them find answers to some of their deep questions by connecting with researchers or exploring other articles or papers under the same domain.
RQ2: What are the most discussed blockchain topics in recent years?
Answering this question would help researchers narrow their research direction by understanding the future scope and demand in the field.
RQ3: What recent trends and categories have emerged in this field during these years?
Answers to this question will give researchers insight into the prioritized trends and status of the research in this field.
RQ4: Which journals are cited the most in these recent blockchain publications?
Knowing the answer will help researchers narrow the search for accurate journals in this field.
RQ5: Which journals are cited highly as per the citation number?
This answer will help researchers know about particular papers and give further insights about countries and institutes that have been dedicated to this field of research.

1.2. Contributions

The current research paper is organized with the following contributions, which are as follows:
i.
It performs a systematic review to investigate the research trend and the current state of the art on blockchain storage, scalability, and availability.
ii.
It presents the popular research subjects and emerging trends related to blockchain storage, scalability, and availability using VOSviewer and CiteSpace tools.
iii.
Finally, it provides an in-depth analysis of blockchain storage, scalability, and availability solutions through a wide range of graphs with comparison analysis of existing work.

1.3. Organizations

The structure for the rest of this paper is as follows: Section 2 explains the data extraction and the methods used in the present research. Section 3 outlines the related work on data storage, scalability, and availability in a blockchain environment. Section 4 offers the results and discussion. Section 5 provides the popular research subjects and emerging trends on various characteristics of data storage—scalability and availability in a blockchain environment. Section 6 discusses the comparison analysis of existing work along with the present work. Section 7 draws the summary and concluding remarks of the present research paper.

2. Data Extraction and Methods

2.1. Data Extraction

We used Dimensions.ai to collect many professional and scientific publications to comprehensively cover information about blockchain storage, scalability, and availability worldwide. Exported records from dimensions.ai included extensive and detailed data on publication year, author, institution, and source journal (complete records and cited references exported to text files). Dimensions.ai has the setting of search types like exclusive data, title, and abstract or DOI that helps researchers to know whether we want to extract only data on blockchain technology and storage separately or data on blockchain storage, scalability, and availability. The different sorts of searches mostly focused on keywords, titles, and abstracts to study correlation hypotheses and research material. Dimensions.ai produced 2002 valid papers for blockchain storage, 1298 for blockchain scalability, and 282 for blockchain availability research. The data was exported to the Dimensions.ai export center of the user account in the form of CSV (VOSViewer, Citespace) and RIS format. It was then downloaded from the export center of the user account. We also collected the analysis of publication output from the overview of the analytical view in Dimensions.ai.

2.2. Methods

CiteSpace, a popular tool created by Dr. Chen Chaomei, is written in Java. CiteSpace was used in this paper to create visual knowledge maps that included countries, institutions, authors, journals, keyword grouping, and reference citation bursts. Furthermore, Eck and Waltman’s VOSviewer is a powerful visualization tool. The co-occurrence network was loaded into VOSviewer for further examination. Some knowledge map classification techniques and parameters were used in the bibliometric analysis results which will be detailed in subsequent paragraphs. Furthermore, a node denoted an item (country, institution, journal, author, or keyword) and links characterized by co-citation or co-occurrence between these nodes.
The terms “blockchain storage”, “blockchain scalability”, and “blockchain availability” were chosen through preliminary study and comparison, and the retrieval publication period was set to run from 2012 to 2022, along with the top 10 researcher’s publications for blockchain storage, blockchain scalability, and blockchain availability.
To obtain the visualization maps of essential authors, co-cited authors, journals, nations, institutions, etc., we could insert the data into CiteSpace and pick the data we needed to input.

3. Related Works

3.1. Blockchain as Storage

Blockchain storage is a decentralized storage system with no single point of failure that allows users to own their data fully. A decentralized architecture is an alternative to centralized cloud storage that can address several problems with centralized systems [1,2,3]. Blockchain data storage enhances the privacy and security of users’ data because it is decentralized [4,5]. As a result, this decentralized structure provides a vital substitute for standard storage systems. In layman’s terms, cryptography and blockchain technology power the decentralized storage system’s backend. As a result, the blockchain file storage network has been spread among several computers and servers. Blockchain-based data storage allows for more excellent storage and transparency by customizing data files as transactions and granting users access. Because of this, storing data on a blockchain provides immutable and more secure data. Creating the decentralized blockchain file storage allows for the global distribution of information storage across many nodes [15,16]. Some authors have contributed to related work on blockchain data storage.
Zhai et al. [1] focused on digital currency’s infrastructure, comprising the information, network, consensus, contract, and application layers. The fundamentals of technology for encryption, such as hash function, asymmetric cryptosystem, and digital signature, are briefly introduced by the author. It illustrates how the blockchain system as a whole has been imprinted with cryptography [4]. Many authors evaluated the difference between traditional central and decentralized cloud storage. Most importantly, authors now know how cryptography is integrated within a blockchain. Many of them have mentioned the usage of advanced encryption storage for implementing a security system and keeping the data in sequence using hashing [17]. In addition to this, many symmetric vital cryptographic algorithms (RC4, AES, Blowfish, RC2, DES, Skipjack, and Triple DES) have been contrasted and assessed based on the length of time needed to encrypt data while taking into account variations in the file’s data types, sizes, densities, and key sizes. According to several authors, the primary objective is to assess how long different cryptographic techniques take to encrypt data depending on criteria like data type, size, density, and critical size [3]. Further, authors are also focused on comparing Blowfish, DES, and AES, the three most widely used symmetric key cryptography algorithms. The performance of algorithms under varied conditions is the primary concern in this comparison; thus, it considers how the algorithm behaves and performs when different data loads are used. The comparison takes these elements—speeds, block dimension, and key size—into account [18]. Several authors elaborated on using blockchain in the healthcare sector for information security. Before setting medical data in a blockchain, which may be made secure and impenetrable by developing linked intelligent contracts, the process leverages attribute-based access management to grant individuals dynamic, granular access to that data [19]. Moreover, a blockchain-based decentralized storage solution is also implemented. It describes the user receiving a data integrity certificate from the storage provider. After confirming that the verification was successful, the user uses lightning network technology to pay the storage provider. The blockchain stores all proofs and payment details, ensuring the system’s security and dependability [12,13,17]. Numerous frameworks were proposed for distributed document storage built on a blockchain. The authors also stated that with its protected cryptographic methods and immutability, blockchain technology has so far been able to prohibit unauthorized access while making the data tamper-proof. It creates an access control environment where only those with the necessary permissions can access the data using cryptographic techniques to prohibit unauthorized access to all records [20,21,22].
Khalid et al. [23] emphasized the significance of decentralized storage networks, which are fundamentally based on blockchain technologies. The author also provided a brief description of blockchain-based storage systems and how they work, followed by comparing them with cloud-based storage networks and evaluating various decentralized storage networks existing on the market, such as SIA, File Coin, and Storj. Moreover, the author’s study addresses a novel method of storing data to ensure privacy and security. Ren et al. [24] proposed a blockchain-based secure storage mechanism that uses an on-chain and off-chain collaborative storage approach to address the shortage of blockchain storage capacity. The update sub vector commitment proposed in this paper was utilized to build an on-chain and off-chain safe authorization protocol that ensures the accuracy of on-chain and off-chain data and enables batch processing. Further, the author proposed a protocol to support batch processing, which allowed data verifiers to retrieve and validate data in batches and data owners to update data in storage servers in batches, which reduced computational costs. Finally, the protocol’s soundness and security were demonstrated, and the performance was evaluated. Table 1 summarizes the research and review articles on blockchain as storage.

3.2. Blockchain as Scalability

Blockchain is no longer a technology, as in coming years, it will replace the existing process of every sector. It is a unique system designed for storing information in a distributed manner where manipulating or tampering is extremely difficult or nearly impossible. But it is limited to the Bitcoin and Ethereum crypto networks; others are yet to be acknowledged. Among all the obstacles, scalability is the key hindrance in the execution of blockchain for existent industries. Scalability is the capability to handle increasing transactions and nodes in a blockchain network [21,22,23,24]. According to the “blockchain trilemma” or “scalability trilemma,” traditional blockchain can only maximally utilize any two of the three properties, namely scalability, decentralization, and security. Bitcoin and Ethereum are the crypto networks that have to sacrifice scalability [26,27,28], whereas Ripple chooses scalability [27,29] over other parameters. Blockchain experiences slow throughput and lengthy queues due to scalability issues. Here, we have systematically formulated the literature review part that mainly focuses on scalability in blockchain networks.
Zhou et al. [20] covered the classification and explanation of current solutions to the scalability of blockchain networks. There are four types of scalability solutions: On-chain (Layer-1), Off-chain (Layer-2), Distributed Ledgers, and Consensus Mechanism. Off-chain solutions are still in the early stages of development. Several researchers focused on and compared different consensus mechanisms like PoW, PoS, Hybrid Form of PoW and PoS, PBFT, and Raft used in blockchain-based on energy saving, miner selection, scalability, 51% attack possibility, double spending attack possibility, transaction fees, etc. It also identified the issues and the algorithms of consensus mechanisms [29]. Yadav and Kushwaha [30] proposed a blockchain-based e-registry framework that uses the novel proposed consensus algorithm for choosing the system’s leader. This framework was suggested for the better management of land registration.
Moreover, many surveys are carried out in the Systematic Literature Review about the blockchain scalability issues, challenges, and solutions [22]. Many new scalability solutions, including Side-chain, Child-chain, and Inter-chain, have been introduced by Kim et al. [31]. All the findings also show how blockchain technology will be most commonly used in industries like energy, finance, resource management, healthcare, education, and agriculture. Recently, authors published and evaluated the issue of scalability from the perspectives of throughput, storage, and networking and put forth the existing enabling technologies for scalable blockchain systems based on the number of transactions in each block, block interval time, data storage technologies, and data transmission. Eklund and Beck [26] focused on the significant differences between blockchain and traditional computer systems: performance and scalability factors. Another framework is G-PBFT (Geographic-PBFT), a brand-new scalable consensus protocol created for IoT-blockchain applications to solve the poor scalability and high overhead [32,33]. Through experimenting with G-PBFT against currently used consensus protocols with over 200 participating nodes, they discovered that G-PBFT dramatically lowers consensus time and network overhead and is scalable for IoT applications. Several authors reviewed and evaluated solutions for the healthcare field by looking into the scalability issues of blockchain in detail [34]. Many authors [1,35,36] have surveyed and concluded on the sharding technique, a first-level solution to blockchain’s scalability issue. Below is the table that lists some surveys and reviews on blockchain scalability. Sharma et al. [37] discussed the requirements of a secure network in IoT by which it can commute information and services. This requirement is fulfilled by the TMS (Trust Management System) which ensures security by observing the system’s dynamic behavior. But, in the case of systems with many low-power devices, the accuracy reduces, giving rise to the BLAST-IoT solution (blockchain-assisted scalable trust model).
Gangwal et al. [38] focused on the arrival of other cryptocurrencies after the rise of Bitcoin. At the same time, they discussed the major issue of blockchain scalability and the Layer 2 solutions in detail that came into existence in recent years. Zhou et al. [39] suggested an architecture of scalable consortium blockchain designed with the working principle of world state collaborative storage technique. Finally, the efficiency and possibility of this model are measured to solve the scalability of blockchain storage. Table 2 presents all the review articles related to blockchain as scalability.

3.3. Blockchain as Availability

Dependability properties, like availability, are significant for various applications, but the guarantees offered by blockchain technology still need to be clarified, especially from an application perspective. Weber et al. [47] in their book On Availability for Blockchain-Based Systems write about how a blockchain database does not require human involvement. The authors examined the limitations of two popular blockchains, Ethereum and Bitcoin. Finally, approaches for overcoming availability constraints were presented [33]. Many frameworks have been proposed for availability, like ProvChain, BDUA, and AutAvailChain. Liang et al. [48] gave a review on ProvChain. They proposed a decentralized and trusted cloud data provenance architecture for data accountability, forensics, and privacy using blockchain technology. This provenance data provides tamper-proof records, enables the transparency of data accountability in the cloud, and helps to enhance the confidentiality and availability of the provenance data. Kaaniche and Laurent [49] discussed BDUA, a Blockchain-based Data Usage Auditing abbreviation. With the advent of distributed applications, suppliers can correlate their records to deliver client services. BDUA is a revolutionary blockchain-based data usage auditing system that ensures controlled and secure data transit. Camilo et al. [50] reviewed AutAvailChain. They created an intervention solution to authorize their data access for each request, making frequent access to popular data less tedious. AutAvailChain is a software-defined networking (SDN) architecture that allows for the secure, autonomous, and widespread sharing of IoT data. Kaaniche and Laurent [51] presented a Blockchain-based Data Usage Auditing Architecture with Improved Privacy and Availability. Combining hierarchical identity-based cryptographic mechanisms with emerging blockchain infrastructures provided a solution to security breaches that jeopardized users’ privacy, and it proposed a blockchain-based data usage auditing architecture that ensured availability and accountability while protecting users’ privacy [52,53,54,55]. As a result, data access, exchange, and processing are transparent and controlled. Recently, Liu et al. [56] recommended a data availability scheme with robust dataset confidentiality, associating a zero-knowledge accumulator with advanced productivity and safety through local repair coding. This scheme decreases the possibility of invaders producing fraudulent data by copying the information of data blocks on the blockchain. Table 3 outlines all the articles related to blockchain availability.

4. Results and Discussion

4.1. Analysis of Publication Outputs

Exploring the development phase, knowledge accumulation, and maturity of blockchain is made more accessible by the annual pattern of publishing activity that characterizes blockchain research. Table 4 presents the yearly research paper publications from 2012 to 2022. As presented in Figure 1a, the number of publications about blockchain storage was at an initial stage between 2012 and 2015, but there was a slight increase in the number of publications between 2016 and 2018, and it reached its peak in 2020. Later, the number of publications decreased slightly during 2021–2022. Similarly, in Figure 1b, the number of publications about blockchain scalability is at an initial stage between 2012 and 2015. Still, there was a slight increase in publications between 2016 and 2018, which peaked from 2012 to 2022. Moreover, in Figure 1c, the number of publications about blockchain availability was at an initial stage between 2012 and 2016. Still, there was a slight increase in the number of publications between 2017 and 2018, and it peaked in 2021. Later, the number of publications will decrease in 2022. In the present studies, we have taken the data source from Scopus database for the bibliometric analysis.

4.2. Analysis of Countries and Institutions

Table 5 had 406 publications with India having the highest number of publications in blockchain storage research. However, blockchain scalability had 225 publications, whereas India had the highest number of publications. Moreover, blockchain availability had 65 publications, whereas the United States had the highest number of publications.
According to Table 6, the most productive institutions based on several publications were shown. Nirma University was found to be the most influential institution related to blockchain storage and blockchain scalability; however, the Federal University of Pernambuco was the most productive institution from a blockchain availability research perspective.
The country cooperation network of blockchain storage research is displayed in Figure 2a, blockchain scalability in Figure 2b, and blockchain availability in Figure 2c (parameter settings: year(s) per slice: 1; node type: country; pruning: pathfinder and pruning the merged network; top N per slice: 50; top N%: 10%), with the size of nodes denoting the number of articles that were published in each country. The more nodes there were, the more papers were published.
Several research institutions were substantially concentrated, as seen in Figure 3a–c, (factor settings: year(s) per slice: 1; node type: institution; pruning: pathfinder and pruning the merged network; top N per slice: 50; top N%: 10%). The visualization map of institutions/countries involved in blockchain storage, blockchain scalability, and blockchain availability research: Mapping of central countries involved in blockchain storage, blockchain scalability, and blockchain availability research; Mapping of institutions engaged in blockchain storage, blockchain scalability, and blockchain availability research (the table above lists the top 10 publishing institutions).

4.3. Analysis of Author Cooperative and Author Co-Citation Network

According to Table 7, related to blockchain storage, the most productive author was Salah Khaled, with 54 publications from Khalifa University of Science and Technology, UAE. The second most productive author was Tanwar Sudeep from Nirma University, India, with 47 publications. It was then preceded by Jayaramam Raja, Yaqoob Ibrar, Javaid Nadeem, Kumar Neeraj, Kanhere Salil, Omar Mohammed, and Jurdak Raja, and at the tenth position was Gupta Rajesh from Nirma University, India. The most productive author in blockchain scalability was Gupta Rajesh, with 25 publications from Nirma University, India. The second most productive author was Dorri Ali from the Queensland University of Technology, Australia, with 24 publications. It was then preceded by Kumari Aparna, Wang Yuyi, Tanwar Sudeep, Bhattacharya Pronaya, Kakkar Riya, and Xi, and at the tenth position was Decker Christian from Cornell University, United States. However, regarding blockchain availability, the most productive author was Park Jong Hyuk, with nine publications from Seoul National University of Science and Technology, South Korea. The second most productive author was Tauz Lev from the University of California, United States, with eight publications. It was then preceded by Dolecek Lara, Mitra Debarnab, Aloqaily Moayad, Guizani Mohsen, Mahmoud Mohamed, Maciel Paulo, Dantas Jamilson, and at the tenth position was Melo Carlos from the Federal University of Pernambuco, Brazil.
Based on extracted data for Figure 4a–c (parameter settings: year(s) per slice: 1; node type: author; pruning: pathfinder and pruning the merged network; top N per slice: 50; top N%: 10%) from 2012 to 2022, the topmost productive authors were found in the form of mapping. Each node includes the name of a specific author, and this diagram shows Salah Khaled as the most influential author related to blockchain storage research. Moreover, Gupta Rajesh is the most influential author of blockchain scalability research, and Guizani Mohsen is the most influential author of blockchain availability research.
Table 8 shows the most cited authors and their highly cited articles on blockchain storage, blockchain scalability, and blockchain availability for 2012–2022. In blockchain storage, the most cited author was Gupta Rajesh with 28 articles, and this author’s most highly cited article was in IEEE Systems Journal. Zhang Zijian was the second most cited author with 22 articles, and their highest cited article was in IEEE Transactions on Network Science and Engineering. It was preceded by Kumari Aparna, R. Hasan Haya, Dorri Ali, Tanwar Sudeep, Zhang Yan, O. Novo, Salah Khaled, and S. Wang. Moreover, in the top cited authors and their highly cited articles on blockchain scalability for 2012–2022, the most cited author was Kakkar Riya, with 13 articles, and this author’s most highly cited article was in IEEE Systems Journal. Decker Christian was the second most cited author with nine articles, and their highest cited article was in Lecture Notes in Computer Science. It was preceded by Gupta Rajesh, Dorri Ali, Kumari Aparna, Luu Loi, Croman Kyle, Liu Mengting, and Kang Jieng. However, in the top cited authors and their highly cited articles on blockchain availability for 2012–2022, the most cited author was Anonymous with ten articles, and this author’s most highly cited article was in IEEE Access. Dorri Ali was the second most cited author with six articles, and their highest cited article was in Journal for General Philosophy of Science. It was preceded by Garg Shiva Raj, Sharma Pradip Kumar, Hu Xiao-Yu, Ridhawi Ismaeel, Yuntao Mondal, Christidis Konstantiros, Baza Mohamed, and Aitzhan NZ.
In terms of the co-authorship analysis in Figure 5a–c, each node represents one author (parameter settings: year(s) per slice: 1; node type: cited author; pruning: pathfinder and pruning the merged network; top N per slice: 50; top N%: 10%). In the charts generated by CiteSpace, mapping showed the most co-cited authors related to blockchain storage as Tanwar Sudeep and Gupta Rajesh between 2012 to 2022, along with their journals as Computers and Industrial Engineering and 2022 IEEE Globecom Workshops (GC Wkshps). Mapping in blockchain scalability showed the most co-cited authors as Kakkar Riya and Gupta Rajesh between 2012 to 2022 along with their journals as IEEE Systems Journal and IET Communications. Mapping in blockchain availability showed the most co-cited authors as Anonymous and Dorri Ali between 2012 to 2022 along with their journals as IEEE Access and Journal for General Philosophy of Science.

4.4. Analysis of Co-Citation of Journals

As per Table 9, the most productive journals in blockchain storage for 2012–2022 saw IEEE Access leading with 134 papers that covered 6.69% of papers. Future Generation Computer Systems, IEEE Internet of Things Journal, IEEE Transactions on Industrial Informatics, and Lecture Notes in Computer Science preceded it. Also, the most productive journals in blockchain scalability for 2012–2022 saw IEEE Access leading with 67 papers that covered a total of 5.16% of papers. It was preceded by the IEEE Internet of Things Journal, IEEE Transactions on Industrial Informatics, IEEE Network, and Lecture Notes in Computer Science. Here too, the most productive journals in blockchain availability for 2012–2022 saw IEEE Access leading with 23 papers that covered 8.15% of papers. It was preceded by Lecture Notes in Computer Science, IEEE Transactions on Industrial Informatics, IEEE Communications Magazine, and IEEE Transactions on Dependable and Secure Computing.
The time zone perspective of the co-citation journals network is portrayed in Figure 6a–c (parameter settings: year(s) per slice: 1; node type: cited journal; pruning: pathfinder and pruning the merged network; top N per slice: 50; top N%: 10%). IEEE Access was the leading journal and preceded by Future Generation Computer Systems, IEEE Internet of Things Journal, IEEE Transactions on Industrial Informatics, and Lecture Notes in Computer Science in blockchain storage. IEEE Access was the leading journal preceded by IEEE Internet of Things Journal, IEEE Transactions on Industrial Informatics, IEEE Network, and Lecture Notes in Computer Science in blockchain scalability. IEEE Access was the leading journal preceded by Lecture Notes in Computer Science, IEEE Transactions on Industrial Informatics, IEEE Communications Magazine, and IEEE Transactions on Dependable and Secure Computing in blockchain availability.

4.5. Analysis of Cited References

The method of burst detection is a powerful analytical tool that may be used to spot urgent situations or significant information within a given window of time. Figure 7 shows the top 30 strongest references identified by CiteSpace from 2012 to 2022 (parameter settings: year(s) per slice: 1; node type: reference; pruning: pathfinder; top N per slice: 50; top N%: 10%). In 2017, the largest citation burst strength, 4.87, came from a paper by Dorri Ali. The burst initiated in 2017 and completed in 2019. The author worked on the Proceedings of the Second International Conference on Internet-of-Things Design and Implementation. In 2018, the two largest citations burst strengths derived from the papers by Tschorsch Florian (7.67) and Christidis Konstantinos (7.21), respectively.
As noticed, ten references were cited from 2017 to 2019. It was documented that the most extended citation period was two years, and the shortest was one year. It was surprising, however, to notice that all the papers were available after 2014, considering newly published studies tend to display citation burst analysis.
Table 10 lists the top ten reference articles in terms of citation frequency. Examining cited references helped academics understand the internal linkages between countries, organizations, and authors. Since blockchain research was quite mature and its application was broadened in 2018, the most widely referenced references were published in 2018. In the field of blockchain research, Khan Minhaj Ahmad’s article “IoT Security: Review, Blockchain Solutions, and Open Challenges” was a reference that was quoted rather frequently.

5. Popular Research Subjects and Emerging Trends

5.1. Author’s Keyword Search

This section categorizes the authors who were dynamic and active in blockchain study. For achieving this result, a top five author’s keyword search has been employed. Table 11 includes the authors related to blockchain and the keywords related to blockchain. As per understood from Table 11, Kim-Kwang Raymond Choo, Dusit Tao Niyato, Mohsen Mokhtar Guizani, and Mohsen Mokhtar Guizani take a lead in journals associated with blockchain. There are several other authors who are working on areas related to blockchain like data storage, scalability, availability, confidentiality, privacy, security, transparency, and traceability, etc. Blockchain is also highly correlated with bitcoin, smart contract, ethereum, cryptography, and IOT. These keywords are searched from the Scopus webpage, wherein we can use AND if both the keywords are required and OR if any one of the keywords is required.
Figure 8 represents a bag of keywords which is created using a word cloud from a given text in Python and involves a few steps such as preprocessing the text, generating the word frequencies, and then creating the word cloud. It should be ensured that we have the necessary libraries installed such as matplotlib and wordcloud. Depending on the specific use case, we may want to preprocess the text to remove unnecessary elements like punctuation, stopwords, etc.

5.2. Popular Research Subjects

This paper created numerous keyword co-occurrence networks that roughly delineate the domains of blockchain research. This was performed because term frequency analysis may directly and effectively characterize study areas and vital substances of a certain subject. A knowledge domain map was drawn for the co-occurrence network of keywords as observed in Figure 9a using the VOSviewer software (parameter settings: type of analysis: co-occurrence; counting method: complete counting; unit of study: title and abstract fields; node type: blockchain). Figure 9b demonstrated the hot research topics using a density visualization map based on article weights.
Network: 1 (red cluster): The largest node was industry, which connected 185 keywords such as smart contract, survey, healthcare, traceability, supply chain, vehicle, miner, node, proof, and blockchain network. This cluster paid more attention to the application of blockchain.
Network: 2 (blue cluster): The blue cluster was the largest and mainly related to smart contracts, traceability, transparency, supply chain, and stakeholder. In the past few years, they have emerged to be popular topics. Although each issue had a distinct focus, the contents were interconnected.
Network: 3 (green cluster): The largest node was a vehicle, which connected 160 keywords in cluster 3. This branch of research was interested in blockchain’s utility, proof, verification, and accuracy.
Network: 4 (yellow cluster): The largest node was the smart grid, which mainly focused on digital technologies, sensors, and software. Implementing blockchain through a smart grid enables the community to retain system transactions in a consensus manner.
As observed in Table 12, the smart contract was the most popular subject in the work of blockchain research with a frequency of 164, which was followed by vehicle (133), industry (119), and node (113). We can also analyze the popular research subjects and emerging trends where you can find the recent emerging key topics and some of the explanations of the critical issues like an intelligent contract, traceability, transparency, etc., in Table 12.
In the previous studies of blockchain technology, there was a significant rise in curiosity about Bitcoin, IoT, security, smart contracts, etc. But after 2012, there was a rise in studies based on smart contracts, traceability, transparency, etc., which led researchers to think beyond the capabilities of blockchain technology.

5.3. Smart Contract

Nick Szabo, an American computer scientist who devised a virtual currency called “Bit Gold” in 1998 ten years before Bitcoin, advocated intelligent contracts in 1994. In reality, Szabo is frequently mistaken for Satoshi Nakamoto, the anonymous Bitcoin creator, a claim which he has disputed.
As a first exposure to blockchain technology, smart contracts are critical to making transactions safer, secure, and organized.
Smart contracts are computer programs or protocols that streamline transactions kept on a blockchain and executed in response to specific criteria being met. In other words, smart contracts automate the implementation of agreements so that all players can know the outcome as quickly as feasible without needing an intermediary or a time delay. The overlay visualization of an intelligent contract, complete with weighted linkages and citation scores, is depicted in Figure 10.

5.4. Traceability

Blockchain traceability, the subject of university dissertations in the 1990s, was encapsulated by Satoshi Nakamoto’s 2008 scholarly publication. The first significant application was secure money transfer via Bitcoin tokens without using a central bank.
Traceability can be described as the collection of data on all product movements across the supply chain. Today, the pharmaceutical, food, cigarette, alcohol, and cosmetics industries employ traceability applications.
Traceability in the blockchain is also heavily reliant on the simplicity with which critical data points such as claims and certificates can be captured. As a result, it can aid in building confidence and facilitating open access to data. Third-party attesters validate the legitimacy of data points after they are registered on the blockchain. It is also worth noting that traceability in blockchain includes real-time updates and information authentication benefits. The overlay visualization of traceability with weighted linkages and citation scores is shown in Figure 11.

5.5. Transparency

Chen Li, the CEO and founder of Youbi Capital, elaborated on transparency as the greatest feature of blockchain. This is because, in theory, the technology powers detectable and immutable transactions, allowing parties to trade with total confidence without the involvement of an intermediary. The framework eliminates conventional sources of authority, such as banks and governments, in order to decentralize modern life. In their opinion, transparency in the blockchain is game-changing—preferably in theory.
Blockchain might be utilized as the new norm for transparency since it makes data transparent in ways that have not existed in financial institutions. A block explorer, which is used to search the blocks of a blockchain, their contents, and their essential details, allows network participants to access the holdings and transactions of public addresses.
In cybersecurity, this means that decentralized threat data can be made available. While some claim that in-depth analyses and reports provide sufficient assurance that security systems are working as expected, bias may be present because these corporations pay for the study reports, certifications, and other acknowledgments in the beginning. The overlay visualization of transparency with weighted linkages and citation scores is presented in Figure 12.

6. Comparative Analysis

In this section, the findings of the present bibliometric study are compared to the findings of earlier bibliometric analyses in order to offer information regarding the tools employed for both CiteSpace and VOSviewer, data storage, scalability, and availability of the blockchain system [67,68,69,70]. Recently, Sanka and Cheung [40] reviewed blockchain scalability challenges and solutions, analyzes research trends, and discusses adoption, serving as a guide for blockchain scalability research. Liu et al. [56] analyzes blockchain research from 2013 to 2020, highlighting key topics and trends. China leads in this field, focusing on computer science and engineering. Emerging areas include management, blockchain technology, energy, machine learning, and smart homes, with a call for regulatory standards. Firdaus et al. [71] analyzes blockchain research from 2013 to 2018, emphasizing its diverse applications in IoT and healthcare, top research countries, and its versatility beyond digital currency.
Guo et al. [72] presented the bibliometric analysis and visualization of blockchain and discussed in many areas. Further, Luo et al. [73] uses bibliometrics to analyze blockchain research evolution from 2014 to 2020. It highlights key contributors, like China, and reveals that blockchain research is expanding into various areas. Future directions include integrating blockchain with cloud tech, smart contracts, and authentication. Whereas Kuzior and Sira [74] focused on how Scopus-based bibliometric analysis of blockchain literature from 2007 to 2021 highlights the rapid growth in publications since 2016, China’s leading role in blockchain research, and shifting research priorities from Bitcoin to various blockchain applications. It also identifies three primary clusters of topics. Alam et al. [75] analyzes blockchain research from 2012–2020 using Web of Science data. Major findings include China and the USA as leaders in blockchain research, a focus on computer science, and a growing interest in blockchain research in Pakistan. It underscores blockchain’s multidisciplinary impact and the importance of global collaboration and language diversity in research.
In our research work, VOSviewer and CiteSpace were used for bibliometric analysis. We obtained the data in CSV and RIS formats from the Dimensions.ai database. From 2012 to 2022, the search includes blockchain storage, scalability, and availability, as well as the top ten authors of each search result. While extracting visualization maps from CiteSpace, we had to export the CSV data in the form of Web of Science data in the CiteSpace Java application, and all of the selection criteria for extracting visualization maps were given in the application, including author, institution, country, reference, cited author, journal, and so on. We utilized VOSviewer software for bibliometric networks, importing data from RIS files and selecting the method of visualization of bibliometric networks.

7. Conclusions

In conclusion, this paper represents a crucial bridge between theoretical insights and practical applications in the ever-evolving realm of blockchain technology. Throughout our research, we have meticulously examined the complex interplay of blockchain storage, scalability, and availability, shedding light on the current research landscape and the pressing challenges practitioners and researchers face.
Sia, Storj, and IPFS are some prominent companies that have launched their cryptocurrencies, like Siacoin, Storjcoin X, and Filecoin, to create a market for buying and selling decentralized storage and encouraging its use. In recent years, several trends have emerged, and innovations to enhance transaction throughput, reduce latency, and improve user experience can shape the future of blockchain scalability. Blockchain availability can provide the finance sector with a transparent ledger system that may eliminate the need for intermediaries and lower transfer costs.
Using bibliometric analysis tools, CiteSpace and VOSviewer, has allowed us to distill a wealth of academic literature into meaningful insights. By identifying the prevailing research priorities and mapping the intellectual structure of the field, we have contributed to theoretical advancements and provided actionable takeaways for real-world implementations. The practical significance of our work lies in its potential to guide and inform the development of blockchain-based solutions that seamlessly balance data storage, scalability, and availability while upholding the paramount importance of security. By visualizing pathways to achieve these objectives, we have opened doors for innovators, businesses, and policymakers to make informed decisions in the rapidly evolving blockchain landscape.
Using Dimensions.ai data from 2012 to 2022, this bibliometric study examined 2002 valid papers on blockchain storage, 1298 proper papers on blockchain scalability, and 282 valid papers on blockchain availability. Since 2016, the quantity of publications on blockchain storage and scalability has steadily expanded, while the number of publications has steadily grown since 2017 on blockchain availability. India held a conspicuous leadership position among all nations, according to the examination of national and institutional cooperation. At the same time, China, South Korea, UAE, Australia, and the United States accomplished worthwhile projects and made significant contributions. Most productive institutions were from India, making up the leading institutions in the study of blockchain storage, blockchain scalability, and blockchain availability. Nirma University was the most influential institution in this field.
According to the authors’ analysis, several authors tended to work with a limited group of colleagues, resulting in numerous prominent author groups, such as Gupta Rajesh and Kakkar Riya. The number of articles and citations found in the journal IEEE Access indicated that it was both an influential and productive publication, as stated by the co-citation journals network. In recent years, the IEEE Internet of Things Journal has attracted significant attention and encouraged the production of articles related to blockchain technology. Most journals focused on the IEEE Transactions on Industrial Informatics publication, attracting the most innovative academics and practitioners worldwide.
The research hotspots provided by VOSviewer have supplied specific information on the relevant literature. Conceptual knowledge and popular research subjects on blockchain were primarily distributed in the following categories: (a) intelligent contracts, (b) vehicle, (c) industry, and (d) proof. Further comprehension of conceptual knowledge and hot research themes is required, which will be a promising answer to different types of blockchain challenges. CiteSpace’s evolving trends and patterns have provided a fresh, engaging, and complete understanding of how to perform research subject searches. More emphasis should be placed on the potential of smart contracts and machine learning, which will be promising blockchain research fields. Overall, this study used CiteSpace and VOSviewer to understand blockchain research fields better and stay current with the blockchain sector.
While using CiteSpace, the limitations occur in the case of the amount of data being extracted because if we take data of only blockchain storage or scalability or availability, then the references with the most robust citation bursts might not be possible to extract due to a lack of data availability. While showing visualization maps, it might only be possible to show some countries, institutions, journals, primary authors, etc. Still, it is possible only to show the value of a single node and its nearby nodes. The analysis also varies concerning the g-index, as each insight becomes a different map when its value changes. However, it could be more accurate and thorough on how research is performed and publishing works. Therefore, all the limitations were reduced by taking the value of the g-index to be constant, leading us to obtain valid results.

Author Contributions

Conceptualization, M.K. and V.G.; writing—review and editing, V.G., R.K.B., and R.P.; Methodology, M.K. and R.K.B.; Software, M.K. and V.G.; Validation, R.K.B. and V.G.; Formal analysis, V.G. and R.P.; Resources, V.G. and R.K.B.; Visualization, M.K. and R.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors gratefully credit the Research Lab, School of Computer Applications, KIIT Deemed to be University, Bhubaneswar, India for providing computational resources.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhai, S.; Yang, Y.; Li, J.; Qiu, C.; Zhao, J. Research on the Application of Cryptography on the Blockchain. J. Phys. Conf. Ser. 2019, 1168, 032077. [Google Scholar] [CrossRef]
  2. Pise, R.; Patil, S. Enhancing security of data in cloud storage using decentralized blockchain. In Proceedings of the 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), Tirunelveli, India, 4–6 February 2021; IEEE: New York, NY, USA; pp. 161–167. [Google Scholar]
  3. Masram, R.; Shahare, V.; Abraham, J.; Moona, R. Analysis and comparison of symmetric key cryptographic algorithms based on various file features. Int. J. Netw. Secur. Its Appl. 2014, 6, 43–52. [Google Scholar] [CrossRef]
  4. Thakur, J.; Kumar, N. DES, AES and Blowfish: Symmetric key cryptography algorithms simulation-based performance analysis. Int. J. Emerg. Technol. Adv. Eng. 2011, 1, 6–12. [Google Scholar]
  5. Princy, P. A comparison of symmetric key algorithms DES, AES, Blowfish, RC4, RC6: A survey. Int. J. Comput. Sci. Eng. Technol. (IJCSET) 2015, 6, 328–331. [Google Scholar]
  6. Abdullah, A.M. Advanced encryption standard (AES) algorithm to encrypt and decrypt data. Cryptogr. Netw. Secur. 2017, 16, 11. [Google Scholar]
  7. Zhou, L.; Wang, L.; Sun, Y. MIStore: A blockchain-based medical insurance storage system. J. Med. Syst. 2018, 42, 149. [Google Scholar] [CrossRef]
  8. Zhu, Y.; Lv, C.; Zeng, Z.; Wang, J.; Pei, B. Blockchain-based decentralized storage scheme. J. Phys. Conf. Ser. 1237, 2019, 042008. [Google Scholar] [CrossRef]
  9. Zhao, Y.; Li, Q.; Yi, W.; Xiong, H. Agricultural IoT Data Storage Optimization and Information Security Method Based on Blockchain. Agriculture 2023, 13, 274. [Google Scholar] [CrossRef]
  10. Wang, S.; Wang, Y.; Zhang, Y. Blockchain-based fair payment protocol for deduplication cloud storage system. IEEE Access 2019, 7, 127652–127668. [Google Scholar] [CrossRef]
  11. Khan, N.; Aljoaey, H.; Tabassum, M.; Farzamnia, A.; Sharma, T.; Tung, Y.H. Proposed Model for Secured Data Storage in Decentralized Cloud by Blockchain Ethereum. Electronics 2022, 11, 3686. [Google Scholar] [CrossRef]
  12. Alizadeh, M.; Andersson, K.; Schelén, O. Efficient decentralized data storage based on public blockchain and IPFS. In Proceedings of the 2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE), Gold Coast, Australia, 16–18 December 2020; IEEE: New York, NY, USA; pp. 1–8. [Google Scholar]
  13. Wadile, N.; Shamdasani, J.; Deshmukh, S.; Sayyed, M.; Khandare, S. Decentralized File Storage (Interplanetary File System) using Blockchain. Int. J. Eng. Res. Technol. (IJERT) 2023, 2023, 12. [Google Scholar]
  14. Bhalibar, K.; Singh, A.; Sharma, H.; Uphadyay, A.; Gupta, H. Centralize Storage System with Encryption vs Decentralize Storage System Using Blockchain. Available at SSRN 4119952. 2022. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4119952 (accessed on 23 January 2023).
  15. Ren, Y.; Liu, X.; Sharma, P.K.; Alfarraj, O.; Tolba, A.; Wang, S.; Wang, J. Data storage mechanism of industrial IoT based on LRC sharding blockchain. Sci. Rep. 2023, 13, 2746. [Google Scholar] [CrossRef]
  16. Warke, K.; Mahindre, D.; Patil, V.; Shinde, V.; Hapse, S.; Shevade, V.A. Block chain based secure data storage on cloud. Int. Res. J. Mod. Eng. Technol. Sci. 2022, 4, 991–998. [Google Scholar]
  17. Singh, S.; Gite, S.; Bahare, N.; Raut, A.V. Secured Cloud Storage Using Blockchain Technology. IJARIIE 2018, 4, 434–438. [Google Scholar]
  18. Babrekar, A.; Pise, R.G. Public key encryption for cloud storage attack using blockchain. Int. J. Recent Technol. Eng. (IJRTE) 2020, 9, 862–867. [Google Scholar] [CrossRef]
  19. Jain, A.; Jain, A.; Chauhan, N.; Singh, V.; Thakur, N. Seguro digital storage of documents using blockchain. Int. Res. J. Eng. Technol. 2018, 5, 4951–4954. [Google Scholar]
  20. Zhou, Q.; Huang, H.; Zheng, Z.; Bian, J. Solutions to scalability of blockchain: A survey. IEEE Access 2020, 8, 16440–16455. [Google Scholar] [CrossRef]
  21. Xie, J.; Yu, F.R.; Huang, T.; Xie, R.; Liu, J.; Liu, Y. A survey on the scalability of blockchain systems. IEEE Netw. 2019, 33, 166–173. [Google Scholar] [CrossRef]
  22. Khan, D.; Jung, L.T.; Hashmani, M.A. Systematic literature review of challenges in blockchain scalability. Appl. Sci. 2021, 11, 9372. [Google Scholar] [CrossRef]
  23. Khalid, M.I.; Ehsan, I.; Al-Ani, A.K.; Iqbal, J.; Hussain, S.; Ullah, S.S. A Comprehensive Survey on Blockchain-Based Decentralized Storage Networks. IEEE Access 2023, 11, 10995–11015. [Google Scholar] [CrossRef]
  24. Ren, Y.; Huang, D.; Wang, W.; Yu, X. BSMD: A blockchain-based secure storage mechanism for big spatio-temporal data. Future Gener. Comput. Syst. 2023, 138, 328–338. [Google Scholar] [CrossRef]
  25. Sun, Z.; Han, D.; Li, D.; Wang, X.; Chang, C.C.; Wu, Z. A blockchain-based secure storage scheme for medical information. EURASIP J. Wirel. Commun. Netw. 2022, 2022, 40. [Google Scholar] [CrossRef]
  26. Eklund, P.W.; Beck, R. Factors that impact blockchain scalability. In Proceedings of the 11th International Conference on Management of Digital Ecosystems, Limassol, Cyprus, 12–14 November 2019; pp. 126–133. [Google Scholar]
  27. Dang, H.; Dinh, T.T.A.; Loghin, D.; Chang, E.C.; Lin, Q.; Ooi, B.C. Towards scaling blockchain systems via sharding. In Proceedings of the 2019 International Conference on Management of Data, New York, NY, USA, 30 June–5 July 2019; pp. 123–140. [Google Scholar]
  28. Chauhan, A.; Malviya, O.P.; Verma, M.; Mor, T.S. Blockchain and scalability. In Proceedings of the 2018 IEEE International Conference on Software Quality, Reliability and Security Companion (QRS-C), Lisbon, Portugal, 16–20 July 2018; IEEE: New York, NY, USA; pp. 122–128. [Google Scholar]
  29. Das, S.; Rout, J.; Mishra, M. Blockchain Technology: Applications and Open Issues. In Proceedings of the 2022 International Conference on Communication, Computing and Internet of Things (IC3IoT), Chennai, India, 10–11 March 2022; IEEE: New York, NY, USA; pp. 1–6. [Google Scholar]
  30. Yadav, A.S.; Kushwaha, D.S. Digitization of land record through blockchain-based consensus algorithm. IETE Tech. Rev. 2022, 39, 799–816. [Google Scholar] [CrossRef]
  31. Kim, S.; Kwon, Y.; Cho, S. A survey of scalability solutions on blockchain. In Proceedings of the 2018 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Republic of Korea, 17–19 October 2018; IEEE: New York, NY, USA; pp. 1204–1207. [Google Scholar]
  32. Lao, L.; Dai, X.; Xiao, B.; Guo, S. G-pbft: A location-based and scalable consensus protocol for iot-blockchain applications. In Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, LA, USA, 18–22 May 2020; IEEE: New York, NY, USA; pp. 664–673. [Google Scholar]
  33. Boudguiga, A.; Bouzerna, N.; Granboulan, L.; Olivereau, A.; Quesnel, F.; Roger, A.; Sirdey, R. Towards better availability and accountability for iot updates by means of a blockchain. In Proceedings of the 2017 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW), Paris, France, 26–28 April 2017; IEEE: New York, NY, USA; pp. 50–58. [Google Scholar]
  34. Hafid, A.; Hafid, A.S.; Samih, M. Scaling blockchains: A comprehensive survey. IEEE Access 2020, 8, 125244–125262. [Google Scholar] [CrossRef]
  35. Yu, G.; Wang, X.; Yu, K.; Ni, W.; Zhang, J.A.; Liu, R.P. Survey: Sharding in blockchains. IEEE Access 2020, 8, 14155–14181. [Google Scholar] [CrossRef]
  36. Oh, B.; Kim, D. Serverless-enabled permissioned blockchain for elastic transaction processing. In Proceedings of the 20th International Middleware Conference Demos and Posters, Davis, CA, USA, 9–13 December 2019; pp. 9–10. [Google Scholar]
  37. Sharma, A.; Pilli, E.S.; Mazumdar, A.P.; Jain, A. BLAST-IoT: BLockchain Assisted Scalable Trust in the Internet of Things. Comput. Electr. Eng. 2023, 109, 108752. [Google Scholar] [CrossRef]
  38. Gangwal, A.; Gangavalli, H.R.; Thirupathi, A. A survey of Layer-two blockchain protocols. J. Netw. Comput. Appl. 2023, 209, 103539. [Google Scholar] [CrossRef]
  39. Zhou, J.; Wang, N.; Liu, A.; Wang, W.; Du, X. CBCS: A Scalable Consortium Blockchain Architecture Based on World State Collaborative Storage. Electronics 2023, 12, 735. [Google Scholar] [CrossRef]
  40. Sanka, A.I.; Cheung, R.C. A systematic review of blockchain scalability: Issues, solutions, analysis and future research. J. Netw. Comput. Appl. 2021, 195, 103232. [Google Scholar] [CrossRef]
  41. Yang, D.; Long, C.; Xu, H.; Peng, S. A review on scalability of blockchain. In Proceedings of the 2020 the 2nd International Conference on Blockchain Technology, New York, NY, USA, 12–14 March 2020; pp. 1–6. [Google Scholar]
  42. Yadav, A.S.; Kushwaha, D.S. Blockchain-based digitization of land record through trust value-based consensus algorithm. Peer-Peer Netw. Appl. 2021, 14, 3540–3558. [Google Scholar] [CrossRef]
  43. Kaplunovich, A.; Joshi, K.P.; Yesha, Y. Scalability analysis of blockchain on a serverless cloud. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; IEEE: New York, NY, USA; pp. 4214–4222. [Google Scholar]
  44. Shelper, P.; Lowe, A.; Kanhere, S.S. Experiences from the Field: Unify Rewards—A Cryptocurrency Loyalty Program. In Proceedings of the Symposium on Foundations and Applications of Blockchain, Los Angeles, CA, USA, 9 March 2018. [Google Scholar]
  45. Singh, S.; Rathore, S.; Alfarraj, O.; Tolba, A.; Yoon, B. A framework for privacy-preservation of IoT healthcare data using Federated Learning and blockchain technology. Future Gener. Comput. Syst. 2022, 129, 380–388. [Google Scholar] [CrossRef]
  46. Mazlan, A.A.; Daud, S.M.; Sam, S.M.; Abas, H.; Rasid, S.Z.A.; Yusof, M.F. Scalability challenges in healthcare blockchain system—A systematic review. IEEE Access 2020, 8, 23663–23673. [Google Scholar] [CrossRef]
  47. Weber, I.; Gramoli, V.; Ponomarev, A.; Staples, M.; Holz, R.; Tran, A.B.; Rimba, P. On availability for blockchain-based systems. In Proceedings of the 2017 IEEE 36th Symposium on Reliable Distributed Systems (SRDS), Hong Kong, China, 26–29 September 2017; IEEE: New York, NY, USA; pp. 64–73. [Google Scholar]
  48. Liang, X.; Shetty, S.; Tosh, D.; Kamhoua, C.; Kwiat, K.; Njilla, L. Provchain: A blockchain-based data provenance architecture in cloud environment with enhanced privacy and availability. In Proceedings of the 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), Madrid, Spain, 14–17 May 2017; IEEE: New York, NY, USA; pp. 468–477. [Google Scholar]
  49. Kaaniche, N.; Laurent, M. BDUA: Blockchain-based data usage auditing. In Proceedings of the 2018 IEEE 11th International Conference on Cloud Computing (CLOUD), San Francisco, CA, USA, 2–7 July 2018; IEEE: New York, NY, USA; pp. 630–637. [Google Scholar]
  50. Camilo, G.F.; Rebello, G.A.F.; de Souza, L.A.C.; Duarte, O.C.M. AutAvailChain: Automatic and secure data availability through blockchain. In Proceedings of the GLOBECOM 2020—2020 IEEE Global Communications Conference, Taipei, Taiwan, 7–11 December 2020; IEEE: New York, NY, USA; pp. 1–6. [Google Scholar]
  51. Kaaniche, N.; Laurent, M. A blockchain-based data usage auditing architecture with enhanced privacy and availability. In Proceedings of the 2017 IEEE 16th International Symposium on Network Computing and Applications (NCA), Cambridge, MA, USA, 30 October–1 November 2017; IEEE: New York, NY, USA; pp. 1–5. [Google Scholar]
  52. Cao, S.; Kadhe, S.; Ramchandran, K. CoVer: Collaborative light-node-only verification and data availability for blockchains. In Proceedings of the 2020 IEEE International Conference on Blockchain (Blockchain), Rhodes, Greece, 2–6 November 2020; IEEE: New York, NY, USA; pp. 45–52. [Google Scholar]
  53. Nakamoto, S. Bitcoin: A Peer-to-Peer Electronic Cash System. Decentralized Bus. Rev. 2008. Available online: https://bitcoin.org/bitcoin.Pdf (accessed on 23 January 2023).
  54. Blazy, O.; Kiltz, E.; Pan, J. (Hierarchical) identity-based encryption from affine message authentication. In Advances in Cryptology–CRYPTO 2014, Proceedings of the 34th Annual Cryptology Conference, Santa Barbara, CA, USA, 17–21 August 2014; Proceedings, Part I 34; Springer: Berlin/Heidelberg, Germany, 2014; pp. 408–425. [Google Scholar]
  55. Linn, L.A.; Koo, M.B. Blockchain for health data and its potential use in health it and health care related research. In Proceedings of the ONC/NIST Use of Blockchain for Healthcare and Research Workshop, Gaithersburg, MD, USA, 26–27 September 2016; pp. 1–10. [Google Scholar]
  56. Liu, X.; Ji, S.; Wang, X.; Liu, L.; Ren, Y. Blockchain Data Availability Scheme with Strong Data Privacy Protection. Information 2023, 14, 88. [Google Scholar] [CrossRef]
  57. Paillisse, J.; Subira, J.; Lopez, A.; Rodriguez-Natal, A.; Ermagan, V.; Maino, F.; Cabellos, A. Distributed access control with blockchain. In Proceedings of the ICC 2019—2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; IEEE: New York, NY, USA; pp. 1–6. [Google Scholar]
  58. Androulaki, E.; Barger, A.; Bortnikov, V.; Cachin, C.; Christidis, K.; De Caro, A.; Enyeart, D.; Ferris, C.; Laventman, G.; Manevich, Y.; et al. Hyperledger fabric: A distributed operating system for permissioned blockchains. In Proceedings of the Thirteenth EuroSys Conference, Porto, Portugal, 23–26 April 2018; pp. 1–15. [Google Scholar]
  59. Gallager, R. Low-density parity-check codes. IRE Trans. Inf. Theory 1962, 8, 21–28. [Google Scholar] [CrossRef]
  60. Patgiri, R.; Nayak, S.; Borgohain, S.K. Preventing ddos using bloom filter: A survey. arXiv 2018, arXiv:1810.06689. [Google Scholar] [CrossRef]
  61. Carlson, S.; Anderson, B. What are data? The many kinds of data and their implications for data re-use. J. Comput.-Mediat. Commun. 2007, 12, 635–651. [Google Scholar] [CrossRef]
  62. Muniswamy-Reddy, K.K.; Holland, D.A.; Braun, U.; Seltzer, M.I. Provenance-aware storage systems. In Proceedings of the Usenix Annual Technical Conference, General Track, Boston, MA, USA, 30 May–3 June 2006; pp. 43–56. [Google Scholar]
  63. Lalitha, V.L.; Raju, S.H.; Sonti, V.K.; Mohan, V.M. Customized smart object detection: Statistics of detected objects using IoT. In Proceedings of the 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), Coimbatore, India, 25–27 March 2021; IEEE: New York, NY, USA; pp. 1397–1405. [Google Scholar]
  64. Kim, J.E.; Boulos, G.; Yackovich, J.; Barth, T.; Beckel, C.; Mosse, D. Seamless integration of heterogeneous devices and access control in smart homes. In Proceedings of the 2012 Eighth International Conference on Intelligent Environments, Guanajuato, Mexico, 26–29 June 2012; IEEE: New York, NY, USA; pp. 206–213. [Google Scholar]
  65. Zyskind, G.; Nathan, O.; Pentland, A. Enigma: Decentralized computation platform with guaranteed privacy. arXiv 2015, arXiv:1506.03471. [Google Scholar]
  66. Anand, D.; Khemchandani, V.; Sharma, R.K. Identity-based cryptography techniques and applications (a review). In Proceedings of the 2013 IEEE International Conference and Computational Intelligence and Communication Networks, Mathura, India, 27–29 September 2013; IEEE: New York, NY, USA; pp. 343–348. [Google Scholar]
  67. Reis-Marques, C.; Figueiredo, R.; de Castro Neto, M. Applications of Blockchain Technology to Higher Education Arena: A Bibliometric Analysis. Eur. J. Investig. Health Psychol. Educ. 2021, 11, 1406–1421. [Google Scholar] [CrossRef] [PubMed]
  68. Chen, C. CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. J. Am. Soc. Inf. Sci. Technol. 2006, 57, 359–377. [Google Scholar] [CrossRef]
  69. Van Eck, N.J.; Waltman, L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 2010, 84, 523–538. [Google Scholar] [CrossRef]
  70. Yu, J.; Wen, Y.; Yang, L.; Zhao, Z.; Guo, Y.; Guo, X. Monitoring on triboelectric nanogenerator and deep learning method. Nano Energy 2022, 92, 106698. [Google Scholar] [CrossRef]
  71. Firdaus, A.; Razak, M.F.A.; Feizollah, A.; Hashem, I.A.T.; Hazim, M.; Anuar, N.B. The rise of “blockchain”: Bibliometric analysis of blockchain study. Scientometrics 2019, 120, 1289–1331. [Google Scholar] [CrossRef]
  72. Guo, Y.M.; Huang, Z.L.; Guo, J.; Guo, X.R.; Li, H.; Liu, M.Y.; Ezzeddine, S.; Nkeli, M.J. A bibliometric analysis and visualization of blockchain. Future Gener. Comput. Syst. 2021, 116, 316–332. [Google Scholar] [CrossRef]
  73. Luo, J.; Hu, Y.; Bai, Y. Bibliometric analysis of the blockchain scientific evolution: 2014–2020. IEEE Access 2021, 9, 120227–120246. [Google Scholar] [CrossRef]
  74. Kuzior, A.; Sira, M. A bibliometric analysis of blockchain technology research using VOSviewer. Sustainability 2022, 14, 8206. [Google Scholar] [CrossRef]
  75. Alam, S.; Zardari, S.; Shamsi, J. Comprehensive three-phase bibliometric assessment on the blockchain (2012–2020). Libr. Hi Tech 2023, 41, 287–308. [Google Scholar] [CrossRef]
Figure 1. (a): Annual output of blockchain storage research per year; (b): Annual output of blockchain scalability research papers; (c): Annual output of blockchain availability research papers.
Figure 1. (a): Annual output of blockchain storage research per year; (b): Annual output of blockchain scalability research papers; (c): Annual output of blockchain availability research papers.
Data 08 00148 g001aData 08 00148 g001b
Figure 2. (a): Mapping of major countries involved in blockchain storage research; (b): Mapping of major countries involved in blockchain scalability research; (c): Mapping of major countries involved in blockchain availability research.
Figure 2. (a): Mapping of major countries involved in blockchain storage research; (b): Mapping of major countries involved in blockchain scalability research; (c): Mapping of major countries involved in blockchain availability research.
Data 08 00148 g002
Figure 3. (a): Mapping of institutions involved in blockchain storage research; (b): Mapping of institutions involved in blockchain scalability research; (c): Mapping of institutions involved in blockchain availability research.
Figure 3. (a): Mapping of institutions involved in blockchain storage research; (b): Mapping of institutions involved in blockchain scalability research; (c): Mapping of institutions involved in blockchain availability research.
Data 08 00148 g003aData 08 00148 g003b
Figure 4. (a): Mapping of prominent authors in blockchain storage research; (b): Mapping of prominent authors in blockchain scalability research; (c): Mapping of prominent authors in blockchain availability research.
Figure 4. (a): Mapping of prominent authors in blockchain storage research; (b): Mapping of prominent authors in blockchain scalability research; (c): Mapping of prominent authors in blockchain availability research.
Data 08 00148 g004aData 08 00148 g004b
Figure 5. (a): Mapping of co-cited authors contributing in blockchain storage; (b): Mapping of co-cited authors contributing in blockchain scalability; (c): Mapping of co-cited authors contributing in blockchain availability.
Figure 5. (a): Mapping of co-cited authors contributing in blockchain storage; (b): Mapping of co-cited authors contributing in blockchain scalability; (c): Mapping of co-cited authors contributing in blockchain availability.
Data 08 00148 g005aData 08 00148 g005b
Figure 6. (a): Co-citation journal time zone view in blockchain storage for 2012–2022; (b): Co-citation journal time zone view in blockchain scalability for 2012–2022; (c): Co-citation journal time zone view in blockchain availability for 2012–2022.
Figure 6. (a): Co-citation journal time zone view in blockchain storage for 2012–2022; (b): Co-citation journal time zone view in blockchain scalability for 2012–2022; (c): Co-citation journal time zone view in blockchain availability for 2012–2022.
Data 08 00148 g006aData 08 00148 g006b
Figure 7. List of top 10 references with the strongest citation bursts.
Figure 7. List of top 10 references with the strongest citation bursts.
Data 08 00148 g007
Figure 8. Bag of keywords.
Figure 8. Bag of keywords.
Data 08 00148 g008
Figure 9. Knowledge domain map showing a network of keywords associated with blockchain research: (a) Article-weighted network visualization map; (b) Article-weighted density visualization map.
Figure 9. Knowledge domain map showing a network of keywords associated with blockchain research: (a) Article-weighted network visualization map; (b) Article-weighted density visualization map.
Data 08 00148 g009
Figure 10. Overlay visualization of smart contracts with weighted links and citation scores.
Figure 10. Overlay visualization of smart contracts with weighted links and citation scores.
Data 08 00148 g010
Figure 11. Overlay visualization of traceability with weighted links and citation scores.
Figure 11. Overlay visualization of traceability with weighted links and citation scores.
Data 08 00148 g011
Figure 12. Overlay visualization of transparency with weighted links and citation scores.
Figure 12. Overlay visualization of transparency with weighted links and citation scores.
Data 08 00148 g012
Table 1. Research and review articles on blockchain as storage.
Table 1. Research and review articles on blockchain as storage.
ReferencesYearCiteTitle
[1]201985Research on the Application of Cryptography on the Blockchain
[2]20215Enhancing Security of Data in Cloud Storage using Decentralized Blockchain
[3]201472Analysis and Comparison of Symmetric Key Cryptographic Algorithms Based on Various File Features
[4]2011439DES, AES, and Blowfish: Symmetric Key Cryptography Algorithms Simulation Based Performance Analysis
[5]201523A Comparison of Symmetric Key Algorithms Des, Aes, Blowfish, Rc4, Rc6: A Survey
[6]2017228Advanced Encryption Standard (AES) Algorithm to Encrypt and Decrypt Data
[7]2018189MIStore: a Blockchain-Based Medical Insurance Storage System
[8]201924Blockchain-based Decentralized Storage Scheme
[9]20236Agricultural IoT Data Storage Optimization and Information Security Method
[10]201928Blockchain-Based Fair Payment Protocol for Deduplication Cloud Storage System
[11]20193Proposed Model for Secured Data Storage in Decentralized Cloud by Blockchain Ethereum
[12]202016Efficient Decentralized Data Storage Based on Public Blockchain and IPFS
[13]2023354Decentralized File Storage (Interplanetary File System) using Blockchain
[14]20221Centralize storage system with encryption vs. decentralized storage system using blockchain
[15]20221Data storage mechanism of industrial IoT based on LRC sharding blockchain
[16]20221Blockchain-based secure data storage on the cloud
[25]202219A blockchain-based secure storage scheme for medical information
[17]20181Secured cloud storage using blockchain technology
[18]20222Public Key Encryption for Cloud Storage Attack Using Blockchain
[19]201816Seguro Digital storage of documents using Blockchain
[23]20234A Comprehensive Survey on Blockchain-Based Decentralized Storage Networks
[24]202317BSMD: A blockchain-based secure storage mechanism for big spatiotemporal data
Table 2. Research and review articles on blockchain as scalability.
Table 2. Research and review articles on blockchain as scalability.
ReferencesYearCiteTitle
[20]2020490Solutions to Scalability of Blockchain: A Survey
[21]2019224A Survey on the Scalability of Blockchain Systems
[22]202174Systematic Literature Review of Challenges in Blockchain Scalability
[40]202167A systematic review of blockchain scalability: Issues, solutions, analysis, and future research
[26]201928Factors that Impact Blockchain Scalability
[27]2019390Towards Scaling Blockchain Systems via Sharding
[29]20222A Comparative Analysis of the Consensus Algorithms in Blockchain Technology
[41]202054A Review on Scalability of Blockchain
[30]202115Digitization of Land Record Through Blockchain-based Consensus Algorithm
[42]202122Blockchain-based digitization of land records through trust value-based consensus algorithm
[31]2022204A Survey of Scalability Solutions on Blockchain
[28]2018206Blockchain and Scalability
[43]20199Scalability Analysis of Blockchain on a Serverless Cloud
[44]20188Experiences from the Field: Unify Rewards—A Cryptocurrency Loyalty Program
[32]202077G-PBFT: A Location-based and Scalable Consensus Protocol for IoT-Blockchain Applications
[45]202279A framework for privacy-preservation of IoT healthcare data using Federated Learning and blockchain technology
[46]202091Scalability Challenges in Healthcare Blockchain System—A Systematic Review
[34]2020172Scaling Blockchains: A Comprehensive Survey
[36]20192Serverless-Enabled Permissioned Blockchain for Elastic Transaction Processing
[37]20230BLAST-IoT: Blockchain Assisted Scalable Trust in Internet of Things
[38]20230A survey of Layer-two blockchain protocols
[39]20230CBCS: A Scalable Consortium Blockchain Architecture Based on World State Collaborative Storage
Table 3. Research and review articles on blockchain as availability.
Table 3. Research and review articles on blockchain as availability.
ReferencesYearCiteTitle
[33]2017257Towards Better Availability and Accountability for IoT Updates by means of a Blockchain
[36]20192Serverless-Enabled Permissioned Blockchain for Elastic Transaction Processing
[47]2017184On Availability for Blockchain-Based Systems
[48]2017791ProvChain: A Blockchain-based Data Provenance Architecture in Cloud Environment with Enhanced Privacy and Availability
[49]201810BDUA: Blockchain-based data usage auditing
[50]202020AutAvailChain: Automatic and Secure Data Availability through Blockchain
[51]201781A Blockchain-based Data Usage Auditing Architecture with Enhanced Privacy and Availability
[52]202015CoVer: Collaborative Light-Node-Only Verification and Data Availability for Blockchains
[53]200828,450Bitcoin: A Peer-to-Peer Electronic Cash System
[54]201499(Hierarchical) Identity-Based Encryption from Affine Message Authentication
[55]2016371Blockchain For Health Data and Its Potential Use in Health IT and Health Care Related Research
[57]201933Distributed Access Control with Blockchain
[58]20183925Hyperledger Fabric: A Distributed Operating System for Permissioned Blockchains
[59]196215,843Low-Density Parity-Check Codes
[60]201828Preventing DDoS using Bloom Filter: A Survey
[61]2007169What Are Data? The Many Kinds of Data and Their Implications for Data Re-Use
[62]2006751Provenance-Aware Storage Systems
[63]202115Customized Smart Object Detection: Statistics of detected objects using IoT
[64]2012166Seamless Integration of Heterogeneous Devices and Access Control in Smart Homes
[65]2015585Enigma: Decentralized Computation Platform with Guaranteed Privacy
[66]201341Identity-based cryptography techniques and applications
[56]20230Blockchain Data Availability Scheme with Strong DataPrivacy Protection
Table 4. Annual publication per year.
Table 4. Annual publication per year.
Storage Scalability Availability
CategoryPublication (Total)CategoryPublication (Total)CategoryPublication (Total)
201202012020120
201302013020130
201402014020140
201502015020150
201612016120160
201712017320173
201872018720183
201916201917201910
202062202026202010
202154202126202115
20224120223820226
Table 5. List of top 10 countries based on publications.
Table 5. List of top 10 countries based on publications.
StorageScalabilityAvailability
RankCountryPublications% of PapersCentralityRankCountryPublications% of PapersCentralityRankCountryPublications% of PapersCentrality
1India5814.28%1.081India4319.11%0.131United States1116.92%0.31
2UAE5212.80%0.522China2611.55%0.632India812.30%0.00
3Pakistan307.38%0.703Australia2410.66%0.133South Korea710.76%0.07
4Australia297.14%0.784Canada156.66%0.554Qatar69.23%0.53
5Saudi Arabia266.40%0.915Saudi Arabia135.77%0.665Brazil69.23%0.00
6United States204.92%0.386United States104.44%1.466UAE69.23%0.40
7China184.43%0.147Taiwan104.44%0.377Kuwait57.69%0.18
8Qatar122.95%0.398Qatar83.55%0.468Canada34.61%0.00
9United Kingdom102.46%0.009UAE83.55%0.259Egypt34.61%0.00
10Taiwan92.21%0.0010Switzerland73.11%0.4610Saudi Arabia34.61%0.00
Table 6. List of top 10 institutions based on publications.
Table 6. List of top 10 institutions based on publications.
StorageScalabilityAvailability
RankInstitutionsPublicationsCentralityCountryRankInstitutionsPublicationsCentralityCountryRankInstitutionsPublicationsCentralityCountry
1Nirma University470.64India1Nirma University350.22India1Federal University of Pernambuco60.00Brazil
2Khalifa University of Science and Technology450.34UAE2UNSW Sydney180.81Australia2Seoul National University of Science and Technology60.02South Korea
3COMSATS University Islamabad280.28Pakistan3Thapar University160.38India3National Institute of Technology Raipur50.00India
4Thapar University271.27India4University of Petroleum and Energy Studies150.13India4Kuwait College of Science and Technology50.27Kuwait
5UNSW Sydney170.49Australia5Carleton University140.58Canada5Qatar University40.39Qatar
6University of Petroleum and Energy Studies140.17India6Queensland University of Technology100.43Australia6Tennessee Technological University40.41United States
7King Saud University130.22Saudi Arabia7Asian University100.73India7University of California40.00United States
8Qatar University110.14Qatar8Qatar University81.26Qatar8Al Ain University40.00UAE
9Queensland University of Technology110.21Australia9ETH Zurich70.00Switzerland9Umm al-Qura University30.01Saudi Arabia
10Cleveland Clinic Abu Dhabi80.04UAE10Commonwealth Scientific and Industrial Research Organization40.22Australia10Mohamed bin Zayed University of Artificial Intelligence30.63UAE
Table 7. List of top 10 most productive authors participating in blockchain storage, blockchain scalability, and blockchain availability research perspectives.
Table 7. List of top 10 most productive authors participating in blockchain storage, blockchain scalability, and blockchain availability research perspectives.
StorageScalabilityAvailability
RankPublicationAuthorCountryInstituteRankPublicationAuthorCountryInstituteRankPublicationAuthorCountryInstitute
154Salah KhaledUAEKhalifa University of Science and Technology125Gupta RajeshIndiaNirma University19Park Jong HyukSouth KoreaSeoul National University of Science and Technology
247Tanwar SudeepIndiaNirma University224Dorri AliAustraliaQueensland University of Technology28Tauz LevUnited StatesUniversity of California
345Jayaramam RajUAEKhalifa University of Science and Technology317Kumari AparnaIndiaGanpat University38Dolecek LaraUnited StatesUniversity of California
432Yaqoob IbrarUAEKhalifa University of Science and Technology415Wang YuyiSwitzerlandETH Zurich48Mitra DebarnabUnited StatesUniversity of California
528Javaid NadeemPakistanCOMSATS University Islamabad514Tanwar SudeepIndiaNirma University57Aloqaily MoayadKuwaitKuwait College of Science and Technology
628Kumar NeerajIndiaNirma University613Bhattacharya PronayaIndiaBirla Institute of Technology and Science66Guizani MohsenIndiaAnna University
721Kanhere SalilAustraliaUNSW Sydney713Kakkar RiyaIndiaNirma University76Mahmoud MohamedUnited StatesTennessee Technological University
821Omar MohammedUAEKhalifa University of Science and Technology812Li XiChinaMinistry of Education of People’s Republic of China86Maciel PauloBrazilFederal University of Pernambuco
919Jurdak RajaAustraliaQueensland University of Technology911Bodkhe UmeshIndiaNirma University96Dantas JamilsonBrazilFederal University of Pernambuco
1019Gupta RajeshIndiaNirma University1011Decker ChristianUnited StatesCornell University106Melo CarlosBrazilFederal University of Pernambuco
Table 8. List of most cited authors and their highly cited articles on blockchain storage, blockchain scalability, and blockchain availability for 2012–2022.
Table 8. List of most cited authors and their highly cited articles on blockchain storage, blockchain scalability, and blockchain availability for 2012–2022.
StorageScalabilityAvailability
RankFrequencyCentralityYearAuthorHighly Cited ReferenceRankFrequencyCentralityYearAuthorHighly Cited ReferenceRankFrequencyCentralityYearAuthorHighly Cited Reference
1280.102022Gupta Rajesh2022 IEEE Globecom Workshops1130.062021Kakkar RiyaIEEE Systems Journal1100.942017AnonymouIEEE Access
2220.012020Zhang ZijianIEEE Transactions on Network Science and Engineering290.092015Decker ChristianLecture Notes in Computer Science260.872019Dorri AliJournal for General Philosophy of Science
3220.002020Kumari AparnaSensors380.112021Gupta RajeshIET Communications360.032018Garg Shiva RajInternal Journal of Energy Research
4200.072020Hasan Haya RIEEE Access480.132019Dorri AliIEEE Communications Magazine450.052017Sharma Pradip KumarIEEE Access
5200.222020Dorri AliIEEE Access570.392020Kumari AparnaComputer Communications540.012021Hu Xiao-YuIEEE Transactions on Information Theory
6180.062020Tanwar SudeepComputers and Industrial Engineering670.002019Dorri AliJournal of Parallel and Distributed Computing640.052021Ridhawi IsmaeelACM Transactions on Internet Technology
7170.052020Zhang YanIEEE Transactions on Power Systems770.212016Luu LoiSIG SAC Conference on Computer and Communication740.012021Yuntao MondalInternal Journal of Energy Research
8170.002020Novo OTelecommunication Systems870.002016Croman KyleLecture Notes in Computer Science840.032017Christidis KonstantirosIEEE Access
9170.302022Salah KhaledPeer-to-Peer Networking and Applications970.322019Liu MengtingIEEE Transactions on Industrial Informatics930.092019Baza Mohamed2019 International Conference on Smart Applications, Communications and Networking
10140.082020Wang SIEEE Communications Surveys and Tutorials1070.062017Kang jiengIEEE Transactions on Industrial Informatics1030.392017Aitzhan NZAd Hoc Networks
Table 9. List of top five productive journals in blockchain storage, blockchain scalability, and blockchain availability for 2012–2022.
Table 9. List of top five productive journals in blockchain storage, blockchain scalability, and blockchain availability for 2012–2022.
StorageScalabilityAvailability
RankJournalCount% of PaperCentralityRankJournalCount% of PaperCentralityRankJournalCount% of PaperCentrality
1IEEE Access1346.69%0.011IEEE Access675.16%0.011IEEE Access238.15%0.30
2Future generation Computer Systems723.59%0.002IEEE Internet of Things Journal503.85%0.052Lecture Notes in Computer Science155.31%0.26
3IEEE Internet of Things Journal703.49%0.043IEEE Transactions on Industrial Informatics473.62%0.003IEEE Transactions on Industrial Informatics144.96%0.07
4IEEE Transactions on Industrial Informatics582.89%0.004IEEE Network433.31%0.114IEEE Communications Magazine124.25%0.13
5Lecture Notes in Computer Science582.89%0.005Lecture Notes in Computer Science433.31%0.015IEEE Transactions on Dependable and Secure Computing124.25%0.45
Table 10. List of top 10 citation references.
Table 10. List of top 10 citation references.
CitationYearAuthor(s)TitleSourceCountry
16502018Khan Minhaj AhmadIoT security: Review, blockchain solutions, and open challengesFuture Generation Computer SystemsPakistan
11932017Dorri AliBlockchain for IoT Security and Privacy: The Case Study of a Smart Home2017 IEEE International Conference on Pervasive Computing and Communications WorkshopsAustralia
8022020Chamola VinayA Comprehensive Review of the COVID-19 Pandemic and the Role of IoT, Drones, AI, Blockchain, and 5G in Managing its ImpactIEEE AccessIndia
7572017Xia QiMeDShare: Trust-Less Medical Data Sharing Among Cloud Service Providers via BlockchainIEEE AccessChina
5852017Steger MarcoBlockChain: A Distributed Solution to Automotive Security and PrivacyIEEE Communications MagazineAustralia
5642019Wang WenboA Survey on Consensus Mechanisms and Mining Strategy Management in Blockchain NetworksIEEE AccessSingapore
5602018Esposito ChristianCloud Message Queueing and Notification: Challenges and (Blockchain) OpportunitiesIEEE Cloud ComputingItaly
5412017Kanhere Salil Towards an Optimized BlockChain for IoTProceedings of the Second International Conference on Internet-of-Things Design and ImplementationAustralia
5342019Salah KhaledBlockchain for AI: Review and Open Research ChallengesIEEE AccessUAE
5242020Tanwar SudeepBlockchain-based electronic healthcare record system for healthcare 4.0 applicationsJournal of Information Security and ApplicationsIndia
Table 11. List of author’s keywords.
Table 11. List of author’s keywords.
S.No.KeywordCommonTop 5 Author NameCitations
1Blockchain Kim-Kwang Raymond Choo376
Dusit Tao Niyato366
Mohsen Mokhtar Guizani363
Neeraj Kumar338
Sudeep Tanwar277
2Data storageBlockchainKim-Kwang Raymond Choo262
Dusit Tao Niyato248
Mohsen Mokhtar Guizani243
Neeraj Kumar240
Sudeep Tanwar197
3ScalabilityBlockchainSudeep Tanwar179
Neeraj Kumar170
Andreja Pucihar159
Dusit Tao Niyato157
Roger W H Bons156
4IOT SecurityBlockchainMohsen Mokhtar Guizani317
Neeraj Kumar304
Kim-Kwang Raymond Choo292
Dusit Tao Niyato270
Gautam Srivastava223
5CryptographyBlockchainKim-Kwang Raymond Choo236
Neeraj Kumar168
Lie-Huang Zhu151
Gautam Srivastava142
De-Biao He139
6AvailabilityBlockchainSudeep Tanwar178
Neeraj Kumar167
Andreja Pucihar165
Roger W H Bons155
Mohsen Mokhtar Guizani152
7ConfidentialityBlockchainAndreja Pucihar162
Roger W H Bons154
Kim-Kwang Raymond Choo141
Neeraj Kumar121
Mirjana Kljajic Borstnar113
8Distributed ledger technology (DLT)BlockchainPolona Šprajc49
Iztok Podbregar49
Damjan Maletič48
Mirjana T Radovanović48
Umit Cali40
10Consensus algorithmsBlockchainDusit Tao Niyato177
Andreja Pucihar162
Roger W H Bons154
Neeraj Kumar148
Sudeep Tanwar143
11Bibliometric analysisBlockchainAndreja Pucihar116
Mirjana Kljajic Borstnar112
Roger W H Bons108
Doroteja Vidmar61
Pascal Ravesteijn52
12Data extraction methodsBlockchainAndreja Pucihar164
Roger W H Bons154
Mirjana Kljajic Borstnar113
Kim-Kwang Raymond Choo105
Pascal Ravesteijn103
13RIS formatBlockchainPolona Šprajc49
Iztok Podbregar49
Damjan Maletič48
Mirjana T Radovanović48
Kim-Kwang Raymond Choo20
14Analytical viewBlockchainAndreja Pucihar163
Roger W H Bons154
Mirjana Kljajic Borstnar114
Pascal Ravesteijn103
Doroteja Vidmar62
15CiteSpaceBlockchainZeshui S Xu11
Shou-Yang Wang8
Karim Rejeb8
Abderahman Rejeb8
Qiang Wang7
16Visual knowledge mapsBlockchainAndreja Pucihar165
Roger W H Bons154
Mirjana Kljajic Borstnar114
Pascal Ravesteijn103
Doroteja Vidmar63
17VOSviewerBlockchainSatish Kumar30
Abderahman Rejeb18
Karim Rejeb17
Weng Marc Lim16
Horst Treiblmaier13
18Co-occurrence networkBlockchainAndreja Pucihar163
Roger W H Bons154
Mirjana Kljajic Borstnar113
Pascal Ravesteijn100
Doroteja Vidmar62
19Retrieval publication periodBlockchainAndreja Pucihar162
Roger W H Bons154
Mirjana Kljajic Borstnar113
Pascal Ravesteijn103
Doroteja Vidmar62
20Data ownershipBlockchainAndreja Pucihar166
Roger W H Bons154
Mirjana Kljajic Borstnar113
Pascal Ravesteijn104
Khaled Salah73
21Immutable dataBlockchainSudeep Tanwar130
Khaled Salah112
Roger W H Bons104
Andreja Pucihar104
Neeraj Kumar99
22Smart contractsBlockchainDusit Tao Niyato191
Sudeep Tanwar191
Andreja Pucihar164
Roger W H Bons157
Neeraj Kumar156
23Scalability trilemmaBlockchainPolona Šprajc49
Iztok Podbregar49
Damjan Maletič48
Mirjana T Radovanović48
Paolo Tasca6
24ShardingBlockchainDusit Tao Niyato50
Zi-Bin Zheng31
Song Guo23
Khaled Salah22
Jiawen Kang19
25TraceabilityBlockchainKhaled Salah104
Kim-Kwang Raymond Choo91
Raja Jayaraman84
Sudeep Tanwar69
Lie-Huang Zhu66
26TransparencyBlockchainAndreja Pucihar165
Roger W H Bons155
Sudeep Tanwar137
Neeraj Kumar118
Mirjana Kljajic Borstnar114
Table 12. List of top 10 keywords of blockchain research based on frequency.
Table 12. List of top 10 keywords of blockchain research based on frequency.
RankFrequencyYearKeyword
11642022Smart contract
21332017Vehicle
31192019Industry
41132019Node
5912019Proof
6912018Miner
7882021Transparency
8852021Traceability
9852020Healthcare
10822020Stakeholder
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kandpal, M.; Goswami, V.; Priyadarshini, R.; Barik, R.K. Towards Data Storage, Scalability, and Availability in Blockchain Systems: A Bibliometric Analysis. Data 2023, 8, 148. https://doi.org/10.3390/data8100148

AMA Style

Kandpal M, Goswami V, Priyadarshini R, Barik RK. Towards Data Storage, Scalability, and Availability in Blockchain Systems: A Bibliometric Analysis. Data. 2023; 8(10):148. https://doi.org/10.3390/data8100148

Chicago/Turabian Style

Kandpal, Meenakshi, Veena Goswami, Rojalina Priyadarshini, and Rabindra Kumar Barik. 2023. "Towards Data Storage, Scalability, and Availability in Blockchain Systems: A Bibliometric Analysis" Data 8, no. 10: 148. https://doi.org/10.3390/data8100148

Article Metrics

Back to TopTop