Next Article in Journal
Is Sustainability Reporting Promoting a Circular Economy? Analysis of Companies’ Sustainability Reports in the Agri-Food Sector in the Scope of Corporate Sustainability Reporting Directive and EU Taxonomy Regulation
Previous Article in Journal
The Effect of Perceived Corporate Social Responsibility (PCSR) on Subjective Well-Being
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Digital Transformation Strategies, Practices, and Trends: A Large-Scale Retrospective Study Based on Machine Learning

by
Fatih Gurcan
1,
Gizem Dilan Boztas
2,
Gonca Gokce Menekse Dalveren
3 and
Mohammad Derawi
4,*
1
Department of Management Information Systems, Faculty of Economics and Administrative Sciences, Karadeniz Technical University, 61080 Trabzon, Turkey
2
Digital Transformation Office, Karadeniz Technical University, 61080 Trabzon, Turkey
3
Department of Software Engineering, Faculty of Engineering, Atilim University, 06830 Ankara, Turkey
4
Department of Electronic Systems, Faculty of Information Technology and Electrical Engineering, Norwegian University of Science and Technology, 7034 Gjøvik, Norway
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(9), 7496; https://doi.org/10.3390/su15097496
Submission received: 14 March 2023 / Revised: 29 April 2023 / Accepted: 1 May 2023 / Published: 3 May 2023

Abstract

:
The purpose of this research is to identify the areas of interest, research topics, and application areas that reflect the research nature of digital transformation (DT), as well as the strategies, practices, and trends of DT. To accomplish this, the Latent Dirichlet allocation algorithm, a probabilistic topic modeling technique, was applied to 5350 peer-reviewed journal articles on DT published in the last ten years, from 2013 to 2022. The analysis resulted in the discovery of 34 topics. These topics were classified, and a systematic taxonomy for DT was presented, including four sub-categories: implementation, technology, process, and human. As a result of time-based trend analysis, “Sustainable Energy”, “DT in Health”, “E-Government”, “DT in Education”, and “Supply Chain” emerged as top topics with an increasing trend. Our findings indicate that research interests are focused on specific applications of digital transformation in industrial and public settings. Based on our findings, we anticipate that the next phase of DT research and practice will concentrate on specific DT applications in government, health, education, and economics. “Sustainable Energy” and “Supply Chain” have been identified as the most prominent topics in current DT processes and applications. This study can help researchers and practitioners in the field by providing insights and implications about the evolution and applications of DT. Our findings are intended to serve as a guide for DT in understanding current research gaps and potential future research topics.

1. Introduction

In today’s world, rapidly developing digital technologies and increasingly adopted innovative and competitive business models have compelled many businesses to undergo digital transformation (DT) in recent years. DT refers to businesses of all sizes leveraging online resources and technology to improve the efficiency of their products, services, customer experiences, workflows, and decision-making processes [1,2]. Virtual reality, digital literacy, the Internet of things, digital employees, big data, machine learning, cloud services, and blockchain have increased the adoption of DT, allowing organizations to improve service quality, efficiency, profitability, and customer satisfaction [1,3]. With the increase in online resources and the acceleration of technological developments, DT has become a strategic process that closely concerns businesses [2].
This process has resulted in radical shifts in the strategic decision-making paradigms, business models, and services of businesses in every country and industry. Emerging digital technologies such as artificial intelligence, robotics, big data analytics, the Internet of things, and blockchain play an important role in increasing company efficiency and lowering costs by enabling data-driven decision-making processes [4]. Furthermore, the problems encountered during the COVID-19 pandemic process clearly demonstrated the need for DT in many public and industrial areas. This case has accelerated the process of organizational DT. According to Statista, spending on DT was USD 1.18 trillion in 2019, with a projected increase to USD 1.8 trillion by 2022. This figure is expected to rise to USD 2.8 trillion by 2025 [5]. To be successful in global competition, businesses must adapt quickly to competitive business strategies and be constantly renewed. The most significant of these innovations is DT. The strategic significance of DT in today’s global competitive environment is growing by the day [6].
Academic research on DT is becoming increasingly important as the importance of DT for businesses and societies grows. DT research has recently increased as a result of the increased interest in this field [4]. Academic research on DT covers a wide range of research topics and draws important conclusions about business success in DT processes and global competition [3]. Academic research on the applications and effects of DT in various fields includes efforts to investigate how this transformation affects social, cultural, economic, and technological developments. Furthermore, numerous studies on DT processes and applications have been conducted in the context of various countries, governments, regions, organizations, or industries [7,8]. While the increase in DT studies provides important inferences and insights into the field, the exponential increase makes understanding the themes and focal points of the field-specific research landscape from a holistic perspective difficult [1]. Understanding DT research trends, themes, and application areas will make it easier for new researchers to recognize this field and identify potential research gaps [4]. Understanding these trends, issues, and changes is also important for developing a vision for the future of DT. Scientific literature is a rich and important source of information in this context. Using automated text mining procedures based on machine learning is one of the most effective ways to analyze the scientific literature [9,10]. Because automated text mining efficiently summarizes a large number of documents, it enables the discovery of hidden semantic topics in that collection [10,11,12].
Studies based on a DT literature analysis, on the other hand, are mostly of the systematic review and bibliometric analysis variety, with a relatively small number of studies. These studies primarily concentrated on the DT of various industries, including education [13], construction [14], and manufacturing [15]. It has been observed that studies dealing with DT in general have a small number of articles, such as systematic literature reviews [16] or bibliometric analysis [17]. Automated text mining research focuses on the temporal effects of DT [18] and its industrial applications [19].
Despite these commendable efforts, no study based on automated text mining has addressed DT as a whole. This study’s goal is to identify the research and implementation topics and their temporal trends by analyzing research on DT over the last decade using automated text mining and topic modeling procedures. The field of DT has been approached from a broad perspective for this purpose, and themes and trends have been identified using a machine learning-based topic modeling algorithm. A systematic taxonomy map for DT associated with the obtained topics has been created. Furthermore, the discovered topics’ trends and temporal developments are revealed. Because current reviews for DT have focused on a small number of articles or on specific industries, it is expected that this study will contribute to the literature with more specific and deep insights than the others. Answers to the following research questions (RQs) will be sought in the context of evaluating the previous decade of DT.
RQ1: What are the bibliometric characteristics of DT research?
RQ2: What are the main topics of DT research?
RQ3: How do DT topics fit into a taxonomy?
RQ4: What are the temporal trends in DT research topics?

2. Background

The concept of DT, its sub-contexts, and industrial application areas will be discussed in this section. The topic modeling method we used in our analysis, as well as its applications, will then be discussed.

2.1. Digital Transformation

The concept of digital transformation first appeared in the literature in the late 1990s and has since become a popular research topic due to the COVID-19 pandemic [20]. There are numerous definitions of DT in the literature, but no agreement has been reached on any of them. DT is defined as “an essential process of change driven by the innovative use of digital technologies, combined with the strategic leverage of key resources and capabilities, aimed at radically improving an organization and redefining the value proposition for its stakeholders” [21]. This process revolves around four basic elements: the unit where the DT will occur, the scope of the transformation, the tools, and the transformation’s outcome [3].
Information and communication technologies (ICT), which allow for the rapid acquisition, storage, processing, and distribution of information, are becoming more prevalent, increasing their importance in social and industrial fields [22]. This rapid development of ICT has provided various opportunities for organizations to innovate, but it has also necessitated this renewal. Organizations must adapt their business processes to SMACIT (social, mobile, analytics, cloud, and Internet of things) technologies in order to capitalize on emerging opportunities and obligations. This has resulted in the emergence of smarter and more effective new business models as a result of the DT of many small and medium-sized businesses (SMEs) [23]. In today’s global economy, digital transformation processes and practices provide significant benefits for SMEs to maintain competitiveness, efficiency, and profitability. A successful DT for organizations is only possible when technology is backed up by process and people [24].
Technology in the context of DT can be defined as a tool that assists people in moving forward more efficiently by automating work. When it comes to DT, technology is usually the first thing that comes to mind. Despite all of its benefits, thinking of DT solely in terms of technology can lead to organizations making serious mistakes and failing their DT initiatives [25]. At this point, the process component enters the picture to assist the technology component. A process is a collection of actions that must be carried out in order to achieve a goal [26]. Processes, on the other hand, are complex activities that must be completed during the transformation phase [27] because a process should be designed in such a way that it can meet the needs of the organization’s stakeholders while also generating values that contribute to the organization’s operational excellence. In process design, it is necessary to establish a collaborative DT vision in the organization, understand the organization’s business model and processes, and evaluate the organization’s DT performance and capabilities. Furthermore, reviewing organizational policies, taking into account organizational culture, and developing and testing processes [28] are also necessary. Aside from the other two components, one of the most important aspects of DT is the human component. Internal stakeholders are also directly linked to an organization’s DT. According to one study, two of the most important reasons for a failed DT process are related to the human component. The first is due to a lack of managerial support, while the second is due to a lack of skills in developing and implementing the organization’s digital strategy [27,29]. As a result, for an effective DT, managers must focus on their employees and prepare them for this transformation in terms of skills and mindset. This preliminary planning is critical not only for the success of the DT process, but also for the long-term viability of the DT activity.
A number of studies have been conducted to examine the DT literature in various contexts. In terms of scope, these studies can be divided into two categories. The first category includes studies that evaluate DT in general, while the second includes studies that focus on specific industries. When the literature is examined, it is discovered that systematic literature review studies on DT were not conducted until 2014 [30]. Henriette et al. examined 13 articles and conducted a managerial literature review on DT [31]. Reis et al. conducted a systematic review of the DT literature, reviewing 206 articles [4]. Gebayew et al. examined DT in the context of research by reviewing 30 articles [30]. Teichert examined 24 studies on DT maturity in 2019 and revealed the characteristics of 22 digital maturity models [32]. Hanelt et al. examined 279 studies in order to assess DT from the standpoint of organizational differentiation [16]. Zhu et al., on the other hand, used bibliometric analysis on 865 studies to provide an overview of DT [8]. Verhoef et al. conducted a scoping study on 84 studies in order to identify the three stages of DT [6]. Researchers conducted a systematic literature review in 2022 to evaluate Latin America’s DT [33]. In a similar vein, Chawla and Goyal conducted a bibliometric analysis on 234 studies to provide an overview of DT [23]. Facin et al. also conducted a bibliometric analysis on 294 studies [17].
On the other hand, DT is a fascinating topic for researchers from various disciplines. As a result, some studies in the literature focus on the applications of DT in various industries. Martin-Pena (2018), for example, investigated DT business models in the manufacturing industry and reviewed 65 studies to better understand the relationship between service and digitalization [15]. Cortellazzo, Bruni, and Zampieri (2019) examined 54 leadership-related DT studies [34]. Benavides et al. examined 19 studies to identify the characteristics that distinguish the DT processes in higher education from one another [13]. Olanipekun and Sutrisna (2021) examined 151 studies to determine the effects and challenges of DT applications in the construction industry [14]. The tourism industry is another area in which DT focuses. A bibliometric analysis of 535 studies was performed in order to better understand the DT in cultural heritage management [35].

2.2. Topic Modeling

Systematic literature studies using manual methods necessitate more labour and time. Bibliometric studies offer the benefit of a quick examination of the subject as well as a general evaluation. However, it does not provide detailed information about the content [36]. As a result, automated methodologies based on machine learning, such as topic modeling, enable effective analysis that can provide much deeper insights.
Topic modeling is a technique for detecting hidden semantic structures known as “topics” in large, unstructured document collections [10,11]. Latent Dirichlet allocation (LDA) is a three-level hierarchical Bayesian model-based generative probabilistic topic modeling approach [37]. In topic modeling research, the LDA model is frequently used [38]. LDA is based on the notion that each document is likely to contain one or more topics, and that collections of words that reflect a topic are used [11]. The topic distribution for each document and the word distribution for each topic are calculated using iterations of the Dirichlet distribution in this algorithm [37].
Topic modeling is also widely used in social science research due to the methodological advantages it provides. Gurcan et al. (2021) analyzed 41,720 articles using topic modeling to determine the trends and knowledge-domains of human–computer interaction [39]. Another case study conducted by Aziz et al. (2022) used topic modeling analysis on 5942 articles to search the finance literature [40]. However, there are only a few studies in the field of DT that use the topic modeling approach. Lopez and Wilfredo (2022) used a topic modeling approach to analyze 700 articles in order to better understand DT in a crisis situation [18]. Lee et al. used topic modeling and bibliometric analysis on 99 studies to identify strategic boundaries in advanced manufacturing and engineering and to understand DT trends [19]. Similarly, Chen et al. used topic modeling and bibliometric analysis to identify the main components of effective academic–industrial collaboration topics and patterns [41]. To the best of our knowledge, our study is the first to use a topic modeling approach to examine DT from a holistic perspective. As a result, it is expected that this research will contribute to future DT research.

3. Materials and Methods

This study’s methodology consisted of five sequential stages (see Figure 1) in order to investigate up-to-date strategies, practices, and trends of DT. First, a search strategy was developed in accordance with the theoretical background of DT in order to obtain the most consistent data within the scope of the study. With the articles obtained through this search, an empirical corpus for DT was then compiled. Applying preprocessing tasks to this textual corpus made it suitable for topic modeling analysis in the subsequent step. At this stage, the corpus was analyzed using a topic modeling technique based on Latent Dirichlet allocation (LDA). Finally, the content and background of the topics generated by LDA were interpreted, the topics were labeled, and the time-based accelerations of the topics were revealed. These procedures are illustrated in Figure 1.

3.1. Search Strategy and Data Collection

One of the most important steps in topic modeling research is the creation of the appropriate corpus; because the empirical corpus to be created is directly related to the performance of the results, the data collection should be done methodically [10,36]. In content analysis studies on scientific articles, various bibliometric databases can be used. Scopus, published by Elsevier, is one of the most widely used bibliographic databases [42]. With its large number of journals and advanced search filters, this bibliometric database provides access to more articles. As a result, Elsevier’s the Scopus bibliographic database is preferred in this study. This bibliometric database indexes a wide range of publication sources, including book reviews, editorial materials, book chapters, conference proceedings, and peer-reviewed journal articles. Peer-reviewed journal articles, on the other hand, consistently reflect the research environments of the disciplines. To demonstrate an objective approach in the relevant study, only peer-reviewed journal articles were included in the experimental data set in this context. In this context, a Scopus database query was made with “digital transform*” in the title, abstract, and keywords to create an experimental data set for DT. The desired articles were extracted from the Scopus database as a result of this search query, and an experimental data set was created that included bibliometric indicators, author keywords, and abstract and title sections of each article [11]. As a result, the experimental corpus of DT created includes 5350 peer-reviewed journal articles (4927 research articles and 423 reviews) published between 2013 and 2022.

3.2. Data Preprocessing

Data preprocessing is a fundamental data mining step that includes optimization processes to clean up noisy data on the raw data set prior to the construction of a model. This step is critical for text mining and natural language processing studies because it has a large impact on the performance of the analysis results [9]. To filter out noisy information from the extracted DT experimental corpus, we perform data preprocessing in five steps. To begin, each word was converted to lowercase. Tags, web links, punctuation, publisher information, symbols, and numeric expressions were then removed from the dataset. After that, tags, web links, punctuation, publisher information, symbols, and numeric expressions were removed from the dataset. All English stop words (how, what, and, is, or, the, a, an, for, etc.) were deleted in the fourth stage [11,43]. Finally, the remaining words were subjected to the lemmatization process, which resulted in the reduction of the words from their derived form to their plain form [44]. Following data preprocessing, each article in the dataset was converted into a word vector using the “bag of words” method to provide a numerical representation of the words in the corpus [10]. By combining these vectors, a document-term matrix in numerical matrix format representing the entire corpus and required for topic modeling analysis was created [45].

3.3. Implementation of the LDA-Based Topic Modeling

Topic modeling is a probabilistic approach based on machine learning that is used to manage, understand, and summarize large collections of text information [10]. It enables the detection of hidden dependencies between various patterns associated with specific topics in textual document sets. As a result, it is possible to identify word groups that best represent the information embedded in documents known as topics [38]. We use experimental compilation topic modeling with DT articles in this article. Our goal is to generate abstractions of semantic mappings of articles stored in topic sets. To obtain the patterns and topics included in the DT research corpus, we use the Latent Dirichlet allocation (LDA) algorithm. The LDA topic model is a probabilistic approach. LDA topics are less likely to have semantic overflow from documents and are more interpretable than topics produced by other algorithms such as Latent Semantic Indexing (LSI) or Probabilistic Latent Semantic Analysis (PLSA) [11]. LDA does not require a predefined tag or training set because it is based on an unsupervised machine learning model [45].
We used the tmtoolkit package [43], which includes the LDA model for topic modeling, to apply LDA-based topic modeling to the experimental corpus of DT research. Given our dataset’s preprocessed articles as input, the LDA algorithm generates a topic list that groups the articles under K topic labels. To determine the optimal number of K topics, we used the CV consistency score, a standard practice recommended in previous studies. First, we ran the LDA model on our dataset with 1000 iterations, increasing the number of K topics by one each time from 5 to 60. We calculated the CV consistency score of the generated topic model for each K at the same time [46]. A higher consistency score indicates better topic separation. Finally, the topic model with the highest consistency score was chosen. Our analysis revealed that the consistency scores for K = 34 reached the maximum CV =0.6982, so we decided that 34 was the best number of topics [11,46]. As a result of the topic modeling implemented with the above-mentioned dataset and parameters, we created 34 topics. For each topic, the model displays the following data: (1) descriptive keywords: a list of the first N words with the highest frequency describing the topic, along with the probability of each word indicating the relative descriptive power of the word for the topic. We identify the top 20 words for each topic. (2) List of related articles: each related article is assigned a correlation value between 0 and 1. The higher the correlation of an article, the more “on-topic” the article is described. Following previous research, we assign the article with the highest correlation values to that topic. After the articles were assigned to the dominant topics, the labels for each topic were determined and assigned by three field experts, taking into account the descriptive keywords of the topics. Finally, taking into account the context and background of the 34 topics identified as a result of the analysis, these topics were categorized and a systematic taxonomic map was created, modeling the research and practice perspectives of the DT field.

4. Results

4.1. Descriptive Analysis (RQ1)

In order to answer the RQ1 question, this section provides bibliometric characteristics of DT research. Table 1 shows the numbers and percentages of articles by year. The low number of studies conducted in the first four years (2013–2016) is highlighted in Table 1. Despite an increase in the number of studies conducted during the process, the majority of the studies (80.32%) were conducted during and after the pandemic period (2020–2022).
Table 2 lists the most productive journals for DT. The majority of the DT articles were published in the journal Sustainability Switzerland, as shown in Table 2. This journal is followed by Technological Forecasting and Social Change and IEEE Access.
Table 3 shows which subject areas are covered by the DT studies. Because the articles cover a wide range of topics, the sum of the distribution percentages exceeds 100% in Table 3. Further, this table demonstrates the multidisciplinary nature of DT. According to this table, the fields of “Business, Management, and Accounting”, “Social Science”, and “Computer Science” stand out in DT studies.
The keywords most frequently used in DT research are shown in Table 4. “Digital Transformation” is the most frequently used keyword, according to Table 4. “Digitalization” is the second most frequently used keyword, and “Industry 4.0” is the third.
Table 2 lists the most productive journals for DT. The majority of the DT articles were published in the journal Sustainability Switzerland, as shown in Table 2. This journal is followed by Technological Forecasting and Social Change and IEEE Access.
Table 3 shows which subject areas are covered by the DT studies. Because the articles cover a wide range of topics, the sum of the distribution percentages exceeds 100% in Table 3. In addition, this table demonstrates the multidisciplinary nature of DT. According to this table, the fields of “Business, Management, and Accounting”, “Social Science”, and “Computer Science” stand out in DT studies.
The keywords most frequently used in DT research are shown in Table 4. “Digital Transformation” is the most frequently used keyword, according to Table 4. “Digitalization” is the second most frequently used keyword, and “Industry 4.0” is the third.

4.2. Topic Modeling Analysis (RQ2)

The topics identified by LDA and their citation rates are presented in this section. The LDA topic model was applied to the DT experimental corpus, and 34 topics were discovered, as shown in Table 5. Topic ratios expressed as percentages indicate the frequency with which they occur in articles. Appendix A, Table A1 lists the descriptive keywords for each topic. The keywords for each topic are listed in descending order of frequency in Table A1. The top three topics with the highest percentages, as shown in Table 5, are “Management”, “DT in Economy”, and “SDCL” (software development life cycle). “CIO-CDO Roles in DT”, “Smart City”, and “Cultural Entrepreneurship” received the lowest percentages.
Table 6 also shows the citation rates (CR) of the discovered topics. The “Management” topic with the highest percentage is also the most cited topic, according to Table 6. This is followed by the “Business Model” and “Industry 4.0” topics. Although the percentages of both topics are low (Table 5), the citation rates are high. Similarly, “Digital Technologies”, “Innovation Capability”, and “COVID-19” topics have high citation rates despite having a low percentage. “Cultural Entrepreneurship” and “DT in Tourism” have the lowest citation rates as the topics with the lowest percentages. “Digital Literacy”, on the other hand, stands out as the topic with the lowest citation rate.

4.3. Taxonomy of Digital Transformation Topics (RQ3)

The topics were classified, and a taxonomy map was created in this section, which included four major categories: implementation, process, technology, and people (see Table 7). As shown in Table 7, the first category is implementation (45.76%). The implementation category includes specific areas where digital transformation is used.
This category contains 16 topics, including “DT in Economy”, “DT in Education”, “DT in Health”, “DT of SME”, “E-Government”, “Industry 4.0”, “Sustainable Energy”, “E-Commerce”, “COVID-19”, “Smart Grid”, “DT in Construction”, “Supply Chain”, “DT in Finance”, “DT in Tourism”, “Smart City”, and “Cultural Entrepreneurship”. The second category, process (26.08%), includes elements such as developing, understanding, improving, implementing, and managing organizational business processes for a successful DT. The process category contains six topics: “Management”, “SDLC”, “Performance Analysis”, “Economic Policy”, “Business Model”, and “Innovation Capability”. The third category is technology (22.48%). This category is about DT technologies and tools. “Social Media”, “Digital Technologies”, “Big Data”, “Internet of Things”, “Service Provider”, “Artificial Intelligence”, “Cybersecurity”, “Cloud Computing”, and “Virtual Reality” are all topics in the technology category. People (5.63%) are the fourth category, and they play an important role in DT. Topics in the people category cover the roles of human expertise in organizational DT. “Digital Literacy”, “HRM” (Human Resources Management), and “CIO-CDO Roles in DT” are the topics.

4.4. Time-Based Trend Analysis of the Topics (RQ4)

To reveal the changes in trends and evolutions of 34 topics reflecting the DT research landscape over time, annual changes in percentages of topics were identified, and trend values for each topic were calculated. Appendix A, Table A2 displays the percentages, trend values, and trend directions of the topics by year. The topics in this table are listed in descending order (from green to red) based on the overall trend values in the last column (see Appendix A, Table A2). Annual percentages for the last five years were used to calculate the overall trend shown in the last column, because in the first five years, the percentage weight of all the topics has not been formed yet. Appendix A, Table A2 is presented in the form of a colorized heatmap table. Each row of this table demonstrates the annual percentage changes for a specific topic as well as descriptive information about how that topic has evolved over time. In this table, the green colors indicate percentages above the topic’s average in a row, the red colors indicate percentages below the topic’s average, and their colors approach white as these annual values approach the average value (see Appendix A, Table A2). Figure 2 also depicts the top ten topics with the highest increasing and decreasing trends. According to Figure 2, “Sustainable Energy” appears to be the topic with the highest increasing trend. This topic is followed by “DT in Health”, “E-Government”, “DT in Education”, and “Supply Chain”, respectively. “Big Data”, “Digital Technologies”, and “Social Media” have the most decreasing trends. On the other hand, despite the low percentage of topics (see Table 4) and citation rate, the topic “Sustainable Energy” shows an increasing trend (see Table 5). “Social Media” is another fascinating topic in this context. The topic “Social Media” has the most decreasing trend, despite having a relatively high percentage of topics and citations.

5. Discussion

5.1. The Interdisciplinary Background of Digital Transformation

DT, the focal point of today’s technological life cycle, has common application areas for all disciplines. Technological advancements, digital infrastructure, and globalizing business models enable large-scale DT projects based on interdisciplinary collaboration by bringing together multi-disciplinary approaches and ideas from disparate disciplines. DT is a process that affects all disciplines and has various application areas based on the collaboration of various disciplines in each sector [1,6]. Business (Industry 4.0, Business Intelligence, Smart Innovation), Health (E-Health, Smart Drug Management), Energy (Smart Grids, Cloud Energy, Sustainable Energy), Technology (Virtual Reality, Big Data, Machine Learning, Deep Learning, IoT), and so on are examples of common applications [3,6].
In line with previous studies, our findings confirmed that DT has an interdisciplinary structure. Furthermore, the contribution rates of other disciplines contributing to the field are shown in our findings (Table 3). According to our findings, the fields of “Business, Management, and Accounting”, “Social Sciences”, “Computer Science”, and “Engineering” made the greatest contribution [6]. This multidisciplinary background of DT is clearly reflected in our findings [6,47,48]. For example, “Management”, “Business Model”, and “DT in Finance” topics are related to business, management, and accounting, “DT in Education” and “E-Government” topics are related to social sciences, “SDCL”, “Big Data”, “Artificial Intelligence”, and “Internet of Things” topics are related to computer science, and “Smart Grid” and “DT in Construction” topics are related to engineering. Our findings regarding the multidisciplinary nature of DT (economics, health, education, management, etc.) can be considered as an indication that interdisciplinary collaboration produces more innovative and effective solutions for DT processes [6].

5.2. The Four Pillars of Digital Transformation

For the 34 topics obtained as a result of this study, a systematic taxonomy with four basic categories was created. In a systematic taxonomy, DT is divided into four major categories: application, process, technology, and people. Within these four categories, DT has an interacting life cycle and a working flow that combines various elements working together. The categories we presented are similar to those presented in previous studies. The dominance of process, technology, and people categories in DT is a phenomenon that researchers have previously highlighted [24]. In addition to these categories, the study created the implementation category, which evaluates DT in the context of industrial applications.
As previously stated, the spread of digital technologies, the pandemic, and other similar factors have accelerated global DT. As a result of this acceleration, organizations, industries, and government agencies were forced to transform their existing business models in order to keep up with the changing world ushered in by information technologies [49]. The transformation of a business model that occurred in one industry triggered the participation of other industries in this transformation. This is supported by our findings in the implementation category. The implementation category accounts for nearly half of the total category weights (45.76%). DT applications are used in a variety of fields, including the economy, manufacturing, energy, and cultural entrepreneurship. As a result, the spread of field applications to a broad spectrum is considered a natural result. Furthermore, the most striking topic of the implementation category (Table 7) is “DT in Economy”, which is ranked second in the order of topic percentages (Table 6). This situation indicates that there is scientific interest in comprehending the theoretical and practical aspects of the economy’s DT in general, as well as its implications for economic units [49]. It can be seen that the other prominent implementation category topics are “DT in Education” and “DT in Health”, which reflect sectors that can be considered a start in DT (see Appendix A, Table A2). The fact that researchers are turning to the applications of these industries can be viewed as a natural result of the widespread impact of education and health on society [50].
Processes are critical paradigms for improving workflow efficiency [26]. By definition, DT entails either improving existing processes or developing new, more efficient processes. As a result, effective process management is closely related to successful DT [51]. Because of this relationship, the “Management” topic is the most important topic in the process category. Furthermore, the fact that the topic “Management” ranks first in terms of frequency of occurrence (Table 5) and citation rates (Table 6) supports this finding. Our findings, on the other hand, show that the “SDLC” topic ranks first in both the process category and the frequency of occurrence. This finding indicates that businesses that previously did not focus on software but produced basic products and services are now adopting software-oriented services and applications more frequently [52]. Furthermore, the fact that software is one of the most important driving forces for DT explains why the “SDLC” topic is at the top of the list [52,53].
Technology is a tool that can help organizations in their DT process [27]. Social media is an effective subset of this tool. Social media has capabilities that are important for DT, such as improving stakeholder communication, education, and information sharing. Furthermore, it is less expensive than other DT technologies [54]. According to studies, the use of social media accelerates the DT of institutions ranging from small businesses to government agencies [55,56]. The fact that “Social Media” is the most important topic in the technology category confirms this situation. Another intriguing finding in the technology category is that the topic “Virtual Reality” is ranked last. Virtual reality is one of the rapidly evolving, low-cost, and widely available technologies [57]. However, our findings indicate that virtual reality technology is only just getting started in the context of DT.
Employees play an important role in achieving the goal of DT in an organization [25]. As a result, understanding employee roles and skills is critical for DT. As a result, there is a direct relationship between employee digital literacy and the success of DT. Digital literacy is increasingly being used in practice and theory to address employees’ knowledge, skills, and ability to interact with digital technologies in this context [58,59]. Our discovery that the most important human category topic is “Digital Literacy” emphasizes this point. Human–machine collaboration, on the other hand, must be effectively ensured for a successful DT. In this regard, close collaboration between the IT and HR departments will significantly aid the organization’s DT [60]. As can be seen from this, the active role of human resources in DT necessitates placing the “HRM” topic at the top of the human category. The broad spectrum of DT’s applications indicates its widespread adoption in various industry and public domains. For the DT process to be carried out successfully in organizations, it is necessary to consider DT holistically, not only from a technological standpoint, but also by evaluating the process and human mechanisms.

5.3. Existing Trends in Digital Transformation from Past to Present

The COVID-19 pandemic has been a driving force for digital transformation, particularly in critical areas of society. In other words, the COVID-19 outbreak has accelerated the trend toward topics directly related to social life, such as “Sustainable Energy”, “DT in Health”, “E-Government”, “DT in Education”, and “Supply Chain” [61,62]. Energy, in particular, plays an important role in the global dimension and is central to many sustainable development goals that are closely related to societies [63]. As a result of the pandemic, the DT of energy has become critical for the development of sustainable energy systems [64]. One of the most effective ways to address current issues such as energy crises and climate change is to ensure the long-term viability of clean, renewable energy. All of these arguments have distinguished “Sustainable Energy” for researchers. The fact that the topic “Sustainable Energy” is the fastest growing topic can also explain the critical developments in the energy field.
Another significant finding of the study is slowdown in the momentum of “Social Media”, “Digital Technologies”, and “Big Data” topics over the last five years (see Figure 2). The increase in the diversity of DT’s research contexts and application areas in recent years has caused the core topics studied in the first years to be divided into sub-contexts, and as a result, a slowdown has been observed in such core topics of DT. Although there was a lot of emphasis on these topics in the early stages of DT, the COVID-19 outbreak resulted in a rapid decline in interest in these topics (see Appendix A, Table A2). Furthermore, “Industry 4.0” was identified as one of the low acceleration topics. Unlike previous industrial revolutions, Industry 4.0 has a technology-driven connection to the changes that have occurred [65]. As a result, it is believed that the low acceleration in the “Industry 4.0” topic is due to trends in post-COVID-19 technology topics [66].

6. Conclusions

This research aims to reveal the research and application topics of DT, as well as their temporal trends and hierarchical taxonomy. Semantic content analysis was performed on 5350 peer-reviewed journal articles specific to the field for this purpose, using the LDA topic modeling algorithm, which is commonly used in text mining. The research themes of the field, their taxonomy, and their evolution over time were determined using this analysis. According to the topic percentages and citation rates of topics in our study, transformation management holds an important place in DT research. As another finding, four subcategories of DT were identified: implementation, process, technology, and people. More specifically, our taxonomy findings revealed that nearly half of the studies covered DT implementations in various fields. Furthermore, contrary to expectations, our research found that there is little interest in virtual reality technology, which is one of the most popular technologies. Furthermore, when the number of topics, the frequency of occurrence of the topics, and citation rates are considered, it is possible to conclude that the human component of DT has not yet received sufficient attention. Our findings also show that internal stakeholders are generally focused on the human component, while research trends for external stakeholders are low. Furthermore, our research shows that in the last five years, researchers have become more interested in industrial applications of DT. In this respect, “Sustainable Energy”, “DT in Health”, “E-Government”, “DT in Education” and “Supply Chain” were identified as topics with the most increasing trend. Another important finding from our research is that DT research, particularly in energy, is still in its early stages and evolving rapidly.
This study, like all others, has limitations. In this study, only peer-reviewed journal articles were used. The study can be repeated in future studies by including different types of studies, such as books and conference papers from various databases. Furthermore, different topic modeling algorithms can be used to perform analyses, and the results can be compared. More experimental research is needed for comparison and validation of our results in the field of DT. It is expected that our methodology and findings will be useful in future research. Furthermore, this study is expected to serve as a guide for researchers to investigate potential opportunities in the field. One of the potential benefits of this study is that it will aid strategic decision-making processes by increasing practitioners’ awareness of DT.

Author Contributions

Conceptualization, F.G. and G.D.B.; methodology, G.G.M.D. and F.G.; software, G.G.M.D.; validation, G.G.M.D. and G.D.B.; investigation, G.G.M.D. and M.D.; resources, G.G.M.D. and M.D.; data curation, F.G. and G.D.B.; writing—original draft preparation, G.G.M.D. and M.D.; writing—review and editing, M.D.; visualization, F.G. and G.D.B.; supervision, M.D.; project administration, M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available and can be shared upon request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Keywords and percentages of the topics.
Table A1. Keywords and percentages of the topics.
Topic NameKeywords%
Managementorganization management organizational process change knowledge practice project strategy transformation strategic manager leadership organization role theory support interview leader need6.13
DT in Economydigitalization digital development economy process technology economic information problem system management analysis state modern level enterprise main activity consider determine5.88
SDLCsystem process model design quality management framework develop support information development evaluation software tool implementation improve project assessment requirement integrate5.63
DT in Educationeducation learn student university higher educational teacher online school teach institution learning skill course teaching training technology e-learning support environment5.29
Social Mediasocial media communication society theory news journalism content community platform discourse political practice online interaction network change transformation actor argue4.19
DT in Healthhealth care healthcare patient medical clinical hospital medicine record disease implementation system service electronic include information outcome improve conclusion quality4.15
Performance Analysisperformance effect relationship impact factor model influence data analysis leadership positive test role structural variable equation affect sample orientation investigate3.79
Economic Policycountry economy economic policy european ict sector growth global regional region development level market eu trade analysis international index impact3.77
Business Modelbusiness model company strategy management industry customer market strategic process change competitive advantage innovation transformation sale experience corporate need product3.75
DT of SMEsme adoption maturity factor enterprise readiness model level small technology digitalization medium-sized develop barrier company success adopt support framework assess3.41
Digital Technologiesdigital technology transformation change information development role emerge society need age develop infrastructure create transform challenge digitization opportunity require innovation3.18
E-Governmentpublic government governance policy service sector administration legal e-government regulation law citizen tax audit state private need electronic national authority3.16
Industry 4.0industry industrial manufacturing manufacture production technology revolution company process product lean factory smart implementation system sector development environment manufacturer challenge3.12
Innovation Capabilityinnovation capability firm dynamic digitalization framework business technological knowledge process digital theory role resource organizational collaboration develop open disruptive managerial3.01
Sustainable Energysustainable enterprise sustainability development environmental energy impact efficiency green effect innovation improve mechanism promote economy economic model consumption achieve analysis2.91
E-Commercecustomer marketing online retail mobile consumer intention service experience user acceptance perceive app satisfaction quality brand market behavior retailer purchase2.80
Big Datadata big analytical analysis information collect mining source data-driven collection database predictive available decision analyze challenge generate open time large2.55
Internet of Thingsinternet iot technology thing blockchain smart agriculture application system agricultural sector data food challenge solution adoption farm sensor distribute agri -food2.47
COVID-19pandemic crisis change impact resilience time response global disruption world accelerate remote event emergency work increase rapid coronavirus situation challenge2.45
Service Providerservice value platform ecosystem creation model servitization network create customer provider proposition share design delivery framework co-creation capture actor role2.40
Smart Gridnetwork energy power system device communication application grid time control wireless sensor smart mobile frequency achieve demand filter electricity electric2.28
DT in Constructionconstruction twin bim engineering model management information building industry maintenance project application build control operation site water infrastructure technology modeling2.17
Digital Literacysurvey literacy perception professional interview qualitative data technology attitude group conduct analysis work level participant survey people gender social2.09
HRMwork employee human skill resource job management professional workplace change worker talent competency workforce accounting role demand labor capital individual2.09
Supply Chainchain supply management logistics circular procurement economy supplier industry sc technology integration food material information value firm dsc performance digitalization2.02
Artificial Intelligenceintelligence artificial machine automation learn technology process robotic knowledge human intelligent learning application system ml ethical cognitive robot deep decision1.96
Cybersecurityrisk security information cyber cybersecurity threat privacy gas oil management protection data asset system dx attack personal disaster organization control1.96
Cloud Servicescloud service architecture compute library enterprise information computing resource user algorithm framework application erp center global ea board solution web1.94
Virtual Realityvirtual reality design lab laboratory technology ar device augment game fashion vr tool interactive application experience simulation augmented interaction environment1.83
DT in Financefinancial bank banking fintech insurance service payment market finance risk institution transaction vietnam system technology sector commercial chatbot investment industry1.81
DT in Tourismtourism port sector maritime travel destination industry tourist hotel spanish hospitality ship spain startup brazilian international start-up sustainability container transport1.45
CIO-CDO Roles in DTcontrol information price officer report global chief market rm cio stock cost investment cdo switch warehouse condition firm stability return1.45
Smart Citysmart city urban rural pharmaceutical area development plan service spatial space village drug pss pattern human activity pharmacy citizen environment1.43
Cultural Entrepreneurshipentrepreneurship cultural entrepreneurial museum heritage art culture forensic entrepreneur story archive collection online work source experience evidence institution creation film1.43
Table A2. Annual percentages and trends of the topics.
Table A2. Annual percentages and trends of the topics.
Topic Name2013201420152016201720182019202020212022Trend
Sustainable Energy0.000.000.000.000.000.000.850.482.395.901.18
DT in Health8.336.256.6712.070.932.335.314.443.434.330.68
E-Government0.006.253.331.720.003.263.614.083.282.710.54
DT in Education8.330.006.675.172.783.265.734.805.825.420.53
Supply Chain0.006.253.330.000.000.930.642.042.312.470.49
Innovation Capability0.000.003.331.720.931.861.272.643.883.370.49
Performance Analysis0.000.000.000.002.781.402.762.524.035.180.48
COVID-190.000.000.003.450.001.860.853.603.062.110.42
DT of SME0.000.000.000.001.852.791.493.963.813.790.39
Digital Literacy0.006.253.330.000.932.790.641.921.792.830.38
Internet of Things0.000.000.003.450.932.331.492.882.542.650.34
DT in Construction8.330.000.000.000.931.402.341.922.242.470.31
E-Commerce8.330.0010.001.721.852.332.972.402.393.310.29
Management0.006.256.676.904.636.515.316.126.576.080.29
Artificial Intelligence0.006.250.000.000.931.862.121.202.242.230.26
Economic Policy0.000.000.001.722.780.934.254.083.884.030.25
HRM0.000.000.001.720.931.401.492.402.392.110.24
Service Provider8.330.000.000.001.852.332.342.521.942.890.21
DT in Tourism8.330.000.000.000.931.860.421.201.641.750.16
DT in Finance0.006.250.000.000.932.331.912.641.871.380.09
Cybersecurity0.000.000.003.451.851.861.492.521.722.050.04
SDLC0.000.006.671.726.486.514.885.405.526.08−0.08
Smart Grid16.676.250.005.172.782.791.491.922.462.23−0.11
CIO-CDO Roles in DT0.006.250.006.901.852.791.911.201.191.26−0.12
Virtual Reality0.000.000.000.002.781.863.181.081.492.17−0.12
DT in Economy0.000.000.000.004.635.129.138.166.423.97−0.13
Cultural Entrepreneurship8.3312.506.673.451.854.191.911.201.190.90−0.19
Smart City0.000.000.000.002.783.722.341.201.790.72−0.41
Cloud Services0.006.253.3310.344.633.261.911.681.941.38−0.65
Business Model0.000.000.0010.345.565.586.794.083.732.29−0.65
Industry 4.00.000.000.000.006.484.652.764.442.542.83−0.73
Big Data0.000.003.335.177.414.193.612.401.872.29−1.02
Digital Technologies0.006.2513.336.9010.195.585.523.002.911.75−1.69
Social Media25.0018.7523.336.9013.894.195.313.843.733.07−2.16
“In this table, the green colors indicate percentages above the topic’s average in a row, the red colors indicate percentages below the topic’s average, and their colors approach white as these annual values approach the average value”.

References

  1. Kraus, S.; Jones, P.; Kailer, N.; Weinmann, A.; Chaparro-Banegas, N.; Roig-Tierno, N. Digital Transformation: An Overview of the Current State of the Art of Research. SAGE Open 2021, 11, 21582440211047576. [Google Scholar] [CrossRef]
  2. Schallmo, D.R.A.; Williams, C.A. History of Digital Transformation. In Digital Transformation Now! Guiding the Successful Digitalization of Your Business Model; Spring: Berlin/Heidelberg, Germany, 2018; pp. 3–8. [Google Scholar]
  3. Vial, G. Understanding Digital Transformation: A Review and a Research Agenda. In Managing Digital Transformation: Understanding the Strategic Process; Routledge: Oxfordshire, UK, 2021; pp. 13–66. ISBN 9781003008637. [Google Scholar]
  4. Reis, J.; Amorim, M.; Melão, N.; Matos, P. Digital Transformation: A Literature Review and Guidelines for Future Research. In Proceedings of the World Conference on Information Systems and Technologies 2018; Springer: Cham, Switzerland; Berlin, Germany, 2018; pp. 411–421. [Google Scholar]
  5. Sava, J.A. Spending on Digital Transformation Technologies and Services Worldwide from 2017 to 2025. Available online: https://www.statista.com/statistics/870924/worldwide-digital-transformation-market-size/ (accessed on 1 April 2023).
  6. Verhoef, P.C.; Broekhuizen, T.; Bart, Y.; Bhattacharya, A.; Dong, J.Q.; Fabian, N.; Haenlein, M. Digital Transformation: A Multidisciplinary Reflection and Research Agenda. J. Bus. Res. 2021, 122, 889–901. [Google Scholar] [CrossRef]
  7. Holand, A.; Svadberg, S.; Breunig, K.J. Beyond the Hype: A Bibliometric Analysis Deconstructing Research on Digitalization. Technol. Innov. Manag. Rev. 2019, 9, 38–50. [Google Scholar] [CrossRef]
  8. Zhu, X.; Ge, S.; Wang, N. Digital Transformation: A Systematic Literature Review. Comput. Ind. Eng. 2021, 162, 260–265. [Google Scholar] [CrossRef]
  9. Aggarwal, C.C.; Zhai, C.X. Mining Text Data; Springer: New York, NY, USA, 2013. [Google Scholar]
  10. Blei, D.M. Probabilistic Topic Models. Commun. ACM 2012, 55, 77–84. [Google Scholar] [CrossRef]
  11. Gurcan, F.; Cagiltay, N.E. Exploratory Analysis of Topic Interests and Their Evolution in Bioinformatics Research Using Semantic Text Mining and Probabilistic Topic Modeling. IEEE Access 2022, 10, 31480–31493. [Google Scholar] [CrossRef]
  12. Gurcan, F.; Dalveren, G.G.M.; Cagiltay, N.E.; Soylu, A. Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling. IEEE Access 2022, 10, 74638–74654. [Google Scholar] [CrossRef]
  13. Benavides, L.M.C.; Arias, J.A.T.; Serna, M.D.A.; Bedoya, J.W.B.; Burgos, D. Digital Transformation in Higher Education Institutions: A Systematic Literature Review. Sensors 2020, 20, 3291. [Google Scholar] [CrossRef]
  14. Olanipekun, A.O.; Sutrisna, M. Facilitating Digital Transformation in Construction—A Systematic Review of the Current State of the Art. Front. Built Environ. 2021, 7, 660758. [Google Scholar] [CrossRef]
  15. Martín-Peña, M.L.; Sánchez-López, J.M.; Díaz-Garrido, E. Servitization and Digitalization in Manufacturing: The Influence on Firm Performance. J. Bus. Ind. Mark. 2020, 35, 564–574. [Google Scholar] [CrossRef]
  16. Hanelt, A.; Bohnsack, R.; Marz, D.; Antunes Marante, C. A Systematic Review of the Literature on Digital Transformation: Insights and Implications for Strategy and Organizational Change. J. Manag. Stud. 2021, 58, 1159–1197. [Google Scholar] [CrossRef]
  17. Facin, A.L.F.; Barbosa, A.P.F.P.L.; Matsumoto, C.; da Gama Cruz, A.F.S.; Salerno, M.S. Featured Topics in Research on Digital Transformation: Evidence from a Bibliometric Study and Content Analysis. RAE Rev. Adm. Empresas 2022, 62, 1–22. [Google Scholar] [CrossRef]
  18. Lopez, B.; Wilfredo, V. Digital Transformation in Crisis Situations. Literature Review Using Topic Modeling and Grounded Theory. Cuad. Adm. 2022, 35, 01203592. [Google Scholar] [CrossRef]
  19. Lee, C.H.; Liu, C.L.; Trappey, A.J.C.; Mo, J.P.T.; Desouza, K.C. Understanding Digital Transformation in Advanced Manufacturing and Engineering: A Bibliometric Analysis, Topic Modeling and Research Trend Discovery. Adv. Eng. Inform. 2021, 50, 101428. [Google Scholar] [CrossRef]
  20. Schallmo, D.R.A.; Williams, C.A. History of Digital Transformation. Int. J. Innov. Manag. 2017, 21, 1–17. [Google Scholar] [CrossRef]
  21. Gong, C.; Ribiere, V. Developing a Unified Definition of Digital Transformation. Technovation 2021, 102, 102217. [Google Scholar] [CrossRef]
  22. Savastano, M.; Amendola, C.; Bellini, F.; D’Ascenzo, F. Contextual Impacts on Industrial Processes Brought by the Digital Transformation of Manufacturing: A Systematic Review. Sustainability 2019, 11, 891. [Google Scholar] [CrossRef]
  23. Chawla, R.N.; Goyal, P. Emerging Trends in Digital Transformation: A Bibliometric Analysis. Benchmarking 2022, 29, 1069–1112. [Google Scholar] [CrossRef]
  24. Verina, N.; Titko, J. Digital Transformation: Conceptual Framework. In Proceedings of the International Scientific Conference Contemporary Issues in Business, Management and Economics Engineering’ 2019, Vilnius, Lithuania, 9–10 May 2019. [Google Scholar] [CrossRef]
  25. Kane, G. The Technology Fallacy: People Are the Real Key to Digital Transformation. Res. Technol. Manag. 2019, 62, 44–49. [Google Scholar] [CrossRef]
  26. Basu, S. Manufacturing and Factory Automation. In Plant Intelligent Automation and Digital Transformation: Volume I: Process and Factory Automation; Academic Press: Cambridge, MA, USA, 2022; pp. 243–272. [Google Scholar]
  27. Costich, T. What Is Digital Transformation and What Is It Not? In Excelling on a Digital Transformation Journey: A Field Guide to Help You Define Success; ASQ Quality Press: Milwaukee, WI, USA, 2021; pp. 3–9. [Google Scholar]
  28. Romero, D.; Flores, M.; Herrera, M.; Resendez, H. Five Management Pillars for Digital Transformation Integrating the Lean Thinking Philosophy. In Proceedings of the 25th IEEE International Conference on Engineering, Technology and Innovation, ICE/ITMC 2019, Sophia Antipolis, France, 17–19 June 2019. [Google Scholar] [CrossRef]
  29. Pearce, A. People, Process, and Technology: 3 Steps to Digital Transformation. Available online: https://www.plutora.com/blog/people-process-technology-ppt-framework-explained (accessed on 1 April 2023).
  30. Gebayew, C.; Hardini, I.R.; Panjaitan, G.H.A.; Kurniawan, N.B.; Suhardi. A Systematic Literature Review on Digital Transformation. In Proceedings of the 2018 International Conference on Information Technology Systems and Innovation (ICITSI), Bandung, Indonesia, 22–26 October 2018; pp. 260–265. [Google Scholar]
  31. Henriette, E.; Feki, M.; Boughzala, I. Association for Information Systems AIS Electronic Library (AISeL) The Shape of Digital Transformation: A Systematic Literature Review. In Proceedings of the 9th Mediterranean Conference on Information Systems, Samos, Greece, 3–5 October 2015; Volume 10. Available online: https://aisel.aisnet.org/mcis2015/10 (accessed on 1 April 2023).
  32. Teichert, R. Digital Transformation Maturity: A Systematic Review of Literature. Acta Univ. Agric. Silvic. Mendel. Brun. 2019, 67, 1673–1687. [Google Scholar] [CrossRef]
  33. Trujillo Valdiviezo, G.; Rodríguez Alegre, L.; Mejía Ayala, D.; Padilla, L.; Pilar, R. Del Digital Transformation in Latin America: A Systematic Review. Rev. Venez. Gerenc. 2022, 27, 1519–1536. [Google Scholar] [CrossRef]
  34. Cortellazzo, L.; Bruni, E.; Zampieri, R. The Role of Leadership in a Digitalized World: A Review. Front. Psychol. 2019, 10, 1938. [Google Scholar] [CrossRef] [PubMed]
  35. Zhou, X.; Zhang, C.; Pan, Z.; Wei, P. Tourism Industry Is Leading the Digital Transformation of Cultural Heritage Management: Bibliometric Analysis Based on Web of Science Database. In Proceedings of the 2020 Management Science Informatization and Economic Innovation Development Conference (MSIEID), Guangzhou, China, 18–20 December 2020; pp. 273–276. [Google Scholar] [CrossRef]
  36. Gurcan, F.; Dalveren, G.G.M.; Cagiltay, N.E.; Roman, D.; Soylu, A. Evolution of Software Testing Strategies and Trends: Semantic Content Analysis of Software Research Corpus of the Last 40 Years. IEEE Access 2022, 10, 106093–106109. [Google Scholar] [CrossRef]
  37. Campbell, J.C.; Hindle, A.; Stroulia, E. Latent Dirichlet Allocation: Extracting Topics from Software Engineering Data. In The Art and Science of Analyzing Software Data; Elsevier: Amsterdam, The Netherlands, 2015; pp. 139–159. [Google Scholar] [CrossRef]
  38. Vayansky, I.; Kumar, S.A.P. A Review of Topic Modeling Methods. Inf. Syst. 2020, 94, 101582. [Google Scholar] [CrossRef]
  39. Gurcan, F.; Cagiltay, N.E.; Cagiltay, K. Mapping Human–Computer Interaction Research Themes and Trends from Its Existence to Today: A Topic Modeling-Based Review of Past 60 Years. Int. J. Hum. Comput. Interact. 2021, 37, 267–280. [Google Scholar] [CrossRef]
  40. Aziz, S.; Dowling, M.; Hammami, H.; Piepenbrink, A. Machine Learning in Finance: A Topic Modeling Approach. Eur. Financ. Manag. 2022, 28, 744–770. [Google Scholar] [CrossRef]
  41. Chen, H.; Zhang, Y.; Jin, Q.; Wang, X. Exploring Patterns of Academic-Industrial Collaboration for Digital Transformation Research: A Bibliometric-Enhanced Topic Modeling Method. In Proceedings of the 2022 Portland International Conference on Management of Engineering and Technology (PICMET), Portland, OR, USA, 7–11 August 2022; pp. 1–9. [Google Scholar] [CrossRef]
  42. Sahoo, B.; Kumar, A. Indexing and Abstracting Bibliographic Electronic Database: A Comparative Analysis. Int. J. Inf. Dissem. Technol. 2018, 8, 99. [Google Scholar] [CrossRef]
  43. Konrad, M. Text Mining and Topic Modeling Toolkit. Available online: https://pypi.org/project/tmtoolkit/ (accessed on 21 January 2022).
  44. Gurcan, F.; Cagiltay, N.E. Research Trends on Distance Learning: A Text Mining-Based Literature Review from 2008 to 2018. Interact. Learn. Environ. 2020, 31, 1007–1028. [Google Scholar] [CrossRef]
  45. Ozyurt, O.; Gurcan, F.; Dalveren, G.G.M.; Derawi, M. Career in Cloud Computing: Exploratory Analysis of In-Demand Competency Areas and Skill Sets. Appl. Sci. 2022, 12, 9787. [Google Scholar] [CrossRef]
  46. Mimno, D.; Wallach, H.M.; Talley, E.; Leenders, M.; McCallum, A. Optimizing Semantic Coherence in Topic Models. In Proceedings of the EMNLP 2011—Conference on Empirical Methods in Natural Language Processing, Edinburgh, UK, 27–31 July 2011. [Google Scholar]
  47. Kapidani, N.; Bauk, S.; Davidson, I.E. Digitalization in Developing Maritime Business Environments towards Ensuring Sustainability. Sustain. 2020, 12, 9235. [Google Scholar] [CrossRef]
  48. Gurcan, F.; Ozyurt, O. Emerging Trends and Knowledge Domains in E-Learning Researches: Topic Modeling Analysis with the Article Published between 2008-2018. J. Comput. Educ. Res. 2020, 8, 738–756. [Google Scholar] [CrossRef]
  49. Shkarlet, S.; Dubyna, M.; Shtyrkhun, K.; Verbivska, L. Transformation of the Paradigm of the Economic Entities Development in Digital Economy. WSEAS Trans. Environ. Dev. 2020, 16, 413–422. [Google Scholar] [CrossRef]
  50. Gurcan, F.; Dalveren, G.G.M.; Derawi, M. COVID-19 and E-Learning: An Exploratory Analysis of Research Topics and Interests in E-Learning During the Pandemic. IEEE Access 2022, 10, 123349–123357. [Google Scholar] [CrossRef]
  51. Stjepić, A.-M.; Ivančić, L.; Vugec, D.S. Mastering Digital Transformation through Business Process Management: Investigating Alignments, Goals, Orchestration, and Roles. J. Entrep. Manag. Innov. 2020, 16, 41–74. [Google Scholar] [CrossRef]
  52. Laato, S.; Mäntymäki, M.; Islam, A.K.M.N.; Hyrynsalmi, S.; Birkstedt, T. Trends and Trajectories in the Software Industry: Implications for the Future of Work. Inf. Syst. Front. 2023, 25, 929–944. [Google Scholar] [CrossRef]
  53. Gurcan, F.; Sevik, S. Expertise Roles and Skills Required by the Software Development Industry. In Proceedings of the 1st International Informatics and Software Engineering Conference: Innovative Technologies for Digital Transformation, Ankara, Turkey, 6–7 November 2019. [Google Scholar]
  54. Ferrer, E.; Bousoño, C.; Jorge, J.; Lora, L.; Miranda, E.; Natalizio, N. Enriching Social Capital and Improving Organizational Performance in the Age of Social Networking. Bus. Manag. 2013, 5, 94–281. [Google Scholar]
  55. Jorge, L.F.; Mosconi, E.; Santa-Eulalia, L.A. Enterprise Social Media Platforms for Coping with an Accelerated Digital Transformation. J. Syst. Inf. Technol. 2022, 24, 221–245. [Google Scholar] [CrossRef]
  56. Yuan, Y.-P.; Dwivedi, Y.K.; Tan, G.W.-H.; Cham, T.-H.; Ooi, K.-B.; Aw, E.C.-X.; Currie, W. Government Digital Transformation: Understanding the Role of Government Social Media. Gov. Inf. Q. 2022, 40, 101775. [Google Scholar] [CrossRef]
  57. Bellalouna, F. Industrial Case Studies for Digital Transformation of Engineering Processes Using the Virtual Reality Technology. Procedia CIRP 2020, 90, 636–641. [Google Scholar] [CrossRef]
  58. Cetindamar, D.; Abedin, B.; Shirahada, K. The Role of Employees in Digital Transformation: A Preliminary Study on How Employees’ Digital Literacy Impacts Use of Digital Technologies. IEEE Trans. Eng. Manag. 2021, 1–12. [Google Scholar] [CrossRef]
  59. Khan, F.; Vuopala, E. Digital Competence Assessment across Generations: A Finnish Sample Using the Digcomp Framework. Int. J. Digit. Lit. Digit. Competence 2019, 10, 15–28. [Google Scholar] [CrossRef]
  60. Fenech, R.; Baguant, P.; Ivanov, D. The Changing Role of Human Resource Management in an Era of Digital Transformation. J. Manag. Inf. Decis. Sci. 2019, 22, 176–180. [Google Scholar]
  61. Delardas, O.; Kechagias, K.S.; Pontikos, P.N.; Giannos, P. Socio-Economic Impacts and Challenges of the Coronavirus Pandemic (COVID-19): An Updated Review. Sustainability 2022, 14, 9699. [Google Scholar] [CrossRef]
  62. Yu, Z.; Razzaq, A.; Rehman, A.; Shah, A.; Jameel, K.; Mor, R.S. Disruption in Global Supply Chain and Socio-Economic Shocks: A Lesson from COVID-19 for Sustainable Production and Consumption. Oper. Manag. Res. 2022, 15, 233–248. [Google Scholar] [CrossRef]
  63. Kabeyi, M.J.B.; Olanrewaju, O.A. Sustainable Energy Transition for Renewable and Low Carbon Grid Electricity Generation and Supply. Front. Energy Res. 2022, 9, 1032. [Google Scholar] [CrossRef]
  64. Zhang, S.; Ma, X.; Cui, Q. Assessing the Impact of the Digital Economy on Green Total Factor Energy Efficiency in the Post-COVID-19 Era. Front. Energy Res. 2021, 9, 798922. [Google Scholar] [CrossRef]
  65. Rangaswamy, U.S. Industry 4.0 Disruption: Assessing the Need for Adaptive Capability. Dev. Learn. Organ. 2021, 35, 7–10. [Google Scholar] [CrossRef]
  66. Asokan, D.R.; Huq, F.A.; Smith, C.M.; Stevenson, M. Socially Responsible Operations in the Industry 4.0 Era: Post-COVID-19 Technology Adoption and Perspectives on Future Research. Int. J. Oper. Prod. Manag. 2022, 42, 185–217. [Google Scholar] [CrossRef]
Figure 1. An overview of our methodology.
Figure 1. An overview of our methodology.
Sustainability 15 07496 g001
Figure 2. Top ten topics with the most increasing and decreasing trends.
Figure 2. Top ten topics with the most increasing and decreasing trends.
Sustainability 15 07496 g002
Table 1. Annual distribution of DT articles.
Table 1. Annual distribution of DT articles.
YearN%
2022186534.86
2021149928.02
202093317.44
20195229.76
20182444.56
20171212.26
2016771.44
2015410.77
2014260.49
2013220.41
Total5350100.00
Table 2. The most productive publication sources.
Table 2. The most productive publication sources.
Source TitleN%
Sustainability Switzerland2885.38
Technological Forecasting and Social Change681.27
IEEE Access611.14
Journal of Business Research561.05
Applied Sciences Switzerland520.97
Frontiers in Psychology380.71
IEEE Transactions on Engineering Management300.56
International Journal of Environmental Research and Public Health270.50
Sensors270.50
Journal of Medical Internet Research260.49
International Journal of Innovation Management250.47
Technology in Society250.47
Energies240.45
Journal of Open Innovation Technology Market and Complexity240.45
Journal of Manufacturing Technology Management230.43
Table 3. Distribution of DT articles by subject areas.
Table 3. Distribution of DT articles by subject areas.
Subject AreaN%
Business, Management, and Accounting187635.07
Social Sciences169231.63
Computer Science166631.14
Engineering146727.42
Economics, Econometrics, and Finance58911.01
Environmental Science54010.09
Decision Sciences4748.86
Energy4358.13
Medicine3626.77
Mathematics2304.30
Materials Science2073.87
Arts and Humanities1983.70
Psychology1863.48
Physics and Astronomy1673.12
Chemical Engineering1162.17
Biochemistry, Genetics, and Molecular Biology1061.98
Chemistry801.50
Earth and Planetary Sciences801.50
Agricultural and Biological Sciences791.48
Health Professions741.38
Multidisciplinary400.75
Pharmacology, Toxicology, and Pharmaceutics320.60
Nursing240.45
Immunology and Microbiology190.36
Neuroscience160.30
Dentistry50.09
Veterinary40.07
Table 4. Top 30 most used keywords.
Table 4. Top 30 most used keywords.
KeywordsN%
Digital Transformation251446.99
Digitalization5279.85
Industry 4.04318.06
Digitization3336.22
Artificial Intelligence3025.64
Innovation2805.23
COVID-192534.73
Digital Technologies2464.60
Metadata2123.96
Sustainability2033.79
Manufacturing1883.51
Internet of Things1803.36
Big Data1793.35
Decision Making1663.10
Digital Economy1572.93
Sustainable Development1522.84
Digital Technology1432.67
Machine Learning1302.43
Pandemic1061.98
Information Technology1051.96
Blockchain1041.94
E-learning1041.94
Higher Education1031.93
Automation941.76
Competition911.70
Information Management891.66
Education831.55
Health Care831.55
Social Media811.51
Technological Development791.48
Table 5. The 34 topics discovered by LDA.
Table 5. The 34 topics discovered by LDA.
NoTopic NameRate (%)No Topic NameRate (%)
1Management6.1318Internet of Things2.47
2DT in Economy5.8819COVID-192.45
3SDLC5.6320Service Provider2.40
4DT in Education5.2921Smart Grid2.28
5Social Media4.1922DT in Construction2.17
6DT in Health4.1523Digital Literacy2.09
7Performance Analysis3.7924HRM2.09
8Economic Policy3.7725Supply Chain2.02
9Business Model3.7526Artificial Intelligence1.96
10DT of SME3.4127Cybersecurity1.96
11Digital Technologies3.1828Cloud Services1.94
12E-Government3.1629Virtual Reality1.83
13Industry 4.03.1230DT in Finance1.81
14Innovation Capability3.0131DT in Tourism1.45
15Sustainable Energy2.9132CIO-CDO Roles in DT1.45
16E-Commerce2.8033Smart City1.43
17Big Data2.5534Cultural Entrepreneurship1.43
Table 6. Citation rates of the topics.
Table 6. Citation rates of the topics.
NoTopic NameCR (%)No Topic NameCR (%)
1Management9.6718Economic Policy2.37
2Business Model8.4319Service Provider2.32
3Industry 4.06.5320DT in Construction1.84
4Digital Technologies6.3121Big Data1.83
5Innovation Capability5.4122CIO-CDO Roles in DT1.67
6COVID-194.1623Smart Grid1.63
7DT in Health4.0824Artificial Intelligence1.58
8Social Media3.9925Sustainable Energy1.47
9SDLC3.6026HRM1.35
10DT in Education3.5227Smart City1.24
11Performance Analysis3.4528Cybersecurity1.17
12DT of SME3.4429DT in Finance1.08
13Internet of Things2.9230Cloud Services1.05
14E-Government2.6131Virtual Reality0.96
15DT in Economy2.5832Cultural Entrepreneurship0.93
16Supply Chain2.5833DT in Tourism0.91
17E-Commerce2.5634Digital Literacy0.74
Table 7. Taxonomy of the DT topics.
Table 7. Taxonomy of the DT topics.
NoSub-CategoryTopicRate %Total %
1ImplementationDT in Economy5.8845.76
DT in Education5.29
DT in Health4.15
DT of SME3.41
E-Government3.16
Industry 4.03.12
Sustainable Energy2.91
E-Commerce2.80
COVID-192.45
Smart Grid2.28
DT in Construction2.17
Supply Chain2.02
DT in Finance1.81
DT in Tourism1.45
Smart City1.43
Cultural Entrepreneurship1.43
2ProcessManagement6.1326.08
SDLC5.63
Performance Analysis3.79
Economic Policy3.77
Business Model3.75
Innovation Capability3.01
3TechnologySocial Media4.1922.48
Digital Technologies3.18
Big Data2.55
Internet of Things2.47
Service Provider2.40
Artificial Intelligence1.96
Cybersecurity1.96
Cloud Computing1.94
Virtual Reality1.83
4PeopleDigital Literacy2.095.63
HRM2.09
CIO-CDO Roles in DT1.45
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gurcan, F.; Boztas, G.D.; Dalveren, G.G.M.; Derawi, M. Digital Transformation Strategies, Practices, and Trends: A Large-Scale Retrospective Study Based on Machine Learning. Sustainability 2023, 15, 7496. https://doi.org/10.3390/su15097496

AMA Style

Gurcan F, Boztas GD, Dalveren GGM, Derawi M. Digital Transformation Strategies, Practices, and Trends: A Large-Scale Retrospective Study Based on Machine Learning. Sustainability. 2023; 15(9):7496. https://doi.org/10.3390/su15097496

Chicago/Turabian Style

Gurcan, Fatih, Gizem Dilan Boztas, Gonca Gokce Menekse Dalveren, and Mohammad Derawi. 2023. "Digital Transformation Strategies, Practices, and Trends: A Large-Scale Retrospective Study Based on Machine Learning" Sustainability 15, no. 9: 7496. https://doi.org/10.3390/su15097496

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop