Next Article in Journal
Digital Finance and Corporate Sustainability Performance: Promoting or Restricting? Evidence from China’s Listed Companies
Previous Article in Journal
A Novel Fractional-Order Active Disturbance Rejection Load Frequency Control Based on An Improved Marine Predator Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Business Intelligence Strategies, Best Practices, and Latest Trends: Analysis of Scientometric Data from 2003 to 2023 Using Machine Learning

1
Department of Management Information Systems, Faculty of Economics and Administrative Sciences, Karadeniz Technical University, Trabzon 61080, Turkey
2
Digital Transformation Office, Karadeniz Technical University, Trabzon 61080, Turkey
3
Department of Software Engineering, Faculty of Engineering, Atilim University, Ankara 06830, Turkey
4
Department of Electronic Systems, Faculty of Information Technology and Electrical Engineering, Norwegian University of Science and Technology, 7034 Gjøvik, Norway
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(13), 9854; https://doi.org/10.3390/su15139854
Submission received: 30 May 2023 / Revised: 17 June 2023 / Accepted: 18 June 2023 / Published: 21 June 2023
(This article belongs to the Section Sustainable Management)

Abstract

:
The widespread use of business intelligence products, services, and applications piques the interest of researchers in this field. The interest of researchers in business intelligence increases the number of studies significantly. Identifying domain-specific research patterns and trends is thus a significant research problem. This study employs a topic modeling approach to analyze domain-specific articles in order to identify research patterns and trends in the business intelligence field over the last 20 years. As a result, 36 topics were discovered that reflect the field’s research landscape and trends. Topics such as “Organizational Capability”, “AI Applications”, “Data Mining”, “Big Data Analytics”, and “Visualization” have recently gained popularity. A systematic taxonomic map was also created, revealing the research background and BI perspectives based on the topics. This study may be useful to researchers and practitioners interested in learning about the most recent developments in the field. Topics generated by topic modeling can also be used to identify gaps in current research or potential future research directions.

1. Introduction

Business intelligence (BI) is a discipline that combines business analytics, data mining, data visualization, data reporting, data tools, infrastructure, and best practices to empower organizations to make more data-driven decisions. Business operations and processes can generate massive amounts of structured data, such as emails, memos, call center notes, news, user groups, chats, reports, corporate documentation, web pages, presentations, image files, video files, and marketing materials [1,2]. In practice, to have modern BI, organizations must have a comprehensive data infrastructure and be able to use this data effectively to drive development and competition, eliminate inefficiencies, and quickly adapt to market or supply changes [3,4]. Businesses and organizations have questions and goals. To answer these questions and define the roadmap towards these goals, they collect and analyze the necessary data using BI functions and determine what actions to take to achieve these goals [3,5,6,7]. One of the most important achievements of BI is to support the sustainable development of enterprises. The unequal distribution of resources creates a serious competitive inequality between large and small-scale enterprises in the context of sustainable development and makes it difficult to implement BI effectively in all enterprises [8].
Common functions of BI applications include data mining, data visualization and reporting, business performance metrics and benchmarking, online analytics processing, descriptive analytics, dashboard development, and predictive analytics [1,9]. BI uses these processes to transform raw data into meaningful and valuable information to ensure more effective strategic, tactical, and operational insights and decision-making. BI can provide essential insights to support various business decisions, from operational to strategic [3,9].
Although BI has a very modern definition similar to the one above, it should be emphasized that BI, as a fashionable word, has a suppressed past. Traditional BI emerged in the 1960s as a system of information sharing between organizations. Luhn touched upon the need for BI systems, emphasizing the importance of automating information categorization, retrieval, and dissemination within the organization [10]. In 1989, Howard Dresner proposed the concept of BI as an umbrella term. He defined it as “concepts and methods for improving business decisions using information and information technology-based support systems” [11]. This definition has become increasingly common since the late 1990s. The rapid transformation in information and communication technologies in recent years has enabled businesses to adopt BI more [12].
Thus, services, tools, and BI applications are more frequently being used every day. With the spread of BI, the interest of researchers in this field has increased day by day. This increasing research interest in BI has also led to a significant increase in academic research in this field [12]. The number of publications, only 45 in 2003, has increased exponentially in recent years and will reach over 300 in 2022. Scientific literature is a rich source of information, especially for dynamic fields such as BI, where ever-changing research and applications are indexed. Analysis of the scientific literature can provide important insights to better understand the research background, implementations, scope, development, and future trends of the BI field. However, since there are many studies in the BI field, it is difficult to analyze all these studies and to create an epistemological research map of the field with traditional systematic review methods [5,12,13,14]. This situation is because there are thousands of field-specific studies in the literature. In this context, using automated text mining procedures based on machine learning can provide an effective methodology for analyzing the scientific literature in this field.
Several attempts to analyze BI studies using this approach have been made in the literature [12,15,16]. Despite these commendable efforts, machine learning-based topic modeling approaches are still in limited use to reveal a comprehensive map of the BI research landscape. This study aims to fill this gap in the literature through a semantic content analysis based on a topic modeling approach applied to 3749 peer-reviewed journal articles published in the last 20 years in the BI field between 2003 and 2022. Specifically, the methodology of this study is designed to seek answers to the following research questions (RQ) based on the previous 20 years of the BI field:
RQ1. What are the bibliometric features of BI research?
RQ2. What are the emerging topics in BI research?
RQ3. How have BI topics changed in the last 20 years?
RQ4. What is the taxonomy of BI topics?

2. Research Background and Related Work

The concept of “BI systems” was introduced by H. P Luhn in 1958 [10]. Luhn touched upon the need for these systems and the importance of automating the organization’s sorting, retrieval, and dissemination of information. He also argued that systems could be developed that accepts information, promptly transmit data and information to users, and respond to related requests. However, he reported that the institution, with its communication infrastructure and input-output units, is a living system that focuses on solving all information-related problems [10]. Václav et al. stated that the primary purpose of BI systems is to obtain information that will help decision-making by filtering it from valuable data sets [17].
The perspective on BI systems in today’s studies is also fed from previous studies. Chen & Lin consider BI systems as providing the transformation from data to information or knowledge to provide solutions for corporate applications [18]. At this point, these authors reported that these systems could be advanced decision support systems. Combita Niño et al. stated that the primary purpose of BI implementations is to ensure the effectiveness of information in the strategies and targets to be followed in order to provide a competitive advantage [19]. Muntean et al. defined BI systems as tools used for processing raw data and information to make decisions about the future. They reported that BI is to turn to the future through the question, “What happened in the past?” [20].
Various studies have tried to identify BI research’s bibliometric features and trends [12]. These studies are generally based on bibliometric analysis. For example, Purnomo et al. analyzed 244 studies published between 1975 and 2020 in the Scopus database and grouped BI research into six themes. They named their theme titles “Business analytics”, “Element of marketing”, “Competitive intelligence”, “Intelligence system”, “Industry”, and “Business” [21]. In another study, Dorti & Akdemir discovered research trends by analyzing the keywords of articles on BI from the Web of Science database [22]. Similarly, Vanani & Jalali aimed to examine research trends by focusing on titles, abstracts, and keywords in BI research. For this purpose, they used the scientometric method “burst detection” algorithm to represent emerging and disappearing trends in the BI field between 1980 and 2014 [23]. In another study, Zou et al. [24] conducted a bibliometric analysis to evaluate the annual amount of publications, the trend in the research, and citation variables of BI research between 1997 and 2017. The results revealed that the “Expert Systems with Applications” journal stands out in BI research, and the United States of America leads in the number of publications. In addition, “cloud computing” is the main keyword in hot research topics. Today, BI software is used in almost all sectors, including finance, telecommunications, retail, energy, public, and health. Therefore, studies examining the use of BI in different sectors are also striking. For example, Brum et al. conducted a bibliometric analysis of studies investigating the current use of BI technologies in the agricultural sector [25]. There are also studies examining BI with different trends. For example, Liang & Liu used bibliometric analysis to explore research trends in “Big Data” and “BI” between 1990 and 2017. They also applied common word analysis on keywords to show the distribution of topics over time. The most used keywords were determined as “data mining”, “social media”, and “information system”. However, the keywords “cloud computing”, “data warehouse”, and “knowledge management” were used more after 2016 [5].
Although bibliometric analysis studies provide the general lines of research, these studies are insufficient for machine learning-based semantic content analysis of extensive textual collections [26,27]. One way to solve this problem is to use topic modeling, a text-mining technique based on machine learning. Topic modeling analysis has been used effectively and widely in recent years to define the scope of a field [28]. It is noteworthy that there are few topic modeling studies on BI. For example, Gallinucci et al. [29] applied the topic modeling method to social BI research. However, this study was limited as it only included “social BI” research. In another study, Moro et al. [15] used the LDA method, a topic modeling approach, in 2019 research published between 2002 and 2013 to identify trends in BI applications in the banking sector. Although this study uses topic modeling, it does not reflect the holistic view of the BI field as it is directed toward BI implementations in the banking sector. Only one study focuses directly on BI research and applies the topic modeling method.
Previous BI research was generally examined through bibliometric methods. Although keyword analysis is often applied to reveal trends in BI research, this research offers a limited view of the field or analyzes different sub-contexts of BI. As far as we know, no study applied topic modeling to BI research except for [16]. Therefore, this study aims to analyze BI research by topic modeling to fill this gap in the literature and present a broader picture of research and applications in the field. Therefore, this study will contribute valuably to the BI literature.

3. Method

The study method is organized under three sub-headings: data collection, data preprocessing, and implementing topic modeling on the empirical corpus.

3.1. Data Collection

Creating an empirical corpus for a textual content analysis based on topic modeling is one of the most critical steps [26,30]. The operations applied during corpus creation directly affect the analysis performance [31]. First, to create an experimental corpus about BI, a query was made in the Scopus database in the form of “BI” in the title, summary, and keyword fields. Scopus is the most comprehensive bibliometric database covering more than 7000 publishers and 240 disciplines, including Elsevier, Emerald, IEEE, Sage, Springer, Taylor & Francis, and Wiley Blackwell [26,32]. Only research articles and review articles in English were included in the corpus of the studies filtered by this query. Other studies, such as editorial materials and book reviews, were excluded from the dataset. The filtered articles were extracted from the Scopus database, and an experimental data set was created that includes data such as bibliometric indicators, author keywords, abstract, and title of each article [26,32]. This empirical BI corpus includes 3749 peer-reviewed journal articles (3487 research papers and 262 reviews) published in the last 20 years between 2003 and 2022.

3.2. Data Preprocessing

Data preprocessing is a fundamental task that directly affects the success of the analysis, especially for text mining and natural language processing studies [33]. The data mining process does not proceed in a fixed order, and some processes go back and forth between each stage [13]. In the first step of data preprocessing, each word was converted to lowercase, and tags, web links, punctuation, publisher information, symbols, and numeric information in the dataset was deleted. In the second stage, the word tokenization was applied to decompose each article text into individual terms. In the third stage, all English stop words (how what, and, is, or, the, a, an, for, etc.) that made no sense on their own were deleted [34]. In the fourth stage, the remaining words were reduced from their derived form to their plain form by applying the Lemmatization process [26]. Finally, each article in the dataset was converted into a word vector using the “word bag” approach to provide a numerical representation of the words in the corpus [35]. Accordingly, these vectors were combined to form a document-term matrix representing the entire corpus and providing the necessary matrix form for topic modeling analysis [33,36,37].

3.3. Fitting and Implementation of Topic Modeling

The topic modeling method is used to reduce large volumes of data to a small number of topics and to discover hidden topics [38]. Topic modeling is a text mining tool that has been used frequently recently [13,27,39,40]. Its purpose is to detect confidential topics covered in a collection of documents automatically. The document production process is reversed, and the document-topic and topic-word distributions are revealed with the help of observable documents and the words that make them up. There are many subject modeling algorithms in the literature. However, Latent Dirichlet Allocation (LDA), which forms the basis of topic modeling algorithms and is widely used, stands out among these algorithms [31]. Therefore, LDA is a more preferred topic modeling approach in research [13,36]. Sing an unsupervised machine learning method, LDA needs no predetermined label or training set [31]. The LDA algorithm can be applied to discover semantic patterns in an extensive collection of documents [31,41]. In this study, the LDA algorithm was adopted for topic modeling analysis.
We used the tmtoolkit package for text preprocessing and topic modeling to adapt and apply the LDA-based topic modeling procedure to the preprocessed BI corpus [34]. In order to fit the LDA model to the empirical corpus, firstly, the values of the prior parameters that allow the optimization of the model were determined [13]. Considering the prior parameter values recommended for topic modeling of short texts in previous studies, the α parameter that determines the topic distribution in the documents was chosen as α = 0.1, and the β parameter that determines the word distribution in the topics was β = 0.01 [13,36]. Next, the LDA model was adapted for the number of K topics ranging from 10 to 60 to determine the optimal number of topics (K) that best depicted the BI research landscape. A consistency measure (CV) was calculated for each trial run with each K value using the semantic consistency approach during this process [42]. Four field experts evaluated each K topic number, distribution, and semantic consistency of discovered topics. As a result of this evaluation, the topics that define the BI research field at the most optimal level were obtained with the model adapted to the number of topics K = 36. After examining the consistency of the topics, two field experts determined and assigned each topic’s label, considering the topics’ descriptive keywords. In addition, the percentage distribution per topic document, the percentages of descriptive words of each topic, and the percentages of the topics in the entire corpus were calculated [43]. At the end of this process, 20 representative keywords with the highest frequency were determined for each of the 36 topics. These 36 topics discovered as a result of this analysis were used in all subsequent analyses.

4. Findings

In order to answer the research questions stated in the introduction, the study’s findings were evaluated under five main sections. In the first stage, the bibliometric features of BI research were analyzed. Secondly, research topics were determined by applying topic modeling analysis in the BI field. Third, the change in the topics of BI research over the past 20 years has been examined. Finally, a taxonomy map was created by considering the background of the emerging issues in the BI field.

4.1. Bibliometric Analysis (RQ1)

A descriptive analysis was applied to answer the first research question (RQ1). Thus, the number of articles, topic areas, publication sources, countries, and keywords in the BI research corpus were determined. Figure 1 shows the total number of articles (N) and their percentages (%) by year.
As shown in Figure 1, the number of articles increased from approximately 1% in 2003 (45 articles) to approximately 8% in 2022 (316 articles). In other words, the number of articles has increased significantly over the past two decades. The number and percentage of BI articles by subject area are shown in Table 1.
Table 1 reveals the interdisciplinary nature of BI research. The majority of contributions to BI research come from “Computer Science” (52.09%), “Business, Management, And Accounting” (30.73%), and “Engineering” (26.09%). The top 15 publication resources for BI research are displayed in Table 2.
According to Table 2, many sources have published BI research. This may be due to the multidisciplinary nature of BI. The top three journals that publish these studies the most are the Journal of Intelligence Studies in Business (1.39%), Expert Systems with Applications (1.36%), and Decision Support Systems (1.09%). Table 3 shows the top ten countries that contributed the most to BI research.
According to Table 3, the USA (20.78%), India (8.75%), and China (8.59%) contribute to BI research. Finally, Table 4 lists the most preferred keywords in BI research.
According to Table 4, the most used keywords are “Business Intelligence” (35.82%), “Competitive Intelligence” (15.04%), “Data Mining” (11.52%), “Decision Making” (10.83%), and “Information Analysis” (10.22%).

4.2. Topic Modeling Analysis (RQ2)

The LDA topic modeling algorithm was applied to the experimental corpus to explore emerging topics in BI research (RQ2). The 36 main topics identified by LDA are given in Table 5 in decreasing order according to their percentages. In addition, Appendix A, Table A1, gives descriptive keywords that make up the topics.
As seen in Table 5, the five most studied topics are “SDLC”, “Big Data Analytics”, “Data Warehousing”, “Success Factors”, and “MIS”, respectively. The five least studied topics are “HRM”, “Government Services”, “Risk Analysis”, “Information Security”, and “Graph Streams”, respectively. In addition, Table 6 lists the topics according to the topic citation rates (TCR). Table 6 shows that the most cited topics are “Big Data Analytics”, “TAM”, “Intelligent Systems”, “DSS”, and “Success Factors”, respectively. The least cited topics are “Risk Analysis”, “Government Services”, “Database”, “Graph Streams”, and “Cost of Crime Index”, respectively.

4.3. Temporal Topic Trends (RQ3)

In order to better understand the changes in the trends and evolutions of 36 topics over time, the trend values of each topic were calculated by taking into account the annual changes in the distribution of the topics. The percentages of topics by year, trend values, and trend directions are shown in Appendix A, Table A2, where topics are listed in descending order of overall trend. Appendix A, Table A2 has a colorized heatmap table format. Each line shows the annual percentage changes for a topic and provides many inferences about how that topic has evolved. Green indicates percentages above the average percentage of the topic in a row, and red indicates percentages below that. As the annual values approach the average value, their colors become whiter (See Appendix A, Table A2). Moreover, Figure 2 is a visualization of the top ten topics with the most increasing and decreasing acceleration.
As seen in Figure 2, the trends of “Organizational Capability”, “AI Applications”, “Data Mining”, “Big Data Analytics”, and “Visualization” are increasing. On the other hand, the topics of “Knowledge Management”, “Semantic Web”, “SDLC”, “Intelligent Systems”, and “Database” have a decreasing trend.

4.4. Taxonomy Developed for BI (RQ4)

Taking into account the context and scope of the 36 main topics identified in the analysis, a systematic taxonomic map was developed that classifies the research background and perspectives of BI [44]. This taxonomy presented in Figure 3 shows the BI field as two classes: business and information technologies (IT). The business class was also divided into two subclasses, domain and managerial. The IT class was divided into two subclasses, implementation, and data. The taxonomy in Figure 3 was created by distributing the 36 main topics obtained in the study to these subclasses based on expert opinions.
According to Figure 3, the research topics are mostly related to IT (19 topics: 53.39%). Data is the most studied subclass in IT subclasses (10 topics: 27.91%). The topics in the business field (17 topics: 46.61%) showed a higher distribution compared to the IT class. The most studied subclass in business subclasses is managerial (9 topics: 26.87%).

5. Discussion

This study analyzed all significant research published in the BI field from 2003 to 2022 to understand better the bibliometric characteristics, main topics, trends, and future directions of the field. This study presented valuable findings to researchers and decision-makers in the BI field. The following headings summarize and discuss these findings.

5.1. Bibliometric Features of BI

Analysis of bibliometric data in BI publications can provide insights into the growing impact of publications on academia. Specifically, the number of research published on BI from 2003 to 2022 has increased daily. In addition, publications during this time period were published in 27 research areas. Among these research areas, the most striking was the field of “Computer Science”, with a rate of approximately 53%. This area was followed by “Business, Management and Accounting”, “Engineering” and “Decision Sciences”, respectively. BI is a synergy between computer science and decision science. BI provides computer science-based decision-making for business management purposes [21]. In addition, artificial intelligence is created with neural networks and deep learning methods in computers today, and data is analyzed more “intelligently”. Therefore, the prominence of “Computer Science”, “Business, Management and Accounting”, “Engineering”, and “Decision Sciences” in BI research confirms our findings.
Among the BI publication resources, prominent journals are “The Journal of Intelligence Studies in Business”, “Expert Systems with Applications”, “Decision Support Systems”, and “Sustainability”. It is recommended that researchers who want to follow the field of BI and publish in the field should follow these journals. Moreover, according to country/region analysis, the three most productive countries were the USA, India, and China.
Finally, the most used keywords in BI research were analyzed. Accordingly, the keywords “Business Intelligence”, “Competitive Intelligence”, “Data Mining”, and “Decision Making” are used more. In scientific research, words are often derived from “author keywords”. Therefore, it is not enough to interpret the field by focusing only on these keywords. There are some disadvantages to using keywords for analysis. Donthu et al. [45] stated that keywords could be used in many contexts, and thus re-reading publications may become necessary to understand the relationships between keywords. They also stated that the keywords used could be very general, and it could be challenging to classify them in a thematic cluster. As a result, the increase in BI publications shows that it will find more place in different disciplines, and its use in different disciplines will continue to increase.

5.2. Prominent Topics of BI

The findings obtained from the topic analysis in this study provided inferences about important topics related to the field of BI, and the most studied and cited topics were especially emphasized. First, the most studied topics are “SDLC”, “Big Data Analytics”, “Data Warehousing”, “Success Factors” and “MIS”, respectively. The most cited topics were “Big Data Analytics”, “TAM”, “Intelligent Systems”, “DSS”, and “Success Factors”, respectively. “SDLC” is in first place among the most studied topics and in 16th place among the most cited topics. This may mean that the relevant topics are not studied as much as before and have lost their popularity. As a classical concept, the System Life Cycle (SLC) concept uses the systems approach to analyze and develop relative applications. It provides a methodical approach and logical framework for a system’s development, operation, and sustainability. As an alternative, System Development Life Cycle (SDLC) or Software Development Life Cycle (SDLC) is used on occasion [46]. “SDLC” was frequently preferred during the development of BI software. However, with the effect of cloud-based BI software, LEAN, and Agile-like methods have been preferred recently instead of classical “SDLC” [47,48,49,50].
The second most studied topic is “Big Data Analytics”. This topic ranks first among the most cited topics. Therefore, it has been the most striking topic among the 36 main topics in the BI field. Big data analytics is the process of applying advanced analytical techniques to large data sets. “Big Data Analytics” is a trending topic in the BI field by bringing together the concepts of big data and analytics [51]. Today, the data produced by sensors in mobile devices, web pages, personal blogs, social media applications, and fixed or mobile devices has reached an incredible size [52]. It is not easy to manage, store, process, analyze for scientific or commercial use and interpret the results of this data, which is thousands of exabytes in size and increasing exponentially every day. Therefore, “Big Data Analytics” is used in data-related transactions in many fields, such as customer relations [53], marketing [9], human resources [54], supply chain [55], tourism [56], and health [57].
The third most studied topic was “Data Warehousing”. This topic was ranked 9th among the most cited topics. The vast majority of BI applications are included as a part of data warehouse projects. The term entered our lives with the data warehouse projects. Since BI is frequently used in data warehouse applications, the word BIDW, which consists of the initials of the concepts of data warehousing and BI, has been included in the literature [58]. We can see BI projects as a result of data warehouse applications in some cases and as a reason for data warehouse implementation in others [59].
The fourth most studied topic was “Success Factors”. This topic draws attention in fifth place in the most cited studies. BI applications enable the prediction of risks by analyzing data and, as a result, increase the success of business management. Many information technology project ideas are rejected and remain as concepts or are blocked at the management level. The chances of success of implemented projects are very risky, or some are used below capacity. According to the Gartner 2019 report [60], senior management still prefers BI applications highly, and investments in this field are constantly increasing. Provisioning BI applications is much easier than implementing them. The main problem lies in the successful implementation of such systems. Therefore, it is unsurprising that research on “Success Factors” is so prominent. These studies generally examined the critical success factors required for implementing BI software in businesses [61].
The fifth most studied topic is “MIS”. This topic ranks 13th among the most cited topics. MIS refers to a system that uses the information required by all levels of management in making operational, tactical, and strategic decisions [62]. In the past, BI applications were developed for strategic use only by senior management. However, this has changed recently, and BI applications have started to be utilized at both tactical and operational levels. Therefore, it is not surprising that the “MIS” field, which mainly supports the tactical level in organizations, is one of the most studied topics.
Some topics are outside the top five most studied topics, although they are among the top five most cited topics in the BI field. Therefore, “TAM”, “Intelligent Systems”, and “DSS”, which are among the top five most cited topics, need to be discussed in more detail.
The second most cited topic is “TAM”. Namely, “TAM” can provide important insights to researchers and practitioners in explaining and, more importantly, predicting the behavior of individuals in general and their behavior on using or not using technology in particular. Today, as data becomes more important, BI software for analyzing, sharing, and managing data is increasing [63]. Therefore, the factors affecting users’ adoption of this BI software have become important. Many studies examining the adoption of BI software draw attention [64]. In addition, Chang et al. examined the factors that affect managers’ intention to use BI software in decision-making [65]. Among the studies on the adoption of BI, the fields of higher education [66], health [67], and the electronics industry [68] draw attention.
The third most cited topic is “Intelligent Systems”. More precisely, “Intelligent Systems” allow for solving complex problems with a standardized method approach and obtaining reliable and consistent results over time [69]. It also refers to the systems enabling decision-makers to make decisions using different software tools in information and decision processes [70]. The leading technologies to create intelligent systems are fuzzy systems, artificial neural networks, expert systems, and evolutionary computing. Intelligent systems are typically used for diagnosis, optimization, classification, clustering, selection, prediction, and control [71].
The fourth most cited topic is “DSS”. George defined DSS as a combination of computer and communication technology designed to coordinate the decision-making process across functional areas and hierarchical layers to align decisions with business objectives [72]. DSS has become more advanced with the emergence of data warehouse, OLAP, and BI concepts in the late 1980s. The term BI began to be used more instead of DSS in the 1990s [73]. BI is the latest version of DSS with data processing and analysis capabilities [74]. However, it is stated that DSS is an essential set of systems that increase BI [75]. In this context, there is an essential relationship between DSS and BI. BI refers to technologies and actions that enable a business to process valuable information to support management and decision-making. In order to do this as quickly as possible, BI systems are responsible for processing huge amounts of data and turning it into visuals, reports, graphics, and presentations that can be easily perceived by decision-makers. Thus, the BI system is a decision support system that increases the ability of users to make the right decision.

5.3. Emerging Trends in BI Research

In order to better understand how trends and developments in each topic have changed over time (RQ3), details of the identified topics from 2003 to 2022 were calculated and presented for each year. As shown in Figure 2, while some topics such as “Organizational Capability”, “AI Applications”, “Data Mining”, “Big Data Analytics”, and “Visualization” are increasing, the topics of “Knowledge Management”, “Semantic Web”, “SDLC”, “Intelligent Systems” and “Database” have decreased.
Although “Organizational Capability”, which has the highest positive acceleration value, is not among the most studied and cited topics, it has shown a significant increase in the positive direction. Today, since knowledge acquisition and evaluation is accepted as crucial organizational capability, the organizations of the information society are turning into structures that profit by turning knowledge into value. Organizational capabilities refer to organizations’ ability to assemble, integrate and distribute resources, often in combination or co-existence [76]. Due to the increasing importance of “Big Data” in the modern business world, data, information, and information have become the most valuable resources for organizations. As the need for real-time data collection, analysis, and decision-making grows, BI capabilities that collect, integrate, and distribute data help thread an organization’s future path.
Other topics with the highest positive acceleration value are “AI Applications”, “Data Mining”, “Big Data Analytics”, and “Visualization”, respectively. These issues are broadly interconnected, and their increasing momentum is unsurprising. It focuses on data analysis and report management through BI, data mining, and data warehouse procedures. BI is the use of various technologies and tools to collect and analyze business data [77]. As BI tools begin to be applied to daily operations, empowering them with Artificial Intelligence (AI) has become a top priority. Therefore, AI applications affected the way BI worked in the past. Because AI can analyze vast amounts of data and provide recommendations based on that data, it makes analytics and big data insights accessible and understandable to data scientists and the average user [77,78].
On the other hand, “Knowledge Management” (KM) is the topic that attracts the most attention among the topics with the lowest acceleration value. BI is a broad category of applications and technologies used to collect, access, and analyze large volumes of data for organizations to make effective business decisions. On the other hand, KM is a set of practices for creating, developing, and applying knowledge to improve organizational performance [79]. Similar to BI, knowledge management improves how an organization uses existing knowledge. However, KM differs from BI in many ways. In general, knowledge management deals with human subjective information rather than data or objective information [80]. Therefore, the negative momentum of the “knowledge management” topic is probably due to the recent significant increase in data, the increasing importance of the concept of “Big Data”, and the significant investments in data processing by organizations. In addition, it can be stated that organizations realize that the management of data, rather than the management of information, gives more vital results.

5.4. Insights into the Taxonomy of BI Research

BI is a system that transforms raw data into usable information and supports decision-making processes at operational, tactical, and strategic levels. These systems allow managers to measure and improve the current performance of businesses. BI systems divide the large data sets they use into parts, report, summarize, analyze in real-time, visualize with graphs, and make predictions for the future [81]. Venter & Tustin also stated that information technology is crucial in enabling BI technology to collect, store, analyze, and present business information in a simple and useful way. Therefore, we classified BI topics under two groups, business (46.61%) and IT (53.39%), in the taxonomy we presented in the study [82].
Later, the business category of BI topics was divided into two sub-categories as domain (19.74%) and managerial (26.87%). As the taxonomy in Figure 3 shows, eight topics were identified under the domain category. The most studied topics are “HIS”, “Industry 4.0”, and “Social Media”, respectively. BI plays an increasingly critical role in various industries. Since knowledge is defined as the most valuable asset of a business, it is a fundamental resource for its development [3]. This technological tool can provide many benefits, such as architecture, efficient information, and customer data management when implemented in a business [83]. HIS stands out among the fields where BI is used. The healthcare industry has historically generated a significant amount of data due to record keeping, regulatory requirements, and demand for patient care [84]. “Success Factors”, “MIS”, and “CRM”, are the most studied topics in the managerial category. The decision-making process covers all processes, from which tools to use in businesses to which questions need to be answered. Therefore, topics related to decision-making came to the fore under the managerial category.
The IT category, the other BI category, was divided into two sub-categories: Implementation (25.48%) and Data (27.91%). According to the taxonomy, nine topics were identified under the implementation category. “SDLC”, “Competitive Intelligence”, and “AI Applications” were among the most studied topics. Topics in this category are usually related to developing and implementing the BI tools. Data (27.91%), the other sub-category of BI, draws attention as the most studied sub-category. This category covers “Data Mining”, “Big Data Analytics”, “Visualization”, “Sentiment Analysis”, and “Event Processing”, which have been trending topics recently.

6. Conclusions

This study aimed to examine the research interests and trends of articles in the BI field between 2003 and 2022. The results of the analysis have shown that 36 topics have business emerged in the BI field. The most studied topics are “SDLC”, “Big Data Analytics”, and “Data Warehousing”. Moreover, “Big Data Analytics” has come to the fore by far among the most cited topics. The acceleration of “Knowledge Management”, “Semantic Web”, and “SDLC” topics in the BI field has slowed over the years. The topics with the highest momentum in recent years are “Organizational Capability”, “AI Applications”, “Data Mining”, and “Big Data Analytics”. We have also developed a taxonomy for emerging topics in the BI field. The taxonomy is divided into two subcategories, business, and IT. The Business category was divided into two subcategories: domain and managerial and consisted of 17 topics in total. The IT category was divided into two subcategories, implementation and data, and consisted of 19 topics. In the taxonomy, the IT category from the main categories and the data category from the subcategories came to the fore.
The most innovative aspect of this study is that it is the most comprehensive study of content analysis based on topic modeling in the BI field. The LDA algorithm, which is widely used in the literature, was used for topic modeling analysis. Different algorithms may be used in future research for comparative studies. It is anticipated that the findings to be obtained from future studies in this perspective will make important contributions to the understanding of the evolution of BI or its subtopics in the near future. Thus, it will be possible to see how the findings of this study evolve over time and how the trend of increasing/decreasing the topics has changed. Finally, it is suggested that future studies conduct a similar analysis on more specific sub-headings in the BI field.

Author Contributions

Conceptualization, F.G. and A.A.; methodology, G.G.M.D. and F.G.; software, G.G.M.D.; validation, G.G.M.D. and A.A.; investigation, G.G.M.D. and M.D.; resources, G.G.M.D. and M.D.; data curation, F.G. and A.A.; writing—original draft preparation, G.G.M.D. and M.D.; writing—review and editing, M.D.; visualization, F.G. and A.A.; supervision, M.D.; project administration, M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available and can be shared upon request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Discovered topics, keywords, and percentages.
Table A1. Discovered topics, keywords, and percentages.
Topic NameLDA Keywords%
SDLCsystem design software solution project requirement development support application need tool process report develop evaluation user engineering implementation component analysis4.73
Big Data Analyticsdatum analytic big business analysis insight predictive technology organization data-driven decision opportunity tool understand ba industry technique advance analyze lead4.62
Data Warehousingdatum warehouse analysis olap quality multidimensional source process analytical integration information dw cube warehousing decision spatial integrate support architecture dimension4.24
Success Factorsfactor implementation success organization critical project organisation organizational analysis strategy benefit adoption qualitative develop successful interview business technology important industry3.88
MISsystem information management enterprise resource support erp strategic technology plan decision planning company integration integrate control decision-making application process corporate3.77
Competitive Intelligencecompany competitive business market technology advantage strategy information industry innovation decision opportunity intelligence development environment change growth economic strategic patent3.72
CRMcustomer marketing product sale business relationship market consumer service company retail online e-commerce crm strategy brand help store satisfaction price3.55
AI Applicationsintelligence business artificial technology digital application learn development trend analysis bibliometric machine discuss automation perspective world innovation human ecosystem increase3.34
BPMbusiness process performance management indicator improve operational tool analysis measurement improvement measure metric organization time efficiency monitoring monitor operation objective3.31
Knowledge Managementknowledge management organization business information need role process creation technology create organizational km change design development structure asset tool share3.25
HIShealth healthcare care clinical patient medical hospital datum quality improve treatment information electronic cost disease informatics tool improvement increase medicine3.25
Data Miningdatum mining analysis information technique mine application tool knowledge process large amount analyze decision extract apply help various generate technology3.14
Organizational Capabilityperformance capability impact organizational relationship effect role datum examine positive organisational innovation orientation influence show structural manager questionnaire empirical dynamic3.14
Predictionlearn prediction machine algorithm network learning predict neural classification deep accuracy datum dataset feature technique performance predictive classifier regression time3.14
Intelligent Systemsintelligent cloud system mobile application network service business compute technology platform agent computing architecture internet environment device user solution sensor3.12
Industry 4.0business industry enterprise sme digital small production process product manufacturing intelligent manufacture sustainable technology sustainability industrial company development develop medium2.98
Sentiment Analysissentiment text analysis online recommendation news feature language word image user opinion video aspect term product natural topic classification recommender2.76
Social Mediasocial media network analysis content share opinion relation information user community twitter public platform online open number consumer topic digital2.65
TAMsystem user information quality adoption influence intention technology usage perceive satisfaction acceptance individual effect determinant factor perception significant impact find2.57
Query Optimizationquery performance time datum workload optimization system algorithm cost parallel reduce benchmark execution memory flow task resource application large mapreduce2.54
DSSdecision support system fuzzy decision-making dss selection information criterion theory process analysis environment maker technique set evaluation group cognitive expert2.52
Supply Chainchain supply iot logistics management supplier demand rfid inventory network internet thing operation cost collaboration information technology industry collaborative port2.49
Higher Educationstudent education university learn course business higher educational academic program institution design teach curriculum school development training teaching technology system2.43
Smart Gridenergy smart power datum grid consumption city system facility electricity management vehicle location analysis cost building production transportation sensor transport2.24
Algorithmsalgorithm cluster rule association mining pattern datum classification set database clustering attribute utility churn genetic frequent mine item feature transaction2.21
Emotional Intelligenceintelligence business emotional leadership tourism cultural destination skill competency leader job south emotion entrepreneurial african hotel africa international relationship gender2.19
Visualizationdashboard visualization datum tool visual user map interactive design information analysis usability trajectory exploration report location interface display visualisation presentation2.16
Databasedatabase datum etl olap warehouse analytical report tool ibm query relational application technology processing solution process open sql xml platform2.10
Semantic Webweb information semantic search document ontology page extraction engine language application text structure user site website unstructured extract task access2.08
Cost of Crime Indexcost report crime index expect growth packaging law unit rate increase time unite receive global annual part device drift board2.00
Event Processingprocess event business log pattern workflow activity discovery detection mining discover complex detect execution audit system intelligence anomaly sequence real1.89
HRMhuman global resource practice environmental accounting crisis employee manager management modern potential opportunity workforce safety question change pandemic complex worker1.75
Risk Analysisrisk financial bank stock credit market banking analysis investment return economic management card price surveillance exchange show commercial application reduce1.69
Government Servicesservice public technology sector government architecture private policy provider governance solution cost soa infrastructure open quality collaboration resource local technological1.69
Information Securitysecurity privacy information blockchain attack threat contract signal cyber protect trust protection share risk weak protocol sensitive military detection secure1.53
Graph Streamsgraph stream object distribution dynamic time scenario global logic temporal node linkage form real-time real entity change define physical operator1.31
Table A2. Annual percentages and trends of the topics.
Table A2. Annual percentages and trends of the topics.
Topic Name20032004200520062007200820092010201120122013201420152016201720182019202020212022Trend
Organizational Capability0.001.670.001.110.000.000.000.000.730.550.000.963.563.603.783.863.425.054.6010.750.57
AI Applications0.000.001.354.442.242.560.762.733.653.311.080.480.441.352.525.023.085.057.366.840.36
Data Mining2.220.001.351.110.751.711.531.822.192.762.702.401.331.352.522.324.115.055.837.170.26
Big Data Analytics0.000.001.350.000.750.851.530.000.732.763.244.814.895.417.568.8810.274.735.834.560.24
Visualization0.005.002.702.222.240.853.050.000.732.211.622.881.334.050.841.162.052.522.153.910.21
Success Factors0.000.000.001.110.000.852.293.642.923.312.701.925.335.412.946.953.775.687.363.910.21
Sentiment Analysis0.000.004.051.111.490.850.764.551.466.631.083.372.222.253.783.863.421.893.073.260.17
Event Processing0.001.670.000.002.242.561.531.822.923.871.080.962.222.252.521.162.401.890.922.610.14
DSS0.000.002.700.000.752.561.530.912.924.423.244.332.674.503.362.702.051.891.842.280.12
Supply Chain0.003.334.050.002.243.424.584.552.193.872.161.443.111.353.781.541.372.213.072.280.12
Government Services0.001.671.351.110.750.850.760.912.921.101.620.962.223.152.101.541.712.521.231.950.10
TAM0.001.671.351.110.000.851.530.002.192.215.415.294.001.354.202.702.051.264.601.950.10
BPM0.008.338.117.785.977.697.632.732.921.102.163.852.673.602.104.251.032.522.761.630.09
Social Media0.000.000.001.110.001.710.761.822.192.763.782.885.331.804.202.322.745.053.071.300.07
Industry 4.04.445.002.706.673.733.422.290.911.461.102.160.960.891.352.103.475.483.473.075.540.06
Risk Analysis0.000.005.413.332.993.423.050.911.461.103.780.003.110.901.261.540.681.581.530.980.05
Information Security2.221.674.050.004.481.711.530.912.921.101.621.441.330.450.001.541.031.890.612.930.04
HIS2.220.002.702.223.735.133.059.093.652.214.325.291.783.152.524.253.082.212.452.930.04
Algorithms2.221.670.004.441.490.003.822.732.922.763.240.962.221.803.361.933.420.951.532.610.02
Higher Education2.220.000.001.110.000.851.532.731.461.661.622.404.443.153.783.092.742.842.762.610.02
HRM2.220.002.700.002.241.710.760.912.192.211.082.880.891.351.260.771.712.842.761.95−0.01
Emotional Intelligence2.225.004.053.332.240.852.291.821.461.101.082.402.222.251.263.092.401.893.071.95−0.01
Competitive Intelligence2.226.672.705.562.244.277.635.458.763.871.082.885.783.605.042.322.742.523.681.95−0.01
Smart Grid2.226.674.052.223.730.850.761.821.463.312.160.963.113.152.521.931.371.583.071.63−0.03
Data Warehousing2.221.672.703.334.484.273.827.272.192.214.869.624.444.505.043.095.145.683.371.30−0.05
Graph Streams2.225.000.001.112.242.561.532.732.191.101.620.960.000.451.261.541.711.261.230.33−0.10
CRM6.671.675.411.115.971.712.293.642.926.082.162.403.114.503.781.934.455.053.073.26−0.18
Cost of Crime Index4.446.676.764.446.720.853.050.913.652.761.081.442.223.150.420.392.050.630.920.98−0.18
Prediction4.445.002.707.782.245.984.583.643.651.664.322.882.226.314.623.093.421.890.920.65−0.20
MIS6.675.008.118.898.964.273.0510.007.306.632.704.812.672.702.943.861.371.261.532.28−0.23
Query Optimization6.671.670.000.000.754.273.053.640.732.764.323.373.562.704.204.631.371.581.531.30−0.28
Database6.676.671.352.224.487.694.582.734.383.314.861.440.891.350.421.160.680.950.610.98−0.30
Intelligent Systems8.893.334.053.335.972.560.764.554.383.313.242.402.672.701.262.323.423.793.072.93−0.31
SDLC11.111.671.356.676.722.566.113.645.843.878.116.255.335.864.203.473.424.732.764.89−0.33
Semantic Web6.676.674.054.442.993.423.052.732.922.762.162.883.111.801.680.771.371.260.920.00−0.35
Knowledge Management8.895.006.765.562.2410.269.161.821.462.216.494.812.671.350.841.543.422.841.841.63−0.38
In this heatmap table, the green colors specify percentages above the topic’s average in a row, the red colors specify percentages below the topic’s average, and their colors approach white as these annual values approach the average.

References

  1. Chen, H.; Chiang, R.H.L.; Storey, V.C. Business Intelligence and Analytics: From Big Data To Big Impact. Mis. Q. 2012, 36, 1165–1188. [Google Scholar] [CrossRef]
  2. Ain, N.U.; Vaia, G.; DeLone, W.H.; Waheed, M. Two Decades of Research on Business Intelligence System Adoption, Utilization and Success—A Systematic Literature Review. Decis. Support Syst. 2019, 125, 113113. [Google Scholar] [CrossRef]
  3. Tavera Romero, C.A.; Ortiz, J.H.; Khalaf, O.I.; Prado, A.R. Business Intelligence: Business Evolution after Industry 4.0. Sustainability 2021, 13, 10026. [Google Scholar] [CrossRef]
  4. Hedberg, J.G.; McNamara, S. Innovation and Re-Invention: A Brief Review of Educational Technology in Australia. EMI. Educ. Media Int. 2002, 39, 111–121. [Google Scholar] [CrossRef]
  5. Liang, T.P.; Liu, Y.H. Research Landscape of Business Intelligence and Big Data Analytics: A Bibliometrics Study. Expert Syst. Appl. 2018, 111, 2–10. [Google Scholar] [CrossRef]
  6. Cheng, C.; Zhong, H.; Cao, L. Facilitating Speed of Internationalization: The Roles of Business Intelligence and Organizational Agility. J. Bus. Res. 2020, 110, 95–103. [Google Scholar] [CrossRef]
  7. Herschel, R.T.; Jones, N.E. Knowledge Management and Business Intelligence: The Importance of Integration. J. Knowl. Manag. 2005, 9, 45–55. [Google Scholar] [CrossRef] [Green Version]
  8. Meramveliotakis, G.; Manioudis, M. Sustainable Development, COVİD-19 and Small Business in Greecep: Small Is Not Beautiful. Adm. Sci. 2021, 11, 90. [Google Scholar] [CrossRef]
  9. Fan, S.; Lau, R.Y.K.; Zhao, J.L. Demystifying Big Data Analytics for Business Intelligence Through the Lens of Marketing Mix. Big Data Res. 2015, 2, 28–32. [Google Scholar] [CrossRef]
  10. Luhn, H.P. A Business Intelligence System. IBM J. Res. Dev. 1958, 2, 314–319. [Google Scholar] [CrossRef] [Green Version]
  11. Power, D.J. A Brief History of Decision Support Systems. Decis. Support Syst. 2007, 1, 121–140. [Google Scholar] [CrossRef]
  12. López-Robles, J.R.; Otegi-Olaso, J.R.; Gamboa-Rosales, N.K.; Gamboa-Rosales, H.; Cobo, M.J. 60 Years of Business Intelligence: A Bibliometric Review from 1958 to 2017. In Proceedings of the Frontiers in Artificial Intelligence and Applications, Granada, Spain, 26–28 September 2018; Volume 303, pp. 395–408. [Google Scholar]
  13. Gurcan, F.; Cagiltay, N.E. Exploratory Analysis of Topic Interests and Their Evolution in Bioinformatics Research Using Semantic Text Mining and Probabilistic Topic Modeling. IEEE Access 2022, 10, 31480–31493. [Google Scholar] [CrossRef]
  14. Trieu, V.H. Getting Value from Business Intelligence Systems: A Review and Research Agenda. Decis. Support Syst. 2017, 93, 111–124. [Google Scholar] [CrossRef] [Green Version]
  15. Moro, S.; Cortez, P.; Rita, P. Business Intelligence in Banking: A Literature Analysis from 2002 to 2013 Using Text Mining and Latent Dirichlet Allocation. Expert Syst. Appl. 2015, 42, 1314–1324. [Google Scholar] [CrossRef] [Green Version]
  16. Wyskwarski, M. Business Intelligence—Publication Analysis Using the R Language. Sci. Pap. Silesian Univ. Technol. Organ. Manag. Ser. 2019, 2019, 263–274. [Google Scholar] [CrossRef]
  17. Václav, C.; Gabriel, F.; Blanka, K.; Libor, K.; Michal, T. Utilization of Business Intelligence Tools in Cargo Control. Proc. Transp. Res. Procedia 2021, 53, 212–223. [Google Scholar] [CrossRef]
  18. Chen, Y.; Lin, Z. Business Intelligence Capabilities and Firm Performance: A Study in China. Int. J. Inf. Manag. 2021, 57, 102232. [Google Scholar] [CrossRef]
  19. Combita Niño, H.A.; Cómbita Niño, J.P.; Morales Ortega, R. Business Intelligence Governance Framework in a University: Universidad de La Costa Case Study. Int. J. Inf. Manag. 2020, 50, 405–412. [Google Scholar] [CrossRef]
  20. Muntean, M.; Dănăiaţă, D.; Hurbean, L.; Jude, C. A Business Intelligence & Analytics Framework for Clean and Affordable Energy Data Analysis. Sustainability 2021, 13, 638. [Google Scholar] [CrossRef]
  21. Purnomo, A.; Firdaus, M.; Sutiksno, D.U.; Putra, R.S.; Hasanah, U. Mapping of Business Intelligence Research Themes: Four Decade Review. In Proceedings of the 2021 IEEE International Conference on Communication, Networks and Satellite (COMNETSAT), Purwokerto, Indonesia, 17–18 July 2021; pp. 32–37. [Google Scholar] [CrossRef]
  22. Dorti, A.; Akdemir, Ö. Trackıng Busıness Intellıgence Research Trends: A Bıblıometrıc Approach. Online J. Sci. Technol. 2021, 11, 82–88. [Google Scholar]
  23. Vanani, I.R.; Jalali, S.M.J. Analytical Evaluation of Emerging Scientific Trends in Business Intelligence through the Utilisation of Burst Detection Algorithm. Int. J. Bibliometr. Bus. Manag. 2017, 1, 70. [Google Scholar] [CrossRef]
  24. Zou, Z.; Cheng, J.; Huang, K.; Gui, Y. Research on the Developments of Business Intelligence and Its Enlightenment Based on Bibliometric Statistics and Knowledge Map Analysis. J. Phys. Conf. Ser. 2019, 1176, 042089. [Google Scholar] [CrossRef]
  25. da Brum, L.M.L.; do Lampert, V.N.; da Camargo, S.S. Business Intelligence and Data Warehouse in Agrarian Sector: A Bibliometric Study. J. Agric. Sci. 2019, 11, 353. [Google Scholar] [CrossRef] [Green Version]
  26. Gurcan, F.; Boztas, G.D.; Dalveren, G.G.M.; Derawi, M. Digital Transformation Strategies, Practices, and Trends: A Large-Scale Retrospective Study Based on Machine Learning. Sustainability 2023, 15, 7496. [Google Scholar] [CrossRef]
  27. Ozyurt, O.; Ayaz, A. Twenty-Five Years of Education and Information Technologies: Insights from a Topic Modeling Based Bibliometric Analysis. Educ. Inf. Technol. 2022, 27, 11025–11054. [Google Scholar] [CrossRef]
  28. Sharma, M.; Sharma, S.K. Theoretical Framework for Digital Payments in Rural India: Integrating UTAUT and Empowerment Theory. IFIP Adv. Inf. Commun. Technol. 2019, 558, 212–223. [Google Scholar] [CrossRef]
  29. Gallinucci, E.; Golfarelli, M.; Rizzi, S. Advanced Topic Modeling for Social Business Intelligence. Inf. Syst. 2015, 53, 87–106. [Google Scholar] [CrossRef]
  30. Gurcan, F. What Issues Are Data Scientists Talking about? Identification of Current Data Science Issues Using Semantic Content Analysis of Q&A Communities. PeerJ Comput. Sci. 2023, 9, e1361. [Google Scholar]
  31. Campbell, J.C.; Hindle, A.; Stroulia, E. Latent Dirichlet Allocation: Extracting Topics from Software Engineering Data. Art Sci. Anal. Softw. Data 2015, 139–159. [Google Scholar] [CrossRef]
  32. Gurcan, F.; Erdogdu, F.; Cagiltay, N.E.; Cagiltay, K. Student Engagement Research Trends of Past 10 Years: A Machine Learning-Based Analysis of 42,000 Research Articles. Educ. Inf. Technol. 2023, 1–25. [Google Scholar] [CrossRef]
  33. Gurcan, F.; Dalveren, G.G.M.; Derawi, M. COVID-19 and E-Learning: An Exploratory Analysis of Research Topics and Interests in E-Learning During the Pandemic. IEEE Access 2022, 10, 123349–123357. [Google Scholar] [CrossRef]
  34. Konrad, M. Text Mining and Topic Modeling Toolkit. Available online: https://pypi.org/project/tmtoolkit/ (accessed on 21 January 2022).
  35. Blei, D.M. Probabilistic Topic Models. Commun. ACM 2012, 55, 77–84. [Google Scholar] [CrossRef] [Green Version]
  36. Vayansky, I.; Kumar, S.A.P. A Review of Topic Modeling Methods. Inf. Syst. 2020, 94, 101582. [Google Scholar] [CrossRef]
  37. Gurcan, F.; Dalveren, G.G.M.; Cagiltay, N.E.; Soylu, A. Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling. IEEE Access 2022, 10, 74638–74654. [Google Scholar] [CrossRef]
  38. Kherwa, P.; Bansal, P. Topic Modeling: A Comprehensive Review. EAI Endorsed Trans. Scalable Inf. Syst. 2020, 7. [Google Scholar] [CrossRef] [Green Version]
  39. Liu, J.; Nie, H.; Li, S.; Chen, X.; Cao, H.; Ren, J.; Lee, I.; Xia, F. Tracing the Pace of COVID-19 Research: Topic Modeling and Evolution. Big Data Res. 2021, 5, 100236. [Google Scholar] [CrossRef]
  40. Gurcan, F.; Ozyurt, O. Emerging Trends and Knowledge Domains in E-Learning Researches: Topic Modeling Analysis with the Article Published between 2008-2018. J. Comput. Educ. Res. 2020, 8, 738–756. [Google Scholar] [CrossRef]
  41. Griffiths, T.L.; Steyvers, M. Finding Scientific Topics. Proc. Natl. Acad. Sci. USA 2004, 101, 5228–5235. [Google Scholar] [CrossRef] [Green Version]
  42. Mimno, D.; Wallach, H.M.; Talley, E.; Leenders, M.; McCallum, A. Optimizing Semantic Coherence in Topic Models. In Proceedings of the EMNLP 2011—Conference on Empirical Methods in Natural Language Processing, Scotland, UK, 26 July 2011. [Google Scholar]
  43. Gurcan, F.; Dalveren, G.G.M.; Cagiltay, N.E.; Roman, D.; Soylu, A. Evolution of Software Testing Strategies and Trends: Semantic Content Analysis of Software Research Corpus of the Last 40 Years. IEEE Access 2022, 10, 106093–106109. [Google Scholar] [CrossRef]
  44. Debortoli, S.; Müller, O.; Vom Brocke, J. Comparing Business Intelligence and Big Data Skills: A Text Mining Study Using Job Advertisements. Bus. Inf. Syst. Eng. 2014, 6, 289–300. [Google Scholar] [CrossRef]
  45. Donthu, N.; Kumar, S.; Mukherjee, D.; Pandey, N.; Lim, W.M. How to Conduct a Bibliometric Analysis: An Overview and Guidelines. J. Bus. Res. 2021, 133, 285–296. [Google Scholar] [CrossRef]
  46. Yasuda, K.; Huang, T. Review of the Concepts, Meanings, and Uses of Life Cycle. In Proceedings of the Asia Pacific Industrial Engineering & Management Systems Conference, Jeju, Republic of Korea, 14 October 2014; pp. 62–69. [Google Scholar]
  47. Mohan, K.K.; Harun, R.S.; Srividya, A.; Verma, A.K. Quality Framework for Reliability Improvement in SAP Netweaver Business Intelligence Environment through Lean Software Development—A Practical Perspective. Int. J. Syst. Assur. Eng. Manag. 2010, 1, 316–323. [Google Scholar] [CrossRef]
  48. Rehani, B. Agile Way of BI Implementation. In Proceedings of the Proceedings—2011 Annual IEEE India Conference: Engineering Sustainable Solutions, INDICON-2011, Hyderabad, India, 16–18 December 2011. [Google Scholar]
  49. Ozyurt, O.; Gurcan, F.; Dalveren, G.G.M.; Derawi, M. Career in Cloud Computing: Exploratory Analysis of In-Demand Competency Areas and Skill Sets. Appl. Sci. 2022, 12, 9787. [Google Scholar] [CrossRef]
  50. Gurcan, F.; Sevik, S. Expertise Roles and Skills Required by the Software Development Industry. In Proceedings of the 1st International Informatics and Software Engineering Conference: Innovative Technologies for Digital Transformation, IISEC 2019—Proceedings, Ankara, Turkey, 6–7 November 2019. [Google Scholar]
  51. Rajaraman, V. Big Data Analytics. Resonance 2016, 21, 695–716. [Google Scholar] [CrossRef]
  52. McAfee, A.; Brynjolfsson, E. Big Data: The Management Revolution. Harv. Bus. Rev. 2012, 90, 60–68. [Google Scholar] [PubMed]
  53. Lee, M.; Lee, S.; Koh, Y. Multisensory Experience for Enhancing Hotel Guest Experience: Empirical Evidence from Big Data Analytics. Int. J. Contemp. Hosp. Manag. 2019, 31, 4313–4337. [Google Scholar] [CrossRef]
  54. De Mauro, A.; Greco, M.; Grimaldi, M.; Ritala, P. Human Resources for Big Data Professions: A Systematic Classification of Job Roles and Required Skill Sets. Inf. Process. Manag. 2018, 54, 807–817. [Google Scholar] [CrossRef]
  55. Roßmann, B.; Canzaniello, A.; von der Gracht, H.; Hartmann, E. The Future and Social Impact of Big Data Analytics in Supply Chain Management: Results from a Delphi Study. Technol. Forecast. Soc. Chang. 2018, 130, 135–149. [Google Scholar] [CrossRef]
  56. Marine-Roig, E.; Anton Clavé, S. Tourism Analytics with Massive User-Generated Content: A Case Study of Barcelona. J. Destin. Mark. Manag. 2015, 4, 162–172. [Google Scholar] [CrossRef]
  57. Sousa, M.J.; Pesqueira, A.M.; Lemos, C.; Sousa, M.; Rocha, Á. Decision-Making Based on Big Data Analytics for People Management in Healthcare Organizations. J. Med. Syst. 2019, 43, 290. [Google Scholar] [CrossRef]
  58. Golden, B. Amazon Web Services for Dummies; John Wiley & Sons: Hoboken, NJ, USA, 2013; ISBN 9781118571835. [Google Scholar]
  59. Golfarelli, M.; Rizzi, S.; Cella, I. Beyond Data Warehousing: What’s next in Business Intelligence? In Proceedings of the DOLAP: ACM International Workshop on Data Warehousing and OLAP, Washington, DC, USA, 12–13 November 2004. [Google Scholar]
  60. Hare, J.; Gupta, N.; Woodward, A.; Hunter, E.; Quinn, K. Market Share: Analytics and Business Intelligence, Worldwide, 2019. Available online: https://www.gartner.com/en/documents/3985421 (accessed on 1 April 2023).
  61. Yeoh, W.; Popovič, A. Extending the Understanding of Critical Success Factors for Implementing Business Intelligence Systems. J. Assoc. Inf. Sci. Technol. 2016, 67, 134–147. [Google Scholar] [CrossRef]
  62. Shah, M. Impact of Management Information Systems (MIS) on School Administration: What the Literature Says. Procedia—Soc. Behav. Sci. 2014, 16, 2799–2804. [Google Scholar] [CrossRef] [Green Version]
  63. Gartner. Analytics and Business Intelligence Platforms Reviews and Ratings. Available online: https://www.gartner.com/reviews/market/analytics-business-intelligence-platforms (accessed on 23 April 2023).
  64. Bach, M.P.; Čeljo, A.; Zoroja, J. Technology Acceptance Model for Business Intelligence Systems: Preliminary Research. Procedia Comput. Sci. 2016, 100, 995–1001. [Google Scholar] [CrossRef] [Green Version]
  65. Chang, Y.W.; Hsu, P.Y.; Shiau, W.L. An Empirical Study of Managers’ Usage Intention in BI. Cogn. Technol. Work 2014, 16, 247–258. [Google Scholar] [CrossRef] [Green Version]
  66. Al-Ramahi, N.M.; Odeh, M.; Alrabie, Z.; Qozmar, N. The TOEQCC Framework for Sustainable Adoption of Cloud Computing at Higher Education Institutions in the Kingdom of Jordan. Sustainability 2022, 14, 12744. [Google Scholar] [CrossRef]
  67. Kapetaneas, N.; Kitsios, F. Digital Transformation Approach to Public Hospitals Environment: Technology Acceptance Model for Business Intelligence Applications. In Proceedings of the Lecture Notes in Business Information Processing; Springer: Berlin/Heidelberg, Germany, 2022. [Google Scholar]
  68. Hou, C.K. Understanding Business Intelligence System Continuance Intention: An Empirical Study of Taiwan’s Electronics Industry. Inf. Dev. 2016, 32, 1359–1371. [Google Scholar] [CrossRef]
  69. Byrd, T.A.; Hauser, R.D. Expert Systems in Production and Operations Management: Research Directions in Assessing Overall Impact. Int. J. Prod. Res. 1991, 29, 2471–2482. [Google Scholar] [CrossRef]
  70. Doucet, M.S.; Doucet, T.A. Public Accounting Firms. In Encyclopedia of Information Systems; Elsevier: Amsterdam, The Netherlands, 2003; pp. 601–606. [Google Scholar]
  71. Negnevitsky, M. Artificial Intelligence: A Guide to Intelligent Systems, Systems, 2nd ed.; CRC Press: Boca Raton, FL, USA, 2005; ISBN 0321204662. [Google Scholar]
  72. George, J.F. The Conceptualization and Development of Organizational Decision Support Systems. J. Manag. Inf. Syst. 1991, 8, 109–125. [Google Scholar] [CrossRef]
  73. Nedelcu, B. Business Intelligence Systems. Database Syst. J. 2013, 4, 12–20. [Google Scholar] [CrossRef]
  74. Shollo, A.; Galliers, R.D. Towards an Understanding of the Role of Business Intelligence Systems in Organisational Knowing. Inf. Syst. J. 2016, 26, 339–367. [Google Scholar] [CrossRef]
  75. Yurtay, Y.; Yurtay, N.; Akçetin, E.; Kılıç, A. Freight Optimization in Container Loading: Case Study. J. Manag. Econ. Res. 2014, 12, 228–247. [Google Scholar] [CrossRef]
  76. Bharadwaj, A.S. A Resource-Based Perspective on Information Technology Capability and Firm Performance: An Empirical Investigation. MIS Q. Manag. Inf. Syst. 2000, 24, 169. [Google Scholar] [CrossRef]
  77. Zohuri, B. From Business Intelligence to Artificial Intelligence. Mod. Approaches Mater. Sci. 2020, 2, 231–240. [Google Scholar] [CrossRef]
  78. Gürcan, F. Major Research Topics in Big Data: A Literature Analysis from 2013 to 2017 Using Probabilistic Topic Models. In Proceedings of the 2018 International Conference on Artificial Intelligence and Data Processing, IDAP, Malatya, Turkey, 28–30 September 2018; pp. 1–4. [Google Scholar]
  79. Lee, M.C.; Chang, T. Linking Knowledge Management and Innovation Management in E-Business. Int. J. Innov. Learn. 2007, 4, 145–159. [Google Scholar] [CrossRef]
  80. Davenport, T.H.; Seely, C.P.M. Meets Business Intelligence: Merging Knowledge and Information at Intel. Knowl. Manag. Rev. 2006, 10–15. [Google Scholar]
  81. Negash, S.; Gray, P. Business Intelligence. In Handbook on Decision Support Systems 2; Springer: Berlin/Heidelberg, Germany, 2008; pp. 175–193. [Google Scholar]
  82. Venter, P.; Tustin, D. The Availability and Use of Competitive and Business Intelligence in South African Business Organisations. South. Afr. Bus. Rev. 2009, 13, 88–117. [Google Scholar]
  83. Negro, A.R.; Mesia, R. The Business Intelligence and Its Influence on Decision Making. J. Appl. Bus. Econ. 2020, 22. [Google Scholar] [CrossRef]
  84. Raghupathi, W.; Raghupathi, V. An Overview of Health Analytics. J. Heal. Med. Inform. 2013, 4, 3. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Number of articles by year.
Figure 1. Number of articles by year.
Sustainability 15 09854 g001
Figure 2. Temporal topic trends.
Figure 2. Temporal topic trends.
Sustainability 15 09854 g002
Figure 3. Systematic taxonomy of the BI topics.
Figure 3. Systematic taxonomy of the BI topics.
Sustainability 15 09854 g003
Table 1. Subject areas of the BI articles.
Table 1. Subject areas of the BI articles.
Subject AreaN%
Computer Science195352.09
Business, Management, and Accounting115230.73
Engineering97826.09
Decision Sciences68818.35
Social Sciences63917.04
Mathematics3489.28
Medicine2145.71
Economics, Econometrics, and Finance2005.33
Materials Science1313.49
Environmental Science1283.41
Energy1122.99
Arts and Humanities1112.96
Psychology1022.72
Chemical Engineering601.60
Physics and Astronomy581.55
Multidisciplinary501.33
Biochemistry, Genetics, and Molecular Biology491.31
Agricultural and Biological Sciences471.25
Earth and Planetary Sciences461.23
Chemistry350.93
Health Professions290.77
Pharmacology, Toxicology, and Pharmaceutics270.72
Nursing190.51
Neuroscience90.24
Immunology and Microbiology80.21
Veterinary60.16
Dentistry20.05
Table 2. Publication sources.
Table 2. Publication sources.
Source TitleN%
Journal of Intelligence Studies in Business521.39
Expert Systems with Applications511.36
Decision Support Systems421.12
Sustainability Switzerland411.09
International Journal of Information Management360.96
International Journal of Business Intelligence and Data Mining330.88
Journal of Theoretical and Applied Information Technology330.88
IEEE Access300.80
Information Systems Management290.77
Journal of Computer Information Systems280.75
Journal of Decision Systems270.72
Studies in Computational Intelligence250.67
DB2 Magazine230.61
Journal of Advanced Research in Dynamical and Control Systems220.59
Healthcare Financial Management Journal of The Healthcare Financial Management Association210.56
Table 3. Top contributing countries.
Table 3. Top contributing countries.
Country N%
United States77920.78
India3288.75
China3228.59
United Kingdom2225.92
Australia1684.48
Spain1413.76
Germany1403.73
Canada1303.47
Italy1203.20
Malaysia1193.17
Taiwan1143.04
Iran902.40
France822.19
South Korea741.97
Portugal721.92
Sweden711.89
Saudi Arabia691.84
Indonesia661.76
Hong Kong631.68
Poland561.49
Table 4. The most used keywords.
Table 4. The most used keywords.
KeywordN%
Business Intelligence134335.82
Competitive Intelligence56415.04
Data Mining43211.52
Decision Making40610.83
Information Analysis38310.22
Big Data3048.11
Artificial Intelligence2837.55
Decision Support Systems2316.16
Data Warehouses2105.60
Information Management1925.12
Knowledge Management1463.89
Business Intelligence Systems1303.47
Data Warehouse1233.28
Information Systems1223.25
Information Technology1193.17
Machine Learning1173.12
Sales1173.12
Commerce1163.09
Competition1163.09
Data Analytics1143.04
Business Analytics1102.93
Intelligence982.61
Semantics892.37
Forecasting862.29
Data Handling852.27
Enterprise Resource Planning852.27
Marketing792.11
Data Visualization782.08
Social Media762.03
Social Networking762.03
Table 5. The 36 topics discovered by LDA.
Table 5. The 36 topics discovered by LDA.
Topic Name(%)
SDLC (System Development Life Cycle)4.73
Big Data Analytics4.62
Data Warehousing4.24
Success Factors3.88
MIS (Management Information Systems)3.77
Competitive Intelligence3.72
CRM (Customer Relationship Management)3.55
AI Applications3.34
BPM (Business Process Management)3.31
HIS (Health Information Systems)3.25
Knowledge Management3.25
Organizational Capability3.14
Data Mining3.14
Prediction3.14
Intelligent Systems3.12
Industry 4.02.98
Sentiment Analysis2.76
Social Media2.65
TAM (Technology Acceptance Model)2.57
Query Optimization2.54
DSS (Decision Support Systems)2.52
Supply Chain2.49
Higher Education2.43
Smart Grid2.24
Algorithms2.21
Emotional Intelligence2.19
Visualization2.16
Database2.10
Semantic Web2.08
Cost of Crime Index2.00
Event Processing1.89
HRM (Human Resource Management)1.75
Government Services1.69
Risk Analysis1.69
Information Security1.53
Graph Streams1.31
Table 6. Citation rates of the topics.
Table 6. Citation rates of the topics.
Topic NameTCR (%)
Big Data Analytics12.87
TAM4.74
Intelligent Systems4.72
DSS4.28
Success Factors4.13
BPM3.97
AI Applications3.95
Event Processing3.94
Data Warehousing3.56
Sentiment Analysis3.56
Organizational Capability3.35
CRM3.06
MIS2.99
Competitive Intelligence2.91
Social Media2.81
SDLC2.77
Supply Chain2.59
Knowledge Management2.46
Industry 4.02.46
HIS2.36
Data Mining2.28
Emotional Intelligence2.21
Prediction2.19
Semantic Web1.80
Algorithms1.73
Smart Grid1.68
Query Optimization1.56
HRM1.42
Higher Education1.36
Visualization1.13
Information Security1.02
Risk Analysis1.01
Government Services1.01
Database0.80
Graph Streams0.75
Cost of Crime Index0.59
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gurcan, F.; Ayaz, A.; Menekse Dalveren, G.G.; Derawi, M. Business Intelligence Strategies, Best Practices, and Latest Trends: Analysis of Scientometric Data from 2003 to 2023 Using Machine Learning. Sustainability 2023, 15, 9854. https://doi.org/10.3390/su15139854

AMA Style

Gurcan F, Ayaz A, Menekse Dalveren GG, Derawi M. Business Intelligence Strategies, Best Practices, and Latest Trends: Analysis of Scientometric Data from 2003 to 2023 Using Machine Learning. Sustainability. 2023; 15(13):9854. https://doi.org/10.3390/su15139854

Chicago/Turabian Style

Gurcan, Fatih, Ahmet Ayaz, Gonca Gokce Menekse Dalveren, and Mohammad Derawi. 2023. "Business Intelligence Strategies, Best Practices, and Latest Trends: Analysis of Scientometric Data from 2003 to 2023 Using Machine Learning" Sustainability 15, no. 13: 9854. https://doi.org/10.3390/su15139854

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop