Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review

Polotskaya, Kristina; Muñoz-Valencia, Carlos S.; Rabasa, Alejandro; Quesada-Rico, Jose A.; Orozco-Beltrán, Domingo; Barber, Xavier

doi:10.3390/make6020058

Open AccessReview

Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review

by

Kristina Polotskaya

^1,†

,

Carlos S. Muñoz-Valencia

^1,†,‡

,

Alejandro Rabasa

¹

,

Jose A. Quesada-Rico

^2,3,4

,

Domingo Orozco-Beltrán

^2,3

and

Xavier Barber

^1,4,*

¹

Center of Operations Research, Miguel Hernández University, 03202 Elche, Spain

²

Department of Clinical Medicine, Miguel Hernández University, 03550 San Juan de Alicante, Spain

³

Primary Care Research Center, 03550 San Juan de Alicante, Spain

⁴

Joint Research Unit for Advanced Statistical Methods in Health Sciences UMH-FISABIO: STATSALUT, Miguel Hernández University, 03202 Elche, Spain

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

^‡

Current address: Centro de Investigación Operativa, Universidad Miguel Hernández de Elche, Avda. de la Universidad s/n, Edificio Torretamarit, 03202 Elche, Spain.

Mach. Learn. Knowl. Extr. 2024, 6(2), 1243-1262; https://doi.org/10.3390/make6020058

Submission received: 4 April 2024 / Revised: 27 May 2024 / Accepted: 31 May 2024 / Published: 4 June 2024

(This article belongs to the Collection Extravaganza Feature Papers on Hot Topics in Machine Learning and Knowledge Extraction)

Download

Browse Figures

Versions Notes

Abstract

:

Bayesian networks (BNs) are probabilistic graphical models that leverage Bayes’ theorem to portray dependencies and cause-and-effect relationships between variables. These networks have gained prominence in the field of health sciences, particularly in diagnostic processes, by allowing the integration of medical knowledge into models and addressing uncertainty in a probabilistic manner. Objectives: This review aims to provide an exhaustive overview of the current state of Bayesian networks in disease diagnosis and prognosis. Additionally, it seeks to introduce readers to the fundamental methodology of BNs, emphasising their versatility and applicability across varied medical domains. Employing a meticulous search strategy with MeSH descriptors in diverse scientific databases, we identified 190 relevant references. These were subjected to a rigorous analysis, resulting in the retention of 60 papers for in-depth review. The robustness of our approach minimised the risk of selection bias. Results: The selected studies encompass a wide range of medical areas, providing insights into the statistical methodology, implementation feasibility, and predictive accuracy of BNs, as evidenced by an average area under the curve (AUC) exceeding 75%. The comprehensive analysis underscores the adaptability and efficacy of Bayesian networks in diverse clinical scenarios. The majority of the examined studies demonstrate the potential of BNs as reliable adjuncts to clinical decision-making. The findings of this review affirm the role of Bayesian networks as accessible and versatile artificial intelligence tools in healthcare. They offer a viable solution to address complex medical challenges, facilitating timely and informed decision-making under conditions of uncertainty. The extensive exploration of Bayesian networks presented in this review highlights their significance and growing impact in the realm of disease diagnosis and prognosis. It underscores the need for further research and development to optimise their capabilities and broaden their applicability in addressing diverse and intricate healthcare challenges.

Keywords:

Bayesian networks; disease diagnosis; disease prognosis; directed acyclic graph; Bayesian classifier; scoping review

1. Introduction

In medicine, determining the cause of an event is often not straightforward. Experts find numerous sources of possible uncertainty, unclear and incomplete measurements, and events for which it is difficult to establish causal relationships. Accordingly, probabilistic graphs are useful tools to understand, in a visual and intuitive way, how each factor affects the rest of the system.

In this review, we delve into the application of Bayesian networks (BNs) for diagnosis and prognosis in healthcare. The focus on BNs is driven by the potential to integrate these models into hospital decision support systems, particularly for specific diagnostic applications. Understanding the fundamentals and implications of BNs in these crucial areas is essential for their successful implementation. This review aims to provide a comprehensive background on the capabilities of BNs, enhancing healthcare delivery by improving diagnostic accuracy and prognostic assessments, thereby benefiting patient management and outcomes.

In this scoping review, we analyse various articles related to diagnosis, as mentioned in the previous paragraph. The objective of these publications is to examine how Bayesian networks can serve as an effective system for decision-making in medicine, although they are currently limited to being a decision support system. While the reader may be aware of the significant advancements in artificial intelligence and artificial neural networks (deep learning), it is important to note that not all diseases are diagnosed through image analysis. Bayesian networks, as we will explain in this section, use diagrams to identify probabilistic causality based on Bayes’ rule.

A Bayesian network is a probabilistic graph that tries to reproduce a real system. Used in basic and applied research, as well as in artificial intelligence, BNs are based on Bayes’ theorem and visualised with a directed acyclic graph (DAG), which is used to represent dependence or cause-and-effect relationships between variables. In this way, new knowledge is generated under conditions of uncertainty, which makes it easier to decide and reason through probability theory [1]. BNs were introduced in the 1980s as a formalism for representation and reasoning through problems involving uncertainty, by adopting probability theory as a basic framework [2]. Since then, they have not only been used successfully in medicine; they have also been central to the development of numerous applications in other fields, including environmental sciences [3], production processes [4], finance [5], the debugging of artificial intelligence programs [6], and genetics [7]. In the health sciences, BNs are applied to improve treatments, diagnosis, and prognosis, helping physicians to make reliable decisions by enabling faster and more accurate prediction [8]. According to Kyrimi et al. [9], the reason behind the wide application of BNs in medical diagnosis lies in their ability to express expert knowledge, model uncertainty, and deal with incomplete data. This literature review aims to provide a comprehensive description of BNs and their applications in medicine in general, as in [9]. In particular, this review will focus on diagnostic and prognostic processes, as well seeking as to show the advantages of this method compared to others from the fields of artificial intelligence or machine learning, areas that have been attracting more attention in recent times. Before presenting the conducted searches and analysing the content of the articles, we will provide a concise overview of Bayesian networks and their types, classifiers, and evaluation methods. The aim is to introduce the reader to the subject matter, thereby facilitating a better understanding of the results and terminology used in the articles that will be analysed subsequently.

1.1. Definition of Bayesian Networks

We will now describe the basic concepts around BNs, the types of networks, and the unique structures known as Bayesian classifiers. We also briefly address the topic of learning about parametric and structural BNs, and finally we present the computation of a BN, describing the software available for BN applications.

Let us start by looking at the example in Figure 1, which defines a causal relationship between the variables flu, cough, and fever, where flu is expected to causally influence the presence of a cough or fever. The variables cough and fever are not independent. If a person has a cough, they may have the flu, and then they probably have a fever.

However, consider that, if a person has the flu, it is reasonable to conclude that the presence of a fever does not depend on the presence of a cough. Therefore, we assume that the variables cough

(X_{2})

and fever

(X_{3})

are conditionally independent given the variable flu

(X_{1})

. Formally, this can be expressed as follows:

\begin{matrix} P (X_{2} | X_{1}, X_{3}) = P (X_{2} | X_{1}) \\ P (X_{3} | X_{1}, X_{2}) = P (X_{3} | X_{1}) . \end{matrix}

Therefore,

P (X_{1}, X_{2}, X_{3}) = P (X_{2} | X_{1}) P (X_{3} | X_{1}) .

The variable

X_{1}

has an influence on both

X_{2}

and

X_{3}

, but it is assumed that there is no direct relationship between

X_{2}

and

X_{3}

. The representation of these relationships is usually achieved via a node and arrow diagram, connecting the influencing variables (primary variables) with the influenced variables (secondary variables). The structure shown in Figure 1 is a BN.

Considering the graphic structure of Figure 1 and, more precisely, the parents of each variable, the joint probability distribution can be written as follows:

P (X_{1}, X_{2}, X_{3}) = P (X_{1} | P a (X_{1})) \cdot P (X_{2} | P a (X_{2})) \cdot P (X_{3} | P a (X_{3}))

where

P a (X_{i})

is the parent variable of

X_{i}

.

This equation is the formal definition of a BN, in the case of three variables. Using the process of the analysis and classification of the unconditional independence between the three variables, we have converted

P (X_{1}, X_{2}, X_{3})

into a product of three conditional probabilities.

Probabilistic networks are visual depictions of the variables and their connections that define a particular scenario [10]. One of the most popular types is the BN [11], which provides information on the conditional dependencies and independencies among variables. The existence of independent relationships in a network makes BNs a valuable instrument in portraying knowledge consistently, as they decrease the required number of parameters. These relationships simplify the expression of the joint probability function by representing it as the multiplication of the conditional probability functions of each variable.

Because they represent a probability distribution, BNs confer a clear meaning, so they can be adapted for diagnostics, learning, explanation, and inferences [12]. Depending on the meaning given to them, they can represent causality [13] or correlations [14].

It could thus be said that a Bayesian network is a DAG whose nodes represent random variables and whose arcs represent direct dependencies—for example, causal relationships; see Figure 2. Each node has a conditional probability table, which quantifies the relationships between the connected variables [15].

A BN is a probabilistic model composed of two different parts: on the one hand, there is the graphical structure (DAG), which defines the relationships between the variables, and, on the other hand, it considers the probabilities established between these variables [1]. The elements of a BN are as follows [16]:

A set of variables (continuous or discrete) that form the vertices or nodes of the network;
A set of directed links connecting a pair of vertices—if there is a relationship with the direction $X \to Y$ , then X is said to be the parent of Y.

The network fulfills the following conditions:

There is an association between each vertex $X_{i}$ and a conditional probability function $P (X_{i} | P_{a} (X_{i}))$ , which takes as input a specific set of values for the parent variables of the vertex and gives the probability of the variable representing $X_{i}$ ;
The graph does not have directed cycles, i.e., it does not have directed trajectories or paths that start or end at the same node.

BNs have a planning algorithm based on the Bayes theorem that describes and forecasts, so that inferences can be made, calculating the posterior probabilities of unknown variables based on known variables. A BN provides an inference system that changes the probability distributions as soon as new evidence is found about some vertices, while the new probabilities are transferred to the remaining nodes. The probability transfer is considered a probabilistic inference, which means that the probability of some variables can be determined from the evidence provided on other variables. Before the evidence is presented, they are called prior probabilities; afterwards, they are called posterior probabilities [16,17].

1.1.1. Types of Bayesian Networks

Depending on the type of random variable used in the construction of a BN, there are discrete BNs, where each variable takes a finite number of values, and the probability distribution associated with each variable is multinomial. If the variables used to build the Bayesian networks are continuous, one method is to discretise the variables, i.e., to divide them into nominal intervals. However, discretisation implies a certain loss of information and the assignment of many parameters [18]. Modelling with continuous variables is a good alternative in this case. Gaussian BNs are used when continuous variables are used in the BN construction process, and each of them has a multivariate normal distribution [19]. When discrete and continuous variables are introduced in the construction process, they are called mixed BNs.

In discrete BNs, the probability distribution is determined by the probability tables, while, in Gaussian BNs [20], it is determined by the joint density function.

For [21], in scenarios where conditions evolve over time (such as the sequential activation of brain areas during cognitive decision-making), dynamic Bayesian Networks (BNs) are necessary. A standard BN sets the initial conditions. In dynamic BNs, the structures representing time intervals are consistent, and the conditional probabilities remain unchanged over time. Thus, dynamic BNs are time-invariant models, where ’dynamic’ refers to their capability to model systems with temporal dynamics. However, three years later, Dagum [22] refined the concept of dynamic Bayesian networks to unify and extend them to forecast time-dependent systems.

1.1.2. Bayesian Classifiers

Bayesian classifiers are a particular type of BN where there is a specific variable that is the class and the other variables are attributes or characteristics. A Bayesian classifier determines the posterior probability of each class,

C_{i}

(i.e., ill or not ill due some symptoms), using Bayes’ theorem:

P (C_{i} | E) = P (C_{i}) P (E | C_{i}) / P (E)

Many Bayesian classifiers have been proposed for the estimation of probabilities [23], such as the naive Bayesian classifier (NBC), the tree-augmented naive Bayesian classifier (TAN), and the Bayesian network-augmented naive Bayes (BAN) classifier.

The NBC allows the features to be independent of each other given the class, so the probability is found by multiplying the particular conditional probabilities of each feature given the class vertex [24].

P (C)

is the vector of the prior probabilities of each class, and

P (E_{i} | C)

is the conditional probability matrix of each feature. The features are conditionally independent given the class, so that there are no edges between them, as shown in Figure 3.

The NBC presents high classification precision in many areas [25]. However, sometimes, its performance is affected by the dependence of the features or attributes. One way to resolve this situation is to expand the basic NBC structure by adding edges between features. Two basic options are known [23]:

The Tree-Augmented Naive Bayes (TAN), with a tree structure added between the characteristics, so that, in principle, there are few links and the complexity of the structure does not increase substantially;
The Augmented Naive Bayes (BAN) adds a general dependency structure between features, without restrictions.

1.1.3. Learning Bayesian Networks

Finding the structure that best fits the data becomes one of the key problems to solve when using BNs. This task is performed by two types of machine learning: structural learning, which acquires the network structure (trees, poly-trees, or multi-connected networks) from the data; and parametric learning, which has the associated probabilities (table of probability) of each root node and the other variables. In order to determine the probability distributions linked to each vertex of the network, it is necessary to know the type of network, i.e., these two learning approaches cannot be carried out independently [24].

1.1.4. Computation of a BN

The use of software plays an important role when building the structure of a BN, since it allows the graphic representation of a network so that it is understandable to those involved in its development. Ref. [26] describes the software programs Netica v.5.0 (Norsys Software Corp., Vancouver, BC, Canada), Elvira v.2.0 (Department of Statistics and Applied Mathematics University of Almería, Almería, Spain), and Hugin v.9.4 (HUGIN EXPERT A/S, Aalborg, Denmark). Demo versions of Netica and Hugin are available free of charge, but some features are locked. The demo version of the Hugin software allows for a maximum of 20 variables, which is not enough to solve sequence analysis problems or even to compare the results with other tools. Elvira is a Java-based software tool that emerged from a collaboration among several Spanish universities, aimed at building and evaluating graphical probabilistic models. The software offers both exact and approximate algorithms to handle discrete and continuous variables and includes features for explanation, learning from databases, and network fusion [27].

The study by Charles River Analytics [11] describes the BNet program, which applies BNs to predict and display weather forecasts. Murphy et al. [28] provides different software applications for BNs. There are many private and free software programs available on the Internet.

Kenet et al. [29] presents a list of software packages used to generate and analyse BNs, including the following.

GeNIe v.4.1.R2 (BayesFusion, LLC, Pittsburgh, PA, USA) is a development environment for reasoning in graphical probabilistic models, and SMILE is its inference engine. GeNIe is freely available for any use and has several useful features, such as a module for the exploration and analysis of a data set and the learning of BNs and their numerical parameters from data. It has a special module that addresses problems related to diagnosis [30].
The R bnlearn package [31] is powerful and free. Compared to other available Bayesian network software, it can implement more constraint-based and score-based methods [31]. This R package is our preferred choice.
Bayesia has also developed a proprietary technology for the analysis of BNs. In collaboration with laboratories and large research projects, the company develops innovative technological solutions. Its products include (1) BayesianLab, a learning program; (2) Bayesian Market Simulator, a simulation software package that can be used to compare the influences of a set of competing offerings relative to a defined population; (3) Bayesian Engines, a library of software components through which the modelling and use of Bayesian networks can be integrated; and (4) Bayesian Graph Layout Engine, a library of software components used to integrate the automatic positioning of graphs into specific applications.

1.1.5. Model Validation

The validation of a model justifies its correctness. Validation usually involves checking the performance of the model and its further improvement. Model performance can be expressed through several measures [30], as shown in Table 1.

Now, using the information of the confusion matrix, some basic performance measures will be defined as follows:

\begin{matrix} Sensitivity (Recall) & = \frac{TP}{TP + FN} \\ Specificity & = \frac{TN}{TN + FP} \\ Accuracy & = \frac{TP + TN}{TP + TN + FP + FN} \\ Positive predictive value (PPV) = Precision & = \frac{TP}{TP + FP} \\ False Discovery Rate (FDR) & = 1 - PPV \\ F 1 Score & = 2 \times \frac{PPV \times Sensitivity}{PPV + Sensitivity} \end{matrix}

Sensitivity, also known as recall or the true positive rate, is a measure used to evaluate the performance of a classification model. It is defined as the proportion of actual positives (e.g., diseases, positive cases) that are correctly identified by the model.
Specificity, also referred to as the true negative rate, serves as a metric to assess the precision of a classification model, especially in its ability to precisely identify negative instances. It represents the ratio of actual negatives (for example, cases where a condition is not present) that the model correctly recognises as such.
Accuracy is a measure used in a confusion matrix to evaluate the overall performance of a classification model. It is defined as the proportion of correct predictions (both positive and negative) made by the model out of all predictions. In essence, the accuracy assesses how often the model makes the correct decision for both positive and negative instances.
The F1 score is a metric used within a confusion matrix to gauge the balance between precision and sensitivity (recall) in a classification model. It harmonises both measures into a single metric by calculating their harmonic mean. Essentially, the F1 score provides insights into the model’s accuracy in predicting positive instances while considering both false positives and false negatives.

Two other metrics that are commonly used in the context of diagnostic testing are the positive predictive value (PPV) and the negative predictive value (NPV).

The positive predictive value (PPV), or precision, is a statistical measure that quantifies the proportion of patients or test subjects who are correctly identified as having a condition (true positives) out of all of those who test positive for the condition, whether correctly or not (true positives and false positives). It reflects the likelihood that a positive test result accurately indicates the presence of the disease or condition in question. The PPV is crucial in evaluating the effectiveness of diagnostic tests, particularly in determining their reliability in correctly diagnosing patients within specific populations or under certain conditions.

$\begin{matrix} Precision or PPV & = \frac{TP}{TP + FP} \end{matrix}$
The negative predictive value (NPV) represents the proportion of individuals who test negative for a condition and are correctly identified as not having the condition (true negatives), out of all of those who receive a negative test result (both true negatives and false negatives). It is an essential measure in assessing the accuracy of diagnostic tests, indicating the probability that a negative test result genuinely reflects the absence of the disease or condition. The NPV is particularly important in confirming the efficacy of diagnostic procedures in ruling out diseases in patients.

\begin{matrix} NPV & = \frac{TN}{TN + FN} \end{matrix}

In some cases, in addition to detecting the types of errors produced by the model, the test must meet high sensitivity and specificity requirements. In this regard, it is useful to take measurements of the receiver operating characteristic curve (ROC) [32], graphically observing the relationship between the two measurements. The area under the curve (AUC) provides a good measure of how well the BN model or classifier works. The closer to unity the area is, the more closely the model or classifier will conform to an ideal model or classifier (100% true positives and 0% false positives).

1.2. Bayesian Networks in the Health Sciences

Since their creation, BNs have been linked to medical research, as probabilistic graphs are well suited to solving clinical problems. Understanding the causal relationships that lead to physiological processes is a challenge in clinical procedures; moreover, the health sciences frequently require reasoning through difficult problems under conditions of uncertainty.

Probabilistic diagnosis has gained popularity in the medical literature in recent decades. According to [33,34], Bayesian methods began to be adopted in diagnostic procedures in the USA and the UK during the 1960s. These diagnoses used frequentist probabilistic methods to select variables that included possible diagnoses, as well as several variables that typically corresponded to symptoms. In the 1980s, some authors [35,36] described the development of Bayesian Networks (BNs) and influence diagrams in the medical field, from their definition to the construction of algorithms for the efficient processing of certainty.

In applying probabilistic graph models, researchers encounter uncertainty and inaccuracy due to three main reasons: randomness, a lack of precision in data collection, and model flaws. In clinical procedures, this manifests as incomplete clinical histories, subjective components presented by physicians or patients, and inaccuracies in measurements [37].

Two approaches are taken into account when building BNs: starting from a data set, through the application of techniques from network learning; or with the help of experts, where health professionals transfer their experience to include variables with cause-and-effect relationships in the model. The two approaches can be mixed by creating models using learning techniques, removing and imposing relationships based on personal criteria.

The first approach is the fastest way to build Bayesian networks in medicine, provided that the number of observations is sufficiently large. However, it has drawbacks: each observation often includes only a few measurements in addition to the diagnosis. These variables may not be sufficient to determine their conditional independence, and they might not contribute to the health professional’s diagnosis.

In line with the above, there is usually an expert opinion on the model. The construction of BNs based on personal judgement can be divided into two stages: in the first, qualitative data are collected, clinically relevant variables are identified, and their relationships are used to build causal networks. In the second stage, quantitative information is collected: prior, conditional, and marginal probabilities.

Among specialists in statistics and information technology (the data scientist team), interest has been growing in the training of doctors so that they can appropriate the use of models. Accordingly, the BNs used in medicine could face challenges, such as the construction of algorithms that allow them to reduce the degree of difficulty in the growth of the BN [38].

Notably, probabilistic graph models are frequently used to forecast dependencies within a genetic test [7]. In parallel, Correa et al. [39] have developed BNs that explain assimilation networks by the modification and action exerted reciprocally between metabolites when analytical techniques such as Raman spectroscopy, liquid and gas chromatography, and Curie-point pyrolysis mass spectrometry data analysis require the examination of a large number of features, including various mass spectra. The extensive number of features complicates the interpretation of the data. Consequently, the data analysis begins with dimensionality reduction, selecting a smaller subset of crucial masses for further analysis while discarding less significant ones. Then, they employ a feature selection process that integrates Bayesian network learning methods with genetic algorithms, facilitating the identification of bacterial spores and the classification of Bacillus species. Similarly, in the area of neurology, dependency modelling in cellular tissues has been widely applied by means of BNs [18].

1.3. Bayesian Networks in the Diagnosis and Prognosis of Diseases

Medical diagnosis is often simplified to reasoning that entails the construction of hypotheses for each disease given the set of observed findings in a patient. The diagnosis results from choosing the most probable hypothesis for a set of observations [30]. Formally, it can be expressed by the equation

\begin{matrix} Diagnosis = {m a x}_{i} P (D_{i} | E) \end{matrix}

P (D_{i} | E)

is the probability of the disease

D_{i}

given the evidence E, which represents the set of observed findings, including signs and symptoms, along with laboratory results (i.e.,

P (D_{i} | E_{1}, E_{2}) = P

(flu|fever and cough)).

In the medical field, prognosis aims to forecast the future condition of a patient by considering a combination of the observed symptoms and the treatment provided. Formally, it can be expressed by the equation

\begin{matrix} Prognosis = P (O | E, T) \end{matrix}

Variable E is the evidence, i.e., a set of observed findings, such as signs, symptoms, and laboratory results; T represents a prescribed treatment for a patient; and the variable O is the outcome, which can represent, for example, life expectancy, the health status, or the spread of a disease.

Based on the information provided above, we are confident that the reader will be able to comprehend all of the articles that will be presented in the review of the diagnosis/prognosis literature.

2. Materials and Methods

The objective of this review is to comprehensively examine and synthesize the existing research on Bayesian networks (BNs) and their applications within the medical field, with a particular focus on their roles in disease diagnosis and prognosis.

2.1. Registration Statement

This review has been registered with PROSPERO, the international prospective register of systematic reviews. The registration details can be accessed through the PROSPERO database under the registration number 535260. The review protocol was registered before the start of the literature search to ensure transparency and reduce the risk of reporting bias. The protocol outlines the methods for the selection of the literature, data extraction, analysis, and the specific criteria used to assess the application of Bayesian networks in diagnosis and prognosis in the health sciences. The aim of this review is to synthesise the existing studies and provide a comprehensive evaluation of the effectiveness and accuracy of Bayesian networks in clinical settings.

2.2. PRISMA Statement

This review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. The methods of the literature search, selection, and data synthesis were structured to ensure the comprehensive coverage and unbiased analysis of studies involving Bayesian networks for diagnosis and prognosis in the health sciences. A PRISMA flow diagram has been included to illustrate the search process and selection of studies (Figure 4). Additionally, the PRISMA checklist has been completed and is available in the Supplementary Materials to this article, providing detailed information on the adherence to these guidelines throughout the review process.

2.3. Data Sources

Web of Science, Scopus, ProQuest Central, Google Scholar, and PubMed Central were searched for published articles containing the following keywords: Bayesian networks, disease prediction, medical diagnosis, applications in medicine, prognosis of disease. The search strategies implemented in this study were constructed utilising predefined keywords and aligning them with the pertinent descriptors of the chosen databases. The searches covered the literature from 1983 to the present to ensure the exhaustive inclusion of pertinent studies throughout this extensive timeframe. This methodology was designed to enable an in-depth review of the existing body of knowledge, thereby facilitating a nuanced understanding of the advancements and discoveries in the field.

MeSH terms: Medical Informatics, Decision Making, Computer-Assisted and Diagnosis, Computer-Assisted and Prognosis.

2.4. Data Extraction

The search period was from 1983 to 2023. The references of the included articles were hand-searched to identify additional relevant records based on the title, abstract, and full text.

2.5. Synthesis

The included studies were classified according to their main focus: (1) BNs, (2) the application of BNs in the health sciences, and (3) applications of BNs in the diagnosis and/or prognosis of diseases. Groupings were defined based on the data.

Among the publications considered for this scoping review, two fundamental aspects were sought: the first was that the disease was relatively common and that the variables involved in the diagnosis and/or prognosis were included, along with the directed acyclic graph (DAG) corresponding to the Bayesian network, or at least an indication that it had been used in the search for causality leading to the diagnosis; the second was that some measure of precision was provided. This second requirement was relaxed if the article was considered to provide sufficient information about the system, as in the case of [40], since it was a web-based decision-making system.

3. Results

The searches yielded a total of 190 records, and, following the screening process, 60 articles focusing on BNs applied to medicine for diagnosis and prognosis were finally included in the review (Figure 4). Among the excluded studies, after close reading, seven articles were found that did not use BNs, fifteen considered BNs but did not include any examples or references to disease diagnosis or prognosis, and eight did not provide any measure (’non-measure’) of the accuracy of the network.

Many models of BNs have been developed for medical diagnosis. They can be built using data, knowledge, the medical literature, ontologies, other sources of medical evidence, or a combination of these. For example, Moreira et al. [41] presented a Bayesian network model for the diagnosis of dementia, constructed based on expert knowledge. Seixas et al. [42] compared multiple models of BNs (developed from knowledge and data) with other classifiers fully learned from data to diagnose Alzheimer’s, dementia, and mild cognitive impairment. For each of these conditions, the performance of the knowledge-based BN models, with AUC values of 0.86, 0.96, and 0.97, respectively, was comparable to that of the best data-based classifier, with AUCs of 0.90, 0.98, and 0.97, respectively. In contrast, Yin et al. [43] analysed the data to identify the most important variables and then trained a BN to diagnose Alzheimer’s based on this information. Sa-Ngamuang et al. [44] presented a BN model to differentiate dengue from other acute febrile illnesses, combining expert knowledge, published evidence, and data. Farmer et al. [45] constructed a BN for the diagnosis of musculoskeletal disorders of the shoulder, using expert knowledge and the medical literature, along with retrospective data. The expert analysis was performed in two stages and an initial model was updated following a review by a panel of orthopaedic specialists. Refai et al. (2016) [46] proposed a maintenance strategy for Bayesian networks (BNs) that involves implementing policies to update a fixed structure and considering its reorganisation by defining additional variables, known as maintenance actions. These actions could involve adding or deleting variables and editing their values.

Some authors have suggested extracting medical knowledge from dictionaries or medical ontologies. Bucci et al. [47] built a two-level BN model for diagnosis that automatically extracted medical knowledge from an ontology; the diagnostic part was supplemented by a decision network to evaluate the available tests using utility values specified by a clinician. In Zhang et al. [48], several models, including a BN model, were automatically developed to diagnose mild cognitive impairment using a combination of approaches, including insights drawn from the SNOMED ontology. Shen et al. [49] explored the construction of a clinical BN for the probabilistic inference of medical ontology based on electronic medical records. By evaluating the learned topology against the opinions of expert clinicians and entropy calculations, and by calculating the diagnostic classification based on ontology, the study demonstrated that the direct and automated construction of a high-quality health topology and ontology is feasible using medical records.

Verduijn et al. [50] presented a BN that implements process-oriented and dynamic vision for forecasting. The procedure optimises the prediction of the results and accounts for patient dropout at an early stage. The paper describes how prognostic BNs can be applied to solve a series of information problems related to medical prognosis.

Other studies have used only data to build BN models. For example, Moreira and Namen [41] combined structured data with information extracted from medical notes (i.e., text) to build a BN model for the diagnosis of dementia. In another study, Somnay et al. [51] trained four data-based classifier models (rules, logistic regression, trees, and bay networks) to recognize primary hyperparathyroidism.

Comprehensive Review of Bayesian Networks on Some Diseases

Velikova et al. [52] presented the decomposition of a BN to model the detection of breast cancer, demonstrating some advantages of the method: the natural and more intuitive representation of breast abnormalities and their characteristics; the compact representation and efficient manipulation of large conditional probability tables; and a possible improvement in the processes of knowledge acquisition and representation.

The effectiveness of seven BN classifiers as potential tools for the diagnosis of breast cancer was evaluated by Cruz-Ramirez et al. [53] using two real-world databases containing fine needle aspirates of breast lesions, which were collected by a single and multiple observers, respectively. A certain element of subjectivity in the data was shown by the results: mean accuracy of 93.0% was shown in the single-observer data set, compared to 83.3% for the multiple-observer data set. The findings suggest that different aspects are identified by observers when examining samples under a microscope, a situation that significantly decreases the performance of these classifiers in the diagnosis of such a disease.

Elazmeh et al. [54] built a reliable BN model for the early assessment of emergency paediatric asthma exacerbation. Their predictive model was capable of distinguishing between patients with mild or moderate/severe asthma attacks at a medically acceptable level of performance.

Thermography was used by Yaneli et al. [55] for the pre-diagnosis of breast cancer, based on the predictive value of its attributes. A database of 98 patients was included in the BN used. It was suggested by the results that these attributes were not sufficient to produce good results in the pre-diagnosis of breast cancer. Unexpected interactions between thermographic attributes, especially those directly related to the class variable, were shown by the models. BNs were used by Djebbar and Merouani [56] to model case-based reasoning and apply it to the diagnosis of liver disease. It was found that the BN was an excellent tool for the modelling of uncertainty in terms of its clear graphical representation and the laws of conditional probability based on the similarity function, and the BN was useful in selecting the most similar cases as the recovery phase and optimising it. Two exact inference algorithms, JLO and Pearl, were used to calculate the conditional probabilities in order to improve the performance of case-based reasoning.

Park et al. [57] built a prognostic model for metabolic syndrome using BNs with an evolutionary optimisation ordering approach. This proposed model outperformed the conventional BN model and other neural network models.

Vila-Francés et al. [1] presented a decision-making support plan, using a BN to predict the probability of having unstable angina based on clinical data. The final model was implemented as a web application, which is currently being validated by clinical specialists. Seixas et al. [42] proposed a BN detection model for the early detection of Alzheimer’s disease, dementia, and mild cognitive impairment. The parameters of this structure, built by experts in this field, were estimated using learning algorithms from a real-world data set. The proposed BN showed better accuracy in diagnosing Alzheimer’s, dementia, and mild cognitive impairment compared to most other known classifiers. In addition, it provided additional useful information to clinicians, such as the contribution of certain factors to the diagnosis. Zhou et al. [40] described the development of BiotinNet, an Internet-based program that uses BNs to integrate published data on various aspects of biotin metabolism. Users can provide a combination of values for biotin-related metabolite levels to obtain predictions for other metabolites that are not specified. This system can help researchers to design future experiments while enabling the continuous incorporation of new data.

Sesen et al. [58] assessed the feasibility of BNs in providing accurate and personalized survival estimates and treatment selection recommendations in lung cancer care using the Lung Cancer Database (LUCADA).

Wang et al. [59] proposed a BN model to describe and predict the occurrence of brain metastases from lung cancer, which outperformed the naive Bayes, logistic regression, and support vector machine models in terms of the average sampled sensitivity. Furthermore, the proposed BN had advantages over the other approaches in interpreting how brain metastases develop from lung cancer.

Cai et al. [60] combined BNs with importance measures to identify key factors that have significant effects on the survival times of patients with hepatocellular carcinoma after hepatectomy. The accuracy of the BN model was 67.2%. The true positive rate (TPR) of the model was 83.22%, and the false positive rate (FPR) was 48.67%.

Kaewprag et al. [61] developed a BN-based predictive model that enabled clinicians to better understand and explore clinical patient data along with risk factors for pressure ulcers in intensive care unit patients based on electronic medical records. The BN model increased the sensitivity of the prediction compared to logistic regression models, without sacrificing the overall accuracy.

Takenaka et al. [62] used a BN to predict the postoperative clinical recovery from foot drop attributable to lumbar degenerative diseases. The results obtained suggest that the clinician can intuitively understand the layered correlations between the predictors of the BN models, which successfully provide probability estimates of the post-tibial anterior muscle strength to inform treatment.

Spyroglou et al. [63] examined the performance of BN classifiers in predicting asthma exacerbation based on various patient parameters, including objective measurements and medical history data. The proposed semi-naive network classifier was able to predict whether a patient would experience disease exacerbation after the last evaluation with accuracy of 93.8% and sensitivity of 90.9%. In addition, the resulting structure and conditional probability tables gave a clear view of the probabilistic relationships between the factors. This network could help clinicians to identify patients at high risk of exacerbation after stopping medication and confirm which factors are the most important.

Agrahari et al. [64] presented a method based on the analysis of BNs to classify two types of hematological malignancies: acute myeloid leukemia and myelodysplastic syndrome. The BN classifier showed accuracy of 93% and precision of 98%, surpassing the results achieved by other investigators on the same data set.

More recently, several studies have incorporated Bayesian networks as a component of their methodologies, often in conjunction with artificial neural networks or other machine learning tools. Evidence from these studies is presented below, fulfilling the criteria for the selection of the sample of studies.

Cao et al. [65] compared Bayesian networks (BNs), convolutional neural networks, and multivariate logistic regression in predicting the long-term postoperative health status. The BN demonstrated an excellent predictive capacity for type 2 diabetes and dyslipidaemia (AUC 0.94 and 0.92, respectively), a good capacity for hypertension and sleep apnoea syndrome (AUC 0.89 and 0.83, respectively), and a fair capacity for depression (AUC 0.75). The study showed that BNs are useful tools in predicting the long-term health status and comorbidities in patients after bariatric surgery.

Siga et al. [66] proposed a BN model to predict all-cause mortality in haemodialysis patients. The model presented an AUC of 0.78 (standard deviation [SD] 0.01); a true positive rate (sensitivity) of 72% (SD 2%); a true negative rate (specificity) of 69% (SD 2%); a positive predictive value (PPV) of 70% (SD 1%); and a negative predictive value (NPV) of 71% (SD 2%). The BN provided the most reliable prediction when comparing the results with those acquired by logistic regression.

A BN model was developed by Wu et al. [67] to guide individually targeted antibiotic therapy at the point of care by predicting the most likely causative pathogen in children with osteomyelitis and the antibiotic with the optimal expected utility. The complex relationship between the unobserved infectious pathogen, observed culture results, and clinical and demographic variables was explicitly modelled by the BN, integrating data with critical expert knowledge with the framework of causal inference.

Prochazka et al. [68] presented BN modelling as a tool to predict early disease progression in patients with follicular lymphoma. This study showed that the BN model had better prognostic power than multivariable logistic regression in terms of predicting the disease within 24 months of first-line immunochemotherapy (POD24). Unlike the logistic regression model, BNs allowed the visualisation of the complex relationships between the predictors and the individualised prediction of the risk associated with POD24, even if some of the predictors were unknown.

Finally, a BN-based approach to predict important clinical indicators to improve treatment for COVID-19-related pneumonia was developed by Derevitskii et al. [69]. Using expert knowledge, the current clinical recommendations, previous research, and classic predictive metrics, the models were validated. Following the validation of other COVID-19-related pneumonia data sets for other hospitals, the proposed models can be utilised within decision-making support systems to enhance the treatment of COVID-19 pneumonia.

The multivariable probabilistic associations between the CVD risk and metabolic health, alongside the obesity status, and the potential elements influencing these relationships among Chinese adults were illustrated by Tian et al. [70]. It was concluded by the authors that network modelling is beneficial in amalgamating expert knowledge and observational data, facilitating the straightforward recognition of the probabilistic dependencies and conditional independence among variables via graphical representation.

Lee et al. [71] proposed a BN model to identify actionable factors contributing to racial differences in the breast cancer stage at diagnosis. They employed diverse Bayesian model evaluation tools to assess these two hidden pathways, as well as each of the four observed variables, in elucidating racial disparities in the stage at diagnosis.

Wu et al. [72] presented an insight-based BN modelling approach to diagnose rheumatoid arthritis. The authors explained that the Bayesian network models exhibited strong predictive abilities in forecasting osteoporosis in postmenopausal women. Moreover, these networks provided a more intuitive representation of the complex interplay of the risk mechanisms between diseases and various factors. Another study with a similar methodology examined the bone mineral density using multiple linear regression and a Bayesian network model [73].

To summarise the most significant results, we present them in Table 2, to provide the reader with a comparative analysis of the various studies based on their characteristics.

4. Discussion

Bayesian networks (BNs) are highly valuable tools in the health sciences, providing substantial support to medical professionals with satisfactory accuracy. This section discusses the outcomes achieved by BNs in the diagnosis and prognosis of diseases, emphasising the pertinent considerations, benefits, and limitations. BNs have demonstrated efficacy in identifying causal genetic biomarkers across a wide array of diseases, with precision that rivals that of expert clinician assessments and entropy calculations. Moreover, these networks have facilitated the development of ontology-based diagnostic classifications, enabling the automated construction of high-quality health topologies and ontologies directly from medical records.

An important part of the research on disease diagnosis systems using Bayesian networks employs dynamic retraining, where new diagnoses are incorporated as new data, leading to the re-calibration of the network [9,40,42]. This approach improves the robustness and reliability through elicitation and expert learning phases. The prototype clinical decision support system for diagnosis has demonstrated high validity and reliability. However, it has also identified several areas that require further research before clinical application can be considered.

Moreover, BNs can diagnose diseases with minimal human intervention, even in mild cases, and their integration into electronic medical record systems could aid in recognising underdiagnosed disorders. Djebbar et al.’s findings demonstrate that combining BNs with case-based reasoning enhances the efficiency of case retrieval in liver disease diagnosis [56].

A web-based medical decision support system optimised to maximise the negative predictive value (NPV) showed potential in reducing undetected risk cases in emergency room patients with nonspecific chest pain at risk of a heart attack. The quantification of the uncertainty in predictions is as critical as the prediction itself, indicating a need for further experimentation on unmeasured metabolites [1].

Furthermore, BN classifiers provide advantages over traditional clinical prediction methods by using multiple symptom-related factors simultaneously [63]. However, a significant limitation is their assumption of attribute independence, which may not hold in conditions like asthma, where symptom interactions can occur.

In summary, while BNs offer considerable promise in enhancing disease prediction and understanding, their effectiveness depends on robust data collection and the ability to update the predictions as new data become available. Despite some limitations, their flexibility in handling incomplete information and the dynamic nature of their forecasting make BNs a powerful tool in medical decision-making.

5. Conclusions

This scoping review encompasses 60 articles, demonstrating that Bayesian networks (BNs) achieve high levels of accuracy in diagnosing and forecasting diseases. While recent advancements in machine learning may offer slight improvements in accuracy, the use of directed acyclic graphs (DAGs) in BNs significantly aids in understanding the results and studying causality, capabilities not offered by other classification methods. The increase in publications regarding BN development—54 articles identified on PubMed and over 90 on Web of Science in 2023 alone—suggests their growing importance in clinical procedures.

BNs are particularly effective in modelling complex medical issues that require reasoning under uncertainty. The extensive literature and ongoing research into medical BNs reflect the significant interest in their application for medical diagnosis. BNs offer a versatile model that can evaluate, predict, diagnose, and optimise decisions, enhancing the effectiveness of BN construction efforts and the quality of the available software solutions [9]. As expert support systems, BNs provide a powerful tool for medical decision-making. They utilise a knowledge representation methodology to express variable relationships, employing probability theory to manage medical uncertainty and making the decision process comprehensible to non-specialists.

Looking forward, the development of advanced preprocessing methods and alternative learning strategies is crucial. Additionally, the establishment of more publicly accessible databases should be prioritised to enhance the training of Bayesian networks. Integration with electronic health records (EHRs) and interoperability with existing healthcare IT systems could further enhance the utility of BNs, enabling real-time updates and more personalised patient care.

Moreover, BNs facilitate a more intuitive understanding of complex medical data, which can be particularly valuable in multidisciplinary healthcare teams, where not all members may have advanced data science training. Their ability to incorporate expert knowledge into the model structure makes them highly adaptable to various medical contexts and conditions, from rare diseases to common chronic illnesses.

Given these considerations and the rapid advancements in artificial intelligence over the past 2–3 years, continuous research in Bayesian networks is indispensable. Similar to other machine learning techniques, BNs achieve comparable or superior results in disease diagnosis and prognosis. The notable precision of these intelligent systems supports healthcare professionals by enabling quicker, more certain decisions, whether based on imaging or clinical data, across a wide range of pathologies. Furthermore, the transparency and interpretability of BN models can improve the trust and adoption among healthcare practitioners, ultimately enhancing patient outcomes and advancing personalised medicine.

In conclusion, the continuous evolution of Bayesian networks, coupled with increasing computational power and more sophisticated algorithms, positions BNs as a cornerstone technology for future advancements in medical diagnostics and prognostics. Their integration into clinical workflows promises to elevate the standard of care, making healthcare delivery more efficient, accurate, and patient-centred.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/make6020058/s1. Reference [74] is cited in the supplementary materials.

Author Contributions

Conceptualisation, X.B. and A.R.; methodology. C.S.M.-V. and K.P.; investigation, C.S.M.-V. and K.P., formal analysis: C.S.M.-V., K.P., D.O.-B. and J.A.Q.-R.; writing—original draft preparation, C.S.M.-V. and K.P.; writing—review and editing, X.B.; supervision, X.B., A.R. and D.O.-B.; project administration, X.B. and D.O.-B.; funding acquisition, X.B. and D.O.-B. All authors have read and agreed to the published version of the manuscript.

Funding

This paper is part of the project PID2022-136455NB-I00, funded by the Ministerio de Ciencia, Innovación y Universidades of Spain (MCIN/AEI/10.13039/501100011033/FEDER, UE) and the European Regional Development Fund. and ILISABIO 2023 Modalidad de PROYECTOS DE INNOVACIÓN-ILISABIO23-PI03, funded by Universidad Miguel Hernández de Elche-Vicerrectorado de Planificación y responsabilidad Social.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

BN	Bayesian network
TP	True positive
TN	True negative
FP	False positive
FN	False negative
NPV	Negative predictive value
PPV	Positive predictive value
ROC	Receiver operating characteristic curve

References

Vila-Francés, J.; Sanchis, J.; Soria-Olivas, E.; Serrano, A.J.; Martinez-Sober, M.; Bonanad, C.; Ventura, S. Expert system for predicting unstable angina based on Bayesian networks. Expert Syst. Appl. 2013, 40, 5004–5010. [Google Scholar] [CrossRef]
Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference; Morgan Kaufmann: Burlington, MA, USA, 1988. [Google Scholar]
McCann, R.K.; Marcot, B.G.; Ellis, R. Bayesian belief networks: Applications in ecology and natural resource management. Can. J. Forest Res. 2006, 36, 3053–3062. [Google Scholar] [CrossRef]
Nannapaneni, S.; Mahadevan, S.; Rachuri, S. Performance evaluation of a manufacturing process under uncertainty using Bayesian networks. J. Clean. Prod. 2016, 113, 947–959. [Google Scholar] [CrossRef]
Demirer, R.; Mau, R.R.; Shenoy, C. Bayesian networks: A decision tool to improve portfolio risk analysis. J. Appl. Financ. 2006, 16, 106. [Google Scholar]
Ramírez, A.; Romero, J.R. Synergies Between Artificial Intelligence and Software Engineering: Evolution and Trends. In Handbook on Artificial Intelligence-Empowered Applied Software Engineering: Vol. 1: Novel Methodologies to Engineering Smart Software Systems; Springer: Cham, Switzerland, 2022; pp. 11–36. [Google Scholar]
Sachs, K.; Perez, O.; Pe’er, D.; Lauffenburger, D.A.; Nolan, G.P. Causal protein-signaling networks derived from multiparameter single-cell data. Science 2005, 308, 523–529. [Google Scholar] [CrossRef]
Langarizadeh, M.; Moghbeli, F. Applying naive bayesian networks to disease prediction: A systematic review. Acta Inf. Med. 2016, 24, 364. [Google Scholar] [CrossRef]
Kyrimi, E.; McLachlan, S.; Dube, K.; Neves, M.R.; Fahmi, A.; Fenton, N. A comprehensive scoping review of Bayesian networks in healthcare: Past, present and future. Artif. Intell. Med. 2021, 117, 102108. [Google Scholar] [CrossRef]
Whittaker, J. Graphical Models in Applied Multivariate Statistics; Wiley Publishing: New York, NY, USA, 2009. [Google Scholar]
Analytics, I.C.R. About Bayesian Belief Networks; Charles River Laboratories International Inc.: Cambridge, UK, 2004. [Google Scholar]
Heckerman, D.; Breese, J.S. Causal independence for probability assessment and inference using Bayesian networks. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 1996, 26, 826–831. [Google Scholar] [CrossRef]
Spirtes, P.; Glymour, C.; Scheines, R. Causation, Prediction and Search; Lecture Notes in Statistics; MIT Press: New York, NY, USA, 1993; Volume 81. [Google Scholar]
Pearl, J. Graphical Models, Causality, and Intervention. 2011. Available online: https://escholarship.org/content/qt8d93w51g/qt8d93w51g_noSplash_8eeb138fc6366dfd488ce635e12e8c13.pdf (accessed on 26 May 2024).
Flores, M.J.; Nicholson, A.E.; Brunskill, A.; Korb, K.B.; Mascaro, S. Incorporating expert knowledge when learning Bayesian network structure: A medical case study. Artif. Intell. Med. 2011, 53, 181–204. [Google Scholar] [CrossRef]
Stuart, J.R.; Norvig, P. Artificial Intelligence: A Modern Approach; Prentice Hall: Upper Saddle River, NJ, USA, 2009. [Google Scholar]
Huang, C.; Darwiche, A. Inference in belief networks: A procedural guide. Int. J. Approx. Reason. 1996, 15, 225–263. [Google Scholar] [CrossRef]
Bielza, C.; Larrañaga, P. Bayesian networks in neuroscience: A survey. Front. Comput. Neurosci. 2014, 8, 131. [Google Scholar] [CrossRef]
Shachter, R.D.; Kenley, C.R. Gaussian influence diagrams. Manag. Sci. 1989, 35, 527–550. [Google Scholar] [CrossRef]
Geiger, D.; Heckerman, D. Learning Gaussian Networks in Uncertainty in Artificial Intelligence; Elsevier: San Francisco, CA, USA, 1994; pp. 235–243. [Google Scholar]
Dean, T.; Kanazawa, K. A model for reasoning about persistence and causation. Comput. Intell. 1989, 5, 142–150. [Google Scholar] [CrossRef]
Dagum, P.; Galper, A.; Horvitz, E. Dynamic network models for forecasting. In Proceedings of the Uncertainty in Artificial Intelligence, Stanford, CA, USA, 17–19 July 1992; Elsevier: Stanford, CA, USA, 1992; pp. 41–48. [Google Scholar]
Friedman, N.; Geiger, D.; Goldszmidt, M. Bayesian network classifiers. Mach. Learn. 1997, 29, 131–163. [Google Scholar] [CrossRef]
Neapolitan, R.E. Learning Bayesian Networks; Pearson Prentice Hall: Upper Saddle River, NJ, USA, 2004; Volume 38. [Google Scholar]
Frank, E.; Hall, M.; Pfahringer, B. Locally Weighted Naive Bayes; Morgan Kaufmann Publishers Inc.: Burlington, MA, USA, 2002; pp. 249–256. [Google Scholar]
Madsen, A.L.; Jensen, F.; Kjaerulff, U.B.; Lang, M. The Hugin tool for probabilistic graphical models. Int. J. Artif. Intell. Tools 2005, 14, 507–543. [Google Scholar] [CrossRef]
Lacave, C.; Luque, M.; Diez, F.J. Explanation of Bayesian networks and influence diagrams in Elvira. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 2007, 37, 952–965. [Google Scholar] [CrossRef]
Murphy, K. Software Packages for Graphical Models/Bayesian Networks. Int. Soc. Bayesian Anal. Bull. 2007, 14, 13–15. [Google Scholar]
Kenett, R.S. Applications of Bayesian Networks. 2012. Available online: https://ssrn.com/abstract=2172713 (accessed on 26 May 2024).
Pourret, O.; Na, P.; Marcot, B. Bayesian Networks: A Practical Guide to Applications; John Wiley & Sons: Chichester, UK, 2008. [Google Scholar]
Scutari, M.; Denis, J.B. Bayesian Networks with Examples in R, 2nd ed.; Chapman and Hall: Boca Raton, FL, USA, 2021; ISBN 978-0367366513. [Google Scholar]
Fawcett, T. ROC graphs: Notes and practical considerations for researchers. Mach. Learn. 2004, 31, 1–38. [Google Scholar]
Warner, H.R.; Toronto, A.F.; Veasey, L.G.; Stephenson, R. A mathematical approach to medical diagnosis: Application to congenital heart disease. JAMA 1961, 177, 177–183. [Google Scholar] [CrossRef]
Gorry, G.A.; Barnett, G.O. Experience with a model of sequential diagnosis. Comput. Biomed. Res. 1968, 1, 490–507. [Google Scholar] [CrossRef]
Schwartz, S.M.; Baron, J.; Clarke, J.R. A Causal Bayesian Model for the Diagnosis of Appendicitis; Elsevier: Amsterdam, The Netherlands, 1988; Volume 5, pp. 423–434. [Google Scholar]
Spiegelhalter, D.J.; Knill-Jones, R.P. Statistical and knowledge-based approaches to clinical decision-support systems, with an application in gastroenterology. J. R. Stat. Soc. Ser. A (Gen.) 1984, 147, 35–58. [Google Scholar] [CrossRef]
Diez, F.J.; Mira, J.; Iturralde, E.; Zubillaga, S. DIAVAL, a Bayesian expert system for echocardiography. Artif. Intell. Med. 1997, 10, 59–73. [Google Scholar] [CrossRef]
Arora, P.; Boyne, D.; Slater, J.J.; Gupta, A.; Brenner, D.R.; Druzdzel, M.J. Bayesian networks for risk prediction using real-world data: A tool for precision medicine. Value Health 2019, 22, 439–445. [Google Scholar] [CrossRef]
Correa, E.; Goodacre, R. A genetic algorithm-Bayesian network approach for the analysis of metabolomics and spectroscopic data: Application to the rapid identification of Bacillus spores and classification of Bacillus species. BMC Bioinform. 2011, 12, 33. [Google Scholar] [CrossRef]
Zhou, J.; Wang, D.; Schlegel, V.; Zempleni, J. Development of an internet based system for modeling biotin metabolism using Bayesian networks. Comput. Methods Programs Biomed. 2011, 104, 254–259. [Google Scholar] [CrossRef]
Moreira, L.B.; Namen, A.A. A hybrid data mining model for diagnosis of patients with clinical suspicion of dementia. Comput. Methods Programs Biomed. 2018, 165, 139–149. [Google Scholar] [CrossRef]
Seixas, F.L.; Zadrozny, B.; Laks, J.; Conci, A.; Saade, D.C.M. A Bayesian network decision model for supporting the diagnosis of dementia, Alzheimer’s disease and mild cognitive impairment. Comput. Biol. Med. 2014, 51, 140–158. [Google Scholar] [CrossRef]
Yin, Z.; Zhao, Y.; Lu, X.; Duan, H. A hybrid intelligent diagnosis approach for quick screening of Alzheimer’s disease based on multiple neuropsychological rating scales. Comput. Math. Methods Med. 2015, 2015, 258761. [Google Scholar] [CrossRef]
Sa-Ngamuang, C.; Haddawy, P.; Luvira, V.; Piyaphanee, W.; Iamsirithaworn, S.; Lawpoolsri, S. Accuracy of dengue clinical diagnosis with and without NS1 antigen rapid test: Comparison between human and Bayesian network model decision. PLoS Negl. Trop. Dis. 2018, 12, e0006573. [Google Scholar] [CrossRef]
Farmer, N. An update and further testing of a knowledge-based diagnostic clinical decision support system for musculoskeletal disorders of the shoulder for use in a primary care setting. J. Eval. Clin. Pract. 2014, 20, 589–595. [Google Scholar] [CrossRef]
Refai, A.; Merouani, H.F.; Aouras, H. Maintenance of a Bayesian network: Application using medical diagnosis. Evol. Syst. 2016, 7, 187–196. [Google Scholar] [CrossRef]
Bucci, G.; Sandrucci, V.; Vicario, E. Ontologies and Bayesian networks in medical diagnosis. In Proceedings of the 2011 44th Hawaii International Conference on System Sciences, Kauai, HI, USA, 4–7 January 2011; pp. 1–8. [Google Scholar]
Zhang, X.; Hu, B.; Ma, X.; Moore, P.; Chen, J. Ontology driven decision support for the diagnosis of mild cognitive impairment. Comput. Methods Programs Biomed. 2014, 113, 781–791. [Google Scholar] [CrossRef]
Shen, Y.; Zhang, L.; Zhang, J.; Yang, M.; Tang, B.; Li, Y.; Lei, K. CBN: Constructing a clinical Bayesian network based on data from the electronic medical record. J. Biomed. Inform. 2018, 88, 1–10. [Google Scholar] [CrossRef]
Verduijn, M.; Peek, N.; Rosseel, P.M.J.; de Jonge, E.; de Mol, B.A.J.M. Prognostic bayesian networks: I: Rationale, learning procedure, and clinical use. J. Biomed. Inform. 2007, 40, 609–618. [Google Scholar] [CrossRef]
Somnay, Y.R.; Craven, M.; McCoy, K.L.; Carty, S.E.; Wang, T.S.; Greenberg, C.C.; Schneider, D.F. Improving diagnostic recognition of primary hyperparathyroidism with machine learning. Surgery 2017, 161, 1113–1121. [Google Scholar] [CrossRef]
Velikova, M.; de Carvalho Ferreira, N.; Lucas, P. Bayesian network decomposition for modeling breast cancer detection. In Proceedings of the Artificial Intelligence in Medicine: 11th Conference on Artificial Intelligence in Medicine, AIME 2007, Amsterdam, The Netherlands, 7–11 July 2007; Proceedings 11; Springer: Berlin/Heidelberg, Germany, 2007; pp. 346–350. [Google Scholar]
Cruz-Ramirez, N.; Acosta-Mesa, H.G.; Carrillo-Calvet, H.; Nava-Fernández, L.A.; Barrientos-Martinez, R.E. Diagnosis of breast cancer using Bayesian networks: A case study. Comput. Biol. Med. 2007, 37, 1553–1564. [Google Scholar] [CrossRef]
Elazmeh, W.; Matwin, W.; O’Sullivan, D.; Michalowski, W.; Farion, W. Insights from predicting pediatric asthma exacerbations from retrospective clinical data. In Evaluation Methods for Machine Learning II–Papers from 2007 AAAI Workshop; Morgan Kaufmann: Los Altos, CA, USA, 2007; pp. 10–15. [Google Scholar]
Yaneli, A.A.M.; Nicandro, C.R.; Efrén, M.M.; Enrique, M.D.C.M.; Nancy, P.C.; Gabriel, A.M.H. Assessment of bayesian network classifiers as tools for discriminating breast cancer pre-diagnosis based on three diagnostic methods. In Proceedings of the Advances in Artificial Intelligence: 11th Mexican International Conference on Artificial Intelligence, MICAI 2012, San Luis Potosí, Mexico, 27 October–4 November 2012; Revised Selected Papers, Part I 11; Springer: Berlin/Heidelberg, Germany, 2013; pp. 419–431. [Google Scholar]
Djebbar, A.; Merouani, H.F. Optimising retrieval phase in CBR through Pearl and JLO algorithms for medical diagnosis. Int. J. Adv. Intell. Paradig. 2013, 5, 161–181. [Google Scholar] [CrossRef]
Park, H.S.; Cho, S.B. Evolutionary attribute ordering in Bayesian networks for predicting the metabolic syndrome. Expert Syst. Appl. 2012, 39, 4240–4249. [Google Scholar] [CrossRef]
Sesen, M.B.; Nicholson, A.E.; Banares-Alcantara, R.; Kadir, T.; Brady, M. Bayesian networks for clinical decision support in lung cancer care. PLoS ONE 2013, 8, e82349. [Google Scholar] [CrossRef]
Wang, K.J.; Makond, B.; Wang, K.M. Modeling and predicting the occurrence of brain metastasis from lung cancer by Bayesian network: A case study of Taiwan. Comput. Biol. Med. 2014, 47, 147–160. [Google Scholar] [CrossRef]
Cai, Z.-Q.; Si, S.-B.; Chen, C.; Zhao, Y.; Ma, Y.-Y.; Wang, L.; Geng, Z.-M. Analysis of prognostic factors for survival after hepatectomy for hepatocellular carcinoma based on a bayesian network. PLoS ONE 2015, 10, e0120805. [Google Scholar]
Kaewprag, P.; Newton, C.; Vermillion, B.; Hyun, S.; Huang, K.; Machiraju, R. Predictive models for pressure ulcers from intensive care unit electronic health records using Bayesian networks. BMC Med. Inform. Decis. Mak. 2017, 17, 81–91. [Google Scholar] [CrossRef]
Takenaka, S.; Aono, H. Prediction of postoperative clinical recovery of drop foot attributable to lumbar degenerative diseases, via a Bayesian network. Clin. Orthop. Relat. Res. 2017, 475, 872–880. [Google Scholar] [CrossRef]
Spyroglou, I.I.; Spöck, G.; Rigas, A.G.; Paraskakis, E.N. Evaluation of Bayesian classifiers in asthma exacerbation prediction after medication discontinuation. BMC Res. Notes 2018, 11, 522. [Google Scholar] [CrossRef]
Agrahari, R.; Foroushani, A.; Docking, T.R.; Chang, L.; Duns, G.; Hudoba, M.; Karsan, A.; Zare, H. Applications of Bayesian network models in predicting types of hematological malignancies. Sci. Rep. 2018, 8, 6951. [Google Scholar] [CrossRef]
Cao, Y.; Raoof, M.; Szabo, E.; Ottosson, J.; Näslund, I. Using Bayesian Networks to Predict Long-Term Health-Related Quality of Life and Comorbidity after Bariatric Surgery: A Study Based on the Scandinavian Obesity Surgery Registry. J. Clin. Med. 2020, 9, 1895. [Google Scholar] [CrossRef]
Siga, M.M.; Ducher, M.; Florens, N.; Roth, H.; Mahloul, N.; Fouque, D.; Fauvel, J.P. Prediction of all-cause mortality in haemodialysis patients using a Bayesian network. Nephrol. Dial. Transplant. 2020, 35, 1420–1425. [Google Scholar] [CrossRef]
Wu, Y.; McLeod, C.; Blyth, C.; Bowen, A.; Martin, A.; Nicholson, A.; Mascaro, S.; Snelling, T. Predicting the causative pathogen among children with osteomyelitis using Bayesian networks–improving antibiotic selection in clinical practice. Artif. Intell. Med. 2020, 107, 101895. [Google Scholar] [CrossRef]
Procházka, V.K.; Matuštíková, Š.; Fürst, T.; Belada, D.; Janíková, A.; Benešová, K.; Mociková, H.; Ďuraš, J.; Pirnos, J.; Kopečková, K.; et al. Bayesian Network Modelling As a New Tool in Predicting of the Early Progression of Disease in Follicular Lymphoma Patients. Blood 2020, 136, 20–21. [Google Scholar] [CrossRef]
Derevitskii, I.V.; Mramorov, N.D.; Usoltsev, S.D.; Kovalchuk, S.V. Hybrid Bayesian network-based modeling: COVID-19-pneumonia case. J. Pers. Med. 2022, 12, 1325. [Google Scholar] [CrossRef]
Tian, S.; Bi, M.; Bi, Y.; Che, X.; Liu, Y. A bayesian network analysis of the probabilistic relationships between various obesity phenotypes and cardiovascular disease risk in Chinese adults: Chinese population-based observational study. JMIR Med. Inform. 2022, 10, e33026. [Google Scholar] [CrossRef] [PubMed]
Lee, I.; Luo, Y.; Carretta, H.; LeBlanc, G.; Sinha, D.; Rust, G. Latent pathway-based Bayesian models to identify intervenable factors of racial disparities in breast cancer stage at diagnosis. Cancer Causes Control 2023, 35, 253–263. [Google Scholar] [CrossRef] [PubMed]
Wu, Y.; Chao, J.; Bao, M.; Zhang, N.; Wang, L. Construction of predictive model for osteoporosis related factors among postmenopausal women on the basis of logistic regression and Bayesian network. Prev. Med. Rep. 2023, 35, 102378. [Google Scholar] [CrossRef] [PubMed]
Suzuki, T.; Shimoda, T.; Takahashi, N.; Tsutsumi, K.; Samukawa, M.; Yoshimura, S.; Ogasawara, K. Factors affecting bone mineral density among snowy region residents in Japan: Analysis using multiple linear regression and Bayesian network model. Interact. J. Med. Res. 2018, 7, e8555. [Google Scholar] [CrossRef] [PubMed]
Tricco, A.C.; Lillie, E.; Zarin, W.; O’Brien, K.K.; Colquhoun, H.; Levac, D.; Moher, D.; Peters, M.D.J.; Horsley, T.; Weeks, L.; et al. PRISMA Extension for Scoping Reviews (PRISMAScR): Checklist and Explanation. Ann. Intern. Med. 2018, 169, 467–473. [Google Scholar] [CrossRef]

Figure 1. Representation of the relationships between variables (directed acyclic graph, DAG).

Figure 2. Generic structure (DAG) of a BN for P with n random variables

X_{1}, \dots, X_{n}

.

Figure 2. Generic structure (DAG) of a BN for P with n random variables

X_{1}, \dots, X_{n}

.

Figure 3. An example of a naive Bayesian classifier.

Figure 4. PRISMA flow chart for the review process of BNs in the diagnosis and prognosis field.

Table 1. A confusion matrix example comparing a screening test against the gold standard. A confusion matrix illustrating the classification outcomes of the model. The matrix categorises the predictions into four fundamental types: true positives (TP), where the model correctly predicts the positive class; false positives (FP), where the model incorrectly predicts the negative class as positive; true negatives (TN), where the model correctly predicts the negative class; and false negatives (FN), where the model incorrectly predicts the positive class as negative. N is the total number of analysed patients.

		Gold Standard
		Positive	Negative	Total
Screening test	Positive	True Positive (TP)	False Positive (FP)	TP+FN
Screening test	Negative	False Negative (FN)	True Negative (TN)	FP+TN
	Total	$TP + FP$	$FN + TN$	N

Table 2. Summary of selected studies on BNs, including medical application, type of BN used, numerical level of performance, and most salient study findings. The BN performance measures are as follows: accuracy measured between 0—inaccurate system and 1—highly accurate system; and AUC measured between 0—inaccurate predictions and 1—highly accurate prediction system.

Authors	Disease	BN Type	Performance	Main Conclusions
[65]	Comorbidity after surgery	Hybrid BN	0.7 (AUC)	Bayesian networks serve as effective instruments in forecasting long-term health-related quality of life (HRQoL) and comorbid conditions in patients following bariatric surgery, utilising data from their pre-surgical health and disease conditions.
[40]	Biotin metabolism	BN		This approach enables researchers to plan future experiments as new data can be seamlessly integrated.
[1]	Heart disease	BN	0.69 (Accuracy)	The final model has been developed into a web application that is currently undergoing validation by clinical specialists.
[56]	Liver disease	JLO-BN	0.87 (Accuracy)	The proposed model clearly holds significant promise.
[58]	Lung cancer	Tree-Augmented Naive Bayes	0.81 (Accuracy)	Causal intervention findings indicate that BN treatment recommendations aim to prescribe plans that maximise survival.
[59]	Lung cancer	BN	0.82 (Accuracy)	Sensitivity is crucial in identifying cancer patients, and the sensitivity of the proposed Bayesian network surpasses that of all benchmarked methods.
[55]	Breast cancer	MP-Bayes+Greedy	0.85 (Accuracy)	It appears that there is an overspecialisation in diagnosing patients with the disease, yet a notable lack of specialisation when the disease is absent.
[71]	Breast cancer	Latent pathway-based BN	0.64 (Accuracy)	The BN model, which utilises two latent pathways and four patient-specific mediators, clearly outperforms the CART model in predicting the stage at diagnosis.
[53]	Breast cancer	MP-Bayes+Greedy	0.93 (Accuracy)	Observers perceive various elements when examining samples under the microscope, a situation that considerably undermines the effectiveness of these classifiers in diagnosing such a disease.
[73]	Osteoporosis	BN	0.5 (Accuracy)	These methods are considered valuable indicators in developing healthcare policies in the region.
[72]	Osteoporosis	BN	0.67 (Accuracy)	BNs allow a more intuitive understanding of the complex network of risk factors and diseases.
[51]	Hyperparathyroidism	Hybrid BN	0.98 (AUC)	BNs are capable of accurately diagnosing primary hyperparathyroidism autonomously, even in cases of mild disease.
[44]	Dementia	Hybrid BN	0.83 (AUC)	BN models can aid physicians in diagnosing dengue, especially in regions where experienced medical professionals and laboratory diagnostic tests are scarce.
[43]	Dementia	Hybrid BN	0.85 (Accuracy)	Perform better in identifying different stages of AD compared to others.
[42]	Dementia	BN	0.83 (AUC)	Enhanced diagnostic outcomes for dementia, Alzheimer’s disease (AD), and mild cognitive impairment (MCI) when compared to many other established classifiers.
[41]	Dementia	Hybrid BN	0.79 (Accuracy)	The hybrid model demonstrates superior effectiveness in diagnostic prediction.
[69]	COVID-19 pneumonia	Hybrid BN-based modelling	0.87 (Accuracy)	Support systems for decision-making to improve treatment of COVID-19 pneumonia.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Polotskaya, K.; Muñoz-Valencia, C.S.; Rabasa, A.; Quesada-Rico, J.A.; Orozco-Beltrán, D.; Barber, X. Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review. Mach. Learn. Knowl. Extr. 2024, 6, 1243-1262. https://doi.org/10.3390/make6020058

AMA Style

Polotskaya K, Muñoz-Valencia CS, Rabasa A, Quesada-Rico JA, Orozco-Beltrán D, Barber X. Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review. Machine Learning and Knowledge Extraction. 2024; 6(2):1243-1262. https://doi.org/10.3390/make6020058

Chicago/Turabian Style

Polotskaya, Kristina, Carlos S. Muñoz-Valencia, Alejandro Rabasa, Jose A. Quesada-Rico, Domingo Orozco-Beltrán, and Xavier Barber. 2024. "Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review" Machine Learning and Knowledge Extraction 6, no. 2: 1243-1262. https://doi.org/10.3390/make6020058

APA Style

Polotskaya, K., Muñoz-Valencia, C. S., Rabasa, A., Quesada-Rico, J. A., Orozco-Beltrán, D., & Barber, X. (2024). Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review. Machine Learning and Knowledge Extraction, 6(2), 1243-1262. https://doi.org/10.3390/make6020058

Article Menu

Bayesian Networks for the Diagnosis and Prognosis of Diseases: A Scoping Review

Abstract

1. Introduction

1.1. Definition of Bayesian Networks

1.1.1. Types of Bayesian Networks

1.1.2. Bayesian Classifiers

1.1.3. Learning Bayesian Networks

1.1.4. Computation of a BN

1.1.5. Model Validation

1.2. Bayesian Networks in the Health Sciences

1.3. Bayesian Networks in the Diagnosis and Prognosis of Diseases

2. Materials and Methods

2.1. Registration Statement

2.2. PRISMA Statement

2.3. Data Sources

2.4. Data Extraction

2.5. Synthesis

3. Results

Comprehensive Review of Bayesian Networks on Some Diseases

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI