1. Introduction
Epidemics pose significant threats to public health, characterized by rapid spread, extensive environmental impact, and challenges in their prevention and control. The recent COVID-19 pandemic has underscored the importance of establishing robust public health surveillance and implementing the best practices [
1,
2]. The twenty-first century has witnessed the emergence of numerous new health challenges for humanity. This era has seen a heightened focus on public health, driven by rapid advancements in computer technology and an increased awareness of the pivotal role of spatial epidemiology [
3]. Spatial epidemiology, leveraging spatial statistics and map visualization techniques, plays a crucial role in characterizing and analysing the spatial distribution of human diseases, health conditions, and the associated environmental factors. By harnessing spatial information, this discipline conducts in-depth investigations into epidemic data [
4]. Through its application, spatial epidemiology contributes to a comprehensive understanding of the geographic distribution and incidence trends of diseases, providing valuable theoretical insights for disease surveillance and the development of preventive interventions [
4].
In recent years, Bayesian spatiotemporal models have emerged as potent tools for unravelling the intricacies of disease spread, seamlessly integrating both spatial and temporal dimensions [
5]. The challenge of tackling high-dimensional integral operations in Bayesian statistics, particularly when estimating numerous parameters, has been effectively addressed through rapid advancements in computer technology and the refinement of the Markov Chain Monte Carlo (MCMC) approach. This progress has propelled Bayesian statistics into the spotlight, garnering substantial attention [
6]. Parameter estimates, a key aspect of Bayesian spatiotemporal modelling, are acquired through a conventional statistical technique known as frequency inference, grounded in the likelihood function. Guided by the Bayesian statistical ideological framework, the spatiotemporal model discerns patterns of change inherent in spatiotemporal data [
7]. Through the specification of prior distributions and the incorporation of geographical and temporal dependencies, the Bayesian approach quantifies uncertainty in predictions or inferential estimates using probabilities. The primary distinctions between classical statistics and Bayesian statistics are twofold. Firstly, in classical statistics, all unknown parameters are treated as constants, while in Bayesian statistics, these parameters are regarded as random variables and characterized by probability distributions in the pursuit of statistical inference. Secondly, classical statistics relies on overall information and sample information for statistical inference, whereas Bayesian statistics incorporates prior information alongside overall and sample information. Bayesian statistics places a particular emphasis on the extraction, processing, and utilization of prior information [
8]. In the Bayesian framework, the foundation for parameter estimates, inference, and prediction lies in the posterior distribution [
9]. The posterior distribution is produced by integrating (i) the current sample using the likelihood function, and (ii) the previous distribution of the parameters based on historical or other types of external information [
10]. Through the incorporation of prior knowledge modelling, the Bayesian spatiotemporal model delineates the impacts, time effects, and interaction effects in the spatiotemporal domain, offering a comprehensive understanding of the dynamics of disease spread [
10].
While spatiotemporal analysis effectively captures features through a series of maps, Bayesian approaches prove instrumental in addressing prevalent modelling challenges in spatial epidemiology, such as non-normality, limited sample sizes, missing data, and clustered data structures. By identifying regions or times of heightened risk, Bayesian methods can uncover disease patterns that persist or evolve predictably over time and across diverse spatial units [
11]. The development of a Bayesian spatiotemporal model aims to accomplish key tasks, including the following:
To assess the anticipated value and uncertainty of a specific outcome variable at a defined spatial point throughout the observation period.
To forecast the expected value of an outcome variable at a specific location.
To identify evolving patterns of diseases that persist or undergo predictable changes over time and across various spatial units.
To analyse the influence of environmental factors on the spatiotemporal dynamics of disease.
In recent years, there have been several reviews on Bayesian spatiotemporal models, most of which followed the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) framework [
12,
13]. These reviews have covered various applications, including spatial and spatiotemporal methods for mapping malaria risk [
14], modelling dengue fever [
15], analysing COVID-19 epidemiology [
16], and exploring joint spatial and spatiotemporal methods in health research [
17]. However, it is noted that only a limited number of these reviews have delved specifically into spatiotemporal models or Bayesian inference methods. This suggests that while there is a growing body of research in the broader field of Bayesian spatiotemporal modelling, the detailed exploration of spatiotemporal models and Bayesian inference techniques remains a niche within these systematic reviews.
This review systematically focuses on key aspects of Bayesian spatiotemporal models, including their theoretical foundations, modelling techniques, and computational methods. It provides a detailed examination of the spatiotemporal distribution of data, Bayesian deduction principles, spatiotemporal evolution rules, and calculation methods employed in these models. This comprehensive review not only serves to elucidate the intricacies of Bayesian spatiotemporal models but also underscores their pivotal role in advancing the understanding and control of disease transmission. The integration of advanced statistical methods and computational tools is expected to further propel progress in spatial epidemiology, offering valuable insights into the spatial and temporal dimensions of health events.
2. Literature Review
Epidemiological data frequently exhibit discernible structures in both space and time. Analysing the evolution of diseases enables the identification of patterns and areas of heightened risk. Over the past three decades, significant research efforts have been dedicated to exploring this aspect through the application of spatiotemporal modelling techniques.
Bernardinelli et al. [
18] and Bernardinelli et al. [
11] employed a Poisson log-linear model by modelling the logarithm of rates as a linear function of time, incorporating spatiotemporal effects. Waller et al. [
19] utilized a spatiotemporal conditional autoregressive (CAR)-based model to explore how heterogeneity and spatial patterns evolve over time, considering both temporal effects and spatiotemporal interactions. Waller et al. [
20] introduced a random effect for each year, assuming temporal effects are independent of geographical effects. Sun et al. [
21] and Kim et al. [
22] assumed a linear shape for the temporal trend. Moreover, Kim and Oleson [
23] presented a modified technique incorporating an autoregressive process of order one to capture nonlinearity in the temporal trend.
Spatial epidemiology has also made use of the Bayesian nonparametric spatial modelling technique. A unique dynamic mixture model was developed by Böhning et al. [
24] to simultaneously discover space–time clusters. By adding spatial dependency and permitting a random number of mixture components in a spatiotemporal environment through the Dirichlet process prior, Kottas et al. [
25] expanded the mixture model. In contrast, Yan and Clayton [
26] expanded upon Gangnon and Clayton [
27]’s spatial cluster model to allow for the space–time clustering of disease counts data.
To concurrently investigate the spatiotemporal variations in disease risks, Richardson et al. [
28] undertook a combined modelling approach for two interconnected diseases. Disease risk is decomposed into two components: a common part and a unique part capturing the distinctions between the diseases. Building upon Richardson et al. [
28]’s strategy, Musio et al. [
29] extended the approach by integrating a model for zero-inflated data with semi-parametrically structured additive predictors. Given that time was treated as a categorical variable in the dataset, the interaction effects were modelled using a changing coefficient approach [
30,
31].
To account for variations in underlying rates across different seasons within a given year, Torabi and Rosychuk [
32] proposed a model incorporating both seasonal and spatiotemporal effects. This modelling technique finds applications in diverse fields, such as the study of malaria diseases [
33] and influenza [
34,
35]. Addressing the challenge of fitting Bayesian spatiotemporal models to large datasets, Chen et al. [
36] advocated for a two-stage spatiotemporal technique. In the initial stage, each year’s data were fitted using various geographic models. Subsequently, in the second stage, temporal trends in parameter estimates from each year were analysed, aiming to reduce the computational burden.
Abellan et al. [
37] assessed underlying spatial patterns using a two-component joint mixture model for space–time interactions. The first component addresses residual noise, while the second captures deviations from space and time’s main effects. Schrödle et al. [
38] compared parameter-driven and observation-driven models, applying Bayesian inference with integrated nested Laplace approximations for parameter-driven models and maximum likelihood for observation-driven models. Cameletti et al. [
39] introduced a spatiotemporal modelling approach for particulate matter concentration using the stochastic partial differential equation technique. Bukhari et al. [
40] employed correlation methods, seasonal and trend decomposition, hotspot analysis, and conditional autoregressive Bayesian models to assess spatiotemporal clustering and the impact of climatic and socio-economic covariates on disease occurrence in Mexico. In the spread of the COVID-19 pandemic, numerous studies have employed Bayesian spatiotemporal models to discern spatial and spatiotemporal variations [
16]. Ref. [
41] have devised a Bayesian spatiotemporal model featuring a non-separable, stationary Matern covariance function, which adeptly handles spatial and temporal dependencies by capturing the intricate space–time interactions among geographical locations. Sun et al. [
42] considered six spatiotemporal Bayesian hierarchical models based on two spatial conditional autoregressive priors.
Over the past three decades, significant advancements have been made in Bayesian spatiotemporal models in addressing spatial distribution, temporal trends, and the intricate interaction between space and time. These models have evolved to consider various factors, including seasonal changes, the challenges posed by large datasets, and the exploration of multiple model variations. Today, aided by robust computer technology, Bayesian spatiotemporal modelling methods have found widespread and impactful applications.
3. Bayesian Spatiotemporal Models
3.1. Probability Distribution in Spatial Epidemiology
There are two primary types in spatial epidemiology: (1) case event data, encompassing georeferenced instances of cases along with their respective times of diagnosis, registration, or onset, and (2) count data, representing the tally of disease cases within defined tracts over successive time intervals. The seamless extension of spatial data analysis into the temporal domain is readily achievable, as highlighted by Lawson [
9].
Figure 1 visually illustrates the progression of space–time dynamics in geographical epidemiology.
The Poisson likelihood has been a cornerstone in the toolkit of Bayesian spatiotemporal models for modelling diverse diseases [
43]. Typically, a Poisson distribution is favoured when low observed values are expected for each location and period, making it well-suited for situations where disease cases are relatively rare.
Conversely, binomial distribution becomes a valuable choice when describing the occurrence of non-rare or common disease processes, particularly in areas where the incidence of disease (or mortality) is high. This distribution provides an effective framework for situations where events are not rare and their outcomes involve binary states.
For scenarios characterized by rapidly rising disease cases and over-dispersed infectious outbreaks, such as the initial wave of COVID-19 in New England regions in 2020, the negative binomial likelihood, incorporating an additional dispersion parameter, proves more adaptable. Several studies show the efficacy of this model in capturing the heightened variability observed during such outbreaks [
1,
44,
45,
46]. In spatial epidemiology, over-dispersion is common due to the presence of often undisclosed factors influencing disease prevalence in each location.
In summary, the choice of likelihood in Bayesian spatiotemporal models is contingent upon the specific characteristics of the disease data, with the Poisson, binomial, and negative binomial distributions offering versatile tools tailored to different scenarios and levels of disease prevalence.
3.2. Models
3.2.1. General Effect Model
In a general form, a spatiotemporal model assumes a logarithmic connection between rates and time within a given area, with variations in the time trend across different regions [
11]. Under the framework of general spatiotemporal models, the connection between time and rates within an area is presumed to be log-linear, and time trends are considered to differ across regions [
11].
The linear predictor, a rescaled version of the risk designed to enhance the reliability of predictions, is essentially the logarithm or log-odds of the risk
(or, less frequently, another link function of the risk), depending on the chosen model for observation [
47]. Typically, the additive expression of the linear predictor is the summation of several elements or effects, each representing distinct and independent contributions to risk for a specific area and time period. Generalized linear mixed models (GLMM) over space and time are mostly used in the related research. This concept is exemplified in the following examples:
where the regression models include the following:
The pioneering example of modelling space–time relative risk was introduced by Bernardinelli et al. [
11]. In their methodology, they embraced a model for the log relative risk, formulated as
where
is an area (tract) random effect,
is a linear trend term, and
is an interaction random effect. Here,
,
,
.
Following this, Waller et al. [
20] presented a model incorporating both uncorrelated and correlated heterogeneity terms. Subsequently, Xia and Carlin [
48] further advanced and streamlined this model to be
where
is a linear time trend, and
is a variable measured within the tract unit. In this formulation,
,
,
.
Knorr-Held and Besag [
49] devised a model employing binomial likelihood for the number at risk
with a probability
for the counts. They utilized a logit link to the linear predictor, and the formulation is as follows:
where
is a time-based random intercept, is a th age group effect at time , is a gender × race effect for combination at time , is a fixed covariate effect term with an urbanization index , and encompass both correlated and uncorrelated heterogeneity terms, and notably, these terms are not contingent on time variations. In this formulation, , , .
Recent advancements in spatiotemporal modelling encompassed extensions of mixture models, as demonstrated by Böhning et al. [
24]. These models investigated temporal periods independently, without mutual interaction, and for the spatial random effects, they employed a variation of a full multivariate normal spatial prior distribution [
21]. Another notable extension involved the modification of the Knorr-Held and Besag model to incorporate diverse forms of random interaction terms, as proposed by Knorr-Held [
50].
In exploring the spatiotemporal distribution of disease, the loglinear relative risk can be considered and modelled as a GLMM accounting for spatiotemporal dependence and heterogeneity as follows:
Here,
,
,
,
are the spatially structured (spatial autocorrelation), spatially unstructured (spatial heterogeneity), temporally structured (serial autocorrelation), and temporally unstructured (temporal heterogeneity) random effects, respectively, and
is the interaction of a pair of the four random effects above. The random effects implicitly capture the determinants of the relative risk [
51].
3.2.2. Infectious Disease Model
In the context of the infectious disease mechanism, a susceptible–infected–removed (SIR) model was presumed, as proposed by Wormser and Pourbohloul [
52]. The specific variant of the SIR model being evaluated was introduced by Morton and Finkenstädt [
53], and was initially applied to analyse measles epidemic time series and later was expanded into the spatiotemporal domain by Lawson and Song [
54] for an investigation into influenza outbreaks. The size of the susceptible population at risk in region
and period
is denoted as
. The accounting formula for the SIR model is as follows:
The susceptible population for the current period is obtained by subtracting the previous disease counts, denoted as , and the removed population, denoted as , from the susceptible population of the previous period. This equation takes into account the removals from the population , which could be attributed to factors such as death, relocation, and other reasons.
The two-component endemic–epidemic model proposed by Held et al. [
55] can be applied to Bayesian spatiotemporal models. In this model, the authors separated the endemic and epidemic components of the log-linear mean model. Two dissemination patterns for infectious diseases were identified: the local epidemic effect and the lagged epidemic impact. The lagged epidemic effect posited that the pattern of disease counts from the prior period influences the current disease count. Consequently, in areas where disease counts were higher in the prior period, disease dissemination may be more intense.
3.2.3. Environmental Covariates
Bayesian spatiotemporal models offer the capability to integrate diverse data types, including demographic, environmental, and healthcare data. This integration of various data sources can significantly enhance the comprehension of the fundamental processes influencing the spread of diseases [
56,
57]. The types and quantity of covariates generally exhibit a wide range of variations. Six distinct categories of covariates were identified, as reported in the review by Aswi et al. [
15], including climatic, demographic, socioeconomic, entomological, geographic, and temporal covariates. This comprehensive classification was based on the insights derived from the aforementioned review, providing a framework for a more holistic understanding of the spatiotemporal dynamics of disease.
In certain scenarios where factors can influence the linear predictor, an additional component is introduced as follows:
Covariates can be either time-varying or time-invariant
accounting for uncertainty arising from measured confounders, where
is the environmental covariates’ matrix for county
and time
and
is a vector of regression parameters [
9,
58,
59].
3.2.4. Prior Distribution
Bayesian inference takes all the parameters as random variables with prior probability distributions. In Bayesian disease mapping, a spatially structured random effect is commonly addressed in a hierarchical model’s second level using random effects with CAR prior distributions, falling under the category of Markov random fields. Besag et al. [
60] introduced intrinsic and convolution CAR priors, but these lack clear differentiation between structured and unstructured random effects, causing identification issues [
61]. Leroux et al. [
62] proposed the Leroux prior to explicitly distinguish between structured and unstructured spatial variation, offering greater flexibility in representing various spatial correlation scenarios [
63]. Krisztin et al. [
64] suggested using spatial econometrics, applying the Spatial Autoregressive (SAR) model to quantify spatial spillover, also known as the spatial error model in spatial econometrics.
In the unstructured heterogeneity model, the spatial effects are assumed to be sampled from a normal distribution with mean 0 and precision
,
In the clustering model, the Gaussian CAR distribution was employed, enabling the mean of the structured effect to depend on its neighbouring values [
65]. The CAR prior model postulates that the risk of disease occurrence in a given spatial area is influenced by its nearby geographical neighbours, thereby exhibiting spatial autocorrelation. In essence, it assumes that spatial areas in close proximity share more similar disease occurrence risks. Formally, the joint distribution of
is denoted:
where
indicates that areas
and
are neighbors,
is the number of areas that share boundaries with the
th area, and
is the variance component. Spatial dependence in
is modelled with a CAR prior that extends the well-known Besag model [
66] with a Gaussian distribution. This implies that each
is conditional on its neighbour
with variance being dependent on the number of neighbouring counties
of county
.
Based on the existing knowledge of the primary risk variables, the choice between heterogeneity and clustering models can be made. A clustering model emerges when risk factors extend beyond the boundaries of one or more areas, leading to comparable risk values in neighbouring regions. On the other hand, a heterogeneity model arises when the scope of risk determinants is smaller than the size of a region.
Among the various suggestions proposed in the literature for implementing risk smoothing, the approach presented by Besag et al. [
60] has exerted the most significant influence. Referred to as the BYM specification, this method formed the basis for spatial modelling in the majority of the models examined in this evaluation [
67]. The integration of clustering effects and heterogeneity plays a pivotal role in determining the risk associated with a specific area.
The temporal components also include two random effects: the term
is the unstructured time effect, which is specified using an independent mean–zero normal prior to the unknown variance
:
The term
represents the organized temporal impact and is dynamically modelled using a neighbouring structure. In this instance, the structured temporal impact precedes the random walk dynamic model [
10], with its prior density
expressed as follows:
Similar to CAR, the random walk prior model posits that nearby time points (temporal correlation), represented as a smoothly fluctuating nonlinear curve, exert an influence on the temporal fluctuation of the risk of disease occurrence [
68].
For a temporal trend, a random walk prior of order one (RW1):
or order two (RW2):
is appropriate for the temporal trend
, with
as the variance hyperparameter of
.
For the unstructured temporal effects
, the exchangeable prior could be:
where
is the variance hyperparameter of
.
The spatially structured, spatially unstructured, temporally structured, and spatially unstructured effect with the distribution are schematically shown in
Figure 2.
For the four interaction components, the priors are the products of the priors of the corresponding components. The crucial element in spatiotemporal models is the spatiotemporal interaction, which can be particularly challenging given the array of options and the absence of a universally recognized benchmark that consistently performs well. Knorr-Held [
50] introduced a classification system comprising four potential types of interaction between spatial and temporal random effects [
47,
69], shown in
Table 2 and
Figure 3.
A Type I space–time interaction can be conceptualized as distinct, with unstructured unobserved factors for every possible combination of location and period. In this case, the assumption is that risk is independent in space but similar throughout time. A Type II interaction consists of temporally organized but geographically unstructured effects. Here, the assumption is that risk shows a temporal correlation but is independent across different locations. A Type III interaction assumes that risk exhibits geographic autocorrelation for each time period but is independent over time. This type comprises spatially structured but temporally unstructured effects. A Type IV space–time interaction assumes that global trends are both spatially and temporally connected, capturing both spatial and temporal structures’ risks.
In the Bayesian framework, a notable challenge arises when selecting appropriate prior distributions and determining the corresponding parameter values. The intricacies of hierarchical modelling are further compounded by the imperative to define prior distributions for both model parameters and variance parameters, commonly known as hyperparameters. Widely employed distributions in this context encompass logGamma for log-inverse variance, Half-Cauchy, Penalized Complexity, and Uniform distribution for hyperparameter standard deviation [
70].
3.3. Bayesian Inference
The Bayesian approach, named after Thomas Bayes, an English clergyman (1702–1761), revolved around a method of data analysis that emphasizes conditional probabilities linking observed and unknown quantities. This paradigm enables the assessment of the likelihood of a specific hypothesis based on the observed data.
Bayesian methods offer the advantage of incorporating prior information with observed data to estimate parameters and make predictions. Although frequentist statistical analysis still underpins the majority of practical statistical applications, Bayesian methods have emerged as the standard in numerous fields, particularly in the modelling of spatiotemporal epidemiology. Their utility is particularly evident in public health policy and clinical settings, where study results can directly influence decision making.
Bayes’ theorem, as articulated by Gelman [
71], is fundamental to Bayesian statistics and states that
which means Posterior
likelihood × prior, where
is the observed data and
is the unobservable vector quantities or population parameters of interest. As elucidated by Austin et al. [
72], the probability distribution of a parameter was derived by multiplying its prior probability distribution by the likelihood function and then dividing the result by the probability of the data, conditioned on the observed data. The resulting distribution is referred to as the posterior distribution.
This theorem provides a framework for updating beliefs (posterior probability) based on new evidence (likelihood) and prior knowledge (prior probability). The versatility of Bayesian methods makes them valuable tools in exploring complex relationships and uncertainties, offering a comprehensive approach to statistical modelling in various disciplines.
A tutorial on the Bayesian workflow for disease transmission modelling in Stan has been provided (
https://mc-stan.org/users/documentation/case-studies/boarding_school_case_study.html, accessed on 28 December 2023). The tutorial illustrates the processes of constructing, fitting, and assessing disease transmission models utilizing a fundamental susceptible–infected–recovered (SIR) model.
Figure 4 provides an overview of the Bayesian model’s functionality through a simplified process (adapted from Martin et al. [
73]).
3.4. Computational Algorithms
3.4.1. Markov Chain Monte Carlo
Markov chain Monte Carlo (MCMC) methods, which simulate direct draws from complex distributions of interest, are widely employed in Bayesian computation. MCMC algorithms generate a Markov chain by utilizing past sample values to randomly generate the next sample value. Typically, the Markov chain begins at any random point and gradually converges to the desired distribution. The Gibbs sampler [
74] and the Metropolis algorithm [
75,
76] are commonly used in a variety of Bayesian applications. For further details and variations on MCMC approaches, refer to Gilks et al. [
77].
3.4.2. Integrated Nested Laplace Approximation
Rue et al. [
78] have introduced the integrated nested Laplace approximation (INLA) as an alternative to sampling-based methods, addressing the slow convergence and high computational cost associated with MCMC methods. INLA is particularly useful for latent Gaussian field structured additive regression models, providing an approximation for Bayesian inference.
In each equation, the symbol can be employed to represent the vector of all Gaussian variables and the vector of hyperparameters, which are not necessarily Gaussian. Another approach to express the marginal posterior density of an image is as follows:
To approximate the first component
, three different approaches are possible: a Gaussian approximation, a full Laplace approximation, and a simplified Laplace approximation. Rue et al. [
78] introduced the simplified Laplace approximation, which was based on a series expansion of the full Laplace approximation. This method is less time consuming and proves to be highly competitive in many applications. Finally, an approximation of the posterior marginal density is given using
As computation with INLA has gained popularity, several comparisons between INLA methods and MCMC algorithms have been conducted. These investigations have consistently demonstrated that the two strategies perform similarly. According to Schrödle and Held [
69], the outcomes of MCMC simulations and the full Laplace approximation of INLA are nearly identical. Other studies, including posterior predictive checking by Held et al. [
79], and survival models by Martino et al. [
80], have reported very similar outcomes between INLA and MCMC. The INLA software package, recently developed in R (Integrated Nested Laplace Approximation), has become a commonly used approach for implementing Bayesian models [
16].
3.5. Model Fit Criteria
In disease mapping studies, the Deviance Information Criterion (DIC), introduced by Spiegelhalter et al. [
81], was widely employed as one of the goodness-of-fit statistics. DIC relies on comparing the average deviance and the deviance of the posterior parameter estimates. It is defined as
where D[.] is the deviance of the model, and θ is the parameter set of the model. In model comparison, a smaller DIC indicates a better goodness-of-fit.
Another goodness-of-fit statistic could be Watanabe–Akaike information criterion (WAIC) [
82].
The Mean Squared Error (MSE) for comparing data points and posterior mean estimates is given by Kim et al. [
1]:
where
is the number of MCMC samples,
is the total number of the area, and
is the total number of time periods. The MSE is computed as the sum of the squared differences between the actual data
and posterior estimates at each sampling period
and then averaged over the entire time, space, and sampling periods.
For short-term prediction evaluation, the one-step-ahead prediction from the posterior estimates of the last time period can be compared with real data, as suggested by Kim et al. [
1].
For each sampling period , is sampled from the estimated, while is calculated using the values from the previous time period .
In addition to the measures mentioned above, the predictive performance of models can be further evaluated by assigning numerical scores based on their predictive distributions. In accordance with this approach, given a set of spatiotemporal observations
, the logarithmic score (LS) [
83], and probability integral transform (PIT) [
84] are defined as follows:
and
where
represents the conditional predictive ordinate [
85]. It signifies the cross-validated predictive probability mass at the observed count
.
A smaller LS indicates a better prediction. The PIT histogram provides an indication of the model fit across all observations. The closer the PIT histogram aligns with the uniform distribution histogram, the better the overall fit of the model [
86].
Various model selection criteria used in studies include Bayesian cross-validation criterion (BCV), mean absolute percentage error (MAPE), Root Mean Squared Error (RMSE), Continuous Ranked Probability Score (CRPS), highest probability (HPM), mean absolute error (MAE), Pearson correlation coefficient (
r) [
87], which are appropriate statistics for prediction of performance evaluation, and for the best prediction (BPM) [
16]. Each criterion provides a different perspective on model performance, and the choice often depends on the specific goals and characteristics of the analysis.
5. Applications
The Bayesian spatiotemporal model serves the crucial purpose of estimating disease transmission risk, pinpointing areas with elevated spatial incidence, and aptly considering the susceptibility–infection–recovery model within the population. Simultaneously, this model is adept at assessing the influence of physical geographical and socioeconomic factors on disease incidence. Natural factors encompass elements like temperature, rainfall, and air pollution, while socioeconomic factors include aspects such as economic indicators, healthcare accessibility, and population demographics.
The applications of Bayesian spatiotemporal models are diverse. Here are the four key areas:
Disease surveillance: offering real-time risk estimates and assisting dynamic disease surveillance.
Epidemic forecasting: by modelling the spatiotemporal dynamics of diseases, these models enable the forecasting of future outbreaks, facilitating timely intervention and resource planning.
Identifying hotspots: effectively pinpointing spatial and temporal hotspots, these models highlight areas with consistently high disease risk or where risk is escalating over time.
Public health planning: the insights derived from Bayesian spatiotemporal models play a pivotal role in guiding public health planning and resource allocation, identifying priority areas and populations at high risk.
Bayesian spatiotemporal models serve as powerful tools for comprehensively analysing the distribution and trends of both non-communicable and infectious diseases across diverse regions and over varying time periods.
In the realm of non-communicable diseases (e.g., cardiovascular diseases, cancers, chronic respiratory diseases, and diabetes), these models play a pivotal role. For instance, in cardiovascular disease research, Bayesian spatiotemporal models can effectively map cardiovascular mortality, pinpoint regions with elevated prevalence [
93,
94], identify associated risk factors [
95], and evaluate the nuanced impacts of these factors. Likewise, in cancer epidemiology, these models facilitate the meticulous analysis of cancer incidence and mortality. They empower researchers to identify clusters of cancer cases, delve into potential environmental risk factors [
96], and gauge the effectiveness of interventions [
97]. Moreover, in the study of diseases like asthma or chronic obstructive pulmonary disease, Bayesian spatiotemporal models prove invaluable by helping identify areas with high prevalence and investigating potential environmental triggers [
98]. In the context of diabetes, these models are instrumental in the analysis of spatial and temporal patterns of diabetes prevalence. They contribute to predicting trends and assessing influential factors contributing to the global prevalence of diabetes in recent years [
99].
In the realm of infectious diseases, the significance of Bayesian spatiotemporal models is heightened, as they provide a nuanced understanding of disease dynamics and interactions. The detailed importance of these models is elucidated below.
5.1. Viral Infection
The virus typically spreads through respiratory droplets, direct contact, or vectors like mosquitoes. Consequently, in densely populated areas, viral infections often experience swift and extensive dissemination. Geographically, viral infections can have a global distribution, but prevalence rates vary based on factors such as the environment, population density, and socioeconomic conditions. Temporally, viral infections may display seasonal patterns distinct from other infection patterns. For instance, COVID-19 and influenza are more prevalent during colder months.
5.1.1. COVID-19
Bayesian spatiotemporal models have been widely utilized for COVID-19 analysis and forecasting globally, including in the United States [
100,
101,
102], England [
103,
104], Spain, Italy, Germany, Sweden [
87,
105,
106,
107], Africa [
108], and in some regional areas such as the West Java Province, Indonesia [
109], and the Greater Seoul Area, Korea [
110]. As an illustration, Nazia et al. [
111] applied a Bayesian hierarchical spatiotemporal model to assess the COVID-19 risk. Their model incorporated satellite-derived remote sensing data on land surface temperature and various socioeconomic covariates from January 2020 to October 2021 for 140 neighbourhoods in Toronto. The resulting spatial relative risk and spatiotemporal trends were visually represented, revealing heterogeneous risk patterns, particularly in western and southern Toronto (
Figure 5).
5.1.2. Influenza
A Bayesian spatial CAR model assessed the spatiotemporal distribution of H7N9 infection risk in Shanghai from February 19 to 14 April 2013, revealing high variations in the east and centre of the city [
112]. Another study focused on China, using a Bayesian hierarchical conditional autoregressive model with five covariates to predict seasonal relative risk for human H7N9 infection [
113]. Lemey et al. [
114] applied Bayesian inference to H1N1pdm genetic data for global epidemic spread considering meteorological and socioeconomic variables and accounting for spatial autocorrelation. [
115] used Bayesian phylogeography on H5N1 and H7N9 haemagglutinin sequences to identify migration patterns and investigate associated risk factors [
2].
5.1.3. Haemorrhagic Fever
Two Bayesian spatiotemporal CAR models were employed, one allowing for discontinuities in risk between neighbouring areas. Utilizing these models facilitates the identification of distinct groups of areas and assesses the impact of climatic covariates, offering valuable insights for informing policy decisions [
116]. Another study focused on HFRS incidence in mainland China from 2015 to 2018, employing a Bayesian hierarchical spatiotemporal distribution model. The analysis revealed significant positive associations between HFRS development and woodland and grassland areas, economic factors, and traffic levels, with traffic having the most substantial promoting effect [
117].
5.1.4. Ebola
Bayesian model inference was employed to assess the Ebola outbreak in Western Africa (2014–2015), integrating individual-level spatial information with epidemiological data. Superspreaders were identified as crucial for sustaining epidemic transmission [
118]. A Bayesian hierarchical Poisson model estimated Ebola virus disease risk, linking it to factors like households lacking radios, increasing rainfall, and urban land in districts [
119]. Utilizing fully Bayesian geostatistical analysis through stochastic partial differential equations revealed a positive association between violent events in affected areas and reported Ebola virus disease cases and deaths [
120].
5.1.5. Dengue
The Bayesian method suggests a spatial and temporal positive correlation between the spreading pattern of Aedes aegypti, as indicated by Dengue haemorrhagic fever incidence in East Java [
121]. The relationship between egg numbers and climatic/environmental variables was examined through Bayesian zero-inflated spatiotemporal models [
122]. Martínez-Bello et al. [
123] employed alternative models to assess dengue transmission risk. A systematic review by Aswi et al. [
15] summarized Bayesian spatial and spatiotemporal approaches for modelling dengue fever. To better understand the effects of niche variables in the epidemiological process, Poisson-lognormal and binomial models were explored using various Bayesian spatiotemporal methods [
124]. Different structure models, including autoregressive and random walk, were developed for dengue counts in Kendari, Indonesia [
125].
5.1.6. Rabies
The absence of human-to-human transmission means that the spread of rabies is not substantially influenced by human distribution or migration [
126]. Bayesian hierarchical frameworks were employed to assess spatial and spatiotemporal patterns of rabies infection among striped skunk cases in the North Central Plains of the US [
127]. In eastern Germany, a hierarchical Bayesian state–space model was applied to analyse three decades of fox rabies cases and oral rabies vaccination campaigns [
128]. Spatiotemporal Bayesian regression models were used to evaluate the impact of accessibility to rabies on post-exposure prophylaxis patients, generating a risk map for optimal new centre locations in Cambodia [
129]. In a One Health approach focusing on Thailand, a Bayesian spatial regression model, specifically a CAR Bayesian zero-inflated Poisson regression, quantified the location-based risk of dog-mediated rabies for both human and animal populations [
130]. The study further modelled ecological and socio-economic variables associated with dog rabies using univariate analyses with zero-inflated negative binomial regression and multivariable spatial analyses. In China, annual rabies incidence data at the province level from 2004 to 2019 were analysed using a Bayesian hierarchical spatiotemporal model to determine the impacts of environmental, economic, and demographic factors [
131]. Kanankege et al. [
130] also fitted a conditional autoregressive Bayesian model for human and animal rabies, emphasizing factors such as dog bites and ownership.
5.2. Bacterial Infection
Bacterial infections can spread through various avenues, such as person-to-person contact, contaminated food or water, and vectors. From a spatial standpoint, these infections often manifest in localized patterns, influenced by environmental factors and population dynamics. The prevalence can vary based on factors like sanitation and living conditions. Temporally, bacterial infections typically exhibit a slower spread compared to viruses. Predicting their temporal distribution is more challenging than that of viral infections. Some bacterial diseases endure year-round, while others experience intermittent outbreaks, adding complexity to their overall temporal dynamics.
5.2.1. Salmonellosis
A Bayesian hierarchical spatiotemporal model identified significant spatial and temporal influences on
Salmonella spp. isolation from litter, along with notable effects from broiler house size, total housing area per farm, broiler house type, and the number of litter recycles [
132]. Subsequently, Bayesian spatial regression, using a reparametrized Besag-York-Mollié Poisson model, was employed to quantify structured and unstructured spatial effects while considering potential risk factors for
Salmonella infection at the province level [
133].
5.2.2. Tuberculosis
Bayesian spatiotemporal models have been applied to study tuberculosis in various countries, including India [
134], China [
135,
136,
137], Kenya [
138], Indonesia [
139], Ethiopia [
140]. Srinivasan and Venkatesan [
134] examined disease rates among Indian states, assessing neighbourhood tuberculosis infections and their causes using Bayesian models with a CAR approach for spatial and temporal pattern changes in two survey periods. Cao et al. [
137] identified influencing factors for tuberculosis in certain cases. Li et al. [
135] employed a Bayesian spatiotemporal model to analyse the correlation of socioeconomic, healthcare, demographic, and meteorological factors with the population-level number of tuberculosis cases. Amsalu et al. [
141] used a Bayesian CAR model to explore the spatial–temporal pattern of tuberculosis in the geriatric population and its association with meteorological and sociodemographic factors. Liu et al. [
142] constructed Bayesian spatiotemporal models to analyse the association between ambient air pollutants and pulmonary tuberculosis incidence, adjusting for socioeconomic covariates. Wahyuni and Syam [
143] employed a Bayesian approach for spatiotemporal modelling of tuberculosis cases in Makassar City, comparing seven Bayesian Spatiotemporal CAR models using measures like DIC and WAIC. Madden et al. [
144] explored different spatiotemporal random-effects models, including the negative binomial Besag-York-Mollié model, using comprehensive data to examine the association between covariates and the number of bovine tuberculosis cases.
5.2.3. Brucellosis
Ahasan et al. [
145] used spatial analysis to estimate the true prevalence of brucellosis and assess test performances. In South Korea, Lim et al. [
146] conducted spatial analysis to examine the relationship between human and bovine brucellosis. Rahmanian et al. [
147] analysed temporal trends of human brucellosis in Yazd Province, Iran, between 2013 and 2018, predicting future incidence trends. Liang et al. [
148] estimated the spatiotemporal distribution of human brucellosis in Inner Mongolia, China, and identified influencing factors using Bayesian spatiotemporal models.
5.2.4. Anthrax
Bayesian spatiotemporal models for anthrax were applied in Vietnam [
149,
150] and Kenya [
151]. In Kenya, a Bayesian approach was employed to analyse a long-term dataset of livestock anthrax cases from 2006 to 2020. Spatial and spatiotemporal models were developed to investigate the distribution and socio-economic drivers of anthrax occurrence and incidence at both national and sub-county levels [
151].
5.3. Parasitic Infection
Parasitic infections typically propagate through vectors, such as mosquitoes transmitting malaria, or through the consumption of contaminated water and food. The life cycle of a parasite often spans multiple hosts, impacting both its spatial and temporal distribution. Geographically, the prevalence of parasitic infections is shaped by the habitat and behaviour of vectors or intermediate hosts participating in the transmission cycle. Temporally, parasitic infections can manifest with intricate life cycles; certain parasitic diseases may experience seasonal variations, while others may be characterized by more extended periods aligned with the parasite’s life cycle.
5.3.1. Malaria
Bayesian spatiotemporal models for malaria have been employed in various regions, including Burkina Faso [
152], Pará [
153], Nigeria [
154], Rwanda [
155], and Thailand [
156]. Alegana et al. [
157] utilized a Bayesian conditional–autoregressive model to assess spatial and temporal variations in malaria incidence, considering test positivity rates and health facility utilization. Haddawy et al. [
158] applied Bayesian networks to model malaria, creating village-level models with weekly temporal resolution in the Tha Song Yang district, Thailand, using cases and environmental covariates, including NDVI, LST, rainfall, slope, distance to nearest stream, stream density, and distance to border. Semakula et al. [
159] used spatiotemporal models to estimate excess probability for decision making for malaria incidence reduction, employing SIR and BYM models with routine health facility data from 2012 to 2018 in Rwanda. A systematic review on spatial and spatiotemporal methods for mapping malaria risk was conducted by Odhiambo et al. [
14]. In their review, approximately half (44%) of the studies employed variable selection techniques prior to mapping with rainfall and temperature selected in over 50% of the studies. Malaria incidence (47%) and prevalence (35%) were the most commonly mapped outcomes, with Bayesian geostatistical models often (31%) being the preferred approach to risk mapping.
5.3.2. Toxoplasmosis
Bayesian spatiotemporal models for toxoplasmosis have been utilized in various regions, including China [
160], Brazil [
161], and Africa [
162]. In Jiangsu province, China, Yang et al. [
163] employed Bayesian spatiotemporal models to analyse Schistosoma japonicum infection risk, determining relationships between key climatic factors and infection prevalence at the county level. Welton and Ades [
164] presented a Bayesian evidence synthesis model combining data on toxoplasmosis seroprevalence, seroconversion, and tests of recent infection to estimate its current incidence in the UK, incorporating a hierarchical structure over geographic regions, a random-walk model for temporal effects, and a fixed-age effect. Hu et al. [
160] identified high-risk counties for schistosomiasis in Anhui province, China, and described trends using a Bayesian hierarchical spatiotemporal model with annual parasitological and environmental data. Melo et al. [
165] assessed the temporal trend, spatial analysis, and spatiotemporal clusters of infant mortality associated with congenital toxoplasmosis in Brazil, with spatial analysis (including Global Moran Index, Global Geary’s Contiguity, and Getis-Ord General statistics) and spatiotemporal scan methods.
5.4. Other Infections
The assessment of Mediterranean spotted fever, rickettsiosis in Italy, involved the analysis of spatiotemporal distribution using hospitalization records, as explored in the records of Gomez-Barroso et al. [
166]. In Brazil, Baquero and Machado [
167] investigated the spatiotemporal dynamics and risk factors associated with Leptospirosis. Similarly, in Mahasarakham province, Thailand, Viroj et al. [
168] delved into the spatiotemporal aspects of Leptospirosis. Meanwhile, Dhewantara et al. [
169] studied Leptospirosis in China, contributing insights into the disease’s dynamics and associated risk factors. These studies collectively provide a comprehensive view of the spatiotemporal dynamics and risk factors associated with Leptospirosis across diverse geographic regions, emphasizing the importance of understanding the spatial and temporal dimensions of infectious diseases for their effective management and prevention strategies. Consequently, the predominant application of Bayesian spatiotemporal models in the context of infectious diseases is visually depicted in
Figure 6.
7. Conclusions
This study is designed to address the specific requirements of spatiotemporal modelling in spatial epidemiology. It takes into account the diverse and intricate nature of Bayesian spatiotemporal models, identifying those most suited to the task. The review encompasses a detailed examination of research on Bayesian spatiotemporal models within spatial epidemiology. The methodological intricacies, ranging from the spatiotemporal distribution of data to Bayesian deduction, spatiotemporal evolution rules, and calculation methods, are thoroughly discussed. Furthermore, the application fields, advantages, and necessary precautions associated with this method are critically reviewed. Bayesian spatiotemporal models naturally accommodate uncertainty in parameter estimates and facilitate the incorporation of both spatial and temporal dependencies, which are pivotal in advancing our understanding of the dynamics of diseases. The integration of diverse data sources, such as environmental variables and demographic information, into a unified framework, enhances the analysis of the complex interplay of factors influencing disease dynamics. Despite facing challenges, ongoing research endeavours are anticipated to refine these models, enhancing their accessibility, interpretability, and applicability to a broader spectrum of spatial epidemiological inquiries. The fusion of advanced statistical methodologies and computational tools is poised to continue propelling advancements in this field. This study serves to elucidate the Bayesian model and its practical utilization in spatial epidemiology including both non-communicable and infectious diseases, and the understanding of control factors of these diseases. Such advancements are of profound significance in fostering better comprehension and control of epidemic transmission.