Next Article in Journal
Limited Knowledge of Chronic Kidney Disease among Type 2 Diabetes Mellitus Patients in India
Previous Article in Journal
Drivers and Ideal Types towards Energy Transition: Anticipating the Futures Scenarios of OECD Countries
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Addendum

Addendum: Wang et al. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health, 2018, 15(8):1740; doi:10.3390/ijerph15081740

1
School of Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
2
School of Public Health, Center of Statistical Science, Peking University, Beijing 100871, China
3
School of Mathematical Sciences, Center of Statistical Science, Peking University, Beijing 100871, China
4
China National Center for Food Safety Risk Assessment, Beijing 100022, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2019, 16(8), 1442; https://doi.org/10.3390/ijerph16081442
Submission received: 16 April 2019 / Accepted: 16 April 2019 / Published: 23 April 2019
(This article belongs to the Section Health Behavior, Chronic Disease and Health Promotion)
The authors wish to update the Abstract and Section 3 in their paper published in the International Journal of Environmental Research and Public Health (IJERPH) [1].
They would like to rewrite the abstract as follows:
Abstract: Foodborne diseases have a big impact on public health and are often underreported. This is because a lot of patients delay treatment when they suffer from foodborne diseases. In Hunan Province (China), a total of 21,226 confirmed foodborne disease cases were reported from 1 March 2015 to 28 February 2016 by the Foodborne Surveillance Database (FSD) of the China National Centre for Food Safety Risk Assessment (CFSA). The purpose of this study was to make use of the daily number of visiting patients to forecast the daily true number of patients. Our main contribution is that we take the reporting delays into consideration and apply a Bayesian hierarchical model for this forecast problem. The data shows that there were 21,226 confirmed cases reported among 21,866 visiting patients, a proportion as high as 97%. Given this observation, the Bayesian hierarchical model was established to predict the daily true number of patients using the number of visiting patients. We use several scoring rules to assess the performance of different nowcasting procedures. We conclude that Bayesian nowcasting with consideration of right truncation of the reporting delays has a good performance for short-term forecasting and could effectively predict the epidemic trends of foodborne diseases. Meanwhile, this approach could provide a methodological basis for future foodborne disease monitoring and control strategies, which are crucial for public health.
In the end of the first paragraph in Section 3, the authors would like to update the last two sentences of the paragraph and add some citations. The revised sentences are as follows:
In this paper, we apply a Bayesian nowcasting model proposed by Höhle and an der Heiden [2] to forecast the daily total number of cases. Thanks to Salmon et al. [3], who provided a convenient R package “surveillance”, the inference for the model could be easily performed. The R package surveillance also contains a few other nowcasting methods that we also tried and did comparisons with using the scoring rules implemented in the package. The results are shown in Section 4. Below we review the model.
The authors revised Section 3.2 to describe the inference approach proposed by Höhle and an der Heiden (in Section 3.2 of [2]) in some greater detail. This part is now as follows:
For the convenience of the reader, we describe the inference approach proposed by Höhle and an der Heiden (in Section 3.2 of [2]) in some greater detail.
Define p d as the (time-homogeneous) probability that a case will have a reporting delay of d days. The p d ’s satisfy the following equation: d = 0 D p d = 1 . Following Kalbfleisch and Lawless’s previous work [4] and Zeger et al.’s previous work [5], we assume that the occurrence time of cases follows an underlying inhomogeneous Poisson process. A reasonable data generating process for the daily number of cases is thus as follows:
N t | λ t Po ( λ t ) ( n t , D , n t , D 1 , , n t , 0 ) | N t , p MN ( N t , ( p D , p D 1 , , p 0 ) )
where Po ( λ ) denotes the Poisson distribution with expectation λ > 0 and MN ( N , p ) denotes the multinomial distribution with size parameter N and probability vector p . Nowcasting for a given time T can thus be divided into steps of determining the λ t ’s, estimating the unknown delay distribution (i.e., the p d ’s), and finally predicting the unobserved n t , d ’s in order to compute the total N t . As T increases, and if the assumption about a time-homogeneous delay distribution is acceptable, the available data make it possible to estimate the delay distribution better and better, and hence the quality of the predictions near T improves with time.
Consider a fixed time T and define p T = ( p T , D , p T , D 1 , , p T , 1 ) as the probability vector denoting that a case is reported with a delay of d days given the observed incomplete information at time T , i.e., the set of n t , d , where t + d T . We choose as prior distribution the generalised Dirichlet distribution G D ( α , β ) with fixed constants α = ( α 0 , α 1 , , α D ) and β = ( β 0 , β 1 , , β D ) . Now we use Property 3 in the Web appendix of [2] that shows that the posterior of p under right-truncated multinomial sampling is again a GD distribution with parameters α T * , β T * given by
α T , i * = α i + τ = 0 T D + i n τ , D i β T , i * = β i + τ = 0 T D + i ( N τ , τ + D i n τ , D i ) ,    i { 0 , , D 1 } .
hence, for a given T we can assume the following model hierarchy for the time points t { T D , , T } :
p T G D ( α T * , β T * ) λ t G a ( a λ , b λ ) N t | λ t P o ( λ t ) N t , T | N t , p T Bin ( N t , q T , T t ) ,
where q T , d = δ = 0 d p T , δ is the proportion reported within a delay of d days. We denote by G a ( a λ , b λ ) the gamma distribution with parameters a λ > 0 , b λ > 0 . For this hierarchical model, the marginal distribution of N t is a negative binomial distribution with the following mean and variance:
E ( N t ) = μ λ = a λ b λ Var ( N t ) = μ λ + μ λ 2 a λ .
To estimate N t given the observed counts n t , d at time T , we have to perform two steps: (1) update the delay distribution q T and (2) update the prediction for N t :
(1)
For the given T we compute α T * , β T * as stated above. We then draw for k = 1 , , K random vectors p T ( k ) G D ( α * T , β T * ) by the algorithm of Wong [6] and calculate
q T , d ( k ) = δ = 0 d p T , δ ( k ) .
(2)
Given the updated delay distribution q T ( k ) and the observed counts n t , d , we can now update the prediction of N t , t = T D , , T .
For n { 0 , 1 , 2 , } we approximate by Monte Carlo sampling
f ( N t = n | N t , T ) 1 K k = 1 K f n , t ( k ) ,    t { T D , , T }
An application of Bayes theorem provides f n , t ( k ) = f ̃ n , t ( k ) / c t ( k ) , where c t ( k ) = n = 0 f ̃ n , t ( k ) is the normalization constant and
f ̃ n , t ( k ) = f ( N t , T | N t = n , q T , T t ( k ) ) f ( N t = n | λ t ) f ( λ t )
for all t { T D , , T } . The factors of the last equation can be evaluated using the distributional assumptions of the model hierarchy. For numerical convenience we do not sum over the entire support { 0 , 1 , 2 , } to get the normalization, but instead approximate
c t ( k ) n = 0 N max f ̃ n , t ( k ) ,
where N max is chosen sufficiently large.
Finally, the authors would like to update “proposed in this paper” to “described in this paper”.
The changes do not affect the results. The manuscript will be updated and the original will remain online on the article webpage, with a reference to this addendum.

References

  1. Wang, X.; Zhou, M.; Jia, J.; Geng, Z.; Xiao, G. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health 2018, 15, 1740. [Google Scholar] [CrossRef] [PubMed]
  2. Höhle, M.; An der Heiden, M. Bayesian nowcasting during the STEC O104:H4 outbreak in Germany, 2011. Biometrics 2014, 70, 993–1002. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Salmon, M.; Schumacher, D.; Höhle, M. Monitoring Count Time Series in R: Aberration Detection in Public Health Surveillance. J. Stat. Softw. 2016, 70, 1–35. [Google Scholar] [CrossRef]
  4. Kalbfleisch, J.D.; Lawless, J.F. Inference Based on Retrospective Ascertainment: An Analysis of the Data on Transfusion-Related AIDS. J. Am. Stat. Assoc. 1989, 84, 360–372. [Google Scholar] [CrossRef]
  5. Zeger, S.L.; See, L.C.; Diggle, P.J. Statistical methods for monitoring the AIDS epidemic. Stat. Med. 1989, 8, 3–21. [Google Scholar] [CrossRef] [PubMed]
  6. Wong, T.T. Generalized Dirichlet distribution in Bayesian analysis. Appl. Math. Comput. 1998, 97, 165–181. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Wang, X.; Zhou, M.; Jia, J.; Geng, Z.; Xiao, G. Addendum: Wang et al. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health, 2018, 15(8):1740; doi:10.3390/ijerph15081740. Int. J. Environ. Res. Public Health 2019, 16, 1442. https://doi.org/10.3390/ijerph16081442

AMA Style

Wang X, Zhou M, Jia J, Geng Z, Xiao G. Addendum: Wang et al. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health, 2018, 15(8):1740; doi:10.3390/ijerph15081740. International Journal of Environmental Research and Public Health. 2019; 16(8):1442. https://doi.org/10.3390/ijerph16081442

Chicago/Turabian Style

Wang, Xueli, Moqin Zhou, Jinzhu Jia, Zhi Geng, and Gexin Xiao. 2019. "Addendum: Wang et al. A Bayesian Approach to Real-Time Monitoring and Forecasting of Chinese Foodborne Diseases. Int. J. Environ. Res. Public Health, 2018, 15(8):1740; doi:10.3390/ijerph15081740" International Journal of Environmental Research and Public Health 16, no. 8: 1442. https://doi.org/10.3390/ijerph16081442

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop