Prediction of Safety Risk Levels of Veterinary Drug Residues in Freshwater Products in China Based on Transformer

Jiang, Tongqiang; Liu, Tianqi; Dong, Wei; Liu, Yingjie; Hao, Cheng; Zhang, Qingchuan

doi:10.3390/foods11121690

Open AccessArticle

Prediction of Safety Risk Levels of Veterinary Drug Residues in Freshwater Products in China Based on Transformer

by

Tongqiang Jiang

^1,2

,

Tianqi Liu

^1,2,

Wei Dong

^1,2,*

,

Yingjie Liu

^1,2,

Cheng Hao

^1,2 and

Qingchuan Zhang

^1,2

¹

National Engineering Research Centre for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China

²

School of E-Business and Logistics, Beijing Technology and Business University, Beijing 100048, China

^*

Author to whom correspondence should be addressed.

Foods 2022, 11(12), 1690; https://doi.org/10.3390/foods11121690

Submission received: 25 May 2022 / Revised: 4 June 2022 / Accepted: 7 June 2022 / Published: 9 June 2022

(This article belongs to the Special Issue Food Risk Analysis: Current Status of Research and Future Perspectives)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Early warning and focused regulation of veterinary drug residues in freshwater products can protect human health and stabilize social development. To improve the prediction accuracy, this paper constructs a Transformer-based model for predicting the safety risk level of veterinary drug residues in freshwater products in China to conduct a comprehensive assessment and prediction of the three veterinary drug residues with the maximum detection rate in freshwater products, including florfenicol, enrofloxacin and sulfonamides. Using the national sampling data and consumption data of freshwater products from 2019 to 2021, this paper constructs a self-built dataset, combined with the k-means algorithm, to establish the risk-level space. Finally, based on a Transformer neural network model, the safety risk assessment index is predicted on a self-built dataset, with the corresponding risk level for prediction. In this paper, comparison experiments are conducted on the self-built dataset. The experimental results show that the prediction model proposed in this paper achieves a recall rate of 94.14%, which is significantly better than other neural network models. The model proposed in this paper provides a scientific basis for the government to implement focused regulation, and it also provides technical support for the government’s intervention regulation.

Keywords:

risk assessment; veterinary drug residues; freshwater products; safety risk-level prediction; transformer

1. Introduction

With the improvement of people’s living standards, freshwater products are rich in protein, calcium and other nutrients, and they have gradually become an essential food on people’s tables. Moreover, as a large country in the freshwater industry, the export of freshwater products plays an important role in China’s foreign trade, so the quality and safety of freshwater products is not only related to people’s lives and health, but also related to the influence and status of China in international foreign trade. Therefore, the quality and safety supervision of freshwater products is very important [1,2]. However, as the scale of aquaculture continues to expand, the density of aquaculture is also increasing, and the outbreak of various aquatic diseases is becoming more frequent, resulting in the production process being prone to excessive drug use, not in accordance with the provisions of the drug and the use of prohibited drugs [3]. Drug abuse can cause drug residues in the bodies of aquaculture animals, and drug residues on humans and the environment are mainly chronic, long-term and cause cumulative harm. Research reports that studies have shown that some drug residues can produce carcinogenic effects, mutagenic effects, teratogenic effects, developmental toxicity, accumulation in the body, immunosuppression, sensitization and the induction of drug-resistant strains of bacteria [4].

The frequent occurrence of major food safety incidents in the country has had a bad impact on the international arena, not only posing a serious threat to the physical and mental health of consumers, but also causing an incalculable impact on China’s foreign trade in food [5], so it is crucial to monitor the quality and safety risks of domestic aquatic products, which is related to the safety of domestic people’s livelihood, as well as China’s international economic status. Through freshwater product safety risk assessment and early warning analysis, we can provide scientific means for domestic market supervision, on the one hand, and provide safety assurance for healthy freshwater fish farming and consumers’ peace of mind, on the other hand.

Zhang et al. [6] described the rationale and role of risk assessment, summarized the process of veterinary drug residue risk assessment, outlined the qualitative and quantitative risk assessment methods used in the field and proposed the establishment of a new regulatory model for meat safety to improve the existing regulatory system for meat safety. Alan et al. [7] pointed out that JECFA comprehensively addressed both acute and chronic risks through corresponding estimates of acute and chronic exposures and appropriate correction of the Gallo–Torres model for limited bioavailability of bound residues. Wei et al. [8] ranked the risk matrix for veterinary drug residues using the Council on Veterinary Drug Residues and discussed the types of high risk for veterinary drug residues. Demetra et al. [9] studied the dietary exposure assessment of veterinary antibiotics in pork consumed by children and adolescents in Cyprus and found that antibiotic residues in pork were below the allowable daily intake (ADI), and the risk to human health from antibiotic exposure was low based on the estimated daily intake (EDI) of veterinary antibiotics.

The existing literature mainly uses dietary exposure, the food safety index or the target hazard quotients (THQ) method and other individual indicators for risk assessment of veterinary drug residues in edible products, without comprehensive use of existing consumption data, safety intake data, sampling data and other comprehensive assessments and gradings of veterinary drug residues in freshwater products in multiple provinces across the country.

In recent years, due to its ability to analyze historical data of dynamic systems and predict future operating patterns [10], a time series analysis has been commonly used in weather forecasting [11], earthquake precursor forecasting [12], and crop pest and disease hazard forecasting [13], a feature that also meets the requirements of food safety risk prediction. Jiang et al. [14] used hyperspectral discrete wavelet transformation and deep learning to detect and identify veterinary drug residues in beef. Wang et al. [15] predicted the risk hazard of heavy metals in processed grain products using a voting integrated deep learning approach. Jiang et al. [16] used deep learning to grade and predict the safety risk level of carbofuran pesticide residues in Chinese vegetables.

After the statistics of the State Administration for Market Regulation Statistics sampling data, it was found that in freshwater products, 7 out of 16 drugs are detected, among which florfenicol, enrofloxacin and sulfonamides have the highest detection rates and are much higher than the residues of other veterinary drugs. The detection method used in this study is liquid chromatography-tandem mass spectrometry.

In summary, this paper investigates the national sampling data of veterinary drug residues in freshwater products from 2019 to 2021, and it selects the three veterinary drug residues with the maximum detection rate in freshwater products for assessment and prediction, including florfenicol, enrofloxacin and sulfonamides. In this paper, using the national sampling data of veterinary drug residues in freshwater products from 2019 to 2021 and the weekly consumption data of freshwater products in each province, a safety risk assessment model for freshwater products is constructed, and the indicators in the safety risk assessment model are calculated so as to complete the construction of the self-built dataset, combined with the k-means algorithm to classify the weekly freshwater products in each province into risk levels and establish the risk-level space. Finally, based on the Transformer neural network model, the safety risk assessment indicators of freshwater products in each province are predicted based on the self-built dataset, and the freshwater products in that province for that week are classified into the corresponding risk-level space. The model proposed in this paper not only provides a systematic risk measure for the government, but it also provides a scientific reference basis for confirming the priority regulatory order in regulation, and it provides technical support for the government’s intervention in regulation.

2. Materials and Methods

2.1. Materials

2.1.1. Data Sources

The data of freshwater products in this study were obtained from the sampling data of the National Food Safety Administration from 2019 to 2021, covering 20 provinces, where freshwater products included freshwater fish, shrimps and crabs, with a total of 32,735 samples, including 11,164 samples in 2019, 11,439 samples in 2020 and 10,132 samples in 2021. The national food safety standard maximum residue limits for veterinary drugs in foods (hereinafter referred to as the standards) specify the limit indicators for veterinary drug residues in freshwater products, of which the limit for florfenicol is 1000 μg/kg, the limit for enrofloxacin is 100 μg/kg and the limit for of sulfonamides is 100 μg/kg. In addition, the standards also specify the allowable daily intake of veterinary drugs in freshwater products, including florfenicol for 3

μ g / (kg \cdot d)

, enrofloxacin for 6.2

μ g / (kg \cdot d)

and sulfonamides for 50

μ g / (kg \cdot d)

.

Data on the consumption of freshwater products by the population were obtained from the Fifth China Total Diet Study [17], which conducted a dietary questionnaire survey on the main food items consumed by residents of 20 Chinese provinces and estimated the consumption data using stratified multistage population-proportional whole-group random sampling.

2.1.2. Data Preprocessing

Referring to the principles of credible assessment of low-level contaminants in food proposed by GEMS/FOOD, when the number of non-detected samples was less than 60% of the overall sample size, all non-detected data were replaced by 1/2 of the limit of detection (LOD); when the number of non-detected samples was higher than 60% of the overall number of samples, all non-detected data were replaced by the LOD [18]. Since the sample data of non-detected veterinary drug residues in this study were much less than 60%, all non-detected data were assigned the 1/2 LOD value for statistical calculation in this paper.

2.2. Safety Risk Assessment Model for Freshwater Products

Considering that the object of evaluation in this study was a single food and multiple contaminants, according to the risk assessment method and the set model use, based on the main influencing factors of health risk caused by food contaminants, this paper selected the Nemerow Integrated Pollution Index (NIPI), Index of Food Safety (IFS) and Hazard Risk Factor (R) as the three evaluation indicators of the freshwater product safety risk assessment model.

2.2.1. Nemerow Integrated Pollution Index

The NIPI can reflect the characteristics of food contamination, taking into account the mean and maximum values of the single-factor contamination index, and it can highlight the role of the more contaminated pollutants, and it is often used to assess air [19,20], water environmental quality [21,22,23], heavy metal contamination in soil [24,25] and vegetables [26,27,28]. In this paper, the NIPI is used to calculate the integrated contamination index of veterinary drug residues in the sampled samples based on the sampling data of freshwater products in each province. The expression of the single factor contamination index is as follows:

P_{i, j} = \frac{C_{i, j}}{S_{j}}

(1)

where

P_{i, j}

is the single factor pollution index of

j

types of veterinary drug residues in freshwater products in province

i

;

C_{i, j}

is the detection value of

j

types of veterinary drug residues in freshwater products in province

i

(

μ g / kg

);

S_{j}

is the national limit standard for residues of

j

types of veterinary drug in freshwater products (

μ g / kg

).

P I_{i} = \sqrt{\frac{P_{m a x (i)}^{2} + P_{a v e (i)}^{2}}{2}}

(2)

where

P I_{i}

is the comprehensive pollution index of freshwater products in province

i

;

P_{m a x (i)}

denotes the maximum value of the single pollution index in province

i

;

P_{a v e (i)}

denotes the average value of the single pollution index in province

i

.

2.2.2. Index of Food Safety

The IFS is a method constructed by the International Codex Alimentarius Commission (CAC) and the World Health Organization (WHO) as an evaluation of the risk of exposure to food safety hazards and is often used to assess the risk of pesticide residues [29,30,31,32] and veterinary drug residues in meat, vegetables, fruits and edible mushrooms [33,34,35,36]. In this study, the IFS was used to assess the risk of exposure to hazards for veterinary drug residues in freshwater products. In addition, this paper used the point assessment model in the FAO/WHO Principles and Methods for Risk Assessment of Chemicals in Foods for dietary exposure assessment, with the following expressions:

E D I_{i, j}^{50} = \frac{F_{i}^{50} \times C_{a v g (i, j)}}{W}

(3)

where

E D I_{i, j}^{50}

is the average daily intake of

j

types of veterinary drug residues per kilogram of body weight of the population in province

i

through freshwater products (

μ g / kg \cdot bw \cdot d

);

F_{i}^{50}

is the average consumption of freshwater products in province

i

(

kg / d

);

C_{a v g (i, j)}

is the average detection value of

j

types of veterinary drugs residues in freshwater products in province

i

(

μ g / kg

);

W

is the target intake population average weight (

kg

), taken as 60 kg.

I F S_{i} = \sum_{j} \frac{E D I_{i, j}^{50} \times f}{S I_{j}}

(4)

where

I F S_{i}

is the food safety index of province

i

;

f

is the correction factor for safe intake, and the value of 1 is taken from the relevant literature [37];

S I_{j}

is the safe daily intake (

μ g / kg \cdot bw \cdot d

) of

j

types of veterinary drug residues in freshwater products, using ADI.

2.2.3. Hazard Risk Factor

The hazard risk factor takes into account the influence of the exceedance rate or positive detection rate of the hazard, the frequency of administration and its own sensitivity, and it provides an intuitive and comprehensive reflection of the risk level of the hazard over time, and it is used by researchers to assess the risk system of pesticide and veterinary drug residues in vegetables and other foods [38]. In this paper, the hazard risk factor is used to assess the hazard risk coefficient of veterinary drug residues in freshwater products with the following expression:

R_{i} = \sum_{j} (a H_{i, j} + \frac{b}{F_{i, j}} + S_{j})

(5)

where

R_{i}

is the hazard risk factor of freshwater products in province

i

, and

H_{i, j}

is the exceedance rate of

j

types of veterinary drug residues in freshwater products in province

i

.

F_{i, j}

is the frequency of the administration of

j

types of veterinary drug residues in freshwater products in province

i

. This target contaminant was a mandatory item; therefore,

F_{i, j}

was taken as 1;

S_{j}

is the sensitivity factor of the contaminant, which can be adjusted appropriately according to the sensitivity and importance of the current concern of the hazard in the food safety domain. In this study, the target contaminants were all normally administered hazards, and

S_{j}

was taken as 1;

a

and

b

are the corresponding weight coefficients, respectively, and in order to make the risk coefficient

R_{i}

reflect the effects of

H_{i, j}

and

F_{i, j}

in an accurate and balanced way,

a

is usually taken as 100 in practical applications, while

b

is taken as 0.1 [37].

2.3. Freshwater Product Safety Risk Classification Based on K-Means

To reduce the influence of a single factor in food safety risk assessment, this paper integrates NIPI, IFS and R indicators to construct a risk assessment model of veterinary drug residues in freshwater products. To eliminate the influence of subjective factors in risk grading, this paper uses a clustering algorithm to grade the safety risk of freshwater products in different provinces at different time periods, and it constructs the safety risk grading space based on the risk assessment model. Since the amount of data in this subject is small and there is no dirty data, the k-means [39] algorithm is fast and efficient and can achieve a good clustering performance on sample spaces of arbitrary shapes, which was suitable for analyzing the model data of this study, so the k-means algorithm was selected for food safety risk grading in this paper. The specific process of the algorithm is as follows:

(1): First, any k values are selected in the data set as the initial center of mass.
(2): Calculate the distances of each of the remaining points to these k centers of mass in turn, and divide each point into clusters with the center of mass that is closest to it.
(3): Obtain k clusters in this way, and then calculate the mean value of these k clusters as the new center of mass.
(4): Repeat the steps (2) and (3) until the cluster centers no longer change or the number of iterations is reached; then, the safety risk grading space construction is completed and the algorithm converges.

2.4. Transformer-Based Model for Predicting the Safety Risk Level of Freshwater Products

2.4.1. Freshwater Product Safety Risk-Level Prediction Process

In this study, after data cleaning, integration, transformation and normalization, there were 3180 sample points in the sample space, which is a small sample dataset. Deep learning algorithms, such as Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU) algorithms, are widely used in various industries for prediction and analysis, but a large number of datasets are required for training when building models. Therefore, in order to improve the accuracy of safety risk-level prediction, this paper constructed a Transformer-based freshwater product safety risk-level prediction model. The Transformer neural network model was improved to suit the application scenario in this paper, and the details are described in Section 2.4.2.

The Transformer-based freshwater product safety risk-level prediction model proposed in this paper is shown in Figure 1, and the model is divided into three layers, which are the data layer, Transformer prediction layer and risk-level prediction layer.

First, at the data layer, the sampling data and consumption data of veterinary drug residues of freshwater products in each province are used to construct freshwater product safety risk assessment indicators, including NIPI, IFS and R, and based on the above safety risk assessment indicators, the weekly freshwater products in each province are classified into a risk level by the k-means algorithm to construct the risk-level space and complete the experimental data set construction. We put each risk indicator at the moment of T into the Transformer prediction layer to wait for predicting each risk indicator at the moment of T + 1.

Second, in the Transformer prediction layer, this paper uses the Transformer [40,41,42,43] algorithm to predict each safety risk assessment indicator of veterinary drug residues in freshwater products in each province, using the multi-layer encoder–decoder mechanism in the Transformer, combined with multi-headed attention to improve the accuracy of prediction.

Finally, in the risk-level prediction layer, the predicted value of each safety risk assessment indicator at the moment of T + 1 is output from the full linkage layer, and the distance between this safety risk assessment indicator and the clustering center that has been divided is measured, and the freshwater product risk level of the province for that week is categorized into the cluster with the closest distance.

2.4.2. Transformer-Based Predictive Model for Freshwater Product Safety Risk Assessment Indicators

In this study, the food safety risk assessment indicators are the contamination indicators of freshwater products in each province over a period of time, which are time-series sequences, so the model needs to have the ability to model long-term memory. Therefore, this paper improves the Transformer structure according to the food safety risk assessment indicator prediction application scenario, as shown in Figure 2.

The core of the Transformer-based prediction model for food safety risk assessment metrics lies in its encoder and decoder structures, both of which consist of six identical layers stacked on top of each other, where each layer of the encoder contains two sub-layers of a multi-headed self-attention mechanism and feedforward neural network, and each layer of the decoder contains three sub-layers of a masked multi-headed self-attention mechanism, encoder–decoder multi-headed attention mechanism and feedforward neural network.

First, we construct the input food safety risk assessment indicator matrix, letting the time window be t and the current moment be T, and need to predict the safety risk assessment indicator at the moment of T + 1; then, we put the indicator matrix

X = [X_{T - t}, \dots, X_{T - 1}, X_{T}]

into the encoder. The Transformer benefits from a number of advantages thanks to its purely attentional mechanism construct, but this deprives it of the ability to learn sequence position information. In the food safety risk assessment indicators prediction scenario, the position information of the vectors in the matrix represents the moment information, which plays a crucial role in the assessment indicator prediction. To address this issue, a position-encoding operation was added to the input matrix of the encoder and decoder to integrate the position information into the input sequence, as shown in Equations (6) and (7).

P_{(p, 2 i)} = s i n (\frac{p}{10, 000^{\frac{2 i}{d_{m o d e l}}}})

(6)

P_{(p, 2 i + 1)} = \cos (\frac{p}{10, 000^{\frac{2 i}{d_{m o d e l}}}})

(7)

where

p

denotes the position of the indicator vector,

d_{m o d e l}

denotes the dimension of the indicator vector and the position of each indicator vector is encoded by the cosine and sine function of different frequencies.

The encoder is responsible for encoding the input evaluation indicator matrix and mapping it into an intermediate vector containing the input information, the core principle of which is the self-attention mechanism. The self-attention mechanism is a variation of the attention mechanism, which reduces the reliance on external information and is better at capturing the internal relevance of the data. Its purpose is to filter out a small amount of important information from the input evaluation indicator matrix and use weights to represent the importance of the information so that the model focuses on the more important information. The self-attentive mechanism uses scaled dot product attention to calculate the attention value of the indicator matrix by first performing dot product and SoftMax normalization on the query matrix and key matrix to calculate the weight coefficients and then weighting and summing the value matrix according to the weight coefficients, as shown in Equations (8)–(11).

A t t e n t i o n (Q, K, V) = s o f t m a x (\frac{Q K^{T}}{\sqrt{d}}) V

(8)

Q = X W^{Q}

(9)

K = X W^{K}

(10)

V = X W^{V}

(11)

where

Q

is the query matrix,

K

is the key matrix and

V

is the value matrix. These three matrices are obtained by multiplying the input indicators matrix

X

with the corresponding weight matrices

W^{Q}

,

W^{K}

and

W^{V}

, respectively, and

d

is the dimensionality of

Q

,

K

and

V

.

In order to synthesize the information contained in the input matrix, the self-attentive mechanism uses multi-headed self-attentive mechanisms to jointly focus on the information from different manifestation subspaces at different locations. The multi-headed self-attentive mechanism splices multiple self-attentive mechanisms and uses multiple self-attentive heads to learn the information from different performance subspaces separately, and then, it splices and linearly transforms multiple attention values to obtain the final attention values to achieve the modeling expression of different constraints, as shown in Equations (12) and (13).

h_{i} = A t t e n t i o n (X W_{i}^{Q}, X W_{i}^{K}, X W_{i}^{V})

(12)

M u l t i H e a d (Q, K, V) = C o n c a t (h_{1}, \dots, h_{m}) \cdot W

(13)

where

W_{i}^{Q}

,

W_{i}^{K}

and

W_{i}^{V}

are the weight matrices of the ith attention head

Q

,

K

and

V

.

W

is the multi-head attention weight matrix,

m

is the number of attention heads, and the Concat function is used to splice the output values calculated by each attention head.

The decoder is responsible for decoding the intermediate vector output from the encoder into the output sequence, and its core principles are the encoding–decoding multi-headed attention mechanism and the masking multi-headed self-attentiveness mechanism. In order to improve the accuracy of evaluation indicator prediction, in addition to learning the dependency between input feature sequences in the multi-headed self-attention mechanism, the dependency between input feature sequences and shift sequences should also be considered, so the encoding–decoding multi-headed attention mechanism is used in the decoder. The input of the blocking multi-headed self-attention mechanism module is the shift sequence, and its purpose is to use the multi-headed self-attention mechanism to learn the dependencies between the shift sequences and input the dependencies into the encoding-decoding multi-headed attention mechanism module, so that the whole Transformer-based food safety risk assessment index prediction model can comprehensively learn the dependencies between the input feature vectors and their mutual dependencies.

In order to solve the problem that increasing the depth of the network instead affects the prediction accuracy of food safety risk assessment indicators, a residual connection operation was added between each sub-layer of the encoder and decoder to focus on the change of the difference part before and after training and to improve the training effect. At the same time, in order to accelerate the network convergence and improve the network generalization ability, each sub-layer also adopts the layer normalization operation at the same time, as shown in Equation (14).

o = L a y e r N o r m (x + S u b l a y e r (x))

(14)

where

S u b l a y e r

is the individual attention mechanism layer processing function and the fully connected feedforward neural network processing function.

L a y e r N o r m

is the layer normalization processing function.

Finally, a layer of the fully connected network is added to the output to output the predicted indicators of the safety risk level at the next moment and participate in the construction of the input indicator matrix for the next time.

3. Results

3.1. Data Set and Experimental Environment

3.1.1. Data Set

In this paper, three food safety risk assessment indicators for freshwater products in each province will be predicted separately, and the total length of the time series of freshwater products in each province was 159 weeks in the experiment. The pre-processed data set will be divided into the training set and test set, and the ratio will be 6:4 according to the number of data entries.

3.1.2. Experimental Environment

In this paper, the open source PyTorch [44] deep learning framework was used as the experimental platform, and the specific parameters of the experimental environment are shown in Table 1.

3.2. Model Evaluation Indexes

The safety risk level of freshwater products is determined by the above three indicators together, so this paper evaluated the single performance of each of the three indicators and tested the accuracy of the predicted safety risk level.

3.2.1. Performance Evaluation Indexes for Single Indicator Prediction

In this paper, the Mean Absolute Percentage Error (MAPE) and Mean Squared Error (MSE) are used to evaluate the forecasting efficacy of NIDI, IFS and R in the proposed model. These two indexes are calculated as follows:

MAPE = \frac{1}{n} \sum_{i = 1}^{n} | (y_{i} - \hat{y_{i}}) / y_{i} |

(15)

MSE = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}

(16)

where

y_{i}

is the actual value of a single assessment indicator in week

i

, and

\hat{y_{i}}

is the predicted value of a single assessment indicator in week

i

.

3.2.2. Accuracy Assessment Indexes for Risk-Level Prediction

In this paper, three assessment indexes, precision rate, recall rate and F-measure value, are used to test the accuracy of risk-level prediction.

P = \frac{T P}{T P + F P}

(17)

In the precision rate Equation (17), TP represents the number of samples for which the model correctly predicts the risk level, and FP represents the number of samples for which the model predicts not that risk level as that risk level.

R = \frac{T P}{T P + F N}

(18)

In the recall Equation (18), TP represents the number of samples for which the model correctly predicts the risk level, and FN represents the number of samples for which the model predicts that risk level as other risk levels.

F - M e a s u r e = \frac{2 * P * R}{P + R}

(19)

To better evaluate the performance of the prediction model, this paper uses the F-Measure score as an evaluation criterion to measure the comprehensive performance of the model, as shown in Equation (19).

3.3. Freshwater Product Safety Risk Assessment and Classification

3.3.1. Freshwater Product Safety Risk Assessment Indicators

In order to comprehensively assess the hazards of veterinary drug residues in freshwater products, the weekly NIPI, IFS and R values in freshwater products in each province from 2019 to 2021 were calculated and obtained based on the hierarchical analysis, and the set of three food safety risk assessment indicators for three years in 20 provinces is shown in Figure 3, Figure 4 and Figure 5.

3.3.2. Risk Classification

According to Figure 3, Figure 4 and Figure 5, it can be seen that different safety risk assessment indicators of freshwater product refer to a large difference in the order of magnitude, in order to avoid the disparity in the number of indicators affecting the assessment effect; therefore, this paper used Z-Score data standardization for the three indicators, as shown in Equation (20). Where

I n d e x_{s c a l e}

denotes the standardized indicator,

I n d e x_{o r g}

denotes the original indicator,

m e a n_{o r g}

denotes the mean of the indicator and

s t d_{o r g}

denotes the standard deviation of the indicator.

I n d e x_{s c a l e} = \frac{I n d e x_{o r g} - m e a n_{o r g}}{s t d_{o r g}}

(20)

In this paper, the k-means algorithm is used for clustering, and NIPI, IFS and R are selected as clustering features. Figure 6 shows the line graph of the silhouette coefficient of the number of clusters from 2–7. It can be seen from the figure that the silhouette coefficient is the largest when the clustering result is three clusters and is much larger than the other clusters, indicating that the instances within the clusters are more compact and that the inter-cluster distance is larger when the clusters are three clusters. Therefore, this paper divides the risk level into three levels, and the normalized clustering center, risk classification and the number of samples in each level are shown in Table 2. The distance of the cluster center from the origin is calculated according to the normalized index, and the risk levels of categories 1–3 are defined as low–medium–high, respectively. From Table 2, it can be seen that the indicators of clustering centers increase sequentially with the increase of risk grades.

3.3.3. Analysis of Risk Grading Results

It can be analyzed from Figure 7, Figure 8 and Figure 9 that the distribution of actual indicators for different safety risk levels of freshwater products is as follows:

(1): Characteristics of category 1: NIPI intervals are relatively small, with intervals concentrated in 0; IFS concentrates in 0; R concentrates in 3.3.
(2): Characteristics of category 2: NIPI interval is relatively large, with interval distribution in 0–0.5; IFS concentrates in 0–1; R is distributed in 3.8–4.5.
(3): Characteristics of category 3: NIPI interval is the largest, with interval distribution in 1–4; IFS concentrates in 5–10; R is distributed in 3–5.
(4): Comparative analysis: the NIPI interval of category 1 is short, and the values are generally small; the IFS values are distributed in a range with a small average value and a small distribution, and the R values are small and stable, but the distribution density values of the indicators are large, which is a class of freshwater products with a small-risk level. The NIPI, IFS and R values of category 2 are at a medium level, which is a type of freshwater product at a medium-risk level. Category 3 has a large interval and relatively large value of NIPI, IFS is at a high level and the average value of R value is also large, and the distribution density of indicators is small, which is a freshwater product at a high-risk level and deserves key attention.

In the clustering results, we have found that the provinces with higher risk levels are mainly Guangdong, Guangxi, Hebei, Henan, Hubei, Hunan, Jiangxi, Shanghai, Sichuan and Zhejiang, and the seasons with higher risk levels are mainly concentrated in spring and summer. Because spring is a season of continuous warming, after a winter of frozen water, fish and other biological metabolites accumulate in the bottom of the pond water; at the same time, a large number of pollutants in the air with the snow are deposited on the ice of the fish pond and then into the water, resulting in a variety of toxic and harmful substances that may exist in the water, when the fish are poor, and it may be accompanied by a variety of diseases; the summer temperature is higher, but the water temperature is relatively suitable, and it is the peak season of fish growth, and the water is easy to breed and multiply various pathogens; it is the fish susceptible to disease season, but it is also the most difficult period of medication and management. At the same time, the data show that veterinary drug residues that exceed the standard occur mostly in the provinces with richer water resources but also in the large breeding provinces. Accordingly, it is necessary to impose supervision on key provinces and key months.

3.4. Prediction Results of Freshwater Product Safety Risk Level Based on a Transformer

In order to demonstrate the effectiveness of a Transformer-based model for predicting the safety risk level of freshwater products, RNN, LSTM and GRU prediction models, which are commonly used today, were selected and compared with the model proposed in this paper. Firstly, the weekly NIPI, IFS and R indicators of freshwater products in each province were predicted separately. The risk level of the province for that week was classified according to the predicted food safety risk assessment indicators. Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 show the three assessment indicators for each of the 20 provinces predicted by the model proposed in this paper, with a time step of 7. Since the effectiveness of risk-level prediction is directly determined by the results of indicator prediction, the following statistical analysis of the prediction results of three risk assessment indicators was conducted using MAPE and MSE evaluation indicators.

In Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14, the pink line indicates the actual set of values for the indicators, the blue line indicates the set of values that are used to train the model and the purple line indicates the results predicted by the indicators. Weeks 0–137 were the training set of the model, and weeks 138–159 were the testing set of the model. From the experimental results, it can be seen that most of the predicted curves matched with the actual curves, and very few values did not match with the actual predictions. It was found that there are two main reasons for this discrepancy between predictions and actuals; on the one hand, it is due to the change of the supply chain caused by unexpected events, and on the other hand, it is due to the strengthening of government regulation, which can reduce the pollution index of freshwater products.

Figure 15 and Figure 16 show the MAPE and MSE values of the three food safety risk assessment indicators of freshwater products predicted by the four models, respectively. From the experimental result plots, it can be seen that the prediction models proposed in this paper predicted the three indicators with the smallest MAPE and MSE values, which performed better than the other models. It is generally believed that the prediction accuracy is higher when MAPE is less than 10. The MAPE of the three indicators predicted by the Transformer proposed in this paper was less than 10, and the prediction effect was good, among which the MAPE of the R indicator was the smallest, at only 0.0156, while the MAPE of the NIPI indicator was the largest. Meanwhile, it could be seen that among the four models, the effect of RNN prediction of each indicator deviated more from the correct value and fluctuated more. LSTM and GRU also had a better prediction effect on IFS and R indicators, but the prediction deviated more from the correct value on the NIPI indicator.

After predicting the NIPI, IF, and R values for a single week in each province by the above four neural network models, the distance between these integrated assessment indicators and the three clustering centers was measured, and the risk level of that province for that week was classified into the nearest cluster to determine that risk level. The precision, recall and F-measure of the risk level predicted by the four models were tallied, as shown in Table 3.

The experimental results show that the Transformer-based prediction model was significantly better than the other three models in terms of recall rate. Since the recall rate was as complete as possible to predict all possible risks, this model can provide a good data basis for the government’s comprehensive attention and regulation. At the same time, the model also performed well in terms of precision, which can provide clues for the government to grasp the areas and foods that may have hidden risks. In addition, the F-Measure statistic shows that the model proposed in this paper had a good balance between recall and precision, which provides an effective tool for the government to focus on hierarchical regulation.

4. Discussion

Excessive veterinary drug residues have a major impact on food safety, not only endangering human health, but also affecting social development. China’s vast land resources and abundant products make it a waste of resources if all products are regulated universally throughout the year, and when food safety incidents break out and are then dealt with, they can cause serious public opinions and consequences. Therefore, it is important to focus on the supervision of freshwater products containing veterinary drug residues and early warning. This paper constructs a Transformer-based model for predicting the risk level of veterinary drug residues in freshwater products in China, which provides a systematic risk measurement method for the government and a scientific reference basis for confirming the priority of regulation in supervision, as well as technical support for government intervention. The experimental results show that the prediction model proposed in this paper predicts a recall rate of 94.14%, which can meet the needs of food regulators and reassure consumers while allowing producers to benefit from ensuring food safety and improving quality. The proposed approaches in the paper can combine other parameter estimation algorithms [45,46,47] to study the temporal prediction problems of nonlinear systems with different disturbances [48,49,50,51], and can be applied to other fields such as visual processing and engineering application systems [52,53,54].

Author Contributions

Conceptualization, T.J. and W.D.; methodology, Q.Z. and W.D.; software, T.L. and C.H.; validation, W.D. and T.J.; formal analysis, T.L. and W.D.; investigation, T.J. and T.L.; resources, Y.L.; data curation, C.H.; writing—original draft preparation, W.D.; writing—review and editing, T.L. and T.J.; visualization, T.L.; supervision, Q.Z.; project administration, T.J. and Q.Z.; funding acquisition, T.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Technology R&D Program of China, grant number 2019YFC1606401; the Beijing Natural Science Foundation, grant number 4202014; the Natural Science Foundation of China, grant number 61873027 and 62006008; the Humanity and Social Science Youth Foundation of Ministry of Education of China, grant number 20YJCZH229 and the Social Science Research Common Program of Beijing Municipal Commission of Education, grant number SM202010011013.

Data Availability Statement

Restrictions apply to the availability of these data. Data were obtained from the State Administration for Market Regulation Statistics and are available at [55] with the permission of the State Administration for Market Regulation Statistics.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, Z.; Hao, Q.; Zhou, X.Q.; Zhou, Z.G. Recent Reasech Progresses of Nutrition and Feed Science of Freshwater Fish in China. Chin. J. Anim. Nutr. 2020, 32, 4743–4764. [Google Scholar]
Tian, T.; Wen, J.H.; Zeng, X.L.; Huang, W.J. Research status and prospect of quality and safety risk monitoring and assessment of fresh aquatic products. J. Food Saf. Qual. 2019, 10, 8524–8530. [Google Scholar]
Zhu, Y.F. The Investigation of Fishery Drugs Usage in Fujian. Master’s Thesis, Jimei University, Xiamen, China, 21 June 2013. [Google Scholar]
Luo, Q.; Zhong, M.S.; Zhu, Q.L.; Fang, L.; Liu, W.J. Veterinary drug residues and edible health risk assessment of 3 kinds of cultured freshwater fish. J. Food Saf. Qual. 2020, 11, 8253–8259. [Google Scholar]
Yang, W. The influence of food safety on China’s foreign trade and Its Legal Countermeasures. Leg. Syst. Soc. 2015, 14, 77–78. [Google Scholar]
Zhang, H.; Chen, Q.; Niu, B. Risk Assessment of Veterinary Drug Residues in Meat Products. Curr. Drug Metab. 2020, 21, 779–789. [Google Scholar] [CrossRef] [PubMed]
Boobis, A.; Cerniglia, C.; Chicoine, A.; Fattori, V.; Lipp, M.; Reuss, R.; Verger, P.; Tritscher, A. Characterizing chronic and acute health risks of residues of veterinary drugs in food: Latest methodological developments by the joint FAO/WHO expert committee on food additives. Crit. Rev. Toxicol. 2017, 47, 885–899. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wei, K.L.; Zhou, X.L.; Yan, Q.L.; Tao, Y.X.; Wang, C. Risk assessment of pesticide residues in muskmelon in Xinjiang. Food Mach. 2019, 35, 90–95. [Google Scholar]
Kyriakides, D.; Lazaris, C.; Arsenoglou, K.; Emmanouil, M.; Kyriakides, O.; Kavantzas, N.; Panderi, L. Dietary Exposure Assessment of Veterinary Antibiotics in Pork Meat on Children and Adolescents in Cyprus. Foods 2020, 9, 1479. [Google Scholar] [CrossRef]
Wang, R.; Peng, C.; Gao, J.; Gao, Z.; Jiang, H. A dilated convolution network-based LSTM model for multi-step prediction of chaotic time series. Comput. Appl. Math. 2019, 39, 30. [Google Scholar] [CrossRef]
Madhukumar, N.; Eric, W.; Yifan, Z.; Xiang, W. Consensus Forecast of Rainfall Using Hybrid Climate Learning Model. IEEE Internet Things J. 2021, 8, 7270–7278. [Google Scholar] [CrossRef]
Bao, Z.Y.; Zhao, J.Y.; Huang, P.; Yong, S.S.; Wang, X.A. A Deep Learning-Based Electromagnetic Signal for Earthquake Magnitude Prediction. Sensors 2021, 21, 4434. [Google Scholar] [CrossRef] [PubMed]
Grünig, M.; Razavi, E.; Calanca, P.; Mazzi, D.; Wegner, J.D.; Pellissier, L. Applying deep neural networks to predict incidence and phenology of plant pests and diseases. Emerg. Technol. 2021, 12, e03791. [Google Scholar] [CrossRef]
Jiang, R.; Shen, J.; Li, X.; Gao, R.; Zhao, Q.; Su, Z. Detection and recognition of veterinary drug residues in beef using hyperspectral discrete wavelet transform and deep learning. Int. J. Agric. Biol. Eng. 2022, 15, 224–232. [Google Scholar] [CrossRef]
Wang, Z.Z.; Wu, Z.X.; Zou, M.K.; Wen, X.; Wang, Z.; Li, Y.Z.; Zhang, Q.C. A Voting-Based Ensemble Deep Learning Method Focused on Multi-Step Prediction of Food Safety Risk Levels: Applications in Hazard Analysis of Heavy Metals in Grain Processing Products. Foods 2022, 11, 823. [Google Scholar] [CrossRef]
Jiang, T.Q.; Liu, T.Q.; Dong, W.; Liu, Y.J.; Zhang, Q.C. Security Risk Level Prediction of Carbofuran Pesticide Residues in Chinese Vegetables Based on Deep Learning. Foods 2022, 11, 1061. [Google Scholar] [CrossRef]
Wu, Y.; Zhao, Y.; Li, J. Chapter Two: Food consumption data. In The Fifth China Total Diet Study; Luo, J., Yue, M., Eds.; Science Press: Beijing, China, 2018; pp. 66–69. [Google Scholar]
Wang, X.Q.; Wu, Y.N.; Cheng, J.S. Low level data processing in food contamination monitoring. Chin. J. Prev. Med. 2002, 4, 63–64. [Google Scholar]
Bekhet, H.A.; Yasmin, T. Exploring EKC, trends of growth patterns and air pollutants concentration level in Malaysia: A Nemerow Index Approach. In Proceedings of the 4th International Conference on Energy and Environment 2013, Putrajaya, Malaysia, 5–6 March 2013. [Google Scholar]
Wang, J.; Zhang, X.; Yang, Q.; Zhang, K.; Zheng, Y.; Zhou, G. Pollution characteristics of atmospheric dustfall and heavy metals in a typical inland heavy industry city in China. J. Environ. Sci. 2018, 71, 283–291. [Google Scholar] [CrossRef]
Miao, Q.; Gao, Y.; Liu, Z.; Tan, X. Application of Comprehensive Water Quality Identification Index in Water Quality Assessment of River. In Proceedings of the 2009 WRI Global Congress on Intelligent Systems, Xiamen, China, 19–21 May 2009. [Google Scholar]
Łukasik, M.; Dąbrowska, D. Groundwater quality testing in the area of municipal waste landfill sites in Dąbrowa Górnicza (southern Poland). Environ. Socio-Econ. Stud. 2022, 10, 13–21. [Google Scholar] [CrossRef]
Kong, M.; Zhong, H.; Wu, Y.; Liu, G.; Xu, Y.; Wang, G. Developing and validating intrinsic groundwater vulnerability maps in regions with limited data: A case study from Datong City in China using DRASTIC and Nemerow pollution indices. Environ. Earth Sci. 2019, 78, 262. [Google Scholar] [CrossRef]
Mazurek, R.; Kowalska, J.; Gąsiorek, M.; Zadrożny, P.; Józefowska, A.; Zaleski, T.; Kępka, W.; Tymczuk, M.; Orłowska, K. Assessment of heavy metals contamination in surface layers of Roztocze National Park forest soils(SE Poland)by indices of pollution. Chemosphere 2017, 168, 839–850. [Google Scholar] [CrossRef]
Han, W.; Gao, G.; Geng, J.; Li, Y.; Wang, Y. Ecological and health risks assessment and spatial distribution of residual heavy metals in the soil of an e-waste circular economy park in Tianjin, China. Chemosphere 2018, 197, 325–335. [Google Scholar] [CrossRef] [PubMed]
Li, R.Z.; Pan, C.R.; Xu, J.J.; Chen, J.; Jiang, Y.M. Contamination and Health Risk for Heavy Metals via Consumption of Vegetables Grown in Fragmentary Vegetable Plots from a Typical Nonferrous Metals Mine City. Environ. Sci. 2013, 34, 1076–1085. [Google Scholar]
Zhang, S.B.; Hu, B.X. Heavy Metal Element Pollution of Cultivated Vegetables in Leather Industrial Zone by ICP-AES with Nimerlo Composite Index. Food Sci. 2015, 36, 221–225. [Google Scholar]
Sawut, R.; Kasim, N.; Maihemuti, B.; Hu, L.; Abliz, A.; Abdujappar, A.; Kurban, M. Pollution characteristics and health risk assessment of heavy metals in the vegetable bases of northwest China. Sci. Total Environ. 2018, 642, 864–878. [Google Scholar] [CrossRef]
Hua, P.; Wang, B.H.; Li, C.; Li, Y.; Ha, X.J.; Jia, W.S.; Li, B.R.; Ma, Z.H. Potential health risk of pesticide residues in greenhouse vegetables under modern urban agriculture: A case study in Beijing, China. J. Food Compos. Anal. 2022, 105, 104222. [Google Scholar]
Wang, Z.T.; He, H.L.; Liu, S.Y.; Fan, J.C.; Ren, R. Risk assessment of pesticide residues in strawberry in Hangzhou region based on food safety indexes. Chin. J. Health Lab. 2021, 31, 2917–2920. [Google Scholar]
Luo, J.X.; Zhang, G.; Li, Y.Z.; Shen, Z.B.; Zhao, J.B.; Yang, H.; Duan, L.M.; Liu, Q.; Song, X.S.; Zhao, L.M.; et al. Pollution Characteristics and Dietary Exposure Risk Assessment of Grape Pesticides in Zhengzhou City. Asian J. Ecotoxicol. 2022, 5, 264–272. [Google Scholar]
Lan, S.S.; Lin, T.; Lin, X.; Yang, F.; Du, L.J.; Liu, H.C. Risk assessment of pesticide residues in edible mushrooms in Southwest China based on food safety indexes. Jiangsu J. Agric. Sci. 2014, 30, 199–204. [Google Scholar]
Guo, H.X.; Zhang, X.Q.; Wang, M.L.; Yu, C.K.; Wei, W.; Jiang, X.K.; Xiao, G.Y. Risk estimate of pork based on food safety indexes, Shandong. Mod. Prev. Med. 2019, 46, 1194–1198. [Google Scholar]
Dong, F.G.; Xu, J.J.; Wang, Z.X.; Gong, C.B.; Xing, Y.F.; Sun, Y.L. Dietary exposure risk assessment of forbidden drug and veterinary drug residues in animal-derived food, Yantai. Mod. Prev. Med. 2019, 46, 433–436. [Google Scholar]
Ma, Q.Q.; Lu, S.G.; Liu, H.L.; Zhang, J.; Zhang, E.P.; Zhai, Z.L.; Su, Y.H. Residues and dietary exposure risk assessment of 28 veterinary drugs in eggs in Henan province in 2021. Henan J. Prev. Med. 2022, 33, 108–111. [Google Scholar]
Pan, Z.H.; Qin, P. Risk evaluation of veterinary drug residues in aquatic products of Guangxi by food safety index method. J. Food Saf. Qual. 2020, 11, 4926–4932. [Google Scholar]
Jin, Z.Y.; Xu, C.L.; Xie, Z.J. Introduction to Food Safety, 1st ed.; Chemical Industry Press: Beijing, China, 2005; pp. 255–257. [Google Scholar]
Qin, G.; Chen, Y.; He, F.; Yang, B.; Zou, K.; Shen, N.; Zuo, B.; Liu, R.; Zhang, W.; Li, Y. Risk assessment of fungicide pesticide residues in vegetables and fruits in the mid-western region of China. J. Food Compos. Anal. 2021, 95, 103663. [Google Scholar] [CrossRef]
Hartigan, J.A.; Wong, M.A. A K-Means Clustering Algorithm. J. R. Stat. Soc. Ser. C 1979, 28, 100–108. [Google Scholar]
Li, S.Y.; Jin, X.Y.; Xuan, Y.; Zhou, X.Y.; Chen, W.H.; Wang, Y.X.; Yan, X.F. Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 14 August 2019. [Google Scholar]
Li, P.; Zhong, P.; Zhang, J.; Mao, K. Convolutional Transformer with Sentiment-aware Attention for Sentiment Analysis. In Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK, 19–24 July 2020. [Google Scholar]
Yang, B.; Wang, L.; Wong, D.; Chao, L.S.; Tu, Z. Convolutional self-attention networks. Hum. Lang. Technol. 2019, 1, 4040–4045. [Google Scholar]
Ren, H.; Dai, H.; Dai, Z.; Yang, M.; Leskovec, J.; Schuurmans, D.; Dai, B. Combiner: Full attention transformer with sparse computation cost. Adv. Neural Inf. Process. Syst. 2021, 34, 1049–5258. [Google Scholar]
PyTorch. Available online: https://pytorch.org/ (accessed on 2 May 2022).
Kong, J.L.; Yang, C.C.; Wang, J.L.; Wang, X.; Zuo, M.; Jin, X.; Lin, S. Deep-stacking network approach by multisource data mining for hazardous risk identification in IoT-based intelligent food management systems. Comput. Intell. Neurosci. 2021, 1194565. [Google Scholar] [CrossRef]
Zheng, Y.Y.; Kong, J.L.; Jin, X.B.; Wang, X.Y.; Su, T.L.; Zuo, M. Crop Deep: The crop vision dataset for deep-learning-based classification and detection in precision agriculture. Sensors 2019, 19, 1058. [Google Scholar] [CrossRef] [Green Version]
Jin, X.B.; Zheng, W.Z.; Kong, J.L.; Wang, X.Y.; Bai, Y.T.; Su, T.L.; Lin, S. Deep-Learning Forecasting Method for Electric Power Load via Attention-Based Encoder-Decoder with Bayesian Optimization. Energies 2021, 14, 1596. [Google Scholar] [CrossRef]
Jin, X.B.; Zheng, W.Z.; Kong, J.L.; Wang, X.Y.; Zuo, M.; Zhang, Q.C.; Lin, S. Deep-Learning Temporal Predictor via Bidirectional Self-Attentive Encoder–Decoder Framework for IOT-Based Environmental Sensing in Intelligent Greenhouse. Agriculture 2021, 11, 802. [Google Scholar] [CrossRef]
Jin, X.B.; Gong, W.T.; Kong, J.L.; Bai, Y.T.; Su, T.L. PFVAE: A planar flow-based variational auto-encoder prediction model for time series data. Mathematics 2022, 10, 610. [Google Scholar] [CrossRef]
Jin, X.B.; Gong, W.T.; Kong, J.L.; Bai, Y.T.; Su, T.L. A variational Bayesian deep network with data self-screening layer for massive time-series data forecasting. Entropy 2022, 24, 355. [Google Scholar] [CrossRef] [PubMed]
Jin, X.B.; Zhang, J.S.; Kong, J.L.; Su, T.L.; Bai, Y.T. A reversible automatic selection normalization (RASN) deep network for predicting in the smart agriculture system. Agronomy 2022, 12, 591. [Google Scholar] [CrossRef]
Kong, J.; Yang, C.; Xiao, Y.; Lin, S.; Ma, K.; Zhu, Q. A Graph-Related High-Order Neural Network Architecture via Feature Aggregation Enhancement for Identification Application of Diseases and Pests. Comput. Intell. Neurosci. 2022, 4391491. [Google Scholar] [CrossRef]
Kong, J.; Wang, H.; Yang, C.; Jin, X.; Zuo, M.; Zhang, X. A Spatial Feature-Enhanced Attention Neural Network with High-Order Pooling Representation for Application in Pest and Disease Recognition. Agriculture 2022, 12, 500. [Google Scholar] [CrossRef]
Kong, J.; Wang, H.; Wang, X.; Jin, X.; Fang, X.; Lin, S. Multi-stream hybrid architecture based on cross-level fusion strategy for fine-grained crop species recognition in precision agriculture. Comput. Electron. Agric. 2021, 185, 106134. [Google Scholar] [CrossRef]
State Administration of Market Reguiation. Available online: http://spcj.gsxt.gov.cn (accessed on 21 October 2021).

Figure 1. Transformer-based model for predicting the safety risk level of freshwater products.

Figure 2. Transformer-based predictive model for freshwater product safety risk assessment indicators.

Figure 3. Collection of weekly NIPI indicators for freshwater products in 20 provinces from 2019 to 2021.

Figure 4. Collection of weekly IFS indicators for freshwater products in 20 provinces from 2019 to 2021.

Figure 5. Collection of weekly R indicators for freshwater products in 20 provinces from 2019 to 2021.

Figure 6. Silhouette coefficients of five types of clustering category.

Figure 7. Probability density function plot for category 1.

Figure 8. Probability density function plot for category 2.

Figure 9. Probability density function plot for category 3.

Figure 10. Predicted results of NIPI, IFS and R indicators in Beijing, Fujian, Guangdong and Guangxi.

Figure 11. Predicted results of NIPI, IFS and R indicators in Hebei, Henan, Heilongjiang and Hubei.

Figure 12. Predicted results of NIPI, IFS and R indicators in Hunan, Jilin, Jiangsu and Jiangxi.

Figure 13. Predicted results of NIPI, IFS and R indicators in Liaoning, Inner Mongolia, Ningxia and Qinghai.

Figure 14. Predicted results of NIPI, IFS and R indicators in Shaanxi, Shanghai, Sichuan and Zhejiang.

Figure 15. MAPE for NIPI, IFS and R indicators.

Figure 16. MSE for NIPI, IFS and R indicators.

Table 1. Experimental environment parameters.

Operating System	Windows 10	64-bit
Hardware information	CPU	Intel CORE [email protected] eight-core
	GPU	Nvidia GeForce RTX3060
	RAM	16 GB
Software tool	Python 3.7.11	Numpy 1.18.5
		Pandas 1.2.2
		Torch 1.11.0
		Matplotlib 3.3.3

Table 2. Clustering centers and ranking of the three clusters.

Category	NIPI	IFS	R	Sample Size	Risk Level
1	−0.139858	−0.129344	−0.193611	2997	Low
2	0.990525	0.612015	2.905360	133	Medium
3	5.459207	5.677968	3.782895	50	High

Table 3. Statistical results of the accuracy of risk-level prediction.

Model	Index-Data
Model	P%	R%	F-Measure%
RNN	80.16	80.29	80.22
LSTM	82.46	82.77	82.61
GRU	87.94	88.31	88.12
Transformer	93.73	94.14	93.93

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, T.; Liu, T.; Dong, W.; Liu, Y.; Hao, C.; Zhang, Q. Prediction of Safety Risk Levels of Veterinary Drug Residues in Freshwater Products in China Based on Transformer. Foods 2022, 11, 1690. https://doi.org/10.3390/foods11121690

AMA Style

Jiang T, Liu T, Dong W, Liu Y, Hao C, Zhang Q. Prediction of Safety Risk Levels of Veterinary Drug Residues in Freshwater Products in China Based on Transformer. Foods. 2022; 11(12):1690. https://doi.org/10.3390/foods11121690

Chicago/Turabian Style

Jiang, Tongqiang, Tianqi Liu, Wei Dong, Yingjie Liu, Cheng Hao, and Qingchuan Zhang. 2022. "Prediction of Safety Risk Levels of Veterinary Drug Residues in Freshwater Products in China Based on Transformer" Foods 11, no. 12: 1690. https://doi.org/10.3390/foods11121690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Safety Risk Levels of Veterinary Drug Residues in Freshwater Products in China Based on Transformer

Abstract

1. Introduction

2. Materials and Methods

2.1. Materials

2.1.1. Data Sources

2.1.2. Data Preprocessing

2.2. Safety Risk Assessment Model for Freshwater Products

2.2.1. Nemerow Integrated Pollution Index

2.2.2. Index of Food Safety

2.2.3. Hazard Risk Factor

2.3. Freshwater Product Safety Risk Classification Based on K-Means

2.4. Transformer-Based Model for Predicting the Safety Risk Level of Freshwater Products

2.4.1. Freshwater Product Safety Risk-Level Prediction Process

2.4.2. Transformer-Based Predictive Model for Freshwater Product Safety Risk Assessment Indicators

3. Results

3.1. Data Set and Experimental Environment

3.1.1. Data Set

3.1.2. Experimental Environment

3.2. Model Evaluation Indexes

3.2.1. Performance Evaluation Indexes for Single Indicator Prediction

3.2.2. Accuracy Assessment Indexes for Risk-Level Prediction

3.3. Freshwater Product Safety Risk Assessment and Classification

3.3.1. Freshwater Product Safety Risk Assessment Indicators

3.3.2. Risk Classification

3.3.3. Analysis of Risk Grading Results

3.4. Prediction Results of Freshwater Product Safety Risk Level Based on a Transformer

4. Discussion

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI