1. Introduction
Litchi (
Litchi chinensis Sonn) is an evergreen tree belonging to the Sapindaceae family. China is the country with the widest cultivation area and the largest yield of
Litchi in the world.
Litchi is rich in nutrients and has a delicious taste. As a cash crop tree species, it is widely planted in South China. In China’s mainland, the total output value of
Litchi was 33.389 billion yuan in 2022 according to the acquisition price [
1]. Therefore, promoting the development of the
Litchi planting industry is beneficial to boosting the economic benefits of farmers and meeting the market’s demands for scaled and high-quality
Litchis [
2,
3]. The climate in southern China is usually hot and humid, making
Litchi susceptible to various pest and disease problems.
Litchi leaf disease and pest problems can result in a decrease in
Litchi production and a decline in fruit quality. These issues significantly obstruct the steady development of the
Litchi industry [
4]. The key to solving this problem is to detect, identify, and provide feedback on the disease and insect pests at the early stages of their occurrence. Therefore, timely and efficient detection of
Litchi leaf diseases and insect pests is an important measure to ensure the yield of
Litchi. Traditional artificial recognition is greatly influenced by subjective factors and may not be able to successfully diagnose specific diseases, leading to erroneous conclusions and treatment [
5,
6]. With the rapid popularization of computer technology, it is possible to quickly identify and detect pests and diseases, greatly promoting the development of intelligent agriculture [
7].
In early agricultural pest detection research, traditional image processing methods using support vector machine (SVM) [
8], K-nearest neighbor (KNN) [
9], and other algorithms to detect target pests [
10,
11,
12] were used for agricultural pest detection. Liu et al. [
13] used the histogram of oriented gradients (HOG) feature extraction method to extract features from wheat aphid images in the wheat field, and then used SVM classifiers to identify and locate wheat aphids. Sabro et al. [
14] used the Otsu threshold method to segment tomato leaf diseases and insect pests, manually extracting the shape, color, texture, and other characteristics of the disease, and then using a decision tree to classify tomato leaf diseases and insect pests. Pydipati et al. [
15] used the color co-occurrence method (CCM) to determine whether texture-based hue, saturation, and intensity (HSI) color features in conjunction with statistical classification algorithms could be used to identify diseased and normal citrus leaves under laboratory conditions. Traditional pest detection methods have achieved some results, but there are still problems such as low efficiency, poor segmentation results, and low recognition rates in complex backgrounds. Therefore, they cannot meet the need for the real-time and accurate detection of
Litchi leaf diseases and pests in complex field environments [
16].
In recent years, significant progress has been made in image recognition technology based on convolutional neural networks [
17,
18]. Target detection is a branch of image recognition based on convolutional neural network algorithms. A large number of researchers have applied target detection technology to research crop disease and pest detection, making significant breakthroughs [
19,
20,
21]. Zhang et al. [
22] proposed an improved YOLOv5 network that combines DenseNet, attention mechanism, and Bi-FPN to accurately detect unopened cotton bolls in the field at a lower cost. Liu et al. [
23] proposed an improved convolutional neural network and a PestNet algorithm with a modular channel attention mechanism to identify 16 pests. Experiments showed that the average accuracy rate reached 75.46%. Dai et al. [
24] proposed an improved YOLOv5m method, introducing the Swin Transformer mechanism into the YOLOv5m network to capture more global features and increase receptive fields, enabling more accurate detection of different pests from the dataset. Zhang et al. [
25] used the attention method of dynamic mechanism fusion to improve DenseNet and proposed the Dense Channel and Position Self-Attention Fusion Network model, with a high recognition accuracy of 96.90% for six types of navel orange diseases and pests. Liu et al. [
26] developed a tomato pest recognition algorithm based on the improved YOLOv4 fusion triple attention mechanism (YOLOv4-TAM), which solves the problem of uneven positive and negative sample numbers in the recognition system and improves the accuracy in identifying tomato pests and diseases.
Research on the intelligent recognition of disease and insect images based on convolutional neural networks has made steady progress in recent years. However, due to the wide variety of diseases and insect pests in
Litchi and the complex image background, two types of diseases and insect pests,
Mayetiola sp. and
Litchi algal spot, are characterized by small and dense pest features, which are likely to cause false detection and missed detection of disease and insect targets, making detection difficult [
27,
28]. The existing neural network algorithms have high computational complexity, large parameter quantities, and low detection accuracy for small target diseases and insect pests, such as
Mayetiola sp. and
Litchi algal spot. There is still room for improvement in real-time detection and small target disease and insect pest identification. To address the issues of low accuracy and large model parameters in existing methods for detecting
Litchi leaf diseases and pests, this paper proposes an FCOS-FL model for detecting
Litchi leaf diseases and pests based on the images of five types of
Litchi leaf diseases and pests collected from orchards. The proposed model achieves efficient and accurate recognition of the five common types of
Litchi leaf diseases and pests, with a focus on solving the low detection accuracy of
Mayetiola sp. and
Litchi algal spot. This work provides a reference for the real-time and accurate detection of
Litchi leaf diseases and pests and offers an effective technical support for the prevention and control of
Litchi leaf diseases and pests. The innovation points of this study include:
- (1)
Achieving a lightweight model by replacing the backbone network.
- (2)
Enhancing the features of lychee pests and diseases by adding attention mechanisms.
- (3)
Improving the generalization of the model by improving the central sampling and central measurement of the model.
- (4)
Improving the positioning accuracy of the model by improving the loss function.
4. Discussion
Litchi diseases and pests can reduce yield, affect the quality of
Litchi, and reduce the benefits of farmers. However, there are many types of
Litchi diseases and pests, and their characteristics are small, making detection difficult. Currently, the main method for detecting
Litchi diseases and pests in orchards is still visual identification, but deep learning target detection methods have been widely applied to
Litchi detection [
40,
41,
42,
43]. However, research on the application of deep learning target detection methods to
Litchi diseases’ and pests’ detection is relatively scarce. In view of these problems, this paper further optimized the model structure based on the advantages of existing research at the network design level, and proposed a strong targeted model optimization method for the characteristics of the five
Litchi leaf diseases and pests. The FCOS-FL model was experimentally proven to be superior in terms of accuracy and lightweight performance compared to the pre-improved model on the self-built
Litchi leaf diseases and pests dataset. This method can meet the real-time detection needs of
Litchi leaf diseases and pests, and the model can be deployed on embedded resource-limited devices such as mobile terminals to achieve real-time recognition of
Litchi leaf diseases and pests, providing a new solution for the prevention and control of
Litchi diseases and pests. The overall experimental conclusions are as follows:
- (1)
Regarding the lack of research on deep learning in Litchi diseases and pests and the low detection accuracy of the existing FCOS model in detecting Litchi leaf diseases and pests, the FCOS-FL model proposed in this paper adds an attention mechanism and redesigns the network structure to achieve an average detection accuracy of 91.3% on the detection of five types of Litchi leaf diseases and pests under different natural environments on the AP50, which is 4.2% higher than the detection accuracy of the original FCOS model. Among them, the detection accuracy of the small target pests with high detection difficulty, Mayetiola sp. and Litchi algal spot, reached 93.2% and 92.0%, respectively. It proves that the detection accuracy of the proposed FCOS-FL model is sufficient for detecting five types of Litchi leaf diseases and pests.
- (2)
To solve the problem of the large parameter size of the original FCOS model, G-GhostNet-3.2 was used as the backbone network to achieve a model that was lightweight. The parameter size of the FCOS-FL model proposed in this paper is only 17.65 M, which is 45% smaller than the parameter size of the original FCOS model. The single-image detection speed is 62.0 ms, which proves that the proposed model is suitable for deployment on embedded resource-limited devices such as mobile terminals for real-time and fast recognition and detection of Litchi diseases and pests.
- (3)
The proposed FCOS-FL model outperforms other commonly used models, including Faster R-CNN, RetinaNet, VFNet, and the original FCOS model, in terms of detection accuracy and model parameter size. These results show that the FCOS-FL model has higher accuracy and smaller model parameters, providing more reliable support for real-time and accurate detection of the five types of Litchi diseases and pests.
- (4)
In summary, the Litchi diseases and pests detection model FCOS-FL proposed in this paper can meet the real-time recognition of Litchi diseases and pests in practical applications. The recognition effect and model lightweight degree perform well on the self-built Litchi diseases and pests dataset, which can provide a reference for the prevention and control of Litchi diseases and pests. Compared with other Litchi diseases and pests detection studies, the model and optimization methods used in this study are more in line with the disease characteristics, and the constructed dataset has more complete types of diseases. However, there are still shortcomings in this study. The dataset collected in this study still has room for improvement, and future research should collect more varieties and full-cycle Litchi images to supplement the dataset. There are still a small number of missed detections during the detection process, and further optimization of the detection model should be performed to improve its accuracy. Future work should consider deploying the model to edge computing devices for field applications.