1. Introduction
The cultivation of Chinese herbal medicine is the cornerstone of the industrialization of traditional Chinese medicine [
1], and the authenticity of herbs is a crucial indicator of the level of development of traditional medicine. “Authentic” herbs have been clinically used in Chinese medicine for an extended period and are cultivated in a specific region. These herbs are known for their superior quality and efficacy, and they exhibit consistent quality compared to those produced in other regions. Recently, the environmental impact and conservation and development of authentic herbs have emerged as global concerns [
2]. The geographical variation of authentic herbs and the increasing deterioration of the environment have resulted in a significant decline in authentic herbs. As a result, there has been a sharp decrease in the production and quality of authentic herbs each year. These factors pose a significant obstacle to the sustainable development of authentic herbs [
3].
Saposhnikovia divaricata is a local herb indigenous to Northeast China [
4]. The dried root of the plant, which belongs to the family Umbelliferae, is used to treat various external symptoms, including wind rash and itching, rheumatic paralysis, and pain [
5,
6,
7]. However, identifying traits in the complex variety of
Saposhnikovia divaricata is difficult, resulting in many confused products and inaccurate cultivation practices [
8,
9,
10]. This lack of precision in cultivation leads to the uneven quality of the herbs, confusing both production and medicinal use [
11]. Currently, the selection of herbs is still made through manual observation. For minority herbs like
Saposhnikovia divaricata, the identification process still heavily relies on manual observation compared to more commonly known herbs. This traditional method is time-consuming and subject to the subjective judgment of professionals, resulting in a relatively high margin of error.
Deep learning is a branch of artificial intelligence that utilizes the properties of the human brain to analyze and recognize objects through a multilayer neural network. This approach enables computers to learn and identify various features of objects, including their appearance, sound, and other characteristics. The application of deep learning in the classification and recognition of plants has already yielded many promising results. Krizhevsky et al. [
12] developed the AlexNet network for fine-grained image recognition in a large visualization database, known as the ImageNet project, which has been widely used in research related to visual object recognition software Over the last decade, the field of deep learning has seen an influx of talented scholars who have proposed classical convolutional neural networks, including VGGNet [
13], GoogLeNet [
14], ResNet [
15], and DenseNet [
16]. These networks have been widely used in image classification tasks, including plant classification [
17,
18]. For instance, Liu et al. [
19] utilized GoogLeNet to classify complex background images of 50 Chinese herbal medicines under natural conditions. This model has significant application value as it can be effectively used for fine-grained recognition of Chinese herbs, leading to improved accuracy and stability of recognition. Gui Yue [
20] achieved an impressive accuracy of 95.62% by utilizing ResNet-50 in combination with an attention mechanism and cross-layer bilinear pooling algorithm to classify and recognize 12 plant images. Pei et al. [
21] achieved a remarkable accuracy of 99.26% by utilizing ResNet18 as the base model, replacing the fully-connected layer with a convolutional layer, and incorporating a mixed-domain attention mechanism to classify through the Softmax layer. This model was utilized to classify 1360 images into 17 categories of flowers by building a multilayer convolutional neural network and using data enhancement and stochastic gradient descent techniques. Yanjiao Liu [
22] achieved an impressive correct classification rate of 94.7% and 86.7% for ten succulent images and nine Saxifrage images, respectively. This was accomplished by leveraging the AlexNet deep model and migration learning technology, utilizing 15,336 images. The trained models could classify succulent and lithophytic images with an accuracy rate of 94.7% and 86.7%, respectively, with an average recognition time of approximately 6 s per image. Yi Zhao [
23] and the team were able to classify and recognize 12 different weeds with an impressive accuracy rate of 95.47% by utilizing the VGG16 convolutional neural network model.
Ting Zhou [
24] proposed an innovative ResNet-based ethnomedicinal plant image recognition method to classify eight common Tibetan medicines. The method involved an improvement strategy in network structure and loss function, such as replacing the first layer of a convolutional kernel of ResNet34 with a cascaded small convolutional kernel, adding a nonlinear activation layer, utilizing self-adaptive normalization method, Dropout, and optimizing Focal loss function instead of the cross-entropy loss function. Ultimately, this method achieved an accuracy rate of 91.18%.
This paper presents a new convolutional neural network model for the origin identification of authentic herbs by improving and analyzing the capabilities of the IResNet network proposed by C. Duta [
25] and others. The new model incorporates the hierarchical residual connection block to enhance the feature extraction and network representation abilities of the IResNet. By leveraging a small number of herbal samples and embedding the hierarchical residual connection block, the model expands and strengthens the neural network’s perception field to extract scale features. Additionally, the model incorporates deep separable convolution operations in the later stages of the network to reduce time costs and further enhance feature extraction capabilities. The research not only focuses on the classification of
Saposhnikovia divaricata but also deepens existing studies’ theoretical and practical foundations, providing a reference for future applications in mobile fields.
5. Conclusions
This study aims to enhance the original model of the IResNet network by incorporating the concept of lightweight through the use of deep separable convolution operation (DS conv) instead of the traditional convolution operation. This approach reduces the number of parameters and inference time of the model while enhancing the network’s refinement extraction ability and overall recognition capacity. Additionally, the study embeds the layered residual connection module (HRC block) to enhance the multi-scale feature extraction ability of the network. This integration further expands the scale feature extraction of the network model and strengthens the network’s expression capacity. Combining the features of these two distinct modules and conducting experiments on a dataset from five production areas demonstrate that the proposed method’s accuracy is as high as 93.72%. Compared with the original network model, the proposed model in this paper’s accuracy rate is improved. In contrast, the arithmetic power, and the number of parameters of the model are significantly reduced, the number of transmitted frames is increased, and the model’s performance is greatly improved. By comparing five network models often used for image classification, ResNet50 Resnest50, Res2net50, RepVggNet_B0 and ConvNext_T, the network model proposed in this paper is significantly better than the other network models in terms of performance, and the model complexity is lower, and the requirement for equipment is lower.
To revolutionize the conventional manual collection method for identifying Saposhnikovia divaricata, future research aims to introduce a lightweight herb identification model that can be integrated with mobile devices, enabling real-time classification of Chinese herbs using intelligent devices. This will enhance the accuracy and portability of Chinese medicine collection and identification and contribute to geographical analysis in Chinese medicine.