CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment

Chen, Huizhen; Liu, Shuning; Liu, Rongkai; Shi, Heyuan; Hu, Chao; Shi, Ronghua

doi:10.3390/app14135393

Open AccessArticle

CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment

by

Huizhen Chen

^1,2,†,

Shuning Liu

^2,†,

Rongkai Liu

²,

Heyuan Shi

¹

,

Chao Hu

^1,* and

Ronghua Shi

¹

School of Electronic Information, Central South University, Changsha 410083, China

²

School of Computer Science and Engineering, Central South University, Changsha 410083, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2024, 14(13), 5393; https://doi.org/10.3390/app14135393

Submission received: 27 April 2024 / Revised: 7 June 2024 / Accepted: 14 June 2024 / Published: 21 June 2024

(This article belongs to the Section Ecology Science and Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

The carrier is one of the key components used in wastewater treatment, which can enrich microorganisms at the surface to improve the amount of biomass in the reactor. Monitoring and adjusting the number of carriers is a key component for the processing efficiency of the ecosystem, which directly impacts wastewater treatment effectiveness. Therefore, carrier detection in wastewater microscopic images is important in the urban domestic wastewater treatment process. The current process to detect carriers is operator-dependent, which is time-consuming and expensive. Though a number of general object or cell detection approaches are used for this task, their effectiveness is limited because the carrier and background are similar and there are defective carriers as background noise. In this paper, we propose CarrDet, the first deep learning-based carrier detection framework for wastewater treatment. CarrDet uses a carrier feature block attention module and a symmetry-based defective carrier detection module to detect carriers with shallow edges and reduce false positives caused by defective carriers, respectively. To evaluate CarrDet, we propose a carrier dataset of 600 wastewater microscopic images, manually annotated by experts. Compared with state-of-the-art object detection methods, CarrDet shows superior performance in terms of both accuracy and speed, achieving a mAP of 94.32 and an IPS of 4.93. We employed CarrDet to confirm the detection results of 621 wastewater microscopic images, which were detected by inexperienced engineers who are new to the field. CarrDet added 398 unrecognized carriers with shallow edges and corrected 273 incorrect manual annotations in 5 min, which emphasizes the efficiency and practicality of CarrDet for practical business scenarios.

Keywords:

carrier detection; microscopic images; wastewater treatment; attention mechanism; deep learning

1. Introduction

A carrier is typically an attachable growth structure that can enrich microorganisms, such as cells and bacteria, at the surface to increase the amount of biomass in a bioreactor [1]. Adjusting and controlling the number of carriers can maintain an appropriate and effective microbial living environment [2]. The microbial system plays a vital role in wastewater treatment (WWT), as it degrades, transforms, and eliminates organic pollutants from wastewater. The accurate detection and quantification of carriers in wastewater maximizes the processing efficiency of the microbial system [3] in WWT and ensures that the effluent water quality complies with the set standards [2]. Carrier detection in microscopic images is a significant step in urban domestic WWT [4]. Manual detection methods are known to be time-consuming and subjective [5]. As a result, there has been considerable interest in deep learning-based approaches for carrier detection in wastewater microscopic images.

Though there is no method proposed for carrier detection specifically, this task can be approached by general object detection and cell detection methods. General object detection methods, e.g., YOLO [6], SSD [7], and RCNN [8], can be fine-tuned for carrier detection, using the knowledge gained from general object detection tasks. Cell detection methods [9,10] have refined the baseline, such as ResNet [11], with domain-specific knowledge from cell microscopic images.

However, because of the morphological features of carrier microscopic images, the performance of these methods is limited when they are applied to detect carriers in wastewater microscopic images directly. Carriers closely resemble the background in color, which leads to unrecognized carriers with shallow edges (Figure 1a). Moreover, there is a significant presence of defective carriers acting as background noise, which leads to false positives caused by defective carriers (Figure 1b). Inaccurate carrier detection results can lead to incorrect assessments of carrier quantity by operators, which in turn causes erroneous judgments about the current wastewater conditions. This subsequently leads to inappropriate adjustments in operational procedures, ultimately affecting the effectiveness of water purification. As a result, there is an urgent need for a dedicated approach that can effectively address these challenges in order to enhance the accuracy of carrier detection in microscopic images. This will assist operators in making correct adjustments to operations, ultimately leading to the improvement of the WWT process.

In order to tackle these challenges, we propose Attention-Symmetry-based Microscopic Carrier Detection (CarrDet), the first carrier detector of wastewater microscopic images for WWT. CarrDet mainly contains two modules: a carrier feature block attention module (CFBA) and a symmetry-based defective carrier detection module (SDCD). Specifically, we first employ the feature extractor to extract the carrier feature from the microscopic images. Subsequently, CFBA enhances the global perception of the feature through the integration of channel attention and spatial attention mechanisms. This module enlarges the contrast between the background and carrier edges and emphasizes edge features. Consequently, it effectively distinguishes unrecognized carriers with shallow edges from the background, thereby facilitating detection of them. Lastly, SDCD capitalizes on the symmetry of circular carriers to optimize the candidate frames produced by the detection head. By flipping the detection results and evaluating the pairwise similarity of corresponding positions, this approach checks if the results are symmetrical and circular. Thus, this approach reduces false positives caused by defective carriers, effectively filtering out background noise present in the images.

The main contributions of this paper are summarized as follows:

We are the first to propose a carrier detection method specifically for wastewater microscopic images.
We design a carrier feature block attention module and a symmetry-based defective carrier detection module to effectively reduce unrecognized results and false positives of carrier detection in wastewater microscopic images.
We propose a carrier dataset called Carrier600 with high-accuracy annotations for evaluation. The experimental results demonstrate the effectiveness of our proposed framework and its practical applicability in real industry scenarios.

The rest of this paper is organized as follows: Section 2 provides background knowledge of carriers for wastewater treatment and related work of object detection in microscopic images. Section 3 presents the framework of CarrDet and details each component. In Section 4, we introduce the dataset, experimental settings, compared SOTA methods, and experimental results. Section 5 presents the ablation study and discusses the impact of carriers similarity threshold

θ

. We conducted CarrDet in industry practice and show the effectiveness in Section 6. Finally, we draw conclusions in Section 7.

2. Related Work

2.1. Carrier for Wastewater Treatment

The carrier is typically a material with a large specific surface area and favorable bioadhesive properties [12], such as high concentration powder carriers. These carriers can enrich microorganisms at the surface to improve the amount of biomass in the bioreactor [1]. Adding carriers into wastewater leads to an increased concentration of mixed liquor suspended solids [13] and establishes a symbiotic microbial “double sludge” ecosystem comprising suspended growth and attached growth [14]. This ecosystem not only enhances the shock resistance of the WWT system but also improves the loading capacity of pollutants and settling performance of sludge [13]. This system plays a vital role in WWT, as it transforms and degrades organic pollutants from wastewater to meet effluent standards. Moreover, the carrier contains specialized organic alternative carbon sources, like polyhydroxyalkanoates (PHA), which can enhance the biological denitrification process and can substantially improve denitrification efficiency, thereby achieving biological purification of the wastewater [15]. By regulating carrier quantity, an optimal microbial living environment can be sustained, maximizing microbial system efficiency in WWT and ensuring effluent water quality adherence to established standards [2]. Hence, in the urban domestic WWT process, the recognition and quantification of carriers are of paramount importance. While image analysis methods have been used by wastewater treatment plants for the last many years [16], there is still a lack of specialized carrier detection methods in wastewater microscopic images.

2.2. Object Detection in Microscopic Images

Object detection in microscopic images is a common task in various biological and medical applications. Common methods include two categories, fine-tuned general object detection and domain-specific cell detection.

General object detection methods can be broadly categorized into one-stage and two-stage detectors, each focusing on different aspects. One-stage detectors such as single-shot detection (SSD) [7] and you only look once (YOLO) prioritize inference speed, while two-stage detectors such as the regional convolutional neural network (R-CNN) family, including R-CNN [8], Fast R-CNN [17], Faster R-CNN [18], and Mask R-CNN [19], prioritize detection accuracy.

Detection of certain types of cells in microscopy images is of significant interest to a wide range of biology and clinical practices [20]. To effectively detect cells in microscopic images, a range of detection methods have been developed, each suited to different cells. Prangemeier et al. [21] presented an attention-based cell detection transformer for yeast cell detection. Jiang et al. [22] verified key points and centroid points using the unique geometric sequence of cells for mammalian cell detection. Mao et al. [23] presented an optimized training algorithm for circulating tumor cell detection. Ji et al. [10] integrated upsampled multiscale features from FPN into a single layer for cell detection on IHC-stained images. Sun et al. [24] incorporated the embedding layer into the RPN to learn discriminative features based on similarity for histological cell detection. Abousamraet et al. [25] applied representation learning and deep clustering techniques to learn the morphological features of cells and representation of spatial context for breast cancer, colorectal cancer, and lung cancer cell detection. Shakarami et al. [26] used depthwise separable convolution to reduce detector parameters and distance intersection over union for blood cell detection. Liu et al. [9] refined the detection results using shape constraints of images for CD56 detection.

However, the direct application of these methods to wastewater microscopic images may lead to false positives caused by defective carriers and unrecognized carriers with shallow edges, due to the morphological specificity of carriers. Therefore, it is imperative to develop an object detection method specifically designed for carriers in wastewater microscopic images.

3. Framework

The overall network architecture of CarrDet, as depicted in Figure 2, consists of four main steps: the feature extractor, carrier feature block attention module, detection head, and symmetry-based defective carrier detection module. From the input image, the feature extractor learns feature representation. Then, we strengthen the attention on the edge of the carrier through the channel attention and spatial attention mechanism of the carrier feature block attention module. With these features as input, the detection head generates a series of carrier candidate frames. Following that, the symmetry-based defective carrier detection module eliminates defective carrier candidate frames by pairwise similarity evaluation of corresponding positions. Finally, the refined candidate frames are concentrated to form the final output. The detailed implementations of each module are illustrated as follows.

3.1. Feature Extractor

We construct a feature extractor using ResNet50 [11] and FPN [27] to extract features f from the initial input image x. The process of feature generation is as follows:

f = F (x)

(1)

Here,

f \in R^{H \times W \times C}

, and

(H, W, C)

represents the height, the width, and the number of channels for the extracted feature f. Specifically, we utilize the features extracted from the top three stages of ResNet50 and subsequently feed these features into FPN to achieve effective multiscale feature fusion.

3.2. Carrier Feature Block Attention

Due to subtle distinctions between the carrier edge and the background, the accurate recognition of carrier positions becomes a challenging task. To address this issue, we introduce the carrier feature block attention module, which comprises two essential components: channel attention and sparse attention. The channel attention emphasizes the “what” aspect, and the sparse attention prioritizes the “where” aspect [28]. This enhancement aims to focus on information related to carriers with shallow edges. By incorporating these attention mechanisms, the network can more accurately detect carriers in challenging scenarios involving color similarity. The following describes the details of each attention module.

3.2.1. Channel Attention

We flatten the feature f along the channel axis by using average pooling operations in order to aggregate spatial information, thereby efficiently computing the channel attention. The process can be summarized as

F_{c} = A v g P o o l (f)

(2)

where

F_{c} \in R^{C \times 1 \times 1}

denotes the channel compression feature. Then the similarity measures are performed to evaluate the cosine similarity between any pair of channels, which leads to the derivation of the similarity matrix

S_{c}

through cosine similarity calculations:

S_{c} = \frac{F_{c}^{i} \cdot F_{c}^{j}}{∥F_{c}^{i}∥ \times ∥F_{c}^{j}∥}

(3)

where

F_{c}^{i}

,

F_{c}^{j}

are the compression feature of the i-th and j-th channel of f, with

i, j \in [1, C]

and

S_{c} \in R^{C \times C}

.

To determine how much attention should be given to each channel during further processing, combine the compression features of the channels

F_{c}

and the similarity matrix

S_{c}

in a matrix multiplication operation, resulting in a channel attention vector

A_{c} \in R^{C \times 1 \times 1}

.

A_{c}

represents the importance or relevance of each channel in the feature maps. Finally, feature f along the channel axis is multiplied with the channel attention vector

A_{c}

, generating image feature

F_{c h a n n e l} \in R^{H \times W \times C}

.

This operation enhances the contrast between carrier edge channel features and background channel features, emphasizing the former while suppressing the influence of the latter.

3.2.2. Sparse Attention

In comparison to channel attention, our proposed sparse attention module emphasizes the informative spatial locations “where” in a complementary manner.

We perform spatial segmentation on

F_{c h a n n e l}

by merging the first and second dimensions of the image features together, which effectively reduces the dimension, and obtain a feature vector

F_{p} \in R^{H W \times C}

. Subsequently, we employ cosine similarity to assess the similarity between different spatial positions within the image, yielding a similarity matrix

S_{s}

.

S_{s}

represents the pairwise similarities between various spatial positions. The calculation formula is presented below:

S_{s} = \frac{F_{p}^{i} \cdot F_{p}^{j}}{∥F_{p}^{i}∥ \times ∥F_{p}^{j}∥}

(4)

where

F_{p}^{i}

,

F_{p}^{j}

, respectively, denote the feature vectors corresponding to the i-th and j-th spatial position within the image, with

i, j \in [1, H W]

, and

S_{s} \in R^{H W \times H W}

. Then, the feature

F_{p}

is subjected to matrix multiplication with the similarity matrix

S_{s}

, resulting in the spatial attention vectors

A_{s} \in R^{H W \times C}

. We obtain

F_{s p a r s e}

by multiplying

F_{c h a n n e l}

and

A_{s}

,

F_{s p a r s e} \in R^{H \times W \times C}

.

This module aims to enhance the attention given to various spatial positions of carriers in the image, which leads to improved recognition of the edge’s spatial features, while potentially minimizing the impact of background.

3.3. Detection Head

Given image features

F_{s p a r s e}

as input, the detection head predicts the carrier’s position information within the image, represented by

(x, y, h, w)

. Here,

(x, y)

denotes the coordinates of the carrier’s center point, while

(h, w)

represents the height and width of the carrier. By leveraging the acquired carrier candidate frames, the image can be effectively partitioned, allowing for the extraction of localized carrier image features

F_{c a r r i e r} \in R^{h \times w \times c}

.

3.4. Symmetry-based Defective Carrier Detection

To address the challenge of the false positives caused by defective carriers, it is essential to consider the morphological differences between defective and normal carriers. Thus, we combine the carrier morphological features into our method. The intuitive idea is two-fold: (a) The carrier is circular in wastewater microscopic images. (b) The normal carrier is symmetrical along both its horizontal and its vertical midline. Based on these two observations, we develop a novel module suitable for reducing the false positives caused by defective carriers by evaluating the similarity between the corresponding positions of carriers, described below.

Since the carriers are all circular in shape, we can equally divide each of them predicted by the detection head into n blocks, thereby obtaining localized carrier image block features

F_{b} \in R^{\frac{h}{n} \times \frac{w}{n} \times c}

.

Then we evaluate the similarity of symmetrically paired image blocks, respectively. This process generates similarity matrix

S_{s y m m e t r y} \in R^{n \times n}

, which can be mathematically expressed as follows:

S_{s y m m e t r y} = \frac{F_{b}^{i} \cdot F_{b}^{i s}}{∥F_{b}^{i}∥ \cdot ∥F_{b}^{i s}∥}

(5)

where

F_{b}^{i}

represents the feature vectors of the i-th image block, with

i \in [1, n]

. Furthermore,

F_{b}^{i s}

represents the symmetrically positioned image block of

F_{b}^{i}

after undergoing flipping.

Subsequently, in order to intuitively and conveniently determine whether the image block is symmetrical, we construct a matrix I for recognizing based on

S_{S y m m e t r y}

. The specific calculation formula is as follows:

I = \{\begin{matrix} 1, i f S_{s y m m e t r y}^{i j} > θ \\ 0, i f S_{s y m m e t r y}^{i j} < θ \end{matrix}

(6)

where

S_{s y m m e t r y}^{i j}

represents the element in the row i and column j of

S_{s y m m e t r y}

, and

θ

corresponds to the preset carrier image blocks similarity threshold. If all elements in the matrix I approach 1 very closely, it indicates that the carrier is normal; therefore, we should retain its position information

(x, y, h, w)

. Conversely, the presence of any element in matrix I that significantly approaches 0 indicates that the observed carrier is not a symmetrical circle, which is a false positive caused by a defective carrier. By removing candidate frames containing such defective carriers, we can effectively refine the detection results and enhance the overall recognition accuracy.

4. Experiment

4.1. Dataset

Within the WWT process, carrier detection was performed in three stages: detection of pure particulate carriers, carrier detection in microscopic images of sewage mixtures, and carrier detection in microscopic images of sludge. At present, we have only conducted the first stage of the relevant research. We collected 600 images, each with a size of 5440 × 3648 pixels, using an OLYMPUS CX43 biological microscope at 400× optical magnification to build our own dataset, Carrier600. It’s important to note that the dataset was provided by Sanyou Environmental Protection Technology Co., Ltd., located in Changsha, China, at wastewater treatment plants. And Carrier600 remains confidential due to its proprietary nature. The dataset was manually annotated by two experts for two weeks, and a total of 15,614 carriers were annotated. The distribution of the number of carriers in each image is shown in Figure 3. We divided the dataset according to the ratio of 8:1:1 for model training, verification, and testing.

4.2. Experimental Settings

Our experimental environment is a 32G NVIDIA TESLA V100 GPU. To ensure equity and consistency, all our experiments are carried out under the PaddleDetection platform [29]. All models are trained for 100 epochs, and the batch size is set to 8. We use Adamw [30] as the optimizer to optimize the model, and the learning rate is set to 0.0001. During the evaluation phase, the widely accepted mean average precision (mAP) metric is employed, using an intersection over union (IoU) threshold of 0.5 as the standard criterion for assessing object detection accuracy. Additionally, the images per second (IPS) metric is employed to evaluate the efficiency of the detection method.

4.3. Compared State-of-the-Art Methods

Detection methods designed for certain cell types in microscopy images are specific to proprietary datasets and consequently cannot be directly applied to Carrier600. Therefore, to verify the effectiveness of our method, we select several state-of-the-art object detection models to compare, including Sparse R-CNN [31], ConvNeXt [32], Deformable DETR [33], DINO [34], TOOD [35], and ViT [36]. Specifically, these models are initially trained on Carrier600. Sparse-RCNN provides a fixed sparse set of learned object proposals to the object recognition head for performing classification and location tasks. ConvNeXt introduces advancements in the architecture of ViT, enabling the construction of a hierarchical model. Deformable DETR utilizes attention modules that selectively focus on a small set of key sampling points surrounding a reference. DINO employs a contrastive training approach for denoising, a mixed query selection method for anchor initialization, and a look-forward-twice scheme for box prediction. TOOD addresses the challenge of aligning object classification and localization through a learning-based approach. ViT segments an image into blocks and provides a sequence of linear embeddings of these blocks as input to the transformer.

4.4. Experimental Results

Table 1 shows the results of these methods on Carrier600. It is evident that our proposed framework CarrDet achieves the highest mAP value of 94.32%, which is 10.73% higher than Sparse R-CNN, 8.85% higher than ConvNeXt, 7.9% higher than Deformable DETR, 6.96% higher than DINO, 3.42% higher than TOOD, and 3.41% higher than ViT. Moreover, our method also performs similarly well at processing speed. CarrDet achieves the highest IPS value of 4.93, which means processing 4.93 images per second. Notably, it shows enhancements compared to other models, surpassing Sparse R-CNN by 1.25 IPS, ConvNeXt by 3.48 IPS, Deformable DETR by 1.85 IPS, DINO by 2.41 IPS, TOOD by 0.85 IPS, and ViT by 1.46 IPS.

The superiority of our proposed method CarrDet over general object detection methods can be attributed to these key factors. Firstly, general object detection methods tend to overlook carriers with shallow edges, as depicted in the red bounding boxes in Figure 4b. In contrast, CarrDet uses the CFBA module to specifically focus on carriers’ edge features, effectively distinguishing carriers with shallow edges from the background and reducing the unrecognized rate. Secondly, as shown in the blue bounding boxes in Figure 4c, general object detection methods may erroneously recognize defective carriers. To address this, CarrDet implements the SDCD module to judge whether the shape of the carrier is a complete circle. This process efficiently filters out defective carriers, which leads to a noteworthy enhancement in detection accuracy. Finally, CarrDet uses components with a relatively simple architecture as the feature extractor and detection head, resulting in a faster image processing speed.

5. Discussion

5.1. Ablation Study

We conduct ablation experiments to evaluate the effectiveness of two modules, CFBA and SDCD, and assessed the impact of carriers similarity threshold

θ

. In all the runs, we use the same experimental setting as above.

5.1.1. Impact of CFBA

In this part, we examine the impact of CFBA, which comprises channel attention and sparse attention, on the performance of carrier detection in microscopic images. To conduct a thorough evaluation, we perform three sets of comparative experiments on Carrier600 and present the results in Table 2. The experiments involve the following: (a) only utilizing channel attention for carrier detection, (b) only utilizing sparse attention for carrier detection, and (c) employing both channel attention and sparse attention for carrier detection.

The results demonstrate that using channel attention alone improves mAP by 4.18% compared to not using it. Similarly, only using sparse attention improves the mAP by 4.65% compared to not using it. Furthermore, when both channel attention and sparse attention are combined, it outperforms the two individual experiments, and its mAP reached 92.93%.

In the absence of CFBA, general object detection methods may encounter challenges in accurately distinguishing carriers with shallow edges from the background, as their colors often bear a resemblance. Consequently, the detection of such carriers may be missed. However, by leveraging both channel attention and sparse attention, CFBA achieves a more precise focus on the channel information at each pixel and the spatial locations of carriers’ edges. These complimentary attention mechanisms contribute to effective background suppression while highlighting carriers’ edge features. As a result, CFBA significantly amplifies the contrast between carriers and the background, thereby reducing unrecognized carriers with shallow edges.

5.1.2. Impact of SDCD

In a manner analogous to the experiments conducted to assess the impact of CFBA, we conduct experiments specifically to evaluate the individual impact of SDCD on carrier detection. For this purpose, we design the following experiments for the SDCD module: (c) utilizing the SDCD module alone without CFBA, and (d) incorporating both CFBA and SDCD simultaneously. The results presented in Table 2 indicate that using the SDCD module alone improves mAP by 4.32% compared to the baseline. Moreover, employing both CFBA and SDCD outperforms the experiment (c) with an improvement of 1.39%.

The poor performance of general object detection methods can be partly attributed to the presence of defective carriers, which act as background noise. However, CarrDet uses the cosine similarity of the symmetric positions of the candidate boxes to perform secondary verification on the results of the detection head. Through this process, defective carriers are accurately filtered out, which leads to a substantial reduction in false detection results.

5.2. Impact of Carriers Similarity Threshold $θ$

In formula (6), the threshold

θ

plays a crucial role in assessing the level of similarity between pairwise image blocks. If

θ

is set too low, the defective carriers are regarded as normal carriers, while if

θ

is set too high, the normal carriers are regarded as defective carriers. Therefore, it is very important to choose an appropriate threshold

θ

. The importance of threshold selection has been emphasized in various studies, highlighting its significant impact on the performance of image processing and pattern recognition tasks [37,38] . To thoroughly investigate its impact, we conduct four sets of comparative experiments on Carrier600. In each set, the cosine distance threshold

θ

is set to 0.6, 0.7, 0.8, and 0.9, respectively. These experiments aim to explore its influence on carrier detection performance.

As shown in Figure 5, when the cosine distance threshold

θ

is 0.8, the highest mAP value of 94.32% is obtained, which is 2.69%, 1.83%, and 0.53% higher than the thresholds of 0.6, 0.7, and 0.9, respectively. Based on the trend of the mAP value as the cosine distance threshold

θ

changes from 0.6 to 0.9, we observe that CarrDet performs best when the cosine distance threshold

θ

is around 0.8. The experimental results demonstrate that the performance of our proposed framework is strongly influenced by the cosine distance threshold

θ

. This underscores the potential constraints of our framework that should be considered during practical use.

Future research could explore automated methods for threshold selection, such as using machine learning algorithms to dynamically adjust

θ

based on the dataset characteristics and real-time feedback. This could eliminate the need for manual threshold setting and improve detection accuracy across varied datasets.

6. Industry Practice

During the practical domestic WWT process, approximately 6000 wastewater microscopic images are automatically generated on a daily basis. Subsequently, manual carrier detection is conducted at Environmental Protection Technology Co., Ltd. to recognize and quantify carriers, which establishes the basis for subsequent steps in WWT.

However, due to constraints in terms of manual work and time costs, it is impractical to have a sufficient number of experts for the accurate recognition task. This could potentially compromise the efficiency and effectiveness of the subsequent WWT process. Therefore, the necessity of an efficient and continuously integrated automated carrier detection method becomes crucial.

To validate the applicability of our method in a practical scenario, we collected 621 wastewater microscopic images that were detected by inexperienced engineers from the wastewater treatment plant. After being examined by two experts over a week, the corrected results of CarrDet show that CarrDet successfully rectified a total of 231 images, 166 of which were accurate, while 65 images were found to have incorrect assignments. Specifically, out of the 166 accurately corrected images, 398 unrecognized carriers with shallow edges were correctly added by our method, and 273 false positives caused by defective carriers were rectified. On average, this resulted in 2.4 accurate additions and 1.6 correct rectifications per image. The results show that manual carrier detection in wastewater microscopic images has limitations, while CarrDet’s real-world performance is excellent in assisting manual detection.

7. Conclusions

We propose CarrDet, the first deep learning-based framework that is specifically designed for carrier detection in wastewater microscopic images for the WWT process. By leveraging a carrier feature block attention module and a symmetry-based defective carrier detection module, CarrDet addresses the detection challenge of carriers with shallow edges and false positives caused by defective carriers, respectively. We construct a carrier detection dataset of wastewater microscopic images, which greatly facilitates the development of carrier detection. Experimental results show the superior performance of CarrDet, achieving a mAP of 94.32 and an IPS of 4.93, to accurately recognize carriers and improve water purification processes. Our industry practice proves the effectiveness of CarrDet in practical business WWT applications. In future work, we will try to optimize the performance of carrier detection for microscopic images that are derived from different stages of WWT, such as sewage mixtures and sludge.

Author Contributions

Formal analysis and data curation, S.L. and H.C.; software, R.L. and S.L.; methodology: S.L., H.C., R.L. and H.S.; validation, H.C., H.S. and C.H.; writing—original draft preparation, S.L. and H.C.; writing—review and editing, R.L., H.S. and R.S.; supervision, C.H. and R.S.; project administration, H.S. and R.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by Hunan 14th Five-Year Plan Educational Science Research Project (No. XJK23AJD022 and XJK23AJD021), Changsha Science and Technology Key Project (No. kh2401027), Ministry of Education Industry-University Cooperation Collaborative Education Project (No. 220500643274437), National Natural Science Foundation of China (No. 62177046), Hunan Natural Science Foundation (No. 2023JJ40772), Hunan Social Science Foundation (No. 22YBA012), and the High Performance Computing Center of Central South University.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Due to confidentiality agreements with the company providing the data, they cannot be publicly disclosed. The dataset contains proprietary commercial information, and sharing it would compromise competitive advantages. Therefore, access is restricted to authorized personnel, and findings will be presented in aggregated or anonymized form.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Dong, Y.; Fan, S.Q.; Shen, Y.; Yang, J.X.; Yan, P.; Chen, Y.P.; Li, J.; Guo, J.S.; Duan, X.M.; Fang, F.; et al. A novel bio-carrier fabricated using 3D printing technique for wastewater treatment. Sci. Rep. 2015, 5, 12400. [Google Scholar] [CrossRef] [PubMed]
Shi, J.; Liang, Z.; Dai, X. Enhanced biological phosphorus and nitrogen removal by high-concentration powder carriers: Extracellular polymeric substance, microbial communities, and metabolic pathways. Environ. Sci. Pollut. Res. 2023, 30, 4010–4022. [Google Scholar] [CrossRef] [PubMed]
Zhao, Y.; Liu, D.; Huang, W.; Yang, Y.; Ji, M.; Nghiem, L.D.; Trinh, Q.T.; Tran, N.H. Insights into biofilm carriers for biological wastewater treatment processes: Current state-of-the-art, challenges, and opportunities. Bioresour. Technol. 2019, 288, 121619. [Google Scholar] [CrossRef] [PubMed]
Liang, Z.; Yi, J.; Cao, D.; Shi, J.; Yang, D.; Dai, L.; Dai, X. High concentration powder carrier bio-fluidized bed process: A new perspective for domestic wastewater treatment. Bioresour. Technol. 2022, 351, 127015. [Google Scholar] [CrossRef] [PubMed]
Dowsett, M.; Nielsen, T.O.; A’Hern, R.; Bartlett, J.; Coombes, R.C.; Cuzick, J.; Ellis, M.; Henry, N.L.; Hugh, J.C.; Lively, T.; et al. Assessment of Ki67 in Breast Cancer: Recommendations from the International Ki67 in Breast Cancer Working Group. J. Natl. Cancer Inst. 2011, 103, 1656–1664. [Google Scholar] [CrossRef] [PubMed]
Redmon, J.; Farhadi, A. YOLOv3: An Incremental Improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.E.; Fu, C.; Berg, A.C. SSD: Single Shot MultiBox Detector. In Proceedings of the ECCV, Amsterdam, The Netherlands, 11–14 October 2016; Lecture Notes in Computer Science Series. Springer: Berlin/Heidelberg, Germany, 2016; Volume 9905, pp. 21–37. [Google Scholar]
Girshick, R.B.; Donahue, J.; Darrell, T.; Malik, J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
Liu, R.; Ao, B.; Wen, Q.; Wu, X.; Yin, J.; Li, K. Combining ExtremeNet with Shape Constraints and Re-Discrimination to Detect Cells from CD56 Images. In Proceedings of the 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 21–25 August 2022; pp. 4587–4593. [Google Scholar] [CrossRef]
Ji, W.; Tian, W.; Pan, W.; Sun, S.; Jiang, H.; Feng, R.; Zhang, Y.; Zhao, R.; Jin, G. CellDet: Dual-Task Cell Detection Network for IHC-Stained Image Analysis. In Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Houston, TX, USA, 9–12 December 2021; pp. 1343–1346. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 26 June–1 July 2016; pp. 770–778. [Google Scholar]
Jeirani, Z.; Niu, C.H.; Soltan, J. Adsorption of emerging pollutants on activated carbon. Rev. Chem. Eng. 2017, 33, 491–522. [Google Scholar] [CrossRef]
Xu, X.; Liu, G.; Li, Q.; Wang, H.; Sun, X.; Shao, Y.; Zhang, J.; Liu, S.; Luo, F.; Wei, Q.; et al. Optimization nutrient removal at different volume ratio of anoxic-to-aerobic zone in integrated fixed-film activated sludge (IFAS) system. Sci. Total. Environ. 2021, 795, 148824. [Google Scholar] [CrossRef] [PubMed]
Ahmed, W.; Delatolla, R. Biofilm and microbiome response of attached growth nitrification systems across incremental decreases to low temperatures. J. Water Process. Eng. 2021, 39, 101730. [Google Scholar] [CrossRef]
Bengtsson, S.; Karlsson, A.; Alexandersson, T.; Quadri, L.; Hjort, M.; Johansson, P.; Morgan-Sagastume, F.; Anterrieu, S.; Arcos-Hernandez, M.; Karabegovic, L.; et al. A process for polyhydroxyalkanoate (PHA) production from municipal wastewater treatment with biological carbon and nitrogen removal demonstrated at pilot-scale. New Biotechnol. 2017, 35, 42–53. [Google Scholar] [CrossRef] [PubMed]
Shanono, I.H.; Sapiee, M.R.M.; Aziz, K.A.; Suleiman, N.H.Z.; Gomes, A.; Gomes, C. Image processing techniques applicable to wastewater quality detection: Towards a hygienic environment. J. Mater. Environ. Sci. 2018, 9, 2288–2303. [Google Scholar]
Girshick, R.B. Fast R-CNN. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
Ren, S.; He, K.; Girshick, R.B.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
He, K.; Gkioxari, G.; Dollár, P.; Girshick, R.B. Mask R-CNN. In Proceedings of the International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2980–2988. [Google Scholar]
Xue, Y.; Ray, N. Cell detection in microscopy images with deep convolutional neural network and compressed sensing. arXiv 2017, arXiv:1708.03307. [Google Scholar]
Prangemeier, T.; Reich, C.; Koeppl, H. Attention-Based Transformers for Instance Segmentation of Cells in Microstructures. In Proceedings of the 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Seoul, Republic of Korea, 16–19 December 2020. [Google Scholar]
Jiang, H.; Li, S.; Liu, W.; Zheng, H.; Liu, J.; Zhang, Y. Geometry-Aware Cell Detection with Deep Learning. mSystems 2020, 5, 10–1128. [Google Scholar] [CrossRef] [PubMed]
Mao, Y.; Yin, Z.; Schober, J.M. Iteratively training classifiers for circulating tumor cell detection. In Proceedings of the 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI), New York, NY, USA, 16–19 April 2015; pp. 190–194. [Google Scholar] [CrossRef]
Sun, Y.; Huang, X.; Zhou, H.; Zhang, Q. SRPN: Similarity-based region proposal networks for nuclei and cells detection in histology images. Med. Image Anal. 2021, 72, 102142. [Google Scholar] [CrossRef] [PubMed]
Abousamra, S.; Belinsky, D.; Arnam, J.S.V.; Allard, F.; Yee, E.; Gupta, R.; Kurç, T.M.; Samaras, D.; Saltz, J.H.; Chen, C. Multi-Class Cell Detection Using Spatial Context Representation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11–17 October 2021; pp. 3985–3994. [Google Scholar]
Shakarami, A.; Menhaj, M.B.; Mahdavi-Hormat, A.; Tarrah, H. A fast and yet efficient YOLOv3 for blood cell detection. Biomed. Signal Process. Control. 2021, 66, 102495. [Google Scholar] [CrossRef]
Lin, T.; Dollár, P.; Girshick, R.B.; He, K.; Hariharan, B.; Belongie, S.J. Feature Pyramid Networks for Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 936–944. [Google Scholar]
Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
Ma, Y.; Yu, D.; Wu, T.; Wang, H. PaddleDetection, Object Detection and Instance Segmentation Toolkit Based on PaddlePaddle. Github 2019. Available online: https://github.com/PaddlePaddle/PaddleDetection.git (accessed on 1 January 2024).
Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
Sun, P.; Zhang, R.; Jiang, Y.; Kong, T.; Xu, C.; Zhan, W.; Tomizuka, M.; Li, L.; Yuan, Z.; Wang, C.; et al. Sparse R-CNN: End-to-End Object Detection with Learnable Proposals. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19–25 June 2021; pp. 14454–14463. [Google Scholar]
Liu, Z.; Mao, H.; Wu, C.Y.; Feichtenhofer, C.; Darrell, T.; Xie, S. A ConvNet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 19–24 June 2022; pp. 11976–11986. [Google Scholar]
Zhu, X.; Su, W.; Lu, L.; Li, B.; Wang, X.; Dai, J. Deformable DETR: Deformable Transformers for End-to-End Object Detection. In Proceedings of the 9th International Conference on Learning Representations (ICLR), Virtual, 3–7 May 2021. [Google Scholar]
Zhang, H.; Li, F.; Liu, S.; Zhang, L.; Su, H.; Zhu, J.; Ni, L.M.; Shum, H. DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection. arXiv 2022, arXiv:2203.03605. [Google Scholar]
Feng, C.; Zhong, Y.; Gao, Y.; Scott, M.R.; Huang, W. TOOD: Task-aligned One-stage Object Detection. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 3490–3499. [Google Scholar]
Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In Proceedings of the 9th International Conference on Learning Representations (ICLR), Virtual, 3–7 May 2021. [Google Scholar]
Bhargavi, K.; Jyothi, S. A survey on threshold based segmentation technique in image processing. Int. J. Innov. Res. Dev. 2014, 3, 234–239. [Google Scholar]
Weszka, J.S. A survey of threshold selection techniques. Comput. Graph. Image Process. 1978, 7, 259–265. [Google Scholar] [CrossRef]

Figure 1. Examples of normal detection results of existing detection methods (green bounding boxes) in wastewater images. There are unrecognized carriers with shallow edges (red bounding boxes in (a)) and false positives caused by defective carriers (blue bounding boxes in b)).

Figure 2. Overview of CarrDet, which consists of four steps: (1) feature extractor, which inputs images and outputs feature representations; (2) carrier feature block attention module (CFBA), which inputs feature representations and outputs enhanced feature representations; (3) detection head, which inputs enhanced features and outputs candidate frames; and (4) symmetry-based defective carrier detection module (SDCD), which inputs candidate frames and outputs refined candidate frames.

Figure 3. The distribution of carrier number on Carrier600.

Figure 4. (a,c) are the detection results of the general object detection methods on the Carrier600 dataset, and (b,d) are the detection results of CarrDet. The red bounding boxes denote carriers with shallow edges, and the green bounding boxes denote normal carriers.

Figure 5. The mAP comparison with different carrier image blocks similarity threshold

θ

on Carrier600.

Figure 5. The mAP comparison with different carrier image blocks similarity threshold

θ

on Carrier600.

Table 1. Detection results on Carrier600. “Method” refers to specific algorithmic implementations, whereas “Backbone” refers to the core network architecture in deep learning models.

Method	Backbone	mAP(%)	IPS
Sparse R-CNN [31]	ResNet50	83.59	3.68
ConvNeXt [32]	ResNet50	85.47	1.45
Deformable DETR [33]	ResNet50	86.42	3.08
DINO [34]	ResNet50	87.36	2.52
TOOD [35]	ResNet50	90.90	4.08
ViT [36]	VisionTransformer	90.91	3.47
CarrDet	ResNet50	94.32	4.93

Table 2. Detection results on Carrier600.

CFBA (Channel Attention)	CFBA (Channel Attention)	SDCD	mAP(%)
			84.72
✓			88.90
	✓		89.37
✓	✓		92.93
		✓	89.04
✓	✓	✓	94.32

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, H.; Liu, S.; Liu, R.; Shi, H.; Hu, C.; Shi, R. CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment. Appl. Sci. 2024, 14, 5393. https://doi.org/10.3390/app14135393

AMA Style

Chen H, Liu S, Liu R, Shi H, Hu C, Shi R. CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment. Applied Sciences. 2024; 14(13):5393. https://doi.org/10.3390/app14135393

Chicago/Turabian Style

Chen, Huizhen, Shuning Liu, Rongkai Liu, Heyuan Shi, Chao Hu, and Ronghua Shi. 2024. "CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment" Applied Sciences 14, no. 13: 5393. https://doi.org/10.3390/app14135393

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment

Abstract

1. Introduction

2. Related Work

2.1. Carrier for Wastewater Treatment

2.2. Object Detection in Microscopic Images

3. Framework

3.1. Feature Extractor

3.2. Carrier Feature Block Attention

3.2.1. Channel Attention

3.2.2. Sparse Attention

3.3. Detection Head

3.4. Symmetry-based Defective Carrier Detection

4. Experiment

4.1. Dataset

4.2. Experimental Settings

4.3. Compared State-of-the-Art Methods

4.4. Experimental Results

5. Discussion

5.1. Ablation Study

5.1.1. Impact of CFBA

5.1.2. Impact of SDCD

5.2. Impact of Carriers Similarity Threshold $θ$

6. Industry Practice

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

CarrDet: Attention–Symmetry-Based Microscopic Carrier Detection for Wastewater Treatment

Abstract

1. Introduction

2. Related Work

2.1. Carrier for Wastewater Treatment

2.2. Object Detection in Microscopic Images

3. Framework

3.1. Feature Extractor

3.2. Carrier Feature Block Attention

3.2.1. Channel Attention

3.2.2. Sparse Attention

3.3. Detection Head

3.4. Symmetry-based Defective Carrier Detection

4. Experiment

4.1. Dataset

4.2. Experimental Settings

4.3. Compared State-of-the-Art Methods

4.4. Experimental Results

5. Discussion

5.1. Ablation Study

5.1.1. Impact of CFBA

5.1.2. Impact of SDCD

5.2. Impact of Carriers Similarity Threshold θ

6. Industry Practice

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.2. Impact of Carriers Similarity Threshold $θ$