Real-Time Detection of Microplastics Using an AI Camera

Sarker, Md Abdul Baset; Imtiaz, Masudul H.; Holsen, Thomas M.; Baki, Abul B. M.

doi:10.3390/s24134394

Open AccessArticle

Real-Time Detection of Microplastics Using an AI Camera

¹

Electrical and Computer Engineering, Clarkson University, Potsdam, NY 13699, USA

²

Civil and Environmental Engineering, Clarkson University, Potsdam, NY 13699, USA

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(13), 4394; https://doi.org/10.3390/s24134394

Submission received: 29 May 2024 / Revised: 25 June 2024 / Accepted: 3 July 2024 / Published: 6 July 2024

(This article belongs to the Section Sensing and Imaging)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Microplastics (MPs, size ≤ 5 mm) have emerged as a significant worldwide concern, threatening marine and freshwater ecosystems, and the lack of MP detection technologies is notable. The main goal of this research is the development of a camera sensor for the detection of MPs and measuring their size and velocity while in motion. This study introduces a novel methodology involving computer vision and artificial intelligence (AI) for the detection of MPs. Three different camera systems, including fixed-focus 2D and autofocus (2D and 3D), were implemented and compared. A YOLOv5-based object detection model was used to detect MPs in the captured image. DeepSORT was then implemented for tracking MPs through consecutive images. In real-time testing in a laboratory flume setting, the precision in MP counting was found to be 97%, and during field testing in a local river, the precision was 96%. This study provides foundational insights into utilizing AI for detecting MPs in different environmental settings, contributing to more effective efforts and strategies for managing and mitigating MP pollution.

Keywords:

artificial intelligence (AI); DeepSORT; environmental monitoring; freshwater ecosystems; machine vision; microplastics (MPs); object detection; underwater detection; YOLOv5

1. Introduction

Microplastics (MPs), small plastic particles ≤ 5 mm [millimeter] in size [1], are identified as emerging contaminants, given their widespread presence and potential harm to ecosystems and human health. MPs are persistent in terrestrial, aquatic, and marine environments and carry contaminants to many parts of the trophic food webs [2]. MPs are recognized as one of the major environmental concerns by the United Nations [3]. Projections by the World Economic Forum suggest that without decisive action, MP contamination levels will double by 2030, and the weight of plastics in the ocean will surpass that of fish by 2050 [4]. According to recent toxicological research [5,6], MPs may have harmful health consequences, including tissue inflammation, feeding disturbance, poor growth, developmental defects, and alterations in gene expression. Hence, it is imperative to develop a comprehensive understanding and appropriate management strategies to safeguard aquatic environments and the organisms that depend on them. An important step towards mitigation is accurately estimating MP concentrations in water bodies and evaluating their dynamics.

Despite MPs’ ubiquity, there is a lack of technologies for rapidly and accurately identifying and quantifying MPs in an aquatic environment [7]. The development of this technology is a significant scientific challenge because of the small size of MPs. Naked eye, microscopy, spectroscopy, and thermal analysis-based detection techniques have low accuracy and are time-consuming. Optical, laser-based, and scanning electron microscopy techniques are significant advancements in MP detection; however, they are obtrusive, bulky, and contain expensive subsystems [8,9]. In addition, these methods are laboratory-based and require the water samples to be carried from the water body to the controlled laboratory. Therefore, these methods are not suitable for in situ detection of MPs in water. An in situ and real-time non-destructive detection method with high speed, convenience, and acceptable accuracy potentially utilizing artificial intelligence (AI) [10,11] to quantify and track MPs is desired. The application of an AI-based approach (e.g., machine/deep learning) could help with the detection of various subgroups of MPs in environmental settings, yet its implementation in the field of MPs identification is still limited [12].

Traditional methods for detecting aquatic MPs are labor-intensive, time-consuming, costly, and do not provide in situ, real-time monitoring data. We aim to overcome this gap by harnessing recent strides in AI computing and advanced image processing technology. This novel system will continuously capture real-time water sample images while the AI algorithm processes the visual data with precision, enabling the identification, quantification, and tracking of MPs. The combining of computer vision with different technologies has recently been demonstrated [13,14], and vision-based underwater monitoring has been implemented in many research studies. However, vision-based underwater MP detection has not yet been demonstrated except for our recent work [15], where we detected and counted MPs from real-time video using a commercially available camera housed in a waterproof clear enclosure, submerged in a lab flume, and interfaced with a laptop computer.

To automatically detect MPs in an underwater environment using a computerized method, an object detection model is required. One of the popular object detection models is You Only Look Once (YOLO), which has been widely used since 2015 [16,17,18,19,20]. YOLOv5 is one of the most popular YOLO models, which was also used for underwater target detection [21], maritime object detection [22], underwater scallop recognition [23], and sea cucumber identification [24]. While such object detection is required for real-time MP detection, sorting algorithms are generally used for tracking or velocity calculation. Deep simple online real-time Tracking (DeepSORT) is one of the multi-object tracking (MOT) algorithms that has been used in underwater object tracking [25,26,27,28], vehicle tracking [29], unmanned vehicle tracking (UAV) [30], athletics [31], robotics [32], population tracking [33], and cotton seed count [34]. In our current study, we leveraged YOLOv5 and DeepSORT architecture together.

Therefore, for the first time, this study aims to demonstrate the following:

(1): A noble AI-vision implementation for the accurate detection and tracking of underwater MPs.
(2): A detailed methodology for MP counting, velocity calculation, size measurement, and path detection over a wide range of water velocities, and lighting conditions, with different camera setups.
(3): A validation of this lab-developed sensor system both in the controlled flume and in a river system.
(4): An annotated dataset of underwater MPs to facilitate further research.

2. Methodology and Materials

The proposed AI-vision system is configured with an advanced optical camera(s) interfaced with a laptop computer and light-emitting diode (LED) lights. It captures images at a rate of 60 frames per second, with a resolution of 1920 × 1080 pixels. In the process of systems validation, MPs were released from the water surface and allowed to transport/settle with the flowing water. The trajectories of the MPs were then captured by the submerged camera. A Python-based script was developed (flowchart in Figure 1) in such a way that when the camera captures the image, the image was passed through the object detection YOLOv5 model to count the number of MPs in the image. If any MP was detected, the pixel coordinates of the detected MP in the image (center coordinates, width, and height of the object in normalized pixels) are sent to the DeepSORT to track the movement of the MP. The DeepSORT assigned a unique identity to each MP detected and followed its movement across the frames. From two consecutive images, the instantaneous velocity was computed using the measured distance of the MP and its travel duration.

The velocity of MP was also compared with the water velocity, which was measured using a Vectrino Plus (Nortek) acoustic Doppler velocimeter (ADV). We placed an imaginary vertical reference line at one-quarter of the frame width. When detected MPs pass the reference line, we increased the count with respect to the DeepSORT assigned identity thus reducing errors arising from DeepSORT. In a controlled experimental setup, various features of the sensor (e.g., camera focus, focus length, and lighting) and water body (e.g., water velocity, depth, etc.) were tested to optimize the detection and tracking efficiency. The developed Python script allows us to configure important parameters, such as trained model weight and the confidence level of object detection models. Model weights are parameters that the algorithm learns from the training data. The confidence label of a model refers to the level of certainty or trustworthiness assigned to the predictions made by the model. Typically, lowering the confidence level increases false positives, while raising it decreases them.

2.1. Experimental Setup

The experiment was conducted within the recirculating open channel flume situated in the Water Resources Laboratory at Clarkson University, USA, measuring 12 m [meter] in length, 0.45 m in width, and 0.77 m in height. The side walls of the flume are transparent and have an adjusted bed slope of S₀ = 0.5%. The flume’s spatial dimensions were defined along the longitudinal (x), transverse (y), and vertical (z) axes. An observation area measuring 2.00 × 0.46 m was designated at 5.0 m downstream from the flume inlet, chosen to ensure the establishment of fully developed turbulent flow conditions (shown in Figure 2). Flow rate measurements were obtained through pressure gauges interfaced with a computer system. To control flow depth, a point gauge and tailgate were employed at the terminal end of the flume.

A Vectrino Plus (Nortek) ADV was utilized to measure the time-averaged water velocity in the longitudinal direction. A funnel was affixed to the point gauge, and the funnel opening was 10 mm. The funnel helps to channel the MP towards the designated location with minimal distortion. The lower section of the funnel remained consistently positioned 2 cm below the water surface to mitigate interference from surface effects and turbulence. A white, waterproof, thin white sticker paper was used as a backdrop on the flume wall. After release, the MPs were captured using a fine mesh with a 0.5 mm aperture size located at the terminus of the flume. We conducted the experiment with three flow scenarios, where the time-averaged water velocities (U) were 15, 25, and 36 cm/s for offline video analysis and data collection. For real-time analysis, we utilized a total of four turbulent flow scenarios having time-averaged water velocities of 15, 25, 35, and 46 cm/s. For these flow scenarios, the Reynolds number (

R e = ρ U H / μ

, where the average water depth is H (0.31 m), ρ is the water density, and μ is the dynamic viscosity of water) was in the range of 46,500 < Re < 142,600, indicating a fully turbulent flow (Re > 10⁴) in all the scenarios. To make the camera system waterproof, we placed the cameras into a waterproof IP67 polycarbonate submersible, see-through, lift-off enclosure [35]. The waterproof LED, SOLA Light and Motion 2500 [36], which has three brightness settings: high (2500 lumens), medium (1000 lumens), and low (500 lumens), was used. The coverage angle of each light was 60 degrees. For data processing, we used a Lenovo Legion 7 laptop with a Core i7-10750H CPU @ 2.60 GHz, NVIDIA GeForce RTX 2070 GPU, and 1 TB SSD and Ubuntu 22.04 as an operating system. The Python version was 3.10.

2.2. Camera Setup

Initially, a single-camera system was used; three different cameras were tested (Figure 3 and Table 1). Each camera setup (Table 1) consists of an enclosed camera and two LED waterproof lights. The sizes of the enclosures were 3 × 3 × 2, 4 × 2 × 1, and 4 × 2 × 1 inches. All these camera interfaces used USB for connection to the computer. For offline video analyses, we used 1920 × 1080 resolution, and for real-time analysis, we employed Camera 1 with a 1270 × 720 resolution (as explained in the next section).

2.3. Microplastics

We used different sizes, shapes, colors, materials, and densities of MPs (Table 2). The MPs were submerged in the experimental water for three hours before use to minimize static surface charges.

2.4. Dataset and Training

We collected 484 videos from the lab and the field for the training process. These videos cover eight different distances (190, 210, 230, 250, 270, 290, 310, and 330 mm) from the camera and four sizes (2, 3, 4, and 5 mm) of MPs. From the field, we have collected videos at 230 mm. We excluded videos in which MPs were not visible throughout the entire duration. Frames were extracted from the videos and resized to 1280 × 720 pixels. Next, we manually annotated 17,069 images and randomly divided them into groups of 80% for training, 10% for validation, and 10% for testing. The training process involved 300 epochs with a learning rate of 0.01. We used state-of-the-art YOLOv5n as the base model for transfer learning, as it is the smallest, so it has the fastest processing speed of all Yolo5 models.

2.5. Validation Study in Lab and Field

We performed offline and real-time video analysis in the lab. In the field, we performed offline video analysis. The camera was positioned outside the flume in the lab and submersed in the field, and the focus was calibrated for each distance in both scenarios.

Lab study (offline). For offline analysis, MPs were released at varying distances from the camera (190, 210, 230, 250, 270, 290, 310, and 330 mm) at water velocities of 15, 25, and 36 cm/s. We captured videos at 1920 × 1080 resolution and 60 fps. Following the video recording, we assessed the detection accuracy of our model by comparing it with manually counting the released MPs, and Cameras 1, 2, and 3 were used.

Lab study (real-time). We captured real-time video at 1280 × 720 resolution and a maximum frame rate of 28 fps. Individual MPs were released at water velocities of 15, 25, 36, and 46 cm/s, and the system performed real-time counting, size calculation, and velocity measurements. MPs were released at a fixed distance of 230 mm from the camera, and only Camera 1 was used.

Field study (offline). The field test was conducted in the Raquette River, Potsdam, New York (coordinates approximately 44.672820 latitude and −74.995604 longitude) using a custom deployable structure at 914 mm in height and 469 mm in width to accommodate all electronic and mechanical components to replicate the controlled flume environment within the flowing river (shown in Figure 4). It contained an adjustable vertical bar, allowing the Camera 1 setup to be positioned based on the river’s water depth.

Two horizontal wooden supports securely held a funnel resting upon a separate adjustable aluminum frame, enabling both horizontal and vertical positioning control and was used for MPs dispersal at any water depth. A white-colored plastic board was placed as a backdrop to make a uniform background. We conducted the experiment by releasing a single MP at a time upstream using the funnel located 2 cm below the water surface and 230 mm from the camera and repeated the process with four different MP sizes. The water depth was 44 cm. For each size of MPs, we released five particles at the same location. We also measured the water velocity using the handheld ADV (SonTek FlowTracker2) [40].

2.6. Dynamic Coverage Computation

We released MPs from N (N = 8) at different points on the water surface, i.e., varying distances from the camera (across the flume width). To accurately calculate the velocity (d) and size of the MPs at each distance, measurements for the width and height of camera coverage at that distance are required. Figure 5 depicts a scenario where O represents the camera position, with the camera coverage width and height denoted as W and H, respectively; note that both OP and OQ are perpendicular to AB and BC, respectively.

EF and FG, which are the width (W) and height (H) of the frame coverage, respectively. As OPB and OMF are right triangles, the ratios of corresponding sides in similar triangles OPB and OMF are calculated as follows:

M F = \frac{O M}{O P} \times P B

(1)

W = 2 M F

(2)

Here,

W

is the width of the frame at a distance of d.

Similarly, OBQ and OFN are the right triangles. The ratios of corresponding sides in similar triangles OBQ and OFN are as follows:

N F = \frac{O N}{O Q} \times B Q

(3)

H = 2 N F

(4)

Here, H is the height of the frame at a distance of d.

MP Velocity Computation. To compute the velocity of MPs, we first computed the horizontal (

v_{x}

) and vertical components (

v_{z}

) of the velocity and then the resultant velocity (

v_{T}

) in cm/s:

v_{x} = \frac{W \times Δ x p}{W x p \times Δ t}

(5)

where

v_{x}

is the velocity component of MP in horizontal (x) direction,

W

is the camera converge width at a distance d from the camera computed from Equation (2), and

Δ x p

is the displacement of MPs in pixels, the displacement of MPs was measured in pixels based on their position in the previous frame,

W x p

is the width of the frame in pixels, and

Δ t

is the change in time in seconds, or the time it takes to move from its position in the previous frame.

v_{z} = \frac{H \times Δ z p}{H z p \times Δ t}

(6)

where

v_{z}

is the MP velocity,

H

is the camera converge height at a distance d from the camera computed from Equation (4),

Δ x p

is the displacement of MPs in pixels, and

H z p

is the height of the frame in pixels.

After computing the horizontal velocity

v_{x}

component, a sliding window average was employed over a specified window size k; in our case, k = 5. If we denote the input sequence as the equation for the sliding window average (SWA) at each position, i is given by the following:

v_{x i} = \frac{1}{k} \sum_{j = i - k + 1}^{i} v_{x j}

(7)

where,

v_{x i}

is the sliding window average of the horizontal velocity at position

i

. Similarly, for the vertical velocity,

v_{z i} = \frac{1}{k} \sum_{j = i - k + 1}^{i} v_{z j}

(8)

where,

v_{z i}

is the sliding window average of horizontal velocity at the position

i

.

The resultant instantaneous velocity

v_{T}

can be obtained from the following equation:

v_{T = \sqrt (v_{x i}^{2} + v_{z i}^{2})}

(9)

The average velocity

v_{T (a v g)}

for N data points can be obtained using the following.

v_{T (a v g)} = \frac{1}{N} \sum_{i = 1}^{N} v_{T}

(10)

Size computation. The equation below was used to calculate the size of MPs.

S = \frac{W \times D x m p}{W p}

(11)

where

D x m p

is the diameter of the MPs in pixels,

W p

is the width of the camera frame in pixels, and

W

is the width of the coverage width of the frame.

After obtaining the size of the MPs in a frame position, the sliding window average was implemented over a specified window size k. We denoted the input sequence using the equation for the sliding window average (SWA) at each position i as given by the following:

S_{i} = \frac{1}{k} \sum_{j = i - k + 1}^{k} s_{j}

(12)

S_{i}

is the sliding window average at the position

i

and

k

is the size of the sliding window; in our case, k = 5. The window includes the current element S_i and the k − 1 preceding elements.

Finally, we obtained the average size of the object:

S_{a v g} = \frac{1}{N} \sum_{i = 1}^{N} s_{i}

(13)

where N is the total number of data points, and St represents the size at time t.

Precision calculation. Precision is the ratio of correctly predicted positive observations to the total predicted positives. It was calculated using the following formula:

P r e c i s i o n = \frac{T r u e P o s i t i v e (T P)}{T r u e P o s i t i v e (T P) + F a l a s e P o s i t i v e (F P)}

(14)

3. Results

3.1. Offline Video Analysis Results

The average precision of Cameras 1, 2, and 3 was 91%, 89%, and 87%, respectively, in offline testing (Figure 6). While a higher megapixel (MP) count proved helpful, we also found that the quality of the camera sensor played a crucial role. Surprisingly, Camera 2, at only 2 MP, performed similarly to the 16 MP Camera 3, emphasizing the importance of camera quality over sensor pixel count. All camera setups featured auto-brightness control, but only Camera 2 and Camera 3 had an autofocus capability. The intricate interplay between camera quality, including megapixel count and sensor quality, and environmental factors, such as light angle and brightness, was evident.

During testing, both offline and real-time analysis revealed that the confidence level of the object detection model plays a crucial role in controlling the percentage of false positives. When the confidence level is excessively high (approaching 100), false positives decrease, but the tracking performance of the DeepSORT model diminishes. Conversely, when the confidence level is set too low, DeepSORT tracking performs well, but false positives increase. Therefore, it is essential to set the appropriate confidence level based on the number of objects present in the water. In our study, we found that our model performs best within the 40% to 70% range of object detection confidence levels.

Particles being closer to the camera increased detection accuracy. However, this benefit comes at a cost since the closer the MP is to the camera, the shorter the time it is visible while moving. A distance closer than 190 mm from the camera or water velocities higher than 46 cm/s reduced counting accuracy. As expected, being too far (more than 330 mm) from the camera also reduced the detection capability. In our camera setup MP detection worked best from 190 mm to 230 mm distance.

There was a close correspondence between the calculated sizes by the system and the actual sizes of the MPs (average error of 5.5%) (Figure 7). Note that the MP size calculation was based on one fixed distance from the camera. However, the actual measuring points along the entire path varied and increased with an increasing Reynolds number due to an increase in turbulence (Re varied from 46,500 to 111,600).

The velocity variances for each type of MP for the three camera setups and eight lateral distances of MP dropping points are shown in (Figure 8). At each distance, the variance was calculated using the mean velocity measured by the three cameras, assuming a constant MP distance from the camera. The average variance in velocity at 15 cm/s was 0.62 ± 0.32 cm/s (avg ± std), at 25 cm/s it was 1.6 ± 0.85 cm/s, and at 36 cm/s it was 2.8 ± 1.5 cm/s. The MPs measured velocity variance increased at higher water velocity due to the turbulence and travel path deviation due to turbulence, shape, density, and other environmental factors.

3.2. Field Test Result (Offline)

Camera 1 yielded the best performance (91%) in our offline lab tests; therefore, it was used for field tests in which MPs were released at a 230 mm distance from the camera and the water velocity was 5.0 cm/s. For these tests, the average measurement precision was 96%, with only one false positive (natural particle detected as MP) for the 4 mm MPs (Table 3). In the laboratory experiment conducted at a distance of 230 mm and a water velocity of 15 cm/s, the average precision was 91% using Camera 1. However, in the field when the water velocity was only 5 cm/s, there was an improvement in precision of 6%. This improvement was likely because at lower water velocity, more frames could be analyzed, and in addition, the water in the field was clearer than in the lab flume.

3.3. Real-Time Analysis Result (Lab)

The average precision for all sizes at different water velocities was 97% for real-time detection (Figure 9). The variance gradually increased at higher water velocities (Figure 9b) and was generally larger than found when analyzing videos offline due to higher velocities and lower image resolution. The variance in velocity at 15 cm/s was 10 ± 0.36 cm/s (avg ± std), at 25 cm/s was 14 ± 1.9 cm/s, at 35 cm/s was 22 ± 2.3 cm/s, at 46 cm/s was 21 ± 3.1 cm/s. It is evident that with the increase in water velocity, the calculated variance increases. This difference was related to the difference in image resolution since at lower resolution, the distance represented by each pixel increases. For Camera 1, at a distance of 230 mm, the per-pixel distance is 0.13 mm for 1280 pixels and 0.086 mm for 1920 pixels. Consequently, even a small shift in the detected center of MPs can lead to a significant increase in variance. While higher-resolution images provide better MP detection, they also require more processing power. In real-time detection scenarios, particularly with high water velocities, faster processing is crucial. Therefore, to maintain performance with our current model at high water velocity, we reduced the image resolution from 1920 × 1080 pixels to 1280 × 720 pixels. Size estimation during real-time testing was similar to offline analysis, with average size calculation errors of approximately 10%.

4. Discussion

The deployment of three distinct camera setups revealed that the Camera 1 configuration yielded the best performance. During real-time detection, this system maintained precision even under increasing water velocity and turbulence. However, this finding highlights a challenge: the variance in MPs’ measured velocity is influenced by water dynamics and the MPs’ physical properties since a constant distance from the MP to the camera is assumed. Using a 3D camera would overcome this issue, because of its enhanced depth perception, which would increase velocity measurement accuracy. However, in our experiments, the 3D camera did not work well. According to the manufacturer, this camera model should provide distance calculation at distances greater than 300 mm. In our case, the flume wall was 450 mm away, however only reflections of the background were seen. While autofocus cameras can offer advantages in focus accuracy and adaptability to diverse applications, this can be offset by their increased complexity and slower performance compared to fixed-focus cameras. In addition, as an autofocus camera continuously adjusts its focus, it consumes more power compared to fixed-focus cameras.

4.1. Experimental Conditions

To achieve useful velocity measurement data, precise MP release positioning is crucial due to the turbulence generated by MPs entering the water. In our experiments, a horizontally and vertically adjustable funnel was used. To prevent adherence of MPs to the funnel, a spoon containing a small amount of water was used during the MP release process. The opening of the funnel was small to eliminate turbulence created by the funnel itself. A large opening shifts the dropping point, making the measurement unstable.

DeepSORT is designed to maintain the identity of individual objects between video frames. However, the MPs can be very small and occupy only a few pixels. This limited size results in fewer features being extractable, leading to increased identification errors in DeepSORT and inaccurate counting. To mitigate this problem, we used a virtual vertical reference line placed at one-quarter of the frame width and incremented the count each time a microplastic crossed this line.

To accurately calculate the actual size and velocity of MPs, precise annotation as close as possible to the edge of the MPs is crucial. If the provided annotations for training are inaccurate, it negatively impacts the precision of the size calculations. In our case, annotating the 2 mm MPs proved challenging due to their small size, making it difficult to achieve exact annotations consistently. During annotation, we observed instances where the bounding box size exceeded the actual MP size, leading to inflated size predictions.

Light angle and brightness play a vital role in the detection of MPs because of water’s unique optical properties, including refraction, backscatter, and color absorption, which necessitate a strategic approach to lighting. We found that ambient light caused issues with detecting MPs in both flume and field tests. In the lab environment, we attempted to reduce ambient light in the flume by turning off other light sources. In the field, where we only tested at 44 cm water depth, we blocked the sunlight from entering the viewing field using a black cloth.

While we have yet to conduct extensive tests in seawater, the system is expected to perform similarly, provided that adjustments are made for the different optical properties and potentially higher turbidity levels in marine environments. The system is designed to calculate velocity and track MPs while the camera is stationary. This method can be adapted for use on a ship in both ocean and river environments by mounting the camera and lighting systems on a stabilized platform and using real-time image processing capabilities. By incorporating the moving platform’s velocity into the equation, this system can be modified for use on moving ships. Future studies will focus on validating the system in various marine settings to confirm its robustness and adaptability.

4.2. Camera Limitations

There were some limitations to the camera setup used; the LED had an illumination angle of only 60 degrees, which is insufficient for imaging moving MPs. To produce equal illumination, less sharp shadows, and soft, diffused lighting, a wide-angle light source would be better. This type of lighting is especially useful for small object detection, emphasizing the characteristics of the object and reducing errors. Additionally, by reducing specular reflections on bright surfaces, a wide light source can allow more object details to be seen. While our system performs well for particles from 2 to 5 mm in clear water conditions, the accuracy of detection decreased in darker water due to lower contrast between MPs and the background. Increasing the light intensity (e.g., using a stronger light source) and potentially changing the camera (e.g., to one with higher sensitivity in low light conditions) could mitigate this issue. The system we developed was trained on commercially available MPs of various sizes and colors. While the system has performed well, implementing it in real-world scenarios will require collecting and annotating a large amount of data featuring MPs from natural sources to achieve optimal detection and counting accuracy.

4.3. Future Work

In the future, 3D camera technology will be incorporated into the existing system to enable the determination of the distance of the MPs in each position, rather than calculating velocity using constant distance, thereby reducing the unwanted variance in velocity calculation. In addition, a more lightweight deep learning model for MP detection needs to be developed to increase efficiency and resource optimization and enable deployment on resource-constrained platforms such as Jetson Nano, Google Coral, etc. To make a robust system that can differentiate between natural particles from the MPs, we will collect a large amount of data from natural rivers and seas to train the model. We will explore the feasibility and advantages of implementing the entire MP detection system on field-programmable gate array (FPGA) hardware and investigate how FPGA can contribute to real-time processing, reduced power consumption, and increased adaptability for deployment in diverse underwater environments.

5. Conclusions

This study presents a comprehensive framework for measuring MPs in aquatic ecosystems in real-time. By combining deep learning-based MP detection and advanced object-tracking algorithms, this study contributes to the development of efficient and accurate methods for enabling functionalities like counting, velocity calculation, size measurement, and path detection across diverse setups. After extensive and careful testing of three different camera configurations, Camera 1 (16 MP fixed-focused from econ-Systems) was the most efficient and dependable. The system was optimized for distances ranging from 190 mm to 330 mm from the camera. Within this range, the system demonstrated high detection accuracy, with precision rates of 97% in laboratory settings and 96% in field tests for MP sizes from 2 mm to 5 mm. Detection performance decreased significantly for distances closer than 190 mm or farther than 330 mm due to either insufficient visibility time or reduced detection capability. Our findings underscore the feasibility and effectiveness of employing fixed-focus-based camera systems for future research and monitoring efforts in this domain. The successful implementation of our model in both controlled lab settings and real-world scenarios shows its practical applicability and potential for broader adoption in tackling the pervasive issue of MP pollution in aquatic ecosystems.

Author Contributions

Conceptualization, M.A.B.S., M.H.I. and A.B.M.B.; data curation, M.A.B.S.; formal analysis, M.A.B.S., M.H.I., T.M.H. and A.B.M.B.; funding acquisition, M.H.I. and A.B.M.B.; methodology, M.A.B.S., M.H.I., T.M.H. and A.B.M.B.; project administration, M.H.I., T.M.H. and A.B.M.B.; software, M.A.B.S.; supervision, M.H.I., T.M.H. and A.B.M.B.; validation, M.A.B.S. and T.M.H.; visualization, M.A.B.S.; writing—original draft, M.A.B.S.; writing—review and editing, M.H.I. and T.M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research is funded by the Team Science Project Planning Grant (2022) of Clarkson University and New York State Center of Excellence (CoE) in Healthy Water Solutions at Clarkson University and SUNY ESF, grant number 102834.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data assembled or analyzed in this study are available from the corresponding author.

Acknowledgments

We would like to express our sincere gratitude to Usama Butt (Clarkson University, USA) for his invaluable assistance with the maintenance of the flume system used in the data collection for this study. We also extend our appreciation to Natalie Zhu (Brown University, an ASET REU student at Clarkson University and funded by the National Science Foundation, award No. 2244180) for her contributions to the data querying process. Additionally, we would like to thank Addrita Haque (Clarkson University) for her support and feedback throughout the various stages of this research endeavor. This study was promoted and supported by the Team Science Project Planning Grant (2022) of Clarkson University and NYS Center of Excellence (CoE) in Healthy Water Solutions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Ma, H.; Pu, S.; Liu, S.; Bai, Y.; Mandal, S.; Xing, B. Microplastics in Aquatic Environments: Toxicity to Trigger Ecological Consequences. Environ. Pollut. 2020, 261, 114089. [Google Scholar] [CrossRef] [PubMed]
McNeish, R.E.; Kim, L.H.; Barrett, H.A.; Mason, S.A.; Kelly, J.J.; Hoellein, T.J. Microplastic in Riverine Fish Is Connected to Species Traits. Sci. Rep. 2018, 8, 11639. [Google Scholar] [CrossRef] [PubMed]
Vivekanand, A.C.; Mohapatra, S.; Tyagi, V.K. Microplastics in Aquatic Environment: Challenges and Perspectives. Chemosphere 2021, 282, 131151. [Google Scholar] [CrossRef] [PubMed]
Caruso, G. Microplastics as Vectors of Contaminants. Mar. Pollut. Bull. 2019, 146, 921–924. [Google Scholar] [CrossRef] [PubMed]
Blackburn, K.; Green, D. The Potential Effects of Microplastics on Human Health: What Is Known and What Is Unknown. Ambio 2022, 51, 518–530. [Google Scholar] [CrossRef] [PubMed]
Haleem, N.; Kumar, P.; Zhang, C.; Jamal, Y.; Hua, G.; Yao, B.; Yang, X. Microplastics and Associated Chemicals in Drinking Water: A Review of Their Occurrence and Human Health Implications. Sci. Total Environ. 2024, 912, 169594. [Google Scholar] [CrossRef] [PubMed]
Baruah, A.; Sharma, A.; Sharma, S.; Nagraik, R. An Insight into Different Microplastic Detection Methods. Int. J. Environ. Sci. Technol. 2022, 19, 5721–5730. [Google Scholar] [CrossRef]
Song, Y.K.; Hong, S.H.; Jang, M.; Han, G.M.; Rani, M.; Lee, J.; Shim, W.J.A. Comparison of Microscopic and Spectroscopic Identification Methods for Analysis of Microplastics in Environmental Samples. Mar. Pollut. Bull. 2015, 93, 202–209. [Google Scholar] [CrossRef] [PubMed]
Nuelle, M.-T.; Dekiff, J.H.; Remy, D.; Fries, E.A. New Analytical Approach for Monitoring Microplastics in Marine Sediments. Environ. Pollut. 2014, 184, 161–169. [Google Scholar] [CrossRef]
Yin, K.; Wang, D.; Zhao, H.; Wang, Y.; Guo, M.; Liu, Y.; Li, B.; Xing, M. Microplastics Pollution and Risk Assessment in Water Bodies of Two Nature Reserves in Jilin Province: Correlation Analysis with the Degree of Human Activity. Sci. Total Environ. 2021, 799, 149390. [Google Scholar] [CrossRef]
Liu, T.; Yu, S.; Zhu, X.; Liao, R.; Zhuo, Z.; He, Y.; Ma, H. In-Situ Detection Method for Microplastics in Water by Polarized Light Scattering. Front. Mar. Sci. 2021, 8, 739683. [Google Scholar] [CrossRef]
Tian, X.; Beén, F.; Bäuerlein, P.S. Quantum Cascade Laser Imaging (LDIR) and Machine Learning for the Identification of Environmentally Exposed Microplastics and Polymers. Environ. Res. 2022, 212, 113569. [Google Scholar] [CrossRef]
Al-Faris, M.; Chiverton, J.; Ndzi, D.; Ahmed, A.I. A Review on Computer Vision-Based Methods for Human Action Recognition. J. Imaging 2020, 6, 46. [Google Scholar] [CrossRef] [PubMed]
Ghofrani, J.; Kirschne, R.; Rossburg, D.; Reichelt, D.; Dimter, T. Machine Vision in the Context of Robotics: A Systematic Literature Review. arXiv 2019, arXiv:1905.03708. [Google Scholar]
Sarker, M.A.B.; Butt, U.; Imtiaz, M.; Baki, A.B. Automatic Detection of Microplastics in the Aqueous Environment. In Proceedings of the IEEE 13th Annual Computing and Communication Workshop and Conference (CCWC), Piscatawa, NJ, USA, 8–11 March 2023. [Google Scholar]
Jiang, P.; Ergu, D.; Liu, F.; Cai, Y.; Ma, B. A Review of Yolo Algorithm Developments. Procedia Comput. Sci. 2022, 199, 1066–1073. [Google Scholar] [CrossRef]
Algorry, A.M.; García, A.G.; Wofmann, A.G. Real-Time Object Detection and Classification of Small and Similar Figures in Image Processing. In Proceedings of the 2017 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 14–16 December 2017; pp. 516–519. [Google Scholar]
Ahmad, F.; Ning, L.; Tahir, M. An Improved D-CNN Based on YOLOv3 for Pedestrian Detection. In Proceedings of the 2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP), Wuxi, China, 19–21 July 2019; pp. 405–409. [Google Scholar]
Benjdira, B.; Khursheed, T.; Koubaa, A.; Ammar, A.; Ouni, K. Car Detection Using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3. In Proceedings of the 2019 1st International Conference on Unmanned Vehicle Systems-Oman (UVS), Muscat, Oman, 5–7 February 2019; pp. 1–6. [Google Scholar]
Wang, H.; Sun, S.; Wu, X.; Li, L.; Zhang, H.; Li, M.; Ren, P.A. YOLOv5 Baseline for Underwater Object Detection. In Proceedings of the OCEANS 2021, San Diego—Porto, CA, USA, 20–23 September 2021; pp. 1–4. [Google Scholar]
Lei, F.; Tang, F.; Li, S. Underwater Target Detection Algorithm Based on Improved YOLOv5. J. Mar. Sci. Eng. 2022, 10, 310. [Google Scholar] [CrossRef]
Kim, J.-H.; Kim, N.; Park, Y.W.; Won, C.S. Object Detection and Classification Based on YOLO-V5 with Improved Maritime Dataset. J. Mar. Sci. Eng. 2022, 10, 377. [Google Scholar] [CrossRef]
Li, S.; Li, C.; Yang, Y.; Zhang, Q.; Wang, Y.; Guo, Z. Underwater Scallop Recognition Algorithm Using Improved YOLOv5. Aquac. Eng. 2022, 98, 102273. [Google Scholar] [CrossRef]
Zhai, X.; Wei, H.; He, Y.; Shang, Y.; Liu, C. Underwater Sea Cucumber Identification Based on Improved YOLOv5. Appl. Sci. 2022, 12, 9105. [Google Scholar] [CrossRef]
Mathias, A.; Dhanalakshmi, S.; Kumar, R. Occlusion Aware Underwater Object Tracking Using Hybrid Adaptive Deep SORT-YOLOv3 Approach. Multimed. Tools Appl. 2022, 81, 44109–44121. [Google Scholar] [CrossRef]
Xing, C.; Sun, B.; Zhang, W. Image-Enhanced YOLOv5 and Deep Sort Underwater Multi-Moving Target Tracking Method. In Proceedings of the 2022 5th International Symposium on Autonomous Systems (ISAS), Hangzhou, China, 8–10 April 2022; pp. 1–6. [Google Scholar]
Wojke, N.; Bewley, A.; Paulus, D. Simple Online and Realtime Tracking with a Deep Association Metric. In Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 17–20 September 2017; pp. 3645–3649. [Google Scholar]
Wojke, N.; Bewley, A. Deep Cosine Metric Learning for Person Re-Identification. In Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA, 12–15 March 2018; pp. 748–756. [Google Scholar]
Hou, X.; Wang, Y.; Chau, L.-P. Vehicle Tracking Using Deep SORT with Low Confidence Track Filtering. In Proceedings of the 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Taipei, Taiwan, 18–21 September 2019; pp. 1–6. [Google Scholar]
Kapania, S.; Saini, D.; Goyal, S.; Thakur, N.; Jain, R.; Nagrath, P. Multi Object Tracking with UAVs Using Deep SORT and YOLOv3 RetinaNet Detection Framework. In Proceedings of the 1st ACM Workshop on Autonomous and Intelligent Mobile Systems, Association for Computing Machinery, New York, NY, USA, 22 January 2020; pp. 1–6. [Google Scholar]
Zhang, Y.; Chen, Z.; Wei, B. A Sport Athlete Object Tracking Based on Deep Sort and Yolo V4 in Case of Camera Movement. In Proceedings of the 2020 IEEE 6th International Conference on Computer and Communications (ICCC), Chengdu, China, 11–14 December 2020; pp. 1312–1316. [Google Scholar]
Pereira, R.; Carvalho, G.; Garrote, L.; Nunes, U.J. Sort and Deep-SORT Based Multi-Object Tracking for Mobile Robotics: Evaluation with New Data Association Metrics. Appl. Sci. 2022, 12, 1319. [Google Scholar] [CrossRef]
Ahmed, I.; Ahmad, M.; Ahmad, A.; Jeon, G. Top View Multiple People Tracking by Detection Using Deep SORT and YOLOv3 with Transfer Learning: Within 5G Infrastructure. Int. J. Mach. Learn. Cybernetics. 2021, 12, 3053–3067. [Google Scholar] [CrossRef]
Yang, H.; Chang, F.; Huang, Y.; Xu, M.; Zhao, Y.; Ma, L.; Su, H. Multi-Object Tracking Using Deep SORT and Modified CenterNet in Cotton Seedling Counting. Comput. Electron. Agric. 2022, 202, 107339. [Google Scholar] [CrossRef]
McMaster-Carr. Available online: https://www.mcmaster.com/ (accessed on 20 March 2024).
Sola Video 2500 Flood. Available online: https://lightandmotion.com/products/sola-video-2500-flood (accessed on 1 February 2024).
See3CAM_CU135—4K USB Camera. Available online: https://www.e-consystems.com/4k-usb-camera.asp (accessed on 14 January 2024).
Depth Camera D435. Available online: https://www.intelrealsense.com/depth-camera-d435/ (accessed on 14 January 2024).
See3CAM_160–16MP (4K) Autofocus USB 3.1 Gen 1 Camera Board (Color). Available online: https://www.e-consystems.com/usb-cameras/16mp-sony-imx298-autofocus-usb-camera.asp (accessed on 16 August 2023).
37MB SonTek FlowTracker2 Handheld-ADV|Xylem US. Available online: https://www.xylem.com/en-us/products--services/analytical-instruments-and-equipment/flowmeters-velocimeters/flowtracker2-handheld-adv/ (accessed on 15 February 2024).

Figure 1. Firmware flowchart of the proposed system.

Figure 2. Controlled laboratory experimental setup for detection of MP in open channel flume; acoustic Doppler velocimeter (ADV).

Figure 3. Different cameras were employed in this study: (a) Camera 1, (b) Camera 2, and (c) Camera 3.

Figure 4. Experimental setup for the field test (A) Camera support, (B1, B2) LEDs, (C) laptop holder, (D) funnel, and (E) backdrop.

Figure 5. Illustration of camera coverage plane for different distances from the camera.

Figure 6. Comparison of MPs’ count detection average precision among three cameras (Camera 1, 2, and 3) for (a) underwater velocity of 15 cm/s, (b) 25 cm/s, and (c) 36 cm/s.

Figure 7. Comparison of MPs’ size computation errors. Among three cameras (Camera 1, 2, and 3) for underwater velocity of (a) 15 cm/s, (b) 25 cm/s, and (c) 36 cm/s (red dotted lines are the actual size of MPs) and MPs’ size computational error at Reynolds number (d) 46,500 (e) 77,500, and (f) 111,600 for underwater velocities of 15, 25, and 36 cm/s, respectively.

Figure 8. Comparison of MP’s velocity variance at (a) 15 cm/s water velocity, (b) 25 cm/s water velocity, and (c) 36 cm/s water velocity.

Figure 9. (a) Precision change over water velocity for real-time test, (b) velocity variance over different water velocities, and (c) size detection in real-time analysis (red dotted lines are the actual size of MPs).

Table 1. Comparison of different camera specifications.

Features	Camera 1	Camera 2	Camera 3
Manufacturer	See3_Cam [37]	Intel RealSense [38]	See3_Cam [39]
Type	2D	3D	2D
Sensor Resolution	13 MP	2 MP	13 MP
Focus	Fixed	Autofocus	Autofocus
Brightness Control	Auto	Auto	Auto
FPS	60 fps@1920 × 1080 120 fps@640 × 480	30 fps@1920 × 1080	30 fps@1920 × 1080
Interface	USB 3.1	USB 3.0	USB 3.1

Table 2. The types and sizes of MPs used in this study.

Size (mm)	Shape	Actual Size (mm)	Color	Polymer Type	Density (kg/m³)
5 mm	Spherical	4.96	Cyan	Polystyrene	1050
4 mm	Spherical	3.98	White	Cellulose Acetate	1300
3 mm	Spherical	2.96	Green	Acrylic	1190
2 mm	Spherical	1.98	Orange	Cellulose Acetate	1300

Table 3. Field test result for Camera 1.

Size	Actual Count	Software Count	Precision (%)
2 mm	5	5	100
3 mm	5	5	100
4 mm	5	6	83
5 mm	5	5	100

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sarker, M.A.B.; Imtiaz, M.H.; Holsen, T.M.; Baki, A.B.M. Real-Time Detection of Microplastics Using an AI Camera. Sensors 2024, 24, 4394. https://doi.org/10.3390/s24134394

AMA Style

Sarker MAB, Imtiaz MH, Holsen TM, Baki ABM. Real-Time Detection of Microplastics Using an AI Camera. Sensors. 2024; 24(13):4394. https://doi.org/10.3390/s24134394

Chicago/Turabian Style

Sarker, Md Abdul Baset, Masudul H. Imtiaz, Thomas M. Holsen, and Abul B. M. Baki. 2024. "Real-Time Detection of Microplastics Using an AI Camera" Sensors 24, no. 13: 4394. https://doi.org/10.3390/s24134394

APA Style

Sarker, M. A. B., Imtiaz, M. H., Holsen, T. M., & Baki, A. B. M. (2024). Real-Time Detection of Microplastics Using an AI Camera. Sensors, 24(13), 4394. https://doi.org/10.3390/s24134394

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Real-Time Detection of Microplastics Using an AI Camera

Abstract

1. Introduction

2. Methodology and Materials

2.1. Experimental Setup

2.2. Camera Setup

2.3. Microplastics

2.4. Dataset and Training

2.5. Validation Study in Lab and Field

2.6. Dynamic Coverage Computation

3. Results

3.1. Offline Video Analysis Results

3.2. Field Test Result (Offline)

3.3. Real-Time Analysis Result (Lab)

4. Discussion

4.1. Experimental Conditions

4.2. Camera Limitations

4.3. Future Work

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI