Next Article in Journal
Image Vignetting Correction Using a Deformable Radial Polynomial Model
Previous Article in Journal
Prediction of Actual from Climatic Precipitation with Data Collected from Northern Poland: A Statistical Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Long-Term Indoor-Outdoor PM2.5 Measurements Using PurpleAir Sensors: An Improved Method of Calculating Indoor-Generated and Outdoor-Infiltrated Contributions to Potential Indoor Exposure

1
Independent Researcher, 428 Woodley Way, Santa Rosa, CA 95409, USA
2
Department of Civil and Environmental Engineering, Stanford University, 1008 Cardiff Lane, Redwood City, CA 94061, USA
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(3), 1160; https://doi.org/10.3390/s23031160
Submission received: 15 December 2022 / Revised: 14 January 2023 / Accepted: 16 January 2023 / Published: 19 January 2023
(This article belongs to the Section Environmental Sensing)

Abstract

:
Low-cost monitors make it possible now for the first time to collect long-term (months to years) measurements of potential indoor exposure to fine particles. Indoor exposure is due to two sources: particles infiltrating from outdoors and those generated by indoor activities. Calculating the relative contribution of each source requires identifying an infiltration factor. We develop a method of identifying periods when the infiltration factor is not constant and searching for periods when it is relatively constant. From an initial regression of indoor on outdoor particle concentrations, a Forbidden Zone can be defined with an upper boundary below which no observations should appear. If many observations appear in the Forbidden Zone, they falsify the assumption of a single constant infiltration factor. This is a useful quality assurance feature, since investigators may then search for subsets of the data in which few observations appear in the Forbidden Zone. The usefulness of this approach is illustrated using examples drawn from the PurpleAir network of optical particle monitors. An improved algorithm is applied with reduced bias, improved precision, and a lower limit of detection than either of the two proprietary algorithms offered by the manufacturer of the sensors used in PurpleAir monitors.

Graphical Abstract

1. Introduction

Airborne fine particles are the single most important environmental cause of mortality. Outdoor and indoor sources are roughly equally important, with annual estimated deaths in the millions [1,2]. The London smog of the 1950s, which killed thousands of persons over a period of a week, was instrumental in activating many countries to monitor outdoor particle concentrations [3]. Indoor risks were perhaps slower to be recognized and still receive only a fraction of the attention and resources devoted to monitoring and controlling sources of outdoor particle pollution. However, most people spend most of their lives indoors [4]. Therefore, multiple studies have been carried out to measure either personal or indoor exposures to particles [5,6,7,8,9,10,11,12]. However, these studies have been limited to short-term measurements (days or weeks), since the expense of maintaining research-grade instrumentation in homes, together with the noise and clutter of the instruments, has made long-term (months or years) studies with actual measurements of indoor concentrations nearly impossible.
Recently, however, the advent of low-cost monitors has made it possible to measure PM2.5 continuously for months or even years inside and outside residences. Multiple manufacturers have produced optical particle counters employing tiny noiseless sensors to estimate fine particle concentrations. These monitors have been tested for response to particles produced by multiple common indoor sources and many have performed well [13,14]. Their unobtrusive size and lack of noise has made them attractive to homeowners interested in their exposure to fine particles.
An important goal in studies of human exposure to particles has been to differentiate particles produced by indoor sources from those that infiltrate homes from outdoors. Multiple methods have been adopted. In the pioneering Harvard 6-City Study, indoor air concentrations were regressed on outdoor concentrations [15]. The slope of the regression was interpreted as an infiltration factor Finf that could be multiplied by the outdoor concentration to determine the contribution of outdoor particles to total indoor concentration.
Alternative methods have also been applied, such as using a tracer (typically sulfur) with few indoor sources to reduce the indoor-generated particle concentrations [16,17,18,19]; using a “censoring” method on continuous indoor measurements to eliminate indoor sources [20,21,22,23]; using a recursive modeling approach based on dynamic modeling of near-continuous measurements [24,25]; measuring deposition and penetration directly [26]; applying mass balance analysis to estimate resuspension [27,28,29]; and using LOESS regression to detect minima in the indoor/outdoor ratio [30].
However, the regression method is still being widely used. A careful statistical analysis of the regression method, called the Random Component Superposition (RCS) method is provided in [31]. In this approach, the following model is used:
Cin(t) = Pa/(a + k) Cout(t) + G(t)
where Cin and Cout are the instantaneous indoor and outdoor concentrations (µg/m3), P is the penetration factor (dimensionless number describing the fraction of outdoor particles able to cross the building envelope), a is the air exchange rate (h−1), k is the deposition rate (h−1), and G is the instantaneous concentration at time t caused by indoor activities.
The term Pa/(a + k) is the infiltration factor Finf and describes the fraction of outdoor PM that enters the home. In the case of a time-averaged monitoring period, the equation may be written
<Cin> = Finf <Cout> + <G>
where now <Cin> and <Cout> are the averaged values, <G> is the average contribution of indoor activities to the indoor concentration, and Finf is considered to be constant. This equation is in the form of a simple linear regression, where the intercept is the average contribution of indoor-generated particles to the total indoor concentration, and the slope is the infiltration factor Finf. Multiplying the observed outdoor average concentration by Finf determines the indoor average concentration of infiltrated particles, or Cinf:
<Cinf>= Finf <Cout>
Health effects depend on exposure. Remembering that exposure is defined as the contact of a pollutant with a receptor (in this case fine particles in the breathing zone of a human), for a person over time T, a rough estimate of exposure would be given by
<Cin>T = Finf <Cout>T + <G>T
Of course, people are not always indoors, so Equation (4) is not a measure of their exposure. It is, however, a measure of the potential exposure of a person indoors over time.
In particular, the first term on the right-hand side of Equation (4) is described by epidemiologists as the exposure to particles of ambient origin. Epidemiologists then seek a relationship between health effects and this exposure to particles of ambient origin. The second term is the potential exposure to indoor-generated particles. Epidemiologists do not generally include this second term, since G is usually completely unknown.
There are three sources of error in this formulation. First, indoor-generated concentrations are seldom known. These can sometimes be substantial, as in smoking tobacco or marijuana, cooking on gas or electric stoves, burning candles, or simply moving about and resuspending particles [32,33,34,35,36,37,38]. Secondly, little is known about the times the residents are not at home. Thirdly, the equation assumes a constant infiltration factor, despite a number of influences such as the indoor-outdoor temperature difference, wind speed or direction, and, in particular, occupant behaviors such as opening a window or running a kitchen fan [39,40,41,42].
The first source of error can be rectified by measuring indoor and outdoor concentrations. This is now being done for an ever-growing number of homes using low-cost particle sensors. The second source of error can only be dealt with statistically at present, due to the lack of low-cost personal monitors. The third source of error can be reduced by using the RCS model to identify periods when different infiltration factors may occur for a given location. This is the primary focus of our investigation.
We also present an improved method for determining PM2.5 from the raw data provided by the PurpleAir monitors. Although the Plantower manufacturer of the sensors used in the Purpleair monitors provides its own algorithms for determining PM2.5, no information is provided on the composition, density, or size distribution of the aerosol used for calibrating the sensors. Therefore we use an alternative algorithm based on the number of particles reported by the sensors. The algorithm is described fully in Section 2.3.
Finally, we employ a method of determining the limit of detection (LOD) of the PurpleAir monitors under real-life conditions. The LOD is of interest, particularly for indoor air, since it is a boundary below which the measured values are not significantly different from zero. Indoor air PM2.5 typically is at a lower concentration than outdoor air, and the fraction of particles below the LOD is typically higher than for outdoor air. The method is described fully in Section 2.4.

2. Materials and Methods

2.1. PurpleAir Monitors

PurpleAir monitors employ optical particle sensors manufactured by Plantower, a Chinese company. The monitors use one or two lasers at about 650 nm to scatter off the aerosols, which are brought into the monitor housing using a small fan. There is also a device providing the temperature and relative humidity (RH). PurpleAir recommends its PA-II monitor, which has two independent Plantower PMS-5003 sensors, for outdoor use, and its PA-I monitor, which has one PMS-1003 sensor, for indoor use. A photograph of a custom mounting of a PurpleAir CA-II outdoor monitor is provided (Figure S1 in the Supplement). Using one of two proprietary Plantower algorithms, estimated values of PM1, PM2.5, and PM10 are provided. In addition, estimated particle numbers in six size categories are provided. All monitors send data to the Web, from which the observations can be downloaded either from the PurpleAir map page (https://www2.purpleair.com/, accessed on 1 January 2023) or the PurpleAir API site (https://api.purpleair.com/, accessed on 1 January 2023). All PurpleAir data used in this study employed an alternative algorithm (ALT-CF3) based on the particle numbers in the three smallest size categories. This algorithm is described fully in Section 2.3.

2.2. RCS Model

The RCS model assumes a constant infiltration factor over the time the house is monitored. The concentration due to particles of ambient origin is given by Equation (3) above. Suppose there is a set of average indoor concentrations and corresponding average outdoor concentrations. A regression of the indoor on the outdoor concentrations results in a regression line with slope Finf. Now a second line is drawn showing the contribution of outdoor particles to indoor PM2.5. That line is the solution to Equation (3) above for the outdoor-infiltrated particles only. The line is parallel to the slope of the regression and passes through the origin. Considering that the indoor contribution <G> cannot be negative, the line through the origin defines a “Forbidden Zone” where few observations are expected (Figure 1). For any given observation, the total indoor air concentration consists of two components: the outdoor-penetrated and indoor-generated particles. The outdoor-penetrated particles (vertical red line in Figure 1) are determined by the upper boundary of the Forbidden Zone. The indoor-generated particles (vertical blue line in Figure 1) are calculated by subtracting this value from the observed total indoor concentration.
A major weakness of the regression approach (including the RCS method) is the assumption of a constant infiltration factor, despite the influences mentioned above that alter the infiltration factor. For example, a major EPA study measured 37 homes in the Research Triangle Park of North Carolina for personal, indoor, and outdoor concentrations of sulfur and fine particles [11,17,18,19]. Each home was monitored for 7 days in each of the four seasons. The infiltration factor was considerably lower (0.49) in summer compared to the fall, winter, and spring values of 0.64, 0.62, and 0.59, respectively. The authors attributed this effect to the increased use of air conditioning during the summer.
The existence of the Forbidden Zone is a feature that allows a check on the assumption of a constant infiltration factor. If many observations appear in the Forbidden Zone, the infiltration factor was not constant. However, it may then be possible to divide the dataset into two or more datasets, each with its own constant infiltration factor.
This study deals with the problem of a variable infiltration factor. First, the RCS approach is applied to an entire dataset of matched indoor and outdoor daily average PM2.5 measurements. If there are multiple violations of the Forbidden Zone, subsets of the data are then sought that will have few or no violations. A rough estimate of the variability of the infiltration factor is the daily indoor/outdoor (I/O) ratio as a function of time. If there is a marked variation, it may suggest subsets of the data likely to have more stable ratios. For example, a seasonal variation (due to open windows in spring or the use of air conditioning in summer) might appear. If the resulting RCS regressions on each subset (or at least one of the subsets) show no or few violations of the Forbidden Zone, the result can be accepted as a robust estimate of the infiltration factor for that subset and thus a good estimate of the relative contributions of indoor-generated and outdoor-penetrated particles to the total indoor PM2.5.
The RCS approach is illustrated using matched indoor and outdoor long-term measurements (hundreds of days) at six California locations. The indoor sites are in Menlo Park, Oakmont (in Santa Rosa), Redwood City, and the Outer Sunset area in NW San Francisco. The outdoor sites are in those four locations and also at Bennett Valley, near Santa Rosa, and Alexander Avenue, in Redwood City.

2.2.1. Menlo Park

This site has one outdoor PA-II monitor (with two Plantower PMS 5003 sensors) and one indoor PA-I monitor (with one Plantower PMS 1003 sensor). The monitors operated from 13 November 2019 to 4 December 2021. The site is a private residence occupied by a renter. Between 2 July 2020 and 14 February 2021, the renter was out of the country and the residence was unoccupied. Daily means were calculated for all days using the ALT-CF3 algorithm. At least 540 of the 720 possible measurements per day were required (18 h/day). For every two-minute measurement made by the outdoor monitors with two identical sensors A and B, a measure of precision (abs (A − B)/(A + B)) was calculated. Precision was required to be less than 0.2 to be included in the analysis. This resulted in the loss of about 8% of the data. The final database consisted of 270 days with matched indoor and outdoor daily means.

2.2.2. Oakmont (in Santa Rosa)

Oakmont is a community within Santa Rosa, CA. The site is a home with two occupants. Two indoor PA-II monitors 1 and 2 were set up on 23 July 2019 and operated until 18 June 2020; at that point, two additional PA-II monitors (3 and 4) were deployed. Monitors 1 and 4 were operated exclusively indoors, and monitor 3 (except for one month) outdoors for the next 19 months to 28 September 2022. Daily means were calculated for all days using the ALT-CF3 algorithm. At least 360 of the 720 possible measurements per day were required. The final database consisted of 804 days with matched indoor and outdoor daily means.

2.2.3. Redwood City

A site in Redwood City, CA operated two PurpleAir PA-II monitors indoors and one PurpleAir PA-II monitor outdoors from 1 May 2021 until 16 October 2022. The site is a home with one occupant. The final database consisted of 411 days with matched indoor and outdoor daily means.

2.2.4. Alexander Avenue

Indoor and outdoor PurpleAir PM2.5 measurements at a residence on Alexander Avenue in Redwood City were collected between 5 December 2017 and 7 September 2022 using the ALT-CF3 algorithm. Daily means were calculated for all days with the requirement that at least half of the 720 possible measurements per day were available at an acceptable precision (abs (A − B)/(A + B)) of better than 20%. The final database consisted of 1645 days with matched indoor and outdoor daily means.

2.2.5. Bennett Valley

Outdoor PurpleAir PM2.5 measurements at the Bennett Valley PurpleAir site in Sonoma County, CA were collected using the ALT-CF3 algorithm and compared to the Oakmont site in Santa Rosa (about 4 km distant) covering the period from 19 June 2020 to 27 September 2022. Daily means were calculated for all days with the requirement that at least half of the 720 possible measurements per day were available at an acceptable precision (abs (A − B)/(A + B)) of better than 20%. In total, 821 days met these requirements.

2.2.6. Outer Sunset (San Francisco)

An outdoor and corresponding indoor site is located in the Outer Sunset area in northwest San Francisco. Data from 20 October 2019 to 13 August 2021 were examined. Daily means were calculated for all days with the requirement that at least half of the 720 possible measurements per day were available at an acceptable precision (abs (A − B)/(A + B)) of better than 20%. In total, 409 days met these requirements.

2.3. Algorithm for Calculating PM2.5

Although two algorithms (CF1 and CF_ATM) for calculating PM2.5 are offered by the Plantower (https://www.plantower.com/en/, accessed on 1 January 2023) manufacturer of the sensors used in the PurpleAir monitors, we reject both. There are several reasons for rejection:
  • Both algorithms are biased high, from 40 to 90% [43,44,45].
  • Both algorithms assign values of zero to some results below an arbitrary value. Yet there are no cases in which the reported number of particles is zero. This problem is particularly prevalent for indoor PM2.5 measurements, typically below outdoor levels. In one study with >900,000 indoor measurements, more than 200,000 values were reported as zero by the Plantower algorithms (Supplementary Information).
  • No information is provided by Plantower regarding the calibration aerosol properties, such as the chemical makeup and its density.
  • The precision of measurements is poor, in part because one or both sensors within the PA-II monitor often return zero values.
  • The limit of detection (LOD) is elevated, again in part because of poor precision caused by excessive zero values.
The ALT-CF3 algorithm was developed to provide a transparent and reproducible alternative to the “black box” algorithms provided by Plantower. It is described fully in [46]. The algorithm uses the particle numbers reported for the three size fractions 0.3–0.5 µm, 0.5–1 µm, and 1–2.5 µm. By assigning an average diameter to each size fraction, the total particle volume is calculated. In our approach, we selected as an average diameter the geometric mean of the upper and lower boundaries of these three size categories. The total PM2.5 mass is then determined in two steps: first, a density is assigned (in this case the density of water) and then a calibration factor is determined by comparison to nearby regulatory monitors employing Federal Reference or Federal Equivalent Methods (FRM/FEM). The original calibration factor for 33 PurpleAir PA-II monitors within 0.5 km of 27 regulatory monitors was 3.0, leading to the naming of this alternative algorithm ALT-CF3. The equation giving the ALT-CF3 estimate of PM2.5 is
PM2.5 = 3(0.00030418 N1 + 0.0018512 N2 + 0.2069706 N3)
where N1, N2, and N3 are the number of particles per deciliter in the three smallest size categories.
The ALT-CF3 method has been found to provide less bias, better precision, and a lower limit of detection than either the CF1 or CF_ATM algorithms [47,48]. This method is presently available on the PurpleAir mapping page at the PurpleAir Website (https://www2.purpleair.com/, accessed on 1 January 2023) as an alternative to the proprietary algorithms offered by Plantower. The method is also available on the PurpleAir API page (https://api.purpleair.com/, accessed on 1 January 2023) under a different name: “PM2.5_alt”. All PM2.5 data in this paper were determined using the ALT-CF3 algorithm.

2.4. Calculation of the Limit of Detection (LOD)

A method for calculating the limit of detection for continuous monitors has been developed [49]. By analogy with integrated methods such as weighing particle masses on filters, it is required that the mean of multiple collocated instruments of one type all measuring the same environment at some low concentration exceeds 3 times the standard deviation to be considered as evidence at the 95% confidence level of a non-zero concentration. This approach can be applied to a single PurpleAir monitor with two sensors A and B. Every data point has an associated standard deviation based on the two sensors. The data can be ordered by the mean of the two sensors. At very low mean values, it is likely that the ratio of the mean to the standard deviation (µ/σ) will be <3 for many of the observations. For higher mean concentrations, the fraction of cases with µ/σ < 3 will decline. When this fraction falls below 5%, the associated mean concentration will be the LOD. Operationally, this approach can be implemented by considering successive batches of, say, 100 measurements of the ordered data. When the number of ratios <3 falls to fewer than 5 of the 100 values in the batch, the associated mean of that batch may be considered a provisional LOD. However, a batch at a higher concentration may have 5 or more ratios <3, in which case the LOD will be greater than the provisional LOD and the search for the LOD must continue to higher concentrations. This problem can be solved by analyzing all possible batches of 100 and picking the highest concentration with only 5 out of 100 values having µ/σ < 3.
In our study, however, it was found that some of the very large multiple-day datasets, including 100,000 to 975,000 measurements, gave clearly erroneous high values for the LOD, likely due to random variation. We therefore tweaked the method to employ “batches” of 1000 (instead of 100), with 50 (instead of 5) µ/σ values <3 to constitute the 5% cutoff for determining the LOD. This approach was sufficient to smooth out the function and allow the identification of the LOD. The approach is described in an example in the Alexander Avenue section of the Supplementary Information.

3. Results

3.1. Menlo Park

The data for the time of occupancy (270 days) consist of 372,621 two-minute average PM2.5 concentrations (Table 1). Note the loss of nearly 70,000 data points for the Plantower CF1 algorithm (bottom three rows) compared to the ALT-CF3 algorithm (top five rows). This is partly due to the Plantower decision to assign values of zero to concentrations below an arbitrary cutoff. No values of zero ever occur in the ALT-CF3 algorithm, since there are never occasions when the number of particles between 0.3 and 0.5 µm is zero. The CF1 algorithm overestimates the outdoor PM2.5 concentration by about 85% (8.3 vs. 4.4 µg/m3) and the indoor concentration by 50% (3.3 vs. 2.2 µg/m3). The mean PM2.5 concentration due to indoor activities (1.78 µg/m3) is larger than the mean concentration (0.78 µg/m3) attributable to penetration of ambient PM2.5. The Spearman correlation coefficient was reasonably high at 0.67.
For the 270-day period when the residence was occupied, the RCS estimate of the infiltration factor is 0.0849 (Figure 2). The RCS-predicted Forbidden Zone created by the product of Finf and the outdoor concentration is not violated by a single daily measurement. This suggests that the estimate of a single constant value for the infiltration factor throughout the entire period is robust.

Limit of Detection (LOD) at Menlo Park

Using the approach to calculate the LOD described in [49], the ALT-CF3 algorithm resulted in an estimated LOD of 1.10 µg/m3, with 73,458 (16.6%) PM2.5 ALT-CF3 values below the LOD. For the Plantower CF1 algorithm, the LOD was almost four times higher at 4.15 µg/m3 and nearly half (48.6%) of all values fell below the LOD.

3.2. Oakmont, Santa Rosa, CA

A table of results for the Plantower algorithms CF1 and CF_ATM for comparison with the ALT-CF3 algorithm is provided (Table 2). The Plantower algorithms overestimated PM2.5 values by 50% (CF1) and 30% (CF_ATM) compared to the ALT-CF3 algorithm, which has been calibrated against multiple regulatory monitors [30]. The Plantower algorithms resulted in a major loss of data (18%) due to the Plantower practice of assigning values of zero to results below an arbitrary cutoff. (The ALT-CF3 algorithm resulted in no zero values.) The mean precision of 22% for both Plantower algorithms is nearly three times the mean of 8% shown by the ALT-CF3 algorithm.
Quality assurance of the data was performed by setting a limit of 0.2 (20%) on the precision of both the indoor and outdoor measurements. This resulted in ~9% of the ALT-CF3 data and 22% of the CF1/CF_ATM data being deleted. In addition, several experiments resulting in elevated PM2.5 for periods of several hours in one room of the house (which included at least one of the indoor monitors) were removed from consideration. This resulted in a further removal of ~0.6% of the data. The final database consisted of 804 days with matched indoor and outdoor daily means.
At the Oakmont site, monitors 1 and 4 were always indoors, monitor 2 was mostly indoors, and monitor 3 was outdoors except for one month. The total number of zeros reported by the Plantower CF1 (and CF_ATM) algorithm for PM2.5 was >100,000 (17–22%) for the two indoor monitors and less than about 40,000 (6–7%) for monitor 3, which was mostly outdoors (Table 3). This indicates how the lower concentrations found indoors lead to increased numbers of zeros reported by the Plantower algorithms compared to outdoors.

Limit of Detection (LOD) for Oakmont

The LOD was calculated for each monitor location for both the Alt-CF3 and Plantower CF1 algorithms (Table 4). For the ALT-CF3 algorithm, the LOD ranged from 0.60 to 1.32 µg/m3 and was always exceeded by a majority of the observations. For the Plantower CF1 algorithm, the LOD ranged from 2.9 to 9.9 µg/m3, and fewer than half the observations exceeded the LOD except for a one-month period when monitor 3 was moved indoors.
The regression of the indoor on outdoor PM2.5 using the ALT-CF3 algorithm is provided (Figure 3). However, there are 120 of 804 (15%) values in the Forbidden Zone, suggesting that the estimate of the infiltration factor is not robust.
Analysis of the monthly regressions showed a clear pattern of low infiltration factors (mean 0.11, SE 0.02) for the cool wet months of December through May (227 days), and high infiltration factors (mean 0.37, SE 0.04) for the warm dry months of June through November (285 days) (Figure 4). The 6 months (June-November) with high infiltration factors and 6 months (December through May) with low infiltration factors match up very well both with temperature and rainfall. The 6 months with the highest average temperatures in Santa Rosa (59–66 °F) are May–October, while the 6 months with the lowest temperatures (46–55 °F) occur in November–April. Rainfall is also limited to the November–April time frame, with the six months from May to October seldom experiencing rain. Doors and windows are more likely to be open when the weather is warm and dry, resulting in a higher infiltration factor in the warm months. The I/O values appear to be a sine wave with a period of one year, showing peaks in the summer months and troughs in the winter months.
As a result, the data were split into two 6-month periods. The 6-month period from December to May had few points falling into the Forbidden Zone (Figure 5). Therefore the estimated infiltration factor of 0.1355 is accepted, resulting in an average contribution of 1.54 µg/m3 made by indoor-generated particles during this period.
During the remaining 6 months of June–November, once again few points fall into the Forbidden Zone (Figure 6). The estimated infiltration factor was 0.340, more than twice as high as the winter value of 0.1355, and the estimated contribution of indoor-generated particles was 2.09 µg/m3.

3.3. Redwood City, CA

A site in Redwood City, CA operated two PurpleAir PA-II monitors indoors and one PurpleAir PA-II monitor outdoors. The 2 min average indoor data for the period from 1 May 2021 until 16 October 2022 are provided for both the Plantower CF1 and ALT-CF3 algorithms in Table 5. The Plantower CF1 algorithm overestimated PM2.5 by about 50% (6.2 vs. 4.1 µg/m3). The mean precision of 17–21% in the CF1 algorithm was about twice the 9–10% observed for the ALT-CF3 algorithm. Applying an upper bound of 20% to the precision resulted in about 15% of the PM2.5 CF3 data being removed from the analysis, compared to 40% of the Plantower CF1 data.
The extreme loss of data from the use of the Plantower CF1 algorithm is largely due to the practice of setting all PM2.5 values falling below a cutoff to zero. The number of zeros reported by the Plantower CF1 algorithm ranged from 49,013 to 72,442 (7–11% of all data). No zeros are ever reported by the ALT-CF3 algorithm, since particles are always present in the 0.3–0.5 µm size category.
Daily means were calculated for all days with the requirement that at least half of the 720 possible measurements per day were available at an acceptable precision (abs (A − B)/(A + B)) of better than 20%. The final database consisted of 411 days with matched indoor and outdoor daily means.
The RCS approach for the entire database results in 120 (29%) values in the Forbidden Zone, indicating that the infiltration factor varies too much to be estimated using the entire database (Figure 7).
The I/O ratio is shown for the entire period (Figure 8). It suggests a sine wave with a period of one year, with peaks occurring in the summer months and troughs in the winter months. A period of low infiltration factors appears between about November and March.
The RCS regression on the November–March period results in only 15 points falling into the Forbidden Zone, and 12 of these seem to “hug” the line (Figure 9). Therefore the estimated infiltration factor of 0.2329 and mean contribution of 0.556 µg/m3 made by indoor-generated particles during this period appears robust.
An effort was made to estimate the infiltration factor for the summer months (June–July–August). However, the effort failed, suggesting too much variability of the infiltration factor. Two of these months were strongly affected by wildfires.

Limit of Detection (LOD)

For monitor 1 indoors, the LOD for the ALT-CF3 algorithm was about 1.4 µg/m3, corresponding to about 97% of all observations exceeding the LOD. For the same monitor, the Plantower CF1 algorithm resulted in an estimated LOD of 4.75 µg/m3, with only 42% of observations exceeding the LOD.

3.4. Alexander Avenue

The I/O ratio is shown for the entire period between 5 December 2017 and 7 September 2022 (Figure S2). The final database consisted of 1625 days with matched indoor and outdoor daily means. Unlike our previous findings with homes in Santa Rosa and Redwood City, the I/O ratio shows little or no seasonal variation. There is, however, an apparent sudden switch to less variable values occurring near January 2020.
The RCS approach to estimate the infiltration factor for the entire database resulted in 516 of 1625 (32%) values in the Forbidden Zone (Figure 10).
Based on the apparently smaller variability of the I/O ratios in the latter part of the period from about 1 January 2020 to 7 September 2022 (938 days), the RCS approach is applied to that period (Figure 11). In this period, 75 (<8%) of the observations fall into the Forbidden Zone. This is somewhat marginal. However, if applicable, it would suggest an infiltration factor of 0.08 and an average contribution of about 0.51 µg/m3 from indoor-generated particles. The contribution of particles of ambient origin was 0.44 µg/m3, so the indoor-generated fraction of the total indoor potential exposure was about 54%.
An effort was made to estimate the infiltration factor for the period from 2017 through 2019. However, this effort was unsuccessful.
For this location, there appears to have been a period before 2020 with a higher infiltration factor and a period from 2020 to the present with a lower infiltration factor. Only the latter period allowed the finding of a likely stable infiltration factor and therefore an estimate of the relative importance of indoor-generated and outdoor-penetrated particles.

3.5. Bennett Valley

The matched Bennett Valley outdoor and Oakmont indoor measurements numbered 821 days However, an unusually high number (280) of the Bennett Valley measurements showed precision poorer than 20%. Only 21 of the 821 corresponding Oakmont measurements had precision poorer than 20%. The final database consisted of 522 days with matched indoor and outdoor daily means and precision better than 20%.
The I/O ratio is provided for the entire period (Figure S3). There are two strong troughs in winter 2021 and 2022, with peaks in the summer of 2020 and 2021.
The RCS approach to estimate the infiltration factor for the entire database results in 172 of 522 (33%) values in the Forbidden Zone, indicating that the infiltration factor varies too much to be estimated using the entire database (Figure 12).
The variation of the I/O ratio suggests that a period of low infiltration factors might occur in the winter and perhaps the spring. Indeed, when the RCS regression is performed on the winter months (December through February) and the spring months (April–June), few points fall into the Forbidden Zone (Figure 13). Therefore the estimated infiltration factor of 0.2195 appears robust. An average contribution of 1.1 µg/m3 is made by indoor-generated particles.
The corresponding regression on the summer–fall months (August–November) again produced few observations violating the Forbidden Zone, with an estimated infiltration factor of 0.3637 and an estimated contribution of indoor-generated particles of 2.10 µg/m3 (Figure 14).
Plotting the indoor-outdoor ratio (I/O) vs. time was useful in showing the seasonal change in the infiltration factor. Two periods (winter–spring and summer–fall) were found to have a stable infiltration factor, allowing good estimates of both the infiltration factor and the relative contribution of indoor-generated and outdoor-penetrated particles to total indoor potential exposure. The good results for sites 4 km apart are notable, suggesting that outdoor sites may often have little spatial variation. This opens up the possibility of using distant outdoor sites (e.g., those with high-quality Federal Reference Method (FRM) PM2.5 measurements) to calculate infiltration factors at many indoor sites.

3.6. Outer Sunset (San Francisco)

Data from 20 October 2019 to 13 August 2021 (409 days) were examined (Figure 15). This site in San Francisco had very few points falling in the Forbidden Zone. The intercept of 10.3 µg/m3 is an unusually strong estimate of the dominance of the (unknown) indoor source(s).

4. Discussion

The LODs for five of the six sites are provided for both the ALT-CF3 and CF1 algorithms in Table 6. The mean CF1 LOD of 4.24 µg/m3 was four times larger than the ALT-CF3 LOD of 1.04 µg/m3. Moreover, the lowest of 11 values found for the CF1 algorithm was 2.69 µg/m3, while the highest value for the ALT-CF3 algorithm was 1.83 µg/m3, indicating no overlap between the two algorithms.
Many investigators have estimated LODs for a number of low-cost monitors. Many of these are chamber studies, in which they can compare the low-cost monitors to a reference instrument at a very low concentration (e.g., <1 µg/m3). Chamber studies have shown low LODs of <1 µg/m3 to about 2 µg/m3 [50,51]. A laboratory investigation used the method in [49] to calculate the LOD for five low-cost monitors [51]. All monitors were below 1 µg/m3. The PurpleAir monitor had the lowest LOD (near zero!).
Typically, chamber studies under careful control of the particle composition, and perhaps size, temperature, humidity, and a nearby reference instrument, show lower LODs than field investigations. For example, a field and laboratory study of the PMS 1003 sensor found that it ranged from 1 to 3.22 µg/m3 under laboratory conditions but increased to 10.5 µg/m3 under ambient conditions [52].
Our method for determining the LOD under field conditions has been adopted by several investigators [51,53]. One study included 58 Speck monitors during two outdoor and one indoor 3-day campaigns and found a LOD for 1 min data of 10 µg/m3, or 9 µg/m3 for hourly average data [53]. In 2016–2017, a field study of two Plantower MS 1003 sensors took place over 320 days, with the addition of a 5003 sensor in 2017 [54]. In 2017, the PMS LODs ranged from 2.62–11.5 µg/m3. This range is very similar to our range of 2.69–9.9 µg/m3 using the Plantower algorithms.
It may not be well understood how low typical exposures to PM2.5 are. Comparing the mean LOD of 4.2 for the CF1 (or CF_ATM) algorithm to actual measured levels of PM2.5 in a tristate area over the last 4.7 years, it is well above the median value for indoor hourly average PM2.5 concentrations of 3.5 µg/m3 [55]. This means that the use of the Plantower algorithm would result in the conclusion that well over half of the indoor daily average PM2.5 concentrations experienced by persons in ~9000 residences over 4.7 years would not be significantly different from zero. Use of the ALT-CF3 LOD of 1 µg/m3 results in only about 15% of the measured population exposures falling below that figure. As mentioned in the Materials and Methods section, we found that our original method for determining the LOD, which employed inspecting the µ/σ ratios in batches of 100 contiguous concentrations, failed for some of the datasets. By changing to batches of 1000, we reduced the noise sufficiently to achieve a reasonably smooth function (Figure 16).
A summary of the estimated preliminary and final infiltration estimates, along with the calculated outdoor-infiltrated and indoor-generated concentrations for all six cases is offered (Table 7).
In Table 7, the values shown in red represent the four cases in which the Forbidden Zone was violated by numerous observations. These results have been discarded and replaced by the values shown in the bottom eight rows for subsets of the data with many fewer violations of the Forbidden Zone.
In two cases, (Menlo Park and Outer Sunset), the number of observations violating the Forbidden Zone were near zero, and thus provided reassurance that the entire data set (last two columns, top four rows) could be used to determine robust estimates for the infiltration factor and also for calculating the contributions to total indoor PM2.5 concentration of indoor-generated and outdoor-penetrated particles. In these two cases, the indoor-generated particles greatly outweighed (80–85%) particles of ambient origin. In the remaining four cases, the regression on the full dataset failed to identify a stable infiltration factor (red typeface, first four columns, top four rows). For two of these four cases (Oakmont and Bennett Valley), the use of the I/O ratio identified two 6-month periods (subsets 1 and 2, first two columns, bottom eight rows) that could be used to calculate robust estimates of the infiltration factor for each period. In the remaining two cases (Redwood City and Alexander Avenue), it was possible to identify one subset with a stable infiltration factor (columns 3–4, rows 5–8), although the remaining subset could not be quantitatively analyzed.
The importance of determining the correct infiltration factors can be seen by considering the first column in Table 7 for Oakmont. Using the entire dataset (with the many violations of the Forbidden Zone), the contribution of indoor-generated particles was 34%, indicating that the outdoor-infiltrated particles (66%) were about twice as important. However, using the two subsets 1 and 2 with good estimates of the infiltration factor shows that the actual contribution of the indoor-generated particles was 48% in one case and 80% in the other, an average share of 64% or about twice as important as the particles of ambient origin (36%).
This example leads to the following general conclusion: When a regression of indoor on outdoor fine particles leads to multiple violations of the Forbidden Zone, the contribution of indoor-generated particles to the total indoor concentration may be underestimated. This is illustrated using the example of the Oakmont site. By coloring the winter–spring (Dec–May) and summer–fall (June–November) measurements differently (combining Figure 3, Figure 5 and Figure 6), the violations of the Forbidden Zone for all data combined (points below the upper boundary given by the equation y = 0.3437x) can be seen to consist mostly of the winter–spring measurements (blue) (Figure 17). When these data are isolated, the resulting slope of 0.1355 is almost a factor of three below the slope of the regression on all datapoints, meaning the influence of particles of ambient origin is reduced by the same factor. Or, in other words, the influence of indoor-generated particles on total indoor exposure is increased by the same factor.
One recent study has calculated long-term exposures for more than 3000 PurpleAir indoor sites in three states over a 4.7-year period using RCS regressions on about 10,000 outdoor sites and found that indoor-generated particles make about the same contribution to total indoor exposures as particles of ambient origin [55]. Presuming that some of these regressions resulted in violations of the Forbidden Zone, the conclusion that indoor-generated particles constitute about half of all potential indoor exposure may be an underestimate of the actual contribution.
The Alt-CF3 algorithm outperformed the Plantower CF1 algorithm in every aspect considered. Precision was better, the limit of detection was lower, the number of values > LOD was higher, and no zero values were recorded compared to hundreds of thousands of zeros for the Plantower CF1 algorithm. Although the Plantower CF_ATM algorithm was not directly considered in all analyses, it is identical to the Plantower CF_1 algorithm for all values less than ~28 µg/m3, and since these constitute the majority of observations, and include all the zeros reported by Plantower CF_1, the findings regarding precision, LOD, and number of zeros for the CF_1 algorithm would be equally applicable to CF_ATM. Since all the alternative algorithms (or “conversion factors”) other than ALT-CF3 offered by PurpleAir depend on one or another Plantower algorithm, they are all equally vulnerable to the Plantower shortcomings.
In Oakmont, Santa Rosa, a PurpleAir PA-II monitor measured indoor concentrations of PM2.5 continuously from 1/10/19 to 8/21/22, collecting 988,825 eighty-second or two-minute averages. The data were analyzed using five conversion factors made available by PurpleAir on their map page or their API link. The conversion factors include the CF1 algorithm provided by Plantower, the ALT-CF3 algorithm (also described as “PM2.5 alt” on the API site) [30,46], the EPA model [44], the LRAPA model used in Lane County, Oregon, (https://www.lrapa.org/, accessed on 1 January 2023), and the Australian woodsmoke model [56]. A sixth conversion factor is not presently available from PurpleAir [45]. Five of the six algorithms agree well at high PM2.5 concentrations (Figure 18).
Only the CF1 algorithm is out of step, reading about 90% higher than the other five models. At low concentrations, however, all algorithms except the ALT-CF3 algorithm are unable to chart the actual concentrations due to the CF1 assignment of a value of zero to these concentrations. The number of zeros reported by the CF1 algorithm was 205,682 for sensor A and 204,996 for sensor B, or about 21% of all data being lost by using the Plantower algorithms. Since all the models shown other than the ALT-CF3 model depend on the CF1 or CF_ATM algorithm, they are all affected by the Plantower arbitrary assignment of values to zero. Three models include an intercept that provides nonzero estimates for all cases, but these estimates do not seem to be based on real data. The ALT-CF3 model, however, suggests that these lower 20% of concentrations are in fact measurable and represent a reasonable continuation of the roughly log-normal distribution shown for the other 80% of the data.

5. Conclusions

Epidemiological studies without indoor measurements are unable to arrive at estimates of indoor-generated particles, yet some of these particles can have health effects. The advent of low-cost particle sensors can improve our knowledge of indoor potential exposure to PM2.5. These sensors make it possible for the first time to estimate long-term (months to years) PM2.5 concentrations from indoor sources. The present study has shown how to use the Forbidden Zone to evaluate the quality of the estimate of the infiltration factor. In particular, the use of the simple I/O ratio can often suggest subsets of the data to analyze with a chance of identifying periods with a stable infiltration factor, thus providing a more accurate determination of both indoor-generated and outdoor-penetrated PM2.5. Finally, a recently developed algorithm for calculating PM2.5 from PurpleAir measurements was found to be superior to the proprietary algorithms offered by the manufacturer of the sensors used in PurpleAir monitors. The use of this algorithm reduced the limit of detection by a factor of four compared to the LOD as calculated using the proprietary Plantower algorithms.
Practical Implications
  • A simple method is developed to determine whether PM2.5 infiltration factors found by regression are reliably constant. If not, the method can help identify subsets of the data that may have constant infiltration factors.
  • The method is illustrated by using long-term (hundreds of days) indoor and outdoor PM2.5 measurements made by PurpleAir low-cost particle monitors.
  • An algorithm for calculating PM2.5 is used that avoids entirely the proprietary algorithms of the manufacturer of the Plantower sensors used in PurpleAir monitors. This algorithm (ALT—CF3) is shown to have reduced bias, increased precision, and much lower limits of detection (LOD).
  • An improved method of determining the LOD of any continuous particle monitor using only real-life (field) data is presented.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/s23031160/s1, (References [30,44,45,46,47,48,49,55,56] are cited in the Supplementary Materials).

Author Contributions

Conceptualization, L.W. and W.O.; Methodology, L.W. and W.O.; Software, L.W. and W.O.; Validation, L.W. and W.O.; Formal Analysis, L.W. and W.O.; Investigation L.W. and W.O.; Resources, L.W. and W.O.; Data Curation, L.W. and W.O.; Writing—Original Draft Preparation, L.W.; Writing—Review and Editing, L.W. and W.O.; Visualization, L.W. and W.O.; Supervision, L.W. and W.O.; Project Administration, L.W. and W.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All raw data publicly available from PurpleAir websites (https://www2.purpleair.com/, accessed on 1 January 2023), (https://api.purpleair.com/, accessed on 1 January 2023). All RCS regression analyses are available on request from the corresponding author.

Acknowledgments

We acknowledge the contribution to science made by the PurpleAir organization in making all public data available on the Internet.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Murray, C.J.L. The Global Burden of Disease Study at 30 years. Nat. Med. 2022, 28, 2019–2026. [Google Scholar] [CrossRef] [PubMed]
  2. WHO. 2022. Available online: https://www.who.int/news-room/fact-sheets/detail/household-air-pollution-and-health (accessed on 1 January 2023).
  3. Britannica. 2022. Available online: https://www.britannica.com/event/Great-Smog-of-London (accessed on 1 January 2023).
  4. Klepeis, N.E.; Nelson, W.C.; Ott, W.R.; Robinson, J.; Tsang, A.M.; Switzer, P.; Behar, J.V.; Hern, S.; Engelmann, W. The National Human Activity Pattern Survey (NHAPS): A resource for assessing exposure to environmental pollutants. J. Expos. Anal. Environ. Epidemiol. 2001, 11, 231–252. [Google Scholar] [CrossRef] [Green Version]
  5. Koutrakis, P.; Briggs, S.L.K.; Leaderer, B.P. Source apportionment of indoor aerosols in Suffolk and Onondaga counties, New York. Environ. Sci. Technol. 1992, 26, 521–527. [Google Scholar] [CrossRef]
  6. Özkaynak, H.; Xue, J.; Spengler, J.D.; Wallace, L.A.; Pellizzari, E.D.; Jenkins, P. Personal Exposure to Airborne Particles and Metals: Results from the Particle TEAM Study in Riverside, CA. J. Expos. Anal. Environ. Epidemiol. 1996, 6, 57–78. [Google Scholar]
  7. Wheeler, A.J.; Wallace, L.A.; Kearney, J.; Van Ryswyk, K.; You, H.; Kulka, R.; Brook, J.R.; Xu, X. Personal, indoor, and outdoor concentrations of fine and ultrafine particles using continuous monitors in multiple residences. Aerosol. Sci. Technol. 2011, 45, 1078–1089. [Google Scholar] [CrossRef] [Green Version]
  8. Allen, R.; Larson, T.; Sheppard, L.; Wallace, L.; Liu, L.-J.S. Use of real-time light scattering data to estimate the contribution of infiltrated and indoor-generated particles to indoor air. Environ. Sci. Technol. 2003, 37, 3484–3492. [Google Scholar] [CrossRef] [PubMed]
  9. Allen, R.; Larson, T.; Sheppard, L.; Wallace, L.; Liu, L.-J.S. Estimated hourly personal exposures to ambient and non-ambient particulate matter among sensitive populations in Seattle, Washington. J. Air Waste Manag. Assoc. 2004, 54, 1197–1211. [Google Scholar] [CrossRef] [Green Version]
  10. Liu, L.-J.S.; Box, M.; Kalman, D.; Kaufman, J.; Koenig, J.; Larson, T.; Lumley, T.; Sheppard, L.; Wallace, L.A. Exposure assessment of particulate matter for susceptible populations in Seattle. Environ. Health Perspect 2003, 111, 909–918. [Google Scholar] [CrossRef] [Green Version]
  11. Wallace, L.; Williams, R.; Suggs, J. Estimating Contributions of Outdoor Fine Particles to Indoor Concentrations and Personal Exposures: Effects of Household Characteristics and Personal Activities. 2006; US EPA Report 600/R-06/023. Available online: https://nepis.epa.gov/Exe/ZyPDF.cgi/9100CCPM.PDF?Dockey=9100CCPM.PDF (accessed on 1 January 2023).
  12. Diapouli, E.; Chaloulakou, A.; Koutrakis, P. Estimating the concentration of indoor particles of outdoor origin: A review. J. Air Waste Manag. Assoc. 2013, 63, 1113–1129. [Google Scholar] [CrossRef]
  13. Wang, Z.; Delp, W.W.; Singer, B.C. Performance of low-cost indoor air quality monitors for PM2.5 and PM10 from residential sources. Build Environ. 2020, 174, 106654. [Google Scholar] [CrossRef]
  14. Singer, B.C.; Delp, W.W. Response of consumer and research grade indoor air quality monitors to residential sources of fine particles. Indoor Air 2018, 28, 624–639. [Google Scholar] [CrossRef] [PubMed]
  15. Dockery, D.W.; Spengler, J.D. Indoor-outdoor relationships of respirable sulfates and particles. Atmos. Environ. 1981, 15, 335–343. [Google Scholar] [CrossRef]
  16. Sarnat, J.A.; Long, C.M.; Koutrakis, P.; Coull, B.A.; Schwartz, J.; Suh, H.H. Using sulfur as a tracer of outdoor fine particulate matter. Environ. Sci. Technol. 2002, 36, 5305–5314. [Google Scholar] [CrossRef] [PubMed]
  17. Wallace, L.A.; Williams, R.; Rea, A.; Croghan, C. Continuous weeklong measurements of personal exposures and indoor concentrations of fine particles for 37 health-impaired North Carolina residents for up to four seasons. Atmos. Environ. 2006, 40, 399–414. [Google Scholar] [CrossRef]
  18. Wallace, L.; Williams, R. Use of personal–indoor–outdoor sulfur concentrations to estimate the infiltration factor and outdoor exposure factor for individual homes and persons. Environ. Sci. Technol. 2005, 39, 1707–1714. [Google Scholar] [CrossRef]
  19. Williams, R.; Suggs, J.; Rea, A.; Sheldon, L.; Rodes, C.; Thornburg, J. The Research Triangle Park particulate matter panel study: Modeling ambient source contribution to personal and residential PM mass concentrations. Atmos. Environ. 2003, 37, 5365–5378. [Google Scholar] [CrossRef]
  20. Kearney, J.; Wallace, L.; MacNeill, M.; Xu, X.; Van Ryswyk, K.; You, H.; Kulka, R.; Wheeler, A.J. Residential indoor and outdoor ultrafine particles in Windsor, ON. Atmos. Environ. 2011, 45, 7583–7593. [Google Scholar] [CrossRef]
  21. MacNeill, M.; Wallace, L.; Kearney, J.; Allen, R.W.; Van Ryswyk, K.; Judek, S.; Xu, X.; Wheeler, A.J. Factors Influencing Variability in the Infiltration of PM2.5 Mass and its Components. Atmospheric. Environ. 2012, 61, 518–532. [Google Scholar] [CrossRef]
  22. MacNeill, M.; Kearney, J.; Wallace, L.; Gibson, M.; Heroux, M.E.; Kuchta, J.; Guernsey, J.R.; Wheeler, A.J. Quantifying the contribution of ambient and indoor-generated fine particles to indoor air in residential environments. Indoor Air 2014, 24, 362–375. [Google Scholar] [CrossRef]
  23. Kearney, J.; Wallace, L.; MacNeill, M.; Heroux, M.-E.; Kindzierski, W.; Wheeler, A. Residential infiltration of fine and ultrafine particles in Edmonton. Atmos. Environ. 2014, 94, 793–805. [Google Scholar] [CrossRef]
  24. Switzer, P.; Ott, W. Derivation of an indoor air averaging time model from the mass balance equation for the case of independent source inputs and fixed air exchange rates. J. Expos. Anal. Environ. Epidemiol. 1992, 2, 113–135. [Google Scholar]
  25. Allen, R.; Wallace, L.A.; Larson, T.; Sheppard, L.; Liu, L.-J.S. Evaluation of the recursive model approach for estimating particulate matter infiltration efficiencies using continuous light scattering data. J. Expos. Sci. Environ. Epidemiol. 2007, 17, 468–477. [Google Scholar] [CrossRef]
  26. Thatcher, T.L.; Layton, D.W. Deposition, resuspension, and penetration of particles within a residence. Atmos. Environ. 1995, 29, 1487–1497. [Google Scholar] [CrossRef]
  27. Ferro, A.R.; Kopperud, R.J.; Hildemann, L.M. Source strengths for indoor human activities that resuspend particulate matter. Environ. Sci. Technol. 2004, 38, 1759–1764. [Google Scholar] [CrossRef] [PubMed]
  28. Hänninen, O.O.; Lebret, E.; Ilacqua, V.; Katsouyanni, K.; Kunzli, N.; Sram, R.J.; Jantunen, M. Infiltration of ambient PM2.5 and levels of indoor generated non-ETS PM2.5 in residences of four European cities. Atmos. Environ. 2004, 38, 6411–6423. [Google Scholar] [CrossRef]
  29. Kopperud, R.J.; Ferro, A.R.; Hildemann, L.M. Outdoor versus indoor contributions to indoor particulate matter (PM) determined by mass balance methods. J. Air Waste Manag. Assoc. 2004, 54, 1188–1196. [Google Scholar] [CrossRef]
  30. Bi, J.; Wallace, L.; Sarnat, J.A.; Liu, Y. Characterizing outdoor infiltration and indoor contribution of PM2.5 with citizen-based low-cost monitoring data. Environ. Pollut. 2021, 276, 116793. [Google Scholar] [CrossRef]
  31. Ott, W.; Wallace, L.; Mage, D. Predicting particulate (PM10) personal exposure distributions using a random component superposition statistical model. J. Air Waste Manag. Assoc. 2000, 50, 1390–1406. [Google Scholar] [CrossRef] [Green Version]
  32. Repace, J.L.; Lowrey, A.H. Indoor air pollution, tobacco smoke, and public health. Science 1980, 208, 464. [Google Scholar] [CrossRef]
  33. Ott, W.R.; Wallace, L.A.; Cheng, K.-C.; Hildemann, L.M. Measuring PM2.5 concentrations from secondhand tobacco vs. marijuana smoke in 9 rooms of a detached 2-story house. Sci. Total Environ. 2022, 852, 158244. [Google Scholar] [CrossRef]
  34. Ott, W.R.; Zhao, T.; Cheng, K.-C.; Wallace, L.A.; Hildemann, L.M. Measuring indoor fine particle concentrations, emission rates, and decay rates from cannabis use in a residence. Atmos. Environ. X 2021, 10, 100106. [Google Scholar] [CrossRef]
  35. Wallace, L.; Ott, W.; Zhao, T.; Cheng, K.-C.; Hildemann, L. Secondhand exposure from vaping marijuana: Concentrations, emissions, and exposures determined using both research-grade and low-cost monitors. Atmos. Environ. X 2020, 10, 100093. [Google Scholar] [CrossRef]
  36. Zhao, T.; Cheng, K.-C.; Ott, W.R.; Wallace, L.A.; Hildemann, L.M. Characteristics of secondhand cannabis smoke from common smoking methods: Calibration factor, emission rate, and particle removal rate. Atmos. Environ. 2020, 242, 117731. [Google Scholar] [CrossRef]
  37. Wallace, L.; Jeong, S.-G.; Rim, D. Dynamic behavior of indoor ultrafine particles (2.3–64 nm) due to burning candles in a residence. Indoor Air 2019, 29, 1018–1027. [Google Scholar] [CrossRef]
  38. Wallace, L.A.; Emmerich, S.J.; Howard-Reed, C. Source strengths of ultrafine and fine particles due to cooking with a gas stove. Environ. Sci. Tech. 2004, 38, 2304–2311. [Google Scholar] [CrossRef]
  39. Wallace, L.A.; Emmerich, S.J.; Howard-Reed, C. Effect of central fans and in-duct filters on deposition rates of ultrafine and fine particles in an occupied townhouse. Atmos. Environ. 2004, 38, 405–413. [Google Scholar] [CrossRef]
  40. Howard-Reed, C.; Wallace, L.A.; Emmerich, S.J. Effect of ventilation systems and air filters on decay rates of particles produced by indoor sources in an occupied townhouse. Atmos. Environ. 2003, 37, 5295–5306. [Google Scholar] [CrossRef]
  41. Wallace, L.A.; Howard-Reed, C.H.; Emmerich, S.J. Continuous measurements of air change rates in an occupied house for one year: The effect of temperature, wind, fans, and windows. Expos. Anal. Environ. Epidemiol. 2002, 12, 296–306. [Google Scholar] [CrossRef] [Green Version]
  42. Howard-Reed, C.H.; Wallace, L.A.; Ott, W.R. The effect of opening windows on air change rates in two homes. J. Air Waste Manag. Assoc. 2002, 52, 147–159. [Google Scholar] [CrossRef]
  43. AQ-SPEC 2016. Available online: http://www.aqmd.gov/docs/default-source/aq-spec/field-evaluations/purpleair—field-evaluation.pdf (accessed on 19 December 2020).
  44. Barkjohn, K.; Gantt, B.; Clements, A.L. Development and application of a United States wide correction for PM2.5 data collected with the PurpleAir sensor. Atmos. Meas. Tech. 2020, 4, 105194. [Google Scholar] [CrossRef]
  45. Liang, Y.; Sengupta, D.; Campmier, M.J.; Lunderberg, D.M.; Apte, J.S.; Goldstein, A. Wildfire smoke impacts on indoor air quality assessed using crowdsourced data in California. Proc. Natl. Acad. Sci. USA 2021, 118, e2106478118. [Google Scholar] [CrossRef] [PubMed]
  46. Wallace, L.; Bi, J.; Ott, W.R.; Sarnat, J.A.; Liu, Y. Calibration of low-cost PurpleAir outdoor monitors using an improved method of calculating PM2.5. Atmos. Environ. 2021, 256, 118432. [Google Scholar] [CrossRef]
  47. Wallace, L.; Zhao, T.; Klepeis, N.E. Calibration of PurpleAir PA-I and PA-II monitors using daily mean PM2.5 concentrations measured in California, Washington, and Oregon from 2017 to 2021. Sensors 2022, 22, 4741. [Google Scholar] [CrossRef] [PubMed]
  48. Wallace, L. Intercomparison of PurpleAir sensor performance over three years indoors and outdoors at a home: Bias, precision, and limit of detection using an improved algorithm for calculating PM2.5. Sensors 2022, 22, 2755. [Google Scholar] [CrossRef] [PubMed]
  49. Wallace, L.A.; Wheeler, A.; Kearney, J.; Van Ryswyk, K.; You, H.; Kulka, R.; Rasmussen, P.; Brook, J.; Xu, X. Validation of continuous particle monitors for personal, indoor, and outdoor exposures. J. Expos. Sci. Environ. Epidemiol. 2010, 21, 49–64. [Google Scholar] [CrossRef] [Green Version]
  50. Austin, E.; Novosselov, I.; Seto, E.; Yost, M.G. Laboratory evaluation of the Shinyei PPD42NS low-cost particulate matter sensor. PLoS ONE 2017, 10, e0137789. [Google Scholar] [CrossRef]
  51. Bulot, F.M.J.; Russell, H.S.; Rezaei, M.; Johnson, M.S.; Ossont, S.J.J.; Morris, A.K.R.; Cox, S.J. Laboratory comparison of low-cost particulate matter sensors to measure transient events of pollution. Sensors 2020, 20, 2219. [Google Scholar] [CrossRef] [Green Version]
  52. Kelly, K.E.; Whitaker, J.; Petty, A.; Widmer, C.; Dybwad, A.; Sleeth, D.; Martin, R.; Butterfield, A. Ambient and laboratory evaluation of a low-cost particulate matter sensor. Environ. Pollut. 2017, 221, 491–500. [Google Scholar] [CrossRef]
  53. Zikova, N.; Hopke, P.K.; Ferro, A.R. Evaluation of new low-cost particle monitors for PM2.5 concentrations measurements. J. Aerosol Sci. 2017, 105, 24–34. [Google Scholar] [CrossRef]
  54. Sayahi, T.; Butterfield, A.; Kelly, K.E. Long-term field evaluation of the Plantower PMS low-cost particulate matter sensors. Environ. Pollut. 2019, 245, 932–940. [Google Scholar] [CrossRef]
  55. Wallace, L.A.; Zhao, T.; Klepeis, N.R. Indoor contribution to PM2.5 exposure using all PurpleAir sites in Washington, Oregon, and California. Indoor Air 2022, 32, 13105. Available online: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.13105 (accessed on 19 December 2020). [CrossRef] [PubMed]
  56. Robinson, D. Accurate, Low Cost PM2.5 Measurements Demonstrate the Large Spatial Variation in Wood Smoke Pollution in Regional Australia and Improve Modeling and Estimates of Health Costs. Atmosphere 2020, 11, 856. [Google Scholar] [CrossRef]
Figure 1. Regression of indoor on outdoor PM2.5 measured over 343 days for a home. The slope of the regression line (blue) is the infiltration factor Finf. The parallel line (red) is the product of the infiltration factor Finf and the outdoor concentration. This line forms the upper boundary of a “Forbidden Zone”. No observed point should appear in this zone if Finf is constant and applies to all data, because it would indicate a negative indoor-generated concentration.
Figure 1. Regression of indoor on outdoor PM2.5 measured over 343 days for a home. The slope of the regression line (blue) is the infiltration factor Finf. The parallel line (red) is the product of the infiltration factor Finf and the outdoor concentration. This line forms the upper boundary of a “Forbidden Zone”. No observed point should appear in this zone if Finf is constant and applies to all data, because it would indicate a negative indoor-generated concentration.
Sensors 23 01160 g001
Figure 2. Regression of daily outdoor PM2.5 on indoor PM2.5 during the time the Menlo Park residence was occupied (13 November 2019 to 2 July 2020 and 14 February 21 to 4 December 2021).
Figure 2. Regression of daily outdoor PM2.5 on indoor PM2.5 during the time the Menlo Park residence was occupied (13 November 2019 to 2 July 2020 and 14 February 21 to 4 December 2021).
Sensors 23 01160 g002
Figure 3. Random Component Superposition (RCS) regression for Oakmont.
Figure 3. Random Component Superposition (RCS) regression for Oakmont.
Sensors 23 01160 g003
Figure 4. Indoor-outdoor (I/O) ratio of daily means at the Santa Rosa site. Data have been fit (red line) using distance-weighted regression.
Figure 4. Indoor-outdoor (I/O) ratio of daily means at the Santa Rosa site. Data have been fit (red line) using distance-weighted regression.
Sensors 23 01160 g004
Figure 5. RCS regression of indoor on outdoor air for the December through May data in Oakmont.
Figure 5. RCS regression of indoor on outdoor air for the December through May data in Oakmont.
Sensors 23 01160 g005
Figure 6. RCS regression of indoor on outdoor air for the June through November data in Oakmont.
Figure 6. RCS regression of indoor on outdoor air for the June through November data in Oakmont.
Sensors 23 01160 g006
Figure 7. Random Component Superposition (RCS) regression for the Redwood City site. The slope (Finf) of 0.3944 results in many datapoints falling into the Forbidden Zone (below the estimated outdoor-penetrated particles given by the red line).
Figure 7. Random Component Superposition (RCS) regression for the Redwood City site. The slope (Finf) of 0.3944 results in many datapoints falling into the Forbidden Zone (below the estimated outdoor-penetrated particles given by the red line).
Sensors 23 01160 g007
Figure 8. Indoor-outdoor (I/O) ratio of daily means at the Redwood City site.
Figure 8. Indoor-outdoor (I/O) ratio of daily means at the Redwood City site.
Sensors 23 01160 g008
Figure 9. Random Component Superposition (RCS) regression for the November through March data at Redwood City.
Figure 9. Random Component Superposition (RCS) regression for the November through March data at Redwood City.
Sensors 23 01160 g009
Figure 10. Regression of Alexander Avenue indoor PM2.5 on Alexander Avenue outdoor PM2.5.
Figure 10. Regression of Alexander Avenue indoor PM2.5 on Alexander Avenue outdoor PM2.5.
Sensors 23 01160 g010
Figure 11. RCS regression for the period from 2020 to 2022.
Figure 11. RCS regression for the period from 2020 to 2022.
Sensors 23 01160 g011
Figure 12. RCS regression of indoor values at Oakmont on outdoor values at Bennett Valley, about 4 km from Oakmont.
Figure 12. RCS regression of indoor values at Oakmont on outdoor values at Bennett Valley, about 4 km from Oakmont.
Sensors 23 01160 g012
Figure 13. Regression of Oakmont indoor concentrations on Bennett Valley outdoor concentrations for the winter and spring months (December–June (March omitted)).
Figure 13. Regression of Oakmont indoor concentrations on Bennett Valley outdoor concentrations for the winter and spring months (December–June (March omitted)).
Sensors 23 01160 g013
Figure 14. Regression of Oakmont indoor concentrations on Bennett Valley outdoor concentrations for the summer and fall months (August–November).
Figure 14. Regression of Oakmont indoor concentrations on Bennett Valley outdoor concentrations for the summer and fall months (August–November).
Sensors 23 01160 g014
Figure 15. RCS regression of indoor on outdoor data at two nearby sites in the Outer Sunset area of San Francisco.
Figure 15. RCS regression of indoor on outdoor data at two nearby sites in the Outer Sunset area of San Francisco.
Sensors 23 01160 g015
Figure 16. The number of mean/SD ratios per thousand “batches” of ordered contiguous measured concentrations steadily declines with increasing concentration. When this function falls below 50 and does not exceed 50 at any higher concentration, the LOD is considered to be the “last” (largest) concentration with at least 50 ratios <3.
Figure 16. The number of mean/SD ratios per thousand “batches” of ordered contiguous measured concentrations steadily declines with increasing concentration. When this function falls below 50 and does not exceed 50 at any higher concentration, the LOD is considered to be the “last” (largest) concentration with at least 50 ratios <3.
Sensors 23 01160 g016
Figure 17. Regressions of Oakmont indoor PM2.5 measurements on outdoor values. The equation for the combined data is y = 0.3437x + 1.56 µg/m3 (light black dashed line) and the resulting upper boundary of the Forbidden Zone is y = 0.3437x (thick black line passing through the origin). The equation for the summer–fall data (red) is 0.3401x + 1.56. The equation for the winter–spring data (blue) is 0.1335 + 1.54.
Figure 17. Regressions of Oakmont indoor PM2.5 measurements on outdoor values. The equation for the combined data is y = 0.3437x + 1.56 µg/m3 (light black dashed line) and the resulting upper boundary of the Forbidden Zone is y = 0.3437x (thick black line passing through the origin). The equation for the summer–fall data (red) is 0.3401x + 1.56. The equation for the winter–spring data (blue) is 0.1335 + 1.54.
Sensors 23 01160 g017
Figure 18. Comparison of 6 algorithms for determining PM2.5 estimates.
Figure 18. Comparison of 6 algorithms for determining PM2.5 estimates.
Sensors 23 01160 g018
Table 1. PM2.5 (µg/m3) and associated precision at the Menlo Park site using the ALT-CF3 and Plantower CF1 algorithms.
Table 1. PM2.5 (µg/m3) and associated precision at the Menlo Park site using the ALT-CF3 and Plantower CF1 algorithms.
N ObsMeanStd.Err.Lower
Quartile
MedianUpper
Quartile
Max
ALT-CF3
Mean outdoor PM2.5372,6214.49.2 × 10−31.42.75.1311
Indoor PM2.5369,6102.20.0150.651.32.41050
Outdoor-infiltrated372,6210.781.6 × 10−30.250.50.955
Indoor-generated369,6101.780.0150.210.61.51049
Precision outdoor372,6210.067.3 × 10−50.020.10.10.2
Plantower CF1
Mean outdoor PM2.5304,3458.30.0192.55.19.6552
Indoor PM2.5301,6253.30.0310.611.83.81874
Precision outdoor304,3450.079.1 × 10−50.030.10.10.2
Table 2. All PM2.5 data from Monitor 1 at the Oakmont site between 23 July 2019 and 28 September 2022.
Table 2. All PM2.5 data from Monitor 1 at the Oakmont site between 23 July 2019 and 28 September 2022.
Valid NMeanStd. Err.MinLower QuartileMedianUpper QuartileMax
MEAN 1 CF1593,7934.410.0300.1051.13.281678
MEAN 1 CF ATM593,7933.90.0200.1051.13.281118
MEAN 1 ALT-CF3593,7932.970.020.00630.5431.1672.39865
precision 1 CF1488,117 *0.220.000400.040.0980.251
precision 1 CF ATM488,117 *0.220.000400.0390.0980.251
precision 1 ALT-CF3593,6950.080.00013.02 × 10−70.0290.0620.110.87
* Missing 105,676 observations assigned a value of zero by the Plantower CF1 or CF_ATM algorithms.
Table 3. Number of zeros reported by CF1 (and CF_ATM) and percentage of total measurements.
Table 3. Number of zeros reported by CF1 (and CF_ATM) and percentage of total measurements.
Monitor/SensorLocationZerosTotal%
1aIndoors127,195593,80421
1bIndoors129,114593,80422
4aIndoors114,186593,68819
4bIndoors102,616593,68817
2aSome outdoors52,000587,3099
2bSome outdoors121,399587,30921
3aMostly outdoors40,682593,8137
3bMostly outdoors33,836593,8136
Table 4. LODs and percentage of observations exceeding the LOD for Oakmont comparing the ALT-CF3 algorithm to the Plantower CF1 algorithm.
Table 4. LODs and percentage of observations exceeding the LOD for Oakmont comparing the ALT-CF3 algorithm to the Plantower CF1 algorithm.
MonitorLocationValid NALT-CF3 LOD (µg/m3)ALT-CF3 N > LODALT-CF3 (% > LOD)CF1 LOD (µg/m3)CF1 N > LODCF1 (% > LOD)
1Indoors406,1080.99233,900582.9177,90844
2Outdoors253,4540.92203,384809.939,48716
2Indoors146,2290.72110,674763.244,28930
3Outdoors363,7970.6334,973924.4156,85043
3Indoors42,3040.5242,11499.62.932,15076
4Indoors406,0921.32215,872535.379,37120
Table 5. All indoor PM2.5 data (µg/m3) and associated precision for monitors 1 and 2 compared for the ALT-CF3 and Plantower CF1 algorithms for the 19-month period 7/3/19 to 2/6/22.
Table 5. All indoor PM2.5 data (µg/m3) and associated precision for monitors 1 and 2 compared for the ALT-CF3 and Plantower CF1 algorithms for the 19-month period 7/3/19 to 2/6/22.
N obs.MeanStd.Err.Lower QuartileMedianUpper QuartileMax
ALT-CF3
Mean 1662,7734.20.0200.931.83.3735
Mean 2662,6354.00.0170.921.83.4539
precision 1662,7190.099.2 × 10−50.0320.0690.120.98
precision 2662,5610.101.2 × 10−40.0430.0800.130.99
Plantower CF1
Mean 1662,7736.30.0330.742.35.11149
Mean 2662,6336.10.0290.702.35.1883
precision 1626,6670.173.1 × 10−40.0360.0830.181
precision 2617,0160.213.2 × 10−40.0660.120.221
Table 6. Calculations of the LODs (µg/m3) for 5 of 6 sites.
Table 6. Calculations of the LODs (µg/m3) for 5 of 6 sites.
SiteLocationALT-CF3CF1/CF_ATM
Menlo ParkOutdoors1.104.15
OakmontIndoors0.992.90
Outdoors0.929.90
Indoors0.723.20
Outdoors0.604.40
Indoors0.522.90
Indoors1.325.30
Redwood CityIndoors1.404.75
AlexanderOutdoors0.972.69
Outer SunsetOutdoors1.123.27
Outer SunsetIndoors1.833.18
Statistics
Mean1.044.24
SD0.382.06
RSD0.360.49
Table 7. Infiltration factors (Finf) and mean PM2.5 for outdoor-infiltrated (Outinf) and indoor-generated (Ingen) particles, including the percent contribution to total indoor PM2.5 from the indoor-generated particles (Ingen (%)). These variables are shown for the entire datasets and for the subsets (1 and 2) created when the Forbidden Zone was violated by too many observations.
Table 7. Infiltration factors (Finf) and mean PM2.5 for outdoor-infiltrated (Outinf) and indoor-generated (Ingen) particles, including the percent contribution to total indoor PM2.5 from the indoor-generated particles (Ingen (%)). These variables are shown for the entire datasets and for the subsets (1 and 2) created when the Forbidden Zone was violated by too many observations.
Outdoor SiteOakmontBennett ValleyRedwood CityAlexander AveMenlo ParkOuter Sunset
Indoor sitesameOakmontsamesamesamesame
Finf0.340.380.360.170.0850.33
PM2.5 Outinf3.22.283.00.990.312.6
PM2.5 Ingen1.71.32.60.431.810
Ingen (%)343646318580
Finf 10.340.360.230.080NANA
PM2.5 Outinf 12.24.102.90.44NANA
PM2.5 Ingen 12.12.22.30.51NANA
Ingen 1 (%)48354554NANA
Finf 20.140.22NANANANA
PM2.5 Outinf 20.380.75NANANANA
PM2.5 Ingen 21.51.1NANANANA
Ingen 2 (%)8059NANANANA
Outinf = outdoor-infiltrated; Ingen = indoor-generated; 1, 2 = subsets 1, 2; NA = Not Applicable.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wallace, L.; Ott, W. Long-Term Indoor-Outdoor PM2.5 Measurements Using PurpleAir Sensors: An Improved Method of Calculating Indoor-Generated and Outdoor-Infiltrated Contributions to Potential Indoor Exposure. Sensors 2023, 23, 1160. https://doi.org/10.3390/s23031160

AMA Style

Wallace L, Ott W. Long-Term Indoor-Outdoor PM2.5 Measurements Using PurpleAir Sensors: An Improved Method of Calculating Indoor-Generated and Outdoor-Infiltrated Contributions to Potential Indoor Exposure. Sensors. 2023; 23(3):1160. https://doi.org/10.3390/s23031160

Chicago/Turabian Style

Wallace, Lance, and Wayne Ott. 2023. "Long-Term Indoor-Outdoor PM2.5 Measurements Using PurpleAir Sensors: An Improved Method of Calculating Indoor-Generated and Outdoor-Infiltrated Contributions to Potential Indoor Exposure" Sensors 23, no. 3: 1160. https://doi.org/10.3390/s23031160

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop