1. Introduction
Respiratory diseases are a major cause of human morbidity and mortality all over the world. In 2015, lower respiratory diseases were the 3rd highest cause of years of life lost, while COPD was the 10th and asthma the 37th in terms of global burden of disease [
1]. There are major public health consequences associated with asthma as a well-known respiratory disease, including high morbidity and mortality in severe cases for both children and adults [
2]. Around 334 million people worldwide suffer from asthma [
3]. Asthma affects 8.4% of Americans, compared with 4.3% of the worldwide population, and its prevalence is on the rise both domestically and internationally. In minorities and lower socioeconomic groups, the impact is most noticeable [
4].
Globally, chronic respiratory diseases remain a major cause of death and disability. Chronic respiratory diseases affected over 544 million people worldwide in 2017, an increase of 39.8% since 1990 [
5]. The world over, chronic obstructive pulmonary disease (COPD) is a common respiratory disease associated with high morbidity and mortality rates [
6]. A number of factors will likely influence the future development of COPD, including longer life expectancy, occupational and environmental exposures, and the increasing prevalence of asthma. Other factors are expected to continue to influence future trends globally, including air pollution and chronic respiratory infections such as tuberculosis [
7].
Studies show that air pollution can cause serious health consequences. In urban areas, especially in developing nations, air pollution is a major concern due to its harmful effects on respiratory health [
8]. A study examined the association between respiratory emergency department (ED) visits (2011–2015) and biorefinery exposure in residences. A significant increase in respiratory ED visits was reported among residents living within 10 km of biorefineries [
9].The causes of respiratory diseases are multiple. Many such diseases are infectious, while others are due to exposure to air pollution or exposure to toxic chemicals. Rates of smoking, poverty and other personal behavioral patterns within different populations are major factors that influence geographic rate differences. While these disparities are particularly pronounced globally, there are also distinct differences in patterns of respiratory diseases within smaller geographic areas.
We have examined the patterns for emergency room visits for respiratory diseases other than cancer within the counties in New York State (NYS). There are 62 counties in New York, and they vary widely in size and population density. They also differ greatly in rates of air pollution, poverty, and smoking. While it would be far preferable to be able to analyze the patterns of respiratory diseases at a smaller geographic unit, data on total levels of air pollution on such a scale as zip codes or census tracts are not available. We have utilized data from the National Emissions Inventory (NEI), which provide results of road, non-road, point source, and non-point source air pollution. Only point source air levels are available at the zip code level, and point source emissions are less than 5% of total air pollution. Therefore, even though counties are relatively large and there will be diverse sources of air pollution within each county, this is the only geographic level for which complete data is available. We have chosen to study asthma, COPD, acute lower respiratory diseases, and acute upper respiratory infections for which patients commonly go to emergency rooms.
1.1. Asthma
Asthma is a chronic disease caused, aggravated, or exacerbated by various chemical, physical, and biological exposures both indoors and outdoors [
10]. The prevalence of asthma is a consequence of several factors, including genetics, allergies, and environmental exposures. Among environmental risk factors are allergies to dust mites and cockroaches, tobacco, air pollution, microbes, and stress [
11]. According to the hygiene hypothesis, early life excessive cleanliness can lead to an immune system that overreacts to triggers [
12]. There is a large body of evidence showing that various air pollutants are associated with an increased risk of asthma attacks. Even at levels below the current limit values, long-term exposure to air pollution, specifically from fossil fuel combustion, is associated with adult-onset asthma. A study with a mean follow-up period of 16.6 years for 98,326 participants in three cohorts in Denmark and Sweden revealed that 1965 participants had asthma and reported the associations in fully adjusted models with hazard ratios for particulate matter (PM
2.5), nitrogen dioxide (NO
2), and black carbon (BC) [
13]. Another study of 4,154,887 respiratory admissions in Italy between 2006 and 2015 showed that short-term exposure to particulate matter (PM) harms the respiratory system, especially for asthma and COPD [
14].
1.2. Chronic Obstructive Pulmonary Disease (COPD)
COPD has been a major public health problem for decades, and there are numerous studies that show air pollution increases the risk of COPD. Over 3 million people die worldwide every year from COPD [
15]. COPD is a group of severe lung diseases such as chronic bronchitis and emphysema that cause difficulty breathing by preventing air from flowing through the airways. The most common cause of COPD in developed countries is smoking [
16]. It has been suggested that outdoor air pollution exposure is associated with COPD prevalence and incidence. COPD patients experience increased respiratory symptoms when exposed to outdoor air pollutants [
17]. Studies have shown that short-term exposure to air pollution increases the risk of emergency room (ER) visits for lower respiratory diseases in environments with relatively low air pollutants. In one study, the association between ER visits and ambient air pollutants for COPD and acute lower respiratory diseases was tested using a case-crossover design. NO
2, PM
2.5, and sulfur dioxide (SO
2) all showed positive results in males, while ozone (O
3) and SO
2 showed positive results in females [
18]. A study using data from the UK Biobank on 303,887 individuals aged 40–69 years found that ambient air pollution results in reduced lung function and an increased prevalence of COPD. Moreover, PM
2.5, particulate matter (PM
10), and NO
2 concentrations are all associated with a higher prevalence of COPD [
19].
1.3. Acute Lower Respiratory Diseases
In this paper, we studied acute bronchitis and bronchiolitis as acute lower respiratory diseases. Several studies have found that exposure to gaseous air pollution increases the risk of acute lower respiratory diseases. A study on children who visited ER services for bronchiolitis during the period 1997–2001 in Paris suggests that air pollution may contribute to acute severe bronchiolitis. Air pollutants such as PM
10, black smoke, SO
2, and NO
2 were positively associated with hospitalizations and consultations after adjusting for public holidays, holidays, and meteorological variables [
20]. Another study on 29,688 children’s hospital admissions in Hong Kong from 2008 to 2017 revealed that elevated NO
2 concentration was associated with higher hospitalization rates for acute bronchiolitis; the risk of hospitalization for acute bronchiolitis increased statistically significantly with cumulative NO
2 exposure over the range 66.2–119.6 µg/m
3 [
21].
1.4. Acute Upper Respiratory Diseases
This paper examines acute upper respiratory diseases including acute laryngopharyngitis, acute upper respiratory infections of other multiple sites, and acute upper respiratory infections of unspecified sites. Epidemiological studies have shown an association between exposure to air pollution and an increased risk of upper respiratory diseases. A study on ER visits data from three hospitals in Lanzhou, China between January 2014 and December 2018 revealed that an increase in ER visits for upper respiratory tract infections was linked to short-term exposure to air pollution, particularly SO
2 and carbon monoxide (CO). At lag 07, the percentage increase for each 1mg/m
3 increase in CO was 20.8% (95% CI: 11.8, 30.5) [
22]. According to another study on pediatric ER visits in 66 hospitals in Shanghai, China from 2016 to 2018, short-term exposure to air pollutants such as ultrafine particles could increase the risk of ER visits for upper respiratory tract infections among children (1.14, 95% CI: 1.02–1.28) [
23]. Several studies indicate that ambient air pollution increases the risk of upper respiratory tract infections. According to a study conducted at Wuhan University Hospital in Wuhan, China, gaseous pollutants negatively impact health more than solid pollutants. A significant number of clinic visits for upper respiratory tract infections were caused by SO
2 and NO
2, to which women seemed more susceptible than men [
24]. Based on data on hospital visits from 1 October 2010 to 30 September 2012 from the Beijing Medical Claim data, researchers found that a 10 µg/m
3 increase in PM
2.5 concentration was associated with 0.84% (95% confidence interval, 0.05–1.64%) increases in hospital admissions for upper respiratory tract infections at lag 0–3 days, but not with emergency rooms or outpatients [
25].
While respiratory infections are clearly due to infectious agents, the vulnerability to infection varies with factors such as poverty, immune system status, and access to and quality of health care. In this paper, we seek to clarify the relationships between air pollution, smoking, and poverty with respiratory diseases, to gain a better understanding of the importance of the role these factors have in human health.
2. Methods
2.1. Health Data
Hospitals located in and regulated by NYS are required to report all diagnoses, up to 15 per inpatient or ER visit, to the NYS Department of Health (NYS DOH) upon discharge. This diagnostic report system is known as the Statewide Planning and Research Cooperative System (SPARCS), and was established in 1979. The SPARCS data that we have contains all ER visits to state-regulated ERs (all but Veterans Administration or Indian Health Service clinics) from 2010–2018. With the help of R studio software version 2022.07.1 Build 554, the total number of ER visits and primary diagnoses of respiratory diseases for each year was calculated.
The International Classification of Diseases, Ninth Revision, and Clinical Modification (ICD-9-CM) and The International Classification of Diseases, Tenth Revision, and Clinical Modification (ICD-10-CM,) and the SPARCS Data Dictionary were used to find the appropriate code and category for each respiratory disease. Codes are as follows: asthma (ICD9 = 493; ICD10 = J45), COPD (ICD9 = 490, 491, 492, 494, 496; ICD10 = J40, J41, J42, J43, J44, J47), acute lower respiratory infections (acute bronchitis and bronchiolitis) (ICD9 = 466; ICD10 = J20, J21), and acute upper respiratory infections of multiple or unspecified sites (ICD9 = 465; ICD10 = J06).
The number of ER visits for each disease for each year was calculated according to the ICD code. In the next step, this data was categorized based on each county to have the crude number of ER visits for each disease per year at the county level. The Federal Information Processing Standard (FIPS) code was used to categorize data at the county level. In the last step, we calculated the rate of ER visits for each county base on the crude number of ER visits and the population of that county.
2.2. Population Data
The total population in each county of NYS was downloaded from the United States Census Bureau database for 2010 and the estimated population for 2018 was downloaded from NYS DOH. We interpolate the population numbers between 2010 and 2018 using the Equation (1) below.
The rates of ER visits for respiratory diseases were calculated by dividing the crude number of ER visits for each respiratory disease in each county by the population of that county. The total rate of ER visits from 2010–2018 was calculated by adding the rate of ER visits for each respiratory disease in the years 2010–2018. The rate of ER visits for each respiratory disease by county is given by the following Equation (2).
2.3. Exposure Data
The exposure data came from the United States Environmental Protection Agency (US EPA) National Emissions Inventory (NEI), a data set of air emissions and pollutants from point, non-point, road, and non-road sources by county. The FIPS code was used to merge data for each pollutant at the county level from all four sources of pollution. The total pollution was calculated by adding together the data for each county of NYS for 2011, 2014, and 2017 divided by 3 for each chemical—total volatile organic compounds (VOCs), PM2.5, acetaldehyde, formaldehyde, benzene, ethylbenzene, toluene, xylene, SO2, NO2, CO, and lead—from all four different sources of pollution (point, non-point, road, and non-road). Total pollution for lead was only available for point and non-point sources. Land area in square miles for each county was found in the NYS DOH database. Pollution per square mile was calculated by dividing the total pollution for each chemical by the land area in square miles.
Point sources include estimates of emissions from large, fixed, stationary sources such as industrial facilities, electric power plants, airports, as well as smaller industrial, commercial, and non-industrial sources. In addition to point sources, non-point sources include sources that are too small in magnitude to be reported as point sources, such as household heating, commercial combustion, asphalt paving, and commercial and consumer solvent use. Emissions from on-road vehicles running on gasoline, diesel, or other fuels are included in the NEI on-road sources. Light-duty and heavy-duty vehicles operating on roads, highway ramps, and while idling are also considered on-road sources. Fuel sources that use gasoline, diesel, and other fuels are included among NEI non-road sources. There are several types of non-road sources, including construction equipment, lawn, and garden equipment, aircraft ground support equipment, locomotives, and commercial marine vessels.
With the help of ArcGIS software version Desktop 10.8.1, maps were created to show the total air pollution (other than lead) per square mile. The shape files were downloaded from the Topologically Integrated Geographic Encoding and Referencing system (TIGER) that represents the U.S. Census Bureau’s geographic spatial data.
2.4. Poverty Data
Each county’s percentage of poverty was downloaded from the United States Census Bureau database for 2010 to 2018 separately. The data is called the Small Area Income and Poverty Estimates (SAIPE) Program which gives the percentage of people in poverty. The SAIPE program estimates are based on several data sources, such as aggregate tax, Supplemental Nutrition Assistance Program (SNAP) benefits, and poverty incidence data. Some of these data sources are publicly available, while others are not. In SAIPE estimates, poverty incidence is defined similarly to that in the American Community Survey (ACS). Depending on family size and composition, the Census Bureau determines who is poor based on money income thresholds. Poverty is defined when a family’s total income is less than the threshold for a family. In this paper, we calculated the total percentage of poverty by adding the data together for each county in NYS from 2010 to 2018 and dividing by 9, to obtain annual rates.
2.5. Smoking Data
The crude rate of smoking in each county was downloaded from The Behavioral Risk Factor Surveillance System (BRFSS) database for 2016 and 2018. The total rate of smoking was determined by adding the crude rate of smoking in 2016 and 2018 together for each county divided by 2. We defined different statistical models to determine the impact of smoking on the rate of ER visits for respiratory diseases.
2.6. Statistical Analysis
R studio software version 2022.07.1 Build 554 was used to create figures. Correlation analysis was used to assess the relationships between 12 different air pollutants and quantify how closely related any two variables are. A positive correlation occurs when air pollutants change in the same direction. The FIPS code was used to merge health data and pollution data to have the rate of ER visits for respiratory diseases and pollution per square mile at the county level in one dataset.
Regression analysis aims to determine how the rate of ER visits for respiratory diseases is impacted by exposure to 12 air pollutants (total VOCs, PM2.5, acetaldehyde, formaldehyde, benzene, ethylbenzene, toluene, xylene, SO2, NO2, CO, and lead). All these chemicals were aggregated as a separate variable, and regression models were developed to assess the association between air pollution exposure and the rate of ER visits for respiratory diseases.
In order to determine the best fitting model for air pollution adjusted for smoking and poverty, we compared different family regression models. For the purpose of assessing the quality of regression models and choosing the best fitting model, we used a performance package. To measure the performance of a regression model, different statistical regression metrics were calculated, including root mean squared error (RMSE) and residual standard error (RSE), or sigma. The regression models were conducted with p < 0.05 as the threshold for statistical significance. After choosing the best family model to be quasi-Poisson regression, we ran different models in our study to assess the association between air pollution and ER visits for respiratory diseases, and to determine the impact of smoking and poverty on the rate of ER visits for respiratory diseases.
Among unadjusted models, our response variable is the rate of ER visits; the predictor variable in the first group of models is air pollution, in the second group of models it is poverty, and in the third group of models it is smoking.
Among adjusted models, the first group of multiple regression models assesses the association between exposure to air chemicals and the rate of ER visits adjusted for poverty. The second group of multiple regression models examines the association between air pollution and the rate of ER visits adjusted for smoking, and the third group of multiple regression analyses was run to find the relationship between air pollutants and respiratory diseases adjusted for both poverty and smoking.
In the last step, models adjusted for smoking and poverty were compared to unadjusted models using the performance package. Different statistical regression metrics including Nagelkerke R2, RMSE, and RSE (known as sigma), were calculated to compare these different regression models.
4. Discussion
4.1. Respiratory Diseases Are Very Serious Contributors to Global Morbidity and Mortality
In this research, air pollution has been shown to be the greatest risk factor for asthma attacks. A higher level of total air pollution was associated with more asthma ER visits. The results of our research are in line with many studies that confirm the link between exposure to air pollution and the increased rate of asthma. While the results of this current study provide interesting information about the geographic differences in ER visits for respiratory diseases in one US state, it is clear that none of the models is adequate because we have been unable to clearly detect the expected adverse effect of air pollution on any of the diseases studied other than asthma. There are several possible reasons for this. There are likely to be other variables beyond smoking and poverty that influence the results. Age may be one, as residents in many rural communities are older than urban residents and are also much less diverse than in urban areas. We have not controlled for race/ethnicity or sex. County-level data may be just too large a geographic area and the sources of air pollution and the residents too diverse. However, we clearly find the expected associated associations with smoking, especially with COPD, and poverty with all diseases.
4.2. Patterns of Respiratory Diseases in NYS: Role of Urban vs. Rural Areas, and of Population Density
The amount of air pollutants released in NYS is by far the greatest in the New York City counties, which is not surprising because of the greater population density here and the very high level of automobile and truck traffic, especially diesel buses and trucks. There are also clear elevations in the other larger urban areas, from the Buffalo/Niagara Falls area in the west, the upstate urban areas in Rochester, Syracuse, and the city area of Albany, Schenectady, and Troy in the East. In urban and rural settings, pollutants vary in level and type. Several factors can affect air pollution in an urban area, including the layout of the city, the road density, the population density, and the localized population concentration [
26]. The quality of the air improves with a shift from urban to rural areas. Compared with large central metropolitan counties, noncore counties had fewer mean number of PM
2.5 days, decreasing from a total of 11.21 to 0.95. Compared with large central metropolitan counties, nonmetropolitan counties had lower mean PM concentrations, decreasing from 11.15 µg/m
3 to 8.87 µg/m
3 [
27].
The most striking observation of the levels of air pollutants is the great degree to which their concentrations are correlated. As it was shown in
Figure 2b, of the 12 air pollutants studied, most had correlations of 0.99 to 1.00. Only two showed less than a 0.95 correlation, one being lead, which isn’t surprising since there are few sources of lead to the environment now. The other is SO
2, a product of burning high sulfur coal or gas. The tight correlation among the other air pollutants is strong evidence that oil and gas combustion is the common source of formation and release of all of the remaining chemicals, including the other five criteria air pollutants and the various VOCs.
There are two major implications of these results. Because these pollutants are all released together, it is very difficult if not impossible to identify which one or ones are responsible for any particular disease. The second is that the major source is vehicle traffic and fossil fuel combustion. This further indicates how important moving away from fossil fuel combustion is, if we are to both reduce climate change and protect the public from the health hazards resulting from air pollution.
4.3. Role of Poverty
Poverty is well known to be one of the strongest risk factors for ill health. This is a consequence of several different factors. Poor people often live in areas with more contamination, have less access to quality health care, and practice less healthy lifestyles. Our data show a large and consistent increase in all respiratory diseases we have monitored in relation to poverty, including both upper and lower respiratory infections. Our results are consistent with other studies and COPD is more common in rural areas. A study in New Hampshire reported that rural counties had higher rates of COPD exacerbation encounters than non-rural counties when poverty and other sociodemographic factors were taken into account [
28].
While to a certain degree, respiratory infections may be elevated in people living in poverty because of greater exposure to infectious agents, this elevation may primarily reflect access to care. Poor people often do not have health insurance or access to primary preventive health care. When they are ill, they then go to the local emergency room. We have no way to determine the degree to which this is a factor with the present data, but we consider it likely to be a very important factor.
Another important consideration in this study is that poor people are more likely to use emergency rooms for primary care than is the case for more affluent people. There are many people in the US that do not have health insurance, and for these people the ER is their major source of health care. There are two major federal healthcare programs. Medicaid provides access to health care for poor people and people with some disabilities and Medicare provides access to health care for old people. But others must depend on private insurance, which is expensive. The result is that many working poor and otherwise healthy younger people lack any form of health insurance and use emergency rooms when ill. Thus, the population of individuals found in our SPARCS ER dataset does not reflect a general cross-section of the state population.
4.4. Role of Smoking
We had not expected to see such a clear difference in rates of smoking in relation to urban as compared to rural counties. Smoking rates in all of the primary urban counties are not high. This is especially true for NYC and is particularly striking in that smoking rates in the upstate counties are highly correlated with poverty levels, but that is not true in NYC where there is a lot of poverty. Despite high rates of poverty, the smoking rates in NYC and even in the surrounding counties are very low, and are much lower again than in the upstate urban areas.
Our study shows that COPD is highly associated with smoking, and these findings are consistent with other studies. According to Strzelak et al., tobacco smoke causes inflammation, allergies, and other lung diseases by altering immune responses because exposure to tobacco smoke results in an imbalance of antioxidants and oxidants that leads to oxidative stress, inflammation of the mucosa, as well as increased expression of inflammatory cytokines [
29].
4.5. Strengths
This study provides information on outdoor air pollution from all sources, including road, non-road, point source, and non-point source. Another important strength herein is the availability of a very large number of state-wide ER visits, with data from 2010 to 2018. This gives it strong statistical significance. Taking advantage of its geographical targeting, policy and public health leaders in New York can better understand the prevalence and distribution of respiratory diseases.
4.6. Limitations
Other than asthma, we have been unable to document the known effect of outdoor air pollution on respiratory diseases even with the use of multiple statistical models. This must be due to confounders that we have not controlled for or inadequacies of the models we have used. A wide range of respiratory diseases is linked to air pollution, as described in the Introduction. We have no control over indoor air pollution, which is probably more important than outdoor air pollution. We do not have information about people’s occupations and the air pollutants that they might be exposed to in their jobs. The data on ozone is not available in the NEI-EPA data set, and in our analysis of the adverse effect of air pollution, we did not consider ozone. Another limitation is that we find each of the 12 different air pollutants to be so highly correlated at the county level that this prevents us from identifying which specific contaminants are most associated with the different respiratory diseases.
4.7. Suggestions for Future Research
Further study of the effects of air pollutants on respiratory health in other states or countries would be helpful to confirm some of the results of this study. It would be interesting to compare findings in order to have a deeper understanding of the adverse effects of air pollution on human health. The model could be improved by adding other confounders such as age, sex, or race. We recommend this study be conducted in a smaller geographical area, such as zip codes, although there the problem of individual air pollutants not being commonly available in smaller geographic areas would remain. There needs to be further research in order to elucidate the contribution of specific VOCs (especially benzene and formaldehyde) and ozone as causes of respiratory diseases.