Next Article in Journal
Assessing Neighborhood Characteristics and Their Association with Prenatal Maternal Stress, Depressive Symptoms, and Well-Being in Eight Culturally Diverse Cities: A Cross-Sectional Study
Previous Article in Journal
Evaluation of the Reversibility of Cadmium-Induced Testicular Toxicity Following Recovery Alone or with Zinc Supplementation
Previous Article in Special Issue
Leukemia Incidence by Occupation and Industry: A Cohort Study of 2.3 Million Workers from Ontario, Canada
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Geospatial Analysis of the Lung Cancer Burden in Philadelphia, Using Pennsylvania Cancer Registry Data from 2008–2017

by
Russell K. McIntire
1,*,
Katherine Senter
2,
Christine Shusted
3,
Rickisa Yearwood
4,
Julie Barta
3,
Scott W. Keith
5 and
Charnita Zeigler-Johnson
4
1
College of Health, Lehigh University, Bethlehem, PA 18015, USA
2
College of Population Health, Thomas Jefferson University, Philadelphia, PA 19107, USA
3
Division of Pulmonary and Critical Care Medicine, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
4
Fox Chase Cancer Center, Temple University Health System, Philadelphia, PA 19111, USA
5
Division of Biostatistics and Bioinformatics, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2025, 22(3), 455; https://doi.org/10.3390/ijerph22030455
Submission received: 20 December 2024 / Revised: 10 March 2025 / Accepted: 15 March 2025 / Published: 20 March 2025
(This article belongs to the Special Issue Cancer Causes and Control)

Abstract

:
(1) Background: Lung cancer is the deadliest and second most prevalent cancer in Pennsylvania (PA), and African American patients are disproportionately affected. Lung cancer morbidity and mortality in Philadelphia County are among the highest in PA. Geographic information systems (GIS) are useful to explore geospatial variations in the cancer burden and risk factors. Therefore, we used GIS to analyze the lung cancer burden in Philadelphia to assess which areas of the city have the highest morbidity and mortality, identify potential clusters, and determine which census tract-level characteristics were associated with higher tract-level cancer burden. (2) Methods: Using secondary data from the Pennsylvania Cancer Registry, age-adjusted standardized incidence and mortality ratios (SIR and SMR) were calculated by census tract, and choropleth maps were created to visualize geographic variations in the disease burden. Two geostatistical methods were used to determine the presence of lung cancer clusters. Multivariable regression analyses were performed to identify which census-tract level characteristics correlated with a higher lung cancer burden. (3) Results: Three distinct geographical lung cancer clusters were identified. After controlling for demographics and other covariates, adult smoking prevalence, prevalence of chronic obstructive pulmonary disease, and percentage of residential addresses vacant were positively associated with higher lung cancer SIR and SMR. (4) Conclusions: Our findings may inform cancer control efforts within the region and guide future municipal-level GIS analyses of the lung cancer burden.

1. Introduction

1.1. Lung Cancer Morbidity and Mortality: A Global and Local Public Health Issue

Lung cancer is the leading cause of cancer-related death for both men and women in developed countries [1,2]. In 2024, an estimated 234,580 new lung cancer cases will be diagnosed, and 125,070 Americans will die from lung cancer [3]. Recently, lung cancer incidence has begun to decline in men across the United States; however, lung cancer rates have remained stagnant for women, indicating a persistent public health threat despite the overall decline of tobacco use [1]. Despite the nationwide decline, lung cancer mortality exceeds the national average in 21 states, including Pennsylvania [4].
Pennsylvania’s lung cancer mortality has decreased from 54.5 per 100,000 to 32.2 in 2022 [5]. As with the national trend, mortality amongst men has decreased but has lagged for women. Cancer of the lung and bronchus continues to be the most prevalent and the deadliest invasive cancer in Pennsylvania, costing approximately USD 73,800 for initial care and upwards of USD 124,000 for end-of-life care per patient in the U.S. [6]. Moreover, clear racial disparities in the lung cancer burden affect the state as well, with an incidence of 69.6 per 100,000 for African American Pennsylvanians compared to only 60.4 per 100,000 cases for White Pennsylvanians in 2023 and higher mortality in African American (38.5 per 100,000) compared to White (33.9 per 100,000) Pennsylvanians from 2016 to 2020 [7,8].
From 2012 to 2016, Philadelphia County had among the highest lung cancer incidence and mortality rates in the state compared to other counties—and significantly higher rates than the U.S. overall [9]. Further, a recent cluster analysis of lung cancer in the Greater Philadelphia area identified zip codes within Philadelphia County as having significantly higher lung cancer incidence rates than zip codes in the surrounding counties [10,11]. Further analysis at a more local level may illuminate how lung cancer is distributed in individual Philadelphia neighborhoods and thereby inform neighborhood-level cancer control efforts [12,13,14,15].

1.2. Emerging Emphasis on Social and Neighborhood Determinants of Cancer

Despite decades of research that have identified several well-known risk factors for lung cancer, such as smoking, indoor and outdoor air pollution, and occupational and environmental carcinogens [1,16], translating this knowledge into effective mitigation strategies across diverse populations continues to present a public health challenge. Studies have investigated the social and neighborhood factors associated with cancer morbidity and mortality to illuminate how to more effectively target interventions [17,18,19,20]. One analysis of California Cancer Registry data from 2000 to 2013 determined that the stage of diagnosis, neighborhood socioeconomic status, and marital status were most strongly associated with racial and ethnic disparities in cancer survival [17]. Racial and ethnic disparities in lung cancer mortality were more pronounced in women compared to men [17]. Another study of California Cancer Registry data found that despite an overall decrease in lung cancer incidence, the reduction in female lung cancer cases was most significant in areas with higher neighborhood socioeconomic status [19]. Thus, understanding which factors impact lung cancer morbidity and mortality at the neighborhood level may inform why improvements in lung cancer outcomes are not equally distributed among neighborhoods.
In a retrospective analysis of lung cancer incidence using electronic medical record (EMR) data from an Ohio healthcare system, Adie et al. created a “deprivation index” to approximate neighborhood living conditions using 15 census-tract level socioeconomic variables [21]. The study concluded that even when controlling for key demographic characteristics and smoking history, increases in the neighborhood deprivation score were significantly associated with a higher hazard ratio for the presence of a lung cancer diagnosis. The authors conclude that consideration of neighborhood socioeconomic characteristics may inform health promotion and screening efforts and that their findings warrant further study of the mechanism of how neighborhood-level deprivation affects lung cancer risk [21].

1.3. Geographic Information Systems and the Assessment of Cancer Burden

One means of studying neighborhood factors and their association with cancer diagnoses is to geographically analyze the distribution of cancer in an area through geographic information systems (GIS), which allow users to assess the burden of disease visually and statistically in a designated geographic context [22,23]. For example, researchers in Shandong Province, China, assessed urban–rural differences in the geospatial distribution of lung cancer and identified clusters of lung cancer mortality within the province [24,25].
At the municipal level, an analysis of the geospatial distribution of breast, cervical, and colorectal cancer incidence in Baltimore found that incidence and neighborhood-level factors varied by cancer type across neighborhoods [15]. In Philadelphia, McIntire et al. created prostate [14] and breast cancer [13] composite scores, composed of morbidity, mortality, and aggressiveness, to identify census tracts with a high cancer burden and thereby best prioritize cancer control efforts. However, to our knowledge, a parallel geospatial analysis of the lung cancer burden restricted to a single city has not yet been published, despite clear evidence that known lung cancer risk factors such as air pollution and environmental carcinogens may influence the geospatial distribution of disease, in addition to neighborhood-level socioeconomic status and behavioral patterns [15,24].
Prior studies provided a precedent for analysis of neighborhood characteristics and lung cancer burden in state cancer registry data but mainly comprised of statewide multivariable analyses [19] with limited application in the context of local lung cancer prevention, screening, and treatment strategies. Given the high rates of lung cancer morbidity and mortality in Philadelphia, understanding the spatial distribution of lung cancer in Philadelphia is essential to best target limited healthcare resources. A local-level geospatial analysis of lung cancer burden might also aptly inform additional analyses at NCI-designated cancer centers and their catchment areas around the country, especially given the limited number of prior geospatial cancer analyses conducted at the neighborhood scale or using residential locations [22].
Therefore, we hypothesized that by conducting a geospatial analysis, we would be able to (1) determine the spatial distribution of lung cancer burden in Philadelphia, including identification of potential lung cancer clusters, and (2) identify potential neighborhood correlates associated with the lung cancer burden in Philadelphia, to better guide cancer control efforts in the region and to inform municipal geospatial lung cancer control assessments.
The purpose of this study was to spatially characterize lung cancer morbidity and mortality in Philadelphia, to identify any lung cancer clusters, and to assess the relationship between census tract-level demographic, socioeconomic, morbidity, and behavioral characteristics with lung cancer morbidity and mortality. We sought to inform cancer control efforts in the region and further the theoretical basis for geographically prioritizing area-level interventions to reduce the burden of lung cancer.

2. Materials and Methods

2.1. Data Sources

We conducted a secondary analysis of ten years of Pennsylvania Cancer Registry data from 2008 to 2017. The registry data included residential address, age, race, marital status, and cancer history of all reported Pennsylvania lung cancer cases. For the present study, only cases from Philadelphia County (n = 12,013) were included. The data was cleaned, then addresses were geocoded using Environmental Research Systems Research Institute’s (ESRI) ArcMap 10.8.1 software, building an address locator based on Philadelphia Streets Department street centerlines data (2021) to accurately match cases to their geographic coordinates on the base map of Philadelphia. For addresses that did not automatically match with our locator, we manually verified the address and corrected any typographical errors before attempting rematch to ultimately achieve a 98% match rate (n = 11,767).
The matched lung cancer cases were aggregated by census tract using the 2019 TigerLine shapefiles from the U.S. Census Bureau [26]. Eleven census tracts were excluded and marked as nonresidential, defined as tracts with a 2010 Census population of less than 300, to ensure that any tract identified as an area of concern would include sufficient inhabitants to engage in potential interventions [13,14]. After excluding nonresidential areas, 373 census tracts were included in the analysis. A subset of the data was created for cases that were reported to have died due to cancer of the lung and bronchus per applicable International Classification of Diseases (ICD)-10 codes, n = 6978.
To measure census tract-level characteristics, additional data were obtained from Policy Map and the Centers for Disease Control and Prevention (CDC) 500 Cities Project, which provided geographically linked data from government data sources for socioeconomic and health-related indicators [27,28]. To account for census tract-level characteristics, we used U.S. Census data (2010) and American Communities Survey data (2010–2014) to measure median household income, percentage of population African American, percentage of population White, percentage of population Asian, percentage of population Hispanic, percentage lacking health insurance, homeownership rate, and percentage of residential addresses vacant. Additional census tract-level health indicators were obtained from the 2017 CDC 500 Cities data, including adult smoking prevalence; prevalence of adults diagnosed with chronic obstructive pulmonary disease (COPD), emphysema, or chronic bronchitis; percentage of adults reporting a routine physical exam in the past year; and percentage of adults reporting 14 or more poor physical health days during the past 30 days [28]. Census-tract level health indicators from Policy Map’s Behavioral Risk Factor Surveillance System (BRFSS)/U.S. Census 2018 dataset included the percentage of adults with obesity and the percentage of adults achieving the recommended level of physical activity.

2.2. Statistical Analysis

Using indirect age-adjustment methodology, standardized incidence ratios (SIR) and standardized mortality ratios (SMR) were calculated by census tract using the 2010 Census population counts for Philadelphia census tracts and the Surveillance Epidemiology and End Results (SEER) U.S. incidence rates (2013–2017) and U.S. mortality rates (2014–2018) for cancer of the lung and bronchus [29,30]. Lung cancer age-adjusted SIR and SMR (referred to as SIR and SMR, respectively, for the remainder of this analysis) by Philadelphia census tract were visualized on graduated choropleth maps to assess areas of higher and lower than expected lung cancer burden after the indirect age-adjustment method. As part of the city’s plan for rejuvenation, Philadelphia has been divided into 18 planning districts. These districts contain many neighborhoods, numerous census tracts, and cross zip codes. Planning districts will be used for descriptive purposes in this paper to orient the reader to particular areas of focus.
For the cluster analysis, lung cancer SIR and SMR “hot spots and cold spots” were identified using Getis Ord Gi* statistical calculations within ArcMap to detect local groupings of census tracts with non-random spatial distribution of high or low values, i.e., instances where neighboring tracts demonstrated similarly high or low disease burdens with statistical significance at the p ≤ 0.10, p ≤ 0.05, and p ≤ 0.01 levels [31]. Cluster and outlier maps of SIR and SMR were created using Local Indicators of Spatial Association (LISA), which incorporate the Local Moran’s I Statistic values of all census tracts within the study area to determine groupings of high- and low-value census tracts and to compare the value of each tract to the global (non-neighboring) tracts, to thereby identify both clusters and outliers [32]. Local Moran’s I and Getis Ord Gi* calculations were used to identify census tracts with statistically significant high lung cancer burdens per both methods. Census tracts were classified as part of lung cancer clusters if they were designated as both “hot spots” per Getis Ord Gi* z-scores and “high–high” per Local Moran’s I calculation to triangulate our measurement of spatial association [15,32,33]. Cluster maps for SIR and SMR were created to display census tracts identified as areas of concern that had an overlap between the two clustering techniques.
Lastly, multivariable linear regression models were built using IBM SPSS Statistics 26 [34] to determine which census tract-level sociodemographic characteristics were associated with census tract lung cancer SIR and SMR. Univariate descriptive statistics were calculated to assess the mean and standard deviation of SIR, SMR, and independent variables for each of the 373 census tracts. Bivariate Pearson correlations were calculated for SIR and SMR with each independent variable. Multivariable linear regression models were then used to estimate regression coefficients (β) and confidence intervals (CI) to identify which census tract-level characteristics were associated with higher lung cancer SIR and SMR at the α = 0.05 level.
This analysis was one of several to be conducted under a Thomas Jefferson University Institutional Review Board-approved protocol, which aimed to analyze Pennsylvania Cancer Registry lung cancer data.

3. Results

3.1. Geographic Distribution of SIR and SMR

The mean lung cancer SIR for the 373 Philadelphia census tracts was 1.46 (SD = 0.55). The majority of census tracts (304; 81.2%) had an SIR of greater than 1.00 (i.e., higher than expected in the age-adjusted population after standardizing by national lung cancer rates). Several city planning districts, such as Lower North and River Wards, included groups of census tracts with an SIR of 1.5 to 2.0 or 2.1 to 3.0 (Figure 1). The mean SMR was 1.23 (SD = 0.54), and the SMR was greater than 1.00 for 236 out of 373 tracts. Visual comparison of the choropleth maps of SIR and SMR indicated a similar geographic distribution of higher and lower values, although the map of SMR included more census tracts with values of less than or equal to one, primarily concentrated in parts of Northwest and Northeast Philadelphia (Figure 1 and Figure 2). Upon visual inspection, the following planning districts demonstrated groupings of census tracts with the lowest SIR and SMR (less than or equal to 1.0): Lower and Upper Northwest, West Park (along the northwest border of the district), Central, and Upper Far Northeast. Although tracts with SIR and SMR > 1.5 were dispersed across many planning districts, groupings of census tracts with the highest SIR and SMR (>2.0) were apparent in the Lower North, West, University Southwest, South, and River Wards planning districts.
Fifty-five census tracts met both Moran’s I and Getis-Ord Gi* criteria to be classified as part of SIR clusters, and 40 census tracts met the criteria for SMR clusters. For both SIR and SMR, two distinct geographic clusters of high-burden census tracts were apparent: one cluster in the North/Lower North planning districts bordering Fairmount Park, and one cluster spanning parts of the River Wards, Lower Northeast, and North Delaware planning districts (Figure 3 and Figure 4). For SIR only, a third cluster was apparent in the western half of the South Philadelphia planning district.

3.2. Multivariable Model of Census Tract-Level Predictors of SIR and SMR

Table 1 shows the mean, standard deviation (SD), and bivariate Pearson correlations with lung cancer SIR and SMR for each tract-level characteristic. This table indicates that the following variables demonstrated a positive unadjusted association (p < 0.05) with both SIR and SMR: percentage of population African American, home ownership rate, percentage with 14 or more reported poor health days over the past 30 days, and percentage of residential addresses vacant. Median household income, percentage of population White, percentage of population Asian, and percentage who achieved the recommended amount of physical activity demonstrated negative correlations with lung cancer SIR and SMR.
While controlling for covariates in our multivariable regression models (Table 2), several census tract-level variables demonstrated statistically significant increases in both census tract lung cancer SIR and SMR associated with 10 percent increases in adult smoking prevalence (SIR β = 0.654; 95% CI: 0.163, 1.128; SMR β = 0.673; 95% CI: 0.183, 1.163), COPD prevalence (SIR β = 2.805; 95% CI: 0.580, 5.030; SMR β = 2.995; 95% CI: 0.734, 5.255), and residential addresses vacant (SIR β = 0.130; 95% CI: 0.050, 0.209; SMR β = 0.112; 95% CI: 0.031, 0.192). A 10 percent increase in the census-tract level population with 14 or more reported poor health days over the past 30 days was negatively associated with lung cancer SIR and SMR (SIR β = −1.918; 95% CI: −3.499, −0.337; SMR β = −2.144; 95% CI: −3.750, −0.538). Additionally, increases in SIR were associated with a 10 percent increase in census-tract level obesity (SIR β = 0.613; 95% CI: 0.012, 1.214).

4. Discussion

Through geospatial analysis and multivariable modeling of lung cancer SIR and SMR at the census tract level, our study geographically characterized the burden of lung cancer in Philadelphia and identified several factors associated with a higher lung cancer burden at the census tract level. Our census tract-level analyses highlighted the extent to which lung cancer morbidity and mortality in Philadelphia were higher than expected compared to national age-adjusted lung cancer rates, underlining the high lung cancer burden for Philadelphia County [9]. The mean census tract-level age-adjusted lung cancer SIR of 1.46 indicates that lung cancer incidence in the average census tract in Philadelphia from 2008 to 2017 was 46% higher than expected, based on the age distribution of people in the census tracts, after standardizing with national lung cancer rates. Similarly, the SMR of 1.23 signifies that the average census tract-level lung cancer mortality rate is 23% higher than expected. These results emphasize the ongoing need for heightened lung cancer control efforts in Philadelphia.
The identification of three distinct lung cancer SIR clusters may inform the geographic prioritization of lung cancer prevention, screening, and treatment efforts in Philadelphia. Evidence-based interventions to decrease lung cancer incidence and mortality aim to reduce smoking initiation and increase smoking cessation, as well as improve efforts to screen individuals with significant smoking history using low-dose computed tomography (LDCT) scans. [1]. Despite prior development of evidence-based lung cancer screening protocols, an earlier analysis found that significantly more adults were screened inappropriately for lung cancer than were screened according to the 2013 U.S. Preventative Services Task Force (USPSTF) guidelines [35]. Discordance between real-world uptake and implementation of lung cancer screening and that recommended by clinical practice guidelines reflects a need for comprehensive and centralized approaches to improve screening access and effectiveness [36]. Revised USPTSF guidelines from 2021 recommend annual LDCT screening for adults ages 50–80 (previously ages 55–80) with a 20 pack-year smoking history (previously 30 pack-years) who either currently smoke or quit in the past 15 years. These individuals should receive annual LDCTs until they either quit smoking for 15 years or develop a health condition that “substantially limits life expectancy” [37]. More recent analyses showed that the USPSTF 2021 guideline changes broadened lung cancer screening eligibility and reduced some racial disparity in access to screening [38]. In addition to previous studies highlighting varying screening eligibility among Philadelphia neighborhoods [39], the identification of lung cancer clusters may inform the placement of additional screening outreach, resources, and/or protocol improvements in current facilities to improve lung cancer outcomes in high-burden census tracts.
The similar geographic distribution of lung cancer incidence and mortality rates likely reflects the poor survival outcomes of lung cancer. From 2012 to 2018 in the U.S., 5-year relative survival for non-small cell lung cancer for all stages at diagnosis combined was 28%, and 5-year survival for small cell lung cancer was only 7% [40]. Therefore, it can be expected that areas with higher lung cancer incidence over a ten-year period would have similarly high lung cancer mortality, as the SIR and SMR maps and cluster maps displayed. Given that even within one city planning district, several census tracts may form a cluster, while the remaining tracts in the district do not demonstrate significant spatial autocorrelation, our analysis illustrates the benefit of using neighborhood-level units of area to identify more specific areas for public health intervention, instead of using larger units such as zip codes or planning districts.
However, several differences between lung cancer incidence and mortality in Philadelphia were identified. Most notably, a third lung cancer cluster was apparent in South Philadelphia for SIR but not for SMR. Additionally, while SIR and SMR were higher than 1.0 in the majority of census tracts, fewer tracts had an SMR of >1.0 (236) than had an SIR of >1.0 (304). This contrast between the geographic areas affected by higher-than-expected incidence compared to mortality may indicate that while lung cancer is diagnosed at high rates throughout most of Philadelphia, more localized factors may impact the rate at which residents die from lung cancer. These factors may include potential neighborhood differences in the stage at diagnosis and access to health care [17]. For example, higher uptake of lung cancer screening or more intensive follow-up of incidental lung nodules can lead to a higher proportion of early-stage compared with late-stage lung cancer diagnoses [41]. Notably, our model of SMR does not incorporate the stage at diagnosis, which is a key predictor of lung cancer mortality [40].
Many of the associated census tract-level variables identified in the multivariable models of lung cancer SIR and SMR are already well-established in the literature, including smoking prevalence and the presence of a COPD diagnosis, both positively associated [1]. The association between the percentage of residential addresses vacant and the lung cancer burden is not unexpected since neighborhoods with higher vacancies tend to have worse health outcomes [42,43]. However, this may warrant further investigation as a social determinant of lung cancer outcomes since this variable remained significant when controlling for race, sex, and income. The negative association between the lung cancer burden and the percentage of the census tract reporting 14 or more poor health days over the last 30 days was unanticipated, given that self-reported health quality of life is associated with worse cancer outcomes [44,45,46]. This discrepancy may point to a limitation of ecological analyses, such that we cannot assume that patients who reported poor health were also those who were diagnosed with lung cancer [15].
Despite African American Pennsylvanians having higher lung cancer incidence and mortality compared to White Pennsylvanians, the census tract-level percentage of African Americans was no longer significant in either model after controlling for other potential confounders, indicating that neighborhood and socioeconomic factors may drive racial disparities in cancer burden, as identified in prior research [17]. The multivariable models in this analysis accounted for only 37.9% and 28.6% of variability in lung cancer SIR and SMR, respectively; a more robust, local comparison of high and low-burden tracts may illuminate additional sources of variability. As described by McIntire et al. in their analysis of the census tract-level prostate cancer burden in Philadelphia, a dual approach of targeting cancer control efforts based on the known geographic distribution of cancer rates, while also informed by health and demographic characteristics, may be the most logical application of our findings [14].

Strengths and Limitations

The Pennsylvania Cancer Registry includes all new cancer cases diagnosed and treated in the state, making it a comprehensive representation of Philadelphia lung cancer patients from 2008 to 2017. However, since mortality data in the Pennsylvania Cancer Registry only includes the primary cause of death, our calculations of mortality likely underreport cancer-related deaths because some lung cancer patients may have died due to cancer complications that were reported as the primary cause of death, e.g., pneumonia. Moreover, the results may not mirror the current burden of lung cancer in Pennsylvania given the ongoing changes over time of the lung cancer burden in the U.S. [19]. Similarly, while the relevance of our results is limited outside of Philadelphia, our methods are reproducible and provide a valuable framework for local public health professionals to assess the cancer burden in their respective communities to appropriately target interventions.
Our statistical modeling was limited to risk factor data that is aggregated to the census tract level. Therefore, we could not measure associations between the lung cancer burden and other known risk factors, including air quality, radon, asbestos exposure, and other environmental factors, as this data is not available at the census tract level. Additionally, associations found in the cancer burden at the census tract level do not necessarily translate to the individual level. Lastly, this study is observational, and causality cannot be definitively determined from identified associations.

5. Conclusions

In a geographic and statistical analysis of ten years of Pennsylvania Cancer Registry data, several city planning districts in Philadelphia County with statistically significant groupings of high lung cancer incidence and mortality (clusters) were identified. Although the geographic distribution of high and low values were similar for lung cancer SIR and SMR, an additional SIR cluster was found in South Philadelphia, suggesting that certain neighborhood factors impact lung cancer survival in Philadelphia. We also identified several census tract-level predictors that were significant for both lung cancer SIR and SMR, including adult smoking, COPD prevalence, and percent residential addresses vacant. These findings may help inform cancer control efforts in the area, such as the implementation or augmentation of lung cancer screening and prevention services in the geographic areas identified as clusters.

Author Contributions

Conceptualization, R.K.M., K.S. and C.Z.-J.; Data curation, C.Z.-J.; Formal analysis, R.K.M. and K.S.; Investigation, R.K.M., K.S., C.S. and S.W.K.; Methodology, R.K.M. and K.S.; Project administration, R.K.M. and C.Z.-J.; Resources, R.K.M. and C.Z.-J.; Software, K.S. and C.S.; Supervision, R.K.M.; Visualization, K.S. and C.S.; Writing—original draft, R.K.M., K.S., C.S., R.Y. and J.B.; Writing—review and editing, R.K.M., K.S., C.S., R.Y., J.B., S.W.K. and C.Z.-J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Thomas Jefferson University Institutional Review Board (protocol code 17D.150; date of approval 21 June 2019).

Informed Consent Statement

As the study was a secondary quantitative data analysis, patient consent was not necessary.

Data Availability Statement

The data presented in this study are available on request from the corresponding author, conditional on Thomas Jefferson University IRB and PA Cancer Registry agreements.

Acknowledgments

We would like to thank the PA Cancer Registry for their help with data acquisition.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Torre, L.A.; Bray, F.; Siegel, R.L.; Ferlay, J.; Lortet-Tieulent, J.; Jemal, A. Global cancer statistics, 2012. CA Cancer J. Clin. 2015, 65, 87–108. [Google Scholar] [CrossRef] [PubMed]
  2. Bray, F.; Laversanne, M.; Sung, H.; Ferlay, J.; Siegel, R.L.; Soerjomataram, I.; Jemal, A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2024, 74, 229–263. [Google Scholar]
  3. Siegel, R.L.; Giaquinto, A.N.; Jemal, A. Cancer statistics, 2024. CA Cancer J. Clin. 2024, 74, 12–49. [Google Scholar]
  4. National Center for Health Statistics. Age-Adjusted Lung Cancer Death* Rates, by State—National Vital Statistics System, United States, 2018. Morb. Mortal. Wkly. Rep. 2020, 69, 1273. [Google Scholar] [CrossRef]
  5. U.S. Centers for Disease Control and Prevention. Lung Cancer Statistics. Available online: https://www.cdc.gov/lung-cancer/statistics/?CDC_AAref_Val=https://www.cdc.gov/cancer/lung/statistics/index.htm (accessed on 26 October 2024).
  6. Mariotto, A.B.; Enewold, L.; Zhao, J.; Zeruto, C.A.; Yabroff, K.R. Medical Care Costs Associated with Cancer Survivorship in the United States. Cancer Epidemiol. Biomark. Prev. 2020, 29, 1304–1312. [Google Scholar] [CrossRef]
  7. American Lung Association. State of Lung Cancer: Pennsylvania. State Data. Available online: https://www.lung.org/research/state-of-lung-cancer/states/pennsylvania (accessed on 26 October 2024).
  8. Pennsylvania Department of Health. Pennsylvania Healthy People, State Level. C-02: Lung Cancer Death Rate. 2023. Available online: https://www.pa.gov/content/dam/copapwp-pagov/en/health/documents/topics/healthstatistics/healthypeople/documents/current/state/c-02-lung-cancer-death-rate.html (accessed on 26 October 2024).
  9. Pennsylvania Department of Health. The Burden of Cancer in Pennsylvania: A Report of the Cancer Control, Prevention and Research Advisory Board; PA Department of Health: Harrisburg, PA, USA, 2019.
  10. Camiña, N.; McWilliams, T.L.; McKeon, T.P.; Penning, T.M.; Hwang, W.-T. Identification of spatio-temporal clusters of lung cancer cases in Pennsylvania, USA: 2010–2017. BMC Cancer 2022, 22, 555. [Google Scholar]
  11. Zhu, Y.; McKeon, T.P.; Tam, V.; Vachani, A.; Penning, T.M.; Hwang, W.-T. Geographic differences in lung cancer incidence: A study of a major metropolitan area within Southeastern Pennsylvania. Int. J. Environ. Res. Public Health 2020, 17, 9498. [Google Scholar] [CrossRef]
  12. Barta, J.A.; Erkmen, C.P.; Shusted, C.S.; Myers, R.E.; Saia, C.; Cohen, S.; Wainwright, J.; Zeigler-Johnson, C.; Dako, F.; Wender, R.; et al. The Philadelphia Lung Cancer Learning Community: A multi-health-system, citywide approach to lung cancer screening. JNCI Cancer Spectr. 2023, 7, pkad071. [Google Scholar] [CrossRef] [PubMed]
  13. McIntire, R.K.; Juon, H.-S.; Keith, S.W.; Simone, N.L.; Waters, D.; Lewis, E.; Zeigler-Johnson, C. A novel method for measuring the burden of breast cancer in neighborhoods. Prev. Med. Rep. 2023, 33, 102218. [Google Scholar]
  14. McIntire, R.K.; Keith, S.W.; Boamah, M.; Leader, A.E.; Glanz, K.; Klassen, A.C.; Zeigler-Johnson, C.M. A prostate cancer composite score to identify high burden neighborhoods. Prev. Med. 2018, 112, 47–53. [Google Scholar] [CrossRef]
  15. Torres, A.Z.; Phelan-Emrick, D.; Castillo-Salgado, C. Evaluating neighborhood correlates and geospatial distribution of breast, cervical, and colorectal cancer incidence. Front. Oncol. 2018, 8, 471. [Google Scholar] [CrossRef] [PubMed]
  16. McGee-Avila, J.K.; Richmond, J.; Henry, K.A.; Stroup, A.M.; Tsui, J. Disparities in geospatial patterns of cancer care within urban counties and structural inequities in access to oncology care. Health Serv. Res. 2023, 58, 152–164. [Google Scholar] [CrossRef] [PubMed]
  17. Ellis, L.; Canchola, A.J.; Spiegel, D.; Ladabaum, U.; Haile, R.; Gomez, S.L. Racial and ethnic disparities in cancer survival: The contribution of tumor, sociodemographic, institutional, and neighborhood characteristics. J. Clin. Oncol. 2018, 36, 25–33. [Google Scholar] [CrossRef] [PubMed]
  18. Fu, Y.; Liu, J.; Chen, Y.; Liu, Z.; Xia, H.; Xu, H. Gender disparities in lung cancer incidence in the United States during 2001–2019. Sci. Rep. 2023, 13, 12581. [Google Scholar] [CrossRef]
  19. Patel, M.I.; McKinley, M.; Cheng, I.; Haile, R.; Wakelee, H.; Gomez, S.L. Lung cancer incidence trends in California by race/ethnicity, histology, sex, and neighborhood socioeconomic status: An analysis spanning 28 years. Lung Cancer 2017, 108, 140–149. [Google Scholar] [CrossRef]
  20. Sung, H.; Nisotel, L.; Sedeta, E.; Islami, F.; Jemal, A. Racial and ethnic disparities in survival among people with second primary cancer in the US. JAMA Netw. Open 2023, 6, e2327429. [Google Scholar] [CrossRef]
  21. Adie, Y.; Kats, D.J.; Tlimat, A.; Perzynski, A.; Dalton, J.; Gunzler, D.; Tarabichi, Y. Neighborhood disadvantage and lung cancer incidence in ever-smokers at a safety net health-care system: A retrospective study. Chest 2020, 157, 1021–1029. [Google Scholar] [CrossRef]
  22. Korycinski, R.W.; Tennant, B.L.; Cawley, M.A.; Bloodgood, B.; Oh, A.Y.; Berrigan, D. Geospatial approaches to cancer control and population sciences at the United States cancer centers. Cancer Causes Control 2018, 29, 371–377. [Google Scholar] [CrossRef]
  23. Fadiel, A.; Eichenbaum, K.D.; Abbasi, M.; Lee, N.K.; Odunsi, K. Utilizing geospatial artificial intelligence to map cancer disparities across health regions. Sci. Rep. 2024, 14, 7693. [Google Scholar] [CrossRef]
  24. Fu, Z.; Li, Y.; Chu, J.; Sun, J.; Lu, Z.; Zhang, J.; Chen, X.; Zhang, G.; Xue, F.; Guo, X. Lung cancer mortality clusters in urban and rural areas of Shandong Province, China: A spatial scan statistical analysis. Precis. Radiat. Oncol. 2019, 3, 15–22. [Google Scholar] [CrossRef]
  25. Jiang, F.; Fu, Z.; Lu, Z.; Chu, J.; Xu, A.; Guo, X.; Ma, J. Cancer survival analysis and spatial distribution during 2014–2016 in Shandong Province, China. Sci. Rep. 2023, 13, 10324. [Google Scholar]
  26. United States Census Bureau. TIGER/Line Shapefiles. Available online: https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.html (accessed on 26 October 2024).
  27. Policy Map. Policy Map: About Us. Available online: https://policymap.wpengine.com/about (accessed on 26 October 2024).
  28. U.S. Centers for Disease Control and Prevention. Welcome to the 500 Cities & PLACES Data Portal. Available online: https://chronicdata.cdc.gov/browse?category=500+Cities+%26+Places&sortBy=newest&utf8= (accessed on 2 February 2023).
  29. Surveillance Research Program. SEER*Explorer: An interactive website for SEER cancer statistics. Available online: https://seer.cancer.gov/explorer/ (accessed on 26 October 2024).
  30. U.S. Census Bureau. Decennial Census by Decade. Available online: https://www.census.gov/programs-surveys/decennial-census/decade.2010.html (accessed on 26 October 2024).
  31. Moore, K.D. Healthy Places: Towards a Transdisciplinary Mapping of the Theoretical Landscape. Enq. ARCC J. Archit. Res. 2020, 17, 1–20. [Google Scholar]
  32. Moore, J.X.; Royston, K.J.; Langston, M.E.; Griffin, R.; Hidalgo, B.; Wang, H.E.; Colditz, G.; Akinyemiju, T. Mapping hot spots of breast cancer mortality in the United States: Place matters for Blacks and Hispanics. Cancer Causes Control 2018, 29, 737–750. [Google Scholar]
  33. Nassel, A.F.; Root, E.D.; Haukoos, J.S.; McVaney, K.; Colwell, C.; Robinson, J.; Eigel, B.; Magid, D.J.; Sasson, C. Multiple cluster analysis for the identification of high-risk census tracts for out-of-hospital cardiac arrest (OHCA) in Denver, Colorado. Resuscitation 2014, 85, 1667–1673. [Google Scholar]
  34. IBM Corp. IBM SPSS Statistics for Windows; Version 26; IBM Corp.: Armonk, NY, USA, 2019. [Google Scholar]
  35. Richards, T.B.; Doria-Rose, V.P.; Soman, A.; Klabunde, C.N.; Caraballo, R.S.; Gray, S.C.; Houston, K.A.; White, M.C. Lung cancer screening inconsistent with US Preventive Services Task Force recommendations. Am. J. Prev. Med. 2019, 56, 66–73. [Google Scholar]
  36. Kane, G.C.; Barta, J.A.; Shusted, C.S.; Evans, N.R. Now is the time to make screening for lung cancer reportable. Ann. Intern. Med. 2022, 175, 888–889. [Google Scholar] [PubMed]
  37. U.S. Preventive Services Task Force. Lung Cancer: Screening; American Medical Association: Chicago, IL, USA, 2021; p. 962. Available online: https://www.uspreventiveservicestaskforce.org/uspstf/recommendation/lung-cancer-screening (accessed on 19 December 2024).
  38. Pu, C.Y.; Lusk, C.M.; Neslund-Dudas, C.; Gadgeel, S.; Soubani, A.O.; Schwartz, A.G. Comparison between the 2021 USPSTF lung cancer screening criteria and other lung cancer screening criteria for racial disparity in eligibility. JAMA Oncol. 2022, 8, 374–382. [Google Scholar] [CrossRef] [PubMed]
  39. McIntire, R.K.; Lewis, E.; Zeigler-Johnson, C.; Shusted, C.; Barta, J.; Juon, H.-S.; Keith, S.W.; Klein, G. Estimating eligibility for lung cancer screening by neighborhood in Philadelphia using previous and current USPSTF guidelines. Popul. Health Manag. 2022, 25, 254–263. [Google Scholar] [CrossRef]
  40. American Cancer Society. Lung Cancer Survival Rates. Available online: https://www.cancer.org/cancer/lung-cancer/detection-diagnosis-staging/survival-rates.html (accessed on 26 October 2024).
  41. Vachani, A.; Carroll, N.M.; Simoff, M.J.; Neslund-Dudas, C.; Honda, S.; Greenlee, R.T.; Rendle, K.A.; Burnett-Hartman, A.; Ritzwoller, D.P. Stage migration and lung cancer incidence after initiation of low-dose computed tomography screening. J. Thorac. Oncol. 2022, 17, 1355–1364. [Google Scholar]
  42. Wang, K.; Immergluck, D. The geography of vacant housing and neighborhood health disparities after the US foreclosure crisis. Cityscape 2018, 20, 145–170. [Google Scholar]
  43. Snider, N.G.; Hastert, T.A.; Nair, M.; Kc, M.; Ruterbusch, J.J.; Schwartz, A.G.; Peters, E.S.; Stoffel, E.M.; Rozek, L.S.; Purrington, K.S. Area-level socioeconomic disadvantage and cancer survival in metropolitan Detroit. Cancer Epidemiol. Biomark. Prev. 2023, 32, 387–397. [Google Scholar] [CrossRef] [PubMed]
  44. Efficace, F.; Bottomley, A.; Smit, E.; Lianes, P.; Legrand, C.; Debruyne, C.; Schramel, F.; Smit, H.; Gaafar, R.; Biesma, B. Is a patient’s self-reported health-related quality of life a prognostic factor for survival in non-small-cell lung cancer patients? A multivariate analysis of prognostic factors of EORTC study 08975. Ann. Oncol. 2006, 17, 1698–1704. [Google Scholar] [PubMed]
  45. Sofianidi, A.; Karadimou, A.; Charpidou, A.; Syrigos, K.N. The Gap of Health Inequalities Amongst Lung Cancer Patients of Different Socioeconomic Status: A Brief Reference to the Greek Reality. Cancers 2024, 16, 906. [Google Scholar] [CrossRef] [PubMed]
  46. Hong, Y.J.; Han, S.; Lim, J.U.; Kang, H.S.; Kim, S.K.; Kim, J.W.; Lee, S.H.; Kim, S.J.; Yeo, C.D. Association between quality of life questionnaire at diagnosis and survival in patients with lung cancer. Clin. Lung Cancer 2023, 24, 459–466. [Google Scholar]
Figure 1. Lung Cancer Standardized Incidence Ratio (SIR) by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Figure 1. Lung Cancer Standardized Incidence Ratio (SIR) by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Ijerph 22 00455 g001
Figure 2. Lung Cancer Standardized Mortality Ratio (SMR) by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Figure 2. Lung Cancer Standardized Mortality Ratio (SMR) by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Ijerph 22 00455 g002
Figure 3. Lung Cancer Standardized Incidence Ratio (SIR) Clusters by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Figure 3. Lung Cancer Standardized Incidence Ratio (SIR) Clusters by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Ijerph 22 00455 g003
Figure 4. Lung Cancer Standardized Mortality Ratio (SMR) Clusters by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Figure 4. Lung Cancer Standardized Mortality Ratio (SMR) Clusters by Philadelphia, PA Census Tract for Pennsylvania Cancer Registry Data 2008 to 2017.
Ijerph 22 00455 g004
Table 1. Univariate Summary of Philadelphia Census Tracts and Bivariate Analyses Between Lung Cancer Standardized Incidence and Mortality Ratios (SIR and SMR) and Census Tract-Level Characteristics in Philadelphia, PA (n = 373).
Table 1. Univariate Summary of Philadelphia Census Tracts and Bivariate Analyses Between Lung Cancer Standardized Incidence and Mortality Ratios (SIR and SMR) and Census Tract-Level Characteristics in Philadelphia, PA (n = 373).
Census Tract CharacteristicMean (SD)Pearson
Correlation (SIR)
p-Value (SIR)Pearson
Correlation (SMR)
p-Value
(SMR)
Lung cancer SIR a1.46 (0.55)N/AN/AN/AN/A
Lung cancer SMR a1.23 (0.54)N/AN/AN/AN/A
Median household income ($) b41,030.73 (20,864.95)−0.325<0.001 *−0.335<0.001 *
Black or African American (%) b44.44 (35.61)0.1770.001 *0.1760.001 *
White (%) b41.17 (33.00)−0.1490.004 *−0.1650.003 *
Asian (%) b6.01 (7.78)−0.1670.001 *−0.1530.033 *
Hispanic (%) b10.92 (16.34)−0.0210.6880.0150.769
Male (%) b47.16 (4.08)−0.0010.9840.0070.896
Homeownership (%) b52.71 (18.74)0.249<0.001 *0.244<0.001 *
Uninsured (%) c13.54 (6.17)−0.0230.662−0.0320.544
Adult smoking prevalence (%) d23.64 (6.22)0.457<0.001 *0.452<0.001 *
Diagnosed with COPD (%) d7.24 (2.29)0.341<0.001 *0.313<0.001 *
Obesity (%) e34.09 (5.55)0.198<0.001 *0.186<0.001 *
Achieved recommended physical activity (%) e20.58 (2.37)−0.1210.019 *−0.1050.042 *
Routine exam in past year (%) d73.85 (4.89)0.0080.882−0.0110.837
Reported 14+ poor health days (%) d14.92 (4.37)0.321<0.001 *0.310<0.001 *
Residential addresses vacant (%) c13.37 (7.49)0.323<0.001 *0.305<0.001 *
* Correlation has p < 0.05 (2-tailed); a Pennsylvania Cancer Registry, 2008–2017; b U.S. Census Bureau, 2010 Census via PolicyMap; c U.S. Census Bureau, ACS 2010–2014 via PolicyMap; d CDC 500 Cities: BRFSS 2017 and 2010 Census; e Policy Map: BRFSS 2018 and 2010 Census.
Table 2. Results from Multivariable Linear Regressions of Lung Cancer Standardized Incidence and Mortality Ratios (SIR and SMR) on Census Tract-Level Characteristics in Philadelphia, PA (n = 373).
Table 2. Results from Multivariable Linear Regressions of Lung Cancer Standardized Incidence and Mortality Ratios (SIR and SMR) on Census Tract-Level Characteristics in Philadelphia, PA (n = 373).
Census Tract CharacteristicSIR β Coefficient
(95% CI)
t-Statistic
(SIR)
p-Value
(SIR)
SMR β Coefficient
(95% CI)
t-Statistic
(SMR)
p-Value
(SMR)
Median household income−0.014 (−0.061, 0.034)−0.5680.570−0.021 (−0.069, 0.027)−0.8550.393
African American (%)0.077 (−0.285, 0.439)0.4170.677−0.113 (−0.481, 0.255)−0.6040.546
White (%)0.086 (−0.253, 0.425)0.5000.617−0.123 (−0.468, 0.221)−0.7030.482
Asian (%)0.132 (−0.266, 0.530)0.6540.513−0.077 (−0.482, 0.327)−0.3750.708
Hispanic (%)0.041 (−0.231, 0.313)0.2960.767−0.036 (−0.312, 0.240)−0.2550.799
Male (%)−0.112 (−0.294, 0.069)−1.2200.223−0.152 (−0.336, 0.032)−1.6200.106
Homeownership (%)0.011 (−0.030, 0.052)0.5200.6030.018 (−0.023, 0.060)0.8640.388
Uninsured (%)0.027 (−0.079, 0.133)0.5000.618−0.002 (−0.110, 0.106)−0.0350.972
Adult smoking prevalence (%)0.645 (0.163, 1.128)2.6320.0090.673 (0.183, 1.163)2.7030.007
Diagnosed with COPD (%)2.805 (0.580, 5.030)2.4790.0142.995 (0.734, 5.255)2.6050.010
Obesity (%)0.613 (0.012, 1.214)2.0060.0460.499 (−0.112, 1.109)1.6060.109
Achieved recommended physical activity (%)0.450 (−0.400, 1.299)1.0410.2990.530 (−0.333, 1.393)1.2080.228
Routine exam in past year (%)−0.603 (−1.271, 0.064)−1.7770.076−0.525 (−1.203, 0.154)−1.5210.129
Reported 14+ poor health
days (%)
−1.918 (−3.499, −0.337)−2.3860.018−2.144 (−3.750, −0.538)−2.6250.009
Residential addresses
vacant (%)
0.130 (0.050, 0.209)3.2060.0010.112 (0.031, 0.192)2.7160.007
Note: Median household income was scaled to reflect a USD 10,000 change per unit in the outcome variable. All % variables were scaled to reflect a 10% change per unit in the outcome variable. SIR R2 = 0.379, SMR R2 = 0.342.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

McIntire, R.K.; Senter, K.; Shusted, C.; Yearwood, R.; Barta, J.; Keith, S.W.; Zeigler-Johnson, C. A Geospatial Analysis of the Lung Cancer Burden in Philadelphia, Using Pennsylvania Cancer Registry Data from 2008–2017. Int. J. Environ. Res. Public Health 2025, 22, 455. https://doi.org/10.3390/ijerph22030455

AMA Style

McIntire RK, Senter K, Shusted C, Yearwood R, Barta J, Keith SW, Zeigler-Johnson C. A Geospatial Analysis of the Lung Cancer Burden in Philadelphia, Using Pennsylvania Cancer Registry Data from 2008–2017. International Journal of Environmental Research and Public Health. 2025; 22(3):455. https://doi.org/10.3390/ijerph22030455

Chicago/Turabian Style

McIntire, Russell K., Katherine Senter, Christine Shusted, Rickisa Yearwood, Julie Barta, Scott W. Keith, and Charnita Zeigler-Johnson. 2025. "A Geospatial Analysis of the Lung Cancer Burden in Philadelphia, Using Pennsylvania Cancer Registry Data from 2008–2017" International Journal of Environmental Research and Public Health 22, no. 3: 455. https://doi.org/10.3390/ijerph22030455

APA Style

McIntire, R. K., Senter, K., Shusted, C., Yearwood, R., Barta, J., Keith, S. W., & Zeigler-Johnson, C. (2025). A Geospatial Analysis of the Lung Cancer Burden in Philadelphia, Using Pennsylvania Cancer Registry Data from 2008–2017. International Journal of Environmental Research and Public Health, 22(3), 455. https://doi.org/10.3390/ijerph22030455

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop