1. Introduction
The COVID-19 pandemic has unmasked profound racial and ethnic health disparities, a phenomenon well documented in extant literature. Racial and ethnic minority groups have been adversely and disproportionately impacted by the pandemic, experiencing higher rates of infection, severe disease, and mortality [
1]. These disparities extend beyond genetic predispositions and are intricately linked to the social determinants of health, including differential access to healthcare, socioeconomic inequities, and disparate living conditions [
2]. In the realm of mitigating factors, vitamin D has emerged as a variable of interest in the epidemiological landscape of COVID-19. Known for its immunomodulatory and immunostimulatory properties, vitamin D plays a pivotal role in immune response regulation [
3].
Epidemiological studies have consistently highlighted vitamin D deficiency as a prevalent issue among racial and ethnic minorities, a disparity caused by factors such as skin pigmentation and dietary variations [
4,
5,
6]. Vitamin D deficiency has been associated with conditions such as diabetes, rheumatoid arthritis, and acute respiratory tract infections [
7]. The association between vitamin D status and COVID-19 outcomes has garnered significant attention in recent research. Some studies indicate a potential clinically significant association between vitamin D deficiency and increased severity of COVID-19 symptoms [
8]. Conversely, adequate vitamin D levels or supplementation have been associated with less severe disease courses in some cohorts [
9]. This evidence suggests a possible role of vitamin D in modulating the impact of COVID-19, particularly among populations already at elevated risk due to systemic health disparities.
Vitamin D supplementation, an easily accessible (over the counter), predominantly safe, and affordable intervention, could be an important tool to assist in the prevention of SARS-CoV-2 infection among those facing the highest burden, (i.e., racial/ethnic minority groups). Extant literature presents complex and sometimes contradictory findings on the relationship between vitamin D level and SARS-CoV-2 infection [
5]. Some research indicates that vitamin D deficiency may increase the risk of serious SARS-CoV-2 infection, while others indicate that vitamin D was not associated with a higher risk of infection [
10,
11]. These discrepancies may be attributed to varying study designs (cross-sectional vs. longitudinal), different classifications of vitamin D deficiency (multiple standards exist), and difference in sample population (minority populations vs. predominantly White populations). Additionally, there is a lack of an examination of the role of pre-existing comorbidities and geographic location on the relationship between vitamin D and SARS-CoV-2 infection. This inconsistency underscores the need for a nuanced understanding of the multifaceted relationship between vitamin D levels, SARS-CoV-2 infection, and the social determinants of health, especially in racially and ethnically diverse populations. Our study aims to fill this gap by focusing on this historically marginalized population.
People in medically underserved communities have a higher proportion of racial/ethnic minorities—the groups that are disproportionally affected by COVID-19 [
12], vitamin D deficiency [
13,
14], and comorbidities [
15]—which may modify the relationships between vitamin D and susceptibility to SARS-CoV-2 infection. Here we present the results from the Minority and Rural Coronavirus Insights Study (MRCIS) conducted among medically underserved communities across five US states [
1]. MRCIS is the largest multi-site, community-based, epidemiologic investigation of the social and structural drivers of health, clinical, environmental, and genetic factors associated with the COVID-19 pandemic in minority and rural communities in the US. Here, we describe the relationships between circulating vitamin D levels and the evidence of SARS-CoV-2 infection in this largely vaccine-naïve study population. This investigation is crucial for informing public health strategies and interventions tailored to the needs of diverse populations.
2. Methods
We conducted a descriptive analysis to examine the relationship between circulating serum vitamin D levels and SARS-CoV-2 infection among the MRCIS cohort. MRCIS aimed to examine various physiological and social demographic factors associated with the molecular and serological assessment of SARS-CoV-2 infection. A detailed description of the study methods and cohort has been published elsewhere [
1].
2.1. Study Enrollment
Briefly, MRCIS was launched as a five-year cohort study by the National Minority Quality Forum (NMQF) to examine risk factors associated with the disproportionate impact of COVID-19 on minority and rural communities. The study was initiated in November 2020, and follow-up procedures will be completed in April 2026. All participants were recruited between November 2020 and April 2021. Through NMQF’s existing partnerships, Federally Qualified Health Centers (FQHCs), funded through the Health Resources & Services Administration (HRSA), were invited to participate in MRCIS as community-based healthcare providers. FQHCs primarily provide services for underserved communities that have historically been underrepresented in clinical research investigations. Thus, the inclusion of FQHCs was instrumental in ensuring the participation of medically underserved communities. Participants were recruited from five FQHCs: Osceola Community Health Services (Kissimmee, FL, USA), Teche Action Clinic (Franklin, LA, USA), John Wesley County Hospital (Los Angeles, CA, USA), Aunt Martha, (Olympia Fields, IL, USA), and Primary One (Columbus, OH, USA) [
1]. These specific FQHCs were selected given their access to minority populations and/or their rural site designation by HRSA.
Convenience sampling techniques were implemented to recruit adult (18 years and older) volunteers. Multiple recruitment strategies were implemented: in-person recruitment (at study health centers, recreational centers, low-income housing complexes, fire departments, and homeless shelters) and by advertising through marketing flyers distributed to the community. Eligible volunteers completed an informed consent form and a baseline social demographic survey, and underwent sample collections. The study was approved by the WIRB-Copernicus Group Institutional Review Board (WCG IRB) under protocol number [#1292174].
2.2. Data Collection
The baseline survey assessed demographic information (e.g., race, ethnicity, age, sex), social drivers of health (e.g., access to utilities, access to transportation, housing instability), COVID-19 mitigation behaviors and practices (e.g., social distancing, mask-wearing), and medical history (i.e., presence of symptoms and comorbidities). Study coordinators involved in data collection received rigorous training on data collection techniques to improve data quality and completeness.
Laboratory Measurements
At enrollment, clinical research staff obtained nasal swabs from participants for a SARS-CoV-2 RNA nucleic acid amplification test (NAAT) and a peripheral venous blood specimen, collected in EDTA-anticoagulated and serum separator evacuated tubes. Laboratory testing was performed through the regional Quest Diagnostics laboratory facility that serviced each FQHC clinical research site. All biological specimens were collected on the same day.
Main Outcome: The SARS-CoV-2 IgG antibody testing was performed using one of the following US Food and Drug Administration Emergency Use Authorization testing platforms: (i) Abbott Architect SARS-CoV-2 IgG (Abbot Park, IL, USA); or (ii) Ortho-Clinical Diagnostics VITROS Anti-SARS-CoV-2 IgG test (San Diego, CA, USA). Anterior nasal swab sample SARS-CoV-2 RNA NAAT testing used one of the following US Food and Drug Administration Emergency Use Authorization testing platforms: (i) a Quest Diagnostics laboratory developed test modeled on the US Centers for Disease Control and Prevention (CDC) methodology, (ii) Cobas® (Roche Molecular Systems, Inc., Pleasanton, CA, USA), (iii) Panther Fusion, (Hologic, Inc., Marlborough, MA, USA), or (iv) Aptima (Hologic, Inc., Marlborough, MA, USA). Prevalence of SARS-CoV-2 infection was determined as a positive result for either or both molecular or serologic assessments. If at the time of specimen collection a participant was eligible for the COVID-19 vaccine based on age requirements by state, then only the (i) SARS-CoV-2 RNA NAAT and (ii) Abbott Architect SARS-CoV-2 nucleocapsid-protein IgG test results were used to assess SARS-CoV-2 infection status.
Main Exposure: The circulating level of vitamin D was assessed using the FDA-cleared total 25 hydroxyvitamin D 25(OH)D Atellica® Solution immunoassay method from Siemens Healthineers (Malvern, PA, USA).
2.3. Statistical Analysis
Descriptive analysis was conducted to examine all variables. Circulating vitamin D, self-reported age, and number of comorbidities were the only continuous variables included in the analysis and were summarized using mean, median, standard deviation, range, interquartile range, and skewness. The linearity of the association between SARS-CoV-2 and vitamin D was examined by plotting the prevalence of infection in each decile. Vitamin D levels were further categorized into three groups based on existing guidelines [
16]: deficient—<20 ng/mL, insufficient—≥20 and <30 ng/mL, and optimal—≥30 ng/mL. Participant age was further categorized into three groups: 18–39, 40–59, and ≥60 years. Number of self-reported comorbidities was further categorized into four groups: 0, 1–2, 3–4 and ≥5. Frequencies and percentages were calculated for all categorical variables. Race and ethnicity information was categorized into four mutually exclusive groups: Hispanic/Latino, non-Hispanic Black/African American, non-Hispanic White, and non-Hispanic Other. A “region” variable was created to recategorize the five study sites: Southeast (Osceola Community Health Services, and Teche Action Clinic), Midwest (Aunt Martha, and Primary One), and West (John Wesley County Hospital).
The study population characteristics were compared by vitamin D status (deficient: <20 ng/mL, insufficient: 20–30 ng/mL, optimal: >30 ng/mL) and SARS-CoV-2 infection status (positive, negative) using Chi-squared testing. Logistic regression models were used to assess the adjusted odds ratios (ORs) and their 95% confidence intervals (CIs). Due to the low number of respondents in certain categories, the sample used for logistic regression was restricted to exclude the following categories: ethnicity—“non-Hispanic Other” and sex—“Other”. Additionally, due to the violation of the assumption of linearity (relationship between vitamin D levels and SARS-CoV-2 infection), participants in the tenth decile of vitamin D levels were removed for the logistic regression analysis. The main effect model included age, race/ethnicity, and vitamin D level (scaled to the 10 ng/mL interval). The referent group race/ethnicity was White non-Hispanic. The main effect model was then used to assess the strength of the association of the remaining variables by individually adding each variable to the main effect model. Data were analyzed using SAS version 9.4 (SAS Institute, Inc., Cary, NC, USA). Data visualization was carried out using R and R Studio version 4.3.1 (Boston, MA, USA).
3. Results
The final analytical sample included 3187 participants for whom vitamin D level test results and SARS-CoV-2 test results ((i) SARS-CoV-2 RNA NAAT, (ii) Abbott Architect SARS-CoV-2 IgG or (iii) Ortho-Clinical Diagnostics VITROS Anti-SARS-CoV-2 IgG) were available (
Supplementary Materials, Figure S1). The study population included male (39.5%) and female (60.5%) adults (age range: 18 to 96 years, mean age = 50.3 years, standard deviation = 15.8 years), who identified as Black non-Hispanic (25.2%), Hispanic or Latino (48.9%), Other (2.6%), or White non-Hispanic (20.1%) (
Table 1). Of these, 825 (25.9%) tested positive for SARS-CoV-2 infection (
Table 1). Most participants were classified as either vitamin D deficient (46.5%) or insufficient (29.7%), and the remaining 23.8% of participants had optimal vitamin D levels (
Table 1). Bivariate analysis demonstrated a statistically significant (
p ≤ 0.05) relationship between vitamin D categorical status and SARS-CoV-2 infection (
Table 1). Vitamin D status was significantly associated with key SARS-CoV-2 infection risk factors such as age (
p < 0.01), race and ethnicity (
p < 0.01), pre-existing co-morbidities (number of comorbidities (
p < 0.01) or pre-existing cardiometabolic co-morbidities (
p < 0.01)), and geographic location (residing in a rural setting (
p < 0.01) or residing in the West, Midwest, or Southeast (
p < 0.01)) (
Table 2). Medical history such as diabetes (
p < 0.01), history of heart attack (
p = 0.02), high blood pressure (
p < 0.01), and cardiac conditions (
p < 0.01) were also associated with vitamin D status (
Table 3).
Vitamin D was also examined as a continuous variable. As mean vitamin D levels increased across deciles (one through nine), the proportion of participants with SARS-CoV-2 infection decreased (
Figure 1). Racial and ethnic differences were also apparent (
Table 4). White non-Hispanic participants reported the highest mean levels of vitamin D (26.3 ± 23.0) and had the lowest proportion of SARS-CoV-2 infection (15.6%) (
Table 4). Black non-Hispanic participants reported the second highest mean levels of vitamin D (25.5 ± 14.9) and had the second highest proportion of SARS-CoV-2 infection (26.7%) (
Table 4). Hispanic participants reported the lowest mean levels of vitamin D (21.9 ± 12.1) and had the highest proportion of SARS-CoV-2 infection (29.5%) (
Table 4). Logistic regression results indicated that for every 10 unit increase in vitamin D levels, the odds of SARS-CoV-2 infection decreased by 12% when adjusting for race/ethnicity and age (
Table 5). Additionally, participants who identified as Hispanic/Latino or Black non-Hispanic had approximately two fold increased odds of SARS-CoV-2 infection when adjusting for age and vitamin D levels (odds ratio (OR): 2.01, confidence interval (CI): 1.56–2.59 (Hispanic/Latino vs. White non-Hispanic) and OR: 1.91, CI: 1.44–2.53 (Black non-Hispanic vs. White non-Hispanic)) (
Table 5). Age had minimal effect on the relationship between vitamin D levels and SARS-CoV-2 infection (OR: 0.99, CI: 0.98–0.99) (
Table 5). This main effect model was also analyzed using categorical vitamin D status and the relationship between vitamin D status and SARS-CoV-2 infection was not significant (
Supplementary Materials, Table S1).
The main effect model (age, race/ethnicity, and vitamin D level (scaled to the 10 ng/mL interval);
Table 5) was used to independently assess the relationship between region and the presence of pre-existing comorbidities and SARS-CoV-2 infection. Vitamin D levels varied by region, with the highest levels in the Southeast and the lowest in the West (
Table 6). When adjusting for age, race/ethnicity, and vitamin D levels, the region was significantly associated with SARS-CoV-2 infection (OR: 4.04, CI: 3.08–5.28 (Midwest vs. Southeast); OR: 4.51, CI: 3.65–5.57 (West vs. Southeast)); however, vitamin D did not remain significant in this model (
Table 6). Residing in a rural setting was significantly associated with SARS-CoV-2 infection (OR: 0.45, CI: 0.35–0.58), however, vitamin D did not remain significant in this model (
Table 6). Conversely, when adjusting for age, race/ethnicity, and vitamin D levels neither cardiometabolic outcomes (OR: 1.03, CI: 0.84–1.26) nor the number of comorbidities were significantly associated with SARS-CoV-2 infection (OR: 1.04, CI: 0.85–1.26 (1–2 comorbidities vs. 0 comorbidities); OR: 0.74, CI: 0.54–1.03 (3–4 comorbidities vs. 0 comorbidities); OR: 0.86, CI: 0.45–1.63 (≥5 comorbidities vs. 0 comorbidities)) (
Table 6). Interaction terms were not significant (region × vitamin D, rural setting × vitamin D).
4. Discussion
This study demonstrated that among a large population of medically underserved persons, for every 10 unit increase in vitamin D levels the odds of SARS-CoV-2 infection decreased by 12%. Our findings are consistent with extant literature that has demonstrated that vitamin D deficiency increases the risk of SARS-CoV-2 infection [
17]. Kaufman et al. conducted a large retrospective analysis that included more than 190,000 patients from all 50 states [
4]. They concluded that SARS-CoV-2 infection was significantly associated with vitamin D levels, i.e., as vitamin D levels decreased SARS-CoV-2 positivity rates increased [
4]. This relationship was similar across all racial/ethnic groups in the sample. Further examination is suggested of the effectiveness of vitamin D supplementation as a prevention method for SARS-CoV-2 infection and in the management of disease progression. Both the affordability and the availability of vitamin D add to the allure of such a strategy.
We also concluded that racial/ethnic minority groups had lower levels of vitamin D and two times increased odds of SARS-CoV-2 infection compared to White non-Hispanics. These differences may be a result of socioeconomic, nutritional, and/or genetic factors that might influence these outcomes. African Americans individuals experience increased burden of many COVID-19 risk factors such as lower socioeconomic status and higher rates of comorbidities such as diabetes and cardiovascular disease [
18]. Additionally, African American people experience lower serum vitamin D levels [
19]. Our findings are consistent with those of Rodriguez and colleagues who concluded that Hispanics experience increased odds of SARS-CoV-2 infection [
20].
Of note, though the association between vitamin D levels and SARS-CoV-2 infection was significant in the main effect model (SARS-CoV-2 infection = age + race/ethnicity + vitamin D level), this association did not remain significant when additional variables (region, residing in a rural setting, cardiometabolic outcomes, number of comorbidities) were added to the main effect model. Interaction terms (region × vitamin D, rural setting × vitamin D) were not statistically significant. The non-significance of vitamin D levels in the minimally adjusted models could potentially be attributed to unmeasured factors such as the degree of sun exposure that directly impact vitamin D levels and vary by region. Thus, the importance of vitamin D’s role in SARS-CoV-2 infection must be interpreted carefully within the context of varying geographic settings. Additionally, there was minimal effect of age on the relationship between vitamin D levels and SARS-CoV-2 infection. Given the findings in extant literature that demonstrates age-related immune function changes and increases in the risk of SARS-CoV-2 infection among older adults [
21], this finding warrants further examination in future studies.
The findings presented here should be interpreted thoughtfully considering the study’s limitations. The cross-sectional study design did not allow for examination of the direction of the relationship between vitamin D levels and SARS-CoV-2 infection. The study survey did not capture data about vitamin D supplementation; thus, the effect of this variable could not be assessed. Furthermore, participant recruitment for MRCIS occurred over a six-month period across five states. Differences in daylight exposure were not accounted for. The use of convenience sampling, though a suitable technique at the time of recruitment, may have resulted in sampling bias and limited the external validity of the study’s findings. These findings may not be applicable to persons who seek care at facilities other than FQHCs and do not face barriers to accessing care. Due to the limitations in the sample size, participants in the “Other” racial/ethnic group were not included in the final regression analyses to maintain statistical power and achieve model convergence. This did now allow for detailed examination of significantly understudied populations which includes American Indians/Alaskan Natives and Native Hawaiians/Other Pacific Islanders. Despite this limitation, the MRCIS cohort includes a predominantly racially and ethnically diverse sample and is one of the only longitudinal studies that examines COVID-19 in this medically underserved population that includes serologic assessment of markers of disease. The longitudinal nature of MRCIS also allows for future research to examine potential changes in vitamin D levels and the relationship with COVID-19, an area where there is a significant paucity of research.
Future research examining the relationship between vitamin D levels and COVID-19 should consider longitudinal study designs to assess the potential causal influence of vitamin D. Additionally, intervention trials are needed to further elucidate the effects of vitamin D supplementation as a preventive measure for COVID-19 and its effects as a component of COVID-19 treatment plan [
22].