2. Methods
This observational study was approved by our Institutional Review Board (Trinity Health of New England Institutional Review Board, Assurance #FWA00020300, approval 28 September 2021) and informed consent was obtained from all participants. Our primary aim was to evaluate longitudinal impairments in health status following COVID-19 infection. Our secondary aim was to evaluate factors related to health status impairment.
Inclusion criteria were: (1) Adults discharged from our hospital or its emergency department (ED) to home after treatment for COVID-19; (2) Confirmation of COVID-19 infection by laboratory testing; and (3) Consent to participate at the initial and subsequent telephone encounters. Exclusion criteria were: (1) A medical event that led to the index health care encounter other than the COVID-19 infection; and (2) Cognitive issues or a perceived language barrier (determined from the initial record review) that would preclude informed consent or adequate participation in the surveys.
We chose survey questionnaires to be brief and amenable to a telephone survey format. These included:
The modified Medical Research Council (mMRC) dyspnea questionnaire (0–4 scale) [
7]. We considered an mMRC rating of 3 or 4 to indicate severe dyspnea.
The Patient Health Questionnaire for Anxiety and Depression (PHQ-4). This screening instrument has 4 questions, 2 focusing on anxiety and 2 on depression symptoms, plus a composite score [
8]. Severity scores of 3 or greater in either anxiety or depression subscales or a PHQ-4 composite score ≥ 6 were considered elevated.
The Cognitive Change Questionnaire (CC8) [
9] is a screening tool with 8 questions rating cognitive impairment. A score ≥ 2 was used as a marker for cognitive impairment.
The Short Fatigue Questionnaire (SFQ) [
10] rates fatigue using 4 questions, each rated on a 1–7-point scale. This is a visual scale with extremes and middle anchored by descriptors. Scores ≥ 18 were used to represent high levels of fatigue.
The Medical Outcomes Study Short-Form General Health Survey (SF-20) [
11]. This health status instrument has 20 questions summarized in 6 subscales: physical functioning, role functioning, social functioning, mental health, health perceptions, and pain. Scores are scaled to range from 0 (worst) to 100 (best). Since there are no thresholds indicating abnormality with this instrument, we included for comparison data from the closest cohort of individuals we could find in the medical literature [
12].
The telephone survey also included 10 review of systems (ROS) questions, evaluating the presence of cough, excessive sleepiness, chest pain, lower extremity edema, frequent headaches, numbness of feet, arthritis or joint pain, difficulty walking, trouble sleeping and the loss of the sense of taste or smell. Dyspnea, fatigue, and memory assessment was not included in the ROS since they were assessed by questionnaire. If there was a “yes” response to a particular ROS item, the patient was then queried as to whether that symptom was worse or not-worse compared to the time before being infected with COVID-19. These ROS responses (1 = yes, 0 = no) were summed to give a ROS symptom score.
Demographic data abstracted from the electronic record included age, sex, insurance status (Medicaid or no insurance was used as a marker for low socioeconomic status (SES)), and self-identified race and ethnicity.
Acute COVID-19 clinical severity markers from the record review included: (1) Peak supplemental oxygen requirement ≥4 L/min (or equivalent); (2) Discharge home with a prescription for supplemental oxygen; (3) Multilobar pneumonia; (4) Intravenous corticosteroid therapy for COVID-19; (5) Remdesivir treatment; (6) Clinical or laboratory cardiac ischemia; (7) Decompensated heart failure; (8) Shock; (9) Use of non-invasive ventilation; and (10) Endotracheal intubation and placement on a respirator. We created an arbitrary severity score by summing the above 10 scores (0 = no, 1 = yes), with the result ranging from 0 (least) to 10 (greatest) acute COVID-19 severity.
Comorbidity was evaluated using a revised and weighted Charlson Comorbidity Index [
13]. An earlier version of this was proven predictive of severe COVID-19 outcome [
14]. In our analysis the Charlson score was tested as both a continuous variable and after dichotomization to scores above and below 1. In addition to the Charlson Comorbidity Index variables, we included systemic hypertension, obesity, and asthma as other potentially important comorbid factors predicting outcome.
While the original protocol was to have investigators contact and perform telephone surveys at 1, 2, and 3 months, we had unanticipated difficulty in contacting some patients after they initially agreed to participate in the study, especially for the third interview. Because of delays in eventually contacting patients or not reaching some patients at all, the time range for the surveys was broadened and we used the last data available (from either second or third interview) to assess trajectory of outcome data, and added time from discharge to the survey as a covariate. For the purpose of our data presentation the last survey (whether the second or third) was called the second survey.
We used SAS 9.4 for Windows 10 (SAS Institute, Inc., Cary, NC, USA) for statistical analyses, including descriptive analyses. The significance of changes in survey data was determined using paired t-tests. Multivariate stepwise regression (SAS PROC GLMSELECT), (SAS Institute, Inc., Cary, NC, USA) was used to evaluate predictors of each component of health status assessed in each of the two surveys. Independent variables included age, race/ethnicity, obesity (clinical diagnosis), SES, Charlson comorbidity, the composite acute disease severity score, questionnaire-rated dyspnea, psychological variables, fatigue, and cognitive status, and ROS symptom score. In the analyses, the questionnaire variables were dichotomized based on whether they were above or not-above the normal range (1 vs. 0) established for the instrument. Charlson Comorbidity Index scores with dichotomized to above and below 1. Severity scores and ROS summary scores were kept as continuous variables. For those analyses of Survey 2 health status, the corresponding value from survey 1 was added as a covariate to the model and time from the health care discharge to the survey was included as a covariate. A p < 0.05 was considered significant.
3. Results
Figure 1 outlines patient recruitment for the study. Of 96 potentially eligible patients from the record review, 54 provided consent and completed Survey 1. The causes of non-inclusion in Survey 1 included not being reachable by telephone for the survey after having previously provided consent (
n = 35), withdrawing consent (
n = 4), and a language barrier identified at the time of the initial contact (
n = 3). Of the 54 completing survey 1, 40 (74%) participated in Survey 2. Reasons for non-participation (dropouts) included not being reachable by telephone (
n = 12) and withdrawal of consent (
n = 2). Compared to those who completed the surveys dropouts were younger (43 ± 17 vs. 53 ± 15 years,
p = 0.04) and had a lower acute COVID-19 severity score (2.1 ± 2.2 vs. 3.3 ± 1.8,
p = 0.04). Data from 40 patients completing the two surveys were analyzed. The time between health care discharge and the first and second surveys were 31 ± 8 and 89 ± 25 days, respectively.
Patient characteristics are listed in
Table 1, while individual COVID-19 severity variables are listed in
Table 2. Age ranged from 25 to 84, half were female, almost 72% were non-Hispanic Black or White, and 40% had low SES. Seventy percent were diagnosed with obesity. Forty percent had at least one comorbid condition; the mean COVID-19 severity score was 3.3 ± 1.8.
Table 3 lists the percentages of patients with elevated dyspnea, fatigue, and cognitive and psychological scores at the two surveys. Although the percentages with values above the normal range decreased numerically in all outcome areas, none of the changes were significant.
Table 4 shows the SF-20 health subscale scores of each survey. Except for the pain score, all subscales showed significant increases (i.e., improvement) at the second survey. In addition, mean subscale scores from a published cohort of Canadians (age range, 66–70 years) living at home [
12] is provided in this table for comparison. Although this cohort was considerably older than our sample, it represents the closest age group to ours we could find in the medical literature. Role Functioning, Social Functioning, Mental Health and Health Perceptions appeared considerably lower in our sample, although statistical comparisons could not be performed.
Table 5 shows the results of stepwise regression models for each SF-20 subscale in the two surveys. In Survey 1, a high fatigue score was significantly related to health perception and physical and role function. An ROS summary score was related to health perception, social functioning, and mental health. In contrast, baseline high dyspnea was related to all six subscale scores in Survey 2 even when controlling for the baseline score of the corresponding subscale. High fatigue and ROS summary score at baseline were not related to subscale scores in Survey 2.
Table 6 provides data from the ROS questions. Patients responding “yes” to a question were then asked to state whether that symptom was worse or not worse compared to the time before the COVID-19 infection began. At Survey 1, the most frequently reported symptoms were trouble walking (55%), cough (50%), headaches (45%), arthritis/joint pain (40%), and chest pain (37.5%). Cough, chest pain, frequent headaches, and loss of or decrease in the sense of smell or taste showed the most improvement from Survey 1 to Survey 2.
4. Discussion
A key gap listed in a National Institutes of Health Workshop on Postacute COVID-19 is insufficient knowledge of the natural history of post-acute COVID-19, which would require longitudinal analyses [
15]. Accordingly, we evaluated the trajectory of health status and common COVID-19 symptoms using two telephone surveys separated by a mean of approximately 57 days. These surveys took place during the second wave of the pandemic in Connecticut in late winter and spring 2021. Patient demographics differ from those hospitalized during the first pandemic wave to hit Connecticut in 2020: those hospitalized in the first wave (that peaked in the spring of 2020) [
16] were predominately older Whites, especially nursing home residents. In contrast, our sample was younger, had a relatively high percentage (40%) with low SES, and had disproportionately high Black and Latino representation. For perspective, 22% of Connecticut citizens had Medicaid insurance coverage in 2018, with Black and Latino groups comprising 10% and 16% of the Connecticut population, respectively [
17,
18]. Racial, ethnic, and socioeconomic discrepancies in hospitalized COVID-19 patients have been noted elsewhere in the United States [
19,
20].
As expected, based on the existing medical literature on the long-term effects of COVID-19 infection, we demonstrated abnormalities in all the symptom questionnaires out to approximately one and three months. Thus, at the one-month survey, high levels of dyspnea were present in 40%, fatigue in 53%, anxiety in 33%, depression in 20%, and cognitive dysfunction in 18%. Additionally, the review of symptoms analysis at one month was positive in more than 25% of patients for cough, excessive sleepiness, chest pain, frequent headaches, arthritis/joint pain, difficulty walking, and trouble sleeping. Also noteworthy is that while most of symptom categories showed numerical improvement at approximately three months, none of the questionnaire variables showed statistical improvement. These findings point to the pervasive and long-lasting effects of this disease.
Our primary focus was on long-term effects on heath status, since relatively limited information is published on this important outcome area. Previous investigations have demonstrated persistent abnormalities in other health status instruments, including the respiratory-specific St. George’s Respiratory Questionnaire (adapted for COVID-19) [
21], the COPD Assessment Test, the EQ-5D-5L, and the Short Form 36 health status questionnaire [
22,
23]. We chose direct administration of the generic measure of health status, the SF-20, to better capture the wide-ranging morbidity from this disease [
5]. This instrument, which takes a few minutes to complete and can be administered by telephone survey, has six subscales: physical function, role functioning, social functioning, mental health, health perception, and pain.
Since normative values for the SF-20 are not established and threshold values denoting abnormality for this instrument do not exist, we included for comparison published data from a cohort of male and female Canadians living at home who completed the survey. It is of note that the closest age group in this comparison was of the age range 66–70 years. Despite our patients being considerably younger, the six SF-20 component score means were lower (lower is worse) than in this comparative group, although, of course, statistical comparisons or inferences on minimal clinically-important differences cannot be made. Nonetheless, substantial differences in the subscales of Role Functioning, Social Functioning, Mental Health, and Health Perception are evident, attesting to the long-term impacts of COVID-19 in areas other than on merely physical health. These findings likely reflect added long-lasting psychological and social issues, such as fear, stigma, and social isolation, on the individual.
On multivariable analyses, no single variable stood out in predicting health status components at Survey 1 (fatigue was most prevalent as a predictor, being present in three of six SF-20 components). However higher levels of dyspnea (i.e., mMRC of three or four) was significantly predictive of impairment in all six SF-20 component scores at Survey 2. The reason(s) for these associations are not clear. Our patient sample had severe acute lung disease, as reflected by a 73% incidence of multilobar pneumonia and the fact that 70% required high supplemental oxygen at some point during their acute care encounter. It is recognized that Post-COVID-19 Condition may be due in part to long-term lung injury or unresolved inflammation, as suggested by its association with acute disease severity, early dyspnea, and some biomarkers such as D-dimer [
24]. It is also possible that other symptoms, such as fatigue, might resolve more quickly over time than dyspnea, leaving the latter as a predictor in later follow-up. However, while persistent dyspnea, as a marker for ongoing lower respiratory disease and/or underlying comorbidity, might be expected to explain impairments in health perception or in physical, role, or social functioning, it is difficult to explain the R
2 of 0.45 as the sole significant predictor of Pain at Survey 2. An R
2 of 0.45 indicates that 45% of the inter-subject variation in pain is explained by an mMRC of 3 or greater. These associations cannot be fully explained by the data on hand. Physiologic data at the time of the surveys might have been helpful, but were not part of this investigation.
While improvement in the symptom questionnaires was minimal and not statistically significant over the mean of 57 days, five of the SF-20 subscales improved significantly over this interval, some approaching those of the (older) comparator group. It is not clear why the observed changes in the generic health status questionnaire were more robust than those of the symptom questionnaires. This may reflect simply differences in individual responsiveness to change among the instruments, although direct comparisons in this measurement property are not available. Alternatively, Social Functioning and Health Perception, which showed the greatest numerical and statistical improvement, may have captured decreasing limitation in physical and social activities and greater self-efficacy perception as the pandemic surge waned and vaccinations became more widespread. Quarantine practices for this disease reduce directly-measured physical activity [
25] and adversely affect mental health outcomes [
26,
27]. Our surveys took place when vaccinations were becoming available to the public. Since vaccination for COVID-19 not only reduces health care utilization and mortality risk, it also appears to reduce long COVID-19 symptoms at 120 days [
28], knowledge of vaccination status would be useful in interpreting these results. Unfortunately, our original protocol did not provide for a query on vaccination status, so we cannot evaluate whether vaccinations were a significant factor in these outcomes.
A secondary goal of our study was to evaluate those factors which impact health status. To accomplish this we related demographic factors, socioeconomic status, acute COVID-19 severity, baseline comorbidity, dyspnea, fatigue, psychological variables, cognitive function, and current symptoms (ROS) in multivariate analyses as potential predictors of the health status subscales (
Table 5). A number of independent variables entered into the multivariate models predicting health status subscales, attesting to the multidimensional nature of this measurement. However, only high dyspnea entered into every SF-20 subscale model (including Pain)—but only for Survey 2. The reason(s) behind this observation are not clear.
One limitation of this study and other COVID-19 observational studies evaluating persistent effects of this disease is that pre-infection, baseline data are not available for comparison. For instance, we demonstrated elevated dyspnea to be present in 40% at approximately one month after health care discharge. What portion of this 40% is new and secondary to COVID-19 and what portion is not new but rather due to pre-existing disease? Adding to this dilemma is the fact that COVID-19 infection is more common in those with a pre-existing disease burden and therefore more likely to have chronic symptoms.
Another limitation of our study is that we surveyed only a small number of patients in one health care system over a relatively short period of time and with discharge to a home setting only. Only about one-half of the patients eligible for inclusion in the study were able to be contacted by telephone, and of those who completed the first survey, 26% did not participate in the second. Indeed, dropouts were younger and had a lower acute severity score, creating problems with interpretation of the data. Our surveys coincided with the tail end of the second COVID-19 surge that peaked in Connecticut in mid-January, 2021 [
29]; in contrast to the first surge, it tended to produce clinical disease in younger individuals. Adding to this limitation of generalizability was the availability of COVID-19 vaccines (which were not analyzed in our study) over the span of the surveys. Although the number receiving vaccines was probably low as our study commenced, uptake was probably higher by the second survey. While vaccination for this disease reduces both infectivity rate and clinical severity of the infection, limited data suggest that full vaccination may also reduce symptoms out to 28 days or longer in those with breakthrough infections [
30], possibly influencing our results.
As mentioned earlier, since our severity score was not validated, inferences on acute severity (not) predicting the persistence of symptoms must be made with caution. Additionally, the comorbid conditions making up the Charlson Comorbidity Index may not be relevant in predicting sequelae to COVID-19 infection, although another version of the Charlson Index did demonstrate that a score above 0 was related to an increased risk of severe COVID-19 and death [
14]. It is of note that none of the other potentially relevant coexisting conditions that we included—obesity, hypertension, and asthma—predicted long-term impairment in health status.
In summary, our findings are similar to other published observational studies in that we demonstrated the long-term effects of COVID-19 infection across multiple symptom PRO areas and health status. Our data differ somewhat in that we measured outcomes at two times, approximately 57 days apart, giving insight into longitudinal changes in post-acute COVID-19. Health status, which may capture a wider aspect of impact from the disease and its sequelae, seemed to show more improvement over this interval. The implication from our study is that clinicians should be cognizant of the long-term effects of the disease in some individuals in order to initiate therapy, when indicated.
In conclusion, we demonstrated substantial impairments in multiple aspects of general health status out to one month and, to lesser degree, out to three months. Persistent dyspnea appears to be a factor in prolonged health status limitation. Taking this into consideration, comprehensive pulmonary rehabilitation may be of benefit for those with ongoing health status impairment. This intervention, which includes exercise training, promotion of self-efficacy through collaborative self-management, and care coordination exerts its often-substantial beneficial effects though ameliorating the effects of systemic or comorbid illness [
31,
32]. Recognizing the long-term symptoms and functional limitation of some COVID-19 patients, the University of Leicester has recently started a rehabilitation program for these patients [
33]. Further investigation of its effectiveness in this particular situation needs to be determined.