Next Article in Journal
Parallelogram Excision: An Algorithmic Approach for Excision Designs in High-Tension Skin Areas
Next Article in Special Issue
Associations between Preoperative Patient Socioeconomic Status and Pain-Related Outcomes with Pain and Function in Patients Undergoing Rotator Cuff Repairs
Previous Article in Journal
Orthodontic Internal Resorption Assessment in Periodontal Breakdown—A Finite Elements Analysis (Part II)
Previous Article in Special Issue
Responsiveness of the Arabic Quick Disabilities of the Arm, Shoulder and Hand in Patients with Upper Extremity Musculoskeletal Disorders
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Responsiveness and Minimal Important Change of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) in Patients with Upper Extremity Musculoskeletal Disorders

Department of Rehabilitation Sciences, College of Applied Medical Sciences, King Saud University, P.O. Box 10219, Riyadh 11433, Saudi Arabia
Healthcare 2023, 11(19), 2623; https://doi.org/10.3390/healthcare11192623
Submission received: 7 August 2023 / Revised: 13 September 2023 / Accepted: 24 September 2023 / Published: 26 September 2023
(This article belongs to the Special Issue Musculoskeletal Disorders, Physical Therapy and Rehabilitation)

Abstract

:
The aim of this study was to examine the responsiveness of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) and to quantify its minimal important change (MIC) for improvement. People with upper extremity musculoskeletal problems who were receiving physical therapy were evaluated at baseline and again during a follow-up appointment, with a median time frame of 7 days between the two testing sessions (range of 6 to 72 days). The participants completed the Arabic DASH, Global Assessment of Function (GAF), Numeric Pain Rating Scale (NPRS) and Global Rating of Change Scale (GRC). The responsiveness of the Arabic DASH was assessed by examining the pre-specified hypotheses. The MIC for improvement was determined using the receiver operating characteristic method (MICROC) and the predictive modeling method (MICpred). As hypothesized, a change in the Arabic DASH demonstrated a significant positive correlation with changes in the GAF (r = 0.69), NPRS (r = 0.68) and GRC (r = 0.73). Consistent with our hypotheses, the DASH change scores could be used to differentiate between participants who improved and those who did not improve (area under the receiver operating characteristic curve = 0.87), and they showed a large magnitude of change (effect size = 1.53, standardized response mean = 1.42) in patients who improved. All the hypotheses specified a priori were supported by the results. The Arabic DASH MICROC and MICpred were estimated to be 14.22 and 14.85. The interaction between the DASH change and baseline score was not a significant predictor of status (improved vs. not improved) (p = 0.75), indicating that the DASH MIC was not baseline-dependent. The Arabic DASH demonstrated sufficient responsiveness, supporting the idea that the Arabic DASH is capable of detecting changes in upper extremity function over time. The value of the Arabic DASH MIC was similar when estimated using the predictive modeling and ROC methods, and the MIC was not dependent on baseline status.

1. Introduction

The Disabilities of the Arm, Shoulder and Hand (DASH) is a commonly used patient-reported outcome measure (PROM) to measure upper extremity function and symptoms [1,2]. Recent systematic reviews of translated versions of the DASH indicated that most translated versions underwent only partial evaluation of psychometric properties and that the majority of studies that examined the psychometric properties of these translated versions had a poor methodological quality [3,4,5]. These reviews point to the need for a rigorous methodology and further assessments of translated DASH psychometric properties, especially responsiveness and interpretability.
Responsiveness refers to the ability of a PROM to detect changes over time in the construct measured [6]. Hypothesis testing is the recommended method for assessing responsiveness according to the consensus-based standards for the selection of health measurement instruments (COSMIN) guidelines [7,8]. A large number of studies that examined the responsiveness of the DASH and its translated versions lack specific hypotheses to examine whether the DASH change scores actually reflect changes in upper extremity function [9,10,11,12,13,14,15,16]. Currently, there is only one study that reported an assessment of the responsiveness of the Arabic version of the DASH [9]. In this study, the authors used effect size indices and the area under the receiver operating characteristic curve to examine the responsiveness. This study lacked any specific pre-defined hypotheses regarding DASH change scores, and the use of effect size indices is inappropriate when examining responsiveness based on COSMIN [7,8].
A PROM’s minimal important change (MIC) is defined as “the smallest change in score in the construct to be measured which patients perceive as important” [7,17]. Determining the MIC is not part of responsiveness testing but rather is related to the interpretability of PROM change scores [7]. The MIC can be determined using distribution- or anchor-based methods, with the latter believed to be superior [17,18,19]. Numerous studies determined the MIC for various translated versions of the DASH using the receiver operating characteristic (ROC), which is an anchor-based method [9,10,20,21,22,23,24,25]. Recently, a predictive modeling method, which is also an anchor-based method, has been reported to be more precise compared to the commonly used ROC method [26]. Additionally, the majority of studies that estimated the DASH MIC, including the Arabic DASH, had unbalanced proportions of participants who improved and those who did not, which is known to bias the MIC estimate [27].
To date, no prior study has examined the responsiveness of the Arabic DASH using the recommended method of hypothesis testing, and no study has determined the Arabic DASH MIC using the more precise predictive modeling method. Thus, the aims of the current study were (1) to examine the responsiveness of the Arabic DASH in patients with upper extremity musculoskeletal disorders and (2) to quantify the MIC for improvement in the Arabic DASH and to examine whether the MIC value is dependent on baseline status. We hypothesized that the Arabic DASH would have sufficient responsiveness in detecting changes over time in upper extremity function.

2. Materials and Methods

2.1. Setting and Participants

The participants in the current study were recruited using convenience sampling from four outpatient physical therapy departments (King Saud University Medical City, Security Forces Hospital, King Abdulaziz Medical City and Physiotri Clinic) in Riyadh, Saudi Arabia. Approval to conduct the study was obtained from the Institutional Review Boards of King Saud University Medical City (18/0372/IRB) and Security Forces Hospital (H-01-R-069). The purpose and procedure of the study were explained to all participants, and signed informed consent forms were obtained before participation. The inclusion criteria used in the current study were (1) primary complaint of upper extremity musculoskeletal disorder and (2) ≥18 years of age. The exclusion criteria used in the current study were (1) not able to read and understand the Arabic language and (2) neurological, cardiovascular, pulmonary or spinal disorders reported by the participant as a cause of functional limitation. In the current study, we aimed to include participants in which the major cause of their functional limitation was upper extremity musculoskeletal disorder. Thus, participants with additional concomitant disorders (neurological, cardiovascular, pulmonary or spinal disorders) were excluded only if these disorders were perceived by the patients to cause functional limitation.

2.2. Procedure

The current study was a prospective cohort study with two assessment time points (baseline and follow-up assessments). The baseline assessment was conducted during the first visit of the participant to the outpatient physical therapy department, while the follow-up assessment was conducted during a follow-up visit. Between the baseline and follow-up assessments, participants were under physical therapy care, which was the responsibility of the treating physical therapist. Given that the current study was an observational study, the type and specifics of the physical therapy interventions were not determined or controlled by the research team. The physical therapy treatment provided included but was not limited to patient education, therapeutic exercises, therapeutic modalities, articular and soft tissue manual therapy, taping and dry needling. The research team had no involvement in the type or details of the physical therapy care provided to the participants. The participants completed the Arabic DASH [9], Numeric Pain Rating Scale [28] and Global Assessment of Function [29,30] during the baseline assessment. During the follow-up assessment, the participants completed the same outcome measures, in addition to the Global Rating of Change Scale [31,32].

2.3. Outcome Measures

2.3.1. Disabilities of the Arm, Shoulder and Hand (DASH)

The DASH is a PROM designed to quantify upper extremity function and symptoms using 30 items [1,2]. The scoring of the DASH items ranged from no functional limitation and no symptoms (score of 1) to functional inability and extreme symptoms (score of 5). The DASH was reported using a total score from 0 to 100, with a higher score indicating the worst function and symptoms. A prior research report supported the validity and reliability of the Arabic version of the DASH [9].

2.3.2. Numeric Pain Rating Scale (NPRS)

The NPRS was used to measure the average pain intensity at the site of the upper extremity dysfunction [33]. The NPRS score ranged from no pain (score of 0) to worst pain imaginable (score of 10) [33]. Prior research studies supported the measurement properties of the Arabic version of the NPRS [28,30].

2.3.3. Global Assessment of Function (GAF)

The GAF is a self-reported outcome used to measure participants’ overall functional ability [30,34]. The GAF score ranged from inability to perform any activity of daily living (score of 0) to ability to perform all activities of daily living without difficulty (score of 100) [30,34]. The measurement properties of the GAF have been supported by prior research studies [30,34].

2.3.4. Global Rating of Change Scale (GRC)

The participants rated their perceived change in upper extremity function during the second assessment relative to the baseline assessment using the GRC. The GRC score ranged from a very great deal worse (score of −5) to a very great deal better (score of 5) [30,32,34]. The GRC was used to stratify participants into those with improved upper extremity function (GRC of 3 “moderately better”, 4 “great deal better” or 5 “very great deal better”) and those with no improvement (GRC ≤ 2 “little bit better”).

2.3.5. Statistical Analysis

Five pre-defined hypotheses were used to examine the responsiveness of the Arabic DASH [17,35] (Table 1). These hypotheses were formulated based on the argument that the DASH is a measure of upper extremity function and symptoms and that changes in DASH scores reflect changes in the same constructs. In the current study, the Arabic DASH was deemed to have sufficient responsiveness if at least 4 out of the 5 hypotheses were supported, surpassing the recommended threshold of 75% [8,36]. Pearson’s and Spearman’s correlation coefficients were used to examine the hypothesized correlation between changes in the Arabic DASH and changes in the other measures (Table 1). The use of either Pearson’s or Spearman’s correlation coefficients was determined after assessing data conformation to a normality assumption using histograms visual inspection and the Shapiro–Wilk test. In the current study, positive change scores indicated improvement, while negative change scores indicated worsening. Multiple dependent t-tests (with no correction for possible inflation of type 1 error) were used to examine the differences in DASH, NPRS and GAF scores between the baseline and follow-up assessments. The effect size (ES) was computed using the mean change divided by the standard deviation of the baseline score, while the standardized response mean (SRM) was computed using the mean change divided by the standard deviation of the change score [37]. ES and SRM were used as part of hypothesis testing and were also used to quantify the magnitude of change observed in the PROM following intervention. ES and SRM values of 0.20, 0.50 and 0.80 were interpreted as small, medium and large effect sizes [38].
To enhance the interpretability of Arabic DASH change scores, the Arabic DASH MIC for improvement was determined using the predictive modeling method (MICpred) and the receiver operating characteristic curve (ROC) method (MICROC) [26,39]. Logistic regression analysis was used to determine the MICpred with the GRC anchor (improved vs. not improved) as the dependent variable and the DASH change score as the independent variable. After running the logistic regression analysis, the MICpred was determined using the following equation: MICpred = (ln(Oddspre) − C)/B, where ln(Oddspre) is a natural logarithm for the odds of improvement before knowing the DASH change score, C is the logistic regression intercept and B is the logistic regression coefficient [26]. To account for the possible influence of patients’ baseline status on the value of the MIC, the predictive modeling method was used to determine the MICpred for patients with a low baseline severity and those with a high baseline severity [26]. The predictive modeling method enables the computation of the MICpred for both subgroups using the data of the whole sample without dividing the sample into subgroups. The baseline median NPRS score was used to stratify the participants into low vs. high baseline severity to avoid the reported bias in determining the MIC when the baseline score of the measure of interest, DASH, is used for stratification [40]. To determine whether baseline severity influenced the MICpred, a logistic regression model was used to predict the GRC anchor (improved vs. not improved) which included the DASH change score, baseline score and interaction between the DASH change score and baseline score. A significant interaction between the DASH change score and baseline score was used to indicate the baseline dependency of the DASH MICpred.
The MICROC was determined to be the optimal DASH change score cutoff point in the ROC curve, best discriminating between participants who improved and those who did not improve and providing the best balance between specificity and sensitivity with the least misclassifications ((1 − specificity) + (1 − sensitivity)) [17]. The ROC curve was created by graphing the rate of false positive on the horizontal axis and the rate of true positive on the vertical axis, using various DASH change score thresholds. The ROC curve was also used in the current study to examine the responsiveness of the DASH. An area under the curve (AUC) of at least 0.70 was used to support the ability of the DASH change score to differentiate between participants with a reported improvement in upper extremity function and those with no improvement (based on the GRC anchor), thus indicating sufficient responsiveness (Table 1) [8,36]. All statistical analyses were conducted using IBM SPSS Statistics 26 (IBM Corp., Armonk, NY, USA) and JASP software (Version 0.17.2).

2.4. Sample Size Estimation

A sample size of 100 participants has been recommended to determine the MIC of a patient-reported outcome measure [19]. A sample size of 100 participants is also sufficient to examine the responsiveness of a PROM using hypothesis testing according to the COSMIN recommendations [41]. Based on that, a sample size of 100 was considered sufficient for the purpose of the current study to examine the responsiveness of the Arabic DASH and to determine its MIC value.

3. Results

One hundred and fifteen participants were recruited in the current study (Table 2). The site of dysfunction detailed in Table 2 refers to the primary site of upper extremity dysfunction that was the reason for seeking physical therapy care. No interaction was observed in terms of site of dysfunction and all participants presented with only one dysfunction site. The descriptive statistics of the outcome measures in the baseline and follow-up assessments are listed in Table 3. Seventy-three participants missed DASH item 21, two participants missed item 5, and one participant missed items 25, 26, 27 and 28 in the baseline assessment. Seventy-eight participants missed DASH item 21 and one participant missed item 10 in the follow-up assessment. The median time interval between the baseline and follow-up assessments was 7 days (range of 6 to 72 days). Based on the GRC completed during the follow-up assessment, 56 participants reported improved upper extremity function, while 59 participants reported no improvement in upper extremity function (Table 4). No floor or ceiling issues were identified in the DASH scores in both the baseline and follow-up assessments (Table 3).
The participants had a significant improvement in upper extremity function and symptoms, as indicated by the significant reduction in the DASH scores over time (p < 0.001) (mean difference: 17.14 points; 95% CI of the difference: 12.57–21.71), the significant increase in the GAF scores over time (p < 0.001) (mean difference: 17.14 points; 95% CI of the difference: 13.05–21.23) and the significant reduction in the NPRS scores over time (p < 0.001) (mean difference: 1.84 points; 95% CI of the difference: 1.30–2.37) (Table 3).
The DASH change scores had a significant positive correlation with the GAF and NPRS change scores (Table 5). The DASH change scores also demonstrated a significant positive correlation with the GRC (Table 5) (Figure 1). All of the reported correlations showed point estimates and 95% confidence intervals above the hypothesized levels. The DASH demonstrated a medium to large effect size in the whole sample but a large effect size in participants with a reported improvement in upper extremity function (GRC ≥ 3) (Table 3). The DASH change score demonstrated an AUC of 0.87 (95% CI: 0.81–0.94) (p < 0.001) (Figure 2), indicating that the DASH change scores differentiated between participants who improved and those who did not. The MICROC for the DASH was 14.22, with a sensitivity of 0.80 and a specificity of 0.85. The MICpred for the DASH was 14.85 (95% CI: 9.52–20.90). The MICpred for patients with a low severity was 13.58 (95% CI: 6.391–21.74), while the MICpred for patients with a high severity was 16.87 (95% CI: 7.01–28.59). The interaction between the DASH change score and baseline score in the logistic regression model was not a significant predictor of patient status (improved vs. not improved) (p = 0.75), indicating that the DASH MIC was not dependent on the baseline score.

4. Discussion

The aims of the current study were to examine the responsiveness of the Arabic DASH, to quantify the MIC for improvement in the Arabic DASH (using predictive modeling and ROC methods) and to examine whether the MIC value is dependent on patients’ baseline status. The Arabic DASH demonstrated sufficient responsiveness, supporting the idea that the Arabic DASH is capable of detecting changes over time in upper extremity function. The value of the Arabic DASH MIC was similar when estimated using the predictive modeling and ROC methods, and the MIC was not dependent on baseline status.
To the best of our knowledge, this is the first study that examined the responsiveness of the Arabic DASH using hypothesis testing with very specific hypotheses in accordance with the COSMIN guidelines [6]. Five pre-defined hypotheses were formulated for the purpose of assessing the responsiveness of the DASH (Table 1). The responsiveness of the Arabic DASH was determined to be sufficient given that the results supported all five pre-defined hypotheses [8,36].
Positive change scores in the Arabic DASH indicating an improvement in upper extremity function were hypothesized to correlate with positive change scores in the GAF, which also indicate an improvement in functional ability. The findings of the current study supported the direction and magnitude of the hypothesized correlation. During the development of the original DASH, a change in the DASH moderately correlated with a global measure of function (r = 0.69), consistent with the findings of the current study [2]. The same correlation pattern was also reported in other translated versions of the DASH. In the Norwegian version, DASH change scores correlated with changes in the Shoulder Pain and Disability Index function (r = 0.82) and changes in SF-36 physical functioning (r = 0.61) [21]. Similarly, the Dutch DASH change exhibited a moderate correlation with changes in other measures of function, such as Constant–Murley activities of daily living (r = 0.64), SF-36 physical component summary (r = −0.56), SF-36 physical functioning (r = 0.57) and the usual activities domain of the five European quality of life dimensions (r = 0.50) [22].
Given that the DASH represents both upper extremity function and symptoms, positive change scores in the Arabic DASH indicating an improvement in upper extremity function and reduced symptoms were hypothesized to correlate with positive change scores in the NRPS, indicating a reduced pain intensity. The results of the current study supported this hypothesized correlation in terms of direction and magnitude. A correlation between changes in the DASH and changes in pain intensity has been reported previously in the literature, supporting the findings of the current study. During the development of the original scale, DASH change scores moderately correlated with changes in pain intensity (r = 0.65), consistent with our finding [2]. A translated version also demonstrated the same pattern, where changes in the Norwegian DASH showed a moderate correlation with changes in the NPRS (r = 0.69), changes in Shoulder Pain and Disability Index pain (r = 0.77) and changes in SF-36 bodily pain (r = 0.53) [21]. Similarly, an improvement in the Dutch DASH score was associated with a reduced pain intensity measured using a visual analogue scale (r = 0.55), the Constant–Murley pain domain (r = 0.45), SF-36 bodily pain (r = 0.47) and the pain domain of the five European quality of life dimensions (r = 0.41) [22].
The GRC was used in the current study as an external anchor to help in determining what was perceived as an important change. To make sure that patients’ reports using the GRC actually reflected a change in a construct similar to that measured with the DASH, i.e., upper extremity function, the wording of the GRC question was formulated to enquire about the perceived change in upper extremity function. Based on the argument that DASH change scores and the GRC reflect changes in the same construct, a hypothesized moderate positive correlation was formulated a priori. This hypothesized correlation was supported by the findings of the current study. A correlation between changes in DASH scores over time and a Global Rating of Change (7-point GRC) has been reported in patients with various upper extremity musculoskeletal disorders (r = 0.66, r = 0.72) [20,42]. In patients with shoulder disorders, the Persian and Danish DASH change scores moderately correlated (r = 0.54; r = 0.52) with a 7-point GRC [10,23], while the Norwegian DASH change scores showed a similar correlation with a 3-point GRC (r = 0.61) [21]. Furthermore, changes in DASH scores demonstrated a moderate correlation with a 15-point GRC (r = 0.60) in patients with lateral elbow tendinopathy [24]. Despite the variation in the format of the GRC in the studies discussed above, DASH change scores correlated with these GRC scales in a manner that supports the responsiveness of the DASH.
Based on the COSMIN guidelines, effect size indices by themselves are not measures of a PROM’s responsiveness; rather, they reflect the magnitude of change [7]. A specific hypothesis regarding the expected magnitude of the effect size was determined a prior in the current study. This use of the effect size as part of the specific hypothesis testing to examine responsiveness is supported according to the COSMIN guidelines [7,17]. Participants who reported an improvement in upper extremity function (based on the GRC score) were hypothesized to have at least a moderate effect size (≥0.5) given that this magnitude of change was believed to represent the clinically significant change that was expected to occur in this cohort. In the current study, a large effect size was observed in patients who improved, supporting our pre-defined hypothesis. Even when all participants in the current study were considered, the observed effect size was higher than the hypothesized magnitude of the medium effect size. During the development of the original scale, the DASH demonstrated effect sizes similar to those reported in the current study in patients receiving surgical intervention for shoulder, wrist and hand injuries with a 12-week follow-up [2]. Additionally, the Thai version of the DASH had effect sizes (SRM = 0.88 and ES  = 0.76) close to those reported in the current study [11]. When only participants who improved were considered, Schmitt et al. reported a lower magnitude of change in the DASH (SRM = 1.26 and ES = 1.21) than ours in patients who received physical therapy and occupational therapy for upper extremity musculoskeletal disorders with a 3-month follow-up [43].
After physical therapy and occupational therapy for upper extremity musculoskeletal disorders, Alotaibi et al. reported higher effect sizes (ES = 1.39, SRM = 1.51) for the Arabic DASH than those reported in the current study. This could be attributed to the shorter follow-up period employed in the current study compared to that of Alotaibi et al. (30–36 days) [9]. On the other hand, Alotaibi et al. used effect size indices to examine the responsiveness of the Arabic DASH with no pre-defined hypothesis regarding the expected magnitude and direction of change, which is considered an inappropriate method of assessing responsiveness based on the COSMIN guidelines [7,17]. A number of previous research studies reported larger effect size indices of the DASH than those reported in this study in patients who received surgical and conservative treatments [22,24,44]. These studies had longer follow-up periods than the current study, and similar to the majority of the literature, the use of effect size indices was not associated with a hypothesized magnitude and direction of these effect sizes.
The area under the ROC curve was used in the current study as part of the hypothesis testing to assess the responsiveness of the Arabic DASH. The Arabic DASH change score was able to differentiate between participants with a reported improvement in upper extremity function and those with no improvement as indicated by the AUC, which was higher than the recommended threshold of 0.7 [8,45]. This discriminatory ability of the DASH change score supports the notion that the Arabic DASH is able to detect changes in upper extremity function over time (responsiveness). The AUC reported in the current study is close to that reported by Alotaibi et al. (AUC = 0.82) for the Arabic DASH, but the 95% confidence interval of the AUC estimate was narrower in the current study, providing greater confidence in the AUC estimate [9]. The AUC reported in the current study is consistent with prior reports in the literature, such as the Italian version of the DASH (AUC = 0.87) [20], but the discriminatory ability of the Arabic DASH seems to be better than that of other versions, such as the Nepali (AUC = 0.69) [20], Dutch (AUC = 0.66) [22], Persian (AUC = 0.77) [23] and Danish versions (AUC = 0.76) [10].
The DASH MIC for improvement was determined in the current study using two methods, namely the predictive modeling method and the ROC method. Predictive modeling is believed to be the preferred method, given that it is more precise compared with the commonly used ROC method. In addition, predictive modeling allows for the inclusion of additional factors that may influence the MIC value, such as baseline status, without the need to split the sample into smaller subgroups [26]. The DASH MIC estimates determined in the current study using both the predictive modeling and ROC methods were very close. To the best of our knowledge, this is the first study that determined the MIC of the DASH using the predictive modeling method. The MIC value estimated in our study using both methods is close to that reported by Alotaibi et al. (MIC = 15) for the Arabic DASH using the ROC method, but with better diagnostic accuracy, sensitivity and specificity [9]. A change in an individual patient’s score of at least the MIC value determined in the current study (14.85) can be interpreted confidently as an important true change in the patient’s upper extremity function and could be used as a threshold for improvement to achieve during the care of individuals with upper extremity musculoskeletal disorders. This magnitude of change is a true change and not related to a measurement error, given that the MIC value exceeds the value of the Arabic DASH minimal detectable change determined previously (9.28) [9]. The MIC of the DASH reported in the current study is consistent with the estimates (MIC = 15, 20) reported during the development of the original DASH using the ROC method [2]. The DASH MIC reported in the current study falls within the MIC value range (4.4 to 25.4) reported in the literature [10,20,21,22,23,24,25,46,47]. This wide range in the literature could be attributed to the variations among the studies in participants’ characteristics, treatments received, the external anchor used and the definition of improvement vs. no improvement. A common limitation in the available studies that estimated the DASH MIC value is the unbalanced proportions of participants who improved and those who did not improve, which is known to introduce bias in the MIC estimate [27].
To the best of our knowledge, no prior studies have examined whether the MIC of the DASH is dependent on the baseline status using an appropriate methodology. Rysstad et al. reported the MIC values for patients with low and high baseline severities by subgrouping their small sample size into even smaller subgroups, which was carried out using the baseline score of the DASH [21]. Using the baseline score of the measure of interest, DASH, to split the sample into subgroups is known to cause false baseline dependence in the MIC estimates; thus, this method is not recommended and was not used in the current study [40]. An alternative method that involves using the baseline score of another PROM related to the measure of interest for stratification (low vs. high baseline severity) is recommended [40]. In line with this recommendation, the baseline NPRS score was used for stratification, given that it represents a construct related to that measured by the DASH and that it had a moderate correlation with the baseline DASH score.
The general aim of this study was to examine the responsiveness of the Arabic DASH and to quantify its MIC, and the scope of the current study was limited to patients with upper extremity musculoskeletal disorders attending physical therapy clinics in Riyadh, Saudi Arabia. Some limitations in the current study are worth mentioning. The median time interval between the baseline and follow-up assessments was relatively short; thus, some of the study findings, such as the effect size, should be interpreted by taking this time interval into account. Despite the relatively short time interval between testing sessions in some participants, about half of the participants reported an improvement in their upper extremity function. Additionally, the current study followed the COSMIN guidelines using specific pre-defined hypotheses to examine the responsiveness with an adequate sample size. This study also used the currently recommended predictive modeling method in determining the MIC and examined possible baseline dependence.

5. Conclusions

The aims of the current study were to examine the responsiveness of the Arabic DASH, to quantify the MIC for improvement in the Arabic DASH (using predictive modeling and ROC methods) and to examine whether the MIC value was dependent on patients’ baseline status. The Arabic DASH demonstrated sufficient responsiveness, supporting the idea that the Arabic DASH is capable of detecting changes in upper extremity function over time. The value of the Arabic DASH MIC was similar when estimated using the predictive modeling and ROC methods, and the MIC was not dependent on baseline status.

Funding

Deputyship for Research and Innovation, “Ministry of Education” in Saudi Arabia (IFKSUOR3-114-2).

Institutional Review Board Statement

This study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Boards of King Saud University Medical City (18/0372/IRB) and Security Forces Hospital (H-01-R-069).

Informed Consent Statement

Informed consent was obtained from all subjects involved in this study.

Data Availability Statement

The data presented in this study are available from the corresponding author on reasonable request.

Acknowledgments

The author extend their appreciation to the Deputyship for Research and Innovation, “Ministry of Education” in Saudi Arabia, for funding this research (IFKSUOR3-114-2).

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Hudak, P.L.; Amadio, P.C.; Bombardier, C. Development of an upper extremity outcome measure: The DASH (disabilities of the arm, shoulder and hand) [corrected]. The Upper Extremity Collaborative Group (UECG). Am. J. Ind. Med. 1996, 29, 602–608. [Google Scholar] [CrossRef]
  2. Beaton, D.E.; Katz, J.N.; Fossel, A.H.; Wright, J.G.; Tarasuk, V.; Bombardier, C. Measuring the whole or the parts? Validity, reliability, and responsiveness of the Disabilities of the Arm, Shoulder and Hand outcome measure in different regions of the upper extremity. J. Hand Ther. 2001, 14, 128–146. [Google Scholar] [CrossRef]
  3. de Klerk, S.; Buchanan, H.; Jerosch-Herold, C. The validity and clinical utility of the Disabilities of the Arm Shoulder and Hand questionnaire for hand injuries in developing country contexts: A systematic review. J. Hand Ther. 2018, 31, 80–90.e1. [Google Scholar] [CrossRef]
  4. Wajngarten, D.; Campos, J.; Garcia, P. The Disabilities of the Arm, Shoulder and Hand scale in the evaluation of disability—A literature review. Med. Lav. 2017, 108, 314–323. [Google Scholar]
  5. Sigirtmac, I.C.; Oksuz, C. Systematic review of the quality of the cross-cultural adaptations of Disabilities of the Arm, Shoulder and Hand (DASH). Med. Lav. 2021, 112, 279–291. [Google Scholar]
  6. Mokkink, L.B.; Terwee, C.B.; Patrick, D.L.; Alonso, J.; Stratford, P.W.; Knol, D.L.; Bouter, L.M.; de Vet, H.C. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J. Clin. Epidemiol. 2010, 63, 737–745. [Google Scholar] [CrossRef]
  7. Mokkink, L.B.; Terwee, C.B.; Knol, D.L.; Stratford, P.W.; Alonso, J.; Patrick, D.L.; Bouter, L.M.; de Vet, H.C. The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: A clarification of its content. BMC Med. Res. Methodol. 2010, 10, 22. [Google Scholar] [CrossRef] [PubMed]
  8. Prinsen, C.A.C.; Mokkink, L.B.; Bouter, L.M.; Alonso, J.; Patrick, D.L.; de Vet, H.C.W.; Terwee, C.B. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual. Life Res. 2018, 27, 1147–1157. [Google Scholar] [CrossRef] [PubMed]
  9. Alotaibi, N.M.; Aljadi, S.H.; Alrowayeh, H.N. Reliability, validity and responsiveness of the Arabic version of the Disability of Arm, Shoulder and Hand (DASH-Arabic). Disabil. Rehabil. 2016, 38, 2469–2478. [Google Scholar] [CrossRef] [PubMed]
  10. Lundquist, C.B.; Dossing, K.; Christiansen, D.H. Responsiveness of a Danish version of the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire. Dan. Med. J. 2014, 61, A4813. [Google Scholar] [PubMed]
  11. Chanthana, P.; Atthakomol, P.; Manosroi, W.; Wongpakaran, T.; Kraisarin, J.; Sananpanich, K. Comparison of patient preferences and responsiveness among common patient-reported outcome measures for hand/wrist injuries or disorders. J. Orthop. Traumatol. 2023, 24, 2. [Google Scholar] [CrossRef]
  12. Gummesson, C.; Atroshi, I.; Ekdahl, C. The disabilities of the arm, shoulder and hand (DASH) outcome questionnaire: Longitudinal construct validity and measuring self-rated health change after surgery. BMC Musculoskelet. Disord. 2003, 4, 11. [Google Scholar] [CrossRef] [PubMed]
  13. Greenslade, J.R.; Mehta, R.L.; Belward, P.; Warwick, D.J. Dash and Boston questionnaire assessment of carpal tunnel syndrome outcome: What is the responsiveness of an outcome questionnaire? J. Hand Surg. Br. 2004, 29, 159–164. [Google Scholar] [CrossRef] [PubMed]
  14. Schonnemann, J.O.; Larsen, K.; Hansen, T.B.; Soballe, K. Reliability and validity of the Danish version of the disabilities of arm, shoulder, and hand questionnaire in patients with fractured wrists. J. Plast. Surg. Hand Surg. 2011, 45, 35–39. [Google Scholar] [CrossRef] [PubMed]
  15. Mehta, S.P.; Tiruttani, R.; Kaur, M.N.; MacDermid, J.; Karim, R. Psychometric Properties of the Hindi Version of the Disabilities of Arm, Shoulder, and Hand: A Pilot Study. Rehabil. Res. Pract. 2015, 2015, 482378. [Google Scholar] [CrossRef]
  16. Imaeda, T.; Toh, S.; Nakao, Y.; Nishida, J.; Hirata, H.; Ijichi, M.; Kohri, C.; Nagano, A. Validation of the Japanese Society for Surgery of the Hand version of the Disability of the Arm, Shoulder, and Hand questionnaire. J. Orthop. Sci. 2005, 10, 353–359. [Google Scholar] [CrossRef]
  17. De Vet, H.C.; Terwee, C.B.; Mokkink, L.B.; Knol, D.L. Measurement in Medicine: A Practical Guide; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
  18. Revicki, D.; Hays, R.D.; Cella, D.; Sloan, J. Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J. Clin. Epidemiol. 2008, 61, 102–109. [Google Scholar] [CrossRef]
  19. Terwee, C.B.; Peipert, J.D.; Chapman, R.; Lai, J.-S.; Terluin, B.; Cella, D.; Griffith, P.; Mokkink, L.B. Minimal important change (MIC): A conceptual clarification and systematic review of MIC estimates of PROMIS measures. Qual. Life Res. 2021, 30, 2729–2754. [Google Scholar] [CrossRef]
  20. Franchignoni, F.; Vercelli, S.; Giordano, A.; Sartorio, F.; Bravini, E.; Ferriero, G. Minimal clinically important difference of the disabilities of the arm, shoulder and hand outcome measure (DASH) and its shortened version (QuickDASH). J. Orthop. Sports Phys. Ther. 2014, 44, 30–39. [Google Scholar] [CrossRef]
  21. Rysstad, T.; Røe, Y.; Haldorsen, B.; Svege, I.; Strand, L.I. Responsiveness and minimal important change of the Norwegian version of the Disabilities of the Arm, Shoulder and Hand questionnaire (DASH) in patients with subacromial pain syndrome. BMC Musculoskelet. Disord. 2017, 18, 248. [Google Scholar] [CrossRef]
  22. Mahabier, K.C.; Hartog, D.D.; Theyskens, N.; Verhofstad, M.H.; Van Lieshout, E.M.; Bos, P.K.; Bronkhorst, M.W.; Bruijninckx, M.M.; De Haan, J.; Hoed, P.T.D.; et al. Reliability, validity, responsiveness, and minimal important change of the Disabilities of the Arm, Shoulder and Hand and Constant-Murley scores in patients with a humeral shaft fracture. J. Shoulder Elb. Surg. 2017, 26, e1–e12. [Google Scholar]
  23. Negahban, H.; Behtash, Z.; Sohani, S.M.; Salehi, R. Responsiveness of two Persian-versions of shoulder outcome measures following physiotherapy intervention in patients with shoulder disorders. Disabil. Rehabil. 2015, 37, 2300–2304. [Google Scholar] [CrossRef]
  24. Farzad, M.; MacDermid, J.C.; Shafiee, E.; Beygi, A.S.; Vafaei, A.; Varahra, A.; Beikpour, H. Clinimetric testing of the Persian version of the Patient-Rated Tennis Elbow Evaluation (PRTEE) and the Disabilities of the Arm, Shoulder, and Hand (DASH) questionnaires in patients with lateral elbow tendinopathy. Disabil. Rehabil. 2022, 44, 2902–2907. [Google Scholar] [CrossRef]
  25. Kc, S.; Sharma, S.; Ginn, K.; Almadi, T.; Subedi, H.; Reed, D. Cross-cultural adaptation and measurement properties of the Nepali version of the DASH (disability of arm, shoulder and hand) in patients with shoulder pain. Health Qual. Life Outcomes 2019, 17, 51. [Google Scholar]
  26. Terluin, B.; Eekhout, I.; Terwee, C.B.; de Vet, H.C. Minimal important change (MIC) based on a predictive modeling approach was more precise than MIC based on ROC analysis. J. Clin. Epidemiol. 2015, 68, 1388–1396. [Google Scholar] [CrossRef]
  27. Terluin, B.; Eekhout, I.; Terwee, C.B. The anchor-based minimal important change, based on receiver operating characteristic analysis or predictive modeling, may need to be adjusted for the proportion of improved patients. J. Clin. Epidemiol. 2017, 83, 90–100. [Google Scholar] [CrossRef]
  28. Alghadir, A.H.; Anwer, S.; Iqbal, Z.A. The psychometric properties of an Arabic numeric pain rating scale for measuring osteoarthritis knee pain. Disabil. Rehabil. 2016, 38, 2392–2397. [Google Scholar] [CrossRef]
  29. Alnahdi, A.H.; Alrashid, G.I.; Alkhaldi, H.A.; Aldali, A.Z. Cross-cultural adaptation, validity and reliability of the Arabic version of the Lower Extremity Functional Scale. Disabil. Rehabil. 2016, 38, 897–904. [Google Scholar] [CrossRef]
  30. Alnahdi, A.H. Validity and reliability of the Arabic quick disabilities of the arm, Shoulder and Hand (QuickDASH-Arabic). Musculoskelet. Sci. Pract. 2021, 53, 102372. [Google Scholar]
  31. Aljathlani, M.F.; Alshammari, M.O.; Alsuwaygh, M.A.; Al-Mutairi, M.S.; Aljassir, F.F.; Bindawas, S.M.; Alnahdi, A.H. Cross-cultural adaptation and validation of the Arabic version of the upper extremity functional index. Disabil. Rehabil. 2022, 44, 5656–5662. [Google Scholar]
  32. Kamper, S.J.; Maher, C.G.; Mackay, G. Global rating of change scales: A review of strengths and weaknesses and considerations for design. J. Man. Manip. Ther. 2009, 17, 163–170. [Google Scholar] [CrossRef] [PubMed]
  33. Hawker, G.A.; Mian, S.; Kendzerska, T.; French, M. Measures of adult pain: Visual Analog Scale for Pain (VAS Pain), Numeric Rating Scale for Pain (NRS Pain), McGill Pain Questionnaire (MPQ), Short-Form McGill Pain Questionnaire (SF-MPQ), Chronic Pain Grade Scale (CPGS), Short Form-36 Bodily Pain Scale (SF-36 BPS), and Measure of Intermittent and Constant Osteoarthritis Pain (ICOAP). Arthritis Care Res. 2011, 63 (Suppl. S11), S240–S252. [Google Scholar]
  34. Alnahdi, A.H. Measurement properties of the 15-item Arabic lower extremity functional scale. Disabil. Rehabil. 2021, 43, 3839–3844. [Google Scholar] [CrossRef] [PubMed]
  35. Mokkink, L.B.; De Vet, H.C.W.; Prinsen, C.A.C.; Patrick, D.L.; Alonso, J.; Bouter, L.M.; Terwee, C.B. COSMIN Risk of Bias checklist for systematic reviews of Patient-Reported Outcome Measures. Qual. Life Res. 2018, 27, 1171–1179. [Google Scholar] [CrossRef] [PubMed]
  36. Prinsen, C.A.C.; Vohra, S.; Rose, M.R.; Boers, M.; Tugwell, P.; Clarke, M.; Williamson, P.R.; Terwee, C.B. How to select outcome measurement instruments for outcomes included in a “Core Outcome Set”—A practical guideline. Trials 2016, 17, 449. [Google Scholar] [CrossRef] [PubMed]
  37. Portney, L.G.; Watkins, M.P. Foundations of Clinical Research: Applications to Practice, 3rd ed.; Pearson/Prentice Hall: Upper Saddle River, NJ, USA, 2009. [Google Scholar]
  38. Cohen, J. Statistical Power Analysis for the Behavioral Sciences; Routledge: London, UK, 2013. [Google Scholar]
  39. Deyo, R.A.; Centor, R.M. Assessing the responsiveness of functional scales to clinical change: An analogy to diagnostic test performance. J. Chronic Dis. 1986, 39, 897–906. [Google Scholar] [CrossRef]
  40. Terluin, B.; Roos, E.M.; Terwee, C.B.; Thorlund, J.B.; Ingelsrud, L.H. Assessing baseline dependency of anchor-based minimal important change (MIC): Don’t stratify on the baseline score! Qual. Life Res. 2021, 30, 2773–2782. [Google Scholar] [CrossRef]
  41. Terwee, C.B.; Mokkink, L.B.; Knol, D.L.; Ostelo, R.W.J.G.; Bouter, L.M.; de Vet, H.C.W. Rating the methodological quality in systematic reviews of studies on measurement properties: A scoring system for the COSMIN checklist. Qual. Life Res. 2012, 21, 651–657. [Google Scholar] [CrossRef]
  42. Schmitt, J.; Di Fabio, R.P. The validity of prospective and retrospective global change criterion measures. Arch. Phys. Med. Rehabil. 2005, 86, 2270–2276. [Google Scholar] [CrossRef]
  43. Schmitt, J.S.; Di Fabio, R.P. Reliable change and minimum important difference (MID) proportions facilitated group responsiveness comparisons using individual threshold criteria. J. Clin. Epidemiol. 2004, 57, 1008–1018. [Google Scholar] [CrossRef]
  44. MacDermid, J.C.; Tottenham, V. Responsiveness of the disability of the arm, shoulder, and hand (DASH) and patient-rated wrist/hand evaluation (PRWHE) in evaluating change after hand therapy. J. Hand Ther. 2004, 17, 18–23. [Google Scholar] [CrossRef]
  45. Terwee, C.B.; Bot, S.D.M.; de Boer, M.R.; van der Windt, D.A.W.M.; Knol, D.L.; Dekker, J.; Bouter, L.M.; de Vet, H.C.W. Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol. 2007, 60, 34–42. [Google Scholar] [CrossRef] [PubMed]
  46. Hao, Q.; Devji, T.; Zeraatkar, D.; Wang, Y.; Qasim, A.; Siemieniuk, R.A.; Vandvik, P.O.; Lähdeoja, T.; Carrasco-Labra, A.; Agoritsas, T.; et al. Minimal important differences for improvement in shoulder condition patient-reported outcomes: A systematic review to inform a BMJ Rapid Recommendation. BMJ Open 2019, 9, e028777. [Google Scholar] [CrossRef] [PubMed]
  47. Wojnicz, W.; Sobierajska-Rek, A.; Zagrodny, B.; Ludwicki, M.; Jabłońska-Brudło, J.; Forysiak, K. A New Approach to Assess Quality of Motion in Functional Task of Upper Limb in Duchenne Muscular Dystrophy. Appl. Sci. 2022, 12, 12247. [Google Scholar] [CrossRef]
Figure 1. Boxplot of the DASH change scores stratified with the Global Rating of Change (GRC) scores.
Figure 1. Boxplot of the DASH change scores stratified with the Global Rating of Change (GRC) scores.
Healthcare 11 02623 g001
Figure 2. Receiver operating characteristic curve of the DASH change scores. The vertical axis shows the sensitivity, while the horizontal axis shows the 1 − specificity.
Figure 2. Receiver operating characteristic curve of the DASH change scores. The vertical axis shows the sensitivity, while the horizontal axis shows the 1 − specificity.
Healthcare 11 02623 g002
Table 1. Pre-defined hypotheses to examine the responsiveness of the Arabic DASH.
Table 1. Pre-defined hypotheses to examine the responsiveness of the Arabic DASH.
Pre-Defined HypothesesHypothesis Supported
1. DASH change scores demonstrate a positive correlation with GAF change scores (≥0.4).Yes
2. DASH change scores demonstrate a positive correlation with NPRS change scores (≥0.4). Yes
3. DASH change scores demonstrate a positive correlation with GRC scores (≥0.4).Yes
4. The DASH demonstrates an effect size (ES, SRM) of at least 0.5 in participants with a reported improvement in upper extremity function (GRC ≥ 3).Yes
5. DASH change scores differentiate between participants with a reported improvement in upper extremity function and those with no improvement (area under the ROC curve (AUC) ≥ 0.70).Yes
DASH = Disabilities of the Arm, Shoulder and Hand; GAF = Global Assessment of Function; NPRS = Numeric Pain Rating Scale; GRC = Global Rating of Change Scale; ES = effect size; SRM = standardized response mean; ROC = receiver operating characteristic; AUC = area under the curve.
Table 2. Characteristics of participants (N = 115).
Table 2. Characteristics of participants (N = 115).
VariableMean ± SD or N (%)
Age (year)38.18 ± 13.98
Sex
          Male72 (62.6)
          Female43 (37.4)
Height (m)1.68 ± 0.09
Mass (kg)76.39 ±16.56
Body mass index (kg/m2)27.26 ± 5.99
Site of dysfunction
          Shoulder and arm50 (43.5)
          Elbow and forearm21 (18.3)
          Wrist and hand44 (38.3)
Upper extremity surgery
          Yes51 (44.3)
                    Time after surgery (months)1.84 (2.76) *
          No64 (55.7)
                    Duration of symptoms (months)2.99 (7.71) *
* = median (interquartile range).
Table 3. Outcome measures at baseline and follow-up.
Table 3. Outcome measures at baseline and follow-up.
VariableBaseline Scores
Mean ± SD
Follow-Up Scores
Mean ± SD
Change Scores
Mean ± SD
ESSRMBaselineFollow-Up
FloorCeilingFloorCeiling
DASH (0–100)47.47 ± 21.1730.33 ± 23.6617.14 ± 24.730.810.690%0%0%10.4%
Improved (N = 56)52.52 ± 21.5019.53 ± 20.4632.99 ± 23.231.531.42
Unchanged (N = 58)42.25 ± 19.7740.22 ± 22.032.03 ± 14.880.100.14
GAF (0–100)58.70 ± 19.6575.83 ± 20.7417.14 ± 22.160.870.770%0%0%12.2%
NPRS (0–10)5.00 ± 2.413.17 ± 2.811.84 ± 2.900.760.632.6%3.5%1.7%25.2%
ES = effect size; SRM = standardized response mean; DASH = Disabilities of the Arm, Shoulder and Hand; GAF = Global Assessment of Function; NPRS = Numeric Pain Rating Scale. Floor represents the percentage of participants with the worst score (worst status), while ceiling represents the percentage of participants with the best score (best status).
Table 4. Participants according to their global rating of change score at follow-up.
Table 4. Participants according to their global rating of change score at follow-up.
VariableN (%)
GRC
     5 (Very great deal better)25 (21.7)
     4 (Great deal better)15 (13.0)
     3 (Moderately better)16 (13.9)
     2 (Little bit better)26 (22.6)
     1 (A tiny bit better, almost the same)14 (12.2)
     0 (No change)13 (11.3)
     −1 (Tiny bit worse, almost the same)5 (4.3)
     −2 (Little bit worse)0 (0.0)
     −3 (Moderately worse)1 (0.9)
     −4 (Great deal worse)0 (0.0)
     −5 (Very great deal worse)0 (0.0)
Change over time status according to GRC score
          Unchanged vs. Improved vs. Worsened
Unchanged (GRC −2 to 2)58 (50.4)
Improved (GRC ≥ 3)56 (48.7)
Worsened (GRC ≤ -3)1 (0.9)
          Improved vs. Not improved
Improved (GRC ≥ 3)56 (48.7)
Not improved (GRC ≤ 2)59 (51.3)
GRC = Global Rating of Change Scale.
Table 5. Correlation between the DASH change score and change in other measures.
Table 5. Correlation between the DASH change score and change in other measures.
Variabler (95% CI)p
GAF change0.69 (0.55 to 0.81)<0.001
NPRS change0.68 (0.57 to 0.77)<0.001
GRC0.73 (0.64 to 0.80) *<0.001
r = Pearson’s correlation coefficient; CI = confidence interval; DASH = Disabilities of the Arm, Shoulder and Hand; GAF = Global Assessment of Function; NPRS = Numeric Pain Rating Scale; GRC = Global Rating of Change Scale. * Spearman’s correlation coefficient.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alnahdi, A.H. Responsiveness and Minimal Important Change of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) in Patients with Upper Extremity Musculoskeletal Disorders. Healthcare 2023, 11, 2623. https://doi.org/10.3390/healthcare11192623

AMA Style

Alnahdi AH. Responsiveness and Minimal Important Change of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) in Patients with Upper Extremity Musculoskeletal Disorders. Healthcare. 2023; 11(19):2623. https://doi.org/10.3390/healthcare11192623

Chicago/Turabian Style

Alnahdi, Ali H. 2023. "Responsiveness and Minimal Important Change of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) in Patients with Upper Extremity Musculoskeletal Disorders" Healthcare 11, no. 19: 2623. https://doi.org/10.3390/healthcare11192623

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop