Next Article in Journal
BRAF Detection in FNAC Combined with Semi-Quantitative 99mTc-MIBI Technique and AI Model, an Economic and Efficient Predicting Tool for Malignancy in Thyroid Nodules
Previous Article in Journal
Circulating IgG Fragments for Gastric Cancer and Esophageal Cancer
Previous Article in Special Issue
White Matter Hyperintensity in Patients with Sudden Sensorineural Hearing Loss
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Accuracy and Consistency of Confidence Limits for Monosyllable Identification Scores Derived Using Simulation, the Harrell–Davis Estimator, and Nonlinear Quantile Regression

1
Department of Medical Rehabilitation Sciences, College of Applied Medical Sciences, King Khalid University, Abha 61481, Saudi Arabia
2
Speech Language Pathology Unit, College of Applied Medical Sciences, King Khalid University, Abha 61481, Saudi Arabia
3
Department of Speech Pathology and Audiology, Amrutha Institute of Medical Sciences, Kochi 682041, Kerala, India
4
Department of Audiology/Prevention of Communication Disorders, All India Institute of Speech and Hearing, Manasagangothri, Mysore 570006, Karanataka, India
5
Cambridge Hearing Group, Department of Psychology, University of Cambridge, Cambridge CB2 1TN, UK
*
Author to whom correspondence should be addressed.
Diagnostics 2024, 14(13), 1397; https://doi.org/10.3390/diagnostics14131397
Submission received: 9 June 2024 / Revised: 27 June 2024 / Accepted: 27 June 2024 / Published: 30 June 2024
(This article belongs to the Special Issue Hearing Loss: From Diagnosis to Pathology)

Abstract

:
Background: Audiological diagnosis and rehabilitation often involve the assessment of whether the maximum speech identification score (PBmax) is poorer than expected from the pure-tone average (PTA) threshold. This requires the estimation of the lower boundary of the PBmax values expected for a given PTA (one-tailed 95% confidence limit, CL). This study compares the accuracy and consistency of three methods for estimating the 95% CL. Method: The 95% CL values were estimated using a simulation method, the Harrell–Davis (HD) estimator, and non-linear quantile regression (nQR); the latter two are both distribution-free methods. The first two methods require the formation of sub-groups with different PTAs. Accuracy and consistency in the estimation of the 95% CL were assessed by applying each method to many random samples of 50% of the available data and using the fitted parameters to predict the data for the remaining 50%. Study sample: A total of 642 participants aged 17 to 84 years with sensorineural hearing loss were recruited from audiology clinics. Pure-tone audiograms were obtained and PBmax scores were measured using monosyllables at 40 dB above the speech recognition threshold or at the most comfortable level. Results: For the simulation method, 6.7 to 8.2% of the PBmax values fell below the 95% CL for both ears, exceeding the target value of 5%. For the HD and nQR methods, the PBmax values fell below the estimated 95% CL for approximately 5% of the ears, indicating good accuracy. Consistency, estimated from the standard deviation of the deviations from the target value of 5%, was similar for all the methods. Conclusions: The nQR method is recommended because it has good accuracy and consistency, and it does not require the formation of arbitrary PTA sub-groups.

1. Introduction

Speech identification scores (SISs) for phonetically balanced (PB) word or syllable lists are often included in an audiological test battery. If time permits, SIS can be measured at different sensation levels. The highest score obtained is denoted PBmax and it represents the best score for words in quiet that the patient can achieve when unaided. Due to time restrictions, SISs are commonly estimated in clinical practice at a single high level, chosen to be well above the detection threshold. The SIS at that level is also often denoted PBmax [1,2,3], although the true PBmax may be higher than this measured value. In addition to indicating a person’s receptive communication abilities, PBmax is useful in the differential diagnosis of auditory disorders and the fitting of hearing aids [4].
The pure-tone average (PTA) hearing threshold level (HTL), usually averaged across the frequencies 0.5, 1, 2, and 4 kHz, is the most commonly used index of peripheral hearing loss [5,6], and it has been used as a predictor of word scores, including PBmax values, in many studies [7,8]. Knowing if the PBmax obtained at a suprathreshold level is poorer than would be expected from the PTA is important when using PBmax for differential diagnoses [9,10]. For example, a poorer-than-expected PBmax might indicate a retro-cochlear disorder or auditory neuropathy. To determine whether the PBmax is lower than expected from the PTA, the lower boundary of the PBmax value expected for a given PTA must be estimated.
Yellin et al. [11] first proposed using the lower boundary for PBmax to determine whether the PBmax for an individual with sensorineural hearing loss was poorer than would be expected from the PTA. They referred to this as being “disproportionately poor”. They calculated the PTA as the average HTL across 1, 2, and 4 kHz. From the scatterplot relating PBmax to PTA, a lower boundary (a straight line relating PBmax to PTA) was constructed such that approximately 98% of the observed values fell above the boundary. A PBmax score was considered “disproportionately poor” if it fell below this empirically derived boundary. However, the relationship between PTA and PBmax was non-linear and many of the PBmax values were at the ceiling (100%). Due to this, it is inappropriate to determine the lower boundary of PBmax associated with a specific PTA by fitting a linear function to the data [7,8].
Dubno, Lee, Klein, Matthews, and Lam [7] adopted a different method for estimating the lower boundary of PBmax as a function of PTA. They considered 407 hearing-impaired ears from 212 participants, and divided them into 11 sub-groups based on PTA. The number of participants within each sub-group ranged from 18 to 50, which meant that the 95% confidence limit (CL) for each sub-group could not be determined accurately. To deal with this problem, a computer simulation method was used. The measured standard deviations (SDs) of the PBmax scores within each sub-group were always larger than the SDs estimated from the binomial distribution (see Equation (1) in Dubno, Lee, Klein, Matthews, and Lam [7]), on average by a factor of 1.62. Hence, for each PTA group, 2500 samples of normally distributed PBmax scores were generated using the mean determined from the experimental data and 1.62 times the SD estimated from the binomial distribution. Within each PTA sub-group, a one-tailed 95% CL was estimated from the simulation data. The values obtained in this way are denoted 95% CLS. A function was then fitted to the simulated 95% CLS values to predict the 95% CLS as a continuous function of PTA. Hereafter, this is referred to as the simulation method. The simulation method was adopted by Margolis et al. [12] for estimating 95% CL values for three sets of recorded speech materials, Q/MASS Tillman, VA, and Auditec [13].
In addition to the simulation method, several robust distribution-free methods have been proposed for estimating the 95% CL for non-normal distributions [14,15]. Such methods may be more accurate than the simulation method since the distribution of PBmax values varies with PTA and is rarely normal [7,8]. Among the various distribution-free methods, the Harrell–Davis (HD) estimator [16] performs better for small datasets and extreme quantiles [14,15,17]. Like the simulation method, the Harrell–Davis method requires the formation of sub-groups with different PTA values.
Non-linear quantile regression (nQR) [18,19] is another method that does not require assumptions about normality, and it has the additional advantage that it does not require the formation of sub-groups with different PTA values, which eliminates the sampling errors that occur with sub-grouping. Narne, Möller, Wolff, Houmøller, Loquet, Hammershøi, and Schmidt [8] used the nQR method to determine one-tailed 95% CL, denoted 95% CLQR, for a word list in Danish, using data from a large group of young and old participants with sensorineural hearing loss. They found that approximately 5% of the PBmax values fell below the estimated 95% CLQR values, indicating that the method did converge on the target 95% CL.
The estimation of an extreme quantile like 95% is affected by the sample distribution, sample size, and the estimation method [17,20,21]. Previous studies have assessed the performance of a given method for estimating the 95% CL by measuring the percentage of PBmax values that fell below the estimated 95% CL; ideally, this should be close to 5%. However, another important aspect of the performance of a method for determining the 95% CL is its consistency. This can be estimated by applying the method to data obtained from different samples of patients and assessing how well the estimated 95% CL values agree across the samples.
The present study compared three methods for estimating the 95% CL as a function of PTA, namely simulation, HD, and nQR. The performance of each method was assessed using two measures: (1) how accurately the method estimated the 95% CL, i.e., how close to 5% was the number of PBmax values falling below the estimated 95% CL; (2) how consistent the method was in estimating 95% CL values when data from different samples of participants were used. To perform these analyses, the available dataset was randomly split into two, such that each dataset had 50% of the data. One set, called the fitting set, was used to estimate 95% CL as a function of PTA. The fitted function was then applied to the remaining 50% of the data, called the test set, and the proportion of cases falling below the fitted 95% CL was determined. This whole procedure was repeated 25 times, each with a different random draw of 50% of the data for the fitting set. The deviation (DEV) of the percentage of cases falling below the fitted 95% CL from the target value of 5% was determined. The mean of DEV across the 25 draws gives a measure of the accuracy and the SD gives a measure of the consistency.
The simulation and HD methods depend on the division of participants into PTA sub-groups. A final objective was to assess how this grouping affected the estimated 95% CL values. To do this, 95% CL was estimated using the simulation and HD methods for various PTA sub-groups.

2. Materials and Methods

2.1. Participants

Patients referred for hearing aid treatment with various degrees of hearing loss were recruited from the JSS Institute of Speech and Hearing, Mysore. All the participants underwent pure-tone audiometry and an otolaryngologic examination. It was determined that no participants required any surgical or medical treatment for their hearing loss apart from hearing aids. Initially, 720 participants were recruited. Seventy-eight participants had an air–bone gap 15 dB for at least two frequencies in the frequency range 0.5 to 2 kHz, indicating a conductive component of the hearing loss, and these were excluded, leaving 642 participants, 386 men and 256 women. Their ages ranged from 17 to 84 years with a mean of 52, median of 68, and SD of 20 years. Of these, 185 were older than 60 years and 447 were 60 years old or younger.

2.2. Procedure

2.2.1. Basic Audiological Evaluation

All the audiological evaluations were carried out in double-walled sound-treated rooms. A calibrated two-channel diagnostic audiometer (Piano, Inventis, Padova, Italy) with TDH-39 headphones (Telephonics, Farmingdale, NY, USA) was used to determine the pure-tone air-conduction HTLs and a B-71 bone vibrator (Radioear, Middelfart, Denmark) was used for bone-conduction HTLs. HTLs were estimated using the modified Hughson–Westlake procedure [22]. The PTA was calculated for each ear separately by taking the average of the air-conduction HTLs at 0.5, 1, 2, and 4 kHz.

2.2.2. Speech Recognition Threshold (SRT)

The speech recognition threshold (SRT) in quiet was measured in the same session as pure-tone audiometry using paired words developed by the All-India Institute of Speech and Hearing in the Kannada language. The stimuli were presented using monitored (via a sound-level meter) live voice via the Piano audiometer and Sennheiser HDA-200 headphones. The SRT was estimated separately for each ear using the method proposed by the American Speech–Language–Hearing Association [23]. A modified Hughson–Westlake method was used. After two or three correct answers out of three, the level was reduced by 10 dB. After 0 or 1 correct answer out of 3, the level was increased by 5 dB. The final SRT was taken as the lowest level giving 2/3 correct.

2.2.3. Maximum Speech Identification Score (PBmax)

PBmax was obtained for each ear using two phonemically balanced lists developed by Mayadevi and Vyasamurthy [24], each list containing 25 non-meaningful monosyllabic consonant–vowel (CV) stimuli. The second list used the same stimuli as the first list, but in a different randomized order. One list was used for each ear. The stimuli were presented through TDH-39 headphones using monitored live voice. Although recorded materials would be preferable for measuring PBmax, many audiologists use monitored live voice presentations [2,3,25,26,27]. Testing began with the better-hearing ear (based on the PTA) for the participants with asymmetrical hearing loss; if the PTA was the same for the two ears, testing began with the right ear. The list was presented in quiet at 40 dB above the SRT measured in quiet or at the most comfortable level in cases of severe hearing loss, where loudness discomfort prevented the use of levels 40 dB above the SRT. Masking in the non-test ear was used when necessary according to standard masking rules [28].

2.2.4. Statistical Analyses

All the statistical analyses were performed using the R software version 3.6 [29] and programmed with RStudio, V 2022.12 [30]. Quantile regression was performed using the “quantreg” package, V 5.95 [31]. Confidence limits using the HD method were estimated in the R software using the “WRS” package, V 0.40 [32].

2.3. Derivation of 95% CL Using Three Methods

Initially, the entire dataset was used to characterize the distribution of the PBmax values as a function of PTA, and to assess how accurately the 95% CLs were estimated for each PTA group using the simulation and HD methods.

2.4. Distribution of PBmax Values as a Function of PTA

The PBmax and PTA values for the right and left ears were analyzed separately. To characterize the distribution of PBmax as a function of PTA, the PTA values were assigned to sub-groups: <15, 16–25, 26–35… The distributions of the PBmax values for each PTA sub-group for the left and right ears are shown in Figure 1. The PBmax values were highly skewed for several of the PTA sub-groups for both ears. For PTAs up to 55 dB HL, many PBmax scores were higher than 90%. The distributions were generally similar for the left and right ears, except for the PTA group 66–75 dB HL. The left-most columns of Table 1 and Table 2 present the mean PTA and number of ears for each PTA group, and, for PBmax, the mean, SD, skewness, and kurtosis for the right ear (Table 1) and the left ear (Table 2).

2.5. Method 1: Deriving the Means and 95% CL for PBmax Using the Simulation Method

The means and 95% CLS values were derived using a modification of the simulation method of Dubno, Lee, Klein, Matthews, and Lam [7] separately for the left and right ears. The PBmax scores within each PTA group were assumed to be normally distributed for the purpose of the computer simulations, as was done by Dubno et al. [7], even though the distributions clearly deviated from normal. A normal distribution with parameters mean and SD corresponding to the measured values for right and left ears shown in Table 1 and Table 2, respectively, was used to generate simulated data for each PTA sub-group using 50,000 simulations per sub-group. Note that the SDs here were estimated directly from the data, whereas Dubno et al. estimated SDs as 1.62 times the SDs estimated from the binomial distribution. The simulated PBmax values were set to 100 if they were >100 and were set to 0 if they were less than 0. The PBmax value below which 2500 or 5% of 50,000 runs fell for each PTA group was taken as the 95% CLS. Since the PBmax scores were obtained using lists of 25 nonsense syllables, when expressed as percentages, the scores were integer multiples of 4. Since the 95% CLS values estimated using the simulation method were not the integer multiples of 4, the values obtained were rounded to the closest integer multiple of 4.
The means, SDs, and 95% CLS values are shown in the middle columns of Table 1 and Table 2, for right and left ears, respectively. The simulated mean values differed slightly from the measured mean values, mainly because the simulated data were truncated when they were outside the range of 0–100%. The discrepancies in the SDs were larger, but not dramatically so.

2.6. Method 2: Deriving the Median and 95% CL for PBmax Using the HD Method [16]

The median (50% CLHD) and 95% CLHD were estimated using the HD method [16] from bootstrapped samples using the procedure detailed by Harris and Boyd [15] and Wilcox [14]. The resulting values for each PTA group are shown in the right-most columns in Table 1 and Table 2.
In addition to 50% CLHD and 95% CLHD, the standard errors (SEs) of the 95% CLHD values were estimated using the bootstrap approach described by Harris and Boyd [15]. The SEs of the 95% CLS values were estimated using the procedure described by Dowd [33]. The outcomes are shown in Table 1 and Table 2. The SEs of the 95% CLS values were consistently higher than the SEs of the 95% CLHD values.

2.7. Fitting the Means or Medians and 95% CL Values as a Function of PTA for the Simulation and HD Methods

To estimate the means, medians, and 95% CLs derived using the simulation and HD methods as continuous functions of PTA, it was necessary to select a function with a suitable general form. The function used by Dubno, Lee, Klein, Matthews, and Lam [7] fitted their data well for the range of PTA values that they considered (up to about 70 dB HL), but flattened off for larger PTA values, whereas our data indicated that the PBmax values approached zero quite steeply as the PTA approached 90 dB HL. Several functions were tried, and it was found that good fits could be obtained with the following function:
P B m a x = β 1 × 2 e β 2   × P T A β 3
where β1 is the value of PBmax for PTA = 0 dB HL, and β2 and β3 are the constants that determine the slope and curvature of the function with increasing PTA. For the values of PTA less than zero, PBmax was set to β1. When Equation (1) called for a value of PBmax less than zero, PBmax was set to zero.

2.8. Method 3: Non-Linear Quantile Regression

For nQR, the function relating PBmax to PTA as a continuous independent variable was similar in form to Equation (1), as specified by Equation (2).
Y i = β 1 × 2 e β 2   τ × x i β 3 τ + e i
where Yi is the ith observation of PBmax, xi is the ith observation of the PTA, τ denotes the percentile, β1 is the value of PBmax for PTA = 0 dB HL, and β2 [ τ ] and β3 [ τ ] are the constants that determine the slope and curvature of the function with increasing PTA. The error terms, ei, were assumed to have a normal distribution with zero mean: e i   ~   N ( 0 , σ e 2 ) . An interior point algorithm [19] was used for fitting the quantiles τ = 0.5 (Median), and τ = 0.05 (i.e., lower boundary, corresponding to the one-tailed 95% CL). For values of PTA less than zero, PBmax was set to β1. When Equation (2) called for a value of PBmax less than zero, PBmax was set to zero.
The goodness of fit of a nQR model is measured by R1(τ) [34,35], which is an analog of R2 for linear regression. R1(τ) is defined by Equation (3).
R 1 τ = i = 1 n τ y i ý i   τ i = 1 n τ y i y τ  
where y(τ) is the τth quantile of the sample distribution of y, ý i τ is the predicted PBmax value from the nQR model, y i represents the measured value of PBmax, and n is the number of samples.

2.9. Evaluation of the Three Methods Using Random Draws

The initial evaluation of the accuracy of each method was based on the entire dataset. The measure of accuracy was the percentage of cases falling below the estimated 95% CL, which should be close to 5% if a method is accurate. To provide an additional measure of accuracy and to assess consistency, a random sample of 50% of the data (i.e., 321 participants), denoted the fitting set, was taken and was fitted using each method. Then, the fitted parameters were applied to the remaining 50% of the data, the test set, to estimate the 95% CL values for an independent dataset. This process was repeated 25 times using different random draws of 50% of the data. The deviation (DEV) of the percentage of cases falling below the fitted 95% CL from the target value of 5% was determined. The mean of DEV across the 25 draws gives a measure of the accuracy and the SD of the DEV values gives a measure of the consistency.

3. Results

The mean values of PBmax for the measured and simulated data (based on the entire dataset), and the 95% CLS for the simulated data, were estimated as continuous functions of PTA using Equation (1) for both the simulation and HD methods by finding the values of β1, β2, and β3 that minimized the mean square deviation between the fitted functions and the simulated means. The resulting 95% CL values are shown in Table 1 and Table 2. The best-fitting parameter values are given in Table 3. The values of β1 and β3 were similar for the two ears but the values of β2 differed somewhat across the ears. The goodness of fit, measured by the proportion of variance accounted for, R2, is also shown in Table 3. The curves in Figure 2 show the best-fitting functions (long dashed—mean measured data; short dashed—mean simulated data; solid—estimated 95% CLS). The curves in Figure 3 show the best-fitting functions (dashed—median; solid—estimated 95% CLHD).
The fitted nQR functions for the median and the 95% CLQR values are shown in Figure 4. The estimated parameters, standard error of the parameters, and R1 values of the nQR model are given in Table 3. Note that the R1 values are not comparable to the R2 values for the simulation and HD methods, because the former are based on the fits to the data for the individual participants whereas the latter are based on the fits to the measured and simulated values for the PTA sub-groups.

3.1. Comparison of the 95% CL Predicted Using the Three Methods for All Data

The lower left and right panels of Figure 5 compare the 95% CL as a function of PTA estimated using the simulation, HD, and nQR methods including all the data. The 95% CLs were very similar for the HD and nQR methods. The upper panels show the differences between the simulation and HD methods and the simulation and nQR methods. Compared to the HD and nQR methods, the simulation method led to slightly higher 95% CL values for low to medium PTA values for the right ear and for medium to high PTA values for the left ear. As a result, a greater percentage of the ears fell below the estimated 95% CLs for the simulation method than for the HD or nQR methods. For the simulation method, 6.8% and 6.7% of the PBmax values fell below the 95% CLS curve for the right and left ears, respectively, values that are above the target value of 5%. In contrast, for the HD method, 4.9% of the PBmax values fell below the 95% CLHD curve for both right and left ears and for the nQR method, 5.3% of the PBmax values fell below the 95% CLQR curve for both right and left ears, both close to the desired value of 5%.

3.2. Accuracy and Consistency of the 95% CL Predicted Using the Three Methods

This section presents the results of the analyses based on fitting the data for a random sample of 50% of the total data (the fitting set) and then assessing the accuracy and consistency of the predictions for the test set, the whole procedure being repeated 25 times using different randomly selected draws. The key measure was the difference between the percentage of cases falling below the estimated 95% CL for the test dataset and the target percentage of 5%, denoted DEV. The smaller the mean value of DEV, the more accurate the method. The smaller the SD of DEV, the more consistent the method. Figure 6 shows box plots of the values of DEV for each method and each ear, separately for each PTA sub-group and overall. Each small circle shows the results for one random draw of the fitting and test sets. DEV values falling below 0 indicate that the proportion of the PBmax values falling below the estimated 95% CL is greater than the target value of 5%.
Overall, the simulation method showed the poorest accuracy. The HD and nQR methods showed similar accuracy, except for the PTA sub-group with the highest PTA, for which the nQR method showed better accuracy than the HD method, especially for the left ear. A mixed analysis of variance (ANOVA) was performed on the DEV values with ear and method as within-subject factors and the PTA sub-group as a between-subject factor. The Greenhouse–Geissler correction was applied to the degrees of freedom when the condition of sphericity was not met, but the uncorrected degrees of freedom are reported. There were significant main effects of method [F(2, 192) = 18.1, p < 0.0001, η 2 = 0.045], ear [F(1, 96) = 34.0, p < 0.0001, η 2 = 0.036] and PTA sub-group [F(3, 96) = 16.7, p < 0.0001, η 2 = 0.32]. There were significant two-way interactions of ear and method [F(2, 192) = 13.8, p < 0.001, η 2 = 0.033], method and PTA sub-group [F(6, 192) = 29.7, p < 0.0001, η 2 = 0.189], and ear and PTA sub-group [F(3, 96) = 7.1, p < 0.0001, η 2 = 0.023]. The three-way interaction of method, PTA sub-group, and ear was also significant [F(6, 192) = 11.1, p < 0.0001, η 2 = 0.077]. The main interest here is the effect of the method. Averaged across PTA sub-groups and ears, the values of DEV were −3.2, −1.8, and −1.1% for the simulation, HD, and nQR methods, respectively. Pair-wise comparisons with Bonferroni correction showed that there were significant differences between the simulation method and the HD (t = −3.7, p < 0.001) and nQR (t = −5.9, p < 0.001) methods, but no significant difference between the nQR and HD methods (t = −2.1, p = 0.098).
A second ANOVA was conducted based on the overall DEV values with within-subject factors ear and method. This showed significant main effects of method [F(2, 48) = 18.5, p < 0.0001, η 2 = 0.165] and ear [F(1, 24) = 14.6, p < 0.001, η 2 = 0.093]. The two-way interaction between method and ear was also significant [F(2, 48) = 4.9, p = 0.01, η 2 = 0.077]. Averaged across ears, the mean values of DEV were −2.7, −0.9, and −1.1 for the simulation, HD, and nQr methods, respectively. Pair-wise comparisons with Bonferroni correction showed that there were significant differences between the simulation method and the HD (t = −5.8, p < 0.001) and nQR methods (t = −4.3, p < 0.001), but no significant difference between the nQR and HD methods (t = 1.54, p = 0.12).
The SDs of the values of DEV provide a measure of consistency. Consider first the results for the PTA sub-groups. For the simulation method, the SD ranged from 2.3 to 6.3, with a mean of 3.9. For the HD method, the SD ranged from 1.9 to 6.6, with a mean of 3.1. For the nQR method, the SD ranged from 2.5 to 6.5, with a mean of 4.0. Based on variance-ratio (F) tests, the differences in SD across all the pairs of methods were not significant. For the overall data averaged across ears, the values of the SDs of the DEV values were 2.4, 1.6, and 2.2 for the simulation, HD, and nQr methods, respectively. The differences in SD across all the pairs of methods were not significant.
It can be concluded that the three methods had similar consistency but both the HD and nQR methods had greater accuracy than the simulation method. The HD and nQR methods showed similar accuracy, but the nQR method gave smaller DEV values for the PTA sub-group 67.6–90 dB HL, which had the smallest number of participants. Thus, the nQr method is preferable over the HD method.

3.3. Effect of Sub-Group Formation on the Outcome of the Simulation and HD Methods

The results of the simulation and HD methods depend on how the ears are assigned to PTA groups, whereas the nQR method does not depend on the formation of sub-groups. To assess the effect of grouping criteria on the 95% CL obtained using the simulation and HD methods, four sets of PTA groups were used, as shown in Table 4; these are denoted Group 1 to Group 4. The 95% CLS and 95% CLHD for each sub-group for each grouping method were generated and 95% CL was estimated as a continuous function of the PTA for each grouping method using Equation (1). These analyses were based on the entire dataset.
Fitted functions obtained using the simulation method for each of the four grouping methods are plotted in the lower panels of Figure 7, separately for the right and left ears. Group 1 is the same as used previously in the analysis of the simulation and HD methods. The differences between Group 1 and Groups 2–4 are plotted in the upper panels of Figure 7. The differences range from about −4 to 4 percentage points, with RMS differences ranging from 0.53 to 2.3 percentage points for the right ear, and from 1.2 to 1.6 percentage points for the left ear. This is unfortunate, because it is not obvious what grouping is the most appropriate one to use. For Groups 1 to 4, approximately 6.8%, 6.3%, 7.3%, and 7.8% of the PBmax values fell below the estimated 95% CLS, respectively, for the right ears and approximately 6.7%, 7.1%, 8.1%, and 8.2% of the PBmax values fell below the estimated 95% CLS for the left ears. Thus, regardless of grouping, the percentage of PBmax values below the estimated CLS values was greater than the target value of 5%.
The fitted functions obtained using the HD method for each of the four PTA grouping methods are plotted in the lower panels of Figure 8, separately for the right and left ears. The differences between Group 1 and Groups 2–4 are plotted in the upper panels of Figure 8. The differences range from about −3 to 10 percentage points, with the RMS differences ranging from 2.0 to 3.7 percentage points for the right ear, and from 1.3 to 3.0 percentage points for the left ear. For Groups 1 to 4, approximately 5.4%, 5.3%, 4.9%, and 5.3% of the PBmax values fell below the estimated 95% CLHD, respectively, for the right ears and approximately 4.9%, 4.8%, 4.9%, and 4.8% of the PBmax values fell below the estimated 95% CLHD for the left ears. Thus, regardless of grouping, the percentage of PBmax values below the estimated CLHD values was close to the target value of 5%.

4. Discussion

The first objective of this study was to compare the accuracy of the three methods. The 95% CL values estimated using HD and nQR methods were reasonably accurate overall, since for the entire dataset, approximately 5% of the PBmax values fell below the estimated 95% CL values for PTA values up to 76 dB HL, and for the analysis based on 25 random draws of 50% of the data, the DEV values were close to zero; mean values of DEV were −1.8 and −1.1% for the HD and nQR methods, respectively. For the simulation method, the percentage of the PBmax values falling below the estimated 95% CL values was greater than the target value of 5%; the accuracy was poorer. The mean value of DEV was −3.2 (corresponding to 8.2% of the cases falling below the estimated 95% CLS), more than double the value for the HD and nQR methods. The relatively poor accuracy of the simulation method may be related to the sub-grouping, to the small number of samples in some sub-groups, and to the observation that the PBmax values within each PTA group were often not normally distributed [17,20,36].
It should be noted that Dubno, Lee, Klein, Matthews, and Lam [7] reported that about 5% of the PBmax values fell below the estimated 95% CLS values. The difference from our results may be related to the sub-groups used; Dubno et al. used the PTA groups of variable width so as to obtain a reasonable number of participants within each group, but even so, some groups had a relatively small number of participants. Small numbers in each group can result in sampling errors and inaccurate estimates of a variety of measures [37,38,39]. Another possible contributor to the difference is that the SDs of the PBmax values here were estimated directly from the data, whereas Dubno et al. estimated SDs as 1.62 times the SDs estimated from the binomial distribution.
The second objective of the present study was to compare the consistency of the three methods. This was accomplished by comparing the SDs of the DEV values for the three methods. The SDs did not differ significantly across methods, indicating similar consistency. The third objective was to assess the effect of the formation of sub-groups, which is required for the simulation and HD methods. For the simulation method, the 95% CLS values depended somewhat on the way that the sub-groups were formed. While the overall percentage of ears falling below the 95% CLS curve did not vary markedly depending on the grouping method, the shape of the 95% CLS curve did vary with the grouping method, the difference being most pronounced between Groups 1 and 4 (see Figure 7). For the HD method, the shapes of the 95% CLHD curves also depended somewhat on the sub-group formation (see Figure 8). This is problematic, as there is no obvious way of deciding on the optimal grouping. The nQR method has the advantage that it does not depend on the arbitrary formation of sub-groups.
Overall, the nQR method seems to be preferable to the simulation or HD methods because (1) the nQR method does not depend on the formation of PTA sub-groups; (2) the percentage of ears falling below the 95% CLQR curve was close to 5% for the nQR method, whereas it was above 5% for the simulation method.
In diagnostic audiology, PBmax scores may be useful for identifying specific disorders, such as central disorders or auditory neuropathy. For example, Rance [40] compared the PBmax scores of participants with auditory neuropathy as a function of PTA with the norms provided by Yellin, Jerger, and Fifer [11]. Approximately 60% of the participants had PBmax scores below the norms. It may also be the case that changes in PBmax scores over time are more effective than HTLs in identifying changes in hearing resulting from an underlying disorder.
Unaided PBmax scores can provide useful information for the fitting of hearing aids and for counseling patients and their relatives about the likely benefits of hearing aids. Kirkwood [41] reported that 92% of the audiologists surveyed used some type of speech audiometric procedure. It is generally believed that individuals with hearing loss who have high PBmax scores are likely to derive good benefits from the amplification provided by hearing aids, whereas those with very low PBmax scores are likely to gain lower benefits, or at least to achieve lower overall aided performance. Some investigators have shown that hearing aid usage and satisfaction are significantly related to unaided PBmax values [42,43,44,45], whereas other investigators have shown no or only a weak relationship [46,47,48,49].
Most studies assessing the relationship of unaided PBmax scores with hearing aid outcome measures have employed simple correlation or regression to assess the relationship. It might be more appropriate to employ nQR for studying the relationship. For example, individuals could be classified based on the quantile range that their PBmax scores fall into given their PTA, for example, above 75%, between 75% to 50%, between 50% to 25%, and falling below 25%. It can be expected that those who fall in the top quartile will gain considerable benefit from hearing aids, while those falling in the bottom quartile will receive less benefit.

5. Conclusions

The nQR and HD methods are more accurate than the simulation method in estimating the lower 95% CL for PBmax as a function of PTA. Furthermore, the nQR method does not depend upon the division of participants into sub-groups based on their PTA. Based on the nQR method, for the Kannada nonsense syllables used here, the lower 95% CL for PBmax is about 86% correct when the PTA is 0 dB HL, 55% when the PTA is 50 dB HL, and 27% when the PTA is 70 dB HL.

Author Contributions

V.K.N.: conceptualization, methodology, software, validation, formal analysis, and writing—original draft preparation; D.M. and S.D.A.: investigations, data curation, and resources; S.J.: writing—original draft preparation, writing—review and editing, and formal analysis; S.K.R.: funding acquisition, project administration, and visualization; K.Y. and A.A.: supervision, project administration, and visualization; B.C.J.M.: writing—review and editing, supervision, and project administration. All authors have read and agreed to the published version of the manuscript.

Funding

The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through Large Research Project under grant number RGP2/471/45.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board (or Ethics Committee) of King Khalid University (ECM#2023-2130, date of approval was 30 May 2023).

Informed Consent Statement

Informed consent was obtained from all the subjects involved in the study and written informed consent was obtained from the patient(s) to publish this paper.

Data Availability Statement

Data are available with the corresponding author mentioned in this research paper.

Acknowledgments

The authors thank the Department of Audiology, JSS Institute of Speech and Hearing and the Department of Medical Rehabilitation Sciences, King Khalid University, for continuous encouragement and support and the participants for their co-operation. The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through Large Research Project under grant number RGP2/471/45.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Martin, F.; Morris, L. Current audiologic practices in the United States. Hear. J. 1989, 42, 25–44. [Google Scholar]
  2. DeBow, A.; Green, W.B. A Survey of Canadian Audiological Practices: Pure Tone and Speech Audiometry Enquete sur les pratiques en audiologie au Canada: Audiometrie tonale liminaire et vocale. J. Speech Lang. Pathol. Audiol. 2000, 24. [Google Scholar]
  3. Martin, F.N.; Champlin, C.A.; Chambers, J.A. Seventh survey of audiometric practices in the United States. J. Am. Acad. Audiol. 1998, 9, 95–104. [Google Scholar] [PubMed]
  4. McArdle, R.; Hnath-Chisolm, T. Speech audiometry. In Handbook of Clinical Audiology, 7th ed.; Katz, J., Burkard, R.F., Medwetsky, L., Hood, L., Eds.; Lippincott Williams & Wilkins: Baltimore, MD, USA, 2010; pp. 61–76. [Google Scholar]
  5. Gelfand, S. Pure Tone Audiometry. In Essentials of Audiology, 4th ed.; Thieme Medical Publishers: New York, NY, USA, 2016. [Google Scholar]
  6. Schlauch, R.S.; Nelson, P. Puretone Evaluation. In Handbook of Clinical Audiology, 7th ed.; Katz, J., Chasin, M., English, K.M., Hood, L.J., Tillery, K.L., Eds.; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2015; pp. 29–47. [Google Scholar]
  7. Dubno, J.R.; Lee, F.S.; Klein, A.J.; Matthews, L.J.; Lam, C.F. Confidence limits for maximum word-recognition scores. J. Speech Lang. Hear. Res. 1995, 38, 490–502. [Google Scholar] [CrossRef] [PubMed]
  8. Narne, V.K.; Möller, S.; Wolff, A.; Houmøller, S.S.; Loquet, G.; Hammershøi, D.; Schmidt, J.H. Confidence Limits of Word Identification Scores Derived Using Nonlinear Quantile Regression. Trends Hear. 2021, 25, 2331216520983110. [Google Scholar] [CrossRef] [PubMed]
  9. Mueller, H.G.; Hornsby, B.W.Y. 20Q: Word recognition testing—Let’s just agree to do it right! Audiol. Online 2020, 26478. [Google Scholar]
  10. Thibodeau, L.M. Speech audiometry. In Audiology: Diagnosis, 2nd ed.; Roeser, R.J., Michael, V., Hosford-Dunn, H., Eds.; Thieme: New York, NY, USA, 2007; pp. 288–313. [Google Scholar]
  11. Yellin, M.W.; Jerger, J.; Fifer, R.C. Norms for disproportionate loss in speech intelligibility. Ear Hear. 1989, 10, 231–234. [Google Scholar] [CrossRef]
  12. Margolis, R.H.; Wilson, R.H.; Saly, G.L. Clinical Interpretation of Word-Recognition Scores for Listeners with Sensorineural Hearing Loss: Confidence Intervals, Limits, and Levels. Ear Hear. 2023, 44, 1133–1139. [Google Scholar] [CrossRef] [PubMed]
  13. Causey, G.D.; Hermanson, C.L.; Hood, L.J.; Bowling, L.S. A comparative evaluation of the Maryland NU 6 auditory test. J. Speech Hear. Disord. 1983, 48, 62–69. [Google Scholar] [CrossRef]
  14. Wilcox, R.R. Introduction to Robust Estimation and Hypothesis Testing, 5th ed.; Elsevier Science: San Diego, CA, USA, 2021. [Google Scholar]
  15. Harris, E.K.; Boyd, J.C. Statistical Bases of Reference Values in Laboratory Medicine; CRC Press LLC: Boca Raton, FL, USA, 2019. [Google Scholar]
  16. Harrell, F.E.; Davis, C.E. A New Distribution-Free Quantile Estimator. Biometrika 1982, 69, 635–640. [Google Scholar] [CrossRef]
  17. Dielman, T.; Lowry, C.; Pfaffenberger, R. A comparison of quantile estimators. Commun. Stat. Simul. Comput. 1994, 23, 355–371. [Google Scholar] [CrossRef]
  18. Feng, X.; He, X.; Hu, J. Wild Bootstrap for Quantile Regression; Oxford University Press: Oxford, UK, 2011; Volume 98, pp. 995–999. [Google Scholar]
  19. Koenker, R.; Park, B.J. An interior point algorithm for nonlinear quantile regression. J. Econom. 1996, 71, 265–283. [Google Scholar] [CrossRef]
  20. Parrish, R.S. Comparison of Quantile Estimators in Normal Sampling. Biometrics 1990, 46, 247–257. [Google Scholar] [CrossRef]
  21. Gerke, O. Nonparametric Limits of Agreement in Method Comparison Studies: A Simulation Study on Extreme Quantile Estimation. Int. J. Environ. Res. Public Health 2020, 17, 8330. [Google Scholar] [CrossRef]
  22. Carhart, R.; Jerger, J. Preferred method for clinical determination of pure-tone thresholds. J. Speech Hear. Disord. 1959, 24, 330–345. [Google Scholar] [CrossRef]
  23. American Speech-Language-Hearing Association. Determining threshold level for speech (No. GL1988-00008). ASHA 1988, 30, 85–89. [Google Scholar] [CrossRef]
  24. Mayadevi; Vyasamurthy, M. N. Development and Standardization of a Common Speech Discrimination Test for Indians; University of Mysore: Mysore, India, 1974. [Google Scholar]
  25. Thakor, H. South African Audiologists’ Use of Speech-in-Noise Testing for Adults with Hearing Difficulties. Master’s Thesis, University of the Witwatersrand, Johannesburg, South Africa, 2020. [Google Scholar]
  26. Myles, A.J. The clinical use of Arthur Boothroyd (AB) word lists in Australia: Exploring evidence-based practice. Int. J. Audiol. 2017, 56, 870–875. [Google Scholar] [CrossRef]
  27. Parmar, B.J.; Rajasingam, S.L.; Bizley, J.K.; Vickers, D.A. Factors Affecting the Use of Speech Testing in Adult Audiology. Am. J. Audiol. 2022, 31, 528–540. [Google Scholar] [CrossRef]
  28. Roeser, R.J.; Clark, J.L. Clinical Masking. In Audiology: Diagnosis, 2nd ed.; Roeser, R.J., Michael, V., Hosford-Dunn, H., Eds.; Thieme: New York, NY, USA, 2007; pp. 602–658. [Google Scholar]
  29. R Core Team. R: A Language and Environment for Statistical Computing; 4.2.1 (2022); R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
  30. RStudio Team. RStudio: Integrated Development for R, 2022.12.0+353; RStudio, Inc: Boston, MA, USA, 2022. [Google Scholar]
  31. Koenker, R. Quantreg: Quantile Regression in R Package (Version 5.95), R Package Version 5.51; 2023. Available online: https://cran.r-project.org/web/packages/quantreg/quantreg.pdf (accessed on 8 June 2024).
  32. Wilcox, R.R.; Schönbrodt, F.D. The WRS Package for Robust Statistics in R (Version 0.40). 2022. Available online: https://rdrr.io/rforge/WRS/ (accessed on 8 June 2024).
  33. Dowd, K. Model Risk. In Measuring Market Risk; Wiley: Hoboken, NJ, USA, 2005; pp. 351–363. [Google Scholar] [CrossRef]
  34. Koenker, R.; Machado, J.A.F. Goodness of Fit and Related Inference Processes for Quantile Regression. J. Am. Stat. Assoc. 1999, 94, 1296–1310. [Google Scholar] [CrossRef]
  35. Xu, Q.; Niu, X.; Jiang, C.; Huang, X. The Phillips curve in the US: A nonlinear quantile regression approach. Econ. Model. 2015, 49, 186–197. [Google Scholar] [CrossRef]
  36. Hughey, R.L. A survey and comparison of methods for estimating extreme right tail-area quantiles. Commun. Stat. Theory Methods 1991, 20, 1463–1496. [Google Scholar] [CrossRef]
  37. Lê Cook, B.; Manning, W.G. Thinking beyond the mean: A practical guide for using quantile regression methods for health services research. Shanghai Arch. Psychiatry 2013, 25, 55–59. [Google Scholar] [CrossRef] [PubMed]
  38. Selvin, S. Two issues concerning the analysis of grouped data. Eur. J. Epidemiol. 1987, 3, 284–287. [Google Scholar] [CrossRef] [PubMed]
  39. Robinson, W. Ecological Correlations and the Behavior of Individuals*. Int. J. Epidemiol. 2009, 38, 337–341. [Google Scholar] [CrossRef] [PubMed]
  40. Rance, G. Auditory neuropathy/dys-synchrony and its perceptual consequences. Trends Amplif. 2005, 9, 1–43. [Google Scholar] [CrossRef] [PubMed]
  41. Kirkwood. When it comes to hearing aids, “more” was the story on ‘04. Hear. J. 2005, 58, 28–37. [Google Scholar] [CrossRef]
  42. Chang, Y.S.; Choi, J.; Moon, I.J.; Hong, S.H.; Chung, W.H.; Cho, Y.S. Factors associated with self-reported outcome in adaptation of hearing aid. Acta Oto-Laryngol. 2016, 136, 905–911. [Google Scholar] [CrossRef] [PubMed]
  43. Brännström, K.J.; Lantz, J.; Nielsen, L.H.; Olsen, S. Prediction of IOI-HA scores using speech reception thresholds and speech discrimination scores in quiet. J. Am. Acad. Audiol. 2014, 25, 154–163. [Google Scholar] [CrossRef] [PubMed]
  44. Wang, X.; Zheng, Y.; Li, G.; Lu, J.; Yin, Y. Objective and Subjective Outcomes in Patients with Hearing Aids: A Cross-Sectional, Comparative, Associational Study. Audiol. Neurotol. 2022, 27, 166–174. [Google Scholar] [CrossRef]
  45. Dörfler, C.; Hocke, T.; Hast, A.; Hoppe, U. Speech recognition with hearing aids for 10 standard audiograms. HNO 2020, 68, 93–99. [Google Scholar] [CrossRef]
  46. Dornhoffer, J.R.; Meyer, T.A.; Dubno, J.R.; McRackan, T.R. Assessment of Hearing Aid Benefit Using Patient-Reported Outcomes and Audiologic Measures. Audiol. Neurotol. 2020, 25, 215–223. [Google Scholar] [CrossRef] [PubMed]
  47. Humes, L.E.; Wilson, D.L.; Humes, A.C. Examination of differences between successful and unsuccessful elderly hearing aid candidates matched for age, hearing loss and gender. Int. J. Audiol. 2003, 42, 432–441. [Google Scholar] [CrossRef] [PubMed]
  48. Fereczkowski, M.; Neher, T. Predicting Aided Outcome With Aided Word Recognition Scores Measured With Linear Amplification at Above-conversational Levels. Ear Hear. 2023, 44, 155–166. [Google Scholar] [CrossRef] [PubMed]
  49. Houmøller, S.S.; Wolff, A.; Möller, S.; Narne, V.K.; Narayanan, S.K.; Godballe, C.; Hougaard, D.D.; Loquet, G.; Gaihede, M.; Hammershøi, D.; et al. Prediction of successful hearing aid treatment in first-time and experienced hearing aid users: Using the International Outcome Inventory for Hearing Aids. Int. J. Audiol. 2022, 61, 119–129. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Distribution of the PBmax values for each PTA group, separately for the right ear (left side) and left ear (right side).
Figure 1. Distribution of the PBmax values for each PTA group, separately for the right ear (left side) and left ear (right side).
Diagnostics 14 01397 g001
Figure 2. Results using the simulation method for the right ear (left side) and the left ear (right side). The large symbols show the measured means (circles) and simulated means (up-pointing diamonds) and simulated lower 95% CLS (down-pointing triangles) for each PTA sub-group. The long-dashed curves and short-dashed curves show functions fitted to the measured and simulated means, respectively, for the nine PTA sub-groups. The solid curves show functions fitted to the 95% CLS values for the nine PTA sub-groups. The small green filled triangles show individual PBmax values below the 95% CLS and small open circles show PBmax values above the 95% CLS.
Figure 2. Results using the simulation method for the right ear (left side) and the left ear (right side). The large symbols show the measured means (circles) and simulated means (up-pointing diamonds) and simulated lower 95% CLS (down-pointing triangles) for each PTA sub-group. The long-dashed curves and short-dashed curves show functions fitted to the measured and simulated means, respectively, for the nine PTA sub-groups. The solid curves show functions fitted to the 95% CLS values for the nine PTA sub-groups. The small green filled triangles show individual PBmax values below the 95% CLS and small open circles show PBmax values above the 95% CLS.
Diagnostics 14 01397 g002
Figure 3. Median and 95% CLHD values estimated using the Harrell–Davis method for the right ear (left side) and the left (right side). The large filled symbols show the medians (circles) and lower 95% CLHD values (down-pointing triangles) for each PTA sub-group. The dashed curves show the functions fitted to the medians for the nine PTA sub-groups. The solid curves show the functions fitted to the 95% CLHD values for the nine PTA sub-groups. The small green filled triangles show individual PBmax values below the estimated 95% CLHD curve and small open circles show PBmax values above the 95% CLHD curve.
Figure 3. Median and 95% CLHD values estimated using the Harrell–Davis method for the right ear (left side) and the left (right side). The large filled symbols show the medians (circles) and lower 95% CLHD values (down-pointing triangles) for each PTA sub-group. The dashed curves show the functions fitted to the medians for the nine PTA sub-groups. The solid curves show the functions fitted to the 95% CLHD values for the nine PTA sub-groups. The small green filled triangles show individual PBmax values below the estimated 95% CLHD curve and small open circles show PBmax values above the 95% CLHD curve.
Diagnostics 14 01397 g003
Figure 4. Results using the nQR method for the right ear (left side) and the left ear (right side). The dashed lines show the fitted median and the solid lines show the fitted lower 95% CLQR. The small green filled triangles show individual PBmax values below the estimated 95% CL and the small open circles show PBmax values above the 95% CL.
Figure 4. Results using the nQR method for the right ear (left side) and the left ear (right side). The dashed lines show the fitted median and the solid lines show the fitted lower 95% CLQR. The small green filled triangles show individual PBmax values below the estimated 95% CL and the small open circles show PBmax values above the 95% CL.
Diagnostics 14 01397 g004
Figure 5. The lower panels compare the lower 95% CL estimated using the simulation method (dashed green line), the Harrell–Davis method (red dotted line), and the nQR method (black solid line) for the right ear (left side) and the left ear (right side). The upper panels show the differences between the simulation method and the two other methods.
Figure 5. The lower panels compare the lower 95% CL estimated using the simulation method (dashed green line), the Harrell–Davis method (red dotted line), and the nQR method (black solid line) for the right ear (left side) and the left ear (right side). The upper panels show the differences between the simulation method and the two other methods.
Diagnostics 14 01397 g005
Figure 6. Box plots of the difference (DEV) between the percentage of cases falling below the estimated 95% CL for the test dataset and the target percentage of 5% for each of the three methods. The results are shown for the overall data and for each PTA sub-group, separately for the right ear (left side) and left ear (right side).
Figure 6. Box plots of the difference (DEV) between the percentage of cases falling below the estimated 95% CL for the test dataset and the target percentage of 5% for each of the three methods. The results are shown for the overall data and for each PTA sub-group, separately for the right ear (left side) and left ear (right side).
Diagnostics 14 01397 g006
Figure 7. The lower panels show lower 95% CLS estimated using the simulation method for the four PTA groups for the right ear (left two panels) and the left ear (right two panels). The upper panels show differences for specific pairs of PTA groups.
Figure 7. The lower panels show lower 95% CLS estimated using the simulation method for the four PTA groups for the right ear (left two panels) and the left ear (right two panels). The upper panels show differences for specific pairs of PTA groups.
Diagnostics 14 01397 g007
Figure 8. The lower panels show lower 95% CLHD estimated using the Harrell–Davis method for the four PTA groups for the right ear (left two panels) and the left ear (right two panels). The upper panels show differences for specific pairs of PTA groups.
Figure 8. The lower panels show lower 95% CLHD estimated using the Harrell–Davis method for the four PTA groups for the right ear (left two panels) and the left ear (right two panels). The upper panels show differences for specific pairs of PTA groups.
Diagnostics 14 01397 g008
Table 1. The table shows, for each PTA group, the number of participants (N), and for the right ear only, the mean PTA, and, for the PBmax values, the mean, SD, skewness, and kurtosis for the measured data, and the mean, SD, 95% CL, standard error (SE) of the 95% CL, and the percentage of PBmax values below each 95% CL value for the simulation method and the Harrell–Davis method.
Table 1. The table shows, for each PTA group, the number of participants (N), and for the right ear only, the mean PTA, and, for the PBmax values, the mean, SD, skewness, and kurtosis for the measured data, and the mean, SD, 95% CL, standard error (SE) of the 95% CL, and the percentage of PBmax values below each 95% CL value for the simulation method and the Harrell–Davis method.
PTA RangeNPTA
(dB HL)
Measured DataSimulation MethodHarrell–Davis
MeanSDSkewnessKurtosisMeanSD95% CLSSE of 95% CLSPercent below 95% CLSMedian95% CLHDSE of 95% CLHDPercent below 95% CLHD
<1515110.296.25.6−1.33.894.64.1880.959.3 100.0 84.1 0.15 3.7
16–2511520.893.08.9−2.413.190.86.680 1.8 3.5 95.0 78.0 0.45 3.6
26–359630.189.98.4−1.04.788.46.676 1.8 6.3 90.0 74.1 0.46 3.4
36–459140. 884.313.2−1.24.982.010.264 2.9 6.6 85.1 56.8 0.48 4.5
46–556950.781.214.4−1.24.178.911.960 3.7 7.3 83.5 49.3 0.19 5.5
56–655860.272.315.3−0.42.370.912.348 4.3 5.2 72.5 45.7 0.32 5.1
66–753270.850.821.70.42.350.816.024 8.1 3.1 46.0 20.9 0.17 6.3
76–851481.344.623.50.92.845.416.516 13.8 0.0 42.3 21.1 0.19 15.4
86–1201691.221.620.50.31.725.314.58 10.8 13.3 21.4 0.1 0.0 13.3
Table 2. The table shows, for each PTA group, the number of participants (N), and for the left ear only, the mean PTA, and, for PBmax values, the mean, SD, skewness and kurtosis for the measured data, and the mean, SD, 95% CL, standard error (SE) of the 95% CL, and the percentage of PBmax values below each 95% CL value for the simulation method and the Harrell–Davis method.
Table 2. The table shows, for each PTA group, the number of participants (N), and for the left ear only, the mean PTA, and, for PBmax values, the mean, SD, skewness and kurtosis for the measured data, and the mean, SD, 95% CL, standard error (SE) of the 95% CL, and the percentage of PBmax values below each 95% CL value for the simulation method and the Harrell–Davis method.
PTA RangeNPTA
(dB HL)
Measured DataSimulation MethodHarrell–Davis
MeanSDSkewnessKurtosisMeanSD95% CLSSE of 95% CLSPercent below 95% CLSMedian95% CLHDSE of 95% CHDPercent below 95% CLHD
<1512612.196.25.5−1.55.094.64.188 1.1 6.3 100.0 84.7 0.06 4.0
16–2514719.993.97.8−2.210.991.95.880 1.4 8.2 95.0 79. 9 0.45 3.4
26–3510430.390.19.3−0.93.988.27.276 1.9 4.8 91.4 74.1 0.38 4.8
36–458340.885.811.8−1.55.683.79.268 2.7 6.0 89.0 60.1 0.25 4.8
46–558450.680.414.7−1.03.578.011.556 3.4 10.7 82.5 49.1 0.52 3.5
56–654861.269.318.0−0.53.367.414.040 5.5 6.3 70.0 35.3 0.55 6.3
66–751670.661.616.20.01.960.813.536 8.6 6.3 61.4 27.6 0.09 6.3
76–851880.742.815.61.65.743.213.320 7.7 0.0 37.7 18.6 0.03 5.6
86–1201591.321.722.91.02.826.215.78 12.5 13.3 21.2 1.1 0.00 13.3
Table 3. Parameter values (with standard errors in parentheses) of the functions fitted to the mean measured data, the mean simulated data, and the simulated 95% CL for each PTA group using the simulation method, and the median and 95% CL for the Harrell–Davis and the nonlinear QR methods. R2 and R1 values are measures of the goodness of fit. The results for the right and left ears are shown in the top and bottom halves, respectively.
Table 3. Parameter values (with standard errors in parentheses) of the functions fitted to the mean measured data, the mean simulated data, and the simulated 95% CL for each PTA group using the simulation method, and the median and 95% CL for the Harrell–Davis and the nonlinear QR methods. R2 and R1 values are measures of the goodness of fit. The results for the right and left ears are shown in the top and bottom halves, respectively.
Right Ear
Simulation methodHarrell–DavisNonlinear QR
β1β2β3R2 β1β2β3R2 β1β2β3R1
Mean (Measured)96.3 (2.7)0.0000312 (0.000051)2.17 (0.27)0.98
Mean (Simulated)94.1 (2.7)0.0000978 (0.00018)1.91 (0.27)0.98Median100 (4.37)0.000154
(0.00023)
1.82 (0.34)0.97Median100 (1.2)0.0000900 (0.000060)1.96 (0.15)0.67
95% CIS91.8 (6.9)0.00182 (0.0024)1.33 (0.31)0.9795% CLHD92.07 (7.73)0.00613 (0.00063)1.04 (0.23)0.9895% CLQR88.8 (4.6)0.00216 (0.0030)1.29 (0.36)0.76
Left Ear
Simulation methodHarrell–DavisNonlinear QR
β1β2β3R2 β1β2β3R2 β1β2β3R1
Mean (Measured)95.8 (1.9)0.0000241 (0.000012)2.23 (0.10)0.99
Mean (Simulated)94.1 (1.3)0.0000694 (0.000044)1.97 (0.14)0.98Median100
(4.53)
0.000159
(0.00024)
1.82 (0.34)0.97Median99.7 (0.8)0.0000500 (0.00001)2.06 (0.05)0.64
95% CIS93.8 (3.9)0.00348 (0.0021)1.16 (0.14)0.9995% CLHD97.1 (4.68)0.0103
(0.0051)
0.92
(0.11)
0.9995% CLQR93.2 (5.1)0.00536 (0.0057)1.08 (0.24)0.78
Table 4. PTA groups based on four sub-grouping criteria. The table shows: the number of participants (N) in each PTA group and the mean PTA for each group in dB HL for the right ears (top) and left ears (bottom).
Table 4. PTA groups based on four sub-grouping criteria. The table shows: the number of participants (N) in each PTA group and the mean PTA for each group in dB HL for the right ears (top) and left ears (bottom).
Right Ear
Group 1Group 2Group 3Group 4
PTA RangeNMeanPTA RangeNMeanPTA RangeNMeanPTA RangeNMean
<1515110.4<10787.5<10787.5<1515110.4
16–2511520.811–2013615.811–157313.516–3016923.1
26–359630.421–3010626.016–206318.431–4513338.4
36–459140.731–408936.021–255223.746–6010453.0
46–556950.541–507945.326–305428.261–755567.6
56–655860.251–606955.631–354233.476–1203086.6
66–753270.761–704165.736–404738.4
76–851481.671–802074.341–454443.1
86–1201690.881–1202488.646–503548.0
51–553453.0
56–603558.0
61–704165.7
71–802074.3
81–1202488.6
Left Ear
Group 1Group 2Group 3Group 4
PTA RangeNMeanPTA RangeNMeanPTA RangeNMeanPTA RangeNMean
<1512612.1<10308.1<10308.1<1512612.1
16–2514719.911–2018915. 611–159613.316–3020922.4
26–3510430.321–3011625.916–209317.931–4512538.2
36–458340.631–409536.421–255423.446–6010752.4
46–558450.641–507646.526–306228.261–754166.2
56–654861.251–606155.431–354233.576–1203485.0
66–751670.661–703364.436–405338.8
76–851980.771–801976.141–453043.7
86–1201591.281–1202388.746–504648.3
51–553853.4
56–602358.7
61–703364.4
71–801976.1
81–1202388.7
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Narne, V.K.; Mohan, D.; Avileri, S.D.; Jain, S.; Ravi, S.K.; Yerraguntla, K.; Almudhi, A.; Moore, B.C.J. Accuracy and Consistency of Confidence Limits for Monosyllable Identification Scores Derived Using Simulation, the Harrell–Davis Estimator, and Nonlinear Quantile Regression. Diagnostics 2024, 14, 1397. https://doi.org/10.3390/diagnostics14131397

AMA Style

Narne VK, Mohan D, Avileri SD, Jain S, Ravi SK, Yerraguntla K, Almudhi A, Moore BCJ. Accuracy and Consistency of Confidence Limits for Monosyllable Identification Scores Derived Using Simulation, the Harrell–Davis Estimator, and Nonlinear Quantile Regression. Diagnostics. 2024; 14(13):1397. https://doi.org/10.3390/diagnostics14131397

Chicago/Turabian Style

Narne, Vijaya Kumar, Dhanya Mohan, Sruthi Das Avileri, Saransh Jain, Sunil Kumar Ravi, Krishna Yerraguntla, Abdulaziz Almudhi, and Brian C. J. Moore. 2024. "Accuracy and Consistency of Confidence Limits for Monosyllable Identification Scores Derived Using Simulation, the Harrell–Davis Estimator, and Nonlinear Quantile Regression" Diagnostics 14, no. 13: 1397. https://doi.org/10.3390/diagnostics14131397

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop