Next Article in Journal
Feasibility and Safety of the Routine Distal Transradial Approach in the Anatomical Snuffbox for Coronary Procedures: The ANTARES Randomized Trial
Next Article in Special Issue
Accuracy Analysis of the Multiparametric Acoustic Voice Indices, the VWI, AVQI, ABI, and DSI Measures, in Differentiating between Normal and Dysphonic Voices
Previous Article in Journal
Acute Effect of Positive Airway Pressure on Heart Rate Variability in Obstructive Sleep Apnea
Previous Article in Special Issue
Determining the Mouth-to-Microphone Distance in Rigid Laryngoscopy: A Simple Solution Based on the Newly Measured Values of the Depth of Endoscope Insertion into the Mouth
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Validation of the Acoustic Breathiness Index in Speakers of Finnish Language

by
Elina Kankare
1,* and
Anne-Maria Laukkanen
2
1
Department of Rehabilitation and Psychosocial Support, Logopedics, Phoniatrics, Tampere University Hospital, 33520 Tampere, Finland
2
Speech and Voice Research Laboratory, Tampere University, 33100 Tampere, Finland
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2023, 12(24), 7607; https://doi.org/10.3390/jcm12247607
Submission received: 14 October 2023 / Revised: 27 November 2023 / Accepted: 7 December 2023 / Published: 10 December 2023
(This article belongs to the Special Issue New Advances in the Management of Voice Disorders)

Abstract

:
Breathiness (perception of turbulence noise in the voice) is one of the major components of hoarseness in dysphonic voices. This study aims to validate a multiparameter analysis tool, the Acoustic Breathiness Index (ABI), for quantification of breathiness in the speaking voice, including both sustained vowels and continuous speech. One hundred and eight speakers with dysphonia (28 M, 80 F, mean age 50, SD 15.4 years) and 87 non-dysphonic controls (18 M, 69 F, mean age 42, SD 14 years) volunteered as participants. They read a standard text and sustained vowel /a:/. Acoustic recordings were made using a head-mounted microphone. Acoustic samples were evaluated perceptually by nine voice experts of different backgrounds (speech therapists, vocologists and laryngologists). Breathiness (B) from the GRBAS scale was rated. Headphones were used in the perceptual analysis. The dysphonic and non-dysphonic speakers differed significantly from each other in the auditory perceptual evaluation of breathiness. A significant difference was also found for ABI, which had a mean value of 2.26 (SD 1.15) for non-dysphonic and 3.07 (SD 1.75) for dysphonic speakers. ABI correlated strongly with B (rs = 0.823, p = 0.01). ABI’s power to distinguish the groups was high (88.6%). The highest sensitivity and specificity of ABI (80%) was obtained at threshold value 2.68. ABI is a valid tool for differentiating breathiness in non-dysphonic and dysphonic speakers of Finnish.

1. Introduction

1.1. What Is Breathiness

Breathiness is a characteristic in many disordered voices [1,2]. It occurs in organic voice disorders as well as in functional or neurological voice disorders [1,3]. Some breathiness can also be heard in the softly produced healthy voice, especially in women [4,5]. Breathiness refers to the auditory perception of air turbulence and is caused by air leakage from the glottis [6,7,8].

1.2. Perceptual Tools to Detect Breathiness

Many auditory perceptual tools have been developed to evaluate voice quality and at the same time breathiness in the voice. These tools include, for example, the GRBAS scale [6], the Australian Perceptual Voice Profile [9], the Swedish Stockholm Voice Evaluation [10], the CAPE-V (Consensus Auditory Perceptual Evaluation of Voice) [8] and the Danish Dysphonia Assessment [11]. Although an experienced listener can estimate the amount of breathiness in the voice by perceptual analysis, this is a subjective estimation [12]. An objective measure to evaluate voice quality and the amount of breathiness is needed, especially in clinical work and in the rehabilitation of patients with varying voice disorders [13,14].

1.3. Acoustic Tools to Detect Breathiness

Various signal analysis methods have been applied to predict perceived breathiness from acoustic characteristics. All of the methods aim to measure the amount of periodic and non-periodic components in a sound signal. These methods include, e.g., harmonic-to-noise ratio (HNR) [15], noise-to-harmonic ratio (NHR), voice turbulence index and soft phonation index [16,17], signal periodicity, first harmonic amplitude and spectral tilt and cepstral peak prominence (CPP and its smoothed version CPPS) [1,18]. Periodicity or, rather, reduced periodicity with increased jitter and shimmer (i.e., irregular variation in period duration or amplitude, respectively) has been found to predict well perceived breathiness both in non-dysphonic and in dysphonic voices [1,18]. On the other hand, jitter and shimmer are characteristics that are related to irregular vocal fold vibration, whose main perceptual correlate is “roughness” [6]. Furthermore, jitter, shimmer and spectrum based measures of noise, like HNR, are affected by the pitch and intensity of the voice, which impairs their reliability in dysphonia detection [19,20,21]. CPPS, which is based on the spectrum of the logarithmic spectrum [18], is independent of pitch and intensity. It has been found to show highest correlations with perceived hoarseness, roughness and breathiness [22,23]. Thus, there seemed to be a need to develop an index that would be able to focus more on the acoustic characteristics of breathiness rather than those of roughness which refers to irregular vocal fold vibration, and to distinguish non-dysphonic and dysphonic breathiness. The Acoustic Breathiness Index (ABI) has been developed to meet these needs [12,24].

1.4. What Is ABI?

The ABI is a multidimensional method with nine separate acoustic measures for detecting breathiness in the voice. Measures used in the ABI are smoothed cepstral peak prominence (CPPs), jitter local (Jit), glottal-to-noise excitation ratio (GNE), high frequency noise of 6000 Hz (Hfno), harmonic-to-noise ratio of Dejonckere (HNR-D), the amplitude difference between the first two harmonics (H1-H2), two measures of shimmer (Shim dB and Shim%) and period standard deviation (PSD) [12]. CPPs is the distance between the cepstral peak that corresponds to the first harmonic and the point with equal quefrency (inverse of frequency) on the regression line through the smoothed cepstrum [12]. The higher the value of CPPs, the more periodic, i.e., the clearer and more noiseless, the sound is in terms of auditory perception. CPPs is affected by both turbulence noise and signal perturbation (jitter and shimmer). Jit, i.e., jitter local, is the mean difference between successive periods, divided by the average period length. GNE [25] indicates whether a voice signal originates from vocal fold vibrations or from turbulent noise. GNE is independent of jitter and shimmer. A clear, nonbreathy voice results in high GNE. Hfno (up to 6000 Hz) indicates the spectral level difference between the ranges of 0–6 kHz and 6–10 kHz. A breathy voice with more noise in the high-frequency range has a smaller Hfno. HNR-D from Dejonckere and Lebacq [26] analyses the harmonic structure against noise in the long-term average spectrum in the formant zone between 500 Hz and 1500 Hz. A cepstrum is calculated to determine F0. A higher value of HNR-D indicates a less breathy voice. H1–H2, i.e., the difference in level between the first two harmonics, is greater in breathy voices. Shimmer measures the amplitude perturbation through the difference between successive periods divided by the mean amplitude. The value is calculated both in dB and in percentages. PSD is a perturbation measure revealing the variation in the standard deviation of periods [12].
Since the ABI is calculated from both sustained vowels and continuous speech, it needs to be validated in different languages [24]. The ABI has so far been validated in eight different languages, Dutch [12], German [24], Japanese [27], Korean [28], Brazilian Portuguese [29], Spanish [30], South Indian [31] and Persian [32].

1.5. Aim and Research Questions of the Present Study

The present article introduces a study where we aimed to validate the ABI in a Finnish speaking population. In this study, we sought an answer to two main questions: (1) Is the ABI a valid robust method to distinguish dysphonic breathy voice quality in a Finnish speaking population? (2) What is the best threshold value for ABI analysis in a Finnish speaking population?

2. Materials and Methods

2.1. Participants

The present study applied the ABI and auditory perceptual evaluation of breathiness to 195 Finnish speaking participants. The voice material of this study is the same as in the validation study of the Acoustic Voice Quality Index version 03.01 in Finnish.

2.2. Dysphonic Participants

One hundred and eight dysphonic participants were volunteer patients in the phoniatric department in the university hospital. Twenty-eight of the patients were males (mean age 51 years, SD 13.8, range 19–75). Eighty of the patients were females (mean age 51 years, SD 16.2, range 19–84). Table 1 shows the diagnoses of the participants.

2.3. Non-Dysphonic Controls

Eighty-seven vocally healthy persons with no diagnosis of dysphonia participated as controls. Eighteen of the participants were males (mean age 49 years, SD 9.9, range 32–60), and 69 were females (mean age 40 years, SD 14.5, range 19–67). Seventy-nine of the healthy participants scored under 38 points on the VAPP questionnaire (Voice Activity and Participation Profile) [33] and eight of them scored over 38 which has been considered the threshold value for voice disorder [34]. However, all participants considered themselves to be vocally healthy.

2.4. Recordings

All voice samples were recorded with an AKG C544L (AKG, Vienna, Austria) head-mounted condenser microphone with the Focusrite iTrack Solo (Focusrite PLC, High Wycombe, England) audio interface and using Praat software (version 6.2.23) in the computer. The recording used a sampling frequency of 44.1 kHz and the amplitude resolution was 16 bits. The distance of the microphone was 4 cm from the right side of the corner of the mouth at an angle of 45 degrees. The distance and position of the microphone were checked for each participant by measuring the distance from the corner of the mouth with a ruler.
The voice material for the study was collected for patients and 49 vocally healthy participants in a quiet treatment room at Tampere University Hospital. Thirty-eight of the healthy voices were recorded in studio conditions at Tampere University. The mean signal-to-noise ratio of the recordings (i.e., the difference in level between the sample and that of the background noise level) was 39.8 dB (SD 5.6 dB). In all samples the SNR was well over the recommended norm of SNR > 30 dB, so it can be confirmed that the recording conditions were acceptable.

2.5. Voice Samples

As voice samples, the standard text “Pohjantuuli ja aurinko” (North wind and the sun) was read aloud and a sustained vowels [a:] was produced three times. The participants were asked to use a voice pitch and intensity that suited them best, and the length of the sustained vowel was suggested to be five seconds. The participants were asked to produce the vowel in a spoken manner rather than singing.
For the ABI analysis the first 31 syllables from the read text and three seconds from the middle of the second sustained vowel were used. In the Finnish AVQI validation study, it was confirmed that 31 syllables of Finnish language text readings correspond on average to three-second long vowels [35]. The confirmation of the 31 syllables was executed for Finnish language the same way as described when finding out the syllable count of the Dutch sample [36]. Therefore, the index to be obtained would consist of a balanced duration of speech and sustained vowel phonation. For the analysis, connected speech sample was marked “cs” and the three-second sustained vowel sample “sv”. The ABI analysis was executed with VOXplot Acoustic Voice Quality Analysis software, version 2.0.0 [12,37]. The equation to calculate the ABI in the VOXplot software was the one presented by Barsties v. Latoszek et al. 2017 [12]: ABI = (5.0447740915 − [0.172 × CPPs] − [0.193 × Jit] − [1.283 × GNEmax − 4500 Hz] − [0.396 × Hno − 6000 Hz] + [0.01 × HNR –D] + [0.017 × H1 − H2] + [1.473 × Shim − dB] − [0.088 × Shim] − [68.295 × PSD]) × 2.9257400394. The ABI analysis gives the result of an index value between 0 and 10, a value of 0 meaning that there is no breathy sound in the voice and the higher the index number, the breathier sound there is in the voice sample.

2.6. Auditory–Perceptual Analysis

In order to validate the ABI analysis of the Finnish language, a listening analysis was performed. Nine voice experts in the field of voice (three phoniatrician/otolaryngologists, three speech therapists and three vocologists) listened to the voice samples and gave their evaluation of B from the GRBAS scale [6]. The scale is from 0 to 3, 0 signifying “no breathiness at all” and 3 signifying “very much breathiness” in the voice. The listening samples consisted of 31 syllables of continuous speech from the beginning of the text reading and three seconds of a sustained vowel. The length of one sample was thus six seconds and there was a total of 220 samples. For the intra-rater reliability analysis, 25 samples were rated twice. Before the listening test, listeners’ ears were calibrated with the anchor voice samples [38,39]. In the calibration, there were two anchor voice samples for each category of the degree of breathiness 0–3, i.e., there were in total eight anchor samples. The anchor voice samples were selected from the voice material of the present study by one experienced speech therapist and one experienced vocologist. The listening test was conducted on each listener’s own computer with around-ear headphones. Listeners received the voice material and instructions for the listening analysis on a memory stick. They were asked to make a judgement from a combination of continuous speech and sustained vowel phonation and mark the results in an Excel table. During the listening analysis the raters were able to listen to the samples as many times as they felt necessary; moreover, they were asked to listen to the anchor voice samples at least once after every 32 samples. This was carried out to prevent the listeners from losing focus and straining their hearing too much. Reminders about the anchors were marked in the Excel table as was the instruction to take a short break after listening to 128 samples, about halfway through the task. The interval for listening to the anchors was chosen on the basis of previous listening analyses. The listening analysis took on average from two to three hours.

2.7. Statistical Analysis

The statistical analysis was conducted using SPSS for Windows version 26 (IBM Corp., Armonk, NY, USA). All the results were considered statistically significant at p ≤ 0.05. In the Finnish validation of the ABI, first the intra-rater reliability of the perceptual raters was analysed with the Cohen kappa (Cκ) and secondly the raters’ inter-rater reliability was analysed with the Fleiss kappa () [40]. Raters with intra-rater reliability ≥ 0.41 were selected for inclusion in the study. The inter-rater reliability between the perceptual rates was defined to be at least ≥0.41 [40]. Next in the validation process, the relationship between the mean values of ABI and the mean values of the perceptual evaluation of breathiness were tested with the Spearman’s rank order correlation coefficient (rs, r2). Finally, the diagnostic accuracy of the ABI was evaluated with ROC (receiver operating characteristic) curve. The diagnostic accuracy was evaluated according to the sensitivity of the ABI to distinguish between disordered voice and heathy voice, and specificity to detect voices without breathiness. A nonbreathy voice was defined as a voice that received a perceptual mean rating of B 0–0.49. Additionally, the ability of the ABI to distinguish between normal and dysphonic breathiness was evaluated by the area under ROC curve (AROC). The ROC curve and the Youden index were used to differentiate the best threshold level for the ABI to differentiate healthy and dysphonic voices in the Finnish language. Likelihood ratios (LR+ and LR−) were used to differentiate the probability of persons with breathy voice having ABI value above the threshold level (LR+) or persons with nonbreathy voice having ABI value below the threshold level. To define the optimal threshold level, both the positive and negative likelihood ratio and the sensitivity/(1 − specificity) and (1 − sensitivity/specificity were used).

3. Results

3.1. Reliability of the Perceptual Evaluation

The intra-rater reliability of the listening analysis of the breathiness in Ck was between 0.395 and 0.809. One rater, however, presented a Ck value lower than the acceptable 0.41 and was excluded from the analysis. The remaining eight raters reported Ck values between 0.451 and 0.809. This group of eight raters showed reasonable inter-rater reliability (Fk = 0.435) and therefore the mean of their listening analysis represents the auditory assessment of breathiness in this study. The mean distribution of the auditory perceptual rating is seen in Figure 1. It is possible to deduce from Figure 1 that some breathiness was also heard in some of the non-dysphonic voices.
The listeners in this study represented three different occupational groups: phoniatrician/otolaryngologists, speech therapists and vocologists. The vocologists’ evaluation differed significantly from that of the other two groups (Mann–Whitney U test, phoniatrician/otolaryngologists vs. vocologists p = 0.002, speech therapists vs. vocologists p = 0.000). Vocologists rated more breathiness in the voice than the raters in the other groups.

3.2. Results for the ABI and Perceptual Evaluation of Breathiness

ABI results correlated strongly with auditory perceptual rating of breathy voice quality (Spearman’s rho 0.823, p = 0.01) (Figure 2). Non-dysphonic and dysphonic groups differed significantly from each other in both ABI and perceptual results (Mann–Whitney test, p < 0.001) (Table 2).

3.3. Sensitivity and Specificity of ABI

The ability of the ABI to distinguish between breathy and nonbreathy voices was evaluated with ROC analysis. AROC = 0.886 (i.e., 88.6%) showed high discriminatory power to distinguish nonbreathy voices from breathy voices (Figure 3). The highest Youden’s index was 0.60, where the best sensitivity of 80% and specificity of 80% were obtained at the cut point value 2.68. In the likelihood ratio the statistical guideline values were not reached (likelihood ratios LR+ 4.00 and LR− 0.25). Table 3 shows the threshold values of the eight previous ABI validation studies and the cut-point value of the Finnish validation study, as well as the statistical values of sensitivity, specificity, likelihood ratios, and the correlations between the ABI and the perceptual evaluation.

4. Discussion

This study aimed to investigate whether the acoustic breathiness index (ABI) is a valid and robust method to distinguish dysphonic breathiness from healthy voices in a Finnish speaking population, and, if so, what the best threshold value for ABI would be. These research questions are important; while breathiness is one of the main characteristics in dysphonia and the first component of hoarseness [41], it is also frequently found in the non-dysphonic population. This requires a more focused, multi-parameter tool for the detection of true breathiness and to be able to distinguish dysphonic breathiness. Breathiness appears to be perceived better in females’ voices but reduced loudness of voice increases its presence in both genders [42]. It has been found to be related to perceptions of femininity and attractiveness in female voice quality [5,43], and it may also be related to attractiveness in male voices [44], although voice characteristics evoke different evaluations in different cultures [45,46].
Some breathiness was perceived in some of the healthy voices in the present study. This is to be expected, as breathiness is also a cultural characteristic. In particular, vocologists who work with normal and supranormal (trained) voices were more sensitive than clinicians (phoniatrician/otolaryngologists and speech therapists) in rating breathiness. The main reason for including raters from different professional groups was to get a larger distribution of evaluations, which would also take into account the existence of some breathiness in normal voices. Furthermore, breathiness was the only characteristic that was rated in the present study; thus, the raters had to focus on this particular characteristic, which the acoustic tool was also specifically developed to measure.
The results of the present study show that the dysphonic voices scored significantly higher both on perceived breathiness and ABI, although the dysphonic group included patients with very different diagnoses and thus with different acoustic characteristics. This suggests that ABI measures what it is intended to measure. Perceived breathiness correlated strongly with ABI (rs 0.823, p 0.01) suggesting the ecological validity of the index. The discriminatory power of the ABI was high (88.6), showing that the method successfully differentiated between the dysphonic and non-dysphonic groups. The highest sensitivity (80%) and specificity (80%) in differentiation was obtained at ABI = 2.68. This can be thus used as a threshold for the clinical analysis of breathiness in a Finnish speaking population. Other studies [12,24,27,28,29,30,31,32] have reported slightly higher threshold values than what was found in the present study (Table 3). The reason may be related to language and cultural differences [29]. The Finnish language has a high prevalence of vowels, and a lack of linguistically breathy vowels that occur for instance in Gujarati, Mon-Khmer and Jalapa Mazatec [47], or sonorous fricatives. Finns may be more sensitive in perceiving breathiness. It may be speculated whether there could be a connection to some earlier findings where breathy voice quality seemed to convey an impression of emotional instability and implausibility of the speaker among Finnish listeners [46,48].
The mean age in the dysphonic group of the present study was somewhat higher than that of the non-dysphonic group and, in both groups, females were in the majority. These characteristics reflect the clinical reality that dysphonic patients are typically not very young and that females form the majority of voice patients [49,50,51]. However, earlier findings have shown that the ABI is not significantly dependent on age or sex [22,27].
The use of only one perceptual variable in the listening evaluation may be seen as a limitation in the study since then the presence of other potential characteristics of hoarseness remain unknown. This is, however, the policy that other ABI validation studies have adopted [12,24,28,30,32]. Further study of the average ABI results for different diagnostic groups is warranted. This would require larger numbers of participants in different diagnostic groups. Such a study should also address further the capability of listeners and ABI to differentiate between breathiness and other components of dysphonia by including other perceptual variables than merely the B.

5. Conclusions

The present study showed that the ABI is a robust and valid tool for use with a Finnish-speaking population. It distinguishes well between healthy and dysphonic voices. The threshold value for breathiness in Finnish healthy and dysphonic voices was 2.68.

Author Contributions

E.K. and A.-M.L. contributed equally to the study and the article. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Tampere University Hospital (R15014) 15 January 2015.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

Conflicts of Interest

The authors declare no conflict of interest. No funder had any role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Hillenbrand, J.M.; Houde, R.A. Acoustic Correlates of Breathy Vocal Quality: Dysphonic Voices and Continuous Speech. J. Speech Hear Res. 1996, 39, 311–321. [Google Scholar] [CrossRef] [PubMed]
  2. Colton, R.H.; Casper, J.K.; Leonard, R. Understanding Voice Problems: A Physiological Perspective for Diagnosis and Treatment.; Lippincott Willams & Wilkins: Baltimore, MD, USA, 2006. [Google Scholar]
  3. Dejonckere, P.H.; Obbens, C.; De Moor, G.M.; Wieneke, G.H. Perceptual Evaluation of Dysphonia: Reliability and Relevance. Folia Phoniatr. Logop. 1993, 45, 76–83. [Google Scholar] [CrossRef] [PubMed]
  4. Mendoza, E.; Valencia, N.; Muñoz, J.; Trujillo, H. Differences in Voice Quality between Men and Women: Use of the Long-Term Average Spectrum (LTAS). J. Voice 1996, 10, 59–66. [Google Scholar] [CrossRef] [PubMed]
  5. Van Borsel, J.; Janssens, J.; De Bodt, M. Breathiness as a Feminine Voice Characteristic: A Perceptual Approach. J. Voice 2009, 23, 291–294. [Google Scholar] [CrossRef] [PubMed]
  6. Hirano, M. Psycho-Acoustic Evaluation of Voice. In Book Disorders of Human Communication 5. Clinical Examination of Voice; Hirano, M., Arnold, G.E., Winckel, F., Wyke, B.D., Eds.; Springer: Vienna, Austria, 1981. [Google Scholar]
  7. Dejonckere, P.H.; Wieneke, G.H. Cepstra of Normal and Pathological Voices: Correlation with Acoustic, Aerodynamic and Perceptual Data. Adv. Clin. Phon. 1996, 6, 217–226. [Google Scholar]
  8. Kempster, G.B.; Verdolini Abbott, K.; Barkmeier-Kraemer, J.; Hillman, R. Consensus Auditory-Perceptual Evaluation of Voice: Development of a Standardized Clinical Protocol. Am. J. Speech Lang. Pathol. 2009, 8, 124–132. [Google Scholar] [CrossRef]
  9. Oates, J.; Russell, A. Learning Voice Analysis Using an Interactive Multi-Media Package: Development and Preliminary Evaluation. J. Voice 1998, 12, 500–512. [Google Scholar] [CrossRef]
  10. Hammarberg, B. Voice Research and Clinical Needs. Folia Phoniatr. Logop. 2000, 52, 93–102. [Google Scholar] [CrossRef]
  11. Iwarsson, J.; Bingen-Jakobsen, A.; Johansen, D.S.; Kølle, I.E.; Pedersen, S.G.; Thorsen, S.L.; Petersen, N.R. Auditory-Perceptual Evaluation of Dysphonia: A Comparison Between Narrow and Broad Terminology Systems. J. Voice 2018, 32, 428–436. [Google Scholar] [CrossRef]
  12. Latoszek, B.B.v.; Maryn, Y.; Gerrits, E.; De Bodt, M. The Acoustic Breathiness Index (ABI): A Multivariate Acoustic Model for Breathiness. J. Voice 2017, 31, 511.e11–511.e27. [Google Scholar] [CrossRef]
  13. Penido, F.A.; Gama, A.C.C. Accuracy Analysis of the Multiparametric Acoustic Indices AVQI, ABI, and DSI for Speech-Language Pathologist Decision-Making. J. Voice, 2023; in press. [Google Scholar] [CrossRef]
  14. Kankare, E.; Latoszek, B.B.v.; Maryn, Y.; Asikainen, M.; Rorarius, E.; Vilpas, S.; Rantala, L.; Laukkanen, A.M. The Acoustic Voice Quality Index Version 02.02 in the Finnish-Speaking Population. Logoped. Phoniatr. Vocol. 2020, 45, 49–56. [Google Scholar] [CrossRef]
  15. De Krom, G. A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals. J. Speech Lang. Hear Res. 1993, 36, 254–266. [Google Scholar] [CrossRef] [PubMed]
  16. Deliyski, D. Acoustic Model and Evaluation of Pathological Voice Production. In Proceedings of the Third European Conference on Speech Communication and Technology Eurospeech, Berlin, Germany, 22–25 September 1993; pp. 1969–1972. [Google Scholar]
  17. Bhuta, T.; Patrick, L.; Garnett, J.D. Perceptual Evaluation of Voice Quality and Its Correlation with Acoustic Measurements. J. Voice 2004, 18, 299–304. [Google Scholar] [CrossRef]
  18. Hillenbrand, J.M.; Cleveland, R.A.; Erickson, R.L. Acoustic Correlates of Breathy Vocal Quality. J. Speech Hear Res. 1994, 37, 769–778. [Google Scholar] [CrossRef] [PubMed]
  19. Glaze, L.E.; Bless, D.M.; Susser, R.D. Acoustic Analysis of Vowel and Loudness Differences in Children’s Voice. J. Voice 1990, 4, 37–44. [Google Scholar] [CrossRef]
  20. Orlikoff, R.F.; Kahane, J.C. Influence of Mean Sound Pressure Level on Jitter and Shimmer Measures. J. Voice 1991, 5, 113–119. [Google Scholar] [CrossRef]
  21. Brockmann-Bauser, M.; Bohlender, J.E.; Mehta, D.D. Acoustic Perturbation Measures Improve with Increasing Vocal Intensity in Individuals with and without Voice Disorders. J. Voice 2018, 32, 162–168. [Google Scholar] [CrossRef]
  22. Latoszek, B.B.v.; Maryn, Y.; Gerrits, E.; De Bodt, M. A Meta-Analysis: Acoustic Measurement of Roughness and Breathiness. J. Speech Lang. Hear Res. 2018, 61, 298–323. [Google Scholar] [CrossRef]
  23. Maryn, Y.; Roy, N.; De Bolt, M.; Van Cauwenberge, P.; Corthals, P. Acoustic Measurement of Overall Voice Quality: A Meta-Analysis. J. Acoust. Soc. Am. 2009, 126, 2619–2634. [Google Scholar] [CrossRef]
  24. Latoszek, B.B.v.; Lehnert, B.; Janotte, B. Validation of the Acoustic Voice Quality Index Version 03.01 and Acoustic Breathiness Index in German. J. Voice 2020, 34, 157.e17–157.e25. [Google Scholar] [CrossRef]
  25. Michaelis, D.; Gramss, T.; Strube, H.W. Glottal-to-Noise Excitation Ratio—A New Measure for Describing Pathological Voices. Acta Acust. United Acust. 1997, 83, 700–706. [Google Scholar]
  26. Dejonckere, P.H.; Lebacq, J. Harmonic Emergence in Formant Zone of a Sustained [a] as a Parameter for Evaluating Hoarseness. Acta Otorhinolaryngol. Belg. 1987, 41, 988–996. [Google Scholar] [PubMed]
  27. Hosokawa, K.; von Latoszek, B.B.; Ferrer-Riesgo, C.A.; Iwahashi, T.; Iwahashi, M.; Iwaki, S.; Kato, C.; Yoshida, M.; Umatani, M.; Miyauchi, A.; et al. Acoustic Breathiness Index for the Japanese-Speaking Population: Validation Study and Exploration of Affecting Factors. J. Speech Lang. Hear Res. 2019, 62, 2617–2631. [Google Scholar] [CrossRef] [PubMed]
  28. Kim, G.-H.; von Latoszek, B.B.; Lee, Y.-W. Validation of Acoustic Voice Quality Index Version 3.01 and Acoustic Breathiness Index in Korean Population. J. Voice 2021, 35, 660.e9–660.e18. [Google Scholar] [CrossRef] [PubMed]
  29. Englert, M.; Lima, L.; Behlau, M. Acoustic Voice Quality Index and Acoustic Breathiness Index: Analysis With Different Speech Material in the Brazilian Portuguese. J. Voice 2020, 34, 810.e11–810.e17. [Google Scholar] [CrossRef]
  30. Delgado Hernández, J.; León Gómez, N.M.; Jiménez, A.; Izquierdo, L.M.; Latoszek, B.B.v. Validation of the Acoustic Voice Quality Index Version 03.01 and the Acoustic Breathiness Index in the Spanish Language. Annal. Otol. Rhinol. Laryngol. 2018, 127, 317–326. [Google Scholar] [CrossRef] [PubMed]
  31. Jayakumar, T.; Benoy, J. Validation of Acoustic Breathiness Index (ABI) in the South Indian Population. J. Voice, 2022; in press. [Google Scholar] [CrossRef] [PubMed]
  32. Aghajanzadeh, M.; Saeedi, S.; Jalaie, S.; Esarian, K.; Latoszek, B.B.v. Validation of the Acoustic Voice Quality Index and the Acoustic Breathiness Index in the Persian Language. J. Voice, 2023; in press. [Google Scholar] [CrossRef]
  33. Ma, E.P.-M.; Yiu, E.M.-L. Voice Activity and Participation Profile: Assessing the Impact of Voice Disorders on Daily Activities. J. Speech Lang. Hear Res. 2001, 44, 511–524. [Google Scholar] [CrossRef]
  34. Kleemola, L.; Helminen, M.; Rorarius, E.; Isotalo, E.; Sihvo, M. Voice Activity and Participation Profile in Assessing the Effects of Voice Disorders on Quality of Life: Estimation of the Validity, Reliability and Responsiveness of the Finnish Version. Folia Phoniatr. Logop. 2010, 63, 113–121. [Google Scholar] [CrossRef]
  35. Kankare, E.; Rantala, L.; Ikävalko, T.; Latoszek, B.B.v.; Laukkanen, A.-M. Akustisen Äänenlaatuindeksin (AVQI) Version 03.01 Validointi Suomenkielisille Puhujille. Puhe Kieli 2020, 40, 165–182. [Google Scholar] [CrossRef]
  36. Barsties, B.; Maryn, Y. The Improvement of Internal Consistency of the Acoustic Voice Quality Index. Am. J. Otolaryngol. 2015, 36, 647–656. [Google Scholar] [CrossRef] [PubMed]
  37. Gierlich, J.; Latoszek, B.B.v. Test-Retest Reliability of the Acoustic Voice Quality Index and the Acoustic Breathiness Index. J. Voice, 2023; in press. [Google Scholar] [CrossRef] [PubMed]
  38. Brinca, L.; Batista, A.P.; Tavares, A.I.; Pinto, P.N.; Araújo, L. The Effect of Anchors and Training on the Reliability of Voice Quality Ratings for Different Types of Speech Stimuli. J. Voice 2015, 29, 776.e7–776.e14. [Google Scholar] [CrossRef] [PubMed]
  39. dos Santos, P.C.M.; Vieira, M.N.; Sansão, J.P.H.; Gama, A.C.C. Effect of Auditory-Perceptual Training With Natural Voice Anchors on Vocal Quality Evaluation. J. Voice 2019, 33, 220–225. [Google Scholar] [CrossRef]
  40. Landis, J.R.; Koch, G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef]
  41. Dejonckere, P.H.; Lebacq, J. Acoustic, Perceptual, Aero-Dynamic and Anatomical Correlations Invoice Pathology. J. Oto-Rhino-Laryngol. Relat. Specialt. 1996, 58, 326–332. [Google Scholar] [CrossRef]
  42. Södersten, M.; Lindestad, P.-Å. Glottal Closure and Perceived Breathiness During Phonation in Normally Speaking Subjects. J. Speech Lang. Hear Res. 1990, 33, 601–611. [Google Scholar] [CrossRef]
  43. Liu, X.; Xu, Y. What Makes a Female Voice Attractive? In Proceedings of the Proceedings of ICPhS XVII, Hong Kong, China, 17–21 August 2011; pp. 1274–1277. [Google Scholar]
  44. Šebesta, P.; Kleisner, K.; Tureček, P.; Kočnar, T.; Akoko, R.M.; Třebický, V.; Havlíček, J. Voices of Africa: Acoustic Predictors of Human Male Vocal Attractiveness. Anim. Behav. 2017, 127, 205–211. [Google Scholar] [CrossRef]
  45. Giles, H. Ethnicity Markers in Speech. In Book Social Markers in Speech; Scherer, K.R., Giles, H., Eds.; Cambridge University Press: Cambridge, UK, 1979. [Google Scholar]
  46. Waaramaa, T.; Lukkarila, P.; Järvinen, K.; Geneid, A.; Laukkanen, A.-M. Impressions of Personality from Intentional Voice Quality in Arabic-Speaking and Native Finnish-Speaking Listeners. J. Voice 2021, 35, 326.e21–326.e28. [Google Scholar] [CrossRef]
  47. Ladefoged, P.; Maddieson, I. Vowels of the World’s Languages. J. Phon. 1990, 18, 93–122. [Google Scholar] [CrossRef]
  48. Lukkarila, P.; Laukkanen, A.M.; Palo, P. Influence of the Intentional Voice Quality on the Impression of Female Speaker. Logoped. Phoniatr. Vocol. 2012, 37, 158–166. [Google Scholar] [CrossRef]
  49. De Bodt, M.; Van den Steen, L.; Mertens, F.; Raes, J.; Van Bel, L.; Heylen, L.; Pattyn, J.; Gordts, F.; van de Heyning, P. Characteristics of a Dysphonic Population Referred for Voice Assessment and/or Voice Therapy. Folia Phoniatr. Logop. 2016, 67, 178–186. [Google Scholar] [CrossRef]
  50. Mozzanica, F.; Ginocchio, D.; Barillari, R.; Barozzi, S.; Maruzzi, P.; Ottaviani, F.; Schindler, A. Prevalence and Voice Characteristics of Laryngeal Pathology in an Italian Voice Therapy-Seeking Population. J. Voice 2016, 30, 774.e13–774.e21. [Google Scholar] [CrossRef] [PubMed]
  51. Remacle, A.; Petitfils, C.; Flinck, C.; Morsomme, D. Description of Patients Consulting the Voice Clinic Regarding Gender, Age, Occupational Status, and Diagnosis. Eur. Arch. Otorhinolaryngol. 2017, 274, 1567–1576. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Frequency distribution of the mean breathiness by eight raters. Green colour on the graph indicates those participants who did not have breathiness (mean 0–0.49) in their voices and blue indicates those who were rated to have breathy voice quality (mean 0.50–3).
Figure 1. Frequency distribution of the mean breathiness by eight raters. Green colour on the graph indicates those participants who did not have breathiness (mean 0–0.49) in their voices and blue indicates those who were rated to have breathy voice quality (mean 0.50–3).
Jcm 12 07607 g001
Figure 2. Scatter plot and the linear regression line between auditory–perceptual rating and ABI results.
Figure 2. Scatter plot and the linear regression line between auditory–perceptual rating and ABI results.
Jcm 12 07607 g002
Figure 3. ROC curve analysis illustrating the diagnostic accuracy of ABI, area under ROC curve = 88.6% (AROC line blue, reference line red).
Figure 3. ROC curve analysis illustrating the diagnostic accuracy of ABI, area under ROC curve = 88.6% (AROC line blue, reference line red).
Jcm 12 07607 g003
Table 1. Number of dysphonic participants and their diagnoses.
Table 1. Number of dysphonic participants and their diagnoses.
DiagnosisNumber
Functional dysphonia30
Paralysis/paresis of vocal fold23
Laryngeal dystonia23
Other diseases of vocal fold or larynx/other undefined dysphonia10
Chronic laryngitis9
Nodules5
Larynx irritable with voice symptoms2
Cough with voice symptoms1
Polyp of vocal cord1
Laryngeal spasm1
Dysphagia with voice symptoms1
Ehlers–Danlos syndrome1
Larynx trauma1
Total108
Table 2. Mean values of ABI and perceptual rating of breathiness in non-dysphonic (participants with no diagnosis of dysphonia) and dysphonic groups (participants with diagnosis of dysphonia). Comparison of the groups using Mann–Whitney U test.
Table 2. Mean values of ABI and perceptual rating of breathiness in non-dysphonic (participants with no diagnosis of dysphonia) and dysphonic groups (participants with diagnosis of dysphonia). Comparison of the groups using Mann–Whitney U test.
NMeanSDMinMaxp-Value
ABI
Non-dysphonic
Dysphonic
872.261.15−0.704.58<0.001
1083.071.75−0.919.38
Perceptually
assessed BNon-dysphonic
Dysphonic
870.37 0.3501.25<0.001
1081.260.740 3
Table 3. The results of nine ABI-validated languages showing ABI thresholds, sensitivities, specificities, likelihood ratios, and correlation values rs between the ABI and perceptual evaluation of breathiness (B).
Table 3. The results of nine ABI-validated languages showing ABI thresholds, sensitivities, specificities, likelihood ratios, and correlation values rs between the ABI and perceptual evaluation of breathiness (B).
LanguageABI
Cut-Off Value
Sensitivity %Specificity %LR+LR−Correlation rs between
ABI and B
Dutch3.44829311.630.190.84
German3.4272907.400.310.86
Japanese3.4476948.090.130.89
Korean3.6988866.470.140.87
Brazilian Portuguese3.138891−0.030.130.87
Spanish3.40749516.020.270.83
South Indian3.66629512.192.480.76
Persian2.9770875.440.350.74
Finnish2.68 80 80 4.000.250.82
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kankare, E.; Laukkanen, A.-M. Validation of the Acoustic Breathiness Index in Speakers of Finnish Language. J. Clin. Med. 2023, 12, 7607. https://doi.org/10.3390/jcm12247607

AMA Style

Kankare E, Laukkanen A-M. Validation of the Acoustic Breathiness Index in Speakers of Finnish Language. Journal of Clinical Medicine. 2023; 12(24):7607. https://doi.org/10.3390/jcm12247607

Chicago/Turabian Style

Kankare, Elina, and Anne-Maria Laukkanen. 2023. "Validation of the Acoustic Breathiness Index in Speakers of Finnish Language" Journal of Clinical Medicine 12, no. 24: 7607. https://doi.org/10.3390/jcm12247607

APA Style

Kankare, E., & Laukkanen, A. -M. (2023). Validation of the Acoustic Breathiness Index in Speakers of Finnish Language. Journal of Clinical Medicine, 12(24), 7607. https://doi.org/10.3390/jcm12247607

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop