A First Step toward the Clinical Application of Landmark-Based Acoustic Analysis in Child Mandarin
Abstract
:1. Introduction
2. Methods
2.1. Participants
2.2. Equipment, Procedures and Materials
2.3. Data Analysis
2.3.1. Landmark-Based Acoustic Analysis
2.3.2. Intelligibility Scores
2.3.3. Statistical Analysis
3. Results
3.1. Descriptive Results
3.2. Inferential Results
4. Discussion
5. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
No. | Chinese Characters | Transliteration (Pinyin) | Gloss |
---|---|---|---|
1. | 鳳梨 | fèng lí | Pineapple |
2. | 飛機 | fēi jī | Airplane |
3. | 火車 | huǒ chē | Train |
4. | 漢堡 | hàn bǎo | Hamburger |
5. | 蝦子 | xiā zi | Shrimp |
6. | 小鳥 | xiǎo niǎo | Bird |
7. | 森林 | sēn lín | forest |
8. | 松鼠 | sōng shǔ | squirrel |
9. | 薯條 | shǔ tiáo | french fries |
10. | 手錶 | shǒu biǎo | watch |
References
- Chen, L.-M.; Oller, D.K.; Lee, C.C.; Liu, C.-T. LENA: Computerized Automatic Analysis of Speech Development from Birth to Three. In Proceedings of the 30th Conference on Computational Linguistics and Speech Processing (ROCLING 2018), Hsinchu, Taiwan, 4–5 October 2018. [Google Scholar]
- Holmgren, K.; Lindblom, B.; Aurelius, G.; Jailing, B.; Zetterström, R. On the phonetics of infant vocalization. In Precursors of Early Speech; Lindblom, B., Zetterström, R., Eds.; Palgrave Macmillan: London, UK, 1986; pp. 51–63. [Google Scholar]
- Jones, G.; Nadjibzadeh, N.; Károly, L.; Mohammadpour, M. An integrated dialect analysis tool using phonetics and acoustics. Lingua 2019, 221, 37–48. [Google Scholar] [CrossRef] [Green Version]
- Khan, A.; Steiner, I.; Sugano, Y.; Bulling, A.; Macdonald, R. A multimodal corpus of expert gaze and behavior during phonetic segmentation tasks. In Proceedings of the Language Resources and Evaluation Conference (LREC), Miyazaki, Japan, 7–12 May 2018. [Google Scholar]
- Oller, D.K.; Niyogi, P.; Gray, S.; Richards, J.A.; Gilkerson, J.; Xu, D.; Yapanel, U.; Warren, S.F. Automated Vocal Analysis of Naturalistic Recordings from Children with Autism, Language Delay, and Typical Development. Proc. Nat. Acad. Sci. USA 2010, 107, 13354–13359. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Xu, D.; Richards, J.A.; Gilkerson, J. Automated analysis of child phonetic production using naturalistic recordings. J. Speech Lang. Hear. Res. 2014, 57, 1638–1650. [Google Scholar] [CrossRef]
- Liu, C.-T.; Chen, L.-M.; Lin, Y.-C.; Cheng, C.-Y.; Lin, Y.-C. Fricative productions of Mandarin-speaking children with cerebral palsy: The case of five-year-olds. Clin. Linguist. Phonet. 2020, 34, 256–270. [Google Scholar] [CrossRef] [PubMed]
- Boyce, S.; Fell, H.J.; McAuslan, J. SpeechMark: Landmark detection tool for speech analysis. In Proceedings of the Interspeech 2012, Portland, OR, USA, 9–13 September 2012. [Google Scholar]
- Liu, S.A. Landmark detection for distinctive feature-based speech recognition. J. Acoust. Soc. Am. 1996, 100, 3417–3430. [Google Scholar] [CrossRef] [Green Version]
- Howitt, A.W. Automatic Syllable Detection for Vowel Landmarks. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2000. [Google Scholar]
- Stevens, K.N. Evidence or the role of acoustic boundaries in the perception of speech sounds. In Phonetic Linguistics: Essays in Honor of Peter Ladefoged; Fromkin, V.A., Ed.; Academic Press: London, UK, 1985; pp. 243–255. [Google Scholar]
- Stevens, K.N. On the quantal nature of speech. J. Phon. 1989, 17, 3–46. [Google Scholar] [CrossRef]
- Stevens, K.N. Diverse acoustic cues at consonantal landmarks. Phonetica 2000, 57, 139–151. [Google Scholar] [CrossRef] [PubMed]
- Stevens, K.N. From Acoustic Cues to Segments, Features and Words. In Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China, 16–20 October 2000. [Google Scholar]
- Stevens, K.N. Toward a model for lexical access based on acoustic landmarks and distinctive features. J. Acoust. Soc. Am. 2002, 111, 1872–1891. [Google Scholar] [CrossRef] [Green Version]
- Chomsky, N.; Halle, M. The Sound Pattern of English; Harper and Row: New York, NY, USA, 1968. [Google Scholar]
- Keyser, S.J.; Stevens, K.N. Feature geometry and the vocal tract. Phonol 1994, 11, 207–236. [Google Scholar] [CrossRef]
- Boyce, S.; Krause, J.; Hamilton, S.; Smilijanic, R.; Bradlow, A.R.; Rivera-Campos, A.; MacAuslan, J. Using Landmark Detection to Measure Effective Clear Speech. In Proceedings of the Meetings on Acoustics, Montreal, QC, Canada, 2–7 June 2013. [Google Scholar]
- Ishikawa, K.; MacAuslan, J.; Boyce, S. Toward clinical application of landmark-based speech analysis: Landmark expression in normal adult speech. J. Acoust. Soc. Am. 2017, 142, EL441–EL447. [Google Scholar] [CrossRef] [Green Version]
- Kalita, S.; Mahadeva Prasanna, S.R.; Dandapat, S. Importance of glottis landmarks for the assessment of cleft lip and palate speech intelligibility. J. Acoust. Soc. Am. 2018, 144, 2656–2661. [Google Scholar] [CrossRef]
- Atkins, M.S.; Boyce, S.E.; MacAuslan, J.; Silbert, N. Computer-assisted Syllable Complexity Analysis of Continuous Speech as a Measure of Child Speech Disorders. In Proceedings of the 19th International Congress of Phonetic Sciences, (ICPhS 2019), Melbourne, Australia, 4–10 August 2019. [Google Scholar]
- Ishikawa, K.; Rao, M.B.; MacAuslan, J.; Boyce, S. Application of a landmark-based method for acoustic analysis of dysphonic speech. J. Voice 2020, 34, 645.e11–645.e18. [Google Scholar] [CrossRef] [PubMed]
- DiCicco, T.; Patel, R. Automatic landmark analysis of dysarthric speech. J. Med. Speech-Lang. Path. 2008, 16, 213–219. [Google Scholar]
- MacAuslan, J. What are Acoustic Landmarks, and What do They Describe? Available online: https://speechmrk.com/wp-content/uploads/2016/08/Landmark-Descriptions.pdf (accessed on 17 January 2021).
- Huang, Z.; Epps, J.; Joachim, D. Investigation of Speech Landmark Patterns for Depression Detection. In Proceedings of the 2020 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP 2020), Virtual Conference, Barcelona, Spain, 4–8 May 2020. [Google Scholar]
- Atkins, M.S.; Washington, K.; Silbert, N.; MacAuslan, J.; Tuhohy, S.; Blades, R.; Donaldson, M.; Ungruhe, J.; Swanson, K. Towards Automated Detection of Similarities and Differences in Bilingual Speakers. In Proceedings of the 174th Meeting of the Acoustical Society of America, Louisiana, LA, USA, 4–8 December 2017. [Google Scholar]
- Kent, R. The biology of phonological development. In Phonological Development: Models, Research, Implications; Ferguson, C.A., Menn, L., Stoel-Gammon, C., Eds.; York Press: Maryland, MD, USA, 1992; pp. 65–90. [Google Scholar]
- Nathani, S.; Oller, D.K. Beyond ba-ba and gu-gu: Challenges and strategies in coding infant vocalizations. Behav. Res. Methods Insr. Cmp. 2001, 33, 321–330. [Google Scholar] [CrossRef] [Green Version]
- Oller, D.K.; Ramsdell, H.L. A weighted reliability measure for phonetic transcription. J. Speech Lang. Hear. Res. 2006, 49, 1391–1411. [Google Scholar] [CrossRef]
- Liberman, A.M.; Harris, K.S.; Kinney, J.A.; Lane, H. The discrimination of relative onset-time of the components of certain speech and nonspeech patterns. J. Exp. Psychol. 1961, 61, 379–388. [Google Scholar] [CrossRef] [PubMed]
- Zhu, H.; Dodd, B. The phonological acquisition of Putonghua (modern standard Chinese). J. Child Lang. 2000, 27, 3–42. [Google Scholar]
- Zhu, H. Phonological Development in Specific Contexts: Studies of Chinese-Speaking Children; Multilingual Matters: Clevedon, UK, 2002. [Google Scholar]
- Li, F.; Munson, B. The development of voiceless sibilant fricatives in Putonghua-speaking children. J. Speech Lang. Hear. Res. 2016, 59, 699–712. [Google Scholar] [CrossRef]
- Chao, Y.-R. The Cantian idiolect. Univ. Calif. Pub. Semitic Philol. 1951, 2, 27–44. [Google Scholar]
- Jeng, H. The Acquisition of Chinese Phonology in Relation to Jakobson’s Laws of Irreversible Solidarity. In Proceedings of the 9th International Congress of Phonetic Sciences (ICPhS 1979), Copenhagen, Denmark, 6–11 August 1979. [Google Scholar]
- Jeng, H. A developmentalist view of child phonology. Studies Lang. Lit. 1985, 1, 1–29. [Google Scholar]
- Wang, N.-M.; Fei, P.; Huang, H.; Chen, C.-W. The phonetic development of Mandarin-acquiring, three-to-six preschool children. J. Speech-Lang.-Hear. Assoc. 1984, 1, 12–15. (In Chinese) [Google Scholar]
- Chang, C.-F.; Chung, Y.-M. Revision and application of preschool children language development scale. Bull. Spec. Edu. 1986, 2, 37–52. (In Chinese) [Google Scholar]
- Hsu, J. A Study of the Various Stages of Development and Acquisition of Mandarin Chinese by Children in Taiwan Milieu. Master’s Thesis, Fu Jen Catholic University, New Taipei, Taiwan, 1987. [Google Scholar]
- Shiu, H. The Phonological Acquisition by Mandarin-Speaking Children: A Longitudinal Case Study on Children from 9 Months through Three Years Old. Master’s Thesis, National Taiwan Normal University, Taipei, Taiwan, 1990. [Google Scholar]
- Cheung, H. Three to four-years old children’s perception and production of Mandarin consonants. Lang. Linguist. 2000, 1, 19–38. (In Chinese) [Google Scholar]
- Cheung, H.; Hsu, P.-H. Chinese children’s production and perception of consonants: A developmental study. J. Chin. Hear. Speech 2000, 15, 1–10. (In Chinese) [Google Scholar]
- Cho, S.-C. The Phonological Development of 3 to 6 Year-Old Preschool Children in Taiwan. Master’s Thesis, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan, 2008. (In Chinese). [Google Scholar]
- Jeng, J.-Y. Manual of Mandarin Speech Test for Children. Available online: http://giast.nknu.edu.tw/UploadFile/TeaFiles/giast_t_103205415.pdf (accessed on 17 January 2021). (In Chinese).
- Jeng, J.-Y. The speech acquisition of Mandarin-speaking preschool children. J. Chin. Lang. Teach. 2017, 14, 109–136. (In Chinese) [Google Scholar]
- Li, X.X.; To, C.K. A review of phonological development of Mandarin-speaking children. Amer. J. Speech-Lang. Pathol. 2017, 26, 1262–1278. [Google Scholar] [CrossRef] [PubMed]
- Locke, J. Phonological Acquisition and Change; Academic Press: New York, NY, USA, 1983. [Google Scholar]
- Boersma, P.; Weenink, D. Praat: Doing Phonetics by Computer [Computer program]. Available online: http://www.praat.org/ (accessed on 17 January 2021).
- Ansel, B.M.; Kent, R.D. Acoustic-phonetic contrasts and intelligibility in the dysarthria associated with mixed cerebral palsy. J. Speech Lang. Hear. Res. 1992, 35, 296–308. [Google Scholar] [CrossRef]
- Liu, H.-M.; Tseng, C.-H.; Tsao, F.-M. Perceptual and acoustic analysis of speech intelligibility in Mandarin-speaking young adults with cerebral palsy. Clin. Linguist. Phonet. 2000, 14, 447–464. [Google Scholar]
- Liu, C.-T.J.; Chen, L.-M.; Lin, Y.-C.; Cheng, C.-F.A.; Chang, H.-C.J. Speech Intelligibility and the Production of Fricative and Affricate among Mandarin-Speaking Children with Cerebral Palsy. In Proceedings of the 2016 Conference on Computational Linguistics and Speech Processing (ROCLING 2016), Tainan, Taiwan, 6–7 October 2016. [Google Scholar]
- Liu, C.-T. Acoustic Landmark Analysis of Adults’ Consonants in Mandarin Chinese: The Case of Disyllabic Words. In Proceedings of the Paper Presented at The 56th Linguistics Colloquium, Virtual Conference, 26–28 November 2020. [Google Scholar]
Symbol | Mnemonic | Acoustic Rule 1 | Articulatory Interpretation |
---|---|---|---|
±g | Glottal | Beginning/end of sustained laryngeal vibration/motion | Onset/offset of vocal folds’ free vibration |
±p | Periodicity | Beginning/end of sustained periodicity (syllabicity) lasting for at least 32 milliseconds | The presence of ±p reflects the speaker’s ability to properly control the subglottal pressure and cricothyroid muscle. |
±b | Burst | At least three of five frequency bands show simultaneous power increases/decreases of at least 6 dB in both the finely smoothed and the coarsely smoothed contours in an unvoiced segment (not between +g and the next −g) | Presence of a fricative, affricate or aspirated stop burst consonant (i.e., +b) or cessation of frication or aspiration noise (i.e., −b) |
±s | Syllabic | At least three of five frequency bands show simultaneous power increases/decreases of at least 6 dB in both the finely smoothed and the coarsely smoothed contours in a voiced segment (between +g and the next −g) | Closure or release of a nasal or /l/ |
±f | Unvoiced frication | At least three of five frequency bands show simultaneous 6 dB power increases/decreases at high frequencies and decreases/increases at low frequencies (unvoiced segment) | Onset/offset of an unvoiced fricative |
±v | Voiced frication | At least three of five frequency bands show simultaneous 6 dB power increases/decreases at high frequencies and decreases/increases at low frequencies (voiced segment) | Onset/offset of a voiced fricative |
Participants | Total Participants (Number of Girls) | Mean Age in Month (SD) 1 |
---|---|---|
4-year-olds | 20 (10) | 52.25 (2.552) |
5-year-olds | 20 (10) | 64.20 (2.821) |
6-year-olds | 20 (10) | 76.20 (2.353) |
7-year-olds | 20 (10) | 88.20 (2.238) |
Landmark Features | Age 4 (n = 20) | Age 5 (n = 20) | Age 6 (n = 20) | Age 7 (n = 20) |
---|---|---|---|---|
+g | 19.25 (2.79) | 18.55 (2.54) | 18.4 (1.7) | 18.9 (3.49) |
−g | 19.25 (2.79) | 18.5 (2.48) | 18.45 (1.73) | 18.85 (3.5) |
+p | 26.9 (7.82) | 23.95 (4.71) | 24.8 (5.03) | 25.55 (6.33) |
−p | 24.6 (5.753) | 22.35 (3.56) | 23.3 (4.14) | 24 (5.54) |
+b | 10.05 (3.33) | 9.75 (3.73) | 10.5 (3.17) | 7 (3.1) |
−b | 3.45 (2.31) | 2.75 (1.68) | 2.8 (1.58) | 2.8 (2.09) |
+s | 5.55 (3.09) | 5.2 (3.3) | 4.7 (3.23) | 4.95 (3.4) |
−s | 5.3 (2.96) | 5.55 (3.1) | 5.4 (2.8) | 4.6 (4.31) |
+f | 0.15 (0.49) | 0 (0) | 0 (0) | 0 (0) |
−f | 0.1 (0.31) | 0.05 (0.224) | 0.15 (0.366) | 0 (0) |
+v | 0.05 (0.22) | 0 (0) | 0.1 (0.31) | 0.05 (0.22) |
−v | 0.45 (0.83) | 0.25 (0.55) | 0.05 (0.22) | 0.15 (0.366) |
Total | 115.1 (17.47) | 106.9 (11.35) | 108.65 (12.87) | 106.85 (16.6) |
Total without ±f & ±v | 114.35 (17.37) | 106.6 (11.39) | 108.35 (12.93) | 106.65 (16.69) |
Age 4 (n = 20) | Age 5 (n = 20) | Age 6 (n = 20) | Age 7 (n = 20) |
---|---|---|---|
4.825 (0.259) | 4.905 (0.267) | 4.855 (0.305) | 4.975 (0.079) |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, C.-T. A First Step toward the Clinical Application of Landmark-Based Acoustic Analysis in Child Mandarin. Children 2021, 8, 159. https://doi.org/10.3390/children8020159
Liu C-T. A First Step toward the Clinical Application of Landmark-Based Acoustic Analysis in Child Mandarin. Children. 2021; 8(2):159. https://doi.org/10.3390/children8020159
Chicago/Turabian StyleLiu, Chin-Ting. 2021. "A First Step toward the Clinical Application of Landmark-Based Acoustic Analysis in Child Mandarin" Children 8, no. 2: 159. https://doi.org/10.3390/children8020159