Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria

Loakes, Debbie; Gregory, Adele

doi:10.3390/languages9090299

Open AccessArticle

Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria

by

Debbie Loakes

^*

and

Adele Gregory

School of Languages and Linguistics, The University of Melbourne, Parkville 3010, Australia

^*

Author to whom correspondence should be addressed.

Languages 2024, 9(9), 299; https://doi.org/10.3390/languages9090299

Submission received: 19 February 2024 / Revised: 4 September 2024 / Accepted: 5 September 2024 / Published: 12 September 2024

(This article belongs to the Special Issue An Acoustic Analysis of Vowels)

Download

Browse Figures

Versions Notes

Abstract

:

(1) Background: Australian Aboriginal English (AAE) is a variety known to differ in various ways from the mainstream, but to date very little phonetic analysis has been carried out. This study is a description of L1 Aboriginal English in southern Australia, aiming to comprehensively describe the acoustics of vowels, focusing in particular on vowels known to be undergoing change in Mainstream Australian English. Previous work has focused on static measures of F1/F2, and here we expand on this by adding duration analyses, as well as dynamic F1/F2 measures. (2) Methods: This paper uses acoustic-phonetic analyses to describe the vowels produced by speakers of Aboriginal Australian English from two communities in southern Australia (Mildura and Warrnambool). The focus is vowels undergoing change in the mainstream variety–the short vowels in KIT, DRESS, TRAP, STRUT, LOT, and the long vowel GOOSE; focusing on duration, and static and dynamic F1/F2. As part of this description, we analyse the data using the sociophonetic variables gender, region, and age, and also compare the Aboriginal Australian English vowels to those of Mainstream Australian English. (3) Results: On the whole, for duration, few sociophonetic differences were observed. For static F1/F2, we saw that L1 Aboriginal English vowel spaces tend to be similar to Mainstream Australian English but can be analysed as more conservative (having undergone less change) as has also been observed for L2 Aboriginal English, in particular for KIT, DRESS, and TRAP. The Aboriginal English speakers had a less peripheral vowel space than Mainstream Australian English speakers. Dynamic analyses also highlighted dialectal differences between Aboriginal and Mainstream Australian English speakers, with greater F1/F2 movement in the trajectories of vowels overall for AAE speakers, which was more evident for some vowels (TRAP, STRUT, LOT, and GOOSE). Regional differences in vowel quality between the two locations were minimal, and more evident in the dynamic analyses. (4) Conclusions: This paper further highlights how Aboriginal Australian English is uniquely different from Mainstream Australian English with respect to certain vowel differences, and it also highlights some ways in which the varieties align. The differences, i.e., a more compressed vowel space, and greater F1/F2 movement in the trajectories of short vowels for AAE speakers, are specific ways that Aboriginal Australian English and Mainstream Australian English accents are different in these communities in the southern Australian state of Victoria.

Keywords:

vowels; acoustics; Australian English; Aboriginal English

1. Introduction and Background

Australian English is reported to have three main varieties: Mainstream Australian English (the institutional standard), Ethnocultural Australian English (which encompasses the Englishes used by speakers with different non-Anglo-Celtic cultural heritage backgrounds), and ‘Aboriginal English’ (Cox and Fletcher 2017, pp. 12–13). Aboriginal English is spoken by Indigenous Australians, who are First Nations people that experienced colonisation and language loss after the arrival of the British in 1788 (e.g., Mailhammer 2021). This variety has been described as “an English-lexified contact-based variety and the first and only language for a sizeable number of Indigenous people in Australia” (Louro and Collard 2021, p. 2). There are multiple varieties of Aboriginal English spoken in Australia, and Aboriginal Englishes, plural, is often considered a more appropriate term to use to reflect this (Eades 2013). While L1 and L2 varieties are spoken, L1 varieties are more common in the south of the continent, where our research is carried out; and there are greater numbers of L2 speakers in the northern regions of Australia, which largely reflects colonisation practices (Mailhammer 2021). Aboriginal Englishes may be phonetically and/or phonologically different from Mainstream Australian English (Butcher 2008) and can also be structurally different (Eades 2013; Malcolm 2013) depending on the variety in question. Communicative practices are also known to be culturally different from the mainstream variety (Louro and Collard 2021), and this extends to the use of disfluencies such as unfilled pauses (McDougall et al. forthcoming) and filled pauses (Blackwell and McDougall forthcoming) in L1 Aboriginal English. There are very few acoustic studies focusing on the vowels produced by Aboriginal English speakers in Australia, despite a clear difference in the sound system(s) between these speakers and Mainstream Australian English speakers, and this will be the focus of the current paper.

In acoustic descriptions of English vowels in Australia, most attention tends to be paid to the mainstream variety and mostly from Sydney (i.e., Harrington et al. 1997; Watson and Harrington 1999; Cox 1999; Cox and Palethorpe 2008, 2019; Elvin et al. 2016; Grama et al. 2019; Cox et al. 2024). The limited work on Aboriginal English vowel spaces has been based on static measures of formant steady states. These studies have shown in particular that Aboriginal English vowel spaces are more compressed than the mainstream variety and have more “conservative” features, whether L2 Aboriginal English (Butcher and Anderson 2008) or L1 Aboriginal English (Loakes et al. 2016). In these production studies focusing on Aboriginal English vowels, and in the perception studies which consider vowel categorisation behaviour by Aboriginal English listeners (Loakes et al. 2024a, 2024b), varietal differences are analysed as due to Aboriginal English not having undergone the same rates of change as the mainstream variety (also see Butcher 2008). In particular, front vowel lowering has been especially rapid for MAE speakers (i.e., Cox and Palethorpe 2008). This means that age is often a highly significant factor in how Mainstream Australian English listeners produce vowel variants (e.g., Cox 1999; Cox and Palethorpe 2008) and respond to vowel categorisation tasks (Mannell 2004; Loakes et al. 2024a, 2024b). Perception studies focusing on short front vowels with the same speaker-listeners and communities as the current study (Loakes et al. 2024a, 2024b) have also shown that Aboriginal Australian English participants respond differently to various vowel contrasts compared to Mainstream Australian English listeners in the same region. This is consistent with research discussed above, which shows a more compressed vowel space for Aboriginal English speakers. One example is that Aboriginal English listeners in Mildura have a significantly earlier crossover between KIT-DRESS compared to Mainstream Australian English listeners (Loakes et al. 2024a); this is consistent with the idea that Aboriginal English speakers produce vowels which are phonetically less open, and whether this is true for the Mildura speakers will be empirically tested in the current study. What is as yet unknown is the exact acoustic realisation of vowels in Aboriginal English in Victoria, including duration and dynamic features, and their relationship to vowels produced by Mainstream Australian English speakers.

Aims

The aim of the current study is to provide an acoustic analysis of the short vowels KIT, DRESS, TRAP, STRUT, and LOT in L1 Aboriginal English, as well as the long vowel GOOSE. These vowels are chosen because they are among the monophthongs that have experienced rapid change in mainstream Australian English (Cox et al. 2024 for production; Mannell 2004; Loakes et al. 2024a, 2024b for perception). Additionally, because Aboriginal English is said to have changed less rapidly (Butcher 2008), vowels in general are an important focus for accent differences between varieties. Recent work by Cox et al. (2024) summarises the main diachronic changes in Mainstream Australian English and incorporates both static and dynamic analyses. They note that static measures are important for showing the relationship between vowels in the vowel space, while dynamic measures “provide tools for assessing the changes in a vowel’s time-varying spectral detail” (2024, p. 17). As such, we use a mix of tools to analyse vowel acoustics (as described in Section 2.3).

With a focus on data from two regions (Warrnambool and Mildura), the paper will also explore potential regional variation within L1 Aboriginal English. Regional variation within Aboriginal English in Australia has not been widely analysed, but such variation is acknowledged. As mentioned above, Eades (2013) advocates for the term Aboriginal Englishes”, and there has also been regional variation observed in the phonetics of L1 Aboriginal English for the communities analysed in this study, in both voice quality (Loakes and Gregory 2022, for male speakers) and in the realisation of /t/ (Loakes et al. 2022). We also look at age as a variable, because as described above there has been rapid diachronic change reported in Australian English vowels for MAE speakers (i.e., Cox et al. 2024). While Aboriginal English speakers have been reported to have more conservative vowel spaces (Butcher 2008; Butcher and Anderson 2008; Loakes et al. 2016), there are nevertheless differences between older and younger groups in how they process vowels, yet the differences are less marked (e.g., Loakes et al. 2024a, 2024b). Some age differences in AAE are thus likely and should be considered. Finally, gender is also analysed, because depending on the vowel, gender is a known factor driving vowel realisation in Australian English (see e.g., Cox and Palethorpe 2019).

As mentioned, KIT, DRESS, and TRAP have been the focus of recent perceptual studies showing various differences in the ways Aboriginal and Mainstream Australian English listeners respond to the contrasts (Loakes et al. 2024a, 2024b), and STRUT and LOT are also of interest having been included in a study by Mannell (2004) focusing on change in the Australian English perceptual space. As such, there is a relatively comprehensive understanding of how these vowels have changed in Mainstream Australian English communities, as well as in the communities in the current study (Loakes et al. 2024a, 2024b). In the current study, we also decided to include the long vowel GOOSE (phonetically /ʉː/ in Australian English) because this vowel has undergone a substantial fronting change in the mainstream variety (Cox et al. 2024; Cox 1999; also see Elvin et al. 2016), with less fronting observed for both L1 and L2 Aboriginal English speakers (Butcher and Anderson 2008; Loakes et al. 2016), which also makes for another interesting comparator.

Variables analysed in the current paper aim to give an overview of the production of vowels for L1 Aboriginal English speakers in Victoria. This includes static and dynamic measures for formants (including the F1/F2 vowel space), and vowel duration. Previous studies on Aboriginal English vowels have only focused on static measurements for F1/F2 (Butcher and Anderson 2008; Loakes et al. 2016) and have not included dynamic analyses or duration analyses, so this study will provide a more comprehensive and nuanced understanding of the acoustics of vowels in this variety. Monophthongal vowels, which are the primary focus of our study, are said to be sufficiently distinguished by duration and F1/F2 measurements at the target rather than by dynamic features (e.g., Watson and Harrington 1999), and trajectory movement in short vowels has also been shown to be dependent on coarticulatory factors in Australian English (Elvin et al. 2016). More recently, however, research on Australian English has also shown trajectory movement can have a bearing on sociophonetic differences in Australian English in diphthongs (Penney et al. 2023) and even in short monophthongs (e.g., Docherty et al. 2015; Cox and Palethorpe 2019), so we include dynamic F1/F2 measures in our analysis bearing this in mind.

2. Materials and Methods

Analysing the speech of 33 Aboriginal English speakers, we focus on acoustic measures to describe the short vowel system of Aboriginal Englishes spoken as an L1 in two locations in the southeast of Australia. We measure duration, static F1/F2, and dynamic F1/F2 trajectories of KIT, DRESS, TRAP, STRUT, LOT, and GOOSE using current methodology as will be described. Along with region, the sociolinguistic variables of age and gender are also considered in the analysis. Speakers can be grouped into two distinct age groups, >40 and <40, enabling investigation into diachronic changes. This will enable a more complete picture of the acoustic qualities of the vowels of AAE. We compare the AAE data with a “baseline” sample of 28 Mainstream Australian English speakers from the same regions. This MAE sample will not be the primary focus of the paper but gives a sense of the variation occurring within Mainstream Australian English, and highlights the differences and/or similarities, as the case may be, between AAE and MAE.

2.1. Speakers

The speech sample was collected by the first author as part of a more extensive study on sociophonetic variation in AAE (see more detail in, e.g., Loakes et al. 2024b). Participants are monolingual speakers of Australian English from rural Victoria who live in Warrnambool in the southwest of the state of Victoria, and Mildura in the northwest of the state. Warrnambool and Mildura are small towns, both with approximately 35,000 people, located 159 miles (256 km) and 335 miles (540 km), respectively, from the capital Melbourne, and the towns are located 328 miles from one another. These locations are shown in the map below in Figure 1.

The AAE speakers self-identified as members of this speech community, specifically signing up for a study on “Aboriginal English”. They described their variety of English as “Aboriginal”, “Aboriginal English”, or “Koori English” (Koori is a term used by Aboriginal people in Victoria). Mainstream Australian English speakers all spoke the institutional standard (Cox and Fletcher 2017). All speakers self-identified as male or female. Participants also fell into two distinct age ranges: a younger group under 40, and an older group over 40. A breakdown of the location, gender, and age of AAE speakers is presented in Table 1, along with the breakdown of the MAE group.

2.2. Materials and Recording Procedures

Recordings took place in fieldwork conditions in Warrnambool and Mildura, so the recording setting differed depending on the speakers’ preferences. In Warrnambool, AAE speakers were recorded both in their homes and in two different Aboriginal co-operatives (culturally appropriate community service centres), which had meeting rooms: one in Warrnambool, one in a location close to Warrnambool. MAE speakers were recorded in their homes. In Mildura, almost no participants were recorded in their own homes. AAE speakers were recorded in the Aboriginal co-operative there, while MAE speakers were recorded in a public space—often the foyer in the town library, which is a central community space. Speech data were recorded using a portable Zoom (Tokyo, Japan) Handy Recorder H4n at a sampling rate of 44,100 Hz. Participants were presented with the target vowels in isolated /hVt/ contexts as a control and because of a specific interest in sociophonetic variation in /t/ (i.e., Loakes et al. 2022). They were instructed to read the words aloud at a regular conversational rate, but rates differed depending on the speaker. Each vowel was produced six times, although some iterations were later discarded, primarily due to background noise or intelligibility issues. In total, 2168 vowel tokens were available for analysis with the total number of tokens per vowel shown in Table 2.

Participants also took part in a forced-choice perception (vowel categorisation) task and a sociolinguistic interview, but those results are not analysed here (see Loakes et al. 2024a, 2024b) for results of the vowel-categorisation task, and (McDougall et al. forthcoming) and (Blackwell and McDougall forthcoming) for disfluency analysis in the spontaneous speech). We note that the advantage of focusing on controlled speech in the first instance means we will produce a baseline sample for later comparison with spontaneously produced speech. While controlled speech can arguably be seen as less natural, the focus on citation forms means we can produce vowel spaces comparable with other research (especially the L2 spaces in Butcher and Anderson 2008). We note that there is also evidence that controlled speech may be sufficient for analysing speakers’ usual production behaviour in some non-mainstream speaker groups. For example, King et al. (2020) found in a study of Māori speakers that viewing the speakers through a typical sociolinguistic lens was actually not appropriate for that group, and that the idea of prestige norms in citation speech did not apply. This may also be the case for the Aboriginal English speakers in the current study, although this is not the main focus here.

2.3. Analysis

The start and end boundaries of each vowel were first determined automatically in WebMaus (Kisler et al. 2017) and then hand corrected as needed. Duration measures were automatically detected from these boundaries using EmuR (Jochim et al. 2023). Formants were automatically extracted using the ‘on the fly’ extract trackdata function in EmuR. These were then visualised and hand corrected when necessary. Monophthongs were subject to a static analysis with the peak F1/F2 taken depending on the vowel (e.g., Cox 1999). Vowel targets for monophthongs were calculated from the point at which F1 or F2 was highest or lowest, then matched to the other formant’s data at that point. The process was limited to the first half of each segment in line with previous work (Harrington et al. 1997; Cox 2006) to reduce effects of the following consonant. The following list shows at what point the target was calculated for each vowel.

KIT—minimum F1
DRESS—maximum F2
TRAP—maximum F1
STRUT—maximum F1
LOT—mimimum F2
GOOSE—minimum F1

Linear mixed effects models were chosen to statistically investigate the sociolinguistic elements of region, gender, and age for static measures (duration, F1(Hz), F2(Hz)). These enable the random effect of speaker to be included in the analysis, and all fixed factors (region, gender, age, and vowel) and their interactions to be included in the modelling. Post-hoc pairwise comparisons are calculated using the emmeans package (Lenth 2023).

Dynamic formant analyses were conducted with measurements taken at five points across the vowel: 20%, 35%, 50%, 65%, and 80%. Trajectories were calculated across the normalised length of the vowel (21 points) for F1 and F2. Generalised Additive Mixed Models were fitted for formant (F1 and F2) for each vowel. For the GAMM analysis, we fitted generalised additive mixed models using the mgcv (Wood 2006, version 1.8–31) and itsadug (van Rij et al. 2020) packages in R (R Core Team 2020). As the inclusion of interactions of multiple predictors (such as dialect and vowel) is not straightforward in GAMMs, separate models were fitted for F1 and F2 of each of the vowels to enable interpretation of potential changes in each vowel for each sociolinguistic variable. Separate models were fitted using the factors of dialect, location, gender, and age as ordered parameteric terms. In all models a smooth over-normalised vowel duration, a smooth over-normalised vowel duration by parametric factor (dialect, location, age or gender), and a (random) factor smooth over-normalised vowel duration by speaker was included.

3. Results

Results are presented below and include both inferential and descriptive statistics. Appendix A contains mean static measures broken down by dialect, location, gender, and age for reference in Table A1.

3.1. Duration in AAE

Duration was measured for each AAE vowel, and the mean duration was calculated. Figure 2 shows AAE vowel durations broken down according to location and gender. Table A1 in the Appendix A provides mean duration values broken down by vowel, dialect, location, gender, and age. A linear mixed effects model was built by using the fixed factors of vowel, location, gender, and age, and a random intercept for participant for the AAE speakers. Interactions between the fixed factors were included in the model. It was fit by REML, and the t-tests use Satterthwaite’s method as determined by the package lme4, lmertest (Bates et al. 2015). Post-hoc pairwise comparisons were calculated using the emmeans package (Lenth 2023). A full statistical summary can be found in Table A2.

Similar to what we expect from Australian English vowels in the Mainstream variety (see e.g., Elvin et al. 2016), TRAP, LOT, and GOOSE have significantly longer duration than the reference vowel KIT for AAE speakers. There was a significant interaction of Location and LOT (t(1233) = −3.73, p < 0.001) and also for Gender and LOT (t(1233) = −3.49, p < 0.001). These three factors (Location * Gender * LOT then also show a significant interaction (t(1233) =3.11, p < 0.01). The effect of Location * Gender * GOOSE was statistically significant and negative (beta = −16.38, t(1233) = −2.10, p < 0.05) and the effect of Gender * Age * TRAP was also statistically significant and negative (beta = −17.40, t(1233) =−2.16, p = < 0.05). Post hoc tests did not show any significant differences in duration between locations, age groups, or gender except as seen in Figure 2, where Female MI speakers have significantly longer LOT vowel than the male speakers in the same location (t(29.6) = 2.3, p < 0.05). While it seems in Figure 2 that there is a trend for female speakers in Mildura to have longer vowels overall, this is only significant for LOT.

DRESS and TRAP were examined in more detail due to duration being used by some speakers of Australian English to distinguish these vowels in prenasal contexts (Cox and Palethorpe 2014), as well as the fact these vowels are involved in a prelateral merger and are variably produced (Cox et al. 2024) and perceived (Loakes et al. 2024a, 2024b). In this citation form data, post-hoc results showed that DRESS and TRAP differed significantly in length for AAE speakers in the following contexts:

Females: WN:	<40 t(1210) = −3.57 p < 0.01,
	>40 t(1209) = −4.2, p < 0.01
Males: MI,	<40, t(1209) = −3.1, p < 0.05
WN,	<40, t(1209) = −4.1, p < 0.01

Female speakers in WN regardless of age produce TRAP with a significantly longer vowel duration than DRESS, whilst in MI these vowels are produced with a similar duration. In contrast, young male speakers, irrespective of location, produce TRAP with a significantly longer duration than DRESS. Taken with the above results, this description of duration in AAE vowels shows that there is some sociophonetic variation employed by speakers, dependent on location, gender, and age.

3.2. Duration Comparison of AAE and MAE

Whilst the acoustic description of AAE is the main focus of this paper, it is helpful to compare with a MAE sample and thus highlight the differences and/or similarities, as the case may be, between AAE and MAE. When we focus on the comparison of AAE speakers with MAE speakers, we see that there are some differences in duration depending on the vowel and the dialect (see Figure 3).

A linear mixed effects model was built by using the fixed factors of vowel, gender, and dialect and a random intercept for participant for all speakers. Interactions between the fixed factors were included in the model. All vowels showed significant (p < 0.01) differences in duration from the reference vowel (KIT). There was also significant interaction between DRESS and dialect (p < 0.01) and LOT and dialect (p < 0.05). Gender showed significant interaction with the vowel GOOSE (p < 0.001), whilst the interaction between dialect, LOT, and gender was also statistically significant (p < 0.05). A Type III Analysis of Variance was conducted to examine the effects of vowel, gender, and dialect on duration. The analysis used Satterthwaite’s method for estimating degrees of freedom. As expected, the factor vowel was found to have a highly significant effect on duration, F(5, 2088.06) = 200.99, p < 0.001. The interaction between dialect and vowel was also highly significant, F(5, 2088.06) = 7.09, p < 0.001, indicating that the effect of vowel varied depending on dialect. The interaction between vowel and gender was significant, F(5, 2088.06) = 8.60, p < 0.001. No significant effects were observed for dialect or gender alone, nor for the three-way interaction between dialect, vowel, and gender.

3.3. AAE Static F1/F2

Static measures of the short vowels were taken at the vowel target as described in the methodology. A full list of mean F1 and F2 broken down by vowel, dialect, location, gender, and age may be found in Table A1. AAE vowel means were plotted in Figure 4 by sociophonetic variable (location, gender, and age) after being bark normalized for ease of comparison in the plots. As noted, statistical comparisons have been made on the raw values.

In terms of location, AAE speakers from both locations have very similar mean F1/F2 values. WN speakers in general have a slightly more expansive vowel space; however, LOT for MI speakers is phonetically more back. Even after Bark normalisation, female AAE speakers have more fronted mean values for KIT and GOOSE and higher production of DRESS. Younger speakers have a more retracted vowel space as well as an expanded F1 space in comparison to older speakers.

A linear mixed effects model was built separately for both F1 and F2 using non-normalised Hz values. Fixed factors of vowel, location, gender, and age, and a random intercept for participant, were included along with interactions for each of the fixed factors. It was fit by REML, and the t-tests use Satterthwaite’s method as determined by the package lmerTest, lme4 (Bates et al. 2015). A full summary is found in Appendix A, Table A4. As would be expected, there were significant differences in F1 for all vowels (except GOOSE) in comparison to the reference vowel KIT. There were a number of significant interactions:

Location * DRESS p < 0.05
Location * Gender * DRESS p < 0.01
Location * Gender * TRAP p < 0.05
Location * Age * DRESS p < 0.05
Gender * Age * DRESS p < 0.001

These interactions all involve the F1 of one of the non-high front vowels and involve all the sociolinguistic variables (Location, Age, and Gender), though in different combinations.

In terms of F2, there were significant difference for all vowels in comparison to the reference vowel KIT. In addition, Gender was also a significant factor (t(1233) = −3.06, p < 0.01). There were a number of significant interactions:

Gender * Age p < 0.01
Gender * STRUT p < 0.001
Age * TRAP p < 0.05
Location * Gender * TRAP p < 0.05
Location * Gender * LOT p < 0.001
Location * Age * STRUT p < 0.05
Location * Age * LOT p < 0.05
Gender * Age * DRESS p < 0.001
Gender * Age * TRAP p < 0.001
Gender * Age * LOT p < 0.01
Gender * Age * GOOSE p < 0.05

F2 interactions also involve all the sociolinguistic variables (in various configurations), and incorporate more of the vowels (including DRESS, TRAP, STRUT, LOT, and GOOSE).

In order to better understand the interactions, post-hoc tests were carried out. Results are presented in Table 3; significant post-hoc results (p < 0.05) for each of the major fixed factors (location, gender, and age) are presented by vowel. Arrows are used to show the direction of the difference, with ↑ showing the second value is higher, or ↓, lower than the first for each pair of sociolinguistic variables; Location (MI/WN), Gender (F/M), and Age (<40/>40).

When all the variables were controlled for, we see that location differences were minimal in F1 with the only significant differences between MI and WN occurring for Male < 40 speakers in DRESS and STRUT; these speakers in WN have higher F1 values and therefore a phonetically lower position in the vowel space. Female > 40 speakers in WN show significant differences from their Mildura counterparts with lower F2 values for KIT, DRESS, and TRAP, indicating these vowels are significantly retracted for this group.

Gender differences (F/M) are concentrated in F2 where across locations, <40 male speakers exhibit significant retraction for KIT and GOOSE. Age (<40/>40) is an important factor for female Mildura speakers with F1 in TRAP being significantly lower for >40 and F2 significantly higher. These speakers also have a significantly higher F2 in DRESS than their younger counterpart. Older male WN speakers show a similar pattern but in different vowels; DRESS (F1) and KIT (F2).

Post-hoc results for significant interactions of vowels are not shown due to there being significant differences in F1 and F2 for most vowels (as expected). However, of note are the female speakers in MI, >40 and the male <40 speakers in WN who do not have a significant difference in vowel height for DRESS and TRAP (p = 0.9, p =< 0.9, respectively).

3.4. Static F1/F2 Comparison of AAE and MAE

When the two dialects are plotted together (Figure 5), we can see that many of the vowels are in general overlapping between MAE and AAE. MAE speakers have more variability in the F1 dimension for STRUT and LOT, whilst AAE speakers vary more along the F2 axis for these vowels. MAE speakers also have even larger ellipses for the vowels DRESS and TRAP than the AAE speakers, which likely reflects the phonetically more open productions by younger MAE speakers.

Linear mixed effects models were calculated individually for F1 and F2 with the fixed interacting factors of vowel, gender, and dialect and a random intercept for participant for all speakers. A full summary is provided in Appendix A, Table A5. For F1 there were significant differences in vowel height in comparison to the reference vowel KIT for all vowels except GOOSE. Significant interactions were Dialect * STRUT (p < 0.05) and Gender * DRESS (p < 0.001). A Type III Analysis of Variance was used to examine the effects of vowel, gender, and dialect on F1 and found the interaction between dialect and vowel and gender and vowel were statistically significant (F(5, 2104.75) = 5.30, p < 0.0001, F (5, 2104.75) = 12.95, p < 0.001, respectively). Post-hoc tests confirmed that MAE in comparison to AAE has significantly lower STRUT vowels in the vowel space.

Results from the LMER for F2 showed a significant effect for Dialect, t(99.0) = 2.5, p < 0.05. Gender was also statistically significant, t(−287.3) = −6.7, p < 0.001. All vowels differed significantly in F2. Dialect and vowel showed a significant interaction for STRUT (p < 0.01), whilst Gender and Vowel showed a number of significant interactions for the vowels DRESS (p < 0.05), TRAP (p < 0.001), STRUT (p < 0.001), and LOT (p < 0.001). A Type III Analysis of Variance was used to examine the effects of vowel, gender, and dialect on F2, and significant main effects were observed for Dialect (F(1, 56.45) = 10.14, p < 0.01), Gender (F(1, 56.45) = 32.18, p < 0.001), and Vowel (F(5, 2101.58) = 1260.06, p < 0.001, indicating that these factors independently influence F2. There are significant two-way interactions between Dialect and Vowel (F(5, 2101.58) = 4.98, p < 0.001) and between Gender and Vowel (F(5, 2101.58) = 14.20, p < 0.001) showing that the effect of dialect and of gender varies significantly across different vowels. However, the interaction between Dialect and Gender and the three-way interaction between Dialect, Gender, and Vowel were not significant, indicating that the combined influence of these factors does not significantly affect the outcome.

Post-hoc pairwise comparisons were calculated using the emmeans package (Lenth 2023). This showed that for female speakers KIT (p < 0.05), DRESS (p < 0.001), and TRAP (p < 0.05) were significantly retracted when produced by AAE speakers compared to MAE speakers. KIT (p < 0.05) and DRESS (p < 0.001) were also significantly retracted for male AAE speakers in comparison to male MAE speakers.

3.5. AAE Dynamic F1/F2

Whilst static formant measurements form an important part of the description of the vowel system of a language (especially for comparison with existing descriptions), it is also important to investigate what is occurring across the entire duration of a vowel, including for the short vowels (also see Elvin et al. 2016; Penney et al. 2018). As shown in Figure 6, when it comes to the AAE speakers, we see relatively substantial movement in the trajectories of F1 and F2.

AAE speakers show some differences for location, with more trajectory movement in MI than in WN. The AAE female speakers in general show more movement across the vowels than the male speakers, but especially between 65% and 80% of the vowel length. The < 40 AAE group shows extensive movement throughout the vowel, particularly in GOOSE, DRESS, and TRAP. Again, most of this movement occurs during the latter part of the vowel.

To look more at the vowel trajectory, a GAMM analysis was conducted separately for each formant of each vowel and for each sociophonetic factor; region, age, and gender. Separate models were fitted using the factors of location, gender, and age as ordered parametric terms. In all models a smooth over-normalised vowel duration, a smooth over-normalised vowel duration by parametric factor (location, age, or gender), and a (random) factor smooth over-normalised vowel duration by speaker was included. Formant values were included as non-normalised Hz values (Cox et al. 2024). A summary for the parametric and non-linear analyses for the comparsion between location (MI vs. WN), age (<40 vs. >40), and gender (F vs. M) are presented in Table 4. Full results appear in Appendix A, Table A6, Table A7 and Table A8.

Location shows significant differences between MI and WN for the KIT and DRESS vowels with these differences occuring in both F1 and F2 in the non-linear effects. Age showed significant effect both parametic and non-linear across F1 and F2 for DRESS. Significant parametric effects were also present for TRAP and LOT. Gender showed the greatest number of significant effects, with parametic or non-linear effects being significant for each vowel. In KIT and DRESS, this is true across both F1 and F2.

3.6. Dynamic F1/F2 Comparison of AAE and MAE

Turning to compare AAE with MAE (Figure 7), the vowel trajectories particularly of STRUT and LOT show relatively more movement for AAE speakers. The directionality of this movement is also different in most cases, with AAE speakers starting further towards the back of the vowel space and then moving forward.

To look more at the vowel trajectories, a GAMM analysis was conducted separately for F1 (Hz) and F2(Hz) of each vowel, and a smooth over-normalised vowel duration, a smooth over-normalised vowel duration by parametric factor (dialect), and a (random) factor smooth over-normalised vowel duration by speaker was included. A summary of significant effects is shown in Table 5 with a full statistical summary shown in Appendix A, Table A9.

There are significant differences for Dialect between AAE and MAE across all vowels except GOOSE. These differences are concentrated in the parametric effects, and primarily F1 parametric effects. The DRESS vowel also has significant F2 parametric effects.

4. Discussion

The overall purpose of this paper was to give a detailed acoustic description of vowels in L1 Aboriginal Australian English. By focusing on vowel duration, as well as static and dynamic measures for formants (F1/F2), the aim was to comprehensively describe acoustic features of vowels known to be undergoing change in Australia. Sociophonetic variables were considered (age, gender, and region), and by comparing Aboriginal Australian English with a sample of Mainstream Australian English, varietal differences in Australia were also considered, though they were not the primary aim of the paper.

For duration, we did not see large differences across the data set, and regardless, these results should be treated with caution because duration can interact with speech rate. Nevertheless, we saw some limited sociophonetic variability in the data with respect to this variable. Among the short vowels, TRAP and LOT are phonetically longest, and in our data, GOOSE was also significantly longer, which can be expected given that this vowel is phonemically long. We saw that female AAE speakers in Mildura had somewhat longer vowels than their male counterparts, but this was only significant for LOT. This vowel also patterned differently according to location and gender. Additionally, the duration analysis focused on DRESS and TRAP due to their being involved in various sound changes in Australian English. For AAE, we saw that speakers in WN had a longer TRAP vowel compared to DRESS (this was not observed for MI), and we also saw that male speakers tended to produce a phonetically longer TRAP vowel as well. When comparing the AAE and MAE speakers in this study, we saw that all vowels showed significant differences in duration from the reference vowel (KIT), and we also saw a significant interaction between DRESS and dialect and TRAP and dialect. Post-hoc pairwise comparisons confirmed AAE speakers have a significantly shorter DRESS vowel compared to the MAE speakers, but this was not significant for TRAP. Given sound change in DRESS and TRAP, it is perhaps not surprising that differences are observed for this vowel pair, although length is not typically the overarching feature mentioned with respect to change (see e.g., Loakes et al. 2024a, 2024b; Cox et al. 2024). It is also worth noting that the actual duration measurements in this study are consistent with other work reporting the length of short vowels in Australian English in wordlist contexts, for example, Elvin et al. (2016) and Penney et al. (2018), who show similar ranges for duration of these short vowels. While some variability is observed, in this study differences between L1 AAE and MAE vowels are not particularly evident durationally; rather, we find differences in other acoustic dimensions.

As far as static F1/F2 is concerned, we described the vowel spaces of the L1 Aboriginal English speakers and found they were very similar to the Mainstream Australian English speakers’ vowel spaces in their general shape. However, the Aboriginal English speakers had a less expanded space than the Mainstream Australian English speakers. In terms of regional difference, AAE speakers from both locations had very similar vowel acoustics (i.e., there was limited regional variation between MI and WN), but overall, the WN speakers had somewhat more peripheral vowels than the MI speakers. We also saw that for AAE speakers, gender differences occurred and were primarily concentrated in F2. For the dynamic analyses, we focused on movement across the duration of the vowel and saw a relatively large amount of trajectory movement for AAE speakers, and especially in the area around 65–80% of the vowel’s length. Some small age, location, and gender differences were also described for AAE. Significant varietal differences between AAE and MAE were observed and were largely concentrated in the F1 parametric effects.

When considering the findings overall, we can speak to a number of matters referred to in the introduction. The first is whether L1 Aboriginal English is potentially more conservative than the Mainstream variety, having undergone less change. This was noted by Butcher and Anderson (2008) for L2 speakers, and was also observed in perception (i.e., earlier category crossovers) for AAE listeners who form the speaker groups for this study (i.e., Loakes et al. 2024a, 2024b). While we found that the vowel spaces of AAE and MAE in this study were similar in shape, which is not surprising given that all participants are speakers of English who were born and still living in the same regions, the findings also nevertheless align with previous research showing that the L1 AAE speakers produce phonetically less open and more retracted vowels than the MAE speakers (i.e., the AAE vowel space is less peripheral). This is especially evident in Figure 5 and Figure 6 and was borne out in the statistical results for the KIT, DRESS, and TRAP vowels, which are the vowels often implicated in sound changes in Australian English (c.f. Mannell 2004; Cox and Palethorpe 2008; Loakes et al. 2024a, 2024b). MAE speakers also had larger ellipses, which we hypothesis would be due to greater differences between older and younger speakers, reflecting the rapid change occurring in the MAE vowel system. While we did not specifically test for age in the MAE group, this would align with other research on age-graded variation in the mainstream variety (i.e., Cox and Palethorpe 2008 for production; Loakes et al. 2024b for perception).

Looking at the results more closely, while we did not see great amounts of difference between the two regions within AAE, we can also say that WN speakers have slightly less conservative vowels than the MI speakers, and this may be due to the proximity of WN to Melbourne. This finding about the Mildura speakers being the most conservative is also consistent with findings in a perception study (Loakes et al. 2024a) that showed more limited perceptual variability among the older and younger Mildura speakers, compared to findings for WN speakers (Loakes et al. 2024b). In that study, the issue of geographic isolation for Mildura was noted as a likely reason for less rapid change there. Additionally, in the current study we found younger speakers in AAE were exhibiting more open vowels than older speakers, moving towards the changes observed in MAE, but at a less rapid pace.

In terms of the variables that were important for driving speaker behaviour in the AAE data, age, gender, and region all played some role in the distribution of patterns. As mentioned, regional variability was relatively limited. Gender differences were evident, and age differences were more noticeable in Mildura as compared to Warrnambool, and there were also various interactions as described. Previous research has talked about Aboriginal Englishes plural (i.e., Eades 2013), and in the communities studied here that plurality has been seen previously in the realisation of voice quality (Loakes and Gregory 2022) and in more fine-grained differences in production of /t/ (Loakes et al. 2022). Variation within AAE is known to be generally greater than for MAE (e.g., Mailhammer 2021; Butcher 2008; Loakes et al. 2022), although that was not specifically observed for the vowels in this study. In the current study, regional differences are very limited and do not support the notion of different varieties of AAE based on the vowel system alone. This highlights the importance of focusing on various acoustic (and other linguistic) measures in the description of a language variety. Given what we know from our previous work on voice quality (Loakes and Gregory 2022), an interesting area for further research would be to triangulate formant, duration, and voice quality measurements in descriptive and sound change research, to consider overall perceptual effects for listeners.

5. Conclusions

Australian Aboriginal English is known to be a distinct variety of Australian English, with structurally different linguistic features compared to Mainstream Australian English (Malcolm 2013; Eades 2013; Mailhammer 2021) as well as socially conditioned differences (Louro and Collard 2021; Loakes and Gregory 2022; Loakes et al. 2022, 2024a, 2024b). As far as phonetic analyses of vowels are concerned, previous research has included an analysis of static measures in F1/F2 (Butcher and Anderson 2008; Loakes et al. 2016) and static measures of voice quality (Loakes and Gregory 2022), and we build on that knowledge in the current research by bringing in more data points (duration and dynamic F1/F2 analyses). The results in the current paper thus more comprehensively describe how the acoustics of vowels in Aboriginal English align with or are different from the mainstream variety along various acoustic dimensions.

The study has highlighted varietal differences among Mainstream and Aboriginal Australian Englishes spoken in Victoria where the acoustics of vowels are concerned, in particular for formant behaviour as well as showing some sociophonetic variability in terms of gender, regional, and aged-based variability across the samples. This study therefore gives a more precise understanding of how AAE is uniquely different from MAE, even in this clearly acrolectal L1 variety. Having pointed out differences, it is also important to note that another of our findings was that similarities exist as well in vowels produced by AAE and MAE speakers, especially in duration and in the shapes of the overall vowel space and vowel trajectories. This speaks to the fact that both groups are using L1 versions of Australian English that have the same phonologies, but which have experienced different rates of change and internal phonetic variability.

Now that we have an analysis of vowel quality in controlled speech, future work will focus on spontaneous conversational speech by AAE speakers to determine whether and how the patterns observed here are used by speakers in more interactive contexts. Work has begun on the spontaneous speech with respect to how AAE and MAE speakers use disfluencies, with some small but significant differences between the varieties in the way that speakers use unfilled pauses (McDougall et al. forthcoming) and filled pauses (Blackwell and McDougall forthcoming). Given previous phonetic work on these communities, and impressionistic observations, we predict further small but significant differences in vowel production in spontaneous speech in the AAE and MAE groups, but this is still to be empirically tested.

Author Contributions

Conceptualization, D.L. and A.G.; methodology, D.L. and A.G.; software, A.G.; validation, D.L. and A.G.; formal analysis, A.G.; investigation, D.L. and A.G.; resources, D.L. and A.G.; data curation, A.G.; writing—original draft preparation, D.L. and A.G.; writing—review and editing, D.L. and A.G.; visualization, A.G.; supervision, D.L.; project administration, D.L.; funding acquisition, D.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by The ARC Centre of Excellence for the Dynamics of Language (Project ID: CE140100041).

Institutional Review Board Statement

Data collection for this project received ethics approval from the University of Melbourne (HREC number 1544298.3).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Results will be made available by contacting the first author. Voice data is not available for re-analysis due to ethical considerations, but data can be shared in its synthesised form (measurements).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Summary table of mean measures (duration (ms), F1 (Hz), and F2 (Hz)).

Vowel	Dialect	Location	Gender	Age	Duration (ms)	Mean F1 (Hz)	Mean F2 (Hz)
KIT	AAE	MI	F	<40	112	379.8	2691.1
KIT	AAE	MI	F	>40	85	339.8	2836.7
KIT	AAE	MI	M	<40	97	301.1	2426.1
KIT	AAE	MI	M	>40	-	-	-
KIT	AAE	WN	F	<40	112	330.9	2634.3
KIT	AAE	WN	F	>40	110	377.2	2549
KIT	AAE	WN	M	<40	114	252.8	2271.5
KIT	AAE	WN	M	>40	102	319.4	2546.1
KIT	MAE	MI	F	<40	78	363.6	2793.4
KIT	MAE	MI	F	>40	99	350	2730.6
KIT	MAE	MI	M	<40	101	374.6	2457.1
KIT	MAE	MI	M	>40	107	328.8	2661.8
KIT	MAE	WN	F	<40	124	311.7	2793.5
KIT	MAE	WN	F	>40	146	278.6	2724.1
KIT	MAE	WN	M	<40	119	263.5	2486.5
KIT	MAE	WN	M	>40	110	292.1	2455.9

DRESS	AAE	MI	F	<40	125	619	2444.3
DRESS	AAE	MI	F	>40	102	654.6	2726.5
DRESS	AAE	MI	M	<40	104	657.1	2309.2
DRESS	AAE	MI	M	>40	-	-	-
DRESS	AAE	WN	F	<40	115	707.3	2485.5
DRESS	AAE	WN	F	>40	115	606.5	2474.4
DRESS	AAE	WN	M	<40	120	1040	2395.7
DRESS	AAE	WN	M	>40	108	471.1	2220.7
DRESS	MAE	MI	F	<40	93	757.3	2654.2
DRESS	MAE	MI	F	>40	120	513.8	2631.3
DRESS	MAE	MI	M	<40	120	655.1	2365.9
DRESS	MAE	MI	M	>40	128	1402	2843.5
DRESS	MAE	WN	F	<40	146	662.2	2627.2
DRESS	MAE	WN	F	>40	134	966.3	2832.8
DRESS	MAE	WN	M	<40	135	933.9	2528.1
DRESS	MAE	WN	M	>40	121	1377	2935.5

TRAP	AAE	MI	F	<40	132	995.8	2063
TRAP	AAE	MI	F	>40	126	810.7	2501.7
TRAP	AAE	MI	M	<40	118	858.2	1936.9
TRAP	AAE	MI	M	>40	-	-	-
TRAP	AAE	WN	F	<40	126	954	1994
TRAP	AAE	WN	F	>40	134	872.1	2132.8
TRAP	AAE	WN	M	<40	135	985.2	1983.8
TRAP	AAE	WN	M	>40	119	841.6	1852.1
TRAP	MAE	MI	F	<40	112	1127	2032.3
TRAP	MAE	MI	F	>40	131	893.3	2152.5
TRAP	MAE	MI	M	<40	141	1060	1895.6
TRAP	MAE	MI	M	>40	158	802.8	2218.8
TRAP	MAE	WN	F	<40	121	1088	2036.1
TRAP	MAE	WN	F	>40	150	1091	2332.2
TRAP	MAE	WN	M	<40	140	1192	2025.4
TRAP	MAE	WN	M	>40	145	1102	2241.3

STRUT	AAE	MI	F	<40	121	982	1673
STRUT	AAE	MI	F	>40	111	810	1728.2
STRUT	AAE	MI	M	<40	101	835	1722
STRUT	AAE	MI	M	>40	-	-	-
STRUT	AAE	WN	F	<40	120	980	1684
STRUT	AAE	WN	F	>40	112	970	1866
STRUT	AAE	WN	M	<40	125	1021	1614
STRUT	AAE	WN	M	>40	129	920	1899
STRUT	MAE	MI	F	<40	86	1147	1678
STRUT	MAE	MI	F	>40	107	980	1597
STRUT	MAE	MI	M	<40	96	990	1636
STRUT	MAE	MI	M	>40	114	1046	1926
STRUT	MAE	WN	F	<40	113	1296	2090
STRUT	MAE	WN	F	>40	141	1206	1757
STRUT	MAE	WN	M	<40	130	1169	1669
STRUT	MAE	WN	M	>40	120	1163	1632

LOT	AAE	MI	F	<40	144	766	1138
LOT	AAE	MI	F	>40	103	770	1124
LOT	AAE	MI	M	<40	104	685	986
LOT	AAE	MI	M	>40	-	-	-
LOT	AAE	WN	F	<40	120	777	1047
LOT	AAE	WN	F	>40	128	806	1190
LOT	AAE	WN	M	<40	124	747	1178
LOT	AAE	WN	M	>40	145	755	1143
LOT	MAE	MI	F	<40	98	901	1117
LOT	MAE	MI	F	>40	108	765	1150
LOT	MAE	MI	M	<40	120	859	1200
LOT	MAE	MI	M	>40	117	889	1239
LOT	MAE	WN	F	<40	129	881	1196
LOT	MAE	WN	F	>40	136	896	1212
LOT	MAE	WN	M	<40	131	809	1167
LOT	MAE	WN	M	>40	119	698	1046

GOOSE	AAE	MI	F	<40	146	375	2113.7
GOOSE	AAE	MI	F	>40	135	398.2	2227.2
GOOSE	AAE	MI	M	<40	140	373	1879.5
GOOSE	AAE	MI	M	>40	-	-	-
GOOSE	AAE	WN	F	<40	158	312.5	2077.7
GOOSE	AAE	WN	F	>40	165	410.2	2131.8
GOOSE	AAE	WN	M	<40	140	353.5	1870.8
GOOSE	AAE	WN	M	>40	160	320	1909
GOOSE	MAE	MI	F	<40	139	351.5	2138.4
GOOSE	MAE	MI	F	>40	150	354.6	2128
GOOSE	MAE	MI	M	<40	138	293.5	1795.8
GOOSE	MAE	MI	M	>40	131	453.2	1660.5
GOOSE	MAE	WN	F	<40	174	275.2	2207.7
GOOSE	MAE	WN	F	>40	168	232.3	2221.5
GOOSE	MAE	WN	M	<40	149	295.6	1956.1
GOOSE	MAE	WN	M	>40	158	269.3	1916.5

Table A2. Summary table of AAE LMER results for duration using the formula duration~location * age * gender * vowel + (1|person).

Parameter	Coefficient	t	df	p
(Intercept)	114.85	10.96	28	<0.001
location [WN]	−5.24	−0.40	28	0.69
gender [M]	−16.78	−1.07	27	0.28
labslexical [DRESS]	4.55	1.15	1210	0.25
labslexical [TRAP]	14.92	4.00	1211	<0.001
labslexical [STRUT]	7.38	1.82	1211	0.07
labslexical [LOT]	26.57	6.51	1211	<0.001
labslexical [GOOSE]	44.96	10.00	1210	<0.001
age [>40]	−14.82	−0.90	32	0.37
location [WN] × gender [M]	19.97	1.00	27	0.32
location [WN] × labslexical [DRESS]	0.85	0.17	1210	0.86
location [WN] × labslexical [TRAP]	1.01	0.23	1210	0.82
location [WN] × labslexical [STRUT]	−0.68	−0.13	1211	0.89
location [WN] × labslexical [LOT]	−19.37	−3.73	1210	<0.001
location [WN] × labslexical [GOOSE]	1.57	0.29	1210	0.77
gender [M] × labslexical [DRESS]	0.91	0.16	1210	0.88
gender [M] × labslexical [TRAP]	5.70	1.01	1210	0.31
gender [M] × labslexical [STRUT]	−4.05	−0.70	1210	0.49
gender [M] × labslexical [LOT]	−20.29	−3.49	1210	<0.001
gender [M] × labslexical [GOOSE]	−3.50	−0.57	1210	0.57
location [WN] × age [>40]	14.95	0.66	30	0.51
gender [M] × age [>40]	−1.76	−0.07	28	0.94
labslexical [DRESS] × age [>40]	13.20	1.51	1211	0.13
labslexical [TRAP] × age [>40]	9.23	0.98	1168	0.33
labslexical [STRUT] × age [>40]	−22.43	−1.61	1040	0.11
labslexical [LOT] × age [>40]	−0.26	−0.03	1226	0.98
labslexical [GOOSE] × age [>40]	12.89	1.35	1224	0.18
(location [WN] × gender [M]) × labslexical [DRESS]	0.90	0.12	1210	0.90
(location [WN] × gender [M]) × labslexical [TRAP]	0.20	0.03	1210	0.98
(location [WN] × gender [M]) × labslexical [STRUT]	9.27	1.22	1210	0.22
(location [WN] × gender [M]) × labslexical [LOT]	23.71	3.11	1210	<0.01
(location [WN] × gender [M]) × labslexical [GOOSE]	−16.38	−2.10	1210	<0.05
(location [WN] × labslexical [DRESS]) × age [>40]	−12.38	−1.21	1230	0.23
(location [WN] × labslexical [TRAP]) × age [>40]	−0.60	−0.06	1200	0.95
(location [WN] × labslexical [STRUT]) × age [>40]	18.13	1.21	1098	0.23
(location [WN] × labslexical [LOT]) × age [>40]	9.62	0.90	1221	0.37
(location [WN] × labslexical [GOOSE]) × age [>40]	−4.46	−0.40	1211	0.69
(gender [M] × labslexical [DRESS]) × age [>40]	−6.90	−0.71	1211	0.48
(gender [M] × labslexical [TRAP]) × age [>40]	−17.40	−2.16	1210	<0.05
(gender [M] × labslexical [STRUT]) × age [>40]	−1.51	−0.10	1219	0.92
(gender [M] × labslexical [LOT]) × age [>40]	1.82	0.13	1221	0.90
(gender [M] × labslexical [GOOSE]) × age [>40]	2.30	0.15	1219	0.88

Table A3. Summary table of combined LMER results for duration using the formula duration~dialect * gender * vowel + (1|person).

Parameter	Coefficient	t	df	p
(Intercept)	109.34	19.81	62	<0.001
dialect [MAE]	2.23	0.27	62	0.79
labslexical [DRESS]	6.03	2.81	2101	<0.01
labslexical [TRAP]	18.46	9.96	2093	<0.001
labslexical [STRUT]	5.32	2.29	2095	<0.01
labslexica [LOT]	15.88	6.85	2090	<0.001
labslexical [GOOSE]	48.75	21.24	2089	<0.001
gender [M]	−1.67	−0.18	62	0.86
dialect [MAE] × labslexical [DRESS]	10.83	3.14	2092	<0.01
dialect [MAE] × labslexical [TRAP]	5.87	1.91	2090	0.06
dialect [MAE] × labslexical [STRUT]	−1.46	−0.42	2090	0.67
dialect [MAE] × labslexical [LOT]	−8.86	−2.53	2088	<0.05
dialect [MAE] × labslexical [GOOSE]	−2.58	−0.74	2088	0.46
dialect [MAE] × gender [M]	1.56	0.12	62	0.91
labslexical [lDRESS] × gender [M]	−0.31	−0.09	2093	0.93
labslexical [TRAP] × gender [M]	1.64	0.53	2091	0.59
labslexical [STRUT] × gender [M]	2.31	0.62	2090	0.54
labslexical [LOT] × gender [M]	−6.91	−1.85	2089	0.06
labslexical [GOOSE] x gender [M]	−15.77	−4.20	2088	<0.001
dialect [MAE] × labslexical [DRESS] × gender [M]	0.36	0.07	2089	0.95
dialect [MAE] × labslexical [TRAP] × gender [M]	5.28	1.08	2089	0.28
dialect [MAE] × labslexical [STRUT] × gender [M]	0.40	0.07	2088	0.94
dialect [MAE] × labslexical [LOT] × gender [M]	13.83	2.47	2087	<0.05
dialect [MAE] × labslexical [GOOSE] × gender [M]	3.40	0.61	2087	0.54

Table A4. Summary table of AAAE LMER results for (a) F1 and (b) F2 using the formula: formant~location * age * gender * vowel + (1|person).

Parameter	Coefficient	t	df	p
4a—F1
(Intercept)	371.13	8.24	38	<0.001
location [WN]	−35.81	−0.65	37	0.52
gender [M]	−70.11	−1.07	35	0.29
labslexical [DRESS]	232.93	4.41	1215	<0.001
labslexical [TRAP]	623.78	12.50	1222	<0.001
labslexical [STRUT]	609.33	11.24	1225	<0.001
labslexical [LOT]	393.57	7.20	1216	<0.001
labslexical [GOOSE]	−2.58	−0.04	1216	0.97
age [>40]	−92.70	−1.09	66	0.27
location [WN] × gender [M]	−28.73	−0.34	36	0.73
location [WN] × labslexical [DRESS]	141.91	2.17	1211	<0.05
location [WN] × labslexical [TRAP]	−6.23	−0.11	1217	0.92
location [WN] × labslexical [STRUT]	57.23	0.83	1222	0.41
location [WN] × labslexical [LOT]	70.92	1.02	1216	0.31
location [WN] × labslexical [GOOSE]	−5.51	−0.08	1214	0.94
gender [M] × labslexical [DRESS]	123.14	1.59	1207	0.11
gender [M] × labslexical [TRAP]	−67.00	−0.89	1210	0.37
gender [M] × labslexical [STRUT]	−75.43	−0.97	1213	0.33
gender [M] × labslexical [LOT]	−9.23	−0.12	1208	0.91
gender [M] × labslexical [GOOSE]	76.27	0.92	1209	0.36
location [WN] × age [>40]	135.78	1.28	49	0.20
gender [M] × age [>40]	111.07	1.00	45	0.32
labslexical [DRESS] × age [>40]	148.00	1.41	795	0.16
labslexical [TRAP] × age [>40]	−135.16	−1.23	702	0.22
labslexical [STRUT] × age [>40]	−84.99	−0.58	278	0.56
labslexical [LOT] × age [>40]	17.94	0.15	1231	0.88
labslexical [GOOSE] × age [>40]	42.25	0.34	1234	0.74
(location [WN] × gender [M]) × labslexical [DRESS]	305.70	3.09	1211	<0.01
(location [WN] × gender [M]) × labslexical [TRAP]	198.97	2.18	1212	<0.05
(location [WN] × gender [M]) × labslexical [STRUT]	172.75	1.70	1211	0.09
(location [WN] × gender [M]) × labslexical [LOT]	34.78	0.34	1207	0.73
(location [WN] × gender [M]) × labslexical [GOOSE]	27.76	0.26	1207	0.79
(location [WN] × labslexical [DRESS]) × age [>40]	−292.59	−2.31	1020	<0.05
(location [WN] × labslexical [TRAP]) × age [>40]	12.56	0.10	879	0.92
(location [WN] × labslexical [STRUT]) × age [>40]	11.32	0.07	399	0.95
(location [WN] × labslexical [LOT]) × age [>40]	−55.35	−0.39	1235	0.69
(location [WN] × labslexical [GOOSE]) × age [>40]	−1.33	−0.01	1234	0.99
(gender [M] × labslexical [DRESS]) × age [>40]	−545.34	−4.19	1217	<0.001
(gender [M] × labslexical [TRAP]) × age [>40]	−128.71	−1.19	1211	0.23
(gender [M] × labslexical [STRUT]) × age [>40]	−261.10	−1.30	1198	0.20
(gender [M] × labslexical [LOT]) × age [>40]	−188.03	−1.05	1152	0.30
(gender [M] × labslexical [GOOSE]) × age [>40]	−307.25	−1.53	1198	0.13
4b—F2
(Intercept)	2676.01	49.19	48	<0.001
location [WN]	−35.08	−0.52	47	0.60
gender [M]	−242.94	−3.06	44	<0.01
labslexical [DRESS]	−247.03	−4.07	1219	<0.001
labslexical [TRAP]	−612.19	−10.69	1224	<0.001
labslexical [STRUT]	−1007.33	−16.20	1226	<0.001
labslexical [LOT]	−1536.79	−24.52	1220	<0.001
labslexical [GOOSE]	−585.81	−8.48	1220	<0.001
age [>40]	141.64	1.40	83	0.16
location [WN] × gender [M]	−132.11	−1.30	46	0.19
location [WN] × labslexical [DRESS]	89.74	1.20	1217	0.23
location [WN] × labslexical [TRAP]	−33.45	−0.49	1221	0.62
location [WN] × labslexical [STRUT]	57.09	0.72	1224	0.47
location [WN] × labslexical [LOT]	−50.94	−0.64	1220	0.52
location [WN] × labslexical [GOOSE]	30.57	0.37	1219	0.71
gender [M] × labslexical [DRESS]	123.12	1.39	1214	0.17
gender [M] × labslexical [TRAP]	126.74	1.47	1216	0.14
gender [M] × labslexical [STRUT]	303.71	3.41	1218	<0.001
gender [M] × labslexical [LOT]	96.73	1.08	1215	0.28
gender [M] × labslexical [GOOSE]	37.55	0.40	1215	0.69
location [WN] × age [>40]	−234.30	−1.85	62	0.07
gender [M] × age [>40]	428.98	3.20	56	<0.01
labslexical [DRESS] × age [>40]	192.81	1.59	869	0.11
labslexical [TRAP] × age [>40]	274.38	2.15	775	<0.05
labslexical [STRUT] × age [>40]	−117.85	−0.69	352	0.49
labslexical [LOT] × age [>40]	−164.58	−1.22	1233	0.22
labslexical [GOOSE] × age [>40]	−12.68	−0.09	1234	0.93
(location [WN] × gender [M]) × labslexical [DRESS]	165.07	1.45	1217	0.15
(location [WN] × gender [M]) × labslexical [TRAP]	232.06	2.21	1218	<0.05
(location [WN] × gender [M]) × labslexical [STRUT]	−15.58	−0.13	1217	0.89
(location [WN] × gender [M]) × labslexical [LOT]	392.85	3.35	1217	<0.001
(location [WN] × gender [M]) × labslexical [GOOSE]	113.49	0.94	1214	0.35
(location [WN] × labslexical [DRESS]) × age [>40]	−109.07	−0.75	1061	0.45
(location [WN] × labslexical [TRAP]) × age [>40]	−44.93	−0.31	935	0.76
(location [WN] × labslexical [STRUT]) × age [>40]	385.60	2.01	484	<0.05
(location [WN] × labslexical [LOT]) × age [>40]	392.97	2.44	1235	<0.05
(location [WN] × labslexical [GOOSE]) × age [>40]	150.49	0.90	1234	0.37
(gender [M] × labslexical [DRESS]) × age [>40]	−569.99	−3.82	1221	<0.001
(gender [M] × labslexical [TRAP]) × age [>40]	−655.48	−5.29	1217	<0.001
(gender [M] × labslexical [STRUT]) × age [>40]	−387.45	−1.67	1217	0.09
(gender [M] × labslexical [LOT]) × age [>40]	−668.39	−3.23	1191	<0.01
(gender [M] × labslexical [GOOSE]) × age [>40]	−505.42	−2.18	1217	<0.05

Table A5. Summary table of combined LMER results for (a) F1 and (b) F2 using the formula: formant~dialect * gender * vowel + (1|person).

Parameter	Coefficient	t	df	p
5a—F1
(Intercept)	346.16	15.84	152	<0.001
dialect [MAE]	−3.06	−0.09	171	0.93
gender [M]	−66.34	−1.85	154	0.07
labslexical [DRESS]	309.98	10.76	2132	<0.001
labslexical [TRAP]	585.17	23.38	2121	<0.001
labslexical [STRUT]	628.75	20.14	2138	<0.001
labslexical [LOT]	433.94	13.83	2115	<0.001
labslexical [GOOSE]	4.61	0.15	2107	0.88
dialect [MAE] × gender [M]	43.91	0.81	180	0.42
dialect [MAE] × labslexical [DRESS]	5.60	0.12	2111	0.90
dialect [MAE] × labslexical [TRAP]	79.49	1.91	2114	0.06
dialect [MAE] × labslexical [STRUT]	119.99	2.55	2116	<0.05
dialect [MAE] × labslexical [LOT]	52.03	1.09	2100	0.27
dialect [MAE] × labslexical [GOOSE]	−15.99	−0.34	2096	0.73
gender [M] × labslexical [DRESS]	241.66	5.09	2119	<0.001
gender [M] × labslexical [TRAP]	66.97	1.61	2126	0.11
gender [M] × labslexical [STRUT]	15.71	0.31	2120	0.76
gender [M] × labslexical [LOT]	−5.74	−0.11	2110	0.91
gender [M] × labslexical [GOOSE]	63.43	1.25	2105	0.21
dialect [MAE] × gender [M] × labslexical [DRESS]	69.05	0.93	2102	0.35
dialect [MAE] × gender [M] × labslexical [TRAP]	59.09	0.90	2114	0.37
dialect [MAE] × gender [M] × labslexical [STRUT]	18.95	0.25	2109	0.81
dialect [MAE] × gender [M] × labslexical [LOT]	26.69	0.35	2096	0.73
dialect [MAE] × gender [M] × labslexical [GOOSE]	−57.71	−0.76	2094	0.45
5b—F2
(Intercept)	2657.27	102.19	123	<0.001
dialect [MAE]	98.97	2.47	132	<0.05
gender [M]	−287.31	−6.72	124	<0.001
labslexical [DRESS]	−152.41	−4.98	2132	<0.001
labslexical [TRAP]	−579.17	−21.80	2118	<0.001
labslexical [STRUT]	−919.80	−27.72	2133	<0.001
labslexical [LOT]	−1538.40	−46.21	2111	<0.001
labslexical [GOOSE]	−541.31	−16.41	2104	<0.001
dialect [MAE] × gender [M]	29.45	0.46	139	0.64
dialect [MAE] × labslexical [DRESS]	77.38	1.57	2110	0.12
dialect [MAE] × labslexical [TRAP]	−4.85	−0.11	2109	0.91
dialect [MAE] × labslexical [STRUT]	−130.85	−2.62	2113	<0.01
dialect [MAE] × labslexical [LOT]	−45.32	−0.90	2098	0.37
dialect [MAE] × labslexical [GOOSE]	−47.52	−0.95	2095	0.34
gender [M] × labslexical [DRESS]	127.97	2.54	2117	<0.05
gender [M] × labslexical [TRAP]	165.13	3.74	2120	<0.001
gender [M] × labslexical [STRUT]	219.67	4.09	2115	<0.001
gender [M] × labslexical [LOT]	261.32	4.88	2105	<0.001
gender [M] × labslexical [GOOSE]	46.28	0.86	2102	0.39
(dialect [MAE] × gender [M]) × labslexical [DRESS]	11.13	0.14	2101	0.89
(dialect [MAE] × gender [M]) × labslexical [TRAP]	−34.42	−0.49	2109	0.62
(dialect [MAE] × gender [M]) × labslexical [STRUT]	23.91	0.29	2104	0.77
(dialect [MAE] × gender [M]) × labslexical [LOT]	2.18	0.03	2095	0.98
(dialect [MAE] × gender [M]) × labslexical [GOOSE]	−96.88	−1.20	2093	0.23

Table A6. Summary table of AAE GAMM results for location.

Vowel	Term	F1				F2
Vowel	Term	Estimate/edf	Std. Error/Ref.df	t/F	p	Estimate/edf	Std. Error/Ref.df	t/F	p
KIT	Intercept	421.88	11.22	37.6	<0.001	2563	58	44.191	<0.001
	location[WN]	−13.05	13.96	−0.935	0.35	−107.2	72.2	−1.484	0.138
	s(times_norm)	1.089	1.11	2.605	0.101	7.871	8.527	8.728	<0.001
	s(times_norm): WN	8.092	8.554	6.892	<0.001	156.611	306	49.42	<0.001
	s(times_norm,person)	179.19	306	4.569	<0.001	110.962	296	20.428	<0.001
	Intercept	607.37	17.2	35.321	<0.001	271.04	51.42	44.168	<0.001
	location[WN]	−32.92	21.08	−1.562	0.118	−64.46	62.98	−1.023	0.306
	s(times_norm)	1.026	1.033	0.001	0.996	7.837	8.565	6.141	<0.001
	s(times_norm): WN	4.675	5.563	2.426	<0.05	6.804	7.732	2.788	<0.001
	s(times_norm,person)	136.976	296	4.411	<0.001	110.962	296	20.428	<0.001
TRAP	Intercept	792.809	22.944	34.554	<0.001	1981.57	41.45	47.811	<0.001
	location[WN]	3.022	28.53	0.106	0.916	−39.23	51.55	−0.761	0.447
	s(times_norm)	2.099	2.4	1.646	0.1577	7.655	8.522	10.06	<0.001
	s(times_norm:WN)	1.004	1.006	1.92	0.165	102.084	306	13.91	<0.001
	s(times_norm,person)	142.349	306	12.179	<0.001	102.084	306	13.91	<0.001
STRUT	Intercept	825.19	28.07	29.396	<0.001	1698.61	58.32	29.128	<0.001
	locationWN	62.1	35.81	1.734	0.083	−91.4	74.38	−1.229	0.219
	s(times_norm)	2.283	2.664	0.726	0.492	7.296	8.286	7.656	<0.001
	s(times_norm):WN	4.389	5.272	0.818	0.417	1.308	1.444	0.629	0.599
	s(times_norm,person)	104.32	256	12.501	<0.001	71.099	256	13.196	<0.001
LOT	Intercept	764.22	15.003	50.939	<0.001	1301.5	33.68	38.644	<0.001
	locationWN	7.935	19.127	0.415	0.678	−21.33	42.94	−0.497	0.619
	s(times_norm): WN	1.025	1.034	0.141	0.719	8.353	8.833	11.765	<0.001
	s(times_norm)	1.055	1.076	0.046	0.905	1.053	1.069	0.908	0.336
	s(times_norm,person)	96.202	256	6.67	<0.001	114.962	256	4.778	<0.001
GOOSE	Intercept	435.51	16.2	26.883	<0.001	2084.917	43.655	47.758	<0.001
	locationWN	−15.4	19.68	−0.782	0.434	3.121	52.955	0.059	0.953
	s(times_norm)	7.019	8.053	5.213	<0.001	6.872	7.942	8.421	<0.001
	s(times_norm): WN	1.004	1.005	0.02	0.895	1.034	1.047	0.201	0.687
	s(times_norm,person)	84.362	246	2.074	<0.001	81.869	246	11.613	<0.001

Table A7. Summary table of AAE GAMM results for gender.

Vowel	Term	F1				F2
Vowel	Term	Estimate/edf	Std. Error/Ref.df	t/F	p	Estimate/edf	Std. Error/Ref.df	t/F	p
KIT	(Intercept)	422.48	8.13	51.968	<0.001	2600.89	32.77	79.36	<0.001
	genderM	−23.92	13.19	−1.814	0.0698	−276.68	52.7	−5.25	<0.001
	s(times_norm)	1.054	1.068	0.776	0.381	5.506	6.412	6.128	<0.001
	s(times_norm): M	8.349	8.747	11.779	<0.001	7.613	8.313	4.212	<0.001
	s(times_norm,person)	168.852	306	4.063	<0.001	148.915	306	26.165	<0.001
DRESS	(Intercept)	605.95	11.91	50.898	<0.001	2308.51	30.71	75.173	<0.001
	genderM	−51.38	18.87	−2.723	<0.01	−201.18	48.57	−4.142	<0.001
	s(times_norm)	1.998	2.296	0.016	0.998	7.142	8.132	8.663	<0.001
	s(times_norm): M	7.881	8.593	7.125	<0.001	1.531	1.713	0.166	0.821
	s(times_norm,person)	131.445	296	3.963	<0.001	113.1	296	13.333	<0.001
TRAP	(Intercept)	813.93	16.44	49.499	<0.001	1984.65	30.62	64.815	<0.001
	genderM	−49.54	26.44	−1.874	0.061	−73.51	49.23	−1.493	0.135
	s(times_norm)	4.433	5.268	2.831	<0.05	7.654	8.521	10.09	<0.001
	s(times_norm): M	3.74	4.452	1.041	0.4772	1.065	1.091	5.351	0.019
	s(times_norm,person)	142.442	306	10.949	<0.001	101.772	306	13.087	<0.001
STRUT	(Intercept)	897.84	20.68	43.417	<0.001	1646.01	47.55	34.615	<0.001
	genderM	−89.56	33.33	−2.687	<0.01	−9.32	76.64	−0.122	0.903
	s(times_norm)	2.211	2.597	0.525	0.5921	7.314	8.303	9.033	<0.001
	s(times_norm): M	1.01	1.013	3.249	0.0715	1.015	1.022	0.926	0.338
	s(times_norm,person)	106.324	256	10.423	<0.001	71.393	256	13.771	<0.001
LOT	(Intercept)	801.437	8.776	91.317	<0.001	1315.42	25.74	51.105	<0.001
	genderM	−84.087	14.145	−5.945	<0.001	−70.29	41.49	−1.694	0.090
	s(times_norm)	1.008	1.011	1.023	0.311	8.354	8.834	12.613	<0.001
	s(times_norm): M	1.014	1.019	0.01	0.954	1.049	1.064	0.303	0.588
	s(times_norm,person)	94.698	256	3.703	<0.001	114.981	256	4.571	<0.001
GOOSE	(Intercept)	433.4	11.6	37.363	<0.001	2174.6	21.34	101.889	<0.001
	genderM	−21.03	18.46	−1.139	0.255	−219.81	33.81	−6.501	<0.001
	s(times_norm)	7.018	8.052	5.291	<0.001	6.874	7.944	9.794	<0.001
	s(times_norm): M	1.033	1.045	0.655	0.422	1.007	1.01	2.469	0.115
	s(times_norm,person)	84.204	246	2.034	<0.001	81.226	246	5.713	<0.001

Table A8. Summary table of AAE GAMM results for age.

Vowel	Term	F1				F2
Vowel	Term	Estimate/edf	Std. Error/Ref.df	t/F	p	Estimate/edf	Std. Error/Ref.df	t/F	p
KIT	Parametric (Intercept)	414.232	8.146	50.851	<0.001	2525.56	41.49	60.873	<0.001
	Parametric (age > 40)	1.945	11.52	0.169	0.866	81.78	58.67	1.394	0.16
	s(times_norm)	7.615	8.208	4.23	<0.001	7.876	8.529	13.427	<0.001
	s(times_norm): age > 40	1.013	1.016	0.625	0.429	1.261	1.336	1.707	0.21
	s(times_norm,person)	185.001	306	4.967	<0.001	156.154	306	48.472	<0.001
DRESS	Parametric (Intercept)	597.54	10.87	54.986	<0.001	2185.92	30.36	71.995	<0.001
	Parametric (age > 40)	−51.69	22.5	−2.297	<0.05	180.75	62.86	2.875	<0.01
	s(times_norm)	7.689	8.45	5.401	<0.001	7.164	8.155	10.357	<0.001
	s(times_norm): age > 40	2.932	3.469	0.494	0.595	1.009	1.012	3.897	<0.05
	s(times_norm,person)	137.095	296	4.423	<0.001	113.221	296	16.412	<0.001
TRAP	Parametric (Intercept)	806.01	14.86	54.234	<0.001	1912.21	22.46	85.121	<0.001
	Parametric (age > 40)	−49.82	31.29	−1.592	0.111	194.74	47.29	4.118	<0.001
	s(times_norm)	4.828	5.75	2.182	<0.05	7.655	8.522	10.102	<0.001
	s(times_norm): age > 40	1.031	1.041	0.275	0.6126	1.019	1.026	2.455	0.115
	s(times_norm,person)	145.778	306	11.404	<0.001	101.92	306	10.273	<0.001
STRUT	Parametric (Intercept)	855.3	20.23	42.273	<0.001	1621.76	40.37	40.17	<0.001
	Parametric (age>40)	41.98	46.15	0.909	0.363	107.49	92.09	1.167	0.243
	s(times_norm)	2.22	2.609	2.465	0.0625	7.314	8.303	9.046	<0.001
	s(times_norm): age>40	1.002	1.003	2.141	0.1434	1.007	1.01	0.793	0.374
	s(times_norm,person)	106.499	256	13.274	<0.001	71.411	256	13.219	<0.001
LOT	Parametric (Intercept)	760.314	9.799	77.592	<0.001	1286.29	23.36	55.068	<0.001
	Parametric (age > 40)	45.729	22.366	2.045	<0.05	10.9	53.34	0.204	0.838
	s(times_norm)	1.012	1.016	1.172	0.277	8.353	8.833	12.443	<0.001
	s(times_norm): age > 40	1.015	1.02	0.35	0.558	1.389	1.511	0.034	0.953
	s(times_norm,person)	96.122	256	5.975	<0.001	114.662	256	4.817	<0.001
GOOSE	Parametric (Intercept)	417.325	9.933	42.013	<0.001	2071.99	27.18	76.244	<0.001
	Parametric (age > 40)	39.902	22.509	1.773	0.0764	75.57	60.89	1.241	0.215
	s(times_norm)	7.019	8.053	5.804	<0.001	6.874	7.944	8.892	<0.001
	s(times_norm): age > 40	1.014	1.02	0.006	0.975	1.012	1.016	0.076	0.792
	s(times_norm,person)	84.09	246	1.997	<0.001	81.841	246	11.111	<0.001

Table A9. Summary table of combined GAMM results for dialect.

Vowel	Term	F1				F2
Vowel	Term	Estimate/edf	Std. Error/Ref.df	t/F	p	Estimate/edf	Std. Error/Ref.df	t/F	p
KIT	Intercept	413.418	7.597	54.421	<0.001	2493.84	36.32	68.671	<0.001
	dialectMAE	23.388	11.075	2.112	<0.05	96.41	52.72	1.829	0.0675
	s(times_norm)	8.23	8.68	9.538	<0.001	8.026	8.591	12.797	<0.001
	s(times_norm): MAE	1.217	1.274	0.474	0.572	4.515	5.353	1.192	0.421
	s(times_norm,person)	326.567	586	4.214	<0.001	282.495	586	41.625	<0.001
DRESS	Intercept	585.37	13.97	41.893	<0.001	2228.07	31.41	70.928	<0.001
	dialectMAE	19.71	20.23	0.975	0.33	143.8	45.25	3.178	<0.01
	s(times_norm)	8.119	8.732	7.321	<0.001	7.654	8.483	10.497	<0.001
	s(times_norm): MAE	1.035	1.046	0.078	0.813	1.029	1.039	0.216	0.645
	s(times_norm,person)	254.317	576	4.295	<0.001	241.277	576	17.541	<0.001
TRAP	Intercept	794.76	18.44	43.097	<0.001	1956.18	27.38	71.434	<0.001
	dialectMAE	56.83	26.79	2.121	<0.05	73.9	39.78	1.858	0.0632
	s(times_norm)	5.444	6.422	2.542	<0.05	7.64	8.504	13.664	<0.001
	s(times_norm): MAE	3.799	4.551	1.268	0.2213	1.077	1.106	0.911	0.313
	s(times_norm,person)	258.696	586	13.331	<0.001	213.721	586	14.717	<0.001
STRUT	Intercept	863.38	22.51	38.35	<0.001	1642.43	31.28	52.511	<0.001
	dialectMAE	121.65	31.55	3.855	<0.001	43.18	43.85	0.985	0.325
	s(times_norm)	2.531	2.985	1.68	0.166	7.599	8.493	9.888	<0.001
	s(times_norm): MAE	4.87	5.842	1.724	0.121	1.02	1.029	1.581	0.209
	s(times_norm,person)	216.987	526	15.647	<0.001	165.322	526	10.418	<0.001
LOT	Intercept	769.11	14.8	51.978	<0.001	1288.34	23.57	54.662	<0.001
	dialectMAE	81.36	20.57	3.954	<0.001	45.24	32.84	1.378	0.168
	s(times_norm)	1.818	2.106	1.358	0.2574	8.508	8.858	19.068	<0.001
	s(times_norm): MAE	1.005	1.007	3.476	0.0623	3.467	4.139	0.861	0.468
	s(times_norm,person)	194.954	536	8.877	<0.001	242.934	536	4.714	<0.001
GOOSE	Intercept	425.07	9.965	42.655	<0.001	2087.134	32.511	64.197	<0.001
	dialectMAE	−20.275	13.834	−1.466	0.143	7.259	44.772	0.162	0.871
	s(times_norm)	7.912	8.534	7.24	<0.001	7.999	8.656	12.19	<0.001
	s(times_norm): MAE	2.793	3.271	0.06	0.949	4.121	4.983	1.228	0.3
	s(times_norm,person)	257.879	526	3.101	<0.001	198.845	526	10.554	<0.001

References

Bates, Douglas, Martin Maechler, Ben Bolker, and Steve Walker. 2015. Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software 67: 1–48. [Google Scholar] [CrossRef]
Blackwell, Liz, and Kirsty McDougall. forthcoming. Sociophonetic variation of filled pauses in Victoria, Australia. Paper presented at the 19th Australasian International Conference on Speech Science and Technology, Melbourne, Australia, December 3–5; Melbourne: Australasian Speech Science and Technology Association.
Butcher, Andrew. 2008. Linguistic aspects of Australian Aboriginal English. Clinical Linguistics & Phonetics 22: 63542. [Google Scholar] [CrossRef]
Butcher, Andrew, and Victoria Anderson. 2008. The vowels of Australian Aboriginal English. Paper presented at the Interspeech 2008 Incorporating SST 2008, Brisbane, Australia, September 22–26; Edited by Janet Fletcher, Debbie Loakes, Roland Göcke, Denis Burnham and Michael Wagner. Bonn: ISCA, p. 34750. [Google Scholar]
Cox, Felicity. 1999. Vowel change in Australian English. Phonetica 56: 1–27. [Google Scholar] [CrossRef] [PubMed]
Cox, Felicity. 2006. The acoustic characteristics of /hVd/ vowels in the speech of some Australian teenagers. Australian Journal of Linguistics 26: 147–79. [Google Scholar] [CrossRef]
Cox, Felicity, and Janet Fletcher. 2017. Australian English: Pronunciation and Transcription, 2nd ed. Melbourne: Cambridge University Press. [Google Scholar]
Cox, Felicity, and Sallyanne Palethorpe. 2008. Reversal of short front vowel raising in Australian English. Paper presented at the Interspeech 2008 Incorporating SST 2008, Brisbane, Australia, September 22–26; Edited by Janet Fletcher, Debbie Loakes, Roland Göcke, Denis Burnham and Michael Wagner. Bonn: ISCA, p. 34245. [Google Scholar]
Cox, Felicity, and Sallyanne Palethorpe. 2014. Phonologisation of vowel duration and nasalised /æ/ in Australian English. Paper presented at the 15th Australasian International Conference on Speech Science and Technology, Christchurch, New Zealand, December 2–5; pp. 33–36. [Google Scholar]
Cox, Felicity, and Sallyanne Palethorpe. 2019. Vowel variation in a standard context across four major Australian cities. Paper presented at the 19th International Congress of Phonetic Sciences, Melbourne, Australia, August 5–9; Edited by Sasha Calhoun, Paola Escudero, Maria Tabain and Paul Warren. Melbourne: Australasian Speech Science and Technology Association Inc., p. 57781. [Google Scholar]
Cox, Felicity, Joshua Penney, and Sallyanne Palethorpe. 2024. Australian English Monophthong Change across 50 Years: Static versus Dynamic Measures. Languages 9: 99. [Google Scholar] [CrossRef]
Docherty, Gerard, Simón Gonzalez, and Nathaniel Mitchell. 2015. Static vs Dynamic Perspectives on the Realization of Vowel Nucleii in West Australian English. Paper presented at the 18th International Congress of Phonetic Sciences, Glasgow, UK, August 10–14; Edited by the Scottish Consortium for ICPhS. Glasgow: University of Glasgow. [Google Scholar]
Eades, Diana. 2013. Aboriginal Ways of Using English. Canberra: Aboriginal Studies Press. [Google Scholar]
Elvin, Jaydene, Daniel Williams, and Paola Escudero. 2016. Dynamic acoustic properties of monophthongs and diphthongs in Western Sydney Australian English. Journal of the Acoustical Society of America 140: 57681. [Google Scholar] [CrossRef] [PubMed]
Grama, James, Catherine E. Travis, and Simon González. 2019. Initiation, progression and conditioning of the short-front vowel shift in Australian English. Paper presented at the 19th International Congress of Phonetic Sciences (X1X), Melbourne, Australia, August 5–9; Edited by Sasha Calhoun, Paola Escudero, Marija Tabain and Paul Warren. Canberra: Australasian Speech Science and Technology Association Inc., pp. 1769–73. [Google Scholar]
Harrington, Jonathan, Felicity Cox, and Zoë Evans. 1997. An acoustic phonetic study of broad, general, and cultivated Australian English vowels. Australian Journal of Linguistics 17: 155–84. [Google Scholar] [CrossRef]
Jochim, Raphael Winkelmann Markus, Klaus Jaensch, Steve Cassidy, and Jonathan Harrington. 2023. emuR: Main Package of the EMU Speech Database Management System. R Package Version 2.4.2. Available online: https://CRAN.R-project.org/package=emuR (accessed on 15 July 2024).
King, Jeanette, Margaret Maclagan, Ray Harlow, Peter Keegan, and Catherine Watson. 2020. Prestige norms and sound change in Māori. Language Ecology 4: 95–114. [Google Scholar] [CrossRef]
Kisler, Thomas, Uwe Reichel, and Florian Schiel. 2017. Multilingual processing of speech via web services. Computer Speech & Language 45: 32647. [Google Scholar]
Lenth, Russell V. 2023. Emmeans: Estimated Marginal Means, aka Least-Squares Means. R Package Version 1.8.7. Available online: https://CRAN.R-project.org/package=emmeans (accessed on 15 July 2024).
Loakes, Debbie, and Adele Gregory. 2022. Voice quality in Australian English. Journal of the Acoustical Society of America—Express Letters 2: 085201. [Google Scholar] [CrossRef] [PubMed]
Loakes, Debbie, Janet Fletcher, and Josh Clothier. 2024a. One place, two speech communities: Differing responses to sound change in Mainstream and Aboriginal Australian English in a small rural town. In Speech Dynamics: Synchronic Variation and Diachronic Change. Edited by Felicitas Kleber and Tamara Rathcke. Berlin: De Gruyter Mouton. [Google Scholar]
Loakes, Debbie, Janet Fletcher, John Hajek, Josh Clothier, and Ben Volchok. 2016. Short vowels in L1 Aboriginal English spoken in Western Victoria. Paper presented at the 16th Australasian International Conference on Speech Science and Technology, Parramatta, Australia, December 6–9; Edited by Christopher Carignan and Michael Tyler. Melbourne: Australasian Speech Science and Technology Association, pp. 33–36. [Google Scholar]
Loakes, Debbie, Josh Clothier, John Hajek, and Janet Fletcher. 2024b. Sociophonetic variation in vowel categorization of Australian English. Language and Speech 67: 870–906. [Google Scholar] [CrossRef] [PubMed]
Loakes, Debbie, Kirsty McDougall, and Adele Gregory. 2022. Variation in /t/ in Aboriginal and Mainstream Australian Englishes. Paper presented at the Eighteenth Australasian International Conference on Speech Science and Technology, Canberra, Australia, December 13–16; Edited by Rosey Billington. Melbourne: Australasian Speech Science and Technology Association, pp. 61–65. [Google Scholar]
Louro, Celeste, and Glenys Collard. 2021. Australian Aboriginal English: Linguistic and sociolinguistic perspectives. Language and Linguistics Compass 15: e12415. [Google Scholar] [CrossRef]
Mailhammer, Robert. 2021. English on Croker Island: The Synchronic and Diachronic Dynamics of Contact and Variation. Berlin: De Gruyter Mouton. [Google Scholar]
Malcolm, Ian. 2013. Aboriginal English: Some grammatical features and their implications. Australian Review of Applied Linguistics 36: 26794. Available online: https://www.jbe-platform.com/content/journals/10.1075/aral.36.3.03mal (accessed on 15 July 2024). [CrossRef]
Mannell, Robert. 2004. Perceptual vowel space for Australian English lax vowels: 1998 and 2004. Paper presented at the 10th Australian International Conference on Speech Science and Technology, Melbourne, Australia, December 2–5; Edited by Steve Cassidy, Felicity Cox, Robert Mannell and Sallyanne Palethorpe. Melbourne: Australian Speech Science and Technology Association, p. 22126. [Google Scholar]
McDougall, Kirsty, Alice Paver, Martin Duckworth, Liz Blackwell, and Debbie Loakes. forthcoming. Patterns of silent pausing in Aboriginal and Mainstream Australian Englishes spoken in Warrnambool. Paper presented at the 19th International Congress of Phonetic Sciences, Melbourne, Australia, August 5–9; Melbourne: Australasian Speech Science and Technology Association.
Penney, Joshua, Felicity Cox, and Andy Gibson. 2023. Variation in FACE and FLEECE trajectories in Australian English adolescents according to community language diversity. Paper presented at the 20th International Congress of Phonetic Sciences,ICPhS 2023, Prague, Czech Republic, August 7–11; pp. 3547–51. [Google Scholar]
Penney, Joshua, Felicity Cox, Kelly Miles, and Sallyanne Palethorpe. 2018. Glottalisation as a cue to coda consonant voicing in Australian English. Journal of Phonetics 66: 16184. [Google Scholar] [CrossRef]
R Core Team. 2020. R: A Language and Environment for Statistical Computing. Vienna: Foundation for Statistical Computing. [Google Scholar]
van Rij, Jacolien, Martijn Wieling, R. Harald Baayen, and Hedderik van Rijn. 2020. itsadug: Interpreting Time Series and Autocorrelated Data Using Gamms (R package Version 2.4). Available online: https://cran.r-project.org/package=itsadug (accessed on 26 August 2024).
Watson, Catherine, and Jonathan Harrington. 1999. Acoustic evidence for dynamic formant trajectories in Australian English vowels. Journal of the Acoustical Society of America 106: 45868. [Google Scholar] [CrossRef] [PubMed]
Wood, Simon N. 2006. Low-rank scale-invariant tensor product smooths for generalized additive mixed models. Biometrics 62: 102536. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Map of Australia (small) showing Victoria (large)—locations of the communities can be seen in relation to the capital Melbourne.

Figure 2. Duration (ms) of AAE vowels plotted according to location (MI/WN) and gender (F/M).

Figure 3. Duration (ms) of AAE and MAE vowels.

Figure 4. Mean vowel values plotted in Bark according to (top) location (MI/WN), (middle) gender (F/M), and (bottom) age (<40/>40).

Figure 5. Vowel ellipses plotted in Bark according to dialect (AAE—solid line /MAE—dotted line) with labels at the mean value.

Figure 6. AAE vowel formant trajectories taken at five points across the vowel (bark normalized) for location, gender, and age.

Figure 7. Vowel formant trajectories taken at five points across the vowel (bark normalized) for MAE and AAE speakers.

Table 1. Breakdown of speakers by dialect, location, gender, and age.

Dialect	Region	Female		Male
Dialect	Region	<40	>40	<40	>40
AAE	Mildura	5	4	4	0
AAE	Warrnambool	9	3	6	2
MAE	Mildura	2	7	3	1
MAE	Warrnambool	2	5	6	2

Table 2. Number of tokens for each vowel for speakers in this study.

Vowel	AAE	MAE
KIT	344	242
DRESS	186	117
TRAP	296	167
STRUT	152	119
LOT	150	122
GOOSE	149	124

Table 3. Significant results from post-hoc pairwise t-test for F1 and F2 for each vowel (p < 0.05) for location, gender, and age. Increase in formant value ↑ or decrease in formant value ↓, of second factor in comparison to first factor.

Location (MI/WN)	F1	F2
KIT		F, >40 ↓
		M, <40 ↓
DRESS		F, >40 ↓
	M, <40 ↑
TRAP		F, >40 ↓
STRUT	M, <40 ↑
Gender (F/M)	F1	F2
KIT		MI, <40 ↓
		WN, <40 ↓
DRESS	WN, <40 ↑
TRAP		WN, >40 ↓
GOOSE		MI, <40 ↓
		WN, <40 ↓
Age <40/>40	F1	F2
KIT		M, WN ↑
DRESS		F, MI ↑
	M, WN ↓
TRAP	F, MI ↓	F, MI ↑

Table 4. Summary of parametric and non-linear differences for each monophthong in the location, age, and gender GAMMs analyses. Asterisks represent significant differences: * ≤ 0.05, ** ≤ 0.01, *** ≤ 0.001.

Location
Vowel	F1 Parametric	F1 Non-Linear	F2 Parametric	F2 Non-Linear
KIT		***
DRESS		***		**
TRAP
STRUT
LOT
GOOSE
Age
Vowel	F1 Parametric	F1 Non-Linear	F2 Parametric	F2 Non-Linear
KIT
DRESS	*		**	*
TRAP			***
STRUT
LOT	*
GOOSE
Gender
Vowel	F1 Parametric	F1 Non-Linear	F2 Parametric	F2 Non-Linear
KIT		***	***	***
DRESS	**	***	***
TRAP				*
STRUT	**
LOT	***
GOOSE			***

Table 5. Summary of parametric and non-linear differences for each monophthong in the dialect GAMMs analyses. Asterisks represent significant differences: * ≤ 0.05, ** ≤ 0.01, *** ≤ 0.001.

Dialect
Vowel	F1 Parametric	F1 Non-Linear	F2 Parametric	F2 Non-Linear
KIT	*
DRESS			***
TRAP	*
STRUT	***
LOT	***
GOOSE

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Loakes, D.; Gregory, A. Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria. Languages 2024, 9, 299. https://doi.org/10.3390/languages9090299

AMA Style

Loakes D, Gregory A. Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria. Languages. 2024; 9(9):299. https://doi.org/10.3390/languages9090299

Chicago/Turabian Style

Loakes, Debbie, and Adele Gregory. 2024. "Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria" Languages 9, no. 9: 299. https://doi.org/10.3390/languages9090299

APA Style

Loakes, D., & Gregory, A. (2024). Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria. Languages, 9(9), 299. https://doi.org/10.3390/languages9090299

Article Menu

Acoustic Analysis of Vowels in Australian Aboriginal English Spoken in Victoria

Abstract

1. Introduction and Background

Aims

2. Materials and Methods

2.1. Speakers

2.2. Materials and Recording Procedures

2.3. Analysis

3. Results

3.1. Duration in AAE

3.2. Duration Comparison of AAE and MAE

3.3. AAE Static F1/F2

3.4. Static F1/F2 Comparison of AAE and MAE

3.5. AAE Dynamic F1/F2

3.6. Dynamic F1/F2 Comparison of AAE and MAE

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI