1. Introduction
Imagine you just got a phone call from one of your neighbors, who lives upstairs. The other day your dog bit him, and he called to let you know that his scratch is getting much better and you should not worry. When your husband asks you “Who called?” you could, in principle, respond with any of the options in (1a–c):
1. | a. | The neighbor who our dog bit. |
| b. | The neighbor who was bitten by our dog. |
| c. | The neighbor who was bitten. |
| d. | #The neighbor that bit our dog. |
The responses (1a–c) correctly describe the person who called. Assuming that there are several neighbors in your building but only one whom your dog bit, these responses succeed in unambiguously referring to this individual. That is, the meaning “our dog bit one of our neighbors and this is the neighbor who just called” can be expressed via different means, all involving modification of the noun “neighbor”. (1a) illustrates modification via a relative clause (RC), in which the modified noun is the direct object of the transitive RC verb “bite”; following common terminology, these RCs are referred to as object relative clauses (ORCs). (1b) as well as (1c) illustrate the modification of the noun via a RC with a passive verbal form, with the meaning of the passive RC being equivalent to that of an ORC (see
Contemori and Belletti 2014). The two variants in (1b,c) only differ regarding whether the agent PP is present or not. Crucially, a non-passivized relative clause, such as (1d), would lead to referential failure. Following common terminology, we refer to this type of RC, in which the head of the RC is the subject of the RC predicate, as subject relative clause (SRC).
In German, speakers may answer the question just as in (1a–c), translated in (2a–c), or also with (2d) or (2e), depending on their dialect. In these cases, the RC is introduced by the uninflected complementizer
wo “where” and not by the relative pronoun
der/die/das “who”, similar to the generalized use of “that” in English.
2. | a. | Der Nachbar, den unser Hund gebissen hat |
| | the.NOM neighbor who.ACC our.NOM dog bitten has |
| | “The neighbor who our dog has bitten.” |
| b. | Der Nachbar, der von unser-em Hund gebissen wurde |
| | the.NOM neighbor who.NOM by our-DAT dog bitten was |
| | “The neighbor who was bitten by our dog.” |
| c. | Der Nachbar, der gebissen wurde |
| | the.NOM neighbor who.NOM bitten was |
| | “The neighbor who was bitten.” |
| d. | Der Nachbar, wo der Hund gebissen hat |
| | the.NOM neighbor where who.NOM dog bitten has |
| | “The neighbor that our dog has bitten.” |
| e. | Der Nachbar, wo (vo-m Hund) gebissen wurde |
| | the.NOM neighbor where (by-the.DAT dog) bitten was |
| | “The neighbor that was bitten (by the dog).” |
The meaning conveyed by the grammatical variants mentioned in (1a–c) and (2a–e) is the same: they are instances of the many-to-one mapping between form and meaning. Accordingly, we can ask whether speakers prefer any of the clausal variants and what may drive their choices.
Take the example of ORCs: when prompted by discourse to modify a patient noun phrase (DP), as in our example (1) above, speakers rarely opt for ORCs (e.g.,
Gordon et al. 2002;
Friedmann et al. 2009). In particular, ORCs, such as (1a), which involve a transitive predicate, with the agent and the patient denoting animate referents expressed by a lexical phrase, seem to be almost absent in adults’ natural speech and are rarely produced by children in experimental production studies across many languages (
Guasti and Cardinaletti 2003;
Diessel and Tomasello 2005;
Novogrodsky and Friedmann 2006;
Adani et al. 2010;
Contemori and Belletti 2014;
Jensen de Lopez et al. 2014). To convey the intended meaning, speakers choose alternative structures, and these may not necessarily be the same across languages. Experimental findings revealed that children and adults opt for passive RCs in several but not all languages: the production of passive RCs has been reported for Italian (
Contemori and Belletti 2014), Indonesian (
Tjung 2006), Korean (
Kim and O’Grady 2016), Danish (
Jensen de Lopez et al. 2014), Swedish (
Håkansson and Hansson 2000), and German (
Adani et al. 2012), but not for Hebrew (
Friedmann et al. 2009), Catalan (
Gavarrò et al. 2012), and Portuguese (
Belletti and Costa 2015). Speakers of the latter languages often produced ORCs introduced by an uninflected complementizer and containing a resumptive pronoun. Likewise, German-speaking children and adults have been found to prefer wo-RCs (2a,b) over RCs introduced by a relative pronoun (
Adani et al. 2012).
Although variation across languages exists, the choice of the specific grammatical variants is likely to not be arbitrary. It is still unresolved, however, whether speakers’ choices reflect frequencies in the ambient language or language-internal factors. That is, do the preferred grammatical variants have in common that they are frequent or that they are grammatically less complex than full ORCs, independent of frequency (see
Adani et al. 2012;
Contemori and Belletti 2014)? In the current study, we investigate to what extent corpus frequencies determine the types of grammatical variants used in nominal modification contexts. Starting from the working assumption that frequency guides language use and language acquisition, we also ask whether the preferred options change in the course of acquisition. We address these issues in two studies: Study 1 investigates the grammatical variants that German-speaking 3- to 6-year-olds and adults use when asked to modify agent and patient DPs. Taking as a starting point the main result of Study 1–the surprisingly frequent use of passive RCs-Study 2 examines the frequency and the properties of passives and passive RCs in child-directed speech and compares this data to the characteristics of passive RCs in our experimental data of Study 1.
The structure of the paper is as follows:
Section 2 provides an overview of how agent and patient DPs can be modified in German and delineates the frequency account.
Section 3 reviews previous findings regarding the use of SRCs and ORCs and their variants in adult and child language.
Section 4 describes the elicited production experiment (Study 1), and
Section 5 describes the analysis of child-directed speech (Study 2).
Section 6 offers a discussion of our findings and points to some limitations and suggestions for further research,
Section 7 concludes the paper.
4. Study 1: Elicited Production Task
This study investigates which structures children and adults use when modifying agent and patient DPs, in order to examine whether corpus frequencies determine their choice of specific grammatical variants. We developed an elicited production task, which prompted participants to identify a referent among a set of alternative referents. Half of the situations required the modification of the agent and half modification of the patient, and all contexts displayed transitive events involving two animate participants. This design allows us to calculate and compare frequencies in a controlled setting. This way, we arrive at a measure of frequency that is distinct from “everyday language use”, which unavoidably reflects conversational needs that may favor or disfavor production of one structure over another for language-unrelated reasons. Two research questions guided our analysis:
- (Q1)
Which grammatical variants of nominal modification do German children of different ages and adults prefer within a controlled experimental production task?
- (Q2)
Do the preferred variants change in the course of acquisition?
As for (Q1), we identified the range of structures produced by children aged 3, 4, 5 and 6 and by adults and then determine for each age group which ones are preferred, with the preference being measured in terms of production rates. As for (Q2), we compare the data across the age groups to characterize the developmental path of the grammatical variants of nominal modification, starting at age 3, the age at which production of complex sentences first emerges.
Our working assumption is that frequency shapes both language use and language acquisition (see
Section 3.2). In our agent-modifying contexts, SRCs need an animate head and an animate RC internal direct object DP. Given the findings from corpus studies, it is unknown whether child and adult speakers would opt for this type of SRC: SRCs were reported to be the most frequent type of RC in German speakers, but they mainly consisted of intransitive or copula predicates. SRCs with transitive predicates were not frequently found in both adults’ and children’s spontaneous speech. In our patient-modifying contexts, ORCs need an animate head and an animate RC internal subject. Since this type of ORC was only rarely found in corpus studies, we did not expect participants to opt for ORCs. RCs with uninflected complementizers should occur only in speakers whose dialect allows this structure. Furthermore, we expected participial structures to be generally avoided in both contexts and to be restricted to individual preference. No clear predictions can be derived regarding passive RCs because detailed results on the frequency of passive RCs in adults’ speech as well as on their acquisition to date are lacking. Finally, in the case of semantically inappropriate RCs it is impossible to derive frequency-based predictions: if the frequency of structures in the ambient language alone determined language acquisition, these constructions should not occur at all. Nevertheless, they have often been found in acquisition studies.
4.1. Participants
A total of 133 children between the ages of 3 and 6 were tested. Children were recruited in daycares in the metropolitan XXX area, and all parents gave written consent for their child’s participation in the study. A parental questionnaire ensured that none of the children was bilingual, had signs of language impairment, language delay, or hearing problems. Moreover, all children were assessed with a standardized language test (SETK for ages 3 to 5,
Grimm 2001; TROG-D for age 6,
Fox 2006). Eleven children did not perform within the age-appropriate norms and were excluded from the analysis, and another eight children were excluded due to missing 8 or more items (out of 48 test items of our main test). In the following section, we report results on the remaining 114 typically developing monolingual German-speaking children. In addition, 21 adults with no background in linguistics were tested. The participants’ details are summarized in
Table 1.
4.2. Material and Procedure
We developed a novel elicited production task involving colored pictures, which made the task suitable for young children. Nominal modification structures were elicited in a question-answer context similar to the design by
Novogrodsky and Friedmann (
2006). 24 test items prompted the production of structures in which the agent of an action was modified, and 24 test items prompted the production of structures in which the patient of an action was modified. Two warm-up items were included to familiarize participants with the experimental procedure. Each item was presented with a picture: (6) exemplifies the agent-modifying condition paired with
Figure 3 and (7) the patient-modifying condition paired with
Figure 4.
6. | Test item in the modifying-agent condition |
| Context of elicitation |
| Hier sind zwei Bären, ein Hund, und ein Igel. Ein Bär wäscht den Hund und ein Bär wäscht den Igel. Welcher Bär hat den Hut auf? |
| “Here there are two bears, a dog, and a hedgehog. A bear is washing the dog and a bear is washing the hedgehog. Which bear is wearing the hat?” |
| Possible answer |
| Der Bär, der den Hund wäscht |
| “The bear that is washing the dog.” |
7. | Test item in the modifying-patient condition |
| Context of elicitation |
| Hier sind zwei Affen und ein Hase. Der Hase streichelt einen Affen und er beißt einen Affen. Welcher Affe hat den Hut auf? |
| “Here, there are two monkeys and one hare. The hare is stroking one monkey and he is biting one monkey. Which monkey is wearing the hat?” |
| Possible answer |
| Der Affe, den der Hase beißt. |
| “The monkey that the hare is biting.” |
Parallel to the design by
Novogrodsky and Friedmann (
2006), the scenes showed either two contrasting actions (see
Figure 4) or two contrasting participants (see
Figure 2), all involving transitive events, expressed by transitive verbs. In order to minimize differences between the contexts modifying the agent and the patient, we selected only transitive verbs denoting reversible actions. The participants were all animal characters, so both animals were equally likely to perform these silly actions. All nouns used were animate referring to names of animals like bear, bird, etc. The test items contained only lexical DPs marked for masculine singular, providing unambiguous case-marking on the relative pronoun and on the article of the embedded DP. All nouns used in the test items have been attested to be used before age 3, according to our search of the German CHILDES corpora (
MacWhinney 2000).
Participants were tested individually by a trained research assistant in a quiet room in their daycare. The test session started with a familiarization with the experimenter and the material. During this familiarization phase, the child was introduced to the frog-puppet Caru, who interacted with the child during the experimental session. The experimenter, the child, and the puppet sat on one side of a table on which a picture book and a box containing various hat-stickers were laid out in front of the child. The experiment was video-taped and audio-recorded for further analysis. Caru showed the child his favorite picture book of animals performing different actions, which he knew by heart (examples (6–7)). As part of a game, the experimenter then blindfolded Caru. This was done in order to reduce pointing responses and the use of the left/right one. The experimenter asked the child to choose one of the hats available as stickers and to place it on one of the two animals, which were the patients or the agents depending on the test condition: participants could choose to place the hat on one of the two agents in the agent-modifying condition and on one of the two patients in the patient-modifying condition. The child would put a hat on one of the animals, while Caru was already blindfolded. This was done to provide a natural context for asking the test question. Caru would describe by heart the scene in the book and ask the child “Which X is wearing the hat?” If the child did not react, Caru repeated the question. If necessary, the experimenter repeated the two warm-up items to ensure that the child understood the task. When the children gave responses such as “this one” or “the one on the left/right”, the experimenter repeated the question, emphasizing that Caru could not see the pictures. After two prompts, the experimenter moved to the next test item. No response-contingent feedback was given. Adults were tested in one session; children were tested in two sessions, which lasted between 25 and 40 min each.
4.3. Results of Study 1
A range of structures was produced by children and adults; here we focus on five types. (i)
full RCs: grammatically and contextually appropriate SRCs or ORCs, describing the event in the picture correctly, (ii)
semantically inappropriate RCs: RCs that do not match the depicted event or did not answer the question, (iii)
participial structures, (iv) wo-rcs, (v) passive rcs: SRCs introduced by a relative pronoun and with the predicate in passive voice. The remaining structures, labelled
others, comprised main clauses, fragments, pointing, null reactions, productions of other types of subordinate clauses, and un-analyzable reactions. They were not included in any statistical analysis. For expository purposes, we report our findings in two tables.
Table 2 reports the raw scores, percentages (%) and standard deviations (N), of all the structures produced by our participants divided by age group in the agent-modifying condition. The results for the patient-modifying condition are illustrated in
Table 3.
Descriptively,
full RCs were produced more frequently in the agent-modifying than in the patient-modifying condition by both children and adults; the proportion of
full RCs in
Table 2, i.e., SRCs, increased with age.
semantically inappropriate RCs occurred more frequently in the patient- than in the agent-modifying condition; they were almost exclusively produced by children.
Participial structures were produced very rarely in either condition and only by adults. Similarly,
wo-RCS were not frequently produced with no discernable distinctions between conditions and age groups. Finally,
passive-RCS were produced only in the patient-modifying context, as expected, and their production increased with age.
Data were analyzed using R (
R Core Team 2016), in combination with the
lme4 package (
Bates et al. 2015), and
lsmeans (
Lenth 2016) to perform a series of analyses. In a first step we asked whether participants’ production rates differed across structures, conditions, and age. We fitted participants’ responses to a generalized mixed effects logistic regression model. Participants’ productions were our dependent variable. As fixed factors, we posited the following variables: structure type (i.e.,
full rcs, semantically inappropriate RCs, participial structures, wo-rcs, passive rcs), condition (agent-modifying vs. patient-modifying context), age (3-, 4-, 5-, 6-year-olds, adults), and four interactions, i.e., structure type*condition, structure type*age, age*condition, structure type*age*condition. Patient-modifying context was the reference category for condition, 3-year-olds for the age factor, RCs for structure. The model was fitted using restricted maximum likelihood.
The model revealed significant effects of structure type and age, as well as of three interactions, structure type*condition, structure type*age, structure type*age*condition. Degrees of freedom,
F-values and associated
p-values for the model fits are reported in
Table 4.
On the basis of the effects revealed by the model, we proceeded with the additional analyses of our findings isolating structures and development. In
Section 4.3.1, we focus on the structure type, addressing Q1. In
Section 4.3.2, we zoom into the age factor and address Q2.
4.3.1. Assessing Preference: Frequency of Structures within Each Age Group
In order to determine which grammatical variants of nominal modification German children of different ages and adults prefer within a controlled experimental production task (Q1), we compared the production rates of the five structures within each age group. We focus on the factors “structure” and “condition”.
Our model revealed a main effect of condition for full RCs (Estimate = 4.15, SE = 0.34, Wald Z = 42.00, p < 0.0001), passive RCs (Estimate = −4.6, SE = 0.99, Wald Z = −4.6, p < 0.0001), and semantically inappropriate RCs (Estimate = −2.09, SE = 0.33, Wald Z = −6.24, p < 0.001) in each age group (all coefficients for age fixed effects were significant in each group at p < 0.02). While full RCs were more frequently produced in the agent-modifying context, both passive RCs and semantically inappropriate RCs occurred significantly more frequent in the patient-modifying context. The effect of condition did not yield a significant effect for WO-RCs (Estimate = 0.5, SE = 0.55, Wald Z = 0.91, p = 0.365) and participial structures (Estimate = 0.06, SE = 0.44, Wald Z = −0.02, p = 0.419).
To investigate differences between the amounts of structures produced, we applied linear contrasts with Tukey correction within each age group. In the agent-modifying condition, the analyses revealed that full RCs were produced more frequently than semantically inappropriate RCs in all age groups (Age 3: Estimate = −0.05, SE = 0.013, t = −3.9, p < 0.001; Age 4: Estimate = −0.13, SE = 0.01, t = −9.14, p < 0.001; Age 5: Estimate = −0.16, SE = 0.01, t = −14.8, p < 0.001; Age 6: Estimate = −0.2, SE = 0.01, t = −15.2, p < 0.001; Adults: Estimate = −0.2, SE = 0.006, t = −28.9, p < 0.001). Full RCs were also more frequent than wo-RCs (Age 3: Estimate = −0.05, SE = 0.013, t = −4.1, p < 0.001; Age 4: Estimate = −0.13, SE = 0.014, t = −9.15, p < 0.001; Age 5: Estimate = −0.2, SE = 0.01, t = −15.2, p < 0.001; Age 6: Estimate = −0.17, SE = 0.01, t = −15.3, p < 0.001; Adults: Estimate = −0.17, SE = 0.03, t = −14.8, p < 0.001).
In the patient-modifying condition a different picture emerged. Post hoc comparisons with a Tukey correction demonstrated that at age 3 and 4 semantically inappropriate RCs were produced significantly more frequently than full RCs (Age 3: Estimate = −0.05; SE = 0.01; t = −3.9, p < 0.001; Age 4: Estimate = −0.13; SE = 0.014, t = −8.9, p < 0.001) as well as than passive RCs (Age 3: Estimate = 0.13; SE = 0.04, t = 2.7, p = 0.01; Age 4: Estimate = 0.3, SE = 0.06, t = 5.6, p < 0.001). At ages 3 and 4, no significant differences were found between the production rates of full RCs, passive RCs, and wo-RCs (all p’s > 0.1). At age 5, 6, and in the adult group, passive RCs were produced significantly more frequently than full RCs (Age 5: Estimate = −0.12 SE = 0.01, t = −13.3, p < 0.001; Age 6: Estimate = −0.17, SE = 0.01, t = −14.5, p = 0.001; Adults: Estimate = −0.14; SE = 0.035, t = −4.06, p < 0.001). Passive RCs also occurred significantly more often than semantically inappropriate RCs in the adult group (Estimate = −0.17; SE = 0.02, t = −8.45, p < 0.001). No significant differences were found between the production rates of Passive RCs and semantically inappropriate RCs at ages 5 and 6 (both p’s > 0.06).
In short, our analyses show that, in the agent-modifying condition, full RCs were preferred by all age groups. In the patient-modifying condition, semantically inappropriate RCs were preferred at age 3 and 4, while older children and adults preferred passive RCs.
Next, we conducted an individual analysis to detect to what extent the preferred structure within each age-group was also present at the individual level. We asked how many children within each age group aligned with the observed group preference and how many deviated from it. We define a structure as being preferred if it occured in more than half of the child’s productions, namely in more than 12 items. The preference for full RCs found at the group level in the agent-modifying condition was also observed at the individual level, with the proportion of particpants showing this preference increasing with age: 14/27 at age 3, 25/31 at age 4, 29/31 at age 5, 22/25 at age 6, and 20/21 adults preferred full rcs. In contrast, 25 out of the 135 participants, i.e., 24 children and 1 adult, exhibited a preference that differed from the group preference in the Agent-modifying condition. Several participants preferred responses coded as others (Age 3: 13/27 children; Age 4: 4/31; Age 6: 1/25), semantically inappropriate RCs (Age 4: 1/31; Age 5: 1/31; Age 6: 1/25), wo-RCs (Age 4: 1/31; Age 6: 1/25; Adults: 1/21). In the production of 1/31 5-year-old child, the amount of full RCs was identical to the amount of semantically inappropriate RCs.
In the patient-modifying condition, 10/27 three-year-olds and 20/31 four-year-olds preferred semantically inappropriate RCs, thereby aligning with the respective trends at group-level. In total, 19/31 five-year-old children, 14/25 children aged 6, and 19/21 adults aligned with their age group trend preferring passive RCs. Overall 53 out of the 135 participants exhibited a preference that differed from the group preference in the patient-modifying condition. While semantically inappropriate RCs were preferred at the group level at age 3 and 4, some children preferred others (Age 3: 14/27 children; Age 4: 4/31) passive RCs (Age 3: 2/27; Age 4: 4/31), full RCs (Age 4: 2/31), and wo-RCs (Age 3: 1/27; Age 4: 1/31). passive RCs were preferred by the 5-year-olds, the 6-year-olds and the adults at the group level; some speakers deviated from this group preference and preferred different structures: others (Age 6: 1/25), semantically inappropriate RCs (Age 5: 10/31; Age 6: 8/25), full RCs (Age 5: 2/31; Age 6: 1/25), and wo-RCs (Age 6: 1/25; Adults: 1/21), and participial structures (Adults: 1/21).
In short, the preference trends detected at the group level were generally present at the individual level as well. However, our data also revealed a certain degree of variation between speakers within each age group, especially detectable in the younger children and more noticable in the patient- than in the agent-modifying condition.
4.3.2. Assessing the Developmental Path: Frequency of Structures Across Age Groups
In the previous section, we established the structural preferences within each age group. Now we turn to the second research question (Q2) of whether the preferred variants change in the course of acquisition. We examined each of the five structures produced by our participants across the five age groups. We focus on the effect of the “age” factor in the mixed effects logistic regression model.
Full RCs
Note that, in the agent-modifying condition, a
full RCs refers to an SRC and, in the patient-modifying condition, to an ORC. In order to investigate the differences between age groups regarding the rate of
full RCs, we collapsed the agent-modifying and the patient-modifying conditions and compared the age groups to each other, applying linear contrasts with Tukey correction.
Table 5 reports a summary of the results of the post hoc comparisons.
The performance of 3-year-olds on full RCs significantly differed from that of all other age groups. No significant differences in the rate of full RCs were found between all other age groups.
To see whether there were other differences regarding
full ORCs that went undetected by this analysis, we explored which type of
full ORCs children and adults produced. Two types were attested in our data:
full ORCs with both the head noun and the subject being lexical nouns, as in (8a), and
full ORCs with a pronominal RC head and a lexical subject (8b).
8. | a. | Der Affe den Der Hase beißt |
| | the.NOM donkey who.ACC the.NOM hare bites |
| | “The monkey which the hare is biting.” |
| b. | Der den der Hase beißt |
| | the.NOM who.ACC the.NOM hare bites |
| | “The one that the hare is biting.” |
Accordingly, we calculated the number of
full ORCs with two nouns (8a) and with a pronoun and a noun (8b).
Table 6 summarizes the results.
Table 6 shows that the great majority of
full ORCs produced by children contains a pronominal head and a lexical DP subject, while adults’ production is more balanced between the two types of ORCs. Because of the low overall frequencies, all child groups were collapsed for the statistical analysis. An ANOVA revealed that
full ORCs occurred significantly more often with Pronoun–Noun than with Noun–Noun in the children groups (SE = 1.1,
t = −4.7,
p < 0.001), but not in the adult group (SE = 4.9,
t = 0.3,
p = 0.99).
Semantically inappropriate productions: modification of the wrong noun
To investigate the differences between the groups again we applied linear contrasts with Tukey correction. Children in all age groups produced significantly more
semantically inappropriate RCs than adults, while the child groups did not differ significantly from each other.
Table 7 reports a summary of the results of the post hoc comparisons.
Wo-RCs
There was no significant difference depending on age (all coefficients for age fixed effects p > 0.6). The amount of wo-RCs production was very low and remained stable across all ages. An individual analysis revealed that overall only 6/133 children (Age 3: 1; Age 4: 1; Age 5: 2; Age 6: 2) and 1/21 adults produced wo-RCs. Resumption in wo-RCs was not attested in our data.
Participial structures
The number of
participial structures in both conditions, agent- and patient-modifying, was very low overall. They were produced by adults only: 6 responses in the agent condition (1.2%) and 21 responses in the patient condition (5.4%) (see
Table 2 and
Table 3), produced by 2 adults, who used them quite consistently.
Passive RCs
To investigate the differences in the rate of
passive RC production between the age groups, we compared the age groups to each other, again applying linear contrasts with Tukey correction.
Table 8 summarizes the results of the statistical analysis.
Children in all age groups significantly differed from adults in the likelihood of producing passive RCs. Among the child groups, there was no significant difference between 3- and 4-year-olds and no significant difference between 5- and 6-year-olds. 3-year-olds and 4-year-olds differed from both 5- and 6-year-olds, indicating that use of passive RCs was higher in the two older groups than in the younger two groups.
passive RCs constituted the most frequent structure from age 5 onwards and the second frequent one at age 4 (see
Table 3). Passive RCs were used, once or repeatedly, by 6/27 children at age 3, by 8/31 children at age 4, by 18/31 children at age 5, by 17/25 children at age 6, and by all 21 adults. In a next step, we analyzed the formal properties defining passive in German: (i) the type of auxiliary; (ii) the lexicalization of the agent; (iii) the position of the agent-phrase in the RC (see also
Section 5.1).
As for (i), children at all ages used almost exclusively passive RCs with the auxiliary werden “to become”, expressing eventive passive. One 5-year-old used two passive RCs with the auxiliary sein “to be”, signaling stative passive (Der der von dem Löwen gebissen ist “the one that is bitten by the lion”; Der der von dem Tiiiger gewaschen ist “the one that is washed by the tiger”). Adults nearly always used the werden passive, except for two cases of bekommen “to get” passive. These results suggest that, already at age 3, children are adult-like in using the eventive passive in these RCS.
We turn to (ii), the lexicalization of the agent in the RC, next. Recall that our design depicted two different scenarios (see
Section 4.2): a “contrasting action” scenario with only one agent performing two different actions (half of the test items) and an “agent contrasting” scenario with two different agents performing the same action (the other half of the test items) (see
Table A1 in
Appendix A for the scores of
passive RCs in the two contexts). In the “contrasting actions” scenario, the agent does not need to be lexicalized, since the referent can be identified by the type of action he performs, but in the “contrasting participants” scenario, it is mandatory: the referent can only be identified if the agent performing the action is mentioned. Accordingly, we calculated how many
long-passives, i.e.,
passive RCs with a lexicalized agent, were produced in the two contexts in each age group. Note that the lexicalized agent was always expressed by a
von + DP.
Table 9 summarizes the rate of
long-passives produced in the two contexts in each age group.
Adults produced long-passives in both contexts. Children produced short- and some long-passives in the “contrasting actions” context, but almost exclusively long-passives in the “contrasting agents” context. Only in eight instances in the “contrasting agents” context did the passive RC contain no agent phrase; an individual analysis revealed that these deviant productions were produced by one 4-year-old, one 5-year-old, and one 6-year-old each.
As a third aspect (iii), we considered the position of the von-phrase within the RC. After all, it may be that, despite the high number of long-passives, children’s von-phrases were not in target-like position, i.e., properly integrated within the clause. Three different word-orders were found, based on the position of the von-phrase (PP) in relation to the participle (Part) and the auxiliary (Aux). The target word order PP-Part-Aux (der von dem Hasen gebissen wird “who is bitten by the hare”) was by far the most frequent one across all age groups (Age 3: 23/25, Age 4: 82/93, Age 5: 252/270, Age 6: 263/273, Adults: 424/430). The order Part-Aux-PP, where the PP is extraposed—maybe as a kind of afterthought—(der gebissen wird von dem Hasen “who is bitten by the hare”) was found rarely in both children and adults (Age 3: 2/25, Age 4: 11/93, Age 5: 15/270, Age 6: 10/273, Adults: 6/430). The order Part-PP-Aux (der gebissen von dem Hasen wird) was used only in three passive RCs by one 5-year-old. In summary, children and adults pattern alike with respect to all three formal properties of passives in passive RCs: use of the auxiliary werden, lexicalization of the agent if required, and word order of von-PP, participle and auxiliary.
4.4. Discussion
Preference of the grammatical variants within age groups
Concerning Question 1, asking which grammatical variants of nominal modification German children of different ages and adults prefer within a controlled experimental production task, our elicited production data revealed that adults and children across all ages opted for different structures, depending on whether the modified participant was the agent or the patient of the action. When prompted to modify the agent of the action, adults and children at all ages most frequently produced
full RCs. This result is in line corpus studies who report a high rate of SRCs (
Fabricius-Hansen 2010;
Lübbe and Rapp 2011;
Bader and Koukoulioti 2018). When prompted to modify the patient of the action,
passive RCs and, in the child groups,
semantically inappropriate RCs were produced most frequently
; both findings are clearly not in line with the expectations derived from the frequency account. At the same time, the low frequency of
full ORCs attested in our experimental setting involving two animate participants confirms corpus studies, which found
full ORCs to be infrequent, and supports a frequency-based explanation.
In contrast to the previous findings (
Adani et al. 2012;
Yatsushiro and Sauerland 2019),
wo-RCs were not a frequent response in either the agent- or patient-modifying condition. This difference may be a result of the regional variant of German spoken in the area where participants were recruited: Northern Germany in their case, where
wo-RCs have been documented, and XX in our case, where
wo-RCs are not common (
Fleischer 2004;
Brandner and Bräuning 2013). Accordingly, our finding is in line with the frequency reported in the relevant corpus studies (
Hirschberg et al. 2014) and is expected under the frequency account, given that, in the area where we recruited our participants,
wo-RCs are indeed rarely found.
In line with the frequencies reported in corpus studies, participial structures were almost absent, with only two adults producing them, albeit quite consistently. This individual variation would also be consistent with the frequency account.
Finally, our individual analysis revealed that the clear preferences for a grammatical variant at the group level was supplemented with a certain degree of variation between speakers in each age group, which was different for different ages and condition. Adults behaved very consistently, all but one preferring full SRCs in the agent context and all but two opting for passive SRCs in the patient-modifying context. Individual variation was especially noticeable in the younger groups and for the patient-modifying context. Between the ages of 3 and 6, children are still in the process of discovering and mastering the several grammatical variants that exist in German. So, they may weigh different aspects of this acquisition puzzle differently and develop different strategies, when in need of deciding on a grammatical variant. Some may give up on the meaning side, opting for
semantically inappropriate RCs, other may give up on subordination, opting for main clauses (e.g., see,
Fritzenschaft et al. 1990).
5 Note that this account begs the question of whether these choices are driven by the same factor or by different ones and whether corpus frequency can account for this individual variation. We return to this point in the final discussion.
Frequency of structures across age
Turning to Question 2, of whether the preferred options change in the course of acquisition, we found that children and adults showed pronounced differences as well as similarities. Note first that wo-Rcs and participial structures were very infrequent in both conditions (agent- and patient-modifying) in all age groups, so we will not address their development here.
In the
agent-modifying condition, children and adults overall preferred
full SRCs, with statistical analyses revealing a developmental step from age 3 to age 4: three-year-old children produced fewer
full SRCs than all the other age groups, whose production rates did not differ anymore from that of the adults. This finding agrees with previous acquisition studies on German and other languages showing that children master SRCs at the age of 4 (
Rothweiler 1993;
Diessel and Tomasello 2005, a.o.).
In the patient-modifying condition, our data revealed several differences across age groups. First, children at all ages, as well as adults, produced very few full ORCs, with the type of ORC being markedly different: whereas adults produced full ORCs with two lexical nouns, RC head and subject DP, children at all ages most frequently produced full ORCs with a pronominal RC and a lexical subject DP. Assuming that frequency shapes language acquisition, this difference is difficult to explain under the frequency account.
Second, while adults preferred
passive RCs in the patient-modifying context
, children’s responses differed according to age: at ages 3 and 4, the most frequent structures were
semantically inappropriate RCs, whereas at ages 5 and 6, they were
passive RCs, just as in the adult group. In addition, the rate of
passive RCs increased significantly with age, with children at age 6 still producing significantly fewer
passive RCs than the adults.
passive RCs were used already by the three-year-olds, which confirms previous studies reporting early production of passives (
Abbot-Smith and Behrens 2006). No qualitative differences between adults and children emerged regarding the formal properties of
passive RCs: the passive structures contained an eventive passive, the agent-phrase was lexicalized when required, and the word order of participle and agent-phrase signaled syntactic integration. These data confirm the findings on long passives in 5-year-old German children by
Yatsushiro and Sauerland (
2019) for younger children. Our results contrast with the finding in
Mills (
1985), however, according to which children up to the age of 5 produce stative passives and short passives. Our detailed analysis of
passive RCs demonstrated that the syntactic properties of
passive RCs were in place from the age of 3 onwards. This result does not follow from the corpus frequencies, in particular from frequencies derived from child-directed speech, which showed passives to be infrequent.
Summary
Study 1 showed that children’s and adults’ patterns were alike in several respects: they most frequently produced full SRCs in the agent-modifying condition, and they frequently produced passive RCs in the patient-modifying condition. In addition, the formal proprieties of the passive RCs were the same across all age groups. While the production of full SRCs was expected on the basis of the previous acquisition studies, the fact that children produced so many passive RCs is surprising given that studies on child-directed speech found passive to be very rare. Accordingly, in Study 2, we examine whether the child passive RCs elicited in the experimental task reflect the frequency of passive RCs in children’s ambient language; this has, to our knowledge, not been examined before.
6. General Discussion and Implications
Speakers typically have more than one way of getting their message across. As a case in point, when speakers want to identify the agent or the patient of an action, various structures will fulfill this conversational goal to the same degree. The essential question is which principle(s) guide the adult speaker’s choice and whether the same principle(s) determine children’s acquisition path. This paper contributes to this issue by examining whether frequency may be such a principle.
Frequency shapes language use and acquisition: our operationalization
Several studies have argued, specifically, that frequency affects language by shaping adults’ language use and language acquisition. The more frequently a structure occurs, the more easily it will be activated or processed in an adult speaker’s mental grammar. Likewise, more frequent constructions are acquired earlier and are produced at a higher rate by children than less frequent constructions. Given that young children, and even newborns, are sensitive to distributional regularities they are exposed to, it seems very likely that they are attuned to frequency effects.
At the same time, children are well-known for producing structures that are not consistent with the target-grammar, arguably constrained by universal grammar (
Thornton 1990;
Schulz and Tracy 2018), for quite some time; this is difficult to reconcile with a pure frequency account. Similarly, adults have been reported to produce specific structures in experimentally controlled contexts that are very infrequent in spontaneous speech (
Belletti and Chesi 2014), which raises the question of how and why these infrequent structures were activated.
A reliable estimate of the frequency of the relevant structures is central for probing its role. Following
Roland et al. (
2007), we took the estimates of actual language use derived from corpus studies as a good proxy for the frequency of a given structure in adult speech. In addition, we took the estimates derived from corpus studies on child-directed speech to be a reliable proxy for the frequency of a structure in children’s input. As discussed in
Section 2.3, different estimates of the same structure have been proposed in corpus studies, varying with the type of corpus and the frequency measure employed. We compared the (sometimes contradictory) corpora findings with our findings. Study 1 addressed the aspects of adult use and child acquisition with an elicited production task, which provided equal opportunities to modify the agent and the patient of an action. Based on a longitudinal corpus of child-directed speech, Study 2 looked more closely into the frequency and the formal properties of passive structures and passive RCs, which turned out to be very frequently used in Study 1. In the following section, we discuss three general observations drawn from our studies.
Frequency of noun modifying structures in corpora and (the problem of) how to count
When looking at the frequency of structures across the two modification contexts, the problem arises of determining what to compare, i.e., to establish which frequency the speakers and language learners are sensitive to, as argued in
Section 3.3. It may be total frequency of structures A and B (e.g., determined on the basis of all structures A, B, C, D, and E, or all structures modifying a noun A, B, C) and/or relative frequency of A and B (i.e., compared to the total of A and B) that play a role. In our view, both notions of frequency leave unaddressed the fact that specific discourse situations may per se occur at very different rates. More specifically, we reasoned that the opportunity to produce SRCs and ORCs may differ, because the need to modify the agent and the need to modify the patient of an action may not occur at the same rate; this would result in different frequencies of SRCs and ORCs for conversational needs. The experimental design of Study 1 avoided this potential artefact by providing equal opportunities to modify agents and patients of an action. Our results did not single out one structure that was most frequently produced in absolute terms, for either children or adults. The most frequent structure was different for agent- and patient-modifying contexts, suggesting that the absolute frequency of a given structure may not be decisive in determining adult speakers’ choices and language acquisition order. Speakers seem to be sensitive to subtler measures of frequency, which are calculated according to situational contexts (agent- vs. patient-modifying context) (see also
Roland et al. 2007). Accordingly, we suggest that, when modeling the frequency of a given structure, any probabilistic theory of language production and acquisition needs to take into consideration that the likelihood to use particular structures is dictated by discourse needs.
Frequency of structures and the problem of what counts as a structure
When prompted to modify the agent of an action, adults and children at all ages most frequently produced SRCs. This result confirms German corpus studies, which reported SRCs to be, overall, the most frequent type of RC, and acquisition studies showing SRCs to be the first clausal nominal modification structure produced by children. From this, we may conclude that the frequency of SRCs determines its repeated use in a specific context and its early acquisition. However, we may also probe further and ask whether this conclusion only holds if the type of SRC found in corpora is the same as the SRC type used in Study 1. This question turns out to be crucial: the SRCs found in the corpora mostly contain an intransitive predicate or a copula, whereas the RCs in Study 1, due to our experimental design involving transitive events with two animate participants, all contained a transitive predicate. Put differently, how can the frequency of SRCs containing intransitive or copular predicates prompt the early and frequent use of SRCs with transitive predicates? In our view, we have to assume that adult speakers and children are sensitive to frequency measures computed based on structures as abstract objects, detached from the actual realizations, if we wish to maintain the role of frequency.
When prompted to modify the patient of an action, adults and children very rarely produced ORCs. As in the case of agent modification, an evaluation of these findings in terms of corpus frequency is not straight-forward. Overall, German corpus studies found ORCs to be rare but, among all RCs with a transitive predicate, to be more frequent than transitive SRCs. The ORCs attested in corpora are most often embedded by a lexical inanimate RC head and contain a pronominal animate subject, which is in contrast to the two animate nouns required by our experimental design in Study 1. Applying the same reasoning as in the case of SRCs, we would expect the relatively frequent type of ORCs in corpora such as the house that she bought to promote use of ORCs such as the monkey that the rabbit/it strokes, but clearly, this is not what we found. If we, nevertheless, wish to maintain a role of frequency for ORCs, we need to stipulate that adult speakers and children are sensitive to frequency measures computed based on subtypes of ORCs considering properties such as animacy. Accordingly, the notion of frequency that could account for our findings on ORCs is different from the one at play for SRCs.
As for the use of lexical nouns or pronouns in ORCs, adults produced both variants, two nouns or a pronoun and a noun, whereas children used mostly a pronominal RC head and a lexical noun subject. These results, together with previous corpus acquisition findings, suggest that children’s first ORCs differ from the adult variants in a principled fashion. These differences are not predicted by frequency but can be explained with language-internal factors: syntactic considerations dictate that the RC head and the subject DP in the child’s grammar avoid the same lexical specification. Independent of their position, one DP needs to be a pronoun and the other a lexical noun, which can be expressed in terms of different syntactic features (e.g., via relativized minimality, see
Friedmann et al. 2009;
Belletti et al. 2012).
Given our findings on SRCs and ORCs, an important implication emerges for a definition of “structure” and its application in language acquisition. Frequency accounts propose that linguistic structures are organized in a hierarchically ordered network ranging from lexicalized constructions, i.e., concrete tokens of experience that are memorized (
Bybee 2006), to highly abstract representations at the top (
Diessel 2009 2019). Under these approaches, children are assumed to first acquire the structures that match the concrete input-tokens and to gradually learn more schematic representations. In contrast, our results suggest that lexicalized constructions may not have an explanatory role in children’s acquisition of RCs. Both in the case of SRCs and of the few elicited ORCs, we see that the children’s acquisition order of structures relies on the notion of “structure” as an abstract object associated with (sometimes refined) syntactic considerations. Our conclusions can also be extended to adults’ language use of structures in experimental settings (see
Roland et al. 2007) for a similar conclusion on adults and their use of structures in corpora).
Frequency of passive and passive RCs: the role of language-internal factors
Passive RCs were already produced by the 3-year-old children and constituted the most frequent structure from age 5 onwards, with no differences regarding its properties across ages. All passive RCs were eventive and, having been lexicalized by a von-PP, were syntactically integrated into the host clause. These long passive variants were produced when required by discourse. While these results are in line with corpus studies reporting passive RCs to be preferred to ORCs, the predominance of passive RCs is still surprising, given that passive is reported to be very rare in adult child-directed speech. Study 2 revealed that passive RCs were almost absent in child-directed speech as well, and that their rate did not increase with the child’s age. Moreover, in contrast to the long passives favored by adults and children in Study 1, most passives in child-directed speech were short. Accordingly, neither the high production rate nor the formal properties of the passive RCs elicited in Study 1 can be easily determined by corpus frequencies.
Here, we suggest that participants’ choices and the developmental path may be explained by drawing on the notion of syntactic complexity (see
de Vincenzi 1991;
Friedmann et al. 2009;
Contemori and Belletti 2014; a.o.). Informally speaking, whenever there are two (or more) alternative structures, A and B, to express roughly the same meaning, a speaker will prefer the syntactically simpler structure A, requiring fewer ingredients in the syntactic derivation (e.g.,
Jakubowicz 2011) to B, and children will acquire structure A before structure B.
8 Crucially, the structures that are, in principle, available in the patient-modifying context are all reported to be infrequent in natural speech data. Accordingly, it is difficult to see how sensitivity to frequency—no matter which flavor—could be of help here in determining language use and acquisition. Focusing on passive RCs and ORCs, we maintain that the former structure is syntactically simpler than the latter and that this leads to the preferred use and early emergence of passive RCs in German. The argument is as follows: whereas in ORCs the patient DP is assumed to cross the agent, resulting in increased difficulty (
Friedmann et al. 2009), the patient DP in passive RCs does not cross the agent. Depending on the syntactic analysis, crossing does not occur because of the lack of an agent DP in passives (
Baker et al. 1989;
Bruening 2013;
Alexiadou 2005;
Alexiadou and Schäfer 2013, a.o.), or because of smuggling (
Collins 2005;
Contemori and Belletti 2014). Accordingly, the preference for passive RCs over ORCs may be the reflex of the, possibly universal, principle of relativized minimality according to which ORCs are avoided in favor of SRCs as proposed in
Friedmann et al. (
2009). We note that the preference for passive RCs may have been promoted by our experimental setting, in which both the agent and the patient were animate. Animate DPs tend to be structurally the subject of a sentence (e.g.,
Aissen 2003), which may have favored the production of SRCs, which in a patient-modifying context means passive RCs as well as of
semantically inappropriate RCs. That is, when prompted to modify the patient as in
Figure 2 “Which donkey is wearing the hat?”, children may tend to treat the modified DP, i.e., the donkey, as the subject of the RC, due to its animacy. This results in the production of a RC with the patient being promoted to the subject of the RC, namely, a passive RC and a semantically inappropriate RC.
9We propose that syntactic complexity may also account for the frequency and the acquisition path of
semantically inappropriate RCs, i.e., SRCs with either the head or the theta role reversed, which were produced by many three- and four-year-olds.
semantically inappropriate RCs cannot be explained by corpus frequency and, specifically, not by the input children are exposed to. They could serve as a precursor of passive RCs: children who have not yet mastered the relevant ingredients to derive a passive predicate (e.g., see,
de Villiers and de Villiers 1985;
Fritzenschaft 1994;
Snyder and Hyams 2015;
Contemori and Belletti 2014) may produce SRCs on the patient with active verbal morphology.
In a nutshell, our finding on passive RCs and semantically inappropriate RCs attests that there are at least some cases where the most frequent and the less complex grammatical variants do not overlap. The relation between frequency in natural speech and use of a given variant in a specific context is indirect: speakers may opt for the grammatically less complex computation rather than for the most frequent one.
For future research, we may ask whether the principles guiding the adults’ choice of a grammatical variant and the principles determining the order of acquisition necessarily have to be the same, as stated by classical frequentist accounts. As a case in point, children’s choice of
semantically inappropriate RCs and pronoun–noun ORCs was not reflected by the adult productions, indicating that adult use and children’s acquisition order of structures may be driven by (partially) different principles. This line of research may also have further implications for the long-standing debate between frequentist and generative accounts and more recent accounts integrating both views (e.g.,
Yang 2015).