1. Introduction
Idioms, such as “to bury the hatchet” (meaning to end a conflict), form an important part of the native speaker’s phrasal vocabulary, which is frequently used in everyday communication (
Pawley and Syder 1983). One of the key questions in psycholinguistic research on idioms concerns how this knowledge is represented and accessed by the language user. Hybrid theories of idiom representation and processing (e.g.,
Cacciari and Tabossi 1988;
Titone and Connine 1999;
Sprenger et al. 2006) assume that idioms are represented both in terms of their lexical elements and in terms of an overarching idiom representation that serves to connect these elements with the phrasal meaning (e.g., a
superlemma). These theories therefore predict that lexical co-occurrence within a fixed expression has a lasting effect on the structure of the mental lexicon: words that share aspects of neither their meaning nor their form will be “wired together”, because they are processed as part of the same fixed linguistic structure. While the main argument for such a structure is parsimony (i.e., idioms make use of existing lexical representations with their own literal meanings), it also can explain how idioms contribute to fluent speech (
Dechert 1983;
Pawley and Syder 1983;
Kuiper 1995) and more effortless comprehension: whenever one element of an idiom is activated, it is predicted to spread activation to the idiom’s remaining lexical elements, making the retrieval of those words from long-term memory faster and less error-prone.
In line with this hypothesis, the facilitatory effects of idioms in their phrasal contexts have consistently been observed in language comprehension research (see
Conklin and Schmitt 2012 for a review). For example, in an eye-tracking study,
Siyanova-Chanturia et al. (
2011) showed that native speakers of English read idiomatic sequences
(left a bad taste in my mouth) significantly faster than matched control phrases
(the bad taste left in his mouth). Likewise,
Carrol and Conklin (
2020) tested reading times for idioms
(spill the beans) but also for binomials
(bread and butter) and collocations (defined by the authors as “combinations of words that are entirely compositional and semantically ‘free’, but which co-occur in conventional and recurrent patterns” (p. 97); e.g.,
classic example). They found significant processing advantages for all three types of formulaic language, with the phrase frequency being a particularly strong predictor of the response-to-idiom-reading time, and with greater idiom familiarity leading to a higher rate of final-word skipping. The latter finding implies that readers could predict the final word because they had already recognized the idiom.
Taken together, experiments on idiomatic processing advantages in phrasal contexts support hybrid models of idiom representation and processing, suggesting tight links between the words of an idiom that are mediated by a common idiom representation. Here, we want to take this research a step further by studying the processing of lexical elements of idioms in isolation. More specifically, we ask whether the long-term storage of idiomatic expressions affects the organization of single lexical items in the mental lexicon. If idiom representations indeed tie their lexical elements together, enabling spreading activation from one element to another, the facilitatory effects should not depend on idiom recognition in a phrasal context but should independently occur for lexical elements. In other words, bury should prime hatchet, even if presented in isolation. To this aim, we studied the representation and processing of idiom words without a phrasal context.
We tested our hypothesis that idiom words can prime each other when presented in isolation by means of a primed lexical decision task with idiomatic prime–target pairs mixed with pairs that are
semantically related, are
unrelated, or contain
nonwords. We report the results of two experiments in which we varied the Stimulus Onset Asynchrony (SOA) between the prime and target. If idiom knowledge results in the creation of connections between the idiom’s lexical elements in the mental lexicon, we should see facilitatory priming effects between idiom words in a primed lexical decision task. An additional question concerns the nature of these effects. In our view, an idiomatic priming effect should be qualitatively different from the effect of semantic priming, as it originates at a different level of processing, involving different types of representations. This assumption is based on
Collins and Loftus’s (
1975) highly influential model of semantic memory, which is also reflected in
Levelt et al.’s (
1999) model of word production and, accordingly, the superlemma model (
Sprenger et al. 2006). Collins and Loftus clearly distinguish between a semantic (conceptual) network and a lexical network (i.e., what we refer to as the
mental lexicon). In the case of idiomatic priming, the connection between the two words involved is located within the lexicon, mediated by a phrasal representation, and therefore independent of possible semantic links between the underlying concepts. In contrast, in the case of semantic priming, the representations involved are abstract concepts that represent the meaning of a word. If these concepts share semantic features, they are expected to prime each other (
Lucas 2000). If the nature of the connection between idiom words differs from that of an associative semantic connection because it is mediated by a common idiom representation, rather than semantic feature overlap, the time course of idiom word priming should differ from the time course of semantic priming. Whether semantic or idiomatic priming should be faster cannot easily be predicted, though: while activation spreading via a common idiom representation might be somewhat slower than the activation of a semantic network due to the additional node, the level of lexical processing precedes that of semantic analysis during comprehension. We therefore predict priming effects for both types of relationships, with a possible time course difference in either direction.
Importantly, research on semantic memory (which typically uses verbal materials) has looked into prime–target pairs that are quite similar to those under consideration in our study (
Roelke et al. 2018;
Hofmann et al. 2022), based on the idea that the frequency with which words co-occur in a language plays an important role in the organization of their long-term storage. Specifically, compound-cue theory (
McKoon and Ratcliff 1989,
1992;
Ratcliff and McKoon 1988,
1995) states that priming effects result from the familiarity of the prime and target as a
compound (not in the linguistic sense), and that such a compound “is formed by the simultaneous presence of the prime and target in short-term memory as a test item” (
McKoon and Ratcliff 1992, p. 1155). We will return to the similarities and differences between an approach based on models of idiom representation and processing and the compound-cue theory of semantic priming in the discussion. It should be noted that both approaches have one thing in common: they consider the extent of priming that can be observed for non-associatively related word pairs to be affected by their familiarity, which
McKoon and Ratcliff (
1992) operationalized as the frequency of co-occurrence in a linguistic corpus. However, while the compound-cue theory locates the familiarity effect at the conceptual level, theories of idiom representation and processing locate it in the mental lexicon (i.e., at the level of linguistic representations).
4. Discussion
In the current study, we tested the prediction that prime words that are related to a target word via an idiomatic representation show facilitatory priming, even if the prime and target are presented without any phrasal context and the words themselves are not semantically related. That is, we expected to see, for example, the word “bury” prime the word “hatchet”, as they are both part of the familiar idiom “to bury the hatchet”. We derived this prediction from psycholinguistic models of idiom representation and processing that assume connections between idiom words at the lexical level of processing (e.g., the superlemma model,
Sprenger et al. 2006). We tested our prediction, with native speakers of English, in two visually primed lexical decision experiments that differed with respect to the timing of the prime and target words: in Experiment 1, the prime preceded the target by 500 ms (with an ISI of 150), and in Experiment 2, by 350 ms (with an ISI of 0). In both experiments, we included a semantic priming condition of the type DOCTOR–NURSE for comparison. A common analysis of the results of both experiments showed reliable facilitatory priming effects for both idiomatically and semantically related prime–target pairs. In addition, we observed two effects that point to possible differences in the underlying mechanisms responsible for the priming effects: first, the idiomatic priming effect that we found is, on average, only half as strong as the semantic priming effect. Second, the semantic priming effect, but not the idiomatic priming effect, was modulated by our SOA manipulation, with stronger effects in Experiment 2 (shorter SOA). We will discuss these findings and their implications in turn.
First, the finding that idiom word pairs that are not semantically related show facilitatory priming confirms our prediction and supports hybrid models of idiom representation and processing (e.g.,
Sprenger et al. 2006). According to this
lexical view, native speakers of American English have linked the words
bury and
hatchet by means of a phrasal representation in the mental lexicon, because the two words appear together in the same idiom. In addition, the idiom word priming effect is consistent with the literature on processing advantages for fixed phrases, such as faster reading times (e.g.,
Siyanova-Chanturia et al. 2011) and more fluent production (e.g.,
Pawley and Syder 1983;
Kuiper 1995).
Second, the observed difference in effect size between semantic and idiomatic priming is in line with the idea of a possible difference in the underlying priming mechanisms: direct connections between conceptual representations in the case of semantic priming, and indirect connections between lexical representations that are mediated by a phrasal representation in the case of idiomatic priming. At the same time, however, other factors may contribute to the difference as well. For example, semantic associations of the type DOCTOR–NURSE are presumably more frequent than the idioms of our idiomatic word pairs, and they most probably have an earlier age of acquisition than most idioms (
Sprenger et al. 2019;
Carrol 2023). We did not control for frequency, and—due to the differences in the nature of the linguistic contexts in which these items can be expected to appear and the need to control for idiomaticity—doing so would not be straightforward, but we agree with an anonymous reviewer that this could be a valuable addition to future research. In contrast, controlling for age-of-acquisition effects is unfortunately impossible, as early idiom acquisition data are virtually nonexistent.
Third, similar to the difference in effect size, the effect of our SOA manipulation suggests a possible difference in the time course of semantic and idiomatic priming and, therefore, a difference in the underlying mechanisms as well. While we find that the semantic priming effect is stronger at the shorter SOA (and has been shown to be even faster in lab-based studies, e.g.,
Neely 1991), the idiomatic priming effect overall is still comparatively small at the SOAs that we tested. When analyzing the results of the experiments separately, we found the effect to be more reliable at the shorter SOA, but this was not reflected in a significant interaction with Experiment in the common model. Thus, while the effects of the two types of priming are not the same, they are also not clearly dissimilar in terms of their temporal distribution. We therefore interpret these differences with the necessary caution, leaving the question about a time course difference between the two types of priming open. Future research will have to show whether the idiom priming effect can be further optimized at longer SOAs. This will be informative for models of idiom representation and processing, as such studies on idiom words without context can provide us with an upper boundary for the speed with which one word of a phrase can activate its remaining elements. More importantly, such data could help us to understand the nature of the priming processes involved. As we mentioned in the introduction, the assumption of an additional phrasal representation that mediates idiomatic priming effects may lead to the prediction that idiomatic priming is inherently
slower than associative semantic priming due to the additional computational cost that is related to the idiom node’s activation. In contrast, the opposite effect may be predicted as well: as the level of lexical processing precedes that of semantic analysis during comprehension, any effect of a lexical association within the lexicon itself (i.e., non-conceptual) may be faster than the effects of semantic relatedness. As a third option, both processes may act in concert, with idioms losing some speed to the extra node on the one side while gaining some speed, due to the “early” nature of the processes, on the other. Follow-up studies could help us to reduce the number of options, as could computational modeling.
While our study was motivated by theoretical models of idiom representation and processing, our findings are also consistent with the compound-cue theory of semantic priming: we cannot exclude that the fact that
bury and
hatchet frequently co-occur may by itself be enough reason for their representations to become linked together in memory. Whether such associations are conceptual or lexical in nature does not follow clearly from the theory. In the words of
Ratcliff and McKoon (
1988), “
the automatic component of facilitation would be neither pre-nor postlexical [i.e., before or after lexical access], as those terms are usually used, but a product of the joint association of prime and target”. Most importantly, however, no additional phrasal storage seems required within their framework. If we commit to the principle that simpler explanations are to be preferred above more complex ones (“Occam’s razor”), one might therefore conclude that the odds seem to favor the statistical explanation over a phrasal memory explanation at this point.
However, before we throw in the towel, it is worthwhile to take a closer look at those statistics. More recent studies on semantic memory (
Roelke et al. 2018;
Hofmann et al. 2022) have compared the effects of “direct association” (i.e., association without semantic overlap) to
semantic only or
associative and semantic relations between prime and target words. In their approach, Roelke et al. follow the strategy employed by
McKoon and Ratcliff (
1992) to base the association on statistical measures of co-occurrence, but based on a larger database. That is, they extracted “directly associated” word pairs from a German 43-million-sentence corpus that consists of more than 7.5 million word types (
Quasthoff et al. 2006) by calculating the likelihood of all possible word pairs and subsequently selecting the top associates for their “pure associative” condition (i.e., word pairs that do not show semantic feature overlap but nevertheless are associates). Lexical decision data with SOA = 200 ms and SOA = 1000 ms show that “pure associative” (high co-occurrence, but no feature overlap) and “semantic” (high semantic feature overlap) priming were equally effective at the short SOA. However, the effect changed with the SOA, with associative priming being significantly stronger than semantic priming at the 1000 ms SOA. The authors conclude that associative and semantic priming can be dissociated from each other. In other words, they seem to demonstrate that statistical co-occurrence is the driving force behind facilitatory priming between words that do not have any common associates (and thus appear to be unrelated), and that this effect follows a different time course than actual semantic priming does.
Yet, an inspection of their stimulus set shows that for the large majority of their
associative word pairs (at least 36 out of 50), the words form constituents of well-known German multi-word expressions. For example,
Kuh and
Eis (
cow and
ice) are part of the idiom
die Kuh vom Eis holen (to get the cow off the ice, to save the situation),
Flut and
Ebbe (flow and
ebb) are part of the binomial
Ebbe und Flut (ebb and flow), and
Tafel and
Kreide (chalkboard and chalk) form the compound
Tafelkreide (
chalkboard chalk). In other words, the statistical approach to word association reveals that the most common “pure” associates are predicted by the language’s phrasal vocabulary. This includes not only figurative language, such as idioms, but also, for example, common literal expressions (allen
Bedenken zum
Trotz, in spite of all considerations), literary movements (
Sturm und Drang, storm and stress), and movie titles (
Der Schuh des Manitu,
The Shoe of Manitou)
2. Put differently, idiomatic—or rather
phrasal—associations are the best explanation for the way in which strong associations between words that are not semantically related come about. In addition, they are also a fairly good explanation for the association strength between words that are both associatively
and semantically related. In Roelke et al.’s list of stimuli, many pairs in the
Associative+Semantic category (at least 12 out of 50) seem to have phrasal origins as well. For example,
Kaffee and
Tasse (coffee and cup) form the compound
Kaffeetasse (coffee cup),
Banane and
Schale (banana and peel) form the compound
Bananenschale (banana peel), and
Hopfen and
Malz (hop and malt) are part of the idiom
da ist Hopfen und Malz verloren (hop and malt are lost there, that situation cannot be saved).
With respect to the effect that these associations have in a primed lexical decision task, our findings converge with those of
Roelke et al. (
2018): both semantic and associative pairs show facilitation, with semantic effects being strongest at a short SOA and associative effects profiting from a longer SOA. More importantly, however, our approaches diverge with respect to the processing level at which we locate the observed effects. While Roelke et al. aimed at investigating direct associations in semantic memory (as questioned by
Lucas 2000;
Hutchison 2003;
McNamara 2005), we locate the source of our priming effects in the mental lexicon. In (Psycho-)Linguistics, it has long been acknowledged that our linguistic long-term memory comprises not simply words and rules but also vast collections of fixed phrases (e.g.,
Pawley and Syder 1983;
Jackendoff 1995;
Wray 2003). Theories of idiom processing focus on figurative expressions, but the effects of phrasal storage can be observed for all kinds of chunks and at all kinds of ages. For example,
Bannard and Matthews (
2008) found frequency effects on the repetition of four-word chunks (
a drink of tea) already in 2-year-olds. Yet, while idioms take much longer to learn (e.g.,
Sprenger et al. 2019;
Carrol 2023), the ambiguity between their literal and figurative meanings has so far drawn the majority of the research. As idioms cannot be taken literally and, at the same time, depend on a specific configuration of words and grammar, language users must be able to access their phrasal representations fast and effortlessly during both comprehension and production. In comprehension, that includes quickly discarding the literal word meanings in favor of the idiomatic interpretation, once the idiom has been recognized (
Rommers et al. 2013). In other words, idiom word processing within an idiomatic context is not driven by lexical semantics. Accordingly, the relationships between an idiom’s constituent words in semantic memory alone cannot explain these processes. Instead, we need to acknowledge the linguistic nature of these relationships. More generally, our findings therefore contribute to current discussions about the way in which lexical and conceptual representations are connected in the human mind (e.g.,
Eviatar et al. 2023). In future work, it would be interesting to combine the approach by
Roelke et al. (
2018), who sought to separate the effects of co-occurrence and semantic feature overlap, with an approach that explicitly takes the phrasal vocabulary into account.
Here, we have shown that—in line with current theories of idiom processing—one idiom word facilitates the processing of another, even if they are not presented within a phrasal context, and without the words being semantically related. While statistical co-occurrence is probably an important factor for the acquisition of such sequences, we argue that the mechanism that is responsible for the observed effects can be found in the fact that these words are bound together by a common phrasal representation in the mental lexicon.