Sound symbolism is the phenomenon by which people associate the sound of an utterance with its meaning (
Hinton et al. 1994, pp. 1–2). Sound symbolic words are often referred to as ‘ideophones’, which are commonly defined as ‘marked words that depict sensory images’ (
Dingemanse 2012, p. 654). Examples of these include
boom to describe an explosion or
sniff sniff to describe the action of sniffing. These are words that often accompany the image in multimodal works such as comics, picture books and (at times) cartoons and try to bring the action described by the picture to life but, at the same time, they also become problematic when it comes to transposing them into other languages. The translation of comics and, by extension, the translation of sound symbolism, remain significantly under-investigated topics within translation studies (
Borodo 2015, p. 22). The need for more research into this area means that recent papers on the matter have often been focused on the multimodality of the translation process for this particular medium. This is particularly relevant in the case of expressive and sound symbolic language in translation. The key concept in this case is that ‘meaning is not only communicated by language but also many other modes […] that in concrete circumstances possess equal meaning-making potential’ (
Borodo 2015, p. 23). The present study thus takes up the suggestion of both
Lathey (
2006, p. 11) and
Oittinen (
2008a, p. 3) that areas of research in translation studies should include analysis of the relationship between the verbal and the visual in illustration and picture books. As pointed out by
Oittinen (
2005), ‘almost nothing visual can be altered [and] translators need to pay attention to the visual aspect and design their texts so that they conform to the visual information’ (p. 47). For this paper, the examples will be taken from Jeff Kinney’s young adult series
Diary of a Wimpy Kid (2007–), named
Diario di una schiappa in Italian, which portrays the life of an eleven-year-old, undersized middle school boy named Greg. The illustrations portraying Greg would not have the same humorous effect without the words that accompany them. Ideophones, typical of comics’ balloons (
Kaindl 1999, p. 274), are especially dependent on both the visual and the textual. They not only refer to the actual action as portrayed in the picture but also aim to explain and enhance the text’s expressivity through their shape, size, and colour, requiring a high level of creativity (
Oittinen 2008b) and linguistic knowledge on the translator’s part in order to create an even stronger bond between text and pictures.
1.1. Sound Symbolism, Ideophones, Onomatopoeias, Interjections
An excerpt from
Casas-Tost (
2014, p. 40) defines ideophones as ‘phonetically driven words […] with enormous expressive capacity’. In layman’s terms, ideophones are expressive words that try to represent a certain sense, be it a sound or a feeling, often via phonetic means. As such, they do not only depict sonic experiences but often also cover other sensorial realms (motion, smell, touch and temperature, for instance) and emotions. ‘Onomatopoeia’ is the historical term that has been used at times to refer to this type of words, although the term is partially limiting as it only refers to strings that imitate sonic events (
ribbet for a frog or
bow-wow for a dog’s bark for instance). ‘Interjections’, instead, are defined as a ‘typical case of emotional language’ (
Poggi 2009, p. 1). These are words that are uttered by speakers via single-word phrases and are used to express certain feelings openly.
Ouch is for instance an interjection used in English to express pain, whilst
hoorah is uttered to express happiness. The main difference between ideophones and interjections lies in the fact that interjections are meant to be uttered by the speaker out loud and host a completely arbitrary relationship between sign and meaning. The phonetic substance of ideophones is instead often directly linked to their meaning (
Lyons 1977, p. 75), a phenomenon that is formally defined as ‘phonaesthesia’, whereby certain vocalic or consonantal strings seem to consistently be used to refer to specific senses, with
Reid (
1967) being one of the pioneers in this type of scholarly inquiries. ‘Phonaestheme’ is also an important term that was coined by
Firth (
1930) and refers to these specific clusters of sounds that are recurrently used to refer to specific meanings across words—see, for instance, the use of the cluster <gl-> in words that refer to shininess such as ‘glow’, ‘glitter’ or ‘gloss’ (topic also later analysed by
Bolinger 1965). Still, interjections can and do sometimes include certain sound symbolic features so the boundary between the two concepts can at times be blurred and can also depend on personal perspective. Indeed, the marked nature of sound symbolic words goes hand in hand with the classificatory difficulties within this area of scholarship. ‘Not only is the use of the term ‘ideophone’ not always completely agreed upon, but there is also no completely agreed-upon set of features characterising ideophones’ (
Smoll 2012, p. 2), as the term is often used to describe forms that vary across languages in terms of linguistic features. It follows that, as ideophone inventories are elaborated in different ways across languages, any categorisations will never (and should not) claim to be strict or all-inclusive. For instance, in certain African languages ideophones are usually adverbial and not onomatopoeic, as they are not trying to evoke sounds but, more frequently, whole situations and modes that other languages can only paraphrase by using several words. On the other hand, in studies that focus on Asian languages, they are usually called ‘mimetics’, ‘expressives’ or ‘descriptives’ (
Smoll 2012, p. 2). For the purpose of this paper, the term ‘ideophone’ will be used as an epithet to refer to any iconic words that are embedded in the pictures included in the literary works under scrutiny. These are sometimes actual English words that one might not actually instinctively see as being ideophonic, see the use of
crawl crawl to signal a person crawling on the floor. Nevertheless, considering the context in which these words are used makes them highly symbolic and expressive, referring to them as ‘ideophones’ seemed like the most appropriate terminology to use in this case. ‘Phonaesthesia’ will be used to refer to the specific phenomenon of certain phonemes and graphemes carrying recurrent meanings.
Another important distinction that is made throughout the analysis is the one between lexicalised and non-lexicalised ideophones. While lexicalised ideophones can be found in dictionaries, being official words in the language, non-lexicalised ideophones are instead originally nonsensical and they do not have official status as words in the language. In the latter case, their meaning comes from both phonaesthesic properties (where the use of certain phonemes in the word elicits certain senses) and the picture they are embedded in. When judging whether a form belonged to either of the two categories, specific online dictionaries have been used: ‘Il Nuovo De Mauro Dictionary of Language in Use’, 2022 (
De Mauro 2022), for Italian and ‘Merriam Webster Online’, 2022 (
Merriam-Webster 2022), for English. If a form was found in the relevant dictionary, it was considered as being ‘lexicalised’.
As highly flexible and adaptable devices, ideophones are studied within various disciplines: ‘semiotics, psycholinguistics, semantic typology, corpus linguistics, conversation analysis, and the ethnography of speaking’ (
Dingemanse 2012, p. 654) to name but a few. Considering that every language has its own phonological system, ideophonic coining ‘is largely dependent on the phonemes and phonemic combinations of the language’ (
Marchand 1969, p. 402). Kilian-Hatz’s definition of ideophones seems to reiterate this same concept, emphasizing their highly ‘dramaturgic function’ (cited in
Voeltz et al. 2001, p. 2):
Ideophones do not simply describe a state or event, but rather they
simulate it, allowing the speaker to perform the event and raising the
illusion that it is occurring at the moment of the utterance (
Kilian-Hatz 2001, p. 155; emphasis added).
The ideophone
bang does not describe or allude to the explosion (as the word ‘explosion’ does) so much as it attempts to
perform the explosion itself. Regarding the imitativeness of ideophones,
Assaneo et al. (
2011) point out that the actual imitative value of ideophones is the subject of a challenging debate. The real acoustic properties of ‘collisions, bursts and strikes are in fact very different from the string of consonants and vowels forming their onomatopoeic written forms’ (
Assaneo et al. 2011, p. 1), which are limited to a variable number of phonetic categories (
Tsur 1992, p. 18). For this reason,
Assaneo et al. (
2011) suggest that a better way to define the ideophone is to describe it as ‘a transformation of a sound into
the best possible speech element’. By defining it thus, they are not intending to wholly reject the phonaesthesic nature of languages (a property that is sometimes referred to as ‘iconicity’) but rather remind us of its limits, as ‘every language has its own phonological system […] and is thus dependent on a particular set of phonemes and phonemic combinations’ (
Marchand 1969, p. 402). When environmental sounds are transformed into speech elements, not everything can be acoustically preserved. ‘The choice of speech sounds is to certain extent accidental’ (
Marchand 1969, p. 401). Iconicity comes also from tacit conventions among the speakers of a language and is closely dependent on the way natural sounds have been converted in order to fit the language’s alphabet and phonological classes (
Bueno Pérez 1994, p. 18;
Catricala n.d.). Therefore, despite the fact that English cats might not exactly pronounce the word ‘meow’ when they are meowing, it is an accepted convention that an ideophone is the closest written representation of the sound emitted by cats. The same logic applies to the Italian
miao or the French
miaou. It follows that different degrees of iconicity exist, such that certain representations are undoubtedly closer to their real acoustic output than others (
Beccaria 1994, p. 380).
Indeed, the ease with which ideophonic neologisms are created and accepted into a language (
Chapman 1984, p. 156) has not only bestowed the device with a rare linguistic flexibility, but has also created interest among writers, who have learnt to exploit it in order to ‘give free reign to their imagination’ (
Chapman 1984, p. 133). Poems, stories, lullabies and traditional songs were in fact the first testing grounds for sound symbolic experimentations (
Mphande 1992, p. 118). Used to convey ‘dramatic enhancement in narrative texts’ (
Mphande 1992, p. 118), this linguistic device has led writers to a ‘highly evocative onomatopoeic coinage and several ambitious attempts’ (
Chapman 1984, p. 137). In the first case, ideophones are mainly used as mnemonic devices to make a product more memorable for customers: ‘Plop, plop, fizz, fizz, oh what a relief it is’ (slogan of Alka Seltzer); ‘Snap, crackle, pop’ (slogan of Rice Krispies cereals); ‘Bubble Yum. It’s so much yum, yum, yum’ (slogan of Yum Bubble Gum) and so on (examples mentioned in
Crystal 2007, p. 252). In the case of comics and picture books, the genres’ propensity to mix different modes of expressions, particularly verbal and visual signs, has given editors and cartoonists the chance to flaunt ‘a vigorous and bold invention of onomatopoeia beyond that of most literary writers’ (
Chapman 1984, p. 139), with the final result of successfully renewing some of the traditional forms.
1.2. Italian v. English Sound Symbolism
Italian and English are linguistically very distinct languages, and this is reflected in the way they treat and deal with ideophones. The Italian language, due to Latin influences and its inflectional nature (
Crystal 2005, p. 368) (that is, often a word needs to be modified according to gender and number), possesses predominantly multi-syllabic words. The English language, in contrast, due to its Germanic roots, possesses a greater number of monosyllabic words, which makes the English language naturally inclined to create sound symbolism and use it with ease in everyday speech (
Garcia De Diego 1968, p. 18;
Allott 1995). In order to create ideophones that sound and look ‘naturally’ Italian, one would need to use words ending in vowels, as is demonstrated in the long, vowel-ending expressions that have been devised by the few Italian poets, writers and cartoonists who have attempted to invent ideophones (
Arcangeli 2009).
The ease with which English speakers can create ideophones and the way these naturally fit in the language implies that they can be easily used when speaking. This issue takes us to another linguistic characteristic of the English language, the possibility of converting word classes (
Newmark 1996). The noun ‘slap’, for example, readily functions as a verb with minimal grammatical modification that does not reduce its iconic effect (
to slap,
slaps,
slapped). The Italian language, again due to its inflectional characteristics, needs to inflect the term when changing class, so the noun
schiaffo (‘slap’) becomes
schiaffeggiare (‘to slap’). This characteristic of the English language implies that English speakers can easily import sound symbolic words into the language and use them in everyday speech, by converting them without adding any suffix or endings. This implies that, when reading a comic or a picture book, an English reader is visualising sound symbolism commonly used in speech, consequently making the whole reading experience more familiar.
Scholars (
Dingemanse 2012, p. 664;
Smoll 2012, p. 24;
De La Cruz Cabanillas and Tejedor Martínez 2009, p. 50;
Beccaria 1994, p. 317) have come to the conclusion that generating a typological classification of languages according to their use of ideophones is a difficult (and perhaps not even useful) task, due not only to cross-linguistic discrepancies in terms of phonology, morphology, syntax and semantics, but also, and more importantly, because of the high degree of variability of forms and functions within each individual language (
Smoll 2012, p. 22). A more fruitful approach to the study of sound symbolism would involve an investigation of the common (or divergent) cross-linguistic properties, especially by focusing on the way ideophones are actually used or translated and how they affect and shape languages. ‘Naturalistic corpus data is therefore likely to play an essential role in this enterprise’ (
Dingemanse 2012, p. 657).
Linguistic properties are not the sole source of this cross- and intra-linguistic flexibility. A few scholars (
Beccaria 1994, p. 380;
Oszmiańska 2001;
Crystal 2007, p. 250;
Casas-Tost 2014, p. 41) draw attention to the cultural reasons behind the use of ideophones, stressing the importance of social norms and historical facts in determining the success or partial failure of the way sound symbolism is used for communication purposes. This is also generally true in relation to verbal and visual elements in interaction, as ‘meanings belong to culture, rather than specific semiotic modes’, hence ‘the way some things can be said either visually or verbally is also culturally and historically specific’ (
Kress and Van Leeuwen 2006, p. 2). The Japanese language, for example, has an extended array of sound symbolic forms that are employed not only in colloquial contexts, as happens in many languages, but also in formal situations. Scholars agree that this is partly due to the ‘kinaesthetic’ nature of Japanese culture, which tends to favour ‘silence and visual forms of communication’ (
Maynard 1997, p. 180)—a characteristic that might also explain the notable success of comic art forms such as manga and anime in Japan. Oszmiańska adds that, since the structure of ideophones is determined by linguistic rules, language structure will always ‘overshadow the iconic principle’ (
Oszmiańska 2001, p. 154).
Various foreign languages have incorporated Anglophonic sound symbolism into their array of iconic expressions, Italian being one of them. Japanese is possibly one of the few languages ‘so far largely uncolonised by Anglo-American sound-words’ (
Thomas 2004), probably due to the already intricate system of sound symbolic form present in the language.
Iconic forms indirectly affect the literary, cultural and historical contexts of a language and its speakers. As highlighted by
Assaneo et al. (
2011, p. 8), ideophones, ‘as any other word with a long tradition […], contain elements accumulated across history, elements beyond pure acoustic imitation’. Cross-modal relationships are therefore highly useful when attempting to explain the nature of verbal (but also vocal) sound symbolism (
Assaneo et al. 2011, p. 8). In general, and particularly for this paper, research on ideophone systems will also impact on more widely relevant fields such as language imitation and evolution, verbal art and cultural studies. But, more importantly, it will assist the everlasting inquiry into how sensory images are interpreted/translated in vernaculars (
Dingemanse 2012, p. 664) and how environment may shape languages in particular ways (
Smoll 2012, p. 24).
1.3. The Challenge of Translating Sound Symbolism
Historically, most of the existing Italian literature regarding the translation of children’s books has focused on the characteristics of children’s literature rather than the strategies involved during the translating process. One of the first Italian scholars to draw attention to the issues surrounding the translation of Italian comics was
Umberto Eco (
2003,
2008), who has always shown an interest in the translation of children’s literature and has dedicated some reflections to the translation of Italian Disney comics. Key to the advancement of the field as a fully fledged academic subject was, instead, the early work of
Federico Zanettin (
2008,
2013,
2014)—which then inspired later work from the author of this article (
Pischedda 2017,
2020). Historically, the approach to children’s literature in translation has gone from a prescriptive one (how should you translate?) to a more descriptive one (how do texts present themselves as translations?) (
Van Coillie 2006, p. vi). The strategy of ‘cultural context adaptation’, first suggested by
Klingberg (
1986, p. 86), is still believed to be useful (
Frimmelova 2010, p. 30) and it is taken as an umbrella term for a series of strategies aimed at ‘moving the source text towards the child reader’ (
Lathey 2006, p. 7). Since young readers are not expected to have the same world knowledge as an adult, the translator can be more flexible (
Bell 1985, p. 7) and help the child reader by adapting and localising references coming directly from the source culture. The translation of ideophones adds another level of difficulty to the translator’s task, involving the age-old linguistic and philosophical debate surrounding the relationship between sound and meaning. It also offers food for thought in relation to research on translation for young readers, which has included empirical analysis of the reactions of the ‘interpreting’ young reader to the translated text (
Oittinen 1993, p. 37) and the creativity and inventiveness involved in the translating process (
Lathey 2010, pp. 199–200).
The linguistic barriers to creating sound symbolism in Italian have led translators to often leave the English ideophone in place without localising it (
Semprini 2006, p. 42;
Eco 1994, p. 147). As a consequence, Italian readers of comics are presented with ideophones that are not included in their linguistic repertoire and, very often, are not even understandable. Adult readers of comics had to learn English ideophones in order to easily recognise their meaning. Children who had just entered the world of comic and picture books were faced with foreign sound symbolic words which they could not understand and which did not sound and look familiar or as belonging to their mother tongue, and hence their comprehension of the target text and their reading experience were irreparably jeopardised (Abòs in
Castillo Cañellas 1997). For all these reasons, we can see how problematic the matter is from a translating, but also educational, point of view.
However, there are also non-linguistic reasons for the frequent usage of English ideophones—the American monopoly of the market of young readers’ literature (
Thomas 2004). American comic books together with their ideophones became famous all over the world, allowing foreign readers to get used to English sound symbolism (
Valero Garcés 2008, p. 247) and accept it as a standard way of expressing the sounds contained in balloons. Note that in Italy, before the arrival of American comic books in the 1930s, comics did not contain any sound effects to accompany the image (
Eco 2008).
The struggle experienced by the Italian language to successfully employ sound symbolism in everyday speech has led many translators to leave English sound symbolism intact in Italian translations. Child readers, due to their limited knowledge of the world, should be guided through the translated text and often the translator is expected to localise the source text material more than when translating for adults (
Tortoriello 2006;
Frimmelova 2010). This prevailing strategy thus seems frequently neglected when translating ideophones in comic and picture books. The following research issues and questions have been the focus of my examination of the strategies used to translate ideophones in the third and fourth books of the
Diary of a Wimpy Kid series into Italian: which type of sound symbolic word tends to be adapted more frequently, non-lexicalised ideophones which invoke the illocutionary force of the utterance more than its minimal semantic content, or lexicalised ideophones such as
groan? Why? In which cases has the ideophone been left intact? In which cases has it been localised? The results will be used to formulate a series of hypotheses that will prove fruitful for future research on similar topics.