Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant

Bartnicki, Krzysztof; Drożdż, Stanisław; Kwapień, Jarosław; Stanisz, Tomasz

doi:10.3390/e27020177

Open AccessArticle

Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant

by

Krzysztof Bartnicki

¹,

Stanisław Drożdż

^2,3,*

,

Jarosław Kwapień

²

and

Tomasz Stanisz

²

¹

Independent Researcher, 53-201 Wrocław, Poland

²

Complex Systems Theory Department, Institute of Nuclear Physics, Polish Academy of Sciences, 31-342 Kraków, Poland

³

Faculty of Computer Science and Telecommunications, Cracow University of Technology, 31-155 Kraków, Poland

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(2), 177; https://doi.org/10.3390/e27020177

Submission received: 13 January 2025 / Revised: 3 February 2025 / Accepted: 5 February 2025 / Published: 7 February 2025

(This article belongs to the Special Issue Complexity Characteristics of Natural Language)

Download

Browse Figures

Versions Notes

Abstract

The complexity characteristics of texts written in natural languages are significantly related to the rules of punctuation. In particular, the distances between punctuation marks measured by the number of words quite universally follow the family of Weibull distributions known from survival analyses. However, the values of two parameters marking specific forms of these distributions distinguish specific languages. This is such a strong constraint that the punctuation distributions of texts translated from the original language into another adopt quantitative characteristics of the target language. All these changes take place within Weibull distributions such that the corresponding hazard functions are always increasing. Recent previous research shows that James Joyce’s famous novel Finnegans Wake is subject to such an extreme distribution from the Weibull family that the corresponding hazard function is clearly decreasing. At the same time, the distances of sentence-ending punctuation marks, determining the sentence length variability, have an almost perfect multifractal organization to an extent found nowhere else in the literature thus far. In the present contribution, based on several available translations (Dutch, French, German, Polish, and Russian) of Finnegans Wake, it is shown that the punctuation characteristics of this work remain largely translation-invariant, contrary to the common cases. These observations may constitute further evidence that Finnegans Wake is a translinguistic work in this respect as well, in line with Joyce’s original intention.

Keywords:

complex systems; natural language; multiscaling; punctuation; sentence length variability; discrete Weibull distribution

1. Introduction

A complexity science perspective provides valuable insights into the study of natural language as multiple traits identify it as a complex system [1,2]. Methods widely employed in analyzing complex systems—rooted in concepts from information theory [3,4,5], time series analysis [6,7,8,9,10], network science [11,12,13,14,15,16,17,18,19], and power-law probability distributions [20,21,22,23]—have been utilized to explore the quantitative characteristics of natural language. Understanding the mechanisms underlying these characteristics can significantly enhance natural language processing (NLP) and generation techniques, a particularly pertinent goal in the era of large language models (LLMs) [24,25], which are foundational to generative AI systems such as OpenAI’s ChatGPT, Microsoft’s Bing Chat, and Google’s Bard.

One language property recently analyzed using statistical approaches is punctuation use in written texts. Research demonstrates that the distribution of word counts between consecutive punctuation marks in literary texts generally follows a discrete Weibull distribution [26,27,28]. Remarkably, the two parameters of this Weibull distribution are largely language-specific, reflecting the distinctive punctuation characteristics of different languages. Moreover, translations of texts into another language in general retain the Weibull distribution but with parameter values corresponding to the target language [26,27]. Sentence-ending punctuation, such as periods, deviates from this rigidity, however. The intervals thus defined, representing sentence lengths, exhibit greater variability and are not as strictly bound by the Weibull distribution, enabling a more diverse range of patterns.

In this context, Finnegans Wake by James Joyce is a challenging object for quantitative scientific analyses as it constitutes one of the most complex and enigmatic works of literature in the English language. It is known for its experimental style, inventive language that convolutes English with dozens of other languages, and it thus induces intricate, layered meanings. Many words therein have multiple meanings, inviting readers to find several interpretations for each word, phrase, and sentence. Finnegans Wake is structured to reflect the logic of a dream rather than a conventional narrative. The book begins and ends mid-sentence, creating a circular structure where the last line connects back to the first, symbolizing an endless cycle. There is no clear plot or sequence; instead, the narrative spirals, looping back on itself with characters, symbols, and themes that morph and shift. It is supposed to explore how language mirrors human consciousness and the complexity of memory, experience, and culture. Already, from this perspective, it is natural to anticipate the appearance of various types of long-range correlations in the text of this book, which do not exist in traditional written texts.

Indeed, Finnegans Wake’s punctuation patterns, which reflect mutual arrangement in the organization of phrases and sentences, contain correlations incomparably more amazing than in any other work that has ever been subjected to this type of analysis. The most spectacular dimension of Finnegans Wake manifests in its full multifractality glory through the patterns of sentence length variability, largely paralleling model mathematical cascades. The second unusual property of this work is that, the longer the sequence of words uninterrupted by punctuation, the less likely it is that a punctuation mark will appear after the next word, which is a unique property among books. In the formal language of probabilistic survival analysis, this means that the hazard function expressing the need for a punctuation mark to appear if it has not been present in the sequence of words is decreasing. It is as if for a living organism the probability of surviving subsequent years increases with the age of that organism even at its age of maturity.

2. Materials and Methods

2.1. Finnegans Wake

James Joyce’s Finnegans Wake is a unique and challenging work of modernist literature, known for its complex structure, innovative use of language, and unconventional narrative techniques. The opening sentence is a continuation of the final sentence, creating a loop that reflects the cyclical themes in the book. Influenced by Giambattista Vico’s theory of historical cycles, the book explores patterns of rise, fall, and renewal across human history. The narrative mimics the structure of a dream, delving into subconscious associations and symbolic imagery. Central to the story is the Earwicker family, representing archetypes of father, mother, and children. The work draws on mythological, biblical, and literary sources, embedding archetypes like the trickster, the hero, and the everyman. The text eschews a traditional plot for a fragmented cyclical approach, mirroring the logic of dreams.

The book is divided into four parts. Part I (8 chapters) introduces the central characters and themes. It sets the stage for the dreamlike narrative, blending personal and universal histories. Part II (4 chapters) focuses on the Earwicker family, including explorations of their interpersonal dynamics and symbolic representations of societal archetypes. Part III (4 chapters) contrasts rationality and structure versus creativity and chaos of the two brothers. Finally, Part IV (1 chapter) acts as a coda, bringing together the cyclical motifs of renewal and closure as ALP’s (Anna Livia Plurabelle) monologue leads back to the beginning.

Finnegans Wake is often described as one of the most difficult works of literature due to its linguistic density and abstract narrative. It has been interpreted as a “universal dream”, capturing the collective unconscious and human experience across time and space. The text is rich with references to world literature, history, philosophy, and folklore. Sentences often break grammatical rules, defying conventional readability but inviting interpretive exploration. For all these reasons, Finnegans Wake is often described as one of the most difficult works of literature due to its linguistic density and abstract narrative. Therefore, the multiplicity of interpretations and the diversity of opinions—often contradictory even on the same issues—expressed by experts on Finnegans Wake in the existing extensive literature on the subject are inevitable. The reader interested in exploring the current state of views on Finnegans Wake is referred to the doctoral thesis of Krzysztof Bartnicki [29], co-author of the present work, who is also the author of the Polish translation of Finnegans Wake [30].

Translating James Joyce’s Finnegans Wake into other languages thus presents an array of unique challenges due to its experimental nature and narrative full of obscurities, ambiguities, and polyglot infusions. The text often prioritizes sound and rhythm over conventional grammar, creating a musicality that is integral to its meaning. Translating this musical quality into another language, while maintaining the sense of the text, is a delicate balancing act. Translators face the challenge of balancing fidelity to Joyce’s complexity with readability for their audience. While Finnegans Wake is intentionally opaque, readers in the target language may lack the cultural or linguistic tools that English-speaking readers can rely on. Joyce’s narrative incorporates multiple voices, tones, and registers, often blending them seamlessly. Translating this heteroglossia is difficult, especially in languages with less flexibility in tonal or dialectal variation. These are the reasons why available complete translations of this book into other languages are not very numerous but exist for several European languages. In addition to the original, five such widely recognized translations are used in the quantitative analysis presented here. They include translations into Dutch [31], French [32], German [33], Polish [30], and Russian [34].

2.2. Discrete Weibull Distribution

Punctuation can be understood as a mechanism for interrupting the continuous flow of words, thus enhancing the clarity of the message and providing necessary pauses for the reader. These functions align well with the framework of survival analysis [35]. Prior studies have established that distances between punctuation marks follow distributions that can be modeled using the discrete Weibull distribution [26,27]. This distribution is defined by the following probability mass function [36]:

f (k) = {(1 - p)}^{k^{β}} - {(1 - p)}^{{(k + 1)}^{β}}, p \in (0, 1), β > 0

(1)

and its cumulative distribution function:

F (k) = 1 - {(1 - p)}^{k^{β}} .

(2)

The cumulative distribution function represents the probability that the random variable exceeds the value k. The discrete Weibull distribution generalizes the geometric distribution, which is recovered when

β = 1

, resulting in a constant probability

F (k)

over time. When

β > 1

, the probability increases with time, while, for

β < 1

, it decreases. This distribution has broad applications in fields such as survival analysis, weather prediction, and textual data analysis [35,37,38]. In the context of punctuation,

f (k)

denotes the probability that a punctuation mark will appear precisely after k words.

A complementary approach to the above generalization can be readily and transparently formulated in terms of the hazard function

λ (k)

expressing the conditional probability that the kth trial will result in a success provided that no success has occurred in the preceding

k - 1

trials:

λ (k) = \frac{f (k)}{1 - F (k - 1)} .

(3)

In the case of the discrete Weibull distribution, it becomes [39]

λ (k) = 1 - {(1 - p)}^{k^{β} - {(k - 1)}^{β}} .

(4)

For data that exactly follow a Weibull distribution,

β > 1

corresponds to

λ (k)

, which is an increasing function of k. In other words, the probability of success increases with the number of preceding unsuccessful trials. The opposite applies to

β < 1

. In the memoryless case of

β = 1

, the hazard function is constant. Since

p = λ (1)

, the parameter p reflects the probability of putting a punctuation mark right after the first word following the last punctuation mark.

2.3. Multifractal Detrended Fluctuation Analysis (MFDFA)

Self-similarity, or the absence of a characteristic scale, is a defining feature of natural complex systems. Empirically, this property often manifests as a non-trivial temporal organization in measurement outcomes, represented as a time series. Specifically, complexity is frequently linked to a cascade-like hierarchy of data points exhibiting multiscaling, underscoring the importance of practical methods for identifying such structures in complex systems research [40]. Drawing on current research, multifractal detrended fluctuation analysis (MFDFA) has proven to be a highly reliable method for analyzing multiscaling structures [41,42]. This technique builds upon the widely adopted detrended fluctuation analysis (DFA) [43], offering a multiscale approach to studying hierarchical and multiscaling patterns. Below, we provide a concise outline of the key steps involved in the MFDFA algorithm.

Consider a time series

U = {u_{i}}_{i = 1}^{T}

, consisting of T consecutive measurements of an observable u. This series is divided into

M_{s}

non-overlapping windows, each of length s, starting from both ends of U, resulting in a total of

2 M_{s}

windows. To address potential non-stationarity in the signal, a detrending procedure is applied within each window to an integrated signal (referred to as the profile)

X = {x_{i}}_{i = 1}^{s}

, with elements defined as

x_{i} = \sum_{j = 1}^{i} u_{j} .

(5)

The detrending process involves fitting a polynomial

P^{(m)}

of order m (with

m = 2

used throughout this study) to the data X within each window

ν = 0, \dots, 2 M_{s} - 1

. The variance of the detrended signal obtained after subtracting the fitted polynomial is then computed as follows:

f^{2} (ν, s) = \frac{1}{s} \sum_{i = 1}^{s} {(x_{i} - P^{(m)} (i))}^{2} .

(6)

In the next step, a family of fluctuation functions of order q is defined using the average variance computed across all windows:

F_{q} (s) = {\{\frac{1}{2 M_{s}} \sum_{ν = 0}^{2 M_{s} - 1} {[f^{2} (ν, s)]}^{q / 2}\}}^{1 / q} .

(7)

Here, q is a real number. The fluctuation functions

F_{q} (s)

are computed for various values of the scale s and the index q. Typically, the minimum s is selected to exceed the length of the longest sequence of constant values in U, while the maximum ss is set to

T / 5

. Unlike s, there is no standard range for q. Since q is associated with the moments of the signal, extreme values should be avoided for time series with heavy tails to ensure meaningful results. If the fluctuation functions depend on s as power laws

F_{q} (s) \sim s^{h (q)}

(8)

for a number of different choices of q, it indicates that the time series under study is either monofractal (when

h (q)

is constant in q) or multifractal otherwise. The function

h (q)

is called the generalized Hurst exponent because, for

q = 2

,

h (q) = H

, where H is the standard Hurst exponent [44,45]. From a visual perspective, fractal

F_{q} (s)

organizations result in straight lines on double logarithmic plots.

A practical representation of the multifractal characteristics of data is the singularity spectrum

f (α)

, which can be derived from

h (q)

by the Legendre transform:

\begin{matrix} α = h (q) + q h^{'} (q), \\ f (α) = q [α - h (q)] + 1, \end{matrix}

(9)

Here,

α

represents the measure of data-point singularity, equivalent to the Hölder exponent. Geometrically,

f (α)

can be interpreted as the fractal dimension of the subset of the data characterized by a specific Hölder exponent

α

[46]. For a monofractal time series, the pair

(α, f (α))

reduces to a single point. In contrast, for a multifractal time series, it typically forms a downward-pointing parabola. The broader the singularity spectrum

f (α)

, the greater the richness of multifractality in the time series, which serves as an indicator of its complexity content. In some cases, the

f (α)

parabola may appear distorted or asymmetric, suggesting that data points of varying amplitudes exhibit different scaling behaviors [47,48,49,50,51].

3. Results and Discussion

3.1. Inter-Punctuation Intervals (IPIs)

The results illustrating the characteristics of the distributions of distances between all the consecutive punctuation marks for Finnegans Wake in the original and its five above-mentioned translations are shown in Figure 1. The panels in the left column show the corresponding normalized counts for increasing values of the distance k measured by the number of words between consecutive punctuation marks. These empirical distributions are then fitted by the formula of Equation (1), and parameters p and

β

of the best fits are explicitly provided in the corresponding panels. For Finnegans Wake in the original, it is a good fit, which was already presented in Ref. [52], but, remarkably, parameter

β

here is less than 1, in contrast to the multiplicity of all the other literary texts studied [26], written in the major European languages, even those classified as belonging to experimental literature [52]. In general, the Weibull functional form describes the distributions of distances between consecutive punctuation marks well, but the corresponding parameters are somewhat different and on average are language-specific, so they even undergo appropriate transformations in translations. The special case of Finnegans Wake analyzed here, in its uniqueness, turns out to be much less susceptible to changing values of these parameters. For the translations into Dutch and French, they even remain almost exactly unchanged. For the translation into Polish,

β

approaches the limit of 1 but does not exceed it, while, for typical texts in this language,

〈 β 〉 \approx 1.4

. For the two remaining translations, German and Russian,

β

exceeds the value of 1, especially clear for the Russian translation considered here, although, in both cases, it remains below the average values characteristic of these languages [26]. In this context, it should also be noted that the quality of the fit is slightly worse in these two cases, especially for small values of k.

A complementary insight into the specificity of the punctuation distribution is obtained through the hazard function

λ (k)

defined by Equation (3). It provides a better perspective on the asymptotics of the distribution at larger values of k. These functions, calculated directly from empirical data, (not from the p and

β

fit parameters using the Weibull distribution) are presented in parallel on the right side of Figure 1. As indicated, from this perspective, all the translations demonstrate similar decreasing asymptotics, with the Dutch and French translations having almost the same course as the English original. Apart from an initial slightly larger increase, the Polish translation behaves similarly. Such initial increases in

λ (k)

are more pronounced for the German and Russian translations, but, for larger k, the decreases start to prevail even in these cases.

3.2. Long-Range Correlations in IPIs

The distributions of fluctuations are some of the important characteristics of time series. With the same distributions, however, such series may differ in the arrangement of successive values relative to each other. The MFDFA formalism presented above is well suited to quantifying possible long-range correlations of this type and is therefore applied to the time series of successive IPIs analyzed here, proceeding through the entire text of the book.

The series of distances measured in word counts between all the consecutive punctuation marks for the original Finnegans Wake and its five translations are visualized in the left panels of Figure 2. The same scale is used on the vertical axis to make the relative values directly visible. As immediately evident, these series for the Dutch and French translations have strongly similar values and courses to those for the English original. The Polish translation is also quite consistent with these values. For the German translation, some correspondence is also noticeable, but there are clearly more small numbers, so the series is longer than in the above cases. An extreme case is the Russian translation, where the values are on average even significantly smaller, so their number in the series is even greater than for the German translation.

The nature of the long-range correlations encoded in these series is quantified within the MFDFA in terms of the fluctuation functions

F_{q} (s)

expressed in Equation (7). These functions, calculated for

- 4 \leq q \leq 4

and plotted in the right-hand panels of Figure 2, respectively, show clear scaling (straight line on the log–log scale), but this is not a convincing instance of multifractal scaling because the dependence of these functions on q is weak. These dependencies are rather close to monofractal. The slope of

F_{q} (s)

lines is, however, clearly larger than

0.5

, in particular the Hurst exponents (

H = h (q = 2)

, red line in this Figure) determined by Equation (8), found in the range

0.65 \leq H \leq 0.75

. This indicates clear long-range correlations of a persistent nature. The Polish translation, which is at the upper end of the H-value range, is the most extreme in this respect.

The illustration that complements this section, shown in Figure 3, explicitly demonstrates one of the extreme examples of differences in the punctuation distribution in some translations of Finnegans Wake. The plot presents the relative arrangement of punctuation marks in a sample two-paragraph excerpt from the book. The discussed excerpt starts at the beginning of the last paragraph on page 579 and ends at the end of page 580 in the original English version. Clearly, consistent with the previous observations, the first three cases (English, Dutch, and French) differ the least, while the Russian translation differs the most by generating many much shorter IPIs.

3.3. Multifractal Sentence Length Variability (SLV)

The distribution of distances between all the punctuation marks clearly follows certain fairly universal rigors, as demonstrated by previous recent studies and confirmed to a large extent even for such an exceptional work as the one examined here, Finnegans Wake. At the same time, however, the distributions of the punctuation marks ending sentences (such as periods, question marks, and exclamation marks) have much more freedom in this respect. The distances between them are precisely the lengths of sentences. It has already been shown [53] that Finnegans Wake develops a self-similar cascade of SLV with exceptionally richly developed multifractal properties. Such patterns appear only in literary works written using the narrative technique of stream of consciousness, and Finnegans Wake is the most spectacular in this respect, both in terms of the almost ideal symmetry of the resulting singularity spectrum (Equation (9)) as well as in the sense of the width of this spectrum. The main goal of the present study is to examine to what extent these properties are preserved in the available translations.

The results collected in Figure 4 clearly indicate that, in this respect, the translations largely faithfully preserve the multifractal characteristics of the original, and, in the two cases of French and Polish, this mapping is almost perfect. The patterns of variation in the successive sentence lengths are strikingly similar here, and this applies even to their absolute values, as can be seen on the vertical scales of the upper panels of this figure. Consequently, the total number of sentences is also similar, as is evident from the horizontal scales of these panels. This applies even to the Russian translation, which showed greater discrepancies in the case of full punctuation. The functions

F_{q} (s)

show very good scaling that is strongly dependent on q, comparable in all cases. As a consequence, the multifractal spectra

f (α)

are alike, broad (

Δ α \approx 1

), and vary only slightly. Minor asymmetries in the spectra can also be observed. For Dutch and Russian, they are right-sided, and for German they are left-sided. This indicates [49] that, in the first two cases, the hierarchy of multifractal correlations is somewhat more developed towards variability in shorter sentence lengths than longer ones, while, in the third case (German), it is the other way around. The multifractal spectra of the French and Polish translations are almost exactly symmetrical, similarly to the English original. All these spectra are also significantly shifted to the right with respect to

α = 0.5

, which signals a clear persistent trend in sentence length variability. This property is also confirmed in terms of the corresponding Hurst exponents H, which are calculated as scaling exponents for

F_{q = 2} (s)

, marked in red in these figures. Their corresponding values turn out to be even slightly higher than those when considering the distance variations between all the successive punctuation marks, as indicated in Figure 2.

4. Summary and Conclusions

This study explored the intricate punctuation patterns and sentence length variability within James Joyce’s Finnegans Wake and its translations into five languages: Dutch, French, German, Polish, and Russian. Through a combination of statistical methods, including the discrete Weibull distribution and multifractal detrended fluctuation analysis (MFDFA), it is identified that the punctuation use in Finnegans Wake defies conventional linguistic norms. Unlike typical texts, its punctuation patterns remain largely translation-invariant, reflecting Joyce’s intentional crafting of a translinguistic narrative as one likely factor.

The findings highlight two distinct characteristics. The text exhibits a rare decreasing hazard function in its punctuation intervals, a trait only observed in Finnegans Wake among the analyzed works. This uniqueness persists across most translations, with minor variations reflecting linguistic differences, or, which cannot be excluded, some bias related to the translation fidelity. As previously demonstrated [52,53] by the comparison of large corpora of literary texts, the multifractal properties of the sentence lengths in Finnegans Wake demonstrate an extraordinary degree of self-similarity and complexity. This trait is preserved remarkably well in translations, particularly in the French and Polish versions, underscoring the robustness of the text’s structural organization. These results contribute to a deeper understanding of the interplay between linguistic structure and translation fidelity, affirming the universal complexity embedded in Joyce’s work.

The study confirms that Finnegans Wake exemplifies a rare literary phenomenon where structural properties transcend linguistic boundaries, maintaining coherence in complexity across translations. This underscores the text’s suitability for cross-disciplinary analysis, superimposing linguistics, literature, and complexity science. By demonstrating that punctuation and sentence organization are integral to the text’s identity, this research highlights the intricate balance between translation and preservation of artistic intent.

Future research could extend these methods to other experimental literary works to further explore the universality of such translinguistic traits. Additionally, integrating advanced computational techniques and larger datasets might provide new insights into the quantitative characteristics of experimental literature, which explores the deeper layers and possibilities of natural language.

Author Contributions

Conceptualization, K.B., S.D., J.K. and T.S.; Methodology, S.D., J.K. and T.S.; Software, J.K. and T.S.; Validation, K.B., S.D., J.K. and T.S.; Formal analysis, S.D., J.K. and T.S.; Investigation, S.D., J.K. and T.S.; Resources, K.B. and T.S.; Data curation, K.B. and T.S.; Writing—original draft, S.D.; Writing—review and editing, K.B., S.D., J.K. and T.S.; Visualization, J.K. and T.S.; Supervision, S.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to copyright.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hébert-Dufresne, L.; Allard, A.; Garland, J.; Hobson, E.A.; Zaman, L. The path of complexity. Npj Complex. 2024, 1, 4. [Google Scholar] [CrossRef]
Kwapień, J.; Drożdż, S. Physical approach to complex systems. Phys. Rep. 2012, 515, 115–226. [Google Scholar] [CrossRef]
Dębowski, L. Information Theory Meets Power Laws: Stochastic Processes and Language Models; Wiley: Hoboken, NJ, USA, 2020. [Google Scholar] [CrossRef]
Takahira, R.; Tanaka-Ishii, K.; Dębowski, L. Entropy Rate Estimates for Natural Language—A New Extrapolation of Compressed Large-Scale Corpora. Entropy 2016, 18, 364. [Google Scholar] [CrossRef]
Montemurro, M.A.; Zanette, D.H. Universal Entropy of Word Ordering Across Linguistic Families. PLoS ONE 2011, 6, e19875. [Google Scholar] [CrossRef]
Alvarez-Lacalle, E.; Dorow, B.; Eckmann, J.P.; Moses, E. Hierarchical structures induce long-range dynamical correlations in written texts. PNAS 2006, 103, 7956–7961. [Google Scholar] [CrossRef] [PubMed]
Liu, J.; Gunn, E.; Youssef, F.; Tharayil, J.; Lansford, W.; Zeng, Y. Fractality in Chinese prose. Digit. Scholarsh. Humanit. 2023, 38, 604–620. [Google Scholar] [CrossRef]
Sánchez, D.; Zunino, L.; Gregorio, J.D.; Toral, R.; Mirasso, C. Ordinal analysis of lexical patterns. Chaos 2023, 33, 033121. [Google Scholar] [CrossRef]
Pawłowski, A. Time-Series analysis in linguistics: Application of the ARIMA method to cases of spoken Polish. J. Quant. Linguist. 1997, 4, 203–221. [Google Scholar] [CrossRef]
Kosmidis, K.; Kalampokis, A.; Argyrakis, P. Language time series analysis. Phys. A 2006, 370, 808–816. [Google Scholar] [CrossRef]
Cancho, R.F.i.; Solé, R.V. The small world of human language. Proc. R. Soc. Lond. Ser. B Biol. Sci. 2001, 268, 2261–2265. [Google Scholar] [CrossRef]
Amancio, D.R.; Antiqueira, L.; Pardo, T.A.S.; Da, L.; Costa, F.; Oliveira, O.N.; Nunes, M.G.V. Complex networks analysis of manual and machine translations. Int. J. Mod. Phys. C 2008, 19, 583–598. [Google Scholar] [CrossRef]
Liu, H.; Xu, C. Can syntactic networks indicate morphological complexity of a language? Europhys. Lett. 2011, 93, 28005. [Google Scholar] [CrossRef]
Cong, J.; Liu, H. Approaching human language with complex networks. Phys. Life Rev. 2014, 11, 598–618. [Google Scholar] [CrossRef]
Wachs-Lopes, G.A.; Rodrigues, P.S. Analyzing natural human language from the point of view of dynamic of a complex network. Expert Syst. Appl. 2016, 45, 8–22. [Google Scholar] [CrossRef]
Kulig, A.; Kwapień, J.; Stanisz, T.; Drożdż, S. In narrative texts punctuation marks obey the same statistics as words. Inf. Sci. 2017, 375, 98–113. [Google Scholar] [CrossRef]
Akimushkin, C.; Amancio, D.R.; Oliveira, O.N. Text authorship identified using the dynamics of word co-occurrence networks. PLoS ONE 2017, 12, 0170527. [Google Scholar] [CrossRef]
Stanisz, T.; Kwapień, J.; Drożdż, S. Linguistic data mining with complex networks: A stylometric-oriented approach. Inf. Sci. 2019, 482, 301–320. [Google Scholar] [CrossRef]
Raducha, T.; Gubiec, T. Predicting language diversity with complex networks. PLoS ONE 2018, 13, e0196593. [Google Scholar] [CrossRef] [PubMed]
Naranan, S.; Balasubrahmanyan, V. Models for power law relations in linguistics and information science. J. Quant. Linguist. 1998, 5, 35–61. [Google Scholar] [CrossRef]
Newman, M. Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 2005, 46, 323–351. [Google Scholar] [CrossRef]
Ausloos, M. Punctuation effects in english and esperanto texts. Phys. A 2010, 389, 2835–2840. [Google Scholar] [CrossRef]
Piantadosi, S.T. Zipf’s word frequency law in natural language: A critical review and future directions. Psychon. Bull. Rev. 2014, 21, 1112–1130. [Google Scholar] [CrossRef]
Shanahan, M.; McDonell, K.; Reynolds, L. Role play with large language models. Nature 2023, 623, 493–498. [Google Scholar] [CrossRef]
Zhao, W.X.; Zhou, K.; Li, J.; Tang, T.; Wang, X.; Hou, Y.; Min, Y.; Zhang, B.; Zhang, J.; Dong, Z.; et al. A Survey of Large Language Models. arXiv 2023, arXiv:2303.18223v13. [Google Scholar]
Stanisz, T.; Drożdż, S.; Kwapień, J. Universal versus system-specific features of punctuation usage patterns in major Western languages. Chaos Solitons Fractals 2023, 168, 113183. [Google Scholar] [CrossRef]
Stanisz, T.; Drożdż, S.; Kwapień, J. Complex systems approach to natural language. Phys. Rep. 2024, 1053, 1–84. [Google Scholar] [CrossRef]
Dec, J.; Dolina, M.; Drożdż, S.; Kwapień, J.; Stanisz, T. Multifractal Hopscotch in Hopscotch Julio Cortázar. Entropy 2024, 26, 716. [Google Scholar] [CrossRef] [PubMed]
Bartnicki, K. Finnegans Wake as a System of Knowledge Without Primitive Terms: A Proposal Against the Paradigm of Competence in the So-Called Joyce Industry. Ph.D. Thesis, Friedrich Schiller University Jena, Jena, Germay, 2021. [Google Scholar] [CrossRef]
Joyce, J. Finnegans Wake [Polish: Finneganów Tren]; Ha!art: Cracow, Poland, 2012. [Google Scholar]
Joyce, J. Finnegans Wake [Dutch: Finnegans Wake]; Athenaeum-Polak & Van Gennep: Amsterdam, The Netherlands, 2002. [Google Scholar]
Joyce, J. Finnegans Wake [French: Veillée Pinouilles]. Available online: https://archive.org/details/veillee-pinouilles-18-juin-2020-pdf/mode/2up (accessed on 4 February 2025).
Joyce, J. Finnegans Wake [German: Finnegans Wehg]; Zweitausendeins: Frankfurt am Main, Germany, 1993. [Google Scholar]
Joyce, J. Finnegans Wake [Russian: Na Pomine Finneganov]; Rideró: Yekaterinburg, Russia; Available online: http://samlib.ru/r/rene_a/ (accessed on 4 February 2025).
Miller, R. Survival Analysis; John Wiley & Sons: Hoboken, NJ, USA, 1997. [Google Scholar]
Nakagawa, T.; Osaki, S. The discrete Weibull distribution. IEEE Trans. Reliab. 1975, R-24, 300–301. [Google Scholar] [CrossRef]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions; Wiley-Interscience: Hoboken, NJ, USA, 1994. [Google Scholar]
Altmann, E.G.; Pierrehumbert, J.B.; Motter, A.E. Beyond Word Frequency: Bursts, Lulls, and Scaling in the Temporal Distributions of Words. PLoS ONE 2009, 4, e7678. [Google Scholar] [CrossRef]
Padgett, W.J.; Spurrier, J.D. On discrete failure models. IEEE Trans. Reliab. 1985, 34, 253–256. [Google Scholar] [CrossRef]
Jimenez, J. Intermittency and cascades. J. Fluid Mech. 2000, 409, 99–120. [Google Scholar] [CrossRef]
Kantelhardt, J.W.; Zschiegner, S.A.; Koscielny-Bunde, E.; Havlin, S.; Bunde, A.; Stanley, H. Multifractal detrended fluctuation analysis of nonstationary time series. Phys. A 2002, 316, 87–114. [Google Scholar] [CrossRef]
Oświęcimka, P.; Kwapień, J.; Drożdż, S. Wavelet versus detrended fluctuation analysis of multifractal structures. Phys. Rev. E 2006, 74, 016103. [Google Scholar] [CrossRef] [PubMed]
Peng, C.K.; Buldyrev, S.V.; Havlin, S.; Simons, M.; Stanley, H.E.; Goldberger, A.L. Mosaic organization of DNA nucleotides. Phys. Rev. E 1994, 49, 1685. [Google Scholar] [CrossRef]
Hurst, H.E. The long-term storage capacity of reservoir. Trans. Am. Soc. Civ. Eng. 1951, 116, 2447. [Google Scholar] [CrossRef]
Heneghan, C.; McDarby, G. Establishing the relation between detrended fluctuation analysis and power spectral density analysis for stochastic processes. Phys. Rev. E 2000, 62, 6103–6110. [Google Scholar] [CrossRef]
Halsey, T.C.; Jensen, M.H.; Kadanoff, L.P.; Procaccia, I.; Shraimant, B.I. Fractal measures and their singularities: The characterization of strange sets. Phys. Rev. A 1986, 33, 1141–1151. [Google Scholar] [CrossRef]
Ohashi, K.; Amaral, L.A.; Natelson, B.H.; Yamamoto, Y. Asymmetrical singularities in real-world signals. Phys. Rev. E 2003, 68, 065204. [Google Scholar] [CrossRef]
Cao, G.; Cao, J.; Xu, L. Asymmetric multifractal scaling behavior in the Chinese stock market: Based on asymmetric MF-DFA. Phys. A 2013, 392, 797–807. [Google Scholar] [CrossRef]
Drożdż, S.; Oświęcimka, P. Detecting and interpreting distortions in hierarchical organization of complex time series. Phys. Rev. E 2015, 91, 030902. [Google Scholar] [CrossRef]
Gómez-Gómez, J.; Carmona-Cabezas, R.; Ariza-Villaverde, A.B.; Gutiérrez de Ravé, E.; Jiménez-Hornero, F.J. Multifractal detrended fluctuation analysis of temperature in Spain (1960–2019). Phys. A 2021, 578, 126118. [Google Scholar] [CrossRef]
Kwapień, J.; Watorek, M.; Bezbradica, M.; Crane, M.; Mai, T.T.; Drozdz, S. Analysis of inter-transaction time fluctuations in the cryptocurrency market. Chaos 2022, 32, 083142. [Google Scholar] [CrossRef]
Stanisz, T.; Drożdż, S.; Kwapień, J. Statistics of punctuation in experimental literature—The remarkable case of Finnegans Wake by James Joyce. Chaos Interdiscip. J. Nonlinear Sci. 2024, 34, 083124. [Google Scholar] [CrossRef] [PubMed]
Drożdż, S.; Oświęcimka, P.; Kulig, A.; Kwapień, J.; Bazarnik, K.; Grabska-Gradzińska, I.; Rybicki, J.; Stanuszek, M. Quantifying origin and character of long-range correlations in narrative texts. Inf. Sci. 2016, 331, 32–44. [Google Scholar] [CrossRef]

Figure 1. The distributions of the distances between consecutive punctuation marks (left column) and the corresponding hazard functions (right column) in Finnegans Wake and its translations.

Figure 2. Time series representing the distances between consecutive punctuation marks (left column) and the corresponding fluctuation functions (right column) in Finnegans Wake and its translations. In each fluctuation function plot, the function for

q = 2

is marked in red, and the Hurst exponent H is provided in the bottom-right corner.

Figure 2. Time series representing the distances between consecutive punctuation marks (left column) and the corresponding fluctuation functions (right column) in Finnegans Wake and its translations. In each fluctuation function plot, the function for

q = 2

is marked in red, and the Hurst exponent H is provided in the bottom-right corner.

Figure 3. The relative arrangement of punctuation marks corresponding to the six considered cases of the original Finnegans Wake and its Dutch, French, German, Polish, and Russian translations in a two-paragraph excerpt from the book (starting at the beginning of the last paragraph on page 579 and ending at the end of page 580 in the original English version).

Figure 4. Time series representing sentence lengths, the corresponding fluctuation functions

F_{q} (s)

, and singularity spectra

f (α)

in Finnegans Wake and its translations. In each of the fluctuation function plots,

F_{q} (s)

for

q = 2

is marked in red, and the Hurst exponent H value is provided in the bottom-right corner.

Figure 4. Time series representing sentence lengths, the corresponding fluctuation functions

F_{q} (s)

, and singularity spectra

f (α)

in Finnegans Wake and its translations. In each of the fluctuation function plots,

F_{q} (s)

for

q = 2

is marked in red, and the Hurst exponent H value is provided in the bottom-right corner.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bartnicki, K.; Drożdż, S.; Kwapień, J.; Stanisz, T. Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant. Entropy 2025, 27, 177. https://doi.org/10.3390/e27020177

AMA Style

Bartnicki K, Drożdż S, Kwapień J, Stanisz T. Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant. Entropy. 2025; 27(2):177. https://doi.org/10.3390/e27020177

Chicago/Turabian Style

Bartnicki, Krzysztof, Stanisław Drożdż, Jarosław Kwapień, and Tomasz Stanisz. 2025. "Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant" Entropy 27, no. 2: 177. https://doi.org/10.3390/e27020177

APA Style

Bartnicki, K., Drożdż, S., Kwapień, J., & Stanisz, T. (2025). Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant. Entropy, 27(2), 177. https://doi.org/10.3390/e27020177

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Punctuation Patterns in Finnegans Wake by James Joyce Are Largely Translation-Invariant

Abstract

1. Introduction

2. Materials and Methods

2.1. Finnegans Wake

2.2. Discrete Weibull Distribution

2.3. Multifractal Detrended Fluctuation Analysis (MFDFA)

3. Results and Discussion

3.1. Inter-Punctuation Intervals (IPIs)

3.2. Long-Range Correlations in IPIs

3.3. Multifractal Sentence Length Variability (SLV)

4. Summary and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI