Next Article in Journal
Spatial Statistical Models: An Overview under the Bayesian Approach
Next Article in Special Issue
P Systems with Evolutional Communication and Division Rules
Previous Article in Journal
On the Monoid of Unital Endomorphisms of a Boolean Ring
Previous Article in Special Issue
On Turing Machines Deciding According to the Shortest Computations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Maximal Complexity of Quasiperiodic Infinite Words

Institut für Informatik, Martin-Luther-Universität Halle-Wittenberg, D-06099 Halle (Saale), Germany
Axioms 2021, 10(4), 306; https://doi.org/10.3390/axioms10040306
Submission received: 27 September 2021 / Revised: 9 November 2021 / Accepted: 11 November 2021 / Published: 17 November 2021
(This article belongs to the Special Issue In Memoriam, Solomon Marcus)

Abstract

:
A quasiperiod of a finite or infinite string is a word whose occurrences cover every part of the string. An infinite string is referred to as quasiperiodic if it has a quasiperiod. We present a characterisation of the set of infinite strings having a certain word q as quasiperiod via a finite language P q consisting of prefixes of the quasiperiod q. It turns out its star root P q * is a suffix code having a bounded delay of decipherability. This allows us to calculate the maximal subword (or factor) complexity of quasiperiodic infinite strings having quasiperiod q and further to derive that maximally complex quasiperiodic infinite strings have quasiperiods a b a or a a b a a . It is shown that, for every length l 3 , a word of the form a n b a n (or a n b b a n if l is even) generates the most complex infinite string having this word as quasiperiod. We give the exact ordering of the lengths l with respect to the achievable complexity among all words of length l.

1. Introduction

In his tutorials [1,2,3] Solomon Marcus dealt with several properties of infinite words. Among them he considered quasiperiodicity and its influence on measures of symmetry like complexity, recurrence or entropy. One topic of interest was their subword complexity (or factor complexity [4]). Besides the asymptotic behaviour of the factor complexity, also known as their topological entropy ([4], Section 4.2.2) or [5] Marcus was also interested in the behaviour of the complexity function f ( ξ , n ) assigning to a natural number n N the number of subwords of the infinite word ( ω -word) ξ . Here he was also concerned with recurrences in ω -words and their influence to subword complexity. A well-known fact established by Grillenberger is that the asymptotic subword complexity (or topological entropy) of an almost periodic (or uniformly recurrent) ω -word can be arbitrarily close (but not equal) to the maximal subword complexity (see [4], Theorem 4.4.4).
The present paper summarises results on the subword complexity of infinite words obtained in [6,7,8]. We study in detail the structure of the set of infinite words having a certain word q as quasiperiod and how this is connected with the set of finite words with the same quasiperiod. Moreover, we address a question raised in [9] about the maximally achievable subword complexity of a quasiperiodic infinite word.
A first result shows that for every word q there is a value λ q , 1 λ q < 2 , such that, for every infinite word ξ with quasiperiod q, the complexity function f ( ξ , n ) is bounded by O ( 1 ) · λ q n , and this bound is achieved for certain infinite words having quasiperiod q. The maximally possible value for λ q is λ q = t P 1.324718 , where t P is the smallest Pisot-Vijayaraghavan number, that is, the unique real root t P of the cubic polynomial x 3 x 1 .
As a generalisation of the above-mentioned questions [2,9] we estimate, for every length n 3 , the values λ n = max { λ q : | q | = n } , their ordering and the words q , | q | = n , for which λ q = λ n . It appears that a two letter alphabet is sufficient for achieving the maximal complexity λ n .
In order to prove these properties we start with a general investigation of quasiperiodicity of words (as e.g., in [10,11,12]) and infinite words.
The paper is organised as follows. After introducing some notation we derive in Section 3 a characterisation of quasiperiodic words and ω -words having a certain quasiperiod q. Moreover, we use the finite basis sets P q and its dual R q ( L ( q ) and R ( q ) in [12]) from which the sets of quasiperiodic words or ω -words having quasiperiod q can be constructed. In Section 4 it is then proved that the star root of P q is a suffix code having a bounded delay of decipherability and, dually, the star root of R q is a prefix code.
This much prerequisites allow us, in Section 5, to estimate the number of subwords of the language Q q of all quasiperiodic words having quasiperiod q. It turns out that c q , 1 · λ q n f ( Q q , n ) c q , 2 · λ q n where f ( Q q , n ) is the number of subwords of length n of words in Q q and 1 λ q t P depends on q. We construct, for every quasiperiod q, a quasiperiodic ω -word ξ q with quasiperiod q whose subword complexity f ( ξ q , n ) is maximal.
The values λ q turn out to be maximal positive roots of polynomials associated with the star root P q * . Section 6 deals with the properties of those polynomials. This allows to compare the roots λ q .
The following Section 7 and Section 8 deal with the proof of the above mentioned results on the values λ q and λ n = max { λ q : | q | = n } . Here we derive also the complete ordering of the values λ n .

2. Notation and Preliminaries

In this section we introduce the notation used throughout the paper. By N = { 0 , 1 , 2 , } we denote the set of natural numbers. Let X be an alphabet of cardinality | X | = r 2 , and let throughout the paper a , b X , a b , be two different letters. By X * we denote the set of finite words on X, including the empty word e, and X ω is the set of infinite strings ( ω -words) over X. Subsets of X * will be referred to as languages and subsets of X ω as ω-languages.
For w X * and η X * X ω let w · η be their concatenation. This concatenation product extends in an obvious way to subsets L X * and B X * X ω . For a language L let L * : = i N L i , and by L ω : = { w 1 w i : w i L \ { e } } we denote the set of infinite strings formed by concatenating words in L. The smallest subset of a language L which generates L * is called its star root  L * [13]. It holds
L * = ( L \ { e } ) \ ( L \ { e } ) 2 · L * .
Furthermore | w | is the length of the word w X * and pref ( B ) is the set of all finite prefixes of the strings in B X * X ω . We shall abbreviate w pref ( η ) ( η X * X ω ) by w η .
We denote by B / w : = { η : w · η B } the left derivative of the set B X * X ω . As usual, a language L X * is regular provided it is accepted by a finite automaton. An equivalent condition is that its set of left derivatives { L / w : w X * } is finite.
The sets of infixes of B or η are infix ( B ) : = w X * pref ( B / w ) and infix ( η ) : = w X * pref ( { η } / w ) , respectively. In the sequel we assume the reader to be familiar with basic facts of language theory.
We call a word w X * \ { e } primitive if w = v n implies n = 1 , that is, w is not the power of a shorter word, and we call w X * \ { e }  overlap-free if none of its proper prefixes is a suffix of w. The following facts are known (e.g., [14,15]).
Fact 1.
Every word w X * \ { e } has a unique representation w = v n where v is primitive.
Fact 2.
Let q , v , w X * , 0 < | v | < | q | . If v · q = q · w then v = u · u , q = ( u · u ) κ · u and w = u · u for some u , u X * , u e , and κ N . In particular, q is not overlap-free.
Fact 3.
If w · v = v · w , w , v X * then w , v are powers of a common (primitive) word.
As usual a language L X * is called a code provided w 1 w l = v 1 v k for w 1 , , w l , v 1 , , v k L implies l = k and w i = v i . A code L is said to be a prefix code (suffix code) provided no codeword is a prefix (suffix) of another codeword.

3. Quasiperiodicity

3.1. General Properties

The notion of quasiperiodicity can be formalised in the following manner. A finite or infinite word η X * X ω is referred to as quasiperiodic with quasiperiod q X * \ { e } provided that for every j < | η | N { } there is a prefix u j η of length j | q | < | u j | j such that u j · q η , that is, for every w η the relation u | w | w u | w | · q is valid. Informally, η has quasiperiod q if every position of η occurs within some occurrence of q in η [11,12].
Let for q X * \ { e } , Q q be the set of quasiperiodic words with quasiperiod q. Then { q } * Q q = Q q * and Q q \ { e } X * · q q · X * . In order to describe the set of quasiperiodic strings having a certain quasiperiod q X * \ { e } the following definition is helpful.
Definition 1.
A family w i i = 1 , N { } , of words w i X * · q is referred to as a q-chain provided w 1 = q , w i w i + 1 and | w i + 1 | | w i | | q | .
It holds the following.
Lemma 1.
1. 
w Q q \ { e } if and only if there is a q-chain w i i = 1 such that w = w .
2. 
An ω-word ξ X ω is quasiperiodic with quasiperiod q if and only if there is a q-chain w i i = 1 such that w i ξ .
Proof. 
It suffices to show how a family u j j = 0 | η | 1 can be converted to a q-chain w i i = 1 and vice versa.
Consider η X * X ω and let u j j = 0 | η | 1 be a family such that u j · q η and j | q | < | u j | j for j < | η | .
Define w 1 : = q and w i + 1 : = u | w i | · q as long as | w i | < | η | . Then w i η and | w i | < | w i + 1 | = | u | w i | · q | | w i | + | q | . Thus w i i = 1 is a q-chain with w i η .
Conversely, let w i i = 1 be a q-chain such that w i η and set
u j : = max w : i ( w · q = w i | w | j ) , for j < | η | .
By definition, u j · q η and | u j | j . Assume | u j | j | q | and u j · q = w i . Then | w i | j < | η | . Consequently, in the q-chain there is a successor w i + 1 , | w i + 1 | | w i | + | q | j + | q | . Let w i + 1 = w · q . Then u j w and | w | j which contradicts the maximality of u j . □
Lemma 1 yields the following consequences.
Corollary 1.
Let u pref ( Q q ) . Then there are words w , w Q q such that w u w and | u | | w | , | w | | u | | q | .
Corollary 2.
Let ξ X ω . Then the following are equivalent.
1. 
ξ is quasiperiodic with quasiperiod q.
2. 
pref ( ξ ) Q q is infinite.
3. 
pref ( ξ ) pref ( Q q ) .

3.2. Finite Generators for Quasiperiodic Words

In this part we consider the finite languages P q and R q ( L ( q ) and R ( q ) in [12]) which generate the set of quasiperiodic words as well as the set of quasiperiodic ω -words having quasiperiod q.
We set
P q : = { v : e v q v · q } = { v : v ( v q v · v = q ) } .
Then we have the following properties.
Proposition 1.
1. 
q P q and P q = { q } if and only if q is overlap-free.
2. 
Q q = P q * · q { e } P q *
3. 
pref ( Q q ) = pref ( P q * ) = P q * · pref ( q )
Proof. 
1. q P q is obvious and and the equivalence follows immediately from the definition of P q .
2. In order to prove Q q P q * · q { e } we show that w i P q * · q for every q-chain w i i = 1 . This is certainly true for w 1 = q . Now proceed by induction on i. Let w i = w i · q P q * · q and w i + 1 = w i + 1 · q . Then w i · v i = w i + 1 . Now from w i w i + 1 we obtain e v i q v i · q , that is, v i P q .
Conversely, let v i P q and consider v 1 v · q . Since q v i · q the family ( v 1 v j · q ) j = 0 is a q-chain. This shows P q * · q { e } Q q .
3. is an immediate consequence of 2. □
Proposition 1 and Corollary 2 imply the following characterisation of ω -words having quasiperiod q.
{ ξ : ξ X ω ξ has   quasiperiod   q } = P q ω
Proof. 
Since P q is finite, P q ω = { ξ : ξ X ω pref ( ξ ) pref ( P q * ) } . □
A dual generator of Q q is obtained by the right-to-left duality of reading words using the suffix relation s instead of the prefix relation ⊑.
R q : = { v : e < s v s q < s q · v } = { v : v ( v < s q v · v = q ) } .
Analogously to Proposition 1 we obtain
Proposition 2.
1. 
q R q and R q = { q } if and only if q is overlap-free.
2. 
Q q = q · R q * { e } R q * , and
3. 
pref ( Q q ) = pref ( q ) q · pref ( R q * ) .
The proof of Items 1 and 2 is similar to the proof of Proposition 1 using the reversed version of q-chain, and Item 3 then follows from Item 2. A slight difference appears with an analogy to Equation (2).
{ ξ : ξ X ω ξ has   quasiperiod   q } = q · R q ω R q ω
Here the last inclusion might be proper, e.g., for q = a b a where R a b a ω = { b a , a b a } ω a b a · R a b a ω .
An alternative derivation of the languages P q and R q can be found in Definition 2 of [12]. Here the borders, that is, prefixes which are simultaneously suffixes of the quasiperiod q, are used:
P q = { v : w ( w q w < s q q = v · w ) } , and R q = { v : w ( w q w < s q q = w · v ) } .
In the subsequent sections we focus on the investigation of P q due to the left-to-right direction of ω -words.

3.3. Combinatorial Properties of P q

We investigate basic properties of P q using simple facts from combinatorics on words (see e.g., [14,15,16]).
Proposition 3.
v P q if and only if | v | | q | and there is a prefix v ¯ v such that q = v k · v ¯ for k = | q | / | v | .
This is an immediate consequence of Fact 2.
Corollary 3.
v P q if and only if | v | | q | and there is a k N such that q v k .
Now set q 0 : = min P q . Then in view of Proposition 3 and Corollary 3 we have the following canonical representation.
q = q 0 k · q ¯   where   k = | q | / | q 0 |   and   q ¯ q 0 .
We will refer to q 0 as the repeated prefix and to k as the repetition factor. If | q 0 | > | q | / 2 , that is, if k = 1 we will refer to q as irreducible. (Reducible words are also known as periodic words [10,11].)
Corollary 4.
Every word v P q * is primitive.
Proof. 
Assume v = v 1 l for some v P q * and l > 1 . Then q v k = v 1 l · k , and, according to Corollary 3 v 1 P q contradicting v P q * . □
Proposition 4.
Let q X * , q e , q 0 = min P q , q = q 0 k · q ¯ and v P q * \ { e } .
1. 
If w q then v · w q or q v · w .
2. 
If w · v q then w { q 0 } * .
Proof. 
From Proposition 1.2 we know v · q P q * · q Q q q · X * . Consequently, q v · q . Then v · w v · q implies v · w q or q v · w according to whether | w · w | | q | or not.
Since q 0 v , it suffices to prove the second assertion for q 0 . First one observes that, w q and | w | | q | | q 0 | . Thus w q 0 k 1 · q ¯ . Therefore, we have w · q 0 q and q 0 · w q which implies w · q 0 = q 0 · w and, according to Fact 3, w and q 0 are powers of a common word. The assertion follows because q 0 is primitive. □
Next we derive a lower bound on the lengths of words in P q \ { q 0 } * .
To this end, we use the Theorem of Fine and Wilf.
Theorem 1
([17]). Let v , w X * . Suppose v m and w n , for some m , n N , have a common prefix of length | v | + | w | gcd ( | v | , | w | ) . Then v and w are powers of a common word u X * of length | u | = gcd ( | v | , | w | ) . (Here gcd ( k , l ) denotes the greatest common divisor of two numbers k , l N .)
Proposition 5.
Let q X * , q e , q 0 = min P q , q = q 0 k · q ¯ and v P q \ { q 0 } * . Then | v | > | q | | q 0 | + gcd ( | v | , | q 0 | ) .
Proof. 
If q 0 , v P q Corollary 3 and Equation (5) imply that q is a common prefix of q 0 k + 1 and v k for some k N . If | v | | q | | q 0 | + gcd ( | v | , | q 0 | ) then by Theorem 1 q 0 and v are powers of a common word, that is, v is a power of the primitive word q 0 . □
Corollary 5.
P q * = P q \ q 0 2 · { q 0 } *
Proof. 
It suffices to show P q P q 2 · P q * { q 0 } * . To this end observe that in view of Proposition 5 | v · v | > | q | whenever v P q \ { q 0 } * or v P q \ { q 0 } * . □
As an immediate consequence we obtain that P q * = P q if and only if q is an irreducible quasiperiod. Moreover, Proposition 5 shows that
P q * { q 0 } { v : v q | v | > | q | | q 0 | + gcd ( | v | , | q 0 | ) } .

3.4. The Reduced Quasiperiod q ^

Next we investigate the relation between a quasiperiod q = q 0 k · q ¯ where q 0 = min P q and q ¯ q 0 and its reduced quasiperiod  q ^ : = q 0 · q ¯ . Since q Q q ^ , we have Q q ^ Q q .
We continue with a relation between P q and P q ^ . It is obvious that q 0 i P q for every i = 1 , , k and
P q ^ { v : q ^ 0 v q ^ } .
Lemma 2
([7], Lemma 2.2). Let q X * , q e , q 0 = min P q , q = q 0 k · q ¯ and q ^ = q 0 · q ¯ the reduced quasiperiod of q. Then
P q = { q 0 i : i = 1 , , k 1 } { q 0 k 1 · v : v P q ^ } .
Proof. 
Consider v P q ^ . Then v q 0 q ¯ v · q 0 q ¯ , and, consequently, q 0 k 1 · v q 0 k · q ¯ q 0 k 1 · v · q 0 q ¯ q 0 k 1 · v · q 0 k · q ¯ , that is, q 0 k 1 · v P q .
Conversely, let v P q and v { q 0 i : i = 1 , , k 1 } . Then, according to Proposition 5 there is a unique v e such that v = q 0 k 1 · v . Now v = q 0 k 1 · v q = q 0 k · q ¯ v · q = q 0 k 1 · v · q 0 k · q ¯ implies v q 0 · q ¯ v · q 0 k · q ¯ . Since | v | | q 0 · q ¯ | and q 0 · q ¯ q 0 k · q ¯ , we have v q 0 · q ¯ v · q 0 · q ¯ . □
Together with Corollary 5 this implies
P q \ { q 0 } * = P q * \ { q 0 } * = q 0 k 1 · ( P q ^ \ { q 0 } ) .
Moreover, we have the following.
Corollary 6.
| P q * | = 1 if and only if q { q 0 } * and q 0 is overlap-free.
Proof. 
Since q 0 P q * , | P q * | = 1 is equivalent with P q * = { q 0 } or, according to Equation (8), with P q ^ = { q 0 } . This amounts to q ^ = q 0 and, following Proposition 1.1 q ^ = q 0 has to be overlap-free. □
For the repeated prefix q ^ 0 of q ^ we have the obvious relation | q ^ 0 | > | q ¯ | . In case q ^ 0 q 0 we can improve this.
Lemma 3.
Let q = q 0 k · q ¯ with k 2 , q ¯ q 0 and q ^ = q 0 · q ¯ . If q ^ 0 q 0 then
q ¯ q ^ 0 q 0 a n d | q ^ 0 | > | q ¯ | + gcd ( | q 0 | , | q ^ 0 | ) ,
and there is a nonempty suffix v e of q 0 such that v q ^ 0 and v · q ¯ q ^ 0 2 .
Proof. 
We have q ¯ q 0 and, since q 0 P q ^ , also q ^ 0 q 0 . Moreover, q ^ q 0 2 and q ^ q ^ 0 k for some k N . Since q 0 q ^ 0 and both prefixes are primitive words, in view of Theorem 1 as a common prefix of q 0 2 and q ^ 0 | q 0 | the word q ^ = q 0 · q ¯ has to satisfy | q ^ | < | q 0 | + | q ^ 0 | gcd ( | q 0 | , | q ^ 0 | ) , that is, | q ^ 0 | > | q ¯ | + gcd ( | q 0 | , | q ^ 0 | ) . The assertion q ¯ q ^ 0 q 0 now follows from a comparison of the lengths of q ¯ , q ^ 0 q 0 .
Now, let v be the suffix of q 0 defined by q ^ 0 k · v = q 0 q ^ 0 k + 1 . Then v q ^ 0 and v · q ¯ ( q ^ 0 ) 2 . □

3.5. Primitivity and Superprimitivity

In this section we consider the inclusion relations between the languages Q q , q e . Analogously to the primitivity of words in [10,11,12] a word was referred to as superprimitive if it is not covered by a shorter one. This leads to the following definition.
Definition 2
(superprimitive). A non-empty word q X * \ { e } is superprimitive if and only if Q q is maximal w.r.t. “⊆” in the family { Q q : q X * \ { e } } .
The next proposition relates the irreducibility of quasiperiods to superprimitivity.
Proposition 6
([12], Remark 4). If q X * \ { e } is superprimitive then | min P q | > | q | / 2 , and if | min P q | > | q | / 2 then q is primitive.
Proof. 
If q 0 = min P q and | q 0 | | q | / 2 then q = q 0 k · q ¯ for some q ¯ q 0 . Thus q Q q 0 q ¯ and q 0 q ¯ Q q .
As q = q m with m > 1 implies | q 0 | | q | | q | / 2 , the other assertion follows. □
The converse of Proposition 6 is not valid.
Example 1.
Let q = abaabaababaab . Then P q = { abaabaab , abaabaababa , q } , and | min P q | = 8 > 13 / 2 but as abaabaababaab Q abaab the word q is not superprimitive.
The word q = ababa is primitive but q 0 = ab has | q 0 | | q | / 2 .
In contrast to the fact that the word q 0 = min P q is always primitive, it need not satisfy | min P q 0 | > | q 0 | / 2 let alone be superprimitive..
Example 2.
q = aabaaabaaaa has q 0 = aabaaabaa which, in turn has P q 0 = { aaba , aabaaaba , q 0 } with | aaba | = 4 < | q 0 | / 2 .
It turns out that every language Q v is contained in a unique maximal Q q . To this end we derive the following lemma (cf. also [10,11]).
Lemma 4.
Let v Q q and u infix ( v ) q · X * X * · q . Then u Q q .
For the sake of completeness we give a proof.
Proof. 
We use a maximal q-chain ( w i ) i = 1 n with w n = v . Assume v = u 1 · u · u 2 . Since u has q as prefix and suffix, there are 1 j l n such that w j = u 1 · q and w l = u 1 · u . Let, for 1 i l j + 1 , the words w i be defined by w i + j 1 = u 1 · w i . Then ( w i ) i = 1 l j + 1 is a q-chain with w l j + 1 = u , that is, u Q q . □
Corollary 7.
If v Q q Q u and | q | < | u | then Q u Q q .
The corollary shows that every language Q v is contained in a unique maximal Q q and that two languages Q u , Q q are either disjoint or compatible w.r.t. set inclusion. The latter is not true for ω -languages.
Example 3.
Let q = a a b a a and u = a a b a a a . Then q ω P u ω , u ω P q ω but P u ω P q ω a a · { b a a a , b a a a a } ω .

4. P q and R q as Codes

In this section we investigate in more detail the properties of the star root of P q . It turns out that P q * is a suffix code which, additionally, has a bounded delay of decipherability. This delay is closely related to the largest power of q 0 being a prefix of q.
According to [14,18,19,20] a subset C X * is a code of a delay of decipherability m N if and only if for all v , v , w 1 , , w m C and u C * the relation v · w 1 w m v · u implies v = v . Observe that C X * is a prefix code if and only if C has delay 0.
First we show that P q * is a suffix code. This generalises Proposition 7 of [12].
Proposition 7.
P q * is a suffix code, and R q * is a prefix code.
Proof. 
Assume u = w · v for some u , v P q * , u v . Then u q and Proposition 4 (2) proves w { q 0 } * \ { e } . Consequently, | v | | q | | q 0 | . Now Proposition 5 implies v { q 0 } * and hence u { q 0 } * . Since u , v P q * , we obtain u = v = q 0 contradicting u v .
Using the duality of P q and R q one shows in an analogous manner that R q * is a prefix code. □
An easy consequence of Proposition 7 is the Left and Right Normal Form of a quasiperiodic string ([12], Proposition 8).
Corollary 8
(Normal Form). Every word w Q q has a unique factorisation w = v 1 · v 2 v n into words v i P q * ( R q * , r e s p e c t i v e l y ) .
Since R q * is a prefix code while the words v P q are prefixes of each other, we obtain | P q * R q * | = 1 generalising Remark 5 of [12]. In fact P q * R q * = { q } or P q * R q * = { q 0 } depending on whether q q 0 k or not.
We continue this part by investigating the delay of decipherability of P q * . We prove that the delay depends on the repetition factor k.
Theorem 2.
Let q X * \ { e } , q 0 = min P q , and | P q * | > 1 . Then P q * is a code having a delay of decipherability of k or k + 1 .
Proof. 
If | P q * | > 1 then in view of Proposition 5 there is a q P q * with | q | > | q | | q 0 | . Since q P q , we have q q · q 0 q · q . Consequently, q 0 · q 0 k 1 q q · q 0 , that is, the delay of decipherability is at least k.
To prove the converse we show that for q q 0 m the delay cannot exceed m.
Assume the contrary, that is, v · w 1 w m + 1 v · u for some words v , v , w 1 , , w m + 1 P q * , v v , and u P q * . From Proposition 4 (1) we obtain u q or q u and, since | w i | | q 0 | , also q w 1 w m + 1 .
If v v , in view of the inequality | v | + | q | | v | + | q 0 | our assumption yields v · q 0 v · q . Therefore, w · q 0 q for the word w e with v · w = v and, according to Proposition 4 (2) w { q 0 } * . This contradicts the fact that P q * is a suffix code.
If v v , then | u | > | w 1 w m + 1 | | q | , and via | v | + | q | | v | + | q 0 | we obtain v · q 0 v · q from our assumption. This yields the same contradiction as in the case v v .
The observation q q 0 k + 1 finishes the proof. □
For q = q 0 k the preceding proof shows the following.
Corollary 9.
If q = q 0 k and | P q * | > 1 then P q * has a delay of decipherability of exactly k.
Thus, if | P q * | > 1 and q q 0 k the code P q * may have a minimum delay of decipherability of k or k + 1 . We provide examples that both cases are possible.
Example 4.
Let q : = aabaaaaba . Then q 0 = aabaa , k = 1 and P q * = P q = { q 0 , aabaaaab , q } which is a code having a delay of decipherability 2.
Indeed aabaaaabaa = q 0 · q 0 q · q 0 or aabaaaabaa = q 0 · q 0 aabaaaab · q 0 .
Moreover, in Example 4, q · q 0 Q q . Thus our example shows also that q · P q * need not be contained in Q q .
Example 5.
Let q : = aba . Then k = 1 and P q = { ab , aba } is a code having a delay of decipherability 1.
Since R q * is a prefix code, every ω -word ξ R q ω has a unique factorisation into words w R q * . For suffix codes the situation is, in general, different. Consider e.g., the suffix code { b , b a , a a } . Property 4 (ii) of [20] (see also ([21], Proposition 1.9)) shows that codes of bounded delay of decipherability also admit a unique factorisation of ω -words. Thus we obtain from Theorem 2.
Lemma 5
(Normal Form for quasiperiodic ω -words). Every ω-word ξ P q ω has a unique factorisation ξ = v 1 · v 2 v i into words v i P q * .

5. Subword Complexity

In this section we investigate upper bounds on the the subword complexity function f ( ξ , n ) for quasiperiodic ω -words. If ξ X ω is quasiperiodic with quasiperiod q then Proposition 3 and Corollary 3 show infix ( ξ ) infix ( P q * ) . Thus
f ( ξ , n ) | infix ( P q * ) X n |   for   ξ P q ω .
Similar to ([22], Proposition 5.5) let ξ q : = v P q * \ { e } v . This implies infix ( ξ q ) = infix ( P q * ) . Consequently, the tight upper bound on the subword complexity of quasiperiodic ω -words having a certain quasiperiod q is f q ( n ) : = f ( ξ q , n ) = | infix ( P q * ) X n | . Observe that in view of Propositions 1 and 2 the identity
infix ( P q * ) = infix ( R q * ) = infix ( Q q )
holds.
The asymptotic upper bound on the subword complexity f q ( n ) is obtained from
λ q = lim sup n | infix ( P q * ) X n | n ,
that is, for large n, f q ( n ) λ ^ n whenever λ ^ > λ q .
The following facts are known from the theory of formal power series (cf. [23,24]). As infix ( P q * ) is a regular language the power series n N f q ( n ) · t n is a rational series and, therefore, f q satisfies a recurrence relation
f q ( n + k ) = i = 0 k 1 a i · f q ( n + i )
with integer coefficients a i Z . Thus f q ( n ) = i = 0 k 1 g i ( n ) · θ i n where k k , θ i are pairwise distinct roots of the polynomial t n i = 0 k 1 a i · t i and g i are polynomials of degree not larger than k.
In the subsequent parts we estimate values characterising the exponential growth of the family | infix ( P q * ) X n | n N . This growth mainly depends on the root of largest modulus among the θ i and the corresponding polynomial g i .
First we show that, independently of the quasiperiod q, the root θ i of largest modulus is always positive and the corresponding polynomial g i is constant.
In the remainder of this section we use, without explicit reference, known results from the theory of formal power series, in particular about generating functions of languages and codes which can be found in the literature, e.g., in [14,23,24].

5.1. The Subword Complexity of a Regular Star Language

The language P q * is a regular star-language of special shape. Here we show that, generally, the number of subwords of regular star-languages grows only exponentially without a polynomial factor. We start with some easily derived relations between the number of words in a regular language and the number of its subwords.
Lemma 6.
If L X * is a regular language then there is an m N such that
| L X n | | infix ( L ) X n | m · i = 0 2 m | L X n + i |
If the finite automaton accepting L has m states then for every w infix ( L ) there are words u , v of length m such that u · w · v L . Thus as a suitable m one may choose the number of states of an automaton accepting the language L X * .
A first consequence of Lemma 6 is that the identity
lim sup n | L X n | n = lim sup n | infix ( L ) X n | n
holds for regular languages L X * .
In order to derive the announced exponential growth we use Corollary 4 of [25] which shows that for every regular language L X * there are constants c 1 , c 2 > 0 and a λ 1 such that
c 1 · λ n | pref ( L * ) X n | c 2 · λ n .
A consequence of Lemma 6 is that Equation (14) holds also (with a different constant c 2 ) for infix ( L * ) .

5.2. The Subword Complexity of Q q

In this part we estimate the value λ q of Equation (11). In view of Equations (10) and (14) the value λ q satisfies the inequality c 1 · λ q n | infix ( P q * ) X n | c 2 · λ q n .
As P q * is a regular language Equations (11) and (13) show that
λ q = lim sup n | P q * X n | n
which is the inverse of the convergence radius rad s q * of the power series s q * ( t ) : = n N | P q * X n | · t n . The series s q * is also known as the structure generating function of the language P q * .
Since P q * is a code, we have s q * ( t ) = 1 1 s q ( t ) where s q ( t ) : = v P q * t | v | is the structure generating function of the finite language P q * . As s q * has non-negative coefficients Pringsheim’s theorem shows that rad s q * = λ q 1 is a singular point of s q * . Thus λ q 1 is the smallest root of 1 s q ( t ) . Hence λ q is the largest positive root of the polynomial p q ( t ) : = t | q | v P q * t | q | | v | .
Remark 1.
If the length of q 0 = min P q does not divide | q | then p q ( t ) is the reversed polynomial of 1 s q ( t ) , that is, has as roots exactly the the inverses of the roots of 1 s q ( t ) .
If | q 0 | divides | q | then q P q * (cf. Corollary 5) and p q ( t ) has additionally the root 0 with multiplicity | q | | q | where q is the longest word in P q * .
Summarising our observations we obtain the following.
Lemma 7.
Let q X * \ { e } . Then there are constants c q , 1 , c q , 2 > 0 such that the structure function of the language infix ( P q * ) satisfies
c q , 1 · λ q n | infix ( P q * ) X n | c q , 2 · λ q n
where λ q is the largest (positive) root of the polynomial p q ( t ) .
Remark 2.
One could prove Lemma 7 by showing that, for each polynomial p q ( t ) , its largest (positive) root has multiplicity 1. Referring to Corollary 4 of [25] (see Equation (14)) we avoided these more detailed considerations of a particular class of polynomials.
Now we are able to formulate our main theorem.
As quasiperiods q , | q | 2 , have trivially P q * = { q 0 } * , that is, λ q = 1 , in the sequel we confine our considerations to quasiperiods q of length | q | 3 , and we will always assume that the first letter of a quasiperiod q is a X .
Define Q max : = { a n b a n : n 1 } { a n w a n : | w | = 2 , w a a , n 1 } .
Theorem 3
(Main theorem). Let q a · X * , | q | 3 , q Q max , be a quasiperiod and n = | q | 1 2 . Then λ q < λ a n b a n or λ q < λ a n b b a n according to whether | q | is odd or even.
Moreover, λ w < λ a b a = λ a a b a a if w a · X * \ { a b a , a a b a a } .

6. Polynomials

Before proceeding to the proof of our main theorem we derive some properties of polynomials of the form p ( t ) = t n i M t i , where M { i : i N i < n } . This class of polynomials includes the polynomials p q ( t ) whose maximal roots λ q characterise the growth of infix ( P q * ) as described in Lemma 7. We focus in results which are useful for comparing their maximal roots.
The polynomials p ( t ) P ^ : = t n i M t i : M { 0 , , n 1 } have the following easily verified properties.
p ( 0 ) 0 , p ( 1 ) 0 , p ( 2 ) 1   and   p ( t ) < 0   for   0 < t < 1 .
If   ε > 0   and   p ( t ) 0   for   some   t > 0   then   p ( 1 + ε ) · t > 0 .
Since p ( 1 ) 0 and p ( 2 ) 1 for p ( t ) P ^ , Equation (16) shows that once p ( t ) 0 , t 1 , the polynomial p ( t ) has no further root in the interval ( t , ) and p ( t ) P ^ has exactly one root in the interval [ 1 , 2 ) . This yields the following fundamental property.
Property 1.
If t 0 is the positive root of the polynomial p ( t ) P ^ in [ 1 , 2 ) and 1 t < 2 then p ( t ) 0 if and only if t t 0 .
For the roots of maximal modulus we have the following theorem.
Theorem 4
(Cauchy). Let p ( t ) = i = 0 n a i · t i be a complex polynomial. Then every root t of p ( t ) satisfies | t | t 0 where t 0 is the maximal root of the polynomial | a n | · t n i = 0 n 1 | a i | · t i .
This implies the following property of polynomials p ( t ) P ^ .
If   p ( t ) = 0   then   | t | t 0 .
From Property 1 we derive the following criterion to compare the maximal roots of polynomials in P ^ .
Criterion 1.
Let p 1 ( t ) , p 2 ( t ) P ^ have maximal roots t 1 and t 2 , respectively. Then p 2 ( t 1 ) > 0 if and only if t 1 > t 2 .
We conclude this section with a bound on the maximal root of certain polynomials in P ^ .
Lemma 8.
Let p ( t ) = t n i = 0 m t i , n > m 1 . Then p ( t ) < 0 for 1 t ( m + 1 ) 2 2 n m and p ( t ) > 0 for m + 1 n m t .
Proof. 
The assertion follows from the inequality t n ( m + 1 ) · t m < p ( t ) < t n ( m + 1 ) · t m / 2 when t > 1 . The part p ( t ) < t n ( m + 1 ) · t m / 2 uses the arithmetic-geometric-means inequality i = 0 m t i > ( m + 1 ) · i = 0 m t i m + 1 = ( m + 1 ) · t m / 2 , and the other part is obvious. □
The following special case is needed below in Lemma 12.
Corollary 10.
If p ( t ) = t n i = 0 n 3 t i , n 4 , then p ( t ) < 0 for 1 t ( n 2 ) 2 n + 3 .
The subsequent sections are devoted to the proof of our main theorem.

7. Irreducible Quasiperiods

We start with irreducible quasiperiods.

7.1. Extremal Polynomials

The polynomials p q ( t ) of irreducible quasiperiods have non-zero coefficients only for | q | and i < | q | 2 . Therefore we investigate the set
P : = t n i M t i : n 2 M { i : i n 1 2 } .
Let p n ( t ) : = t n i = 0 n 1 2 t i P .
Property 2.
Let p ( t ) P a polynomial of degree n 3 . Then p n ( t ) p ( t ) for t [ 1 , 2 ] , and p n ( t ) has the largest positive root among all polynomials of degree n in P .
Proof. 
This follows from t n i = 0 n 1 2 t i < p ( t ) for p ( t ) P \ { p n ( t ) : n 3 } when 1 < t 2 and Criterion 1. □
Observe that, for n 1 ,
p 2 n + 1 ( t ) = t 2 n + 1 i = 0 n t i   and   p 2 n + 2 ( t ) = t 2 n + 2 i = 0 n t i .
Moreover, the words a n b a n Q max and a n w a n Q max , w { x b , b x } , x X are the quasiperiods corresponding to the extremal polynomials p 2 n + 1 ( t ) P and p 2 n + 2 ( t ) P , respectively.
Lemma 9.
Q max : = { q : q a · X * | q | 3 p q ( t ) = p | q | ( t ) }
Proof. 
If q Q max then obviously p q ( t ) = p | q | ( t ) . Conversely, if p q ( t ) = t | q | v P q * t | q | | v | = p | q | ( t ) then P q * = { v : v q | v | > | q | 2 } . Then, in view of q v · q , every prefix w q of length | w | < | q | 2 is also a suffix of q. This is possible only for q Q max or q { a } * . □
In the sequel the positive root of p n ( t ) is denoted by λ n . From Criterion 1 we obtain immediately.
Property 3.
Let t 1 . We have t < λ n if and only if p n ( t ) < 0 .
Then Property 2 implies the following.
Theorem 5.
If q a · X * , | q | 3 , is an irreducible quasiperiod then λ q λ | q | , and λ q = λ | q | if and only if q Q max .

7.2. The Ordering of the Maximal Roots λ n

Before we proceed to the case of reducible quasiperiods we determine the ordering of the maximal roots λ n . This will not only be interesting for itself but also useful for proving λ q < λ | q | when q is reducible (see Equation (28) below).
The extremal polynomials p n ( t ) , n 2 , satisfy the following general relations (By convention, i = k m a i = 0 if k > m ).
t · p 2 n ( t ) 1 = p 2 n + 1 ( t ) ,
p 2 n + 2 ( t ) t 2 · p 2 n ( t ) = t n + 1 t 1 ,
t n 2 · p 2 n + 1 ( t ) ( t n + 1 ) · p 2 n 1 ( t ) = i = 0 n 3 t i ,   and
t n 2 · p 2 n + 3 ( t ) ( t n + 1 + 1 ) · p 2 n ( t ) = t n + i = 0 n 3 t i .
Lemma 10.
The polynomials t 3 t 1 and t 5 t 2 t 1 = ( t 2 + 1 ) · ( t 3 t 1 ) have largest positive roots λ 3 = λ 5 among all polynomials in P , λ 5 > λ 4 and λ 2 n 1 > λ 2 n + 1 > λ 2 n for n 3 .
Proof. 
From Equation (18) we have p 2 n + 1 ( λ 2 n ) = 1 < 0 and, therefore, λ 2 n < λ 2 n + 1 when n 1 .
Similarly, Equation (20) yields p 2 n + 1 ( λ 2 n 1 ) = λ 2 n 1 ( n 2 ) · i = 0 n 3 λ 2 n 1 i > 0 which implies λ 2 n + 1 < λ 2 n 1 for n 3 and λ 3 = λ 5 when n = 2 . □
The largest (positive) root λ 3 of the polynomial t 3 t 1 is also known as the smallest Pisot-Vijayaraghavan number.
So far we have ordered the ‘odd’ roots: λ 3 = λ 5 > λ 7 > λ 9 > . Next we are going to investigate the ordering of the ‘even’ roots λ 2 n , n 2 .
To this end we derive the following bounds.
Lemma 11.
1. 
n 2 3 n + 1 λ 2 n n n + 1 and n 2 3 n 1 λ 2 n 1 n n for n 2 .
2. 
Let n 5 . Then λ 2 n 2 n 1 .
Proof. 
1. follows from Lemma 8.
2. We calculate p 2 n ( 2 n 1 ) = 4 · 4 n 1 i = 0 n 1 2 i n 1 4 · 4 4 ( 2 + ( n 1 ) ) = 4 · 2 ( n + 1 ) < 0 if n 5 and the assertion follows with Property 1. □
Remark 3.
The lower bound of Lemma 11.2 does not exceed the lower bound in Lemma 11.1. However, the latter is more convenient for the purposes of Lemma 12.
Lemma 12.
If n 5 then λ 2 n 2 > λ 2 n and λ 2 n > λ 2 n + 3 .
Proof. 
If t 2 n 1 then t n t 1 t 1 > 0 . Consequently, Equation (19) and Lemma 11.2 imply p 2 n 2 ( λ 2 n ) < 0 whence λ 2 n < λ 2 n 2 .
If n 5 we have n n + 1 ( n 2 ) 2 n + 3 and, following Lemma 11.1 λ 2 n ( n 2 ) 2 n + 3 . Then Equation (21) yields λ 2 n · p 2 n + 3 ( λ 2 n ) = λ 2 n n i = 0 n 3 λ 2 n i , and Corollary 10 shows p 2 n + 3 ( λ 2 n ) > 0 whence λ 2 n > λ 2 n + 3 . □
Since p 8 ( 2 3 ) > 0 , the proof of Lemma 12 cannot be applied to lower values of n. Thus it remains to establish the order of the λ i for i 13 . To this end, we consider some special identities and use Criterion 3 and Lemma 12.
p 12 ( t ) ( t 8 + t 5 + t 4 + t 2 + t ) · p 4 ( t ) = t 2 1 , and
p 13 ( t ) t · ( t 8 + t 5 + t 4 + t 2 + t ) · p 4 ( t ) = t 3 t 1 = p 3 ( t ) .
Lemma 13.
λ 8 > λ 10 > λ 13 > λ 4 > λ 12
Proof. 
Lemma 12 shows λ 8 > λ 10 > λ 13 . Equation (22) yields p 12 ( λ 4 ) = λ 4 2 1 > 0 whence λ 4 > λ 12 , and Equation (23) yields p 13 ( λ 4 ) = p 3 ( λ 4 ) < 0 , that is, λ 13 > λ 4 . This shows our assertion. □
For the remaining part we consider the identities
t 2 · p 11 ( t ) ( t 5 + 1 ) · p 8 ( t ) = t 4 + t + 1 = p 4 ( t ) ,
p 11 ( t ) ( t 5 + 1 ) · p 6 ( t ) = t 3 · p 4 ( t ) , and
t · p 9 ( t ) ( t 4 + 1 ) · p 6 ( t ) = t 3 + 1 .
Lemma 14.
λ 9 > λ 6 > λ 11 > λ 8
Proof. 
We use Equations (24)–(26). Then p 11 ( λ 8 ) = p 4 ( λ 8 ) < 0 implies λ 11 > λ 8 , p 11 ( λ 6 ) = λ 6 3 · p 4 ( λ 6 ) > 0 implies λ 6 > λ 11 , and, finally, λ 6 · p 9 ( λ 6 ) = λ 6 3 + 1 < 0 implies λ 9 > λ 6 . □
Now Lemma 10, 12–14 yield the complete ordering of the values λ n .
Theorem 6.
Let λ n , n 3 , be the maximal root of the polynomial p n ( t ) . Then the overall ordering of the values λ n starts with
λ 3 = λ 5 > λ 7 > λ 9 > λ 6 > λ 11 > λ 8 > λ 10 > λ 13 > λ 4 > λ 12
and continues as follows λ 2 n + 1 > λ 2 n > λ 2 n + 3 , n 7 .
In connection with Proposition 6 and Corollary 7 we obtain that the Pisot-Vijayaraghavan number λ 3 = λ 5 is an overall upper bound on the values λ q .
Corollary 11.
If q X * , | q | 3 , then λ q λ 3 = λ 5 .
From Lemma 11.1 we obtain immediately.
Corollary 12.
Let M N \ { 0 , 1 , 2 } be infinite. Then inf { λ i : i M } = 1 .

8. Reducible Quasiperiods

Reducible quasiperiods q have a repeated prefix q 0 = min P q with | q 0 | | q | / 2 and a repetition factor k 2 such that q = q 0 k · q ¯ where q ¯ q 0 . Moreover | q ¯ | < | q 0 | | q | / 2 . Observe that q 0 is primitive.
We shall consider three cases depending on the relation between the lengths n = | q | , = | q 0 | , the length of the suffix | q ¯ | < | q 0 | and the repetition factor k 2 .
IN the first case | q 0 | + | q ¯ | 2 , in view of q ¯ q 0 , we have necessarily q ¯ = e and q a * { a b } * , a , b X , a b and, therefore, Q q = { q 0 } * and λ q = 1 .
Let now | q 0 | + | q ¯ | 3 . We divide the remaining cases according to the additional requirement | q | 2 | q 0 | 3 and its complementary one | q | 2 | q 0 | 2 . In the latter case we have necessarily k = 2 and | q ¯ | 2 .

8.1. The Case | q 0 | + | q ¯ | 3 | q | 2 | q 0 | 3

Thus, the preceding consideration shows that we have | q ¯ | 3 (in particular, if q = q 0 2 · q ¯ ) or the repetition factor k 3 . This implies | q | = 7 (where q = ( a b ) 3 a ) or | q | 9 .
From Equation (6) we have
P q * { q 0 } { v : v q | v | > | q | | q 0 | + 1 }
This implies that for | q 0 | | q | / 2 the polynomials p q ( t ) have non-zero coefficients only for | q | = n , | q | | q 0 | = n and i < | q 0 | 1 , that is, are of the form p q ( t ) = t n t n i M q t i where M q { i : i < 1 } . Therefore, in the sequel we consider the positive roots of polynomials in
P red : = t n t n i M t i : n 1 n 2 M { i : i < 1 }
Let p n , ( t ) : = t n t n i = 0 2 t i P red and λ n , be its maximal root. (In the preceding paper [8] we used a slightly different definition of P red , and, therefore, of p n , ( t ) and λ n , .) Similar to Property 2, Criterion 3 and Theorem 5 we have the following.
Property 4.
Let n 3 , n 2 and p ( t ) P red . Then p ( t ) p n , ( t ) for t [ 1 , 2 ] , and p n , ( t ) has the largest positive root among all polynomials of degree n and parameter ℓ in P red .
Lemma 15.
If q , | q | = n , is a quasiperiod with | q 0 | = n / 2 then p q ( t ) p n , ( t ) for t 1 , in particular, λ q λ n , .
Remark 4.
In contrast to Property 2 not for every polynomial p n , ( t ) there is a quasiperiod q such that p n , ( t ) = p q ( t ) , see Remark 5 below.
We have the following relation between the polynomials p n ( t ) and p n , ( t ) .
p n ( t ) t · p n 2 ( t ) = p n , ( t ) t 1 ,   for   n 2 3
This yields
Corollary 13.
Let n 2 · 3 . If λ n < λ n 2 then λ n , < λ n .
Proof. 
If λ n < λ n 2 then p n 2 ( λ n ) < p n 2 ( λ n 2 ) = 0 . Thus p n , ( λ n ) = λ n · p n 2 ( λ n ) + λ n 1 > 0 , that is, λ n > λ n , . □
Next we show the relation λ q < λ | q | for all quasiperiods q having | q 0 | | q | / 2 and | q 0 | + | q ¯ | 3 .
Lemma 16.
Let | q | 2 | q 0 | 3 and | q 0 | + | q ¯ | 3 . Then λ q < λ | q | .
Proof. 
Above we have shown that | q | 2 | q 0 | 3 and | q 0 | + | q ¯ | 3 imply | q | 7 or | q | 10 according to whether | q | is odd or even.
The ordering of Theorem 6 and Corollary 13 show λ n > λ n , for all odd values n 7 and for all even values n 12 .
It remains to consider the exceptional case when n = | q | = 10 . Here | q | 2 | q 0 | 3 and | q 0 | + | q ¯ | 3 imply = | q 0 | = 3 . Consider p 10 , 3 ( t ) = t 10 t 7 t 1 = p 10 ( t ) t 2 · p 5 ( t ) .
From λ 5 > λ 10 and p 10 ( λ 10 ) = 0 we have p 10 , 3 ( λ 10 ) = λ 10 2 · p 5 ( λ 10 ) > 0 , that is, λ 10 , 3 < λ 10 . □
Remark 5.
Equation (6) shows that for n = | q | = 10 and = | q 0 | = 3 we have P q * = { q 0 , q } , that is, p q ( t ) = t 10 t 7 1 . Thus there is no quasiperiod q such that p q ( t ) = p 10 , 3 ( t ) = t 10 t 7 t 1 .

8.2. The Case | q 0 | + | q ¯ | 3 | q | 2 | q 0 | 2

This amounts to | q | = 2 · | q 0 | + | q ¯ | where | q ¯ | { 0 , 1 , 2 } .
Here we have to go into more detail and to take into consideration also the reduced quasiperiod q ^ = q 0 · q ¯ of q and its repeated prefix q ^ 0 = min P q ^ . Observe that both repeated prefixes q 0 , q ^ 0 are primitive.
For q = q 0 k · q ¯ , k 2 , we have from Equations (7) and (8)
p q ( t ) t | q | t | q | | q 0 | i M t i : M { 0 , , | q ^ | | q ^ 0 | } .
Observe that | q ^ 0 | > | q ¯ | (in view of Lemma 3 even | q ^ 0 | > | q ¯ | + 1 if q ^ 0 q 0 ) and thus | q ^ | | q ^ 0 | = | q 0 | ( | q ^ 0 | | q ¯ | ) < | q 0 | .
Let P red : = t n t i M t i : n > > j M { 0 , , j } and p n , , j ( t ) = t n t i = 0 j t i . Here the parameter j corresponds to the value | q ^ 0 | | q ¯ | . Then similar to Property 4 and Lemma 15 we have
Property 5.
Let n , 3 , n 2 , > j , and p ( t ) P red . Then p ( t ) p n , , j ( t ) for t [ 1 , 2 ] , and p n , , j ( t ) has the largest positive root among all polynomials of degree n and parameters ℓ and j in P red .
Lemma 17.
If q , | q | = n , is a quasiperiod with | q 0 | = n / 2 and | q ^ 0 | | q ¯ | j then p q ( t ) p n , , j ( t ) for t 1 , in particular, λ q λ n , , j .
We consider the cases | q ¯ | { 0 , 1 , 2 } separately. In the sequel we shall make use of the relation
t 3 t 2 1 t 2 t 1 < 0   for   1 t λ 3 = max { λ n : n N } .

8.2.1. The Case q = q 0 2 | q ¯ | = 0

As shown above the case | q 0 | 2 and | q ¯ | = 0 amounts to λ q = 1 . Thus we may consider only the case when | q 0 | 3 . Here we have the following relation between p 2 ( t ) and p 2 , , 3 ( t ) .
p 2 ( t ) p 2 , , 3 ( t ) = t 2 ( t 2 t 1 )
Lemma 18.
If q = q 0 2 and | q 0 | = 3 then λ q < λ | q | .
Proof. 
First we suppose | q ^ 0 | 3 . Then | q ^ 0 | | q ¯ | 3 , and Property 5 and Lemma 17 yield p q ( t ) p 2 , , 3 ( t ) for t [ 1 , 2 ] . Now Equations (29) and (30) show p q ( λ 2 ) p 2 , , 3 ( λ 2 ) = λ 2 2 ( λ 2 2 λ 2 1 ) > 0 , that is λ q < λ 2 .
It remains to consider 1 | q ^ 0 | 2 . If q ^ 0 a * then q 0 = a which is not primitive. Thus q ^ 0 = a b and, since q 0 is primitive, q 0 = ( a b ) m a , m 1 whence q = q 0 2 = ( a b ) m a · ( a b ) m a .
We obtain P q * = { ( a b ) m a · ( a b ) i : i = 0 , , m } and, consequently, p q ( t ) = t 4 m + 2 + i = 0 m t 2 i + 1 . Then (Observe again i = k m a i = 0 if k > m ).
p q ( t ) p 4 m + 2 ( t ) = t 2 m + 1 + i = 0 m t 2 i = t 2 m + 1 + t 2 m + t 2 m 2 + i = 0 m 2 t 2 i = t 2 m 2 · ( t 3 t 2 1 ) + i = 0 m 2 t 2 i ,
and from Equation (29) we obtain p q ( λ 4 m + 2 ) λ 4 m + 2 2 m 2 ( λ 4 m + 2 3 λ 4 m + 2 2 1 ) > 0 . □

8.2.2. The Case q = q 0 2 · q ¯ | q ¯ | = 1

Here we have the following relation between p 2 + 1 ( t ) and p 2 + 1 , , 2 ( t ) .
p 2 + 1 ( t ) p 2 + 1 , , 2 ( t ) = t 1 ( t 2 t 1 )
Lemma 19.
If q = q 0 2 · a , a X , then λ q < λ | q | .
Proof. 
First we suppose | q ^ 0 | | q ¯ | 2 . Then = | q 0 | | q ^ 0 | 3 , and Property 5 and Equation (31) yield p q ( λ 2 + 1 ) p 2 + 1 , , 2 ( λ 2 + 1 ) = p 2 + 1 ( λ 2 + 1 ) λ 2 + 1 1 ( λ 2 + 1 2 λ 2 + 1 1 ) . The assertion p q ( λ 2 + 1 ) > 0 , that is λ q < λ 2 + 1 follows from Equation (29).
It remains to consider | q ^ 0 | = 2 . By Lemma 3 q ^ 0 = q 0 implies | q ^ 0 | > | q ¯ | + 1 = 2 . Hence q ^ 0 = q 0 = a b , q = a b a b a and p q ( t ) = t 5 t 3 1 = t 2 · p 3 ( t ) + t 2 1 . Then λ a b a b a < λ 5 follows from λ 5 = λ 3 and p q ( λ 5 ) = λ 5 2 1 > 0 . □

8.2.3. The Case q = q 0 2 · q ¯ | q ¯ | = 2

Here we have the following relation between p 2 + 2 ( t ) and p 2 + 2 , , 2 ( t ) .
p 2 + 2 ( t ) p 2 + 2 , , 2 ( t ) = t 1 ( t 3 t 1 ) = t 1 · p 3 ( t )
Lemma 20.
If q = q 0 2 · q ¯ with | q ¯ | = 2 then λ q < λ | q | .
Proof. 
First we suppose | q ^ 0 | 4 . Then Property 5, Equation (32) and λ 2 + 2 < λ 3 yield p q ( λ 2 + 2 ) p 2 + 2 , , 2 ( λ 2 + 2 ) = λ 2 + 2 1 · p 3 ( λ 2 + 2 ) > 0 , that is, λ q < λ 2 + 2 .
It remains to consider | q ^ 0 | = 3 . If q ^ 0 q 0 Lemma 3 implies | q ^ 0 | > | q ¯ | + 1 . Consequently, q ^ 0 = q 0 . Then | q 0 | = 3 and | q | = 8 , and Equation (6) yields P q * { q 0 , v , q } where v q and | v | = | q | 1 = 7 . Thus p q ( t ) t 8 t 5 t 1 = p 8 ( t ) t 2 · p 3 ( t ) for 1 t λ 3 .
This shows p q ( λ 8 ) λ 8 2 · p 3 ( λ 8 ) > 0 , that is, λ q < λ 8 . □
Summarising, the results of Section 8 yield.
Theorem 7.
If q X * , | q | 3 , is a reducible quasiperiod then λ q < λ | q | .
Our main theorem (Theorem 3) then follows from Theorems 5 and 7.
Together with Corollary 12 our theorem yields a new proof of a theorem of [5] which shows that multi-scale quasiperiodic infinite words have zero topological entropy. In [5] a multi-scale quasiperiodic infinite word is a quasiperiodic infinite word which admits infinitely many quasiperiods.

9. Concluding Remark

In this paper we dealt with the function f ( ξ , n ) = | infix ( ξ ) X n | for quasiperiodic ω -words. Their factor complexity (or topological entropy) is defined as τ ( ξ ) : = lim n log | X | | infix ( ξ ) X n | n (e.g., [4], Section 4.2.2 or [5,22]). Thus the upper bound for ξ P q ω is log | X | λ q log | X | t P which is bounded away from the value 1 for almost periodic ω -words.
Along with the subword complexity in [5] the Kolmogorov complexity of quasiperiodic ω -words was addressed. Obviously, subword complexity upper bounds Kolmogorov complexity (e.g., [22]). Since the ω -languages P q ω are regular ones, the results of [22] show that there are ω -words ξ P q ω whose Kolmogorov complexity achieves their subword complexity. Moreover, as P q ω = q · R q ω where R q ω is a finite prefix code, the results of [22,26,27] give more detailed bounds for most complex quasiperiodic ω -words w.r.t. several notions of Kolmogorov complexity [28].

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Marcus, S. Bridging two hierarchies of infinite words. J. UCS 2002, 8, 292–296. [Google Scholar]
  2. Marcus, S. Quasiperiodic infinite words (column: Formal language theory). Bull. EATCS 2004, 82, 170–174. [Google Scholar]
  3. Marcus, S.; Păun, G. Infinite (almost periodic) words, formal languages and dynamical systems. Bull. EATCS 1994, 54, 224–231. [Google Scholar]
  4. Cassaigne, J.; Nicolas, F. Factor complexity. Combinatorics, automata and number theory. In Encyclopedia of Mathematics and Its Applications; Cambridge University Press: Cambridge, UK, 2010; Volume 135, pp. 163–247. [Google Scholar]
  5. Monteil, T.; Marcus, S. Quasiperiodic infinite words: Multi-scale case and dynamical properties. arXiv 2006, arXiv:math/0603354. [Google Scholar]
  6. Polley, R.; Staiger, L. The maximal subword complexity of quasiperiodic infinite words. In Proceedings of the Twelfth Annual Workshop on Descriptional Complexity of Formal Systems, Saskatoon, SK, Canada, 8–10 August 2010. [Google Scholar]
  7. Staiger, L. Quasiperiods of infinite words. In Mathematics almost Everywhere. In Memory of Solomon Marcus; Bellow, A., Calude, C.S., Zamfirescu, T., Eds.; World Scientific: Hackensack, NJ, USA, 2018; pp. 17–36. [Google Scholar]
  8. Staiger, L. On the generative power of quasiperiods. In Lecture Notes in Computer Science, Proceedings of the 22nd International Conference Descriptional Complexity of Formal Systems, Vienna, Austria, 24–26 August 2020; Jirásková, G., Pighizzini, G., Eds.; Springer: Cham, Switzerland, 2020; Volume 12442, pp. 219–230. [Google Scholar]
  9. Levé, F.; Richomme, G. Quasiperiodic infinite words: Some answers (column: Formal language theory). Bull. EATCS 2004, 84, 128–138. [Google Scholar]
  10. Apostolico, A.; Ehrenfeucht, A. Efficient detection of quasiperiodicities in strings. Theor. Comput. Sci. 1993, 119, 247–265. [Google Scholar] [CrossRef] [Green Version]
  11. Apostolico, A.; Farach, M.; Iliopoulos, C.S. Optimal superprimitivity testing for strings. Inform. Process. Lett. 1991, 39, 17–20. [Google Scholar] [CrossRef] [Green Version]
  12. Mouchard, L. Normal forms of quasiperiodic strings. Theoret. Comput. Sci. 2000, 249, 313–324. [Google Scholar] [CrossRef] [Green Version]
  13. Brzozowski, J.A. Roots of star events. J. ACM 1967, 14, 466–477. [Google Scholar] [CrossRef]
  14. Berstel, J.; Perrin, D. Theory of codes. In Pure and Applied Mathematics; Academic Press Inc.: Orlando, FL, USA, 1985; Volume 117. [Google Scholar]
  15. Shyr, H.-J. Free Monoids and Languages, 3rd ed.; Hon Min Book Company: Taichung, Taiwan, 2001. [Google Scholar]
  16. Lothaire, M. Combinatorics on Words, 2nd ed.; Cambridge University Press: Cambridge, UK, 1997; Volume 17. [Google Scholar]
  17. Fine, N.J.; Wilf, H.S. Uniqueness theorems for periodic functions. Proc. Am. Math. Soc. 1965, 16, 109–114. [Google Scholar] [CrossRef]
  18. Bruyère, V.; Wang, L.M.; Zhang, L. On completion of codes with finite deciphering delay. Eur. J. Combin. 1990, 11, 513–521. [Google Scholar] [CrossRef] [Green Version]
  19. Fernau, H.; Reinhardt, K.; Staiger, L. Decidability of code properties. Theor. Inform. Appl. 2007, 41, 243–259. [Google Scholar] [CrossRef] [Green Version]
  20. Staiger, L. On infinitary finite length codes. RAIRO Inform. Théor. Appl. 1986, 20, 483–494. [Google Scholar] [CrossRef]
  21. Devolder, J.; Latteux, M.; Litovsky, I.; Staiger, L. Codes and infinite words. Acta Cybernet. 1994, 11, 241–256. [Google Scholar]
  22. Staiger, L. Kolmogorov complexity and Hausdorff dimension. Inform. Comput. 1993, 103, 159–194. [Google Scholar] [CrossRef] [Green Version]
  23. Berstel, J.; Reutenauer, C. Rational series and their languages. In EATCS Monographs on Theoretical Computer Science; Springer: Berlin/Heidelberg, Germany, 1988; Volume 12. [Google Scholar]
  24. Salomaa, A.; Soittol, M. Automata-Theoretic Aspects of Formal Power Series; Springer: New York, NY, USA, 1978. [Google Scholar]
  25. Staiger, L. The entropy of finite-state ω-languages. Probl. Control Inform. Theory/Probl. Upravlen. Teor. Inform. 1985, 14, 383–392. [Google Scholar]
  26. Mielke, J.; Staiger, L. On oscillation-free ε-random sequences II. In Computability and Complexity in Analysis; Dagstuhl Seminar Proceedings; Bauer, A., Hertling, P., Ko, K.-I., Eds.; Schloss Dagstuhl-Leibniz-Zentrum für Informatik: Wadern, Germany, 2009; Volume 09003. [Google Scholar]
  27. Staiger, L. Bounds on the Kolmogorov complexity function for infinite words. In Information and Complexity; World Scientific Series in Information Studies; Chapter 8; Burgin, M., Calude, C.S., Eds.; World Scientific: Hackensack, NJ, USA, 2017; Volume 6, pp. 200–224. [Google Scholar]
  28. Uspensky, V.A.; Shen, A. Relations between varieties of Kolmogorov complexities. Math. Syst. Theory 1996, 29, 271–292. [Google Scholar] [CrossRef] [Green Version]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Staiger, L. The Maximal Complexity of Quasiperiodic Infinite Words. Axioms 2021, 10, 306. https://doi.org/10.3390/axioms10040306

AMA Style

Staiger L. The Maximal Complexity of Quasiperiodic Infinite Words. Axioms. 2021; 10(4):306. https://doi.org/10.3390/axioms10040306

Chicago/Turabian Style

Staiger, Ludwig. 2021. "The Maximal Complexity of Quasiperiodic Infinite Words" Axioms 10, no. 4: 306. https://doi.org/10.3390/axioms10040306

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop