The Maximal Complexity of Quasiperiodic Infinite Words

Staiger, Ludwig

doi:10.3390/axioms10040306

Open AccessArticle

The Maximal Complexity of Quasiperiodic Infinite Words

by

Ludwig Staiger

Institut für Informatik, Martin-Luther-Universität Halle-Wittenberg, D-06099 Halle (Saale), Germany

Axioms 2021, 10(4), 306; https://doi.org/10.3390/axioms10040306

Submission received: 27 September 2021 / Revised: 9 November 2021 / Accepted: 11 November 2021 / Published: 17 November 2021

(This article belongs to the Special Issue In Memoriam, Solomon Marcus)

Download Review Reports Versions Notes

Abstract

:

A quasiperiod of a finite or infinite string is a word whose occurrences cover every part of the string. An infinite string is referred to as quasiperiodic if it has a quasiperiod. We present a characterisation of the set of infinite strings having a certain word q as quasiperiod via a finite language

P_{q}

consisting of prefixes of the quasiperiod q. It turns out its star root

\sqrt[*]{P_{q}}

is a suffix code having a bounded delay of decipherability. This allows us to calculate the maximal subword (or factor) complexity of quasiperiodic infinite strings having quasiperiod q and further to derive that maximally complex quasiperiodic infinite strings have quasiperiods

a b a

or

a a b a a

. It is shown that, for every length

l \geq 3

, a word of the form

a^{n} b a^{n}

(or

a^{n} b b a^{n}

if l is even) generates the most complex infinite string having this word as quasiperiod. We give the exact ordering of the lengths l with respect to the achievable complexity among all words of length l.

Keywords:

quasiperiod; formal language; asymptotic growth; polynomial

MSC:

68Q45

1. Introduction

In his tutorials [1,2,3] Solomon Marcus dealt with several properties of infinite words. Among them he considered quasiperiodicity and its influence on measures of symmetry like complexity, recurrence or entropy. One topic of interest was their subword complexity (or factor complexity [4]). Besides the asymptotic behaviour of the factor complexity, also known as their topological entropy ([4], Section 4.2.2) or [5] Marcus was also interested in the behaviour of the complexity function

f (ξ, n)

assigning to a natural number

n \in N

the number of subwords of the infinite word (

ω

-word)

ξ

. Here he was also concerned with recurrences in

ω

-words and their influence to subword complexity. A well-known fact established by Grillenberger is that the asymptotic subword complexity (or topological entropy) of an almost periodic (or uniformly recurrent)

ω

-word can be arbitrarily close (but not equal) to the maximal subword complexity (see [4], Theorem 4.4.4).

The present paper summarises results on the subword complexity of infinite words obtained in [6,7,8]. We study in detail the structure of the set of infinite words having a certain word q as quasiperiod and how this is connected with the set of finite words with the same quasiperiod. Moreover, we address a question raised in [9] about the maximally achievable subword complexity of a quasiperiodic infinite word.

A first result shows that for every word q there is a value

λ_{q}, 1 \leq λ_{q} < 2,

such that, for every infinite word

ξ

with quasiperiod q, the complexity function

f (ξ, n)

is bounded by

O (1) \cdot λ_{q}^{n}

, and this bound is achieved for certain infinite words having quasiperiod q. The maximally possible value for

λ_{q}

is

λ_{q} = t_{P} \approx 1.324718

, where

t_{P}

is the smallest Pisot-Vijayaraghavan number, that is, the unique real root

t_{P}

of the cubic polynomial

x^{3} - x - 1

.

As a generalisation of the above-mentioned questions [2,9] we estimate, for every length

n \geq 3

, the values

λ_{n} = max {λ_{q} : | q | = n}

, their ordering and the words

q, | q | = n,

for which

λ_{q} = λ_{n}

. It appears that a two letter alphabet is sufficient for achieving the maximal complexity

λ_{n}

.

In order to prove these properties we start with a general investigation of quasiperiodicity of words (as e.g., in [10,11,12]) and infinite words.

The paper is organised as follows. After introducing some notation we derive in Section 3 a characterisation of quasiperiodic words and

ω

-words having a certain quasiperiod q. Moreover, we use the finite basis sets

P_{q}

and its dual

R_{q}

(

L (q)

and

R (q)

in [12]) from which the sets of quasiperiodic words or

ω

-words having quasiperiod q can be constructed. In Section 4 it is then proved that the star root of

P_{q}

is a suffix code having a bounded delay of decipherability and, dually, the star root of

R_{q}

is a prefix code.

This much prerequisites allow us, in Section 5, to estimate the number of subwords of the language

Q_{q}

of all quasiperiodic words having quasiperiod q. It turns out that

c_{q, 1} \cdot λ_{q}^{n} \leq f (Q_{q}, n) \leq c_{q, 2} \cdot λ_{q}^{n}

where

f (Q_{q}, n)

is the number of subwords of length n of words in

Q_{q}

and

1 \leq λ_{q} \leq t_{P}

depends on q. We construct, for every quasiperiod q, a quasiperiodic

ω

-word

ξ_{q}

with quasiperiod q whose subword complexity

f (ξ_{q}, n)

is maximal.

The values

λ_{q}

turn out to be maximal positive roots of polynomials associated with the star root

\sqrt[*]{P_{q}}

. Section 6 deals with the properties of those polynomials. This allows to compare the roots

λ_{q}

.

The following Section 7 and Section 8 deal with the proof of the above mentioned results on the values

λ_{q}

and

λ_{n} = max {λ_{q} : | q | = n}

. Here we derive also the complete ordering of the values

λ_{n}

.

2. Notation and Preliminaries

In this section we introduce the notation used throughout the paper. By

N = {0, 1, 2, \dots}

we denote the set of natural numbers. Let X be an alphabet of cardinality

| X | = r \geq 2

, and let throughout the paper

a, b \in X, a \neq b,

be two different letters. By

X^{*}

we denote the set of finite words on X, including the empty word e, and

X^{ω}

is the set of infinite strings (

ω

-words) over X. Subsets of

X^{*}

will be referred to as languages and subsets of

X^{ω}

as ω-languages.

For

w \in X^{*}

and

η \in X^{*} \cup X^{ω}

let

w \cdot η

be their concatenation. This concatenation product extends in an obvious way to subsets

L \subseteq X^{*}

and

B \subseteq X^{*} \cup X^{ω}

. For a language L let

L^{*} : = ⋃_{i \in N} L^{i}

, and by

L^{ω} : = {w_{1} \dots w_{i} \dots : w_{i} \in L \ {e}}

we denote the set of infinite strings formed by concatenating words in L. The smallest subset of a language L which generates

L^{*}

is called its star root

\sqrt[*]{L}

[13]. It holds

\sqrt[*]{L} = (L \ {e}) \ {(L \ {e})}^{2} \cdot L^{*} .

Furthermore

| w |

is the length of the word

w \in X^{*}

and

pref (B)

is the set of all finite prefixes of the strings in

B \subseteq X^{*} \cup X^{ω}

. We shall abbreviate

w \in pref (η) (η \in X^{*} \cup X^{ω})

by

w ⊑ η

.

We denote by

B / w : = {η : w \cdot η \in B}

the left derivative of the set

B \subseteq X^{*} \cup X^{ω}

. As usual, a language

L \subseteq X^{*}

is regular provided it is accepted by a finite automaton. An equivalent condition is that its set of left derivatives

{L / w : w \in X^{*}}

is finite.

The sets of infixes of B or

η

are

infix (B) : = ⋃_{w \in X^{*}} pref (B / w)

and

infix (η) : = ⋃_{w \in X^{*}} pref ({η} / w)

, respectively. In the sequel we assume the reader to be familiar with basic facts of language theory.

We call a word

w \in X^{*} \ {e}

primitive if

w = v^{n}

implies

n = 1

, that is, w is not the power of a shorter word, and we call

w \in X^{*} \ {e}

overlap-free if none of its proper prefixes is a suffix of w. The following facts are known (e.g., [14,15]).

Fact 1.

Every word

w \in X^{*} \ {e}

has a unique representation

w = v^{n}

where v is primitive.

Fact 2.

Let

q, v, w \in X^{*}, 0 < | v | < | q |

. If

v \cdot q = q \cdot w

then

v = u \cdot u^{'}

,

q = {(u \cdot u^{'})}^{κ} \cdot u

and

w = u^{'} \cdot u

for some

u, u^{'} \in X^{*}, u \neq e,

and

κ \in N

. In particular, q is not overlap-free.

Fact 3.

If

w \cdot v = v \cdot w, w, v \in X^{*}

then

w, v

are powers of a common (primitive) word.

As usual a language

L \subseteq X^{*}

is called a code provided

w_{1} \dots w_{l} = v_{1} \dots v_{k}

for

w_{1}, \dots, w_{l},

v_{1}, \dots, v_{k} \in L

implies

l = k

and

w_{i} = v_{i}

. A code L is said to be a prefix code (suffix code) provided no codeword is a prefix (suffix) of another codeword.

3. Quasiperiodicity

3.1. General Properties

The notion of quasiperiodicity can be formalised in the following manner. A finite or infinite word

η \in X^{*} \cup X^{ω}

is referred to as quasiperiodic with quasiperiod

q \in X^{*} \ {e}

provided that for every

j < | η | \in N \cup {\infty}

there is a prefix

u_{j} ⊑ η

of length

j - | q | < | u_{j} | \leq j

such that

u_{j} \cdot q ⊑ η

, that is, for every

w ⊑ η

the relation

u_{| w |} ⊏ w ⊑ u_{| w |} \cdot q

is valid. Informally,

η

has quasiperiod q if every position of

η

occurs within some occurrence of q in

η

[11,12].

Let for

q \in X^{*} \ {e}

,

Q_{q}

be the set of quasiperiodic words with quasiperiod q. Then

{q}^{*} \subseteq Q_{q} = Q_{q}^{*}

and

Q_{q} \ {e} \subseteq X^{*} \cdot q \cap q \cdot X^{*}

. In order to describe the set of quasiperiodic strings having a certain quasiperiod

q \in X^{*} \ {e}

the following definition is helpful.

Definition 1.

A family

{(w_{i})}_{i = 1}^{ℓ}

,

ℓ \in N \cup {\infty}

, of words

w_{i} \in X^{*} \cdot q

is referred to as a q-chain provided

w_{1} = q

,

w_{i} ⊏ w_{i + 1}

and

| w_{i + 1} | - | w_{i} | \leq | q |

.

It holds the following.

Lemma 1.

1.: $w \in Q_{q} \ {e}$ if and only if there is a q-chain ${(w_{i})}_{i = 1}^{ℓ}$ such that $w_{ℓ} = w$ .
2.: An ω-word $ξ \in X^{ω}$ is quasiperiodic with quasiperiod q if and only if there is a q-chain ${(w_{i})}_{i = 1}^{\infty}$ such that $w_{i} ⊏ ξ$ .

Proof.

It suffices to show how a family

{(u_{j})}_{j = 0}^{| η | - 1}

can be converted to a q-chain

{(w_{i})}_{i = 1}^{ℓ}

and vice versa.

Consider

η \in X^{*} \cup X^{ω}

and let

{(u_{j})}_{j = 0}^{| η | - 1}

be a family such that

u_{j} \cdot q ⊑ η

and

j - | q | < | u_{j} | \leq j

for

j < | η |

.

Define

w_{1} : = q

and

w_{i + 1} : = u_{| w_{i} |} \cdot q

as long as

| w_{i} | < | η |

. Then

w_{i} ⊑ η

and

| w_{i} | < | w_{i + 1} | = | u_{| w_{i} |} \cdot q | \leq | w_{i} | + | q |

. Thus

{(w_{i})}_{i = 1}^{ℓ}

is a q-chain with

w_{i} ⊑ η

.

Conversely, let

{(w_{i})}_{i = 1}^{ℓ}

be a q-chain such that

w_{i} ⊑ η

and set

u_{j} : = {max}_{⊑} \{w^{'} : \exists i (w^{'} \cdot q = w_{i} \land | w^{'} | \leq j)\}, for j < | η | .

By definition,

u_{j} \cdot q ⊑ η

and

| u_{j} | \leq j

. Assume

| u_{j} | \leq j - | q |

and

u_{j} \cdot q = w_{i}

. Then

| w_{i} | \leq j < | η |

. Consequently, in the q-chain there is a successor

w_{i + 1}

,

| w_{i + 1} | \leq | w_{i} | + | q | \leq j + | q |

. Let

w_{i + 1} = w^{″} \cdot q

. Then

u_{j} ⊏ w^{″}

and

| w^{″} | \leq j

which contradicts the maximality of

u_{j}

. □

Lemma 1 yields the following consequences.

Corollary 1.

Let

u \in pref (Q_{q})

. Then there are words

w, w^{'} \in Q_{q}

such that

w ⊑ u ⊑ w^{'}

and

| u | - | w |, | w^{'} | - | u | \leq | q |

.

Corollary 2.

Let

ξ \in X^{ω}

. Then the following are equivalent.

1.: ξ is quasiperiodic with quasiperiod q.
2.: $pref (ξ) \cap Q_{q}$ is infinite.
3.: $pref (ξ) \subseteq pref (Q_{q})$ .

3.2. Finite Generators for Quasiperiodic Words

In this part we consider the finite languages

P_{q}

and

R_{q}

(

L (q)

and

R (q)

in [12]) which generate the set of quasiperiodic words as well as the set of quasiperiodic

ω

-words having quasiperiod q.

We set

P_{q} : = {v : e ⊏ v ⊑ q ⊏ v \cdot q} = {v : \exists v^{'} (v^{'} ⊏ q \land v \cdot v^{'} = q)} .

(1)

Then we have the following properties.

Proposition 1.

1.: $q \in P_{q}$ and $P_{q} = {q}$ if and only if q is overlap-free.
2.: $Q_{q} = P_{q}^{*} \cdot q \cup {e} \subseteq P_{q}^{*}$
3.: $pref (Q_{q}) = pref (P_{q}^{*}) = P_{q}^{*} \cdot pref (q)$

Proof.

1.

q \in P_{q}

is obvious and and the equivalence follows immediately from the definition of

P_{q}

.

2. In order to prove

Q_{q} \subseteq P_{q}^{*} \cdot q \cup {e}

we show that

w_{i} \in P_{q}^{*} \cdot q

for every q-chain

{(w_{i})}_{i = 1}^{ℓ}

. This is certainly true for

w_{1} = q

. Now proceed by induction on i. Let

w_{i} = w_{i}^{'} \cdot q \in P_{q}^{*} \cdot q

and

w_{i + 1} = w_{i + 1}^{'} \cdot q

. Then

w_{i}^{'} \cdot v_{i} = w_{i + 1}^{'}

. Now from

w_{i} ⊏ w_{i + 1}

we obtain

e ⊏ v_{i} ⊑ q ⊏ v_{i} \cdot q

, that is,

v_{i} \in P_{q}

.

Conversely, let

v_{i} \in P_{q}

and consider

v_{1} \dots v_{ℓ} \cdot q

. Since

q ⊑ v_{i} \cdot q

the family

{(v_{1} \dots v_{j} \cdot q)}_{j = 0}^{ℓ}

is a q-chain. This shows

P_{q}^{*} \cdot q \cup {e} \subseteq Q_{q}

.

3. is an immediate consequence of 2. □

Proposition 1 and Corollary 2 imply the following characterisation of

ω

-words having quasiperiod q.

{ξ : ξ \in X^{ω} \land ξ has quasiperiod q} = P_{q}^{ω}

(2)

Proof.

Since

P_{q}

is finite,

P_{q}^{ω} = {ξ : ξ \in X^{ω} \land pref (ξ) \subseteq pref (P_{q}^{*})}

. □

A dual generator of

Q_{q}

is obtained by the right-to-left duality of reading words using the suffix relation

\leq_{s}

instead of the prefix relation ⊑.

R_{q} : = {v : e <_{s} v \leq_{s} q <_{s} q \cdot v} = {v : \exists v^{'} (v^{'} <_{s} q \land v^{'} \cdot v = q)} .

(3)

Analogously to Proposition 1 we obtain

Proposition 2.

1.: $q \in R_{q}$ and $R_{q} = {q}$ if and only if q is overlap-free.
2.: $Q_{q} = q \cdot R_{q}^{*} \cup {e} \subseteq R_{q}^{*}$ , and
3.: $pref (Q_{q}) = pref (q) \cup q \cdot pref (R_{q}^{*})$ .

The proof of Items 1 and 2 is similar to the proof of Proposition 1 using the reversed version of q-chain, and Item 3 then follows from Item 2. A slight difference appears with an analogy to Equation (2).

{ξ : ξ \in X^{ω} \land ξ has quasiperiod q} = q \cdot R_{q}^{ω} \subseteq R_{q}^{ω}

(4)

Here the last inclusion might be proper, e.g., for

q = a b a

where

R_{a b a}^{ω} = {b a, a b a}^{ω} \neq a b a \cdot R_{a b a}^{ω}

.

An alternative derivation of the languages

P_{q}

and

R_{q}

can be found in Definition 2 of [12]. Here the borders, that is, prefixes which are simultaneously suffixes of the quasiperiod q, are used:

\begin{matrix} P_{q} & = & {v : \exists w (w ⊏ q \land w <_{s} q \land q = v \cdot w)}, and \\ R_{q} & = & {v : \exists w (w ⊏ q \land w <_{s} q \land q = w \cdot v)} . \end{matrix}

In the subsequent sections we focus on the investigation of

P_{q}

due to the left-to-right direction of

ω

-words.

3.3. Combinatorial Properties of $P_{q}$

We investigate basic properties of

P_{q}

using simple facts from combinatorics on words (see e.g., [14,15,16]).

Proposition 3.

v \in P_{q}

if and only if

| v | \leq | q |

and there is a prefix

\bar{v} ⊏ v

such that

q = v^{k} \cdot \bar{v}

for

k = ⌊| q | / | v |⌋

.

This is an immediate consequence of Fact 2.

Corollary 3.

v \in P_{q}

if and only if

| v | \leq | q |

and there is a

k^{'} \in N

such that

q ⊑ v^{k^{'}}

.

Now set

q_{0} : = {min}_{⊑} P_{q}

. Then in view of Proposition 3 and Corollary 3 we have the following canonical representation.

q = q_{0}^{k} \cdot \bar{q} where k = ⌊| q | / | q_{0} |⌋ and \bar{q} ⊏ q_{0} .

(5)

We will refer to

q_{0}

as the repeated prefix and to k as the repetition factor. If

| q_{0} | > | q | / 2

, that is, if

k = 1

we will refer to q as irreducible. (Reducible words are also known as periodic words [10,11].)

Corollary 4.

Every word

v \in \sqrt[*]{P_{q}}

is primitive.

Proof.

Assume

v = v_{1}^{l}

for some

v \in \sqrt[*]{P_{q}}

and

l > 1

. Then

q ⊑ v^{k^{'}} = v_{1}^{l \cdot k^{'}}

, and, according to Corollary 3

v_{1} \in P_{q}

contradicting

v \in \sqrt[*]{P_{q}}

. □

Proposition 4.

Let

q \in X^{*}, q \neq e

,

q_{0} = {min}_{⊑} P_{q}

,

q = q_{0}^{k} \cdot \bar{q}

and

v \in P_{q}^{*} \ {e}

.

1.: If $w ⊑ q$ then $v \cdot w ⊑ q$ or $q ⊑ v \cdot w$ .
2.: If $w \cdot v ⊑ q$ then $w \in {q_{0}}^{*}$ .

Proof.

From Proposition 1.2 we know

v \cdot q \in P_{q}^{*} \cdot q \subseteq Q_{q} \subseteq q \cdot X^{*}

. Consequently,

q ⊑ v \cdot q

. Then

v \cdot w ⊑ v \cdot q

implies

v \cdot w ⊑ q

or

q ⊑ v \cdot w

according to whether

| w \cdot w | \leq | q |

or not.

Since

q_{0} ⊑ v

, it suffices to prove the second assertion for

q_{0}

. First one observes that,

w ⊑ q

and

| w | \leq | q | - | q_{0} |

. Thus

w ⊑ q_{0}^{k - 1} \cdot \bar{q}

. Therefore, we have

w \cdot q_{0} ⊑ q

and

q_{0} \cdot w ⊑ q

which implies

w \cdot q_{0} = q_{0} \cdot w

and, according to Fact 3, w and

q_{0}

are powers of a common word. The assertion follows because

q_{0}

is primitive. □

Next we derive a lower bound on the lengths of words in

P_{q} \ {q_{0}}^{*}

.

To this end, we use the Theorem of Fine and Wilf.

Theorem 1

([17]). Let

v, w \in X^{*}

. Suppose

v^{m}

and

w^{n}

, for some

m, n \in N

, have a common prefix of length

| v | + | w | - \gcd (| v |, | w |)

. Then v and w are powers of a common word

u \in X^{*}

of length

| u | = \gcd (| v |, | w |)

. (Here

\gcd (k, l)

denotes the greatest common divisor of two numbers

k, l \in N

.)

Proposition 5.

Let

q \in X^{*}, q \neq e

,

q_{0} = {min}_{⊑} P_{q}

,

q = q_{0}^{k} \cdot \bar{q}

and

v \in P_{q} \ {q_{0}}^{*}

. Then

| v | > | q | - | q_{0} | + \gcd (| v |, | q_{0} |)

.

Proof.

If

q_{0}, v \in P_{q}

Corollary 3 and Equation (5) imply that q is a common prefix of

q_{0}^{k + 1}

and

v^{k^{'}}

for some

k^{'} \in N

. If

| v | \leq | q | - | q_{0} | + \gcd (| v |, | q_{0} |)

then by Theorem 1

q_{0}

and v are powers of a common word, that is, v is a power of the primitive word

q_{0}

. □

Corollary 5.

\sqrt[*]{P_{q}} = P_{q} \ q_{0}^{2} \cdot {q_{0}}^{*}

Proof.

It suffices to show

P_{q} \cap P_{q}^{2} \cdot P_{q}^{*} \subseteq {q_{0}}^{*}

. To this end observe that in view of Proposition 5

| v \cdot v^{'} | > | q |

whenever

v \in P_{q} \ {q_{0}}^{*}

or

v^{'} \in P_{q} \ {q_{0}}^{*}

. □

As an immediate consequence we obtain that

\sqrt[*]{P_{q}} = P_{q}

if and only if q is an irreducible quasiperiod. Moreover, Proposition 5 shows that

\sqrt[*]{P_{q}} \subseteq {q_{0}} \cup {v^{'} : v^{'} ⊑ q \land | v^{'} | > | q | - | q_{0} | + \gcd (| v^{'} |, | q_{0} |)} .

(6)

3.4. The Reduced Quasiperiod $\hat{q}$

Next we investigate the relation between a quasiperiod

q = q_{0}^{k} \cdot \bar{q}

where

q_{0} = {min}_{⊑} P_{q}

and

\bar{q} ⊏ q_{0}

and its reduced quasiperiod

\hat{q} : = q_{0} \cdot \bar{q}

. Since

q \in Q_{\hat{q}}

, we have

Q_{\hat{q}} \supseteq Q_{q}

.

We continue with a relation between

P_{q}

and

P_{\hat{q}}

. It is obvious that

q_{0}^{i} \in P_{q}

for every

i = 1, \dots, k

and

P_{\hat{q}} \subseteq {v : {\hat{q}}_{0} ⊑ v ⊑ \hat{q}} .

(7)

Lemma 2

([7], Lemma 2.2). Let

q \in X^{*}, q \neq e

,

q_{0} = {min}_{⊑} P_{q}

,

q = q_{0}^{k} \cdot \bar{q}

and

\hat{q} = q_{0} \cdot \bar{q}

the reduced quasiperiod of q. Then

P_{q} = {q_{0}^{i} : i = 1, \dots, k - 1} \cup {q_{0}^{k - 1} \cdot v : v \in P_{\hat{q}}} .

Proof.

Consider

v \in P_{\hat{q}}

. Then

v ⊑ q_{0} \bar{q} ⊏ v \cdot q_{0} \bar{q}

, and, consequently,

q_{0}^{k - 1} \cdot v ⊑ q_{0}^{k} \cdot \bar{q} ⊏ q_{0}^{k - 1} \cdot v \cdot q_{0} \bar{q} ⊏ q_{0}^{k - 1} \cdot v \cdot q_{0}^{k} \cdot \bar{q}

, that is,

q_{0}^{k - 1} \cdot v \in P_{q}

.

Conversely, let

v^{'} \in P_{q}

and

v^{'} \notin {q_{0}^{i} : i = 1, \dots, k - 1}

. Then, according to Proposition 5 there is a unique

v \neq e

such that

v^{'} = q_{0}^{k - 1} \cdot v

. Now

v^{'} = q_{0}^{k - 1} \cdot v ⊑ q = q_{0}^{k} \cdot \bar{q} ⊏ v^{'} \cdot q = q_{0}^{k - 1} \cdot v \cdot q_{0}^{k} \cdot \bar{q}

implies

v ⊑ q_{0} \cdot \bar{q} ⊏ v \cdot q_{0}^{k} \cdot \bar{q}

. Since

| v | \leq | q_{0} \cdot \bar{q} |

and

q_{0} \cdot \bar{q} ⊑ q_{0}^{k} \cdot \bar{q}

, we have

v ⊑ q_{0} \cdot \bar{q} ⊏ v \cdot q_{0} \cdot \bar{q}

. □

Together with Corollary 5 this implies

P_{q} \ {q_{0}}^{*} = \sqrt[*]{P_{q}} \ {q_{0}}^{*} = q_{0}^{k - 1} \cdot (P_{\hat{q}} \ {q_{0}}) .

(8)

Moreover, we have the following.

Corollary 6.

| \sqrt[*]{P_{q}} | = 1

if and only if

q \in {q_{0}}^{*}

and

q_{0}

is overlap-free.

Proof.

Since

q_{0} \in \sqrt[*]{P_{q}}

,

| \sqrt[*]{P_{q}} | = 1

is equivalent with

\sqrt[*]{P_{q}} = {q_{0}}

or, according to Equation (8), with

P_{\hat{q}} = {q_{0}}

. This amounts to

\hat{q} = q_{0}

and, following Proposition 1.1

\hat{q} = q_{0}

has to be overlap-free. □

For the repeated prefix

{\hat{q}}_{0}

of

\hat{q}

we have the obvious relation

| {\hat{q}}_{0} | > | \bar{q} |

. In case

{\hat{q}}_{0} \neq q_{0}

we can improve this.

Lemma 3.

Let

q = q_{0}^{k} \cdot \bar{q}

with

k \geq 2

,

\bar{q} ⊏ q_{0}

and

\hat{q} = q_{0} \cdot \bar{q}

. If

{\hat{q}}_{0} \neq q_{0}

then

\bar{q} ⊏ {\hat{q}}_{0} ⊏ q_{0} a n d | {\hat{q}}_{0} | > | \bar{q} | + \gcd (| q_{0} |, | {\hat{q}}_{0} |),

and there is a nonempty suffix

v \neq e

of

q_{0}

such that

v ⊏ {\hat{q}}_{0}

and

v \cdot \bar{q} ⊏ {\hat{q}}_{0}^{2}

.

Proof.

We have

\bar{q} ⊑ q_{0}

and, since

q_{0} \in P_{\hat{q}}

, also

{\hat{q}}_{0} ⊑ q_{0}

. Moreover,

\hat{q} ⊑ q_{0}^{2}

and

\hat{q} ⊑ {\hat{q}}_{0}^{k^{'}}

for some

k^{'} \in N

. Since

q_{0} \neq {\hat{q}}_{0}

and both prefixes are primitive words, in view of Theorem 1 as a common prefix of

q_{0}^{2}

and

{\hat{q}}_{0}^{| q_{0} |}

the word

\hat{q} = q_{0} \cdot \bar{q}

has to satisfy

| \hat{q} | < | q_{0} | + | {\hat{q}}_{0} | - \gcd (| q_{0} |, | {\hat{q}}_{0} |)

, that is,

| {\hat{q}}_{0} | > | \bar{q} | + \gcd (| q_{0} |, | {\hat{q}}_{0} |)

. The assertion

\bar{q} ⊏ {\hat{q}}_{0} ⊏ q_{0}

now follows from a comparison of the lengths of

\bar{q}, {\hat{q}}_{0} ⊑ q_{0}

.

Now, let v be the suffix of

q_{0}

defined by

{\hat{q}}_{0}^{k^{'}} \cdot v = q_{0} ⊏ {\hat{q}}_{0}^{k^{'} + 1}

. Then

v ⊏ {\hat{q}}_{0}

and

v \cdot \bar{q} ⊏ {({\hat{q}}_{0})}^{2}

. □

3.5. Primitivity and Superprimitivity

In this section we consider the inclusion relations between the languages

Q_{q}, q \neq e

. Analogously to the primitivity of words in [10,11,12] a word was referred to as superprimitive if it is not covered by a shorter one. This leads to the following definition.

Definition 2

(superprimitive). A non-empty word

q \in X^{*} \ {e}

is superprimitive if and only if

Q_{q}

is maximal w.r.t. “⊆” in the family

{Q_{q} : q \in X^{*} \ {e}}

.

The next proposition relates the irreducibility of quasiperiods to superprimitivity.

Proposition 6

([12], Remark 4). If

q \in X^{*} \ {e}

is superprimitive then

| {min}_{⊑} P_{q} | > | q | / 2

, and if

| {min}_{⊑} P_{q} | > | q | / 2

then q is primitive.

Proof.

If

q_{0} = {min}_{⊑} P_{q}

and

| q_{0} | \leq | q | / 2

then

q = q_{0}^{k} \cdot \bar{q}

for some

\bar{q} ⊏ q_{0}

. Thus

q \in Q_{q_{0} \bar{q}}

and

q_{0} \bar{q} \notin Q_{q}

.

As

q = q^{'}^{m}

with

m > 1

implies

| q_{0} | \leq | q^{'} | \leq | q | / 2

, the other assertion follows. □

The converse of Proposition 6 is not valid.

Example 1.

Let

q = abaabaababaab

. Then

P_{q} = {abaabaab, abaabaababa, q}

, and

| {min}_{⊑} P_{q} | = 8 > 13 / 2

but as

abaabaababaab \in Q_{abaab}

the word q is not superprimitive.

The word

q = ababa

is primitive but

q_{0} = ab

has

| q_{0} | \leq | q | / 2

.

In contrast to the fact that the word

q_{0} = {min}_{⊑} P_{q}

is always primitive, it need not satisfy

| {min}_{⊑} P_{q_{0}} | > | q_{0} | / 2

let alone be superprimitive..

Example 2.

q = aabaaabaaaa

has

q_{0} = aabaaabaa

which, in turn has

P_{q_{0}} = {aaba, aabaaaba, q_{0}}

with

| aaba | = 4 < | q_{0} | / 2

.

It turns out that every language

Q_{v}

is contained in a unique maximal

Q_{q}

. To this end we derive the following lemma (cf. also [10,11]).

Lemma 4.

Let

v \in Q_{q}

and

u \in infix (v) \cap q \cdot X^{*} \cap X^{*} \cdot q

. Then

u \in Q_{q}

.

For the sake of completeness we give a proof.

Proof.

We use a maximal q-chain

{(w_{i})}_{i = 1}^{n}

with

w_{n} = v

. Assume

v = u_{1} \cdot u \cdot u_{2}

. Since u has q as prefix and suffix, there are

1 \leq j \leq l \leq n

such that

w_{j} = u_{1} \cdot q

and

w_{l} = u_{1} \cdot u

. Let, for

1 \leq i \leq l - j + 1

, the words

w_{i}^{'}

be defined by

w_{i + j - 1} = u_{1} \cdot w_{i}^{'}

. Then

{(w_{i}^{'})}_{i = 1}^{l - j + 1}

is a q-chain with

w_{l - j + 1} = u

, that is,

u \in Q_{q}

. □

Corollary 7.

If

v \in Q_{q} \cap Q_{u}

and

| q | < | u |

then

Q_{u} \subseteq Q_{q}

.

The corollary shows that every language

Q_{v}

is contained in a unique maximal

Q_{q}

and that two languages

Q_{u}, Q_{q}

are either disjoint or compatible w.r.t. set inclusion. The latter is not true for

ω

-languages.

Example 3.

Let

q = a a b a a

and

u = a a b a a a

. Then

q^{ω} \notin P_{u}^{ω}

,

u^{ω} \notin P_{q}^{ω}

but

P_{u}^{ω} \cap P_{q}^{ω} \supseteq a a \cdot {b a a a, b a a a a}^{ω}

.

4. $P_{q}$ and $R_{q}$ as Codes

In this section we investigate in more detail the properties of the star root of

P_{q}

. It turns out that

\sqrt[*]{P_{q}}

is a suffix code which, additionally, has a bounded delay of decipherability. This delay is closely related to the largest power of

q_{0}

being a prefix of q.

According to [14,18,19,20] a subset

C \subseteq X^{*}

is a code of a delay of decipherability

m \in N

if and only if for all

v, v^{'}, w_{1}, \dots, w_{m} \in C

and

u \in C^{*}

the relation

v \cdot w_{1} \dots w_{m} ⊑ v^{'} \cdot u

implies

v = v^{'}

. Observe that

C \subseteq X^{*}

is a prefix code if and only if C has delay 0.

First we show that

\sqrt[*]{P_{q}}

is a suffix code. This generalises Proposition 7 of [12].

Proposition 7.

\sqrt[*]{P_{q}}

is a suffix code, and

\sqrt[*]{R_{q}}

is a prefix code.

Proof.

Assume

u = w \cdot v

for some

u, v \in \sqrt[*]{P_{q}}, u \neq v

. Then

u ⊑ q

and Proposition 4 (2) proves

w \in {q_{0}}^{*} \ {e}

. Consequently,

| v | \leq | q | - | q_{0} |

. Now Proposition 5 implies

v \in {q_{0}}^{*}

and hence

u \in {q_{0}}^{*}

. Since

u, v \in \sqrt[*]{P_{q}}

, we obtain

u = v = q_{0}

contradicting

u \neq v

.

Using the duality of

P_{q}

and

R_{q}

one shows in an analogous manner that

\sqrt[*]{R_{q}}

is a prefix code. □

An easy consequence of Proposition 7 is the Left and Right Normal Form of a quasiperiodic string ([12], Proposition 8).

Corollary 8

(Normal Form). Every word

w \in Q_{q}

has a unique factorisation

w = v_{1} \cdot v_{2} \dots v_{n}

into words

v_{i} \in \sqrt[*]{P_{q}} (\sqrt[*]{R_{q}}, r e s p e c t i v e l y)

.

Since

\sqrt[*]{R_{q}}

is a prefix code while the words

v \in P_{q}

are prefixes of each other, we obtain

| \sqrt[*]{P_{q}} \cap \sqrt[*]{R_{q}} | = 1

generalising Remark 5 of [12]. In fact

\sqrt[*]{P_{q}} \cap \sqrt[*]{R_{q}} = {q}

or

\sqrt[*]{P_{q}} \cap \sqrt[*]{R_{q}} = {q_{0}}

depending on whether

q \neq q_{0}^{k}

or not.

We continue this part by investigating the delay of decipherability of

\sqrt[*]{P_{q}}

. We prove that the delay depends on the repetition factor k.

Theorem 2.

Let

q \in X^{*} \ {e}

,

q_{0} = {min}_{⊑} P_{q}

, and

| \sqrt[*]{P_{q}} | > 1

. Then

\sqrt[*]{P_{q}}

is a code having a delay of decipherability of k or

k + 1

.

Proof.

If

| \sqrt[*]{P_{q}} | > 1

then in view of Proposition 5 there is a

q^{'} \in \sqrt[*]{P_{q}}

with

| q^{'} | > | q | - | q_{0} |

. Since

q^{'} \in P_{q}

, we have

q ⊑ q^{'} \cdot q_{0} ⊑ q^{'} \cdot q

. Consequently,

q_{0} \cdot q_{0}^{k - 1} ⊑ q ⊑ q^{'} \cdot q_{0}

, that is, the delay of decipherability is at least k.

To prove the converse we show that for

q ⊑ q_{0}^{m}

the delay cannot exceed m.

Assume the contrary, that is,

v \cdot w_{1} \dots w_{m + 1} ⊑ v^{'} \cdot u

for some words

v, v^{'}, w_{1}, \dots, w_{m + 1} \in

\sqrt[*]{P_{q}}

,

v \neq v^{'}

, and

u \in P_{q}^{*}

. From Proposition 4 (1) we obtain

u ⊑ q

or

q ⊑ u

and, since

| w_{i} | \geq | q_{0} |

, also

q ⊑ w_{1} \dots w_{m + 1}

.

If

v ⊏ v^{'}

, in view of the inequality

| v | + | q | \geq | v^{'} | + | q_{0} |

our assumption yields

v^{'} \cdot q_{0} ⊑ v \cdot q

. Therefore,

w \cdot q_{0} ⊑ q

for the word

w \neq e

with

v \cdot w = v^{'}

and, according to Proposition 4 (2)

w \in {q_{0}}^{*}

. This contradicts the fact that

\sqrt[*]{P_{q}}

is a suffix code.

If

v^{'} ⊏ v

, then

| u | > | w_{1} \dots w_{m + 1} | \geq | q |

, and via

| v^{'} | + | q | \geq | v | + | q_{0} |

we obtain

v \cdot q_{0} ⊑ v^{'} \cdot q

from our assumption. This yields the same contradiction as in the case

v ⊏ v^{'}

.

The observation

q ⊑ q_{0}^{k + 1}

finishes the proof. □

For

q = q_{0}^{k}

the preceding proof shows the following.

Corollary 9.

If

q = q_{0}^{k}

and

| \sqrt[*]{P_{q}} | > 1

then

\sqrt[*]{P_{q}}

has a delay of decipherability of exactly k.

Thus, if

| \sqrt[*]{P_{q}} | > 1

and

q \neq q_{0}^{k}

the code

\sqrt[*]{P_{q}}

may have a minimum delay of decipherability of k or

k + 1

. We provide examples that both cases are possible.

Example 4.

Let

q : = aabaaaaba

. Then

q_{0} = aabaa

,

k = 1

and

\sqrt[*]{P_{q}} = P_{q} = {q_{0}, aabaaaab, q}

which is a code having a delay of decipherability 2.

\begin{matrix} Indeed & aabaaaabaa & = & q_{0} \cdot q_{0} & ⊑ & q \cdot q_{0} or \\ aabaaaabaa & = & q_{0} \cdot q_{0} & ⊑ & aabaaaab \cdot q_{0} . \end{matrix}

Moreover, in Example 4,

q \cdot q_{0} \notin Q_{q}

. Thus our example shows also that

q \cdot P_{q}^{*}

need not be contained in

Q_{q}

.

Example 5.

Let

q : = aba

. Then

k = 1

and

P_{q} = {ab, aba}

is a code having a delay of decipherability 1.

Since

\sqrt[*]{R_{q}}

is a prefix code, every

ω

-word

ξ \in R_{q}^{ω}

has a unique factorisation into words

w \in \sqrt[*]{R_{q}}

. For suffix codes the situation is, in general, different. Consider e.g., the suffix code

{b, b a, a a}

. Property 4 (ii) of [20] (see also ([21], Proposition 1.9)) shows that codes of bounded delay of decipherability also admit a unique factorisation of

ω

-words. Thus we obtain from Theorem 2.

Lemma 5

(Normal Form for quasiperiodic

ω

-words). Every ω-word

ξ \in P_{q}^{ω}

has a unique factorisation

ξ = v_{1} \cdot v_{2} \dots v_{i} \dots

into words

v_{i} \in \sqrt[*]{P_{q}}

.

5. Subword Complexity

In this section we investigate upper bounds on the the subword complexity function

f (ξ, n)

for quasiperiodic

ω

-words. If

ξ \in X^{ω}

is quasiperiodic with quasiperiod q then Proposition 3 and Corollary 3 show

infix (ξ) \subseteq infix (P_{q}^{*})

. Thus

f (ξ, n) \leq | infix (P_{q}^{*}) \cap X^{n} | for ξ \in P_{q}^{ω} .

(9)

Similar to ([22], Proposition 5.5) let

ξ_{q} : = \prod_{v \in P_{q}^{*} \ {e}} v

. This implies

infix (ξ_{q}) = infix (P_{q}^{*})

. Consequently, the tight upper bound on the subword complexity of quasiperiodic

ω

-words having a certain quasiperiod q is

f_{q} (n) : = f (ξ_{q}, n) = | infix (P_{q}^{*}) \cap X^{n} |

. Observe that in view of Propositions 1 and 2 the identity

infix (P_{q}^{*}) = infix (R_{q}^{*}) = infix (Q_{q})

(10)

holds.

The asymptotic upper bound on the subword complexity

f_{q} (n)

is obtained from

λ_{q} = \underset{n \to \infty}{lim sup} \sqrt[n]{| infix (P_{q}^{*}) \cap X^{n} |},

(11)

that is, for large n,

f_{q} (n) \leq {\hat{λ}}^{n}

whenever

\hat{λ} > λ_{q}

.

The following facts are known from the theory of formal power series (cf. [23,24]). As

infix (P_{q}^{*})

is a regular language the power series

\sum_{n \in N} f_{q} (n) \cdot t^{n}

is a rational series and, therefore,

f_{q}

satisfies a recurrence relation

f_{q} (n + k) = \sum_{i = 0}^{k - 1} a_{i} \cdot f_{q} (n + i)

with integer coefficients

a_{i} \in Z

. Thus

f_{q} (n) = \sum_{i = 0}^{k^{'} - 1} g_{i} (n) \cdot θ_{i}^{n}

where

k^{'} \leq k

,

θ_{i}

are pairwise distinct roots of the polynomial

t^{n} - \sum_{i = 0}^{k - 1} a_{i} \cdot t^{i}

and

g_{i}

are polynomials of degree not larger than k.

In the subsequent parts we estimate values characterising the exponential growth of the family

{(| infix (P_{q}^{*}) \cap X^{n} |)}_{n \in N}

. This growth mainly depends on the root of largest modulus among the

θ_{i}

and the corresponding polynomial

g_{i}

.

First we show that, independently of the quasiperiod q, the root

θ_{i}

of largest modulus is always positive and the corresponding polynomial

g_{i}

is constant.

In the remainder of this section we use, without explicit reference, known results from the theory of formal power series, in particular about generating functions of languages and codes which can be found in the literature, e.g., in [14,23,24].

5.1. The Subword Complexity of a Regular Star Language

The language

P_{q}^{*}

is a regular star-language of special shape. Here we show that, generally, the number of subwords of regular star-languages grows only exponentially without a polynomial factor. We start with some easily derived relations between the number of words in a regular language and the number of its subwords.

Lemma 6.

If

L \subseteq X^{*}

is a regular language then there is an

m \in N

such that

\begin{matrix} | L \cap X^{n} | & \leq & | infix (L) \cap X^{n} | & \leq & m \cdot \sum_{i = 0}^{2 m} | L \cap X^{n + i} | \end{matrix}

(12)

If the finite automaton accepting L has m states then for every

w \in infix (L)

there are words

u, v

of length

\leq m

such that

u \cdot w \cdot v \in L

. Thus as a suitable m one may choose the number of states of an automaton accepting the language

L \subseteq X^{*}

.

A first consequence of Lemma 6 is that the identity

\underset{n \to \infty}{lim sup} \sqrt[n]{| L \cap X^{n} |} = \underset{n \to \infty}{lim sup} \sqrt[n]{| infix (L) \cap X^{n} |}

(13)

holds for regular languages

L \subseteq X^{*}

.

In order to derive the announced exponential growth we use Corollary 4 of [25] which shows that for every regular language

L \subseteq X^{*}

there are constants

c_{1}, c_{2} > 0

and a

λ \geq 1

such that

c_{1} \cdot λ^{n} \leq | pref (L^{*}) \cap X^{n} | \leq c_{2} \cdot λ^{n} .

(14)

A consequence of Lemma 6 is that Equation (14) holds also (with a different constant

c_{2}

) for

infix (L^{*})

.

5.2. The Subword Complexity of $Q_{q}$

In this part we estimate the value

λ_{q}

of Equation (11). In view of Equations (10) and (14) the value

λ_{q}

satisfies the inequality

c_{1} \cdot λ_{q}^{n} \leq | infix (P_{q}^{*}) \cap X^{n} | \leq c_{2} \cdot λ_{q}^{n}

.

As

P_{q}^{*}

is a regular language Equations (11) and (13) show that

λ_{q} = {lim sup}_{n \to \infty} \sqrt[n]{| P_{q}^{*} \cap X^{n} |}

which is the inverse of the convergence radius

rad s_{q}^{*}

of the power series

s_{q}^{*} (t) : = \sum_{n \in N} | P_{q}^{*} \cap X^{n} | \cdot t^{n}

. The series

s_{q}^{*}

is also known as the structure generating function of the language

P_{q}^{*}

.

Since

\sqrt[*]{P_{q}}

is a code, we have

s_{q}^{*} (t) = \frac{1}{1 - s_{q} (t)}

where

s_{q} (t) : = \sum_{v \in \sqrt[*]{P_{q}}} t^{| v |}

is the structure generating function of the finite language

\sqrt[*]{P_{q}}

. As

s_{q}^{*}

has non-negative coefficients Pringsheim’s theorem shows that

rad s_{q}^{*} = λ_{q}^{- 1}

is a singular point of

s_{q}^{*}

. Thus

λ_{q}^{- 1}

is the smallest root of

1 - s_{q} (t)

. Hence

λ_{q}

is the largest positive root of the polynomial

p_{q} (t) : = t^{| q |} - \sum_{v \in \sqrt[*]{P_{q}}} t^{| q | - | v |}

.

Remark 1.

If the length of

q_{0} = {min}_{⊑} P_{q}

does not divide

| q |

then

p_{q} (t)

is the reversed polynomial of

1 - s_{q} (t)

, that is, has as roots exactly the the inverses of the roots of

1 - s_{q} (t)

.

If

| q_{0} |

divides

| q |

then

q \notin \sqrt[*]{P_{q}}

(cf. Corollary 5) and

p_{q} (t)

has additionally the root 0 with multiplicity

| q | - | q^{'} |

where

q^{'}

is the longest word in

\sqrt[*]{P_{q}}

.

Summarising our observations we obtain the following.

Lemma 7.

Let

q \in X^{*} \ {e}

. Then there are constants

c_{q, 1}, c_{q, 2} > 0

such that the structure function of the language

infix (P_{q}^{*})

satisfies

c_{q, 1} \cdot λ_{q}^{n} \leq | infix (P_{q}^{*}) \cap X^{n} | \leq c_{q, 2} \cdot λ_{q}^{n}

where

λ_{q}

is the largest (positive) root of the polynomial

p_{q} (t)

.

Remark 2.

One could prove Lemma 7 by showing that, for each polynomial

p_{q} (t)

, its largest (positive) root has multiplicity 1. Referring to Corollary 4 of [25] (see Equation (14)) we avoided these more detailed considerations of a particular class of polynomials.

Now we are able to formulate our main theorem.

As quasiperiods

q, | q | \leq 2,

have trivially

P_{q}^{*} = {q_{0}}^{*}

, that is,

λ_{q} = 1

, in the sequel we confine our considerations to quasiperiods q of length

| q | \geq 3

, and we will always assume that the first letter of a quasiperiod q is

a \in X

.

Define

Q_{max} : = {a^{n} b a^{n} : n \geq 1} \cup {a^{n} w a^{n} : | w | = 2, w \neq a a, n \geq 1}

.

Theorem 3

(Main theorem). Let

q \in a \cdot X^{*}, | q | \geq 3, q \notin Q_{max}

, be a quasiperiod and

n = ⌊ \frac{| q | - 1}{2} ⌋

. Then

λ_{q} < λ_{a^{n} b a^{n}}

or

λ_{q} < λ_{a^{n} b b a^{n}}

according to whether

| q |

is odd or even.

Moreover,

λ_{w} < λ_{a b a} = λ_{a a b a a}

if

w \in a \cdot X^{*} \ {a b a, a a b a a}

.

6. Polynomials

Before proceeding to the proof of our main theorem we derive some properties of polynomials of the form

p (t) = t^{n} - \sum_{i \in M} t^{i}

, where

M \subseteq {i : i \in N \land i < n}

. This class of polynomials includes the polynomials

p_{q} (t)

whose maximal roots

λ_{q}

characterise the growth of

infix (P_{q}^{*})

as described in Lemma 7. We focus in results which are useful for comparing their maximal roots.

The polynomials

p (t) \in \hat{P} : = \{t^{n} - \sum_{i \in M} t^{i} : \emptyset \neq M \subseteq {0, \dots, n - 1}\}

have the following easily verified properties.

p (0) \leq 0, p (1) \leq 0, p (2) \geq 1 and p (t) < 0 for 0 < t < 1 .

(15)

If ε > 0 and p (t^{'}) \geq 0 for some t^{'} > 0 then p ((1 + ε) \cdot t^{'}) > 0 .

(16)

Since

p (1) \leq 0

and

p (2) \geq 1

for

p (t) \in \hat{P}

, Equation (16) shows that once

p (t^{'}) \geq 0, t^{'} \geq 1,

the polynomial

p (t)

has no further root in the interval

(t^{'}, \infty)

and

p (t) \in \hat{P}

has exactly one root in the interval

[1, 2)

. This yields the following fundamental property.

Property 1.

If

t_{0}

is the positive root of the polynomial

p (t) \in \hat{P}

in

[1, 2)

and

1 \leq t^{'} < 2

then

p (t^{'}) \leq 0

if and only if

t^{'} \leq t_{0}

.

For the roots of maximal modulus we have the following theorem.

Theorem 4

(Cauchy). Let

p (t) = \sum_{i = 0}^{n} a_{i} \cdot t^{i}

be a complex polynomial. Then every root

t^{'}

of

p (t)

satisfies

| t^{'} | \leq t_{0}

where

t_{0}

is the maximal root of the polynomial

| a_{n} | \cdot t^{n} - \sum_{i = 0}^{n - 1} | a_{i} | \cdot t^{i}

.

This implies the following property of polynomials

p (t) \in \hat{P}

.

If p (t) = 0 then | t | \leq t_{0} .

(17)

From Property 1 we derive the following criterion to compare the maximal roots of polynomials in

\hat{P}

.

Criterion 1.

Let

p_{1} (t), p_{2} (t) \in \hat{P}

have maximal roots

t_{1}

and

t_{2}

, respectively. Then

p_{2} (t_{1}) > 0

if and only if

t_{1} > t_{2}

.

We conclude this section with a bound on the maximal root of certain polynomials in

\hat{P}

.

Lemma 8.

Let

p (t) = t^{n} - \sum_{i = 0}^{m} t^{i}, n > m \geq 1

. Then

p (t) < 0

for

1 \leq t \leq \sqrt[2 n - m]{{(m + 1)}^{2}}

and

p (t) > 0

for

\sqrt[n - m]{m + 1} \leq t

.

Proof.

The assertion follows from the inequality

t^{n} - (m + 1) \cdot t^{m} < p (t) < t^{n} - (m + 1) \cdot t^{m / 2}

when

t > 1

. The part

p (t) < t^{n} - (m + 1) \cdot t^{m / 2}

uses the arithmetic-geometric-means inequality

\sum_{i = 0}^{m} t^{i} > (m + 1) \cdot \sqrt[m + 1]{\prod_{i = 0}^{m} t^{i}} = (m + 1) \cdot t^{m / 2}

, and the other part is obvious. □

The following special case is needed below in Lemma 12.

Corollary 10.

If

p (t) = t^{n} - \sum_{i = 0}^{n - 3} t^{i}, n \geq 4,

then

p (t) < 0

for

1 \leq t \leq \sqrt[n + 3]{{(n - 2)}^{2}}

.

The subsequent sections are devoted to the proof of our main theorem.

7. Irreducible Quasiperiods

We start with irreducible quasiperiods.

7.1. Extremal Polynomials

The polynomials

p_{q} (t)

of irreducible quasiperiods have non-zero coefficients only for

| q |

and

i < \frac{| q |}{2}

. Therefore we investigate the set

P : = \{t^{n} - \sum_{i \in M} t^{i} : n \geq 2 \land \emptyset \neq M \subseteq {i : i \leq \frac{n - 1}{2}}\} .

Let

p_{n} (t) : = t^{n} - \sum_{i = 0}^{⌊ \frac{n - 1}{2} ⌋} t^{i} \in P

.

Property 2.

Let

p (t) \in P

a polynomial of degree

n \geq 3

. Then

p_{n} (t) \leq p (t)

for

t \in [1, 2]

, and

p_{n} (t)

has the largest positive root among all polynomials of degree n in

P

.

Proof.

This follows from

t^{n} - \sum_{i = 0}^{⌊ \frac{n - 1}{2} ⌋} t^{i} < p (t)

for

p (t) \in P \ {p_{n} (t) : n \geq 3}

when

1 < t \leq 2

and Criterion 1. □

Observe that, for

n \geq 1

,

p_{2 n + 1} (t) = t^{2 n + 1} - \sum_{i = 0}^{n} t^{i} and p_{2 n + 2} (t) = t^{2 n + 2} - \sum_{i = 0}^{n} t^{i} .

Moreover, the words

a^{n} b a^{n} \in Q_{max}

and

a^{n} w a^{n} \in Q_{max}, w \in {x b, b x}, x \in X

are the quasiperiods corresponding to the extremal polynomials

p_{2 n + 1} (t) \in P

and

p_{2 n + 2} (t) \in P

, respectively.

Lemma 9.

Q_{max} : = {q : q \in a \cdot X^{*} \land | q | \geq 3 \land p_{q} (t) = p_{| q |} (t)}

Proof.

If

q \in Q_{max}

then obviously

p_{q} (t) = p_{| q |} (t)

. Conversely, if

p_{q} (t) = t^{| q |} -

\sum_{v \in \sqrt[*]{P_{q}}} t^{| q | - | v |} = p_{| q |} (t)

then

\sqrt[*]{P_{q}} = {v : v ⊑ q \land | v | > \frac{| q |}{2}}

. Then, in view of

q ⊑ v \cdot q

, every prefix

w ⊑ q

of length

| w | < \frac{| q |}{2}

is also a suffix of q. This is possible only for

q \in Q_{max}

or

q \in {a}^{*}

. □

In the sequel the positive root of

p_{n} (t)

is denoted by

λ_{n}

. From Criterion 1 we obtain immediately.

Property 3.

Let

t \geq 1

. We have

t < λ_{n}

if and only if

p_{n} (t) < 0

.

Then Property 2 implies the following.

Theorem 5.

If

q \in a \cdot X^{*}, | q | \geq 3,

is an irreducible quasiperiod then

λ_{q} \leq λ_{| q |}

, and

λ_{q} = λ_{| q |}

if and only if

q \in Q_{max}

.

7.2. The Ordering of the Maximal Roots $λ_{n}$

Before we proceed to the case of reducible quasiperiods we determine the ordering of the maximal roots

λ_{n}

. This will not only be interesting for itself but also useful for proving

λ_{q} < λ_{| q |}

when q is reducible (see Equation (28) below).

The extremal polynomials

p_{n} (t), n \geq 2,

satisfy the following general relations (By convention,

\sum_{i = k}^{m} a_{i} = 0

if

k > m

).

\begin{matrix} t \cdot p_{2 n} (t) - 1 & = & p_{2 n + 1} (t), \end{matrix}

(18)

\begin{matrix} p_{2 n + 2} (t) - t^{2} \cdot p_{2 n} (t) & = & t^{n + 1} - t - 1, \end{matrix}

(19)

\begin{matrix} t^{n - 2} \cdot p_{2 n + 1} (t) - (t^{n} + 1) \cdot p_{2 n - 1} (t) & = & \sum_{i = 0}^{n - 3} t^{i}, and \end{matrix}

(20)

\begin{matrix} t^{n - 2} \cdot p_{2 n + 3} (t) - (t^{n + 1} + 1) \cdot p_{2 n} (t) & = & - t^{n} + \sum_{i = 0}^{n - 3} t^{i} . \end{matrix}

(21)

Lemma 10.

The polynomials

t^{3} - t - 1

and

t^{5} - t^{2} - t - 1 = (t^{2} + 1) \cdot (t^{3} - t - 1)

have largest positive roots

λ_{3} = λ_{5}

among all polynomials in

P

,

λ_{5} > λ_{4}

and

λ_{2 n - 1} > λ_{2 n + 1} > λ_{2 n}

for

n \geq 3

.

Proof.

From Equation (18) we have

p_{2 n + 1} (λ_{2 n}) = - 1 < 0

and, therefore,

λ_{2 n} < λ_{2 n + 1}

when

n \geq 1

.

Similarly, Equation (20) yields

p_{2 n + 1} (λ_{2 n - 1}) = λ_{2 n - 1}^{- (n - 2)} \cdot \sum_{i = 0}^{n - 3} λ_{2 n - 1}^{i} > 0

which implies

λ_{2 n + 1} < λ_{2 n - 1}

for

n \geq 3

and

λ_{3} = λ_{5}

when

n = 2

. □

The largest (positive) root

λ_{3}

of the polynomial

t^{3} - t - 1

is also known as the smallest Pisot-Vijayaraghavan number.

So far we have ordered the ‘odd’ roots:

λ_{3} = λ_{5} > λ_{7} > λ_{9} > \dots

. Next we are going to investigate the ordering of the ‘even’ roots

λ_{2 n}, n \geq 2

.

To this end we derive the following bounds.

Lemma 11.

1.: $\sqrt[3 n + 1]{n^{2}} \leq λ_{2 n} \leq \sqrt[n + 1]{n}$ and $\sqrt[3 n - 1]{n^{2}} \leq λ_{2 n - 1} \leq \sqrt[n]{n}$ for $n \geq 2$ .
2.: Let $n \geq 5$ . Then $λ_{2 n} \geq \sqrt[n - 1]{2}$ .

Proof.

1. follows from Lemma 8.

2. We calculate

p_{2 n} (\sqrt[n - 1]{2}) = 4 \cdot \sqrt[n - 1]{4} - \sum_{i = 0}^{n - 1} \sqrt[n - 1]{2^{i}} \leq 4 \cdot \sqrt[4]{4} - (2 + (n - 1)) = 4 \cdot \sqrt{2} - (n + 1) < 0

if

n \geq 5

and the assertion follows with Property 1. □

Remark 3.

The lower bound of Lemma 11.2 does not exceed the lower bound in Lemma 11.1. However, the latter is more convenient for the purposes of Lemma 12.

Lemma 12.

If

n \geq 5

then

λ_{2 n - 2} > λ_{2 n}

and

λ_{2 n} > λ_{2 n + 3}

.

Proof.

If

t \geq \sqrt[n - 1]{2}

then

t^{n} - t - 1 \geq t - 1 > 0

. Consequently, Equation (19) and Lemma 11.2 imply

p_{2 n - 2} (λ_{2 n}) < 0

whence

λ_{2 n} < λ_{2 n - 2}

.

If

n \geq 5

we have

\sqrt[n + 1]{n} \leq \sqrt[n + 3]{{(n - 2)}^{2}}

and, following Lemma 11.1

λ_{2 n} \leq \sqrt[n + 3]{{(n - 2)}^{2}}

. Then Equation (21) yields

- λ_{2 n} \cdot p_{2 n + 3} (λ_{2 n}) = λ_{2 n}^{n} - \sum_{i = 0}^{n - 3} λ_{2 n}^{i}

, and Corollary 10 shows

p_{2 n + 3} (λ_{2 n}) > 0

whence

λ_{2 n} > λ_{2 n + 3}

. □

Since

p_{8} (\sqrt[3]{2}) > 0

, the proof of Lemma 12 cannot be applied to lower values of n. Thus it remains to establish the order of the

λ_{i}

for

i \leq 13

. To this end, we consider some special identities and use Criterion 3 and Lemma 12.

\begin{matrix} p_{12} (t) - (t^{8} + t^{5} + t^{4} + t^{2} + t) \cdot p_{4} (t) & = & t^{2} - 1, and \end{matrix}

(22)

\begin{matrix} p_{13} (t) - t \cdot (t^{8} + t^{5} + t^{4} + t^{2} + t) \cdot p_{4} (t) & = & t^{3} - t - 1 = p_{3} (t) . \end{matrix}

(23)

Lemma 13.

λ_{8} > λ_{10} > λ_{13} > λ_{4} > λ_{12}

Proof.

Lemma 12 shows

λ_{8} > λ_{10} > λ_{13}

. Equation (22) yields

p_{12} (λ_{4}) = λ_{4}^{2} - 1 > 0

whence

λ_{4} > λ_{12}

, and Equation (23) yields

p_{13} (λ_{4}) = p_{3} (λ_{4}) < 0

, that is,

λ_{13} > λ_{4}

. This shows our assertion. □

For the remaining part we consider the identities

\begin{matrix} t^{2} \cdot p_{11} (t) - (t^{5} + 1) \cdot p_{8} (t) & = & - t^{4} + t + 1 = - p_{4} (t), \end{matrix}

(24)

\begin{matrix} p_{11} (t) - (t^{5} + 1) \cdot p_{6} (t) & = & t^{3} \cdot p_{4} (t), and \end{matrix}

(25)

\begin{matrix} t \cdot p_{9} (t) - (t^{4} + 1) \cdot p_{6} (t) & = & - t^{3} + 1 . \end{matrix}

(26)

Lemma 14.

λ_{9} > λ_{6} > λ_{11} > λ_{8}

Proof.

We use Equations (24)–(26). Then

p_{11} (λ_{8}) = - p_{4} (λ_{8}) < 0

implies

λ_{11} > λ_{8}

,

p_{11} (λ_{6}) = λ_{6}^{3} \cdot p_{4} (λ_{6}) > 0

implies

λ_{6} > λ_{11}

, and, finally,

λ_{6} \cdot p_{9} (λ_{6}) = - λ_{6}^{3} + 1 < 0

implies

λ_{9} > λ_{6}

. □

Now Lemma 10, 12–14 yield the complete ordering of the values

λ_{n}

.

Theorem 6.

Let

λ_{n}, n \geq 3,

be the maximal root of the polynomial

p_{n} (t)

. Then the overall ordering of the values

λ_{n}

starts with

λ_{3} = λ_{5} > λ_{7} > λ_{9} > λ_{6} > λ_{11} > λ_{8} > λ_{10} > λ_{13} > λ_{4} > λ_{12}

and continues as follows

λ_{2 n + 1} > λ_{2 n} > λ_{2 n + 3}, n \geq 7

.

In connection with Proposition 6 and Corollary 7 we obtain that the Pisot-Vijayaraghavan number

λ_{3} = λ_{5}

is an overall upper bound on the values

λ_{q}

.

Corollary 11.

If

q \in X^{*}, | q | \geq 3,

then

λ_{q} \leq λ_{3} = λ_{5}

.

From Lemma 11.1 we obtain immediately.

Corollary 12.

Let

M \subseteq N \ {0, 1, 2}

be infinite. Then

inf {λ_{i} : i \in M} = 1

.

8. Reducible Quasiperiods

Reducible quasiperiods q have a repeated prefix

q_{0} = {min}_{⊑} P_{q}

with

| q_{0} | \leq | q | / 2

and a repetition factor

k \geq 2

such that

q = q_{0}^{k} \cdot \bar{q}

where

\bar{q} ⊏ q_{0}

. Moreover

| \bar{q} | < | q_{0} | \leq | q | / 2

. Observe that

q_{0}

is primitive.

We shall consider three cases depending on the relation between the lengths

n = | q |

,

ℓ = | q_{0} |

, the length of the suffix

| \bar{q} | < | q_{0} |

and the repetition factor

k \geq 2

.

IN the first case

| q_{0} | + | \bar{q} | \leq 2

, in view of

\bar{q} ⊏ q_{0}

, we have necessarily

\bar{q} = e

and

q \in a^{*} \cup {a b}^{*}, a, b \in X, a \neq b

and, therefore,

Q_{q} = {q_{0}}^{*}

and

λ_{q} = 1

.

Let now

| q_{0} | + | \bar{q} | \geq 3

. We divide the remaining cases according to the additional requirement

| q | - 2 | q_{0} | \geq 3

and its complementary one

| q | - 2 | q_{0} | \leq 2

. In the latter case we have necessarily

k = 2

and

| \bar{q} | \leq 2

.

8.1. The Case $| q_{0} | + | \bar{q} | \geq 3 \land | q | - 2 | q_{0} | \geq 3$

Thus, the preceding consideration shows that we have

| \bar{q} | \geq 3

(in particular, if

q = q_{0}^{2} \cdot \bar{q}

) or the repetition factor

k \geq 3

. This implies

| q | = 7

(where

q = {(a b)}^{3} a

) or

| q | \geq 9

.

From Equation (6) we have

\sqrt[*]{P_{q}} \subseteq {q_{0}} \cup {v : v ⊑ q \land | v | > | q | - | q_{0} | + 1}

(27)

This implies that for

| q_{0} | \leq | q | / 2

the polynomials

p_{q} (t)

have non-zero coefficients only for

| q | = n

,

| q | - | q_{0} | = n - ℓ

and

i < | q_{0} | - 1

, that is, are of the form

p_{q} (t) = t^{n} - t^{n - ℓ} - \sum_{i \in M_{q}} t^{i}

where

M_{q} \subseteq {i : i < ℓ - 1}

. Therefore, in the sequel we consider the positive roots of polynomials in

P_{red} : = \{t^{n} - t^{n - ℓ} - \sum_{i \in M} t^{i} : n \geq 1 \land ℓ \leq \frac{n}{2} \land M \subseteq {i : i < ℓ - 1}\}

Let

p_{n, ℓ} (t) : = t^{n} - t^{n - ℓ} - \sum_{i = 0}^{ℓ - 2} t^{i} \in P_{red}

and

λ_{n, ℓ}

be its maximal root. (In the preceding paper [8] we used a slightly different definition of

P_{red}

, and, therefore, of

p_{n, ℓ} (t)

and

λ_{n, ℓ}

.) Similar to Property 2, Criterion 3 and Theorem 5 we have the following.

Property 4.

Let

n \geq 3, ℓ \leq \frac{n}{2}

and

p (t) \in P_{red}

. Then

p (t) \geq p_{n, ℓ} (t)

for

t \in [1, 2]

, and

p_{n, ℓ} (t)

has the largest positive root among all polynomials of degree n and parameter ℓ in

P_{red}

.

Lemma 15.

If

q, | q | = n

, is a quasiperiod with

| q_{0} | = ℓ \leq n / 2

then

p_{q} (t) \geq p_{n, ℓ} (t)

for

t \geq 1

, in particular,

λ_{q} \leq λ_{n, ℓ}

.

Remark 4.

In contrast to Property 2 not for every polynomial

p_{n, ℓ} (t)

there is a quasiperiod q such that

p_{n, ℓ} (t) = p_{q} (t)

, see Remark 5 below.

We have the following relation between the polynomials

p_{n} (t)

and

p_{n, ℓ} (t)

.

p_{n} (t) - t^{ℓ} \cdot p_{n - 2 ℓ} (t) = p_{n, ℓ} (t) - t^{ℓ - 1}, for n - 2 ℓ \geq 3

(28)

This yields

Corollary 13.

Let

n - 2 \cdot ℓ \geq 3

. If

λ_{n} < λ_{n - 2 ℓ}

then

λ_{n, ℓ} < λ_{n}

.

Proof.

If

λ_{n} < λ_{n - 2 ℓ}

then

p_{n - 2 ℓ} (λ_{n}) < p_{n - 2 ℓ} (λ_{n - 2 ℓ}) = 0

. Thus

p_{n, ℓ} (λ_{n}) = - λ_{n}^{ℓ} \cdot p_{n - 2 ℓ} (λ_{n}) + λ_{n}^{ℓ - 1} > 0

, that is,

λ_{n} > λ_{n, ℓ}

. □

Next we show the relation

λ_{q} < λ_{| q |}

for all quasiperiods q having

| q_{0} | \leq | q | / 2

and

| q_{0} | + | \bar{q} | \geq 3

.

Lemma 16.

Let

| q | - 2 | q_{0} | \geq 3

and

| q_{0} | + | \bar{q} | \geq 3

. Then

λ_{q} < λ_{| q |}

.

Proof.

Above we have shown that

| q | - 2 | q_{0} | \geq 3

and

| q_{0} | + | \bar{q} | \geq 3

imply

| q | \geq 7

or

| q | \geq 10

according to whether

| q |

is odd or even.

The ordering of Theorem 6 and Corollary 13 show

λ_{n} > λ_{n, ℓ}

for all odd values

n \geq 7

and for all even values

n \geq 12

.

It remains to consider the exceptional case when

n = | q | = 10

. Here

| q | - 2 | q_{0} | \geq 3

and

| q_{0} | + | \bar{q} | \geq 3

imply

ℓ = | q_{0} | = 3

. Consider

p_{10, 3} (t) = t^{10} - t^{7} - t - 1 = p_{10} (t) - t^{2} \cdot p_{5} (t)

.

From

λ_{5} > λ_{10}

and

p_{10} (λ_{10}) = 0

we have

p_{10, 3} (λ_{10}) = - λ_{10}^{2} \cdot p_{5} (λ_{10}) > 0

, that is,

λ_{10, 3} < λ_{10}

. □

Remark 5.

Equation (6) shows that for

n = | q | = 10

and

ℓ = | q_{0} | = 3

we have

\sqrt[*]{P_{q}} = {q_{0}, q}

, that is,

p_{q} (t) = t^{10} - t^{7} - 1

. Thus there is no quasiperiod q such that

p_{q} (t) = p_{10, 3} (t) = t^{10} - t^{7} - t - 1

.

8.2. The Case $| q_{0} | + | \bar{q} | \geq 3 \land | q | - 2 | q_{0} | \leq 2$

This amounts to

| q | = 2 \cdot | q_{0} | + | \bar{q} |

where

| \bar{q} | \in {0, 1, 2}

.

Here we have to go into more detail and to take into consideration also the reduced quasiperiod

\hat{q} = q_{0} \cdot \bar{q}

of q and its repeated prefix

{\hat{q}}_{0} = {min}_{⊑} P_{\hat{q}}

. Observe that both repeated prefixes

q_{0}, {\hat{q}}_{0}

are primitive.

For

q = q_{0}^{k} \cdot \bar{q}, k \geq 2,

we have from Equations (7) and (8)

p_{q} (t) \in \{t^{| q |} - t^{| q | - | q_{0} |} - \sum_{i \in M} t^{i} : M \subseteq {0, \dots, | \hat{q} | - | {\hat{q}}_{0} |}\} .

Observe that

| {\hat{q}}_{0} | > | \bar{q} |

(in view of Lemma 3 even

| {\hat{q}}_{0} | > | \bar{q} | + 1

if

{\hat{q}}_{0} \neq q_{0}

) and thus

| \hat{q} | - | {\hat{q}}_{0} | = | q_{0} | - (| {\hat{q}}_{0} | - | \bar{q} |) < | q_{0} |

.

Let

P_{red}^{'} : = \{t^{n} - t^{ℓ} - \sum_{i \in M} t^{i} : n > ℓ > j \land M \subseteq {0, \dots, ℓ - j}\}

and

p_{n, ℓ, j} (t) = t^{n} - t^{ℓ} - \sum_{i = 0}^{ℓ - j} t^{i}

. Here the parameter j corresponds to the value

| {\hat{q}}_{0} | - | \bar{q} |

. Then similar to Property 4 and Lemma 15 we have

Property 5.

Let

n, ℓ \geq 3, ℓ \leq \frac{n}{2}, ℓ > j,

and

p (t) \in P_{red}^{'}

. Then

p (t) \geq p_{n, ℓ, j} (t)

for

t \in [1, 2]

, and

p_{n, ℓ, j} (t)

has the largest positive root among all polynomials of degree n and parameters ℓ and j in

P_{red}^{'}

.

Lemma 17.

If

q, | q | = n

, is a quasiperiod with

| q_{0} | = ℓ \leq n / 2

and

| {\hat{q}}_{0} | - | \bar{q} | \geq j

then

p_{q} (t) \geq p_{n, ℓ, j} (t)

for

t \geq 1

, in particular,

λ_{q} \leq λ_{n, ℓ, j}

.

We consider the cases

| \bar{q} | \in {0, 1, 2}

separately. In the sequel we shall make use of the relation

t^{3} - t^{2} - 1 \leq t^{2} - t - 1 < 0 for 1 \leq t \leq λ_{3} = max {λ_{n} : n \in N} .

(29)

8.2.1. The Case $q = q_{0}^{2} \land | \bar{q} | = 0$

As shown above the case

| q_{0} | \leq 2

and

| \bar{q} | = 0

amounts to

λ_{q} = 1

. Thus we may consider only the case when

| q_{0} | \geq 3

. Here we have the following relation between

p_{2 ℓ} (t)

and

p_{2 ℓ, ℓ, 3} (t)

.

p_{2 ℓ} (t) - p_{2 ℓ, ℓ, 3} (t) = t^{ℓ - 2} (t^{2} - t - 1)

(30)

Lemma 18.

If

q = q_{0}^{2}

and

| q_{0} | = ℓ \geq 3

then

λ_{q} < λ_{| q |}

.

Proof.

First we suppose

| {\hat{q}}_{0} | \geq 3

. Then

| {\hat{q}}_{0} | - | \bar{q} | \geq 3

, and Property 5 and Lemma 17 yield

p_{q} (t) \geq p_{2 ℓ, ℓ, 3} (t)

for

t \in [1, 2]

. Now Equations (29) and (30) show

p_{q} (λ_{2 ℓ}) \geq p_{2 ℓ, ℓ, 3} (λ_{2 ℓ}) = - λ_{2 ℓ}^{ℓ - 2} (λ_{2 ℓ}^{2} - λ_{2 ℓ} - 1) > 0

, that is

λ_{q} < λ_{2 ℓ}

.

It remains to consider

1 \leq | {\hat{q}}_{0} | \leq 2

. If

{\hat{q}}_{0} \in a^{*}

then

q_{0} = a^{ℓ}

which is not primitive. Thus

{\hat{q}}_{0} = a b

and, since

q_{0}

is primitive,

q_{0} = {(a b)}^{m} a, m \geq 1

whence

q = q_{0}^{2} = {(a b)}^{m} a \cdot {(a b)}^{m} a

.

We obtain

\sqrt[*]{P_{q}} = {{(a b)}^{m} a \cdot {(a b)}^{i} : i = 0, \dots, m}

and, consequently,

p_{q} (t) = t^{4 m + 2} + \sum_{i = 0}^{m} t^{2 i + 1}

. Then (Observe again

\sum_{i = k}^{m} a_{i} = 0

if

k > m

).

\begin{matrix} p_{q} (t) - p_{4 m + 2} (t) & = & - t^{2 m + 1} + \sum_{i = 0}^{m} t^{2 i} = - t^{2 m + 1} + t^{2 m} + t^{2 m - 2} + \sum_{i = 0}^{m - 2} t^{2 i} \\ = & - t^{2 m - 2} \cdot (t^{3} - t^{2} - 1) + \sum_{i = 0}^{m - 2} t^{2 i}, \end{matrix}

and from Equation (29) we obtain

p_{q} (λ_{4 m + 2}) \geq - λ_{4 m + 2}^{2 m - 2} (λ_{4 m + 2}^{3} - λ_{4 m + 2}^{2} - 1) > 0

. □

8.2.2. The Case $q = q_{0}^{2} \cdot \bar{q} \land | \bar{q} | = 1$

Here we have the following relation between

p_{2 ℓ + 1} (t)

and

p_{2 ℓ + 1, ℓ, 2} (t)

.

p_{2 ℓ + 1} (t) - p_{2 ℓ + 1, ℓ, 2} (t) = t^{ℓ - 1} (t^{2} - t - 1)

(31)

Lemma 19.

If

q = q_{0}^{2} \cdot a, a \in X,

then

λ_{q} < λ_{| q |}

.

Proof.

First we suppose

| {\hat{q}}_{0} | - | \bar{q} | \geq 2

. Then

ℓ = | q_{0} | \geq | {\hat{q}}_{0} | \geq 3

, and Property 5 and Equation (31) yield

p_{q} (λ_{2 ℓ + 1}) \geq p_{2 ℓ + 1, ℓ, 2} (λ_{2 ℓ + 1}) = p_{2 ℓ + 1} (λ_{2 ℓ + 1}) - λ_{2 ℓ + 1}^{ℓ - 1} (λ_{2 ℓ + 1}^{2} - λ_{2 ℓ + 1} - 1)

. The assertion

p_{q} (λ_{2 ℓ + 1}) > 0

, that is

λ_{q} < λ_{2 ℓ + 1}

follows from Equation (29).

It remains to consider

| {\hat{q}}_{0} | = 2

. By Lemma 3

{\hat{q}}_{0} = q_{0}

implies

| {\hat{q}}_{0} | > | \bar{q} | + 1 = 2

. Hence

{\hat{q}}_{0} = q_{0} = a b

,

q = a b a b a

and

p_{q} (t) = t^{5} - t^{3} - 1 = t^{2} \cdot p_{3} (t) + t^{2} - 1

. Then

λ_{a b a b a} < λ_{5}

follows from

λ_{5} = λ_{3}

and

p_{q} (λ_{5}) = λ_{5}^{2} - 1 > 0

. □

8.2.3. The Case $q = q_{0}^{2} \cdot \bar{q} \land | \bar{q} | = 2$

Here we have the following relation between

p_{2 ℓ + 2} (t)

and

p_{2 ℓ + 2, ℓ, 2} (t)

.

p_{2 ℓ + 2} (t) - p_{2 ℓ + 2, ℓ, 2} (t) = t^{ℓ - 1} (t^{3} - t - 1) = t^{ℓ - 1} \cdot p_{3} (t)

(32)

Lemma 20.

If

q = q_{0}^{2} \cdot \bar{q}

with

| \bar{q} | = 2

then

λ_{q} < λ_{| q |}

.

Proof.

First we suppose

| {\hat{q}}_{0} | \geq 4

. Then Property 5, Equation (32) and

λ_{2 ℓ + 2} < λ_{3}

yield

p_{q} (λ_{2 ℓ + 2}) \geq p_{2 ℓ + 2, ℓ, 2} (λ_{2 ℓ + 2}) = - λ_{2 ℓ + 2}^{ℓ - 1} \cdot p_{3} (λ_{2 ℓ + 2}) > 0

, that is,

λ_{q} < λ_{2 ℓ + 2}

.

It remains to consider

| {\hat{q}}_{0} | = 3

. If

{\hat{q}}_{0} \neq q_{0}

Lemma 3 implies

| {\hat{q}}_{0} | > | \bar{q} | + 1

. Consequently,

{\hat{q}}_{0} = q_{0}

. Then

| q_{0} | = 3

and

| q | = 8

, and Equation (6) yields

\sqrt[*]{P_{q}} \subseteq {q_{0}, v, q}

where

v ⊏ q

and

| v | = | q | - 1 = 7

. Thus

p_{q} (t) \geq t^{8} - t^{5} - t - 1 = p_{8} (t) - t^{2} \cdot p_{3} (t)

for

1 \leq t \leq λ_{3}

.

This shows

p_{q} (λ_{8}) \geq - λ_{8}^{2} \cdot p_{3} (λ_{8}) > 0

, that is,

λ_{q} < λ_{8}

. □

Summarising, the results of Section 8 yield.

Theorem 7.

If

q \in X^{*}, | q | \geq 3

, is a reducible quasiperiod then

λ_{q} < λ_{| q |}

.

Our main theorem (Theorem 3) then follows from Theorems 5 and 7.

Together with Corollary 12 our theorem yields a new proof of a theorem of [5] which shows that multi-scale quasiperiodic infinite words have zero topological entropy. In [5] a multi-scale quasiperiodic infinite word is a quasiperiodic infinite word which admits infinitely many quasiperiods.

9. Concluding Remark

In this paper we dealt with the function

f (ξ, n) = | infix (ξ) \cap X^{n} |

for quasiperiodic

ω

-words. Their factor complexity (or topological entropy) is defined as

τ (ξ) : = {lim}_{n \to \infty} \frac{{log}_{| X |} | infix (ξ) \cap X^{n} |}{n}

(e.g., [4], Section 4.2.2 or [5,22]). Thus the upper bound for

ξ \in P_{q}^{ω}

is

{log}_{| X |} λ_{q} \leq {log}_{| X |} t_{P}

which is bounded away from the value 1 for almost periodic

ω

-words.

Along with the subword complexity in [5] the Kolmogorov complexity of quasiperiodic

ω

-words was addressed. Obviously, subword complexity upper bounds Kolmogorov complexity (e.g., [22]). Since the

ω

-languages

P_{q}^{ω}

are regular ones, the results of [22] show that there are

ω

-words

ξ \in P_{q}^{ω}

whose Kolmogorov complexity achieves their subword complexity. Moreover, as

P_{q}^{ω} = q \cdot R_{q}^{ω}

where

R_{q}^{ω}

is a finite prefix code, the results of [22,26,27] give more detailed bounds for most complex quasiperiodic

ω

-words w.r.t. several notions of Kolmogorov complexity [28].

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

Marcus, S. Bridging two hierarchies of infinite words. J. UCS 2002, 8, 292–296. [Google Scholar]
Marcus, S. Quasiperiodic infinite words (column: Formal language theory). Bull. EATCS 2004, 82, 170–174. [Google Scholar]
Marcus, S.; Păun, G. Infinite (almost periodic) words, formal languages and dynamical systems. Bull. EATCS 1994, 54, 224–231. [Google Scholar]
Cassaigne, J.; Nicolas, F. Factor complexity. Combinatorics, automata and number theory. In Encyclopedia of Mathematics and Its Applications; Cambridge University Press: Cambridge, UK, 2010; Volume 135, pp. 163–247. [Google Scholar]
Monteil, T.; Marcus, S. Quasiperiodic infinite words: Multi-scale case and dynamical properties. arXiv 2006, arXiv:math/0603354. [Google Scholar]
Polley, R.; Staiger, L. The maximal subword complexity of quasiperiodic infinite words. In Proceedings of the Twelfth Annual Workshop on Descriptional Complexity of Formal Systems, Saskatoon, SK, Canada, 8–10 August 2010. [Google Scholar]
Staiger, L. Quasiperiods of infinite words. In Mathematics almost Everywhere. In Memory of Solomon Marcus; Bellow, A., Calude, C.S., Zamfirescu, T., Eds.; World Scientific: Hackensack, NJ, USA, 2018; pp. 17–36. [Google Scholar]
Staiger, L. On the generative power of quasiperiods. In Lecture Notes in Computer Science, Proceedings of the 22nd International Conference Descriptional Complexity of Formal Systems, Vienna, Austria, 24–26 August 2020; Jirásková, G., Pighizzini, G., Eds.; Springer: Cham, Switzerland, 2020; Volume 12442, pp. 219–230. [Google Scholar]
Levé, F.; Richomme, G. Quasiperiodic infinite words: Some answers (column: Formal language theory). Bull. EATCS 2004, 84, 128–138. [Google Scholar]
Apostolico, A.; Ehrenfeucht, A. Efficient detection of quasiperiodicities in strings. Theor. Comput. Sci. 1993, 119, 247–265. [Google Scholar] [CrossRef] [Green Version]
Apostolico, A.; Farach, M.; Iliopoulos, C.S. Optimal superprimitivity testing for strings. Inform. Process. Lett. 1991, 39, 17–20. [Google Scholar] [CrossRef] [Green Version]
Mouchard, L. Normal forms of quasiperiodic strings. Theoret. Comput. Sci. 2000, 249, 313–324. [Google Scholar] [CrossRef] [Green Version]
Brzozowski, J.A. Roots of star events. J. ACM 1967, 14, 466–477. [Google Scholar] [CrossRef]
Berstel, J.; Perrin, D. Theory of codes. In Pure and Applied Mathematics; Academic Press Inc.: Orlando, FL, USA, 1985; Volume 117. [Google Scholar]
Shyr, H.-J. Free Monoids and Languages, 3rd ed.; Hon Min Book Company: Taichung, Taiwan, 2001. [Google Scholar]
Lothaire, M. Combinatorics on Words, 2nd ed.; Cambridge University Press: Cambridge, UK, 1997; Volume 17. [Google Scholar]
Fine, N.J.; Wilf, H.S. Uniqueness theorems for periodic functions. Proc. Am. Math. Soc. 1965, 16, 109–114. [Google Scholar] [CrossRef]
Bruyère, V.; Wang, L.M.; Zhang, L. On completion of codes with finite deciphering delay. Eur. J. Combin. 1990, 11, 513–521. [Google Scholar] [CrossRef] [Green Version]
Fernau, H.; Reinhardt, K.; Staiger, L. Decidability of code properties. Theor. Inform. Appl. 2007, 41, 243–259. [Google Scholar] [CrossRef] [Green Version]
Staiger, L. On infinitary finite length codes. RAIRO Inform. Théor. Appl. 1986, 20, 483–494. [Google Scholar] [CrossRef]
Devolder, J.; Latteux, M.; Litovsky, I.; Staiger, L. Codes and infinite words. Acta Cybernet. 1994, 11, 241–256. [Google Scholar]
Staiger, L. Kolmogorov complexity and Hausdorff dimension. Inform. Comput. 1993, 103, 159–194. [Google Scholar] [CrossRef] [Green Version]
Berstel, J.; Reutenauer, C. Rational series and their languages. In EATCS Monographs on Theoretical Computer Science; Springer: Berlin/Heidelberg, Germany, 1988; Volume 12. [Google Scholar]
Salomaa, A.; Soittol, M. Automata-Theoretic Aspects of Formal Power Series; Springer: New York, NY, USA, 1978. [Google Scholar]
Staiger, L. The entropy of finite-state ω-languages. Probl. Control Inform. Theory/Probl. Upravlen. Teor. Inform. 1985, 14, 383–392. [Google Scholar]
Mielke, J.; Staiger, L. On oscillation-free ε-random sequences II. In Computability and Complexity in Analysis; Dagstuhl Seminar Proceedings; Bauer, A., Hertling, P., Ko, K.-I., Eds.; Schloss Dagstuhl-Leibniz-Zentrum für Informatik: Wadern, Germany, 2009; Volume 09003. [Google Scholar]
Staiger, L. Bounds on the Kolmogorov complexity function for infinite words. In Information and Complexity; World Scientific Series in Information Studies; Chapter 8; Burgin, M., Calude, C.S., Eds.; World Scientific: Hackensack, NJ, USA, 2017; Volume 6, pp. 200–224. [Google Scholar]
Uspensky, V.A.; Shen, A. Relations between varieties of Kolmogorov complexities. Math. Syst. Theory 1996, 29, 271–292. [Google Scholar] [CrossRef] [Green Version]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Staiger, L. The Maximal Complexity of Quasiperiodic Infinite Words. Axioms 2021, 10, 306. https://doi.org/10.3390/axioms10040306

AMA Style

Staiger L. The Maximal Complexity of Quasiperiodic Infinite Words. Axioms. 2021; 10(4):306. https://doi.org/10.3390/axioms10040306

Chicago/Turabian Style

Staiger, Ludwig. 2021. "The Maximal Complexity of Quasiperiodic Infinite Words" Axioms 10, no. 4: 306. https://doi.org/10.3390/axioms10040306

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Maximal Complexity of Quasiperiodic Infinite Words

Abstract

1. Introduction

2. Notation and Preliminaries

3. Quasiperiodicity

3.1. General Properties

3.2. Finite Generators for Quasiperiodic Words

3.3. Combinatorial Properties of $P_{q}$

3.4. The Reduced Quasiperiod $\hat{q}$

3.5. Primitivity and Superprimitivity

4. $P_{q}$ and $R_{q}$ as Codes

5. Subword Complexity

5.1. The Subword Complexity of a Regular Star Language

5.2. The Subword Complexity of $Q_{q}$

6. Polynomials

7. Irreducible Quasiperiods

7.1. Extremal Polynomials

7.2. The Ordering of the Maximal Roots $λ_{n}$

8. Reducible Quasiperiods

8.1. The Case $| q_{0} | + | \bar{q} | \geq 3 \land | q | - 2 | q_{0} | \geq 3$

8.2. The Case $| q_{0} | + | \bar{q} | \geq 3 \land | q | - 2 | q_{0} | \leq 2$

8.2.1. The Case $q = q_{0}^{2} \land | \bar{q} | = 0$

8.2.2. The Case $q = q_{0}^{2} \cdot \bar{q} \land | \bar{q} | = 1$

8.2.3. The Case $q = q_{0}^{2} \cdot \bar{q} \land | \bar{q} | = 2$

9. Concluding Remark

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

The Maximal Complexity of Quasiperiodic Infinite Words

Abstract

1. Introduction

2. Notation and Preliminaries

3. Quasiperiodicity

3.1. General Properties

3.2. Finite Generators for Quasiperiodic Words

3.3. Combinatorial Properties of P q

3.4. The Reduced Quasiperiod q ^

3.5. Primitivity and Superprimitivity

4. P q and R q as Codes

5. Subword Complexity

5.1. The Subword Complexity of a Regular Star Language

5.2. The Subword Complexity of Q q

6. Polynomials

7. Irreducible Quasiperiods

7.1. Extremal Polynomials

7.2. The Ordering of the Maximal Roots λ n

8. Reducible Quasiperiods

8.1. The Case | q 0 | + | q ¯ | ≥ 3 ∧ | q | − 2 | q 0 | ≥ 3

8.2. The Case | q 0 | + | q ¯ | ≥ 3 ∧ | q | − 2 | q 0 | ≤ 2

8.2.1. The Case q = q 0 2 ∧ | q ¯ | = 0

8.2.2. The Case q = q 0 2 · q ¯ ∧ | q ¯ | = 1

8.2.3. The Case q = q 0 2 · q ¯ ∧ | q ¯ | = 2

9. Concluding Remark

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.3. Combinatorial Properties of $P_{q}$

3.4. The Reduced Quasiperiod $\hat{q}$

4. $P_{q}$ and $R_{q}$ as Codes

5.2. The Subword Complexity of $Q_{q}$

7.2. The Ordering of the Maximal Roots $λ_{n}$

8.1. The Case $| q_{0} | + | \bar{q} | \geq 3 \land | q | - 2 | q_{0} | \geq 3$

8.2. The Case $| q_{0} | + | \bar{q} | \geq 3 \land | q | - 2 | q_{0} | \leq 2$

8.2.1. The Case $q = q_{0}^{2} \land | \bar{q} | = 0$

8.2.2. The Case $q = q_{0}^{2} \cdot \bar{q} \land | \bar{q} | = 1$

8.2.3. The Case $q = q_{0}^{2} \cdot \bar{q} \land | \bar{q} | = 2$