Algebraic Theory of Patterns as Generalized Symmetries

Rupe, Adam; Crutchfield, James P.

doi:10.3390/sym14081636

Open AccessArticle

Algebraic Theory of Patterns as Generalized Symmetries

by

Adam Rupe

^1,*

and

James P. Crutchfield

²

¹

Center for Nonlinear Studies and Computational Earth Science, Los Alamos National Laboratory, Los Alamos, NM 87545, USA

²

Complexity Sciences Center, Department of Physics and Astronomy, University of California Davis, Davis, CA 95616, USA

^*

Author to whom correspondence should be addressed.

Symmetry 2022, 14(8), 1636; https://doi.org/10.3390/sym14081636

Submission received: 1 July 2022 / Revised: 4 August 2022 / Accepted: 5 August 2022 / Published: 9 August 2022

(This article belongs to the Special Issue Symmetry in Nonlinear Dynamics and Chaos)

Download

Browse Figures

Versions Notes

Abstract

:

We generalize the exact predictive regularity of symmetry groups to give an algebraic theory of patterns, building from a core principle of future equivalence. For topological patterns in fully-discrete one-dimensional systems, future equivalence uniquely specifies a minimal semiautomaton. We demonstrate how the latter and its semigroup algebra generalizes translation symmetry to partial and hidden symmetries. This generalization is not as straightforward as previously considered. Here, though, we clarify the underlying challenges. A stochastic form of future equivalence, known as predictive equivalence, captures distinct statistical patterns supported on topological patterns. Finally, we show how local versions of future equivalence can be used to capture patterns in spacetime. As common when moving to higher dimensions, there is not a unique local approach, and we detail two local representations that capture different aspects of spacetime patterns. A previously developed local spacetime variant of future equivalence captures patterns as generalized symmetries in higher dimensions, but we show that this representation is not a faithful generator of its spacetime patterns. This motivates us to introduce a local representation that is a faithful generator, but we demonstrate that it no longer captures generalized spacetime symmetries. Taken altogether, building on future equivalence, the theory defines and quantifies patterns present in a wide range of classical field theories.

Keywords:

organization; structure; pattern; computational mechanics; spacetime; translation symmetry; predictive equivalence

1. Patterns in Nature

Symmetry plays a central role in fundamental physics. When we look out at the world around us, on the human scale, however, there is a notable lack of exact symmetries. Cows are not spherical, for instance. The disconnect between physics at the fundamental level and the human scale is often described in terms of spontaneous symmetry breaking, the most famous example being the Higgs et. al. mechanism of spontaneously broken gauge symmetries [1,2,3,4]. How and why spontaneous symmetry breaking occurs so ubiquitously in natural systems are interesting and challenging questions, but not our concern here. Rather, we are interested in the question of what structures result from broken symmetries. In particular, a special case of spontaneous symmetry breaking is spontaneous self-organization. However, what is organization in the first place? Can we mathematically formalize it and quantitatively measure it?

We will use the general rubric pattern to refer to the forms of organization that spontaneously emerge in natural systems. When a system undergoes a spontaneous symmetry breaking event it self-organizes into some pattern, either in time, space, or both.

In many canonical examples of pattern formation near equilibrium [5,6,7]—recall the convection roles that emerge in Bénard flow [8,9] or the spiral waves in the Belousov-Zhabotinsky chemical reaction [10,11]—a system undergoes a continuous-to-discrete symmetry breaking bifurcation event. This occurs when it self-organizes, going from a homogeneous state with trivial continuous spacetime symmetries to a state with nontrivial discrete symmetries.

Hexagonal convection cells form in a fluid with velocity initially zero everywhere during a Bénard instability. Belousov first discovered a “chemical clock” with a discrete-time symmetry oscillation that arises from an initially stationary mixture. Farther from equilibrium, these discrete symmetries may be further broken during subsequent bifurcations, resulting in states we may consider “patterned”, but that have no discernible simple symmetries remaining. Turbulent fluid flows containing coherent structures [12,13], such as Jupiter’s Great Red spot [14,15], provide many commonly encountered examples.

What we would like then, and what the following contributes, is a unified account of patterns—an account that rigorously and formally describes the full range of phenomenon including, but going beyond, organizations with exact symmetries. Given that symmetries are formally captured using the mathematics of group theory and given the enormous success group theory has had in formulating physical theory [16], we take an algebraic approach to framing patterns as generalized symmetries.

We start with the simplest setting of discrete one-dimensional spatial systems (e.g., spin lattices) and show how the semigroup algebra of semiautomata—a mathematical representation originating in symbolic computation—generalizes translational symmetry. In doing so, we rigorously clarify subtleties posed by this generalization—subtleties that have not been previously addressed. Based on this, we introduce a classification hierarchy in terms of exact symmetries, partial symmetries, hidden symmetries, and general patterns. We also describe distinct statistical structures supported on these one-dimensional patterns and show that stochastic generalizations of semiautomata provide mathematical representations of these statistical patterns.

In addition, we explore generalizations to patterns in higher dimensions. We introduce a class of local models that generalize the semiautomata approach for spatiotemporal systems. Two models in this class are shown to be particularly useful. Intriguingly, the uses of these models appear mutually exclusive. The first model, introduced previously, can discover hidden spacetime symmetries and coherent structures [17], such as those in turbulent fluid flows [18]. However, we describe a previously unknown shortcoming of these models: they are not consistent generators of their associated spacetime field patterns. The second model, introduced here for the first time, corrects this flaw and introduces a consistent generator of spacetime field patterns. Unfortunately, it loses the first model’s useful generalized spacetime symmetries.

These representations’ conflicting strengths—generalized spacetime symmetries and consistent spacetime generation—add new questions and suggest new paths of inquiry to the enigma of patterns in higher dimensions [19,20,21].

2. One-Dimensional Patterns

Abstractly, we can think of a pattern as a predictive regularity [22]:

… some object O has a pattern P—O has a pattern ‘represented’, ‘described’, ‘captured’, and so on by P—if and only if we can use P to predict or compress O.

On one extreme, symmetries represent an exact predictive regularity. If the symmetries are known, the pattern can be perfectly predicted at any other point in time or space. On the other extreme, a completely random system is entirely devoid of predictive regularity. If every point in spacetime is an independent, identically distributed random variable, there is no regularity. So, knowledge of any part of the system cannot be leveraged to predict other parts of the system. The notion of pattern that we seek encompasses both of these extremes and systems in between. A general pattern will be neither perfectly predicable nor entirely unpredictable—it will be an amalgamation of regularity and randomness.

Before proceeding, let us briefly compare and contrast the theory developed here with the Pattern Theory of Ulf Grenander and colleagues [23]. As both aim at a general quantitative understanding of what patterns are and how to discover them in the world, there are many conceptual similarities. While some quantitative similarities emerge, in particular the use of nonparametric learning algorithms for hidden Markov models [24], most of the quantitative machinery differs. Pattern Theory is grander in scope than what is developed here, and so we do not require the very general constructs of bonds, connectors, configurations, images, and the like [23]. Likewise, Pattern Theory does not employ the machinery of symbolic dynamics, sofic shifts, and predictive equivalence used here. Thus, our work is complementary to the more general approach of Pattern Theory.

2.1. Statistical Field Theories

The following mainly concerns fully-discrete one-dimensional spatial systems. These are given as a shift space

X

—a set of indexed bi-infinite sequences, or strings, of symbols taken from a finite alphabet

A

. Before diving into details, let us first take a moment to compare shift spaces to the analogous setup from statistical mechanics for analyzing ordered systems.

A shift space can be thought of as a topological ensemble—a set of strings—in contrast with a statistical ensemble that is a distribution over a set of strings. This is an abstraction of discrete-spin models in one-dimension—e.g.,

A = {- 1, 1}

for a standard Ising model. Rather than specify interactions on the spin lattice and analyze the resulting statistical field theory, we wish to analyze any pattern present for a given (topological) ensemble

X

.

A key distinction between a shift space

X

as a topological ensemble and a spin lattice ensemble in a statistical field theory is that all elements

x \in X

are related to one another through the shift operator

σ

. (Formally described below.) In fact, for the irreducible sofic shifts we consider,

(X, σ)

is an ergodic dynamical system. So, every member x of the ensemble can eventually be sampled through

σ

’s action. Thus, we equivalently consider (i)

X

as an ensemble of points and

σ

as a deterministic mapping between those points or (ii)

X

as a single infinite lattice and

σ

moves indices on that lattice. The difference is that of active versus passive transformation.

Spontaneous symmetry breaking in statistical field theories is monitored through an order parameter [25]; such as total magnetization for an Ising model. In the symmetric “ordered” phase, the order parameter has a nonzero value and, after a symmetry-breaking phase transition, the order parameter vanishes. For the Ising model, below the transition—below the critical temperature—spins tend to align giving nonzero magnetization. At zero temperature the model reaches its ground state with all spins aligned. This is a fully symmetric state with maximal magnetization, corresponding to strings of the form

{\dots, 1, 1, 1, \dots}

or

{\dots, - 1, - 1, - 1, \dots}

. Above the critical temperature, this symmetry is fully broken, with zero magnetization.

While effective as an approach to thermal spin lattices, such as the Ising and related lattice models, abstractly quantifying “order” with a single scalar quantity—the order parameter—is far from ideal.

First, for the Ising model, there are only two configurations with maximal magnetization, as given above. Second, these configurations are maximally symmetric, with

σ^{p} (x) = x

for integer p. Consider, though, configurations of the form

{\dots, - 1, 1, - 1, 1, - 1, \dots}

. These configurations are still symmetric, with

σ^{2 p} (x) = x

, although they have vanishing order parameter. There are many such symmetric configurations with zero magnetization: e.g., those of the form

{\dots, {(- 1)}^{n}, {(1)}^{n}, \dots}

, with

σ^{2 n p} (x) = x

. More novelly, there are zero order-parameter configurations that are neither completely symmetric nor completely random. Third, these symmetric sequences with zero order parameter are not the ground state and they are not stable under thermal perturbations. Thus, though singled out by the choice of total magnetization as the order parameter, they are edge cases that will almost never be seen. Finally and more generally, order parameters in statistical mechanics are not determined from first physical principles. They must be posited initially and then proved appropriate.

Similarly, correlation functions and structure factors are additional and commonly-employed scalar quantities that capture one or another notion of order. Conceptually, a system considered highly ordered will surely be highly correlated. Patterns that emerge on the macroscopic scale correspond to collective behaviors on the microscale that certainly exhibit nonzero correlations. However, as with order parameters, there can be many degeneracies between specific patterns and correlation values. In short, order is something beyond correlation. A diverging correlation length in an Ising model at the critical temperature does not signify the presence of intricate patterns and organization, such as spiral wave patterns in lattice models of excitable media [26]. To remedy these failings, we seek a definition of pattern that is not scalar.

This is not to banish all scalar quantities. Many, in given settings, can be insightful [27]. We will show that the algebraic presentations for topological patterns have a natural extension to patterns in statistical field theories. Moreover, scalar quantities of interest, like correlation functions, can be computed in closed-form from the stochastic presentations. We also return later, briefly, to discuss generalized order parameters in light of the algebraic theory.

2.2. Symbolic Dynamics

We now detail shift spaces and how they quantify topological patterns as generalized symmetries. Consider a finite alphabet of n symbols

A = {0, 1, \dots, (n - 1)}

and (indexed) bi-infinite symbol sequences or strings. The set

A^{Z}

of all possible bi-infinite sequences is known as the full-n shift. A particular sequence

x = \dots x_{- 1} x_{0} x_{1} \dots \in A^{Z}

is described as a point in

A^{Z}

. For now, we need not specify whether sequence indices are time coordinates or space coordinates. In either case, translations are generated via the shift operator

σ

that maps a point

x \in A^{Z}

to another point

y = σ (x)

whose

i th

coordinate is

y_{i} = x_{i + 1}

for all i. (That is,

σ

shifts every element of x one place to the left.) Our interest is in patterns as predictive regularity, and the predictions are made over translations generated by

σ

. Thus, we want to capture patterns in closed,

σ

-invariant subsets of

A^{Z}

. The subsets are called shift spaces (or subshifts or simply shifts).

Often one can concisely specify a shift space as the set of all strings that do not contain a collection of forbidden words. A word is a finite block of symbols

a_{j} \in A

and a point x is said to contain or admit a word

w = a_{0} a_{1} \dots a_{k}

if there are indices i and

j = i + k

such that

x_{i : j} = w

; explicitly,

x_{i} = a_{0}, x_{i + 1} = a_{1}, \dots, x_{j - 1} = a_{k}

. Again, a word is a finite sequence of symbols; a string, bi-infinite.

For a collection

F

of forbidden words, define

X_{F}

to be the subset of strings in

A^{Z}

that do not contain any words

w \in F

. A shift space

X

is a subset of the full shift

A^{Z}

such that

X = X_{F}

for some collection

F

of forbidden words [28]. The language

W (X) = F^{c}

of a shift space

X

is the collection of all words that occur in some point in

X

.

If

F

is a finite set, the resulting shift space is called a subshift of finite type [29] or an intrinsic or a topological Markov chain [30]. A wider class of finitely-definable shift spaces are the sofic shifts. These are the closure of subshifts of finite type under continuous local mappings—k-block factor maps [31]. Though, note that there are many equivalent definitions of sofic shifts; several of which are given below as needed. A sofic shift is irreducible if, for every ordered pair of words

u, v \in W (X)

, there is a word w such that

u w v \notin F

. For reasons elaborated on shortly, we define general discrete one-dimensional patterns as irreducible sofic shifts.

2.3. Sofic Shifts as Topological Patterns

To recap, we seek a mathematical specification of patterns in strings that captures a range of organizations spanning fully-symmetric sequences to arbitrary (“random”) sequences. Moreover, we wish to identify, from first principles, an associated algebra that generalizes the group algebra of symmetries. For physical consistency, we started with shift spaces since they are shift-invariant subspaces of strings. To fulfill the algebraic requirement, we now further restrict to sofic shifts as they are shift spaces defined in terms of a finite semigroup [31].

Recall that a group is a set of elements closed under an associative and invertible binary operation with an identity element. In this, they are too restrictive and impose only exact symmetry. In contrast, semigroups require neither invertibility nor an identity operation. This relaxation is key to defining generalized patterns, as we laid out above. It permits exact symmetries but also allows expressing noisy and approximate symmetries.

The elements of a sofic shift’s semigroup are words and the binary operation is word concatenation. For example, the set

W (A^{Z})

of all words in

A^{Z}

and their concatenation products together form the free semigroup. For example, the product of

u = 00

and

v = 11

in the free semigroup is the word

w = u v = 0011

. A sofic shift

X = X_{F}

is defined in terms of a finite semigroup G with an absorbing element e, whose product is

g e = e g = e

for all

g \in G

. The absorbing element e together with the elements from the alphabet

A = {0, 1, \dots, (n - 1)}

generates G via single-symbol concatenations. G’s production rules are such that for any pair of allowed words

u, v \notin F

, if their concatenation

w = u v

contains a forbidden word

f \in F

, then their product in G gives the absorbing element

u v = e

.

A semigroup of word concatenations is also associated with a simple presentation in the form of a semiautomaton finite-state machine [32]—the triple

(Ξ, A, M)

, where

Ξ = {ξ_{0}, ξ_{1}, \dots, ξ_{m}}

is a set of internal states,

A = {0, 1, \dots, (n - 1)}

is the symbol alphabet, and

M = {M_{0}, M_{1}, \dots, M_{(n - 1)}}

is a set of mappings from

Ξ

into

Ξ

. To be explicit, we consider deterministic and fully-specified semiautomata for which each

M_{a}

is a function—each input has one and only one output—with domain over the full set

Ξ

of internal states. Semiautomata can be usefully depicted as an edge-labeled directed graph. The vertices represent the internal states in

Ξ

and for every pair

(ξ_{i}, ξ_{j})

such that

ξ_{j} = M_{a} (ξ_{i})

there is an edge labeled

a \in A

that leads from

ξ_{i}

to

ξ_{j}

. For a deterministic and fully-specified semiautomaton, there is an edge labeled with each symbol in

A

emanating from every state in

Ξ

.

A fully-specified semiautomaton directly determines a subshift’s algebra from the free semigroup as follows. For every state

ξ_{i} \in Ξ

and any element

a_{0} a_{1} \dots a_{k}

of the the free semigroup

A^{Z}

there is a map

M_{a_{0}} \circ M_{a_{1}} \circ \dots \circ M_{a_{k}}

from

ξ_{i}

to another state

ξ_{j} \in Ξ

[32]. A deterministic and fully-specified semiautomaton is a presentation of a sofic shift if we include an absorbing “forbidden” state

ξ_{e} \in Ξ

. That is, the mappings associated with all elements of the free semigroup containing a forbidden word in

F

lead to the forbidden state. Additionally, all mappings from the forbidden state return the forbidden state. That is,

M_{w} (ξ_{i}) = ξ_{e}

for all

ξ_{i}

and

w \in F

, and

M_{a} (ξ_{e}) = ξ_{e}

for all

a \in A

. See Figure 1 for presentations of example shifts.

Since sofic shifts are defined by a finite semigroup, every sofic shift can be presented by a semiautomaton with a finite set of states

Ξ

. Recall from above that the idea of compression is related to our intuition of pattern. A pattern—a predictive regularity—allows for a compressed representation of a system’s behaviors. Note that sofic shifts and their presentations provide a finite representation of an ensemble of infinite strings through their finite semigroup.

Here, we distinguish between three types of sofic-shift semiautomaton presentation. Appendix A gives example constructions of these three types of presentation.

The most straightforward presentation assigns a state

ξ \in Ξ

to each element of a semigroup G of

X

and fills in the state transitions

M_{a}

using G’s production rules [33]. While straightforward to construct, if a G is known, this semiautomaton presentation is not necessarily minimal, in terms of the state set size

| Ξ |

. To specify a particular pattern as a sofic shift, it is crucial to have a minimal and unique presentation associated with the given sofic shift. This also allows extracting unambiguous quantitative measures of the ensemble of strings, such as measures of correlation, from the finite presentation.

An important presentation that is minimal and constructible without knowing any G is

X

’s future cover [34], defined below. The future cover semiautomaton of every irreducible sofic shift

X

has a unique strongly connected component [35,36]. This irreducible component is our mathematical representation of patterns as generalized symmetries. We refer to it as the canonical machine presentation

P (X)

of

X

. The future cover and its recurrent component

P (X)

provide a unique, minimal mathematical representation of

X

.

The future set (sometimes follower set)

F_{X} (w)

of a word

w \in W (X)

is the collection of all words u such that

w u \in W (X)

. Define the future equivalence relation

\sim_{F}

on

X

as:

\begin{matrix} u \sim_{F} v \Leftrightarrow F_{X} (u) = F_{X} (v), \end{matrix}

(1)

for all

u, v \in W (X)

.

An important definition of sofic shifts that we use shortly is:

Theorem 1.

([28], Theorem 3.2.10) A shift space is sofic if and only if it has a finite number of future sets.

Therefore, sofic shifts have finitely-many equivalence classes, denoted

{[\cdot]}_{F}

, and these equivalence classes plus the absorbing forbidden state are the internal states

Ξ

of the future cover semiautomaton.

The mappings

M_{a}

that give the state transitions are defined from the allowed concatenations that do not contain a forbidden word in

F

. That is, each state

ξ_{i} \in Ξ \ {ξ_{e}}

is an equivalence class

{[u]}_{F}

and, for each symbol

a \in A

, the concatenation

v = u a \neq e

belongs to the equivalence class

{[v]}_{F}

assigned as state

ξ_{j} \in Ξ

, giving

M_{a} (ξ_{i}) = ξ_{j}

. Note that this is independent of the choice

u \in {[u]}_{F}

and that

{[v]}_{F}

in some cases may be equal to

{[u]}_{F}

. This gives a self-edge transition in the semiautomaton:

M_{a} (ξ_{i}) = ξ_{i}

. If

u a = e

, then

M_{a} (ξ_{i})

maps to the forbidden state

ξ_{e}

.

This natural transition structure follows from future equivalence and it leads to a very important property of the future-cover semiautomaton. They are called unifilar [37] in information theory, equivalently also called right resolving in symbolic dynamics ([28], Corollary 3.3.19) or deterministic in automata theory [38]. A fully-specified semiautomaton is unifilar if for every internal state

ξ_{i}

and every word (element of the free semigroup)

a_{0} a_{1} \dots a_{k}

the map

M_{a_{0}} \circ M_{a_{1}} \circ \dots M_{a_{k}}

leads to one and only one internal state

ξ_{j}

. (It may be that j = i.)

Unifilarity is the defining property of a predictive semiautomaton. Since the goal is to formalize patterns as predictive regularities, this is an important point to stress. By way of contrast, first note that any presentation of a sofic shift

X

, whether unifilar or not, is a generator of

X

. Every string in

X

can be generated by following the symbol-labeled transitions of the presentation and no forbidden words can be generated. Thus, being generative can be thought of as the baseline property of any model of a sofic shift. Prediction is an additional capability beyond generation that arises from a presentation being unifilar.

Unifilarity establishes a presentation as predictive in the following way. For topological patterns, the task of prediction is to establish what may happen in the future, given what has happened in the past. Specifically, given a word w, what words are allowed to follow in the shift space? That is, what is w’s future set? Due to the implied determinism, each internal state of a unifilar presentation is uniquely specified by the word w leading to that state. Notably, this is not guaranteed for an arbitrary generator of a shift. Furthermore, in a unifilar presentation every subsequent word again leads to a unique internal state. It is straightforward to see that the future cover is a predictive presentation: By definition, its internal states are future separated—the set of all words that may follow from each internal state is unique.

To establish that sofic shifts and their canonical machine presentations express patterns as generalized symmetries, it is helpful to first describe how they capture exact translation symmetries of symbolic sequences.

2.4. Exact Symmetries

A string x has a discrete translation symmetry if

σ^{p} (x) = x

, where the minimal such

p \in N

is the symmetry’s period. The symmetry group is the set

{σ^{n p} : n \in N}

with

σ^{i p} σ^{j p} = σ^{k p}

where

k = i + j

. Since here p is finite, the action of

σ

on x produces a compact shift-invariant subspace of

A^{Z}

. Therefore, translation symmetric strings are shift spaces

X

.

We now show that the shift space

X

of translation symmetric strings is sofic. The action of the symmetry group used to define x is determined by the shift operator

σ

, while the action of sofic semigroups is word concatenation.

To connect these, consider windows

x_{i : j}

(

i < j)

that return the word w from coordinates i through j in x. For

(i < j < k)

, if

x_{i : j} = u

and

x_{j : k} = v

, then

x_{i : k} = w = u v

gives the concatenated window. Recall that

σ

shifts indices in x so that

σ^{k} {(x)}_{i} = x_{i + k}

. If we have a word

u \in W (X)

, there is some

(i, j)

such that

x_{i : j} = u

. Then the allowed concatenations

u v \neq e

are determined by the shift operator

σ

, since we can write

v = x_{j : j + k} = (x_{j}) σ {(x)}_{j} σ^{2} {(x)}_{j} \dots σ^{k} {(x)}_{j}

. All such concatenations determine

X

’s semigroup G. For translation symmetric strings with

σ^{p} (x) = x

, we have that

σ^{p} {(x)}_{i} = x_{i}

. So, intuitively, there is only a finite number of elements in G because there is a closure of allowed single-symbol concatenations after a finite number of unit-shifts. Therefore,

X

for translation symmetric strings is sofic.

We now show this explicitly by constructing the canonical machine presentation

P (X)

with a finite number of states.

Proposition 1.

Translation-symmetric strings,

σ^{p} (x) = x

for some

p \in N

, together with their shifts

y = σ^{n} (x)

for all

n \in N

, form an irreducible sofic shift space.

Proof.

First, note that

σ^{p} (x) = x

implies x can be written as a tiling

\dots b b b b b \dots

, where b is a word of length p. Pick any

x_{i}

as the first symbol

b_{0}

in the word b. Then

σ {(x)}_{i} = b_{1}

,

σ^{2} {(x)}_{i} = b_{2}

,…, with

σ^{p - 1} {(x)}_{i} = b_{p - 1}

. Applying

σ

one more time gives

σ^{p} {(x)}_{i} = x_{i} = b_{0}

, arriving at the next tile b.

Second, using this observation we create

P (X)

using

p + 1

internal states, where we have a state

ξ_{b_{i}}

for each symbol

b_{i} \in A

in the tile b (there are p of these) and one absorbing state

ξ_{e}

. Let

ξ_{b_{p - 1}}

be the future-set equivalence class

{[b]}_{F}

of the word b (

b_{p - 1}

is the last symbol in b). Due to x’s exact symmetry, there is one and only one symbol

a \in A

such that

u a \neq e

for

u \in {[b]}_{F}

; namely

b_{0}

. Therefore, let

M_{b_{0}} (ξ_{b_{p - 1}}) = ξ_{b_{0}}

, and then all other

M_{a}

map to

ξ_{e}

, for

a \neq b_{0}

. Now,

ξ_{b_{0}} = {[b b_{0}]}_{F}

and

b_{1}

is the only symbol such that

u a \neq e

for

u \in {[b b_{0}]}_{F}

. Thus,

M_{b_{1}} (ξ_{b_{0}}) = ξ_{b_{1}}

, with

ξ_{b_{1}} = {[b b_{0} b_{1}]}_{F}

, and all other

M_{a}

map to

ξ_{e}

, for

a \neq b_{1}

. Repeat this argument for all generators in b until we arrive at

{[b b_{0} b_{1} \dots b_{p - 1}]}_{F} = {[b b]}_{F}

.

Third, as with

ξ_{b_{p - 1}} = {[b]}_{F}

, the only symbol that can follow is

b_{0}

and we repeat the full argument again, where each

b_{i}

sequentially follows. Therefore,

{[b b]}_{F}

’s future set equals that of

{[b]}_{F}

and so

{[b b]}_{F} = {[b]}_{F}

. Thus, the only transition from

ξ_{b_{p - 2}} = {[b b_{0} b_{1} \dots b_{p - 2}]}_{F}

that is allowed returns to the original state

ξ_{b_{p - 1}} = {[b b_{0} b_{1} \dots b_{p - 1}]}_{F} = {[b b]}_{F} = {[b]}_{F}

. This completely specifies

P (X)

with a finite number of states

Ξ = {ξ_{b_{0}}, \dots, ξ_{b_{p}}, ξ_{e}}

. Given Theorem (1) above, one concludes that the shift space

X

for a translation symmetric sequence is sofic. □

Figure 1a gives an example of

P (X)

for the set of translation symmetric strings

\dots 000111000111000 \dots

with

b = 111000

. For visual clarity it omits the self-loops on state

ξ_{e}

.

Recall that

P (X)

is the irreducible component of the future cover, so for a given shift there may be additional states beyond those just described. However, these are equivalence classes for words of length less than p, since we started with

{[b]}_{F}

. Since p is finite, there are only finitely many additional equivalence classes. These correspond to transient (nonrecurrent) states of the future cover semiautomaton.

In Figure 1a’s example, knowing

x_{i} = 0

does not fully specify which state

P (X)

is in, since three states

ξ_{3}

,

ξ_{4}

, and

ξ_{5}

correspond to observing the symbol 0. The additional transient states of the future cover specify how to synchronize to

P (X)

’s states from the generators

a \in A

(single-symbol words).

Having constructed the canonical machine presentation

P (X)

, we can further relate

X

’s semigroup action to x’s translation symmetry group. For each state

ξ_{b_{i}} \in Ξ

there is one and only one transition that does not lead to the absorbing forbidden state

ξ_{e}

. That is, only one generator

a \in A

can be concatenated to the words

u \in {[u]}_{F} = ξ_{b_{i}}

. Similarly, if we consider a word

u \in ξ_{b i}

as the window

u = x_{i : j}

, then a unit shift by

σ

reveals one and only one new symbol at index j in

{σ (x)}_{i : j}

. Therefore, ignoring the absorbing state and transitions to it, the graph of

P (X)

is a cyclic graph with period p: Every p-length path from

ξ_{i}

returns to

ξ_{i}

for all

ξ_{i} \in Ξ \ {ξ_{e}}

. Thus, the permutation symmetries of this (edge-labeled) graph correspond to elements of x’s translation symmetry group.

In Figure 1a’s example, the state labeled

ξ_{b_{0}}

corresponds to the start of the tile

b = 111000

, but we could equivalently use

ξ_{b_{0}}

as the start of tile 000111. Furthermore, the internal states have a functional meaning. They are the elements of the quotient group of the translation symmetry—counters that track the symmetry’s phase.

It must be emphasized that sofic shifts and their semigroups do not formally generalize such exact symmetries in the obvious way. That is, the semigroup of a sofic shift for exact symmetry strings does not simply become a symmetry group. G’s absorbing semigroup element e is still required for an exact symmetry sofic shift

X

. From our construction of translation symmetric

P (X)

, we see that, for every internal state

ξ_{b_{i}} \neq ξ_{e}

, there is one and only one transition that does not lead to

ξ_{e}

. This makes it clear that exact symmetries are a highly restrictive form of pattern. By representing exact symmetries using sofic shifts and their machine presentations, though, it is now straightforward to generalize by relaxing the restrictions that impose exact symmetries.

2.5. Generalized Symmetries

Ignoring

ξ_{e}

, a sofic shift

X

whose machine presentation is not cyclic then represents a pattern as a generalized symmetry. We can again either consider

X

as an ensemble of strings or one infinite sequence that possesses a generalized symmetry described by

X

’s semigroup, which is well represented by the machine presentation

P (X)

. By removing the restriction of a cyclic graph in

P (X)

—that imposed the perfect regularity of

x = \dots b b b b b \dots

—we now can capture a much wider class of patterned strings with approximate or partial regularities.

Consider the extreme case of the full shift

A^{Z}

that has no regularity. There are no forbidden products such that

u v = e

in G for

A^{Z}

, and so there are no restrictions on its words;

F = \emptyset

. The algebra of the full shift is the free semigroup. Its machine presentation

P (A^{Z})

is a single state with all

M_{a}

mapping that one state back to itself—i.e., all transitions are self transitions. Since all words can be concatenated to each other, they all belong to a single future equivalence class.

We interpret

A^{Z}

’s complete lack of regularity to be a null pattern. Analogously, the opposite extreme of total regularity with strings of the form

x = \dots a a a a a \dots

for

a \in A

is also a null pattern with a single-state (again, ignoring

ξ_{e}

) machine presentation that has a single self-transition. The null pattern, in these cases, has zero memory—the logarithm of

| Ξ \ {ξ_{e}} |

vanishes. While both extremes at first seem to be polar opposites, recall our goal is that “pattern” represents a predictive regularity and this is lacking in both cases. The full shift is completely “random” and thus unpredictable. Whereas, for trivially translation symmetric strings

x = \dots a a a a \dots

, the future is always the same. There is nothing to predict.

Between the complete regularity of exact symmetries and lack of predictive regularity in null patterns, we identify several categories of partial predictive regularity.

First, note that for translation symmetric strings

σ^{p} {(x)}_{i} = x_{i}

for all i.

Second, there are string classes for which

σ^{p} {(x)}_{i} = x_{i}

for only some i. We call these partial symmetries. A particular case of partial symmetries are stochastic symmetries. For simplicity, consider binary sequences with

A = {0, 1}

, and let

ω

denote a “wildcard” that can be either 0 or 1 [39]. Sofic shifts with stochastic (partial) symmetries are fully translation symmetric after making wildcard substitutions. For example, we can specify a sofic shift with sequences of the form

x = \dots ω ω 0 ω ω 0 ω ω 0 \dots

, say. Examples of such strings are

\dots 110010000100110 \dots

, where spaces help emphasize the “fixed” 0s that are the scaffolding of the partial symmetry. Note that the canonical machine presentation

P (X)

for such stochastic symmetries are also cyclic graphs, as shown in Figure 1b.

Third, recall that if the canonical presentation

P (X)

is a cyclic graph, every p-length path from

ξ_{i}

returns to

ξ_{i}

for all

ξ_{i} \in Ξ \ {ξ_{e}}

. Similar to how we generalized from exact to partial symmetries, we define hidden symmetries for which only some states

ξ_{i} \in Ξ \ {ξ_{e}}

return to themselves on all p-length paths in

P (X)

. We exclude the case of

p = 1

, so that self-loops do not count as a hidden symmetry. Figure 1c shows an example with

ξ_{2}

as the symmetric state. The canonical machine presentation specifies a sofic shift consisting of arbitrary arrangements of blocks

a = 000

and

b = 111

; e.g.,

x = \dots a a b a b b a \dots = \dots 000000111000111111000 \dots

. The exact symmetry shift in Figure 1a is the special case of the symmetric tiling

x = \dots a b a b a b a \dots

.

Finally, Figure 1d gives an example of a general nonnull pattern that is not an exact, partial, or hidden symmetry. This is the well-known Even Shift [31,33]—the set of binary strings in which only even blocks of 1s bounded by 0s are allowed. This

\dots 01^{2 n} 0 \dots

pattern extends to arbitrary lengths, despite being specified by only two internal states. While there are no states in the presentation

P (X)

that always return to themselves after

p \neq 1

transitions, there is still predictive regularity. In particular, if a 1 is seen after a 0, it is guaranteed that the next symbol will be a 1. This is specified by

ξ_{B}

having only one allowed transition to

ξ_{A}

on a 1. Appendix A.2 discusses this example in more detail, along with its semigroup and three semiautomaton presentations.

Before moving to probabilistic patterns represented by sofic measures, we note that Krohn–Rhodes theory [40,41] was the first to connect finite automata with a semigroup algebra. Moreover, it showed that finite semigroups and their corresponding automata naturally decompose into simpler components, including finite simple groups. This is yet another perspective showing finite automata and their semigroup algebra capture patterns as generalized symmetries. Further exploration of the connection between Krohn–Rhodes theory and the perspective developed here is left for future work. One important difference to note is that their approach did not address statistical or noisy patterns, as the following now does.

2.6. Statistical Patterns Supported on Sofic Shifts

The exposition on sofic shift patterns did not invoke probabilities over symbols, words, or strings. Shift spaces are not concerned with the probability of a word occurring, only whether a word can possibly occur or not. This is why we referred to sofic shifts as topological patterns. We just saw that exact symmetries are given as sofic shifts and so are topological patterns. Recall that our key motivation for showing this was to argue for sofic shifts as a mathematical formalism that captures a notion of (topological) pattern that greatly expands exact symmetry to generalized symmetry. However, as we now describe, we can generalize further to formalize statistical patterns that are supported on sofic shifts. Doing so provides a direct link with the more familiar statistical measures of order and organization used in statistical mechanics, such as correlation functions.

Our goal is to build a probability space on top of a shift space. The key property of this probability space is that words are assigned positive probability if and only if they are allowed in the shift space. This is accomplished through the use of cylinder sets as the sigma algebra on a shift space

X

[42]. For a shift space

X

and a length-n word w, the cylinder set

C_{i} (w)

is defined as:

\begin{matrix} C_{i} (w) : = {x \in X : x [i : i + n - 1] = w} . \end{matrix}

Naturally then, a probability measure

μ

is assigned such that the probability of a word w occurring is given as:

\begin{matrix} Pr (w) = μ (C_{i} (w)) . \end{matrix}

By definition,

Pr (w) = 0

if

w \in F

, as the associated cylinder sets will be empty for forbidden words.

Such a probability measure defines a stationary stochastic process over the shift space

X

if it is shift-invariant, such that the probability of a word is independent of the index i in

C_{i} (w)

, and each word satisfies prefix and suffix marginalization:

\begin{matrix} μ (w) & = \sum_{{a : a w \in W (X)}} μ (a w) \end{matrix}

and:

\begin{matrix} μ (w) & = \sum_{{a : w a \in W (X)}} μ (w a) . \end{matrix}

This is the Kolmogorov extension theorem that guarantees the finite-dimensional word distributions consistently define a stochastic process [43]. We only consider shift-invariant measures and, so without loss of generality, we simply use

C_{0} (ω)

for the cylinder sets.

Crucially, the semigroup algebra and canonical machine presentation machinery for topological patterns have natural generalizations to stochastic patterns, as we now describe.

Following Ref. [44] in the context of shift spaces, a (free) stochastic semigroup is a function F defined on the free semigroup S, with identity element

η

(the empty symbol), that satisfies the following properties ([42], Definition 4.29):

$F (η) = 1$ ,
$F (s) \geq 0$ for each $s \in S$ and $F (a) \geq 0$ for each $a \in A$ , and
$\sum_{a_{i} \in A} F (a_{i} s) = \sum_{a_{i} \in A} F (s a_{i})$ for each $s \in S$ .

For a sofic shift

X

, we define a stochastic semigroup on

X

using the semigroup G defined above, with absorbing element e corresponding to forbidden words. We simply set

F (e) = 0

. Then, a shift-invariant measure

μ

satisfying Kolmogorov extension on a sofic shift

X

with semigroup G forms a stochastic semigroup as follows. For all elements of the free semigroup

a_{0} a_{1} \dots a_{k}

—i.e., for every word

w = a_{0} a_{1} \dots a_{k}

—define F such that

F (η) = 1

,

F (w) = 0

if and only if

w \in F

(equivalently,

a_{0} a_{1} \dots a_{k} = e

in the semigroup G), and otherwise

F (w) = μ (C_{0} (w))

. Such a measure

μ

is called a sofic measure [42].

In this way, sofic measures allow for statistical structure on top of a sofic shift

X

, while maintaining an algebraic structure related to

X

’s semigroup algebra. More importantly, there is a canonical machine presentation associated with sofic measures, analogous to the canonical machine presentation of sofic shifts. As in the topological case for sofic shifts, the canonical machine presentation of sofic measures provides the mathematical formulation of statistical patterns.

Recall that the future cover semiautomaton—the canonical topological machine presentation—is defined from Equation (1)’s future equivalence relation. The canonical stochastic machine presentation is defined through a stochastic generalization of future equivalence, called predictive or causal equivalence, defined on semi-infinite words. Each index i partitions a sequence into a semi-infinite past

{\overset{\leftarrow}{x}}_{i} = {x_{j}}, j \leq i

, and semi-infinite future

{\vec{x}}_{i} = {x_{k}}, k > i

. In the topological case, two pasts are considered future-equivalent if they have the same future—the same set of futures that follow. In the stochastic case, two pasts are predictively or causally equivalent if they have the same distribution

Pr (\vec{X} | \overset{\leftarrow}{X})

over futures conditioned on the past:

\begin{matrix} {\overset{\leftarrow}{x}}_{i} \sim_{ϵ} {\overset{\leftarrow}{x}}_{j} \Leftrightarrow Pr (\vec{X} | \overset{\leftarrow}{X} = {\overset{\leftarrow}{x}}_{i}) = Pr (\vec{X} | \overset{\leftarrow}{X} = {\overset{\leftarrow}{x}}_{j}) . \end{matrix}

(2)

Just as the future equivalence relation

\sim_{F}

defines the unique minimal semiautomaton presenting a sofic shift, the causal equivalence relation

\sim_{ϵ}

defines the unique minimal hidden Markov chain (HMC) that presents a sofic measure and its stationary stochastic process [22].

Speaking simply, a hidden Markov chain

(Ξ, A, T)

is a semiautomaton whose deterministic symbol-labeled transitions

M = {M_{0}, M_{1}, \dots, M_{(n - 1)}}

are replaced by symbol-labeled transition probabilities

T = {T^{0}, T^{1}, \dots, T^{(n - 1)}}

, where

T_{ξ, ξ^{'}}^{a}

is the probability of transitioning from state

ξ

to

ξ^{'}

on the symbol

a \in A

.

Paralleling the topological setting, the canonical stochastic machine presentation is a hidden Markov chain whose internal states

Ξ

—the predictive or causal states—are the equivalence classes of Equation (2)’s causal equivalence relation. The symbol-labeled transitions are then defined through the one-step conditional distributions

Pr (X^{1} = a | \overset{\leftarrow}{X} = {\overset{\leftarrow}{x}}_{i})

. For a given causal state

ξ

, we write

Pr (\vec{X} | \overset{\leftarrow}{X} \in ξ)

since by definition each past

{\overset{\leftarrow}{x}}_{i}

in the equivalence class

ξ = {[{\overset{\leftarrow}{x}}_{i}]}_{ϵ}

has the same predictive distribution.

The transition probability

T_{ξ, ξ^{'}}^{a}

is then given as

Pr (X^{1} = a | \overset{\leftarrow}{X} = {\overset{\leftarrow}{x}}_{i} \in ξ)

, with

ξ^{'} = {[{\overset{\leftarrow}{x}}_{i} a]}_{ϵ}

, where

{\overset{\leftarrow}{x}}_{i} a

is the new past given by concatenating the observed symbol a onto the current past

{\overset{\leftarrow}{x}}_{i}

. This follows from unifilarity [22]: in the stochastic setting for each internal state

ξ \in Ξ

and symbol

a \in A

there is at most one internal state

ξ^{'}

such that

T_{ξ, ξ^{'}}^{a} > 0

. It then follows that

{[{\overset{\leftarrow}{x}}_{i} a]}_{ϵ}

is the same for all

{\overset{\leftarrow}{x}}_{i} \in ξ

.

Historically, the canonical stochastic machine presentation

(Ξ, A, T)

is known as the ϵ-machine [22] of the associated stochastic process. As described shortly, the

ϵ

-machine is a generator of its associated statistical field theory. It generates all words with their corresponding probabilities. Similar to its topological counterpart, unifilarity additionally elevates the

ϵ

-machine to a predictive presentation. Prediction in the stochastic setting means identifying the predictive distribution associated with a given past. Additionally, by definition, an

ϵ

-machine’s causal states carry unique predictive distributions.

Summing over all symbol-labeled transitions produces a Markov transition operator over the internal causal states:

T_{ϵ} = \sum_{a \in A} T^{a}

. This operator evolves probability distributions over the causal states, regardless of the symbols involved, and specifies an order-1 Markov process over the states [22]. The left eigenvector of

T_{ϵ}

with unit eigenvalue provides the stationary distribution over causal states:

π T_{ϵ} = π

. For ergodic systems, as we consider here,

π

is unique.

We emphasize that the causal states identify a hidden, internal Markov process underlying the non-Markovian process over the symbols in

A

, specified by a sofic measure on a sofic shift. This inverts the abstract definition of sofic measures, given as factor maps on Markov processes [42]. There, a shift-invariant measure obeying Kolmogorov extension on a subshift of finite type produces a Markov chain of finite order and sofic shifts are abstractly defined as factor maps of subshifts of finite type. Hidden Markov chains are then given as the pushforward of the Markov measure along the factor map. Here, in contrast, we start with a statistical field theory supported on a sofic shift. The causal equivalence relation then identifies the underlying causal states and the Markov process defined over them.

With our given assumption of a stationary ergodic process over symbols we can use the stationary distribution

π

and the symbol-labeled transition operators to directly extract the word probabilities

Pr (w) = μ (C_{0} (w))

from the

ϵ

-machine presentation. First, single-symbol probabilities are given as:

\begin{matrix} Pr (a) = \sum_{ξ \in Ξ} π (ξ) \sum_{ξ^{'} \in Ξ} T_{ξ ξ^{'}}^{a} . \end{matrix}

(3)

where

\sum_{ξ^{'}} T_{ξ ξ^{'}}^{a}

gives the probability of observing the symbol

a \in A

, conditioned on being in causal state

ξ

, and this is summed over all states in

Ξ

weighted by their stationary probabilities

π

. We write this compactly as:

\begin{matrix} Pr (a) = 〈 π | T^{a} | 1 〉, \end{matrix}

(4)

where

〈 π |

indicates

π

as a row vector and

| 1 〉

is the column vector of all 1s.

The probability of a word

w = a_{0} a_{1} \dots a_{k}

is then:

\begin{matrix} Pr (a_{0} a_{1} \dots a_{k}) & = μ (C_{0} (a_{0} a_{1} \dots a_{k})) \\ = 〈 π | T^{a_{0}} T^{a_{1}} \dots T^{a_{k}} | 1 〉 . \end{matrix}

(5)

Recall that in the topological case, the semigroup algebra of the canonical machine presentation is given through composition of the mappings

M = {M_{a}}

. Now, we see the same semigroup algebraic structure in the products of the symbol-labeled transition matrices

T = {T^{a}}

. In fact, the topological structure, the set of mappings

M

, is recovered from the statistical structure, the set of transitions

T

, by setting all nonzero elements of each

T^{a} \in T

to unity and then applying future equivalence.

It must be emphasized that this last step, applying future equivalence, is essential. There may be distinct

ϵ

-machines—presentations of statistical patterns—that are supported on the same sofic shift—topological pattern. Said another way, statistical patterns signify distinct structure supported on topological patterns. They are not merely “adding probabilities onto” topological patterns. Formally, the symbol-labeled transition matrices

T

can represent a different semigroup than that represented by their topological counterparts

M

on which they are supported. Appendix B illustrates this distinction.

Let us briefly turn to mention the quantitative benefits of having these presentations. Using the word probabilities as just described, in addition to the underlying pattern of the sofic shift that supports the ensemble, a wide range of statistical properties of the sequence ensembles, such as correlation functions, power spectral densities, and informational properties [45,46] can be directly determined via the

ϵ

-machine. For example, the Shannon entropy rate

h_{μ}

of an ensemble measures its degree of randomness and can be calculated as:

\begin{matrix} h_{μ} & = lim_{ℓ \to \infty} \frac{H [X_{0 : ℓ}]}{ℓ} \\ = lim_{ℓ \to \infty} H [X_{0} | X_{- ℓ : 0}] \\ = - \sum_{ξ \in Ξ} π (ξ) \sum_{a \in A} \sum_{ξ^{'} \in Ξ} T_{ξ ξ^{'}}^{(a)} {log}_{2} T_{ξ ξ^{'}}^{(a)}, \end{matrix}

where

X_{0 : ℓ}

for

ℓ > 0

is the random variable for the subsequence of symbols

x_{0} x_{1} \dots x_{ℓ - 1}

. Moreover, the Shannon information in the causal states measures a process’ historical memory [47]. This is the amount of memory about the past that must be stored in order to optimally predict the future of the process.

3. Patterns in Spacetime

The preceding demonstrated that sofic shifts and their semiautomaton presentations provide a formulation of patterns in discrete one-dimensional systems. Additionally, it showed how they generalize to stochastic ensembles with associated information-theoretic measures. With this, one can argue that the theory of one-dimensional patterns in discrete stationary processes, augmented with the cited extensions, is largely complete. Now, we turn to the question of how to capture patterns in higher dimensions. While patterns in this new setting are amenable to similar analysis, there are key differences, new phenomena, and open problems.

Consider now the time evolution of symbols

A

on the sites of a spatial lattice

L

. A spacetime field

x_{}^{} \in A^{L \otimes Z}

is a time series

x_{0}, x_{1}, \dots

of spatial configurations

x_{t} \in A^{L}

. With a d-dimensional lattice

L = Z^{d}

a spacetime field

x_{}^{}

is an element of a spacetime shift space

A^{Z^{d + 1}}

[19,21]. In what follows, upper indices on field values are spatial coordinates (e.g., at time t site i has value

x^{i}

) and lower indices are time coordinates (e.g.,

x_{t}

).

Multidimensional shift spaces are notoriously difficult to study, with many simple properties being uncomputable [21,48]. Similarly, while sofic shifts and their canonical machine presentations provide a mathematical formalism for patterns as generalized symmetries in one dimension, generalizing to higher dimensions is not straightforward. Significantly, there is not a unique generalization for finite-state machines and regular languages in higher dimensions [20]. If there were a unique generalization, we could use the semiautomata special case as the mathematical representation of high-dimensional patterns.

As we argued, models that create a compressed representation of a system’s behavior must do so by harnessing patterns—predictive regularities—in the system. Fully-discrete one-dimensional systems are ideal as there is a unique minimal presentation of the system and, in this case, that predictive presentation is the pattern. The situation is more complicated in higher dimensions. A conflict arises between useful generalized spacetime symmetries and predictive presentations that faithfully generate fields in their spacetime shift spaces.

We now outline the local spacetime generalization of predictive equivalence for constructing presentations of spacetime patterns. As with finite-state machines and regular languages, degeneracy is broken when moving to higher dimensions as the spacetime generalization is not unique. In particular, we demonstrate that the shape of local “futures” determines the algebraic properties of the resulting local presentation.

3.1. Local Spacetime Presentations

Since an evolving spacetime field is a time series of spatial lattice configurations, it can be interpreted as a one-dimensional shift space over an exponentially-large alphabet (of lattice configurations). While formally well defined, however, this perspective is an unwieldy basis for a mathematical formulation of patterns in spacetime. First, for space and time translation symmetries, we typically consider the idealized case of infinitely-large spatial lattices. Thus, one must work with shift spaces over an infinitely-large alphabet. Second, capturing patterns within the spatial configurations themselves means not treating an entire lattice configuration simply as a single symbol, as done with the one-dimensional framing. There is internal organization within the spatial configurations that is dynamically and structurally important.

Therefore, we take a local approach to generalizing machine presentations of spacetime shift spaces [17,49]. This is implemented by identifying equivalence classes over local pasts that have the same future set of local futures in spacetime. The latter can be either topological (sets) or statistical (predictive distributions).

Motivated by the causal restrictions of local interactions, past lightcones in spacetime are the natural choice for local pasts when building local presentations. Formally, the past lightcone

L^{-}

of a site

(r, t)

in spacetime is the set of all field values

x_{t^{'}}^{r^{'}}

at previous times

(t^{'} \leq t)

that could influence

x_{t}^{r}

through local interactions:

\begin{matrix} L^{-} (r, t) \equiv \{x_{t^{'}}^{r^{'}} : t^{'} \leq t and | r^{'} - r | \leq c (t - t^{'})\}, \end{matrix}

(6)

where c is the finite speed of information propagation in the system. We include the present field value

x_{t}^{r}

in

(r, t)

’s past lightcone, but not in its local future.

Due to the richness and complication of multidimensional shift spaces, the following explores only topological spacetime patterns. Paralleling the one-dimensional development, local presentations of topological spacetime patterns are defined through the local analog of the future equivalence relations:

\begin{matrix} L_{i}^{-} \sim_{F} L_{j}^{-} \Leftrightarrow F (L_{i}^{-}) = F (L_{j}^{-}), \end{matrix}

(7)

where

F (L_{i}^{-}) = {

Local futures co-occurring with

L_{i}^{-}}

. We define co-occurring local futures more precisely shortly, as there are alternatives.

Relation (7)’s equivalence classes determine the internal states for a local spacetime presentation. The generalization of symbol-labeled transitions between the internal states is constructed in terms of spatial

{[σ^{s} (x_{}^{})]}_{t}^{r} = x_{t}^{r + s}

and temporal

{[σ_{τ} (x_{}^{})]}_{t}^{r} = x_{t + τ}^{r}

shift operators. (Note that all space-time shift operators commute.) For a past lightcone

L^{-} (r, t)

at spacetime site

(r, t)

, let

σ_{τ}^{s} (L^{-} (r, t))

denote the action of the shift operator on all of the field values in

L^{-} (r, t)

. That is, an entire lightcone is shifted analogously to an individual spacetime site.

The right spatial transition fringe is defined as the set difference between

L^{-} (r, t)

and

σ^{1} (L^{-} (r, t))

. This is the generalization of the “symbol” emitted during a rightwards move in one spatial dimension. Similarly, the left spatial transition fringe is defined as the set difference between

L^{-} (r, t)

and

σ^{- 1} (L^{-} (r, t))

, and the forward temporal transition fringe is the set difference between

L^{-} (r, t)

and

σ_{1} (L^{-} (r, t))

(see Figure 4 in Ref. [49]). For simplicity, we consider spacetime fields with one spatial dimension, but the generalization to higher dimensions is straightforward.

Time and space transition fringes form the appropriate alphabet to define local spacetime presentations

(Ξ, A, M)

, with

Ξ

the set of past lightcone equivalence classes,

A

the set of space and time transition fringes, and

M

the fringe-labeled state transitions [49]. Briefly, a site

(r, t)

in spacetime has an associated internal state

ξ (r, t)

that is the equivalence class of the past lightcone

L^{-} (r, t)

. Consider the neighboring site

(r + 1, t)

, which similarly has an associated internal state

ξ (r + 1, t) = {[L^{-} (r + 1, t)]}_{F}

. The right transition fringe provides the missing information to construct

L^{-} (r + 1, t)

from

L^{-} (r, t)

. Therefore,

ξ (r, t)

plus a right transition fringe uniquely determines

ξ (r + 1, t)

. That is, such local presentations are unifilar, and therefore have the requisite structure of predictive presentations. Recall that predictive models are also generative.

Generating words is rather straightforward for machine presentations in one dimension, markedly less so for local spacetime presentations. The fringe-labeled transitions establish local spacetime presentations

(Ξ, A, M)

as local generative spacetime models. Local spacetime patches, local “words”, are generated via concatenation of transition fringe “symbols”. Denote concatenations of right spatial fringes generally as

V_{r}

, to signify the particular shape of the resulting spacetime patches. Similarly, let

V_{l}

be the spacetime patches resulting from concatenations of left spatial fringes and

V_{t}

the patches from forward temporal fringes.

Note that spacetime patches are more than merely collections of values from spacetime sites. The configuration and space-time relation among the values matter. This is why we refer to their shape. The

V_{r}

patches are a distinct form of local spacetime words apart from

V_{l}

and

V_{t}

. Only patches of the same shape may be concatenated together to extend patches.

Let

V

be the spacetime patches resulting from the union of forward, left, and right transition fringes. Let depth denote the number of single-fringe symbols concatenated together in a particular spacetime patch word. See Figure 2.

Concatenating fringes into local spacetime patches has the same semigroup algebra structure as in the one-dimensional setting. Similarly, the semigroup algebra is captured by the local presentations through the action of the fringe-labeled transitions

M

, analogous to the one-dimensional presentations

P (X)

. We now examine possible choices of local futures and the algebraic properties of the induced local presentations.

3.2. The Shape of Local Futures

Recall that Relation (7) did not specify local futures when defining local presentations. The future sets

F (L^{-})

of past lightcones there were intentionally left ambiguous to allow for possible alternatives—alternatives that arise to address several subtleties of spacetime shifts.

Prior work assumed that the natural choice for a local future is a future lightcone

L^{+}

[17,18,49]. The latter is defined as all field values at subsequent times that could possibly be influenced from the given spacetime site

x_{t}^{r}

through the local interactions:

\begin{matrix} L^{+} (r, t) \equiv \{x_{t^{'}}^{r^{'}} : t^{'} > t and | r^{'} - r | \leq c (t^{'} - t)\} . \end{matrix}

(8)

Co-occurring past

(L^{-})

and future

(L^{+})

lightcones at spacetime point

(r_{0}, t_{0})

are depicted in Figure 3 for a

1 + 1

dimensional spacetime field with

c = 1

. Local presentations constructed as equivalence classes of past lightcones that have the same future set of future lightcones are known as local causal states.

Interestingly, the benefit of local causal-state models derives from generalized symmetries [17,18]. However, this is a spacetime symmetry distinct from the semigroup algebra of fringe concatenations. While we describe these spacetime symmetries in more detail below, we first demonstrate that such symmetries are at odds with the semigroup algebra of fringe concatenations for local causal states. That is, while employing lightcones as local futures allows one to discover useful spacetime symmetries, the resulting local machine presentation

(Ξ, A, M)

is not a faithful generator of the underlying spacetime shift space. Even though all local presentations possess the unifilarity property of predictive models, the more basic generative ability falls short unless the appropriate local futures are used. We again emphasize that phenomena in higher dimensions are complicated—sometimes in counterintuitive ways, as the following demonstrates.

Elementary cellular automata (ECA)—see Appendix C—produce spacetime shift spaces with nontrivial patterns and structure in their spacetime fields; e.g., domains, particles, and particle interactions [17,50]. In addition, their one-dimensional fully-discrete spatial lattices are shift spaces and so provide a link between one-dimensional shifts and

1 + 1

dimensional shifts of their spacetime fields.

Following convention, denote the mapping from a past lightcone to its equivalence class or, equivalently, to its associated local (causal) state

ξ

, as

ϵ (L^{-}) = {[L^{-}]}_{F} = ξ

. Crucially, this provides a local pointwise mapping over any spacetime field

x_{}^{}

. Each site

(r, t)

has a unique past lightcone

L^{-}

that is then mapped to its local state via

ϵ (L^{-})

. We use this pointwise mapping to transform a spacetime field

x_{}^{}

to an associated local-state field

S_{}^{} = ϵ (x_{}^{})

that shares the same coordinate geometry as

x_{}^{}

. Each site

S_{r}^{t}

in the local state field

S_{}^{}

is the local state

S_{r}^{t} = ξ = ϵ (L^{-})

that is the image under the

ϵ

-map of the past lightcone

L^{-}

at the spacetime point

x_{t}^{r}

.

An example spacetime field

x_{}^{}

from ECA Rule 90 is shown in Figure 4 consisting of black (

x_{t}^{r} = 1

) and white (

x_{t}^{r} = 0

) squares. The associated local causal-state field

S_{}^{}

is shown as the overlaid colored letters, using lightcones as local futures. In the case of Rule 90, there is a single local causal state, labeled as

A

—all past lightcones are future-lightcone equivalent.

This is consistent with prior findings [17] that connect the local causal states with the canonical machine presentations of one-dimensional sofic shifts that represent invariant sets of spatial configurations for the ECA. The full 2-shift is invariant under Rule 90, meaning there are no forbidden words in the one-dimensional spatial configurations generated by Rule 90. All binary strings have pre-images under the global dynamic

Φ

of Rule 90. Recall that the canonical machine presentation of full shifts consists of a single internal state and, hence, we expect the single local causal-state for the spacetime fields produced by Rule 90.

Even though there are no forbidden words in Rule 90’s one-dimensional spatial configurations, there certainly are forbidden patches in its spacetime fields. In particular, note that the ECA’s local update rule

ϕ

corresponds to a spacetime patch in the shape of a depth-1 past lightcone—the present spacetime site together with its local neighborhood one time-step in the past. Therefore, any spacetime patch of the same shape that is inconsistent with local update rule

ϕ

is forbidden in the spacetime shift space of the ECA’s spacetime fields. For example, the following patch is allowed by Rule 90, since

ϕ_{90} (010) = 0

:

\begin{matrix} \begin{matrix} 0 & 1 & 0 \\ 0 \end{matrix} \end{matrix}

Therefore:

\begin{matrix} \begin{matrix} 0 & 1 & 0 \\ 1 \end{matrix} \end{matrix}

is a forbidden spacetime patch in Rule 90’s spacetime shift space.

From ECA Rule 90’s single local causal-state, it is easy to see that the local causal-state machine presentation is not a consistent spacetime generator. Since there is only one state,

A

, all fringe-labeled transitions occur from state

A

back to itself. Thus, all like-shaped fringe symbols that occur may be concatenated together to form spacetime patches. Figure 4 highlights three right-transition fringes. If they are concatenated in order of orange, green, magenta (left to right) they form the depth-3

V_{r}

patch:

\begin{matrix} \begin{matrix} 1 & 1 & 0 \\ 0 & 1 & 0 \\ 0 & 1 & 1 \end{matrix} \end{matrix}

This patch contains the spacetime word given above that is forbidden in Rule 90’s spacetime shift. In fact, the other word of the same shape in this patch is also forbidden by Rule 90:

\begin{matrix} \begin{matrix} 1 & 1 & 0 \\ 0 \end{matrix} \end{matrix},

since

ϕ_{90} (110) = 1

.

Concatenations of left and forward fringes produce, respectively,

V_{l}

and

V_{t}

spacetime patches that similarly contain words forbidden by Rule 90’s spacetime shift space, as shown algorithmically in a supplementary Jupyter Notebook [51]. Similar results for another ECA shift space, the domain of Rule 18 discussed in more detail below, are also given in a supplementary Jupyter Notebook [52]. Empirically, we found that local causal state presentations are generically not faithful generative models of their CA spacetime shift spaces.

Comparing Figure 3 with the

V_{i}

patch shapes in Figure 2, there is little to no overlap between the “futures” (lightcones in this case) used to define the equivalence classes of past lightcones and the spacetime patches generated from fringe-labeled transitions between the equivalence classes. Therefore, it is not surprising that local presentations created with future lightcones are not faithful generators of their spacetime shifts.

In one dimension, there is only one “future” that always follows from the past. When translating forward, what once was a future becomes part of the past. This is not necessarily the case for local pasts and futures in spacetime. The predictive ability of unifilar models follows from this succession of futures becoming pasts, when combined with future equivalence. As we now see, future equivalence without the successional relation between local pasts and futures yields models that, while unifilar, are not consistent spacetime generators.

Constructively, this insight points the way to creating local presentations that are faithful generators. As the future lightcones do not play a role in generating local spacetime patches, they should not be used to define past lightcone equivalence in Relation (7). Rather, we should use spacetime patch shapes—denote them

V_{i}

—that are to be generated as our notion of a local future for defining future equivalence in Relation (7). Local causal states are defined using the future sets with future lightcones

F (L_{i}^{-}) = {L^{+} co - occurring with L_{i}^{-}}

. We can alternatively define local machine presentations whose internal states are defined as equivalence classes from Relation (7) using

F (L_{i}^{-}) = {V_{i} co - occurring with L_{i}^{-}}

.

Indeed, as demonstrated computationally in the supplementary Jupyter Notebook Ref. [51], using

V_{r}

shapes with

F (L_{i}^{-}) = {V_{r} co - occurring with L_{i}^{-}}

yields a local machine presentation that is a faithful generator of

V_{r}

spacetime patches. However, this presentation is not a faithful generator of

V_{l}

or

V_{t}

spacetime patches. Similar to the use of future lightcones, local presentations can only generate faithful spacetime patches in the shape used to define

F (L^{-})

. Therefore, as also demonstrated in Ref. [51], using

V_{l}

to define

F (L^{-})

results in local presentations that faithfully generate

V_{l}

, but not

V_{r}

nor

V_{t}

. Similar for

V_{t}

.

This motivates the definition of

V

in Figure 2d as the union of

V_{r}

,

V_{l}

, and

V_{t}

. Local presentations defined using

F (L_{i}^{-}) = {V co - occurring with L_{i}^{-}}

are faithful generators of spacetime patches in all directions emanating from a seed past lightcone. As the resulting presentations are also unifilar, additionally they are consistent predictive models of local spacetime patches. These

V

presentations can thus be seen as the most natural generalization of the predictive canonical machine presentations

P (X)

for one-dimensional sofic shifts. Moreover, they possess the same semigroup algebra of “symbol” (fringe) concatenation that allows the generation and prediction of arbitrarily-long “words”—arbitrarily large spacetime patches.

It is important also to note that the spacetime fields considered here are generated by deterministic cellular automata. As pointed out above, each site in spacetime is fully determined by its depth-1 past lightcone. From Figure 2d, we see that

L^{-} \cup V

is an enlarged past lightcone. Moreover, the depth-1 past lightcone for each site in

V

is contained within

L^{-} \cup V

. Therefore, the internal states of the

V

presentations are sets of past lightcones that make the same prediction over spacetime regions that contain the full predictive information necessary to make such predictions. This is not the case for local causal states that make predictions over future lightcones, as

L^{-} \cup L^{+}

regions do not contain the depth-1 past lightcones required to make local spacetime patch predictions.

That said, as the following now details, local causal-state presentations that use future lightcones to define

F (L^{-})

possess interesting and useful generalized spacetime symmetries that are lost when using

V

presentations.

3.3. Generalized Spacetime Symmetries

Recall from above that machine transitions in one-dimension determine the structural relations among internal states and that they are driven by the shift operator. We just saw that the analogous fringe-labeled transition structure in spacetime is similarly related to spacetime shift operators through the definition of the fringes. However, the spacetime shift operators provide structural algebraic relations among local states independent of fringe symbols and their concatenated spacetime words. This is due to the local pointwise

ϵ

-map and the resulting shared coordinate geometry between spacetime fields

x_{}^{}

and corresponding local state fields

S_{}^{} = ϵ (x_{}^{})

.

To see this, consider two space-adjacent sites in a spacetime field,

x_{t}^{r}

and

x_{t}^{r + 1} = σ^{1} x_{t}^{r}

. Under the

ϵ

-map, this produces

ξ_{i} = S_{t}^{r}

and

ξ_{j} = S_{t}^{r + 1} = σ^{1} ξ_{i}

. As past lightcones are defined solely in terms of spacetime distances, they are equivariant under spacetime isometries. In the present setting, these are translations.

For example, if a spacetime field

x_{}^{}

has exact time and space translation symmetries

σ_{τ} x_{}^{} = x_{}^{}

and

σ^{s} x_{}^{} = x_{}^{}

, for some s and

τ

, then the corresponding local state field

S_{}^{} = ϵ (x_{}^{})

shares these symmetries:

σ_{τ} S_{}^{} = S_{}^{}

and

σ^{s} S_{}^{} = S_{}^{}

. This is because the spacetime shift operators act equivalently on lightcones, so that

σ^{s} L_{i}^{-} = L_{i}^{-}

and

σ_{τ} L_{i}^{-} = L_{i}^{-}

. Therefore,

ϵ (σ^{s} L_{i}^{-}) = ϵ (L_{i}^{-})

and

ϵ (σ_{τ} L_{i}^{-}) = ϵ (L_{i}^{-})

. Note that this argument is independent of which local future shape is used, as they all define a local

ϵ

-map on past lightcones.

When future lightcones specifically are used as local futures, we previously showed (i) there are spacetime fields

x_{}^{}

that do not have translation symmetries, but (ii) the associated local causal state field

S_{}^{} = ϵ (x_{}^{})

does [17]. These are the spacetime generalizations of partial and hidden symmetries; cf. Figure 1.

For the cellular automata examples in Ref. [17] and shown here in Figure 5, generalized symmetries—exact, partial, and hidden—in the spacetime fields are generated by the evolution of invariant one-dimensional sofic shifts (spatial configurations) under the CA dynamic. Such spacetime regions are known as domains [17,50]. Interestingly, exact symmetry domains are generated from the evolution of exact symmetry sofic shifts. Moreover, stochastic partial symmetry domains are generated from stochastic partial symmetry sofic shifts and hidden symmetry domains from hidden symmetry shifts. Examples of each of these cases are shown in Figure 5 as the generalizations of the one-dimensional cases in Figure 1. Appendix D displays the presentations

P (X)

for each sofic shift

X

used to generate the fields in Figure 5.

Exact symmetries are straightforward, as just described. There is a finite s and

τ

such that for every site

(r, t)

in spacetime we have that

σ_{τ} x_{t}^{r} = x_{t}^{r}

and

σ^{s} x_{t}^{r} = x_{t}^{r}

. Due to the shared coordinate geometry and isometry equivariance of past lightcones, this applies for the local causal state field as well:

σ_{τ} S_{t}^{r} = S_{t}^{r}

and

σ^{s} S_{t}^{r} = S_{t}^{r}

. The exact symmetry field shown in Figure 5a is a sample of the domain of ECA Rule 54. As can be seen, both the spacetime fields and corresponding local state fields have a period-4 translation symmetry in both time and space.

For partial symmetries, some, but not all, of the spacetime coordinates return to themselves after fixed translation, just as in one dimension. For example, there are some sites

(r^{*}, t^{*})

such that

x_{t^{*}}^{r^{*} + s} = σ^{s} x_{t^{*}}^{r^{*}} = x_{t^{*}}^{r^{*}}

, but this does not hold for all

(r, t)

. As in one-dimension, a special case of partial symmetries are stochastic symmetries that become symmetric after a wildcard substitution. For example, consider a 1+1 dimensional spacetime field with

A = {0, 1}

such that it has a checkerboard layout with 0s on the black squares and wildcards (either 0 or 1) on the white squares. The partial (stochastic) symmetry example shown in Figure 5b is ECA Rule 18’s domain, which is a strict subset (subshift) of the 0-wildcard checkerboard shift space. We examine these two spacetime shift spaces in more detail below.

Unlike in one-dimension, hidden symmetries in higher dimensions correspond to exact symmetries in the local causal-state field for spacetime fields that themselves have no symmetries, exact or partial. The hidden symmetry spacetime fields in Figure 5c and Figure 6c are samples of ECA Rule 22’s domain and their structure is manifestly harder to detect from visual inspection.

What is most important to note here is that the observable field

x_{}^{}

does not have any space or time translation symmetries, exact or partial. The corresponding local causal-state field

S_{}^{} = ϵ (x_{}^{})

(shown in Figure 5), though, does have symmetries that are period-4 in both time and space. Essentially, there are motifs that appear in

x_{}^{}

—such as, the black “triangles” that occur with a local period-4 structure (e.g., 1110 in space and 1100 in time). However, in contrast to a stochastic symmetry field, there is no global symmetry that captures how these stochastic motifs occur. The global hidden symmetry is more complicated than can be revealed by simple wildcard substitutions. It is uncovered, though, by the local causal-state field.

A general spacetime pattern then is one for which the local causal-state field does not exhibit spacetime symmetries. There is still an algebraic relation among the local states from the spacetime shift operators, but they do not correspond to spacetime symmetries.

Note that the example in Figure 5d has regions that are locally symmetric in both

x_{}^{}

and

S_{}^{}

. However, this symmetry is globally broken by localized defects or coherent structures [17]. Figure 5d is produced from from ECA Rule 54 evolving a random initial configuration. The local symmetry regions are instances Rule 54’s domain—the exact symmetry field shown in Figure 5a.

We emphasize that, with the exception of exact symmetries, these generalized spacetime symmetries are empirically observed only with local causal-state presentations that employ future lightcone shapes. For example, the corresponding local state fields of presentations using

V

local futures are shown in Figure 6 with the same spacetime fields as Figure 5. As expected, the exact symmetry case also has space-time translation symmetries in the local state field in (a). However, there are no space-time symmetries present in the local state fields for partial (b) or hidden (c) symmetry spacetime fields when

V

local futures are employed.

Altogether, these observations portray a somewhat unsatisfactory scenario. On the one hand, lightcone local futures and the resulting local causal-state models produce insightful and predictive generalized symmetries in spacetime. Moreover, these clearly connect to the one-dimensional sofic shifts of CA spatial configurations, as demonstrated in Ref. [17]. However, we just demonstrated that local causal-state models are unfaithful generators of their underlying CA shift spaces.

On the other hand, we also showed how to create faithful local generators using

V

future shapes. These presentations thus appear to be the most natural local generalization of the canonical machine presentations in one dimension. In this case, though, the useful spacetime generalized symmetries are lost with these generative models.

Let us now examine in more detail the trade-offs between these two local presentations by diving deeper into the case of stochastic symmetries.

4. Case Study: Stochastic Symmetries

The partial (stochastic) symmetry spacetime fields in Figure 5b and Figure 6b are generated from ECA Rule 18 evolving a string from its domain’s invariant sofic shift. Appendix D shows that these are points in the stochastic symmetry sofic shift with the form 0-

Σ

, where

Σ

is a wildcard that can be either 0 or 1.

It is easy to see from Rule 18’s lookup table

ϕ_{18}

that the wildcard locations oscillate each time step between even- and odd-indexed lattice sites. Thus, the spacetime field can be interpreted as a checkerboard pattern of fixed 0s and wildcards

Σ = {0, 1}

; giving the checkerboard pattern seen in the local causal-state field shown in Figure 5b. In this way, the two local causal states can be interpreted as a “fixed” 0 state and a wildcard state. The local states of the

V

machine presentation, though, do not occur in a checkerboard pattern in Figure 6b. So, they clearly cannot be assigned such semantic labels.

While the fixed-0 and wildcard semantic labels are appealing for the two local causal states, they are also misleading. For a given spatial configuration, the fixed-0 and wildcard semantics are appropriate, as it describes the (invariant) one-dimensional sofic shift of the spatial configurations. However, it is again easy to see from the Rule 18 lookup table that the wildcard semantics can no longer be assigned to the full spacetime field. Specifically, in spacetime the local update rule

ϕ_{18}

forbids certain spacetime patches of the form:

\begin{matrix} \begin{matrix} Σ & 0 & Σ \\ Σ \end{matrix} \end{matrix}

Therefore, the spacetime shift space of Rule 18’s domain is a proper subset (subshift) of the 0-

Σ

checkerboard shift space for which all realizations of the above spacetime patch are allowed.

Figure 7 demonstrates that both the local causal states and the

V

machine presentation local states reveal a checkerboard symmetry for spacetime fields in the 0-

Σ

shift space. In this case, both types of local states certainly do carry the semantics of fixed-0 and wildcard

Σ

in spacetime.

What does this say then about Rule 18’s domain in the context of the alternative local-state presentations in Figure 5b and Figure 6b?

First, it is interesting and not entirely clear why the choice of lightcone local futures produces local causal states that carry the strictly-spatial semantics of fixed-0 and wildcard

Σ

, especially when we know the wildcard semantics do not hold in spacetime. Whatever the reason, it is key to interpreting spacetime patterns through the local causal-state fields, including identifying coherent structures as locally-broken generalized symmetries in spacetime [17].

Second, this implies that the

V

machine presentation states are a more nuanced representation of the spacetime pattern of Rule 18’s domain. From above, we know the non-fixed-0 sites cannot be strictly interpreted as wildcards in spacetime. However, we can interpret these as contextually-constrained wildcards. From the 0-

Σ

patch above, the outcome of the bottom

Σ

is constrained by

ϕ_{18}

and the outcomes of the preceding

Σ

s. Propagating these constraints through space and time clearly becomes complicated very quickly. However, this is a more appropriate semantic interpretation of Rule 18’s domain. So, this seems to be to what the

V

machine states correspond. As shown in a supplementary Jupyter Notebook [53], the local state field displayed in Figure 6b is reconstructed from past lightcones of depth-8 and

V

s of depth-3. This produces 767 local

V

states to (approximately) capture the spacetime pattern of Rule 18’s domain. This is in contrast to the three local causal states reconstructed with the same past and future depths. (For finite-depth past lightcones, there is a third “indeterminate” state for the all-0 past lightcone.)

Note that the fixed-0 semantics still holds in spacetime and that this is not captured by the

V

presentation states. Given that

V

machine presentations are constructed to be faithful local generators of their spacetime shift spaces, it seems there is a similar spacetime contextuality to the fixed-0 sites that is necessary for faithful local generation. The spacetime contextuality present in the

V

presentation states is absent in the local causal states. Additionally, it is the latter that reveals the spacetime symmetries observed in Figure 5.

Taken all together, the spacetime patterns of partial and hidden symmetry ECA spacetime shift spaces are exceedingly complicated. Local causal-state presentations constructed from future lightcones and the

V

machine presentations capture different aspects of these patterns. One the one hand,

V

machine presentations are faithful generators and predictors of the spacetime patterns and so capture the patterns in a more direct connection with the one-dimensional case. However, local causal states capture generalized spacetime symmetries that, in the case of ECA domain shift spaces, closely connect with the (evolution of) the one-dimensional shift spaces of the invariant spatial configurations that, in turn, possess the corresponding generalized symmetries.

5. Conclusions

There is a growing body of results that use machine presentations defined from future or predictive equivalence to discover inherent, often hidden, pattern and structure in natural and engineered systems [47]. The first half of the preceding development synthesized the arguments for sofic shifts and their machine presentations as mathematical formulations of pattern and structure. In particular, the manner in which they generalize the perfect regularity of exact symmetries and the associated group algebra has been rigorously clarified. It also connected to recent results on sofic measures and their relation to stochastic

ϵ

-machine presentations of statistical patterns supported on sofic shifts.

The development’s second half overviewed the local approach to spacetime machine presentations. We showed that the standard local causal-state approach that uses lightcones as local futures reveals useful generalized spacetime symmetries. However, this comes at a cost: it does not lead to faithful generative models of the underlying shift space. This motivated introducing an alternative local presentation model using spacetime

V

shapes as local futures. We showed that these presentations are faithful generative models and that they do not possess the same generalized spacetime symmetries of the local causal states. The seeming mutual-exclusion of multiple local presentations is emblematic of the difficulty of high-dimensional shift spaces. The novel constructions given here—in particular the generative local presentations using

V

futures—may provide new paths of inquiry for investigating the organization of these rich and challenging spaces.

One advantage of the nongenerative approach using local predictive equivalence over future lightcones is that it does not rely on a finite alphabet for labeled transitions to provide the algebraic structure among local causal states. Thus, local causal states are well-defined and can be algorithmically approximated for continuum field theories. For example, they have been used to extract coherent structures in complex fluid flows [18]. The results presented here provide a theoretical underpinning and a “physics of organization” behind these unsupervised physics-informed machine learning algorithms.

Author Contributions

Conceptualization, A.R. and J.P.C.; methodology, A.R. and J.P.C.; software, A.R.; formal analysis, A.R.; writing, A.R. and J.P.C. All authors have read and agreed to the published version of the manuscript.

Funding

AR acknowledges the support of the U.S. Department of Energy through the LANL/LDRD Program and the Center for Nonlinear Studies. This material is based upon work supported by, or in part by, FQXi Grant number FQXi-RFP-IPW-1902 and U.S. Army Research Laboratory and the U.S. Army Research Office under grants W911NF-21-1-0048 and W911NF-18-1-0028.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Supporting Python code and supplementary Jupyter Notebooks can be found at https://github.com/adamrupe/ca_patterns (accessed on 30 June 2022).

Acknowledgments

The authors thanks Mikhael Semaan and Nicolas Brodu for helpful comments and discussions. The authors also thank the Telluride Science Research Center for hospitality during visits and the participants of the Information Engines Workshops there. JPC acknowledges the kind hospitality of the Santa Fe Institute, Institute for Advanced Study at the University of Amsterdam, and California Institute of Technology for their hospitality during visits.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Examples and Constructions

To clarify the development and build intuition about patterns, as we use the term, the following works through several examples in detail. Specifically, we show explicitly how to construct a semigroup G for exact and generalized symmetries, how to construct a semiautomaton presenting G (and thus

X

), and how this general semiautomaton simplifies to the future cover, whose irreducible component is the minimal presentation

P (X)

.

Appendix A.1. Exact Symmetry Shifts

The simplest class of exact symmetry strings to characterize as sofic shifts are the k-clock shifts for which

A = {0, 1, \dots, (k - 1)}

and

b = 01 \dots (k - 1)

. For example, points in the 3-clock shift are of the form

\dots 012012012012 \dots

with

b = 012

. Intuitively, this is the simplest case since each

a \in A

fully specifies the period of the translation symmetry. Recall that this corresponds to the internal states of

P (X)

, which are the equivalence classes

{[\cdot]}_{F}

. For k-clock sequences each internal state (excluding the forbidden state), and thus equivalence class, is represented by a generator

a \in A

. This fully specifies

X

since there are no transient states. Each

a \in A

is synchronizing. In terms of the defining semigroup of

X

for k-clock shift, G is consists solely of

A

plus the absorbing element e. The 3-clock shift is given by

G = {0, 1, 2, e}

with

01 = 1

,

12 = 2

,

20 = 0

,

0^{2} = 1^{2} = 2^{2} = e

, and

02 = 10 = 21 = e

.

For more general exact symmetry shifts, there are additional elements in G beyond the generators

A \cup {e}

. Additionally, the future cover will have transient states; i.e., there are additional equivalence class

{[\cdot]}_{F}

beyond those in the minimal presentation

P (X)

. To illustrate, consider the shift

X

with

b = 001

and points of the form

\dots 001001001 \dots

.

A simple construction of a finite G for an exact symmetry shift

X

is as follows. Start by constructing the asymptotic recurrent component of the presenting semiautomaton—this is

P (X)

. We already showed the states of

P (X)

are the equivalence classes

{[b]}_{F}

,

{[b b_{0}]}_{F}

,

{[b b_{0} b_{1}]}_{F}

, and so on. These equivalence classes can generally be represented by the allowed words of length

p - 1

so that concatenation with a generator gives a (shift of) the tiling block b. This represents shifts of windows of length

p - 1

on points

x \in X

.

For our example with

b = 001

and so

p = 3

, these words are 00, 01, and 10. The word 11 is not included because it is forbidden in

X

, and so

11 = 1^{2} = e

in G. If our

p - 1

window is on 00 in

X

, then a unit shift reveals the generator 1 and the window now shows 01. This gives the production rule

00 \cdot 1 = 001 = 01

in G. In the semiautomaton presentation, there are states

ξ_{00} = {[00]}_{F}

and

ξ_{01} = {[01]}_{F}

with

M_{1} (ξ_{00}) = ξ_{01}

. If we again shift the window on 01, we reveal the generator 0 and the window now shows 10, giving the production rule

010 = 10

. To complete the cycle we have

100 = 00

, giving the finite closure for G. The rest of the elements in G can be filled in with the free semigroup: e.g.,

0 \cdot 1 = 01

. Therefore, we can give a finite G for our example as

G = {0, 1, e, 00, 01, 10}

with production rules

001 = 01, 010 = 10, 100 = 00

and

1^{2} = 0^{3} = 101 = e

. The semiautomaton presentation for this G is shown in Figure A1a. Teal colored states are transient, orange is the absorbing forbidden state

ξ_{e}

, and black states are the recurrent component (excluding

ξ_{e}

). Again, self-loops on

ξ_{e}

are omitted for visual clarity.

Figure A1. Semiautomaton presentations for straightforward semigroup construction (a) and its simplification under the future cover equivalence relation (b).

While this straightforward construction always produces a finite semigroup G for a given exact symmetry shift

X

, it is not necessarily a minimal semigroup. (Or, equivalently, not a minimal presenting semiautomaton.) The future cover equivalence relation exploits additional structure in

X

to give a minimal description and may thus reduce and simplify the straightforward G and its presenting automaton. The k-clock shifts are the extreme example, since we only need the generators as the elements of G since each

a \in A

is synchronizing. In the

b = 001

example, while 0 is not synchronizing, 1 is. Applying the future cover equivalence relations exploits this to simplify G and its presenting semiautomaton. From visual inspection of Figure A1a’s semiautomaton, we see that

ξ_{1}

and

ξ_{01}

are equivalent, since their transitions lead to the same states with the same labeled edges. Applying the future cover equivalence relation gives the future cover, with its reduced semiautomaton presentation shown in Figure A1b. The simplified semigroup G of the future cover is given as

G = {0, 1, e, 00, 10}

with

01 = 1

and

100 = 00

, and the same production rules for forbidden words.

Since we designed our straightforward construction around the asymptotic cycle of the translation symmetry, the recurrent components in both cases are isomorphic and represent the canonical presentation

P (X)

that captures the symmetry algebra.

Appendix A.2. General Pattern: The Even Shift

To contrast with exact symmetry shifts, we now go through the well-studied Even Shift [31,33]. Recall that the Even Shift is the set of sequences that have even-length blocks of 1s bounded by 0s. Thus, it is defined by the set of irreducible forbidden words

F = {010, 01^{3} 0, 01^{5} 0, \dots}

. Since

F

is not finite, the Even Shift is not of finite type—it is strictly sofic. However, since it is sofic it can still be finitely defined in terms of a finite semigroup G. Following Ref. [33] we use

G = {0, 1, e, 01, 10, 11, 101}

with production rules

0^{2} = 0

,

1^{3} = 1

,

01^{2} = 1^{2} 0 = 0

and

010 = e

.

The production rule

010 = e

represents the shortest forbidden word and the other rules allow for all the other forbidden rules to reduce to 010. For example, we use

1^{3} = 1

to reduce 01110 to 010 which then maps to the absorbing element e. We can see that while there is a countably-infinite number of forbidden words in the Even Shift, there is structure in these forbidden words that can be captured in a finite semigroup G.

As with our explicit symmetry example

b = 001

, the presenting semiautomaton for the Even Process using the G above is not minimal. Shown in Figure A2, we see that there are two recurrent components

P_{A} (X)

and

P_{B} (X)

. While these components correspond to different elements in G, we can see that they are again isomorphic and so collapse together under

\sim_{F}

. The resulting recurrent component is the canonical machine presentation

P (X)

, shown in Figure 1c.

Figure A2. Presenting semiautomaton for the Even Shift using

G = {0, 1, e, 01, 10, 11, 101}

and

0^{2} = 0

,

1^{3} = 1

,

01^{2} = 1^{2} 0 = 0

,

010 = e

. The two components

P_{A} (X)

and

P_{B} (X)

are isomorphic and thus collapse together under

\sim_{F}

. The recurrent component is the canonical machine presentation

P (X)

, shown in Figure 1c.

Figure A2. Presenting semiautomaton for the Even Shift using

G = {0, 1, e, 01, 10, 11, 101}

and

0^{2} = 0

,

1^{3} = 1

,

01^{2} = 1^{2} 0 = 0

,

010 = e

. The two components

P_{A} (X)

and

P_{B} (X)

are isomorphic and thus collapse together under

\sim_{F}

. The recurrent component is the canonical machine presentation

P (X)

, shown in Figure 1c.

Though we discussed and described three forms of presenting semiautomata, we want to emphasize the importance of the canonical presentation

P (X)

as the mathematical representation of pattern as generalized symmetry. There may be many semigroups G that describe a given sofic shift

X

, and so there are many different semiautomata that can present

X

. However, for irreducible sofic shifts the recurrent components of all such G will be isomorphic [33], which is why the future cover is guaranteed to have a single recurrent component. Therefore, the future cover is a unique representation of the structure of

X

. Patterns and the symmetries they generalize are asymptotic properties and so the transients of the future cover are not of interest for our current purposes. This is why the canonical presentation

P (X)

—the recurrent component of the future cover—is the unique mathematical representation of patterns.

Recall that the symmetry group of a translation symmetry is captured by the canonical presentation, whose semiautomaton is a circle graph, neglecting the absorbing state and its transitions. Similarly, we can see the canonical presentation of the Even Shift captures the essential details of the pattern. From inspecting its edge-labeled graph, shown in Figure 1c, we can see that an arbitrary number of 0s are allowed from state

ξ_{A}

, but once a 1 occurs it must be followed by another 1, ensuring the pattern of an even number of 1s bounded by 0s. This also highlights the partial regularity that motivated our definition of patterns. Starting in state

ξ_{A}

there is a coin flip, either 0 or 1 may occur. If it is a 0, repeat. However, if it is a 1, there is now additional regularity and structure that enforces another 1 to follow.

Appendix B. Distinct Statistical Patterns Supported on the Same Sofic Shift Topological Pattern

The following illustrates how statistical patterns, in the form of sofic measures, are additional structure on top of topological patterns. In particular, there can be complex statistical structure supported on simple topological structure. The topological pattern in this example is the full-2 shift—the set of all binary strings. As described above, the full-2 shift represents a “null” pattern, in the sense that there is no predictability leveraged from knowing the pattern. This is captured quantitatively by the single-state canonical machine presentation

P (X)

. Its memory—the log of the number of states—is zero.

The simplest statistical patterns supported on the full-2 shift is given by assigning IID single-symbol probabilities, e.g.,

Pr (1) = 0.3

and

Pr (0) = 0.7

. The simplicity of this statistical pattern is also captured by a single-state

ϵ

-machine, where the above probabilities are assigned to the correspond transitions from the state back to itself on the given symbol. In this example it is easy to see the statistical pattern is supported on the full-2 shift. Simply remove the probabilities from the single-state transitions and the single-state canonical machine presentation of the full-2 shift is recovered. Since the

ϵ

-machine (statistical) and canonical machine presentation (topological) have the same number of the states, the statistical and topological patterns they capture can be thought of as comparable.

However, consider the

ϵ

-machine shown in Figure A3. Its two states signify a more complex statistical pattern with nonzero memory. From the machine diagram, we can see that the single-symbol probabilities depend on which of the two internal causal states A and B the process is in. For example, the probability of seeing a 1 is

0.75

if in causal state A and

0.25

if in B. Note also that the symbol 0 always leads to causal state A and 1 always leads to B. Thus, single symbols are synchronizing in this example. This stochastic process is an order-1 Markov process. The simple one-state case above is an IID (order-0 Markov) process, by contrast.

Figure A3. Two state stochastic

ϵ

-machine presentation for a statistical pattern supported on the full-2 shift.

Figure A3. Two state stochastic

ϵ

-machine presentation for a statistical pattern supported on the full-2 shift.

It is perhaps clear from Figure A3’s machine diagram that this statistical pattern is supported on the full-2 shift, since both symbols have positive probability from each of the two causal states. However, it is instructive to show this using the symbol-labeled transition matrices. Recall that

T_{i j}^{a}

gives the probability of transitioning from state

ξ_{i}

to

ξ_{j}

on the symbol

a \in A

. For Figure A3’s

ϵ

-machine we have:

\begin{matrix} T^{0} = (\begin{matrix} 0.25 & 0.0 \\ 0.75 & 0.0 \end{matrix}), \end{matrix}

and

\begin{matrix} T^{1} = (\begin{matrix} 0.0 & 0.75 \\ 0.0 & 0.25 \end{matrix}), \end{matrix}

where state A is given index 1 and B is index 2, so that

T_{12}^{a}

is the probability of transitioning from A to B.

Matrix representations

M^{a}

of the topological transition maps are given by setting non-zero elements of

T^{a}

to unity. In this example, we have:

\begin{matrix} M^{0} = (\begin{matrix} 1 & 0 \\ 1 & 0 \end{matrix}), \end{matrix}

and

\begin{matrix} M^{1} = (\begin{matrix} 0 & 1 \\ 0 & 1 \end{matrix}) . \end{matrix}

Recall that

M_{i j}^{a} = 1

signifies a transition from internal state i to state j is allowed on the symbol a and

M_{i j}^{a} = 0

is a forbidden transition that produces a forbidden word.

To see that these topological transitions correspond to the full-2 shift, note that the action of

M^{a}

on both internal states yields the same output state, for both

M^{0}

and

M^{1}

. That is, for both states, call them also

A = (1 0)

and

B = (0 1)

, a transition on a 0 goes to state A and a transition on a 1 goes to B. For example:

\begin{matrix} (0 1) (\begin{matrix} 1 & 0 \\ 1 & 0 \end{matrix}) = (1 0), \end{matrix}

and

\begin{matrix} (1 0) (\begin{matrix} 1 & 0 \\ 1 & 0 \end{matrix}) = (1 0) . \end{matrix}

Thus, as described above, the two states are topologically equivalent

A \sim_{F} B

and so reduce to the single-state canonical machine presentation of the full-2 shift.

Finally, as shown in Figure A4 this construction of probabilistically-distinct causal states supported on the full-2 shift can be extended to

ϵ

-machine with a larger number of states. Clearly, the construction can be extended indefinitely with an arbitrary number of causal states

ξ_{i}

such that

Pr (0 | ξ_{i}) = p

and

Pr (1 | ξ_{i}) = 1 - p

, since there can be uncountably-many p. This shows that there can be arbitrarily complex statistical patterns supported on simple topological patterns.

Figure A4. Four-state stochastic

ϵ

-machine presentation for a statistical pattern supported on the full-2 shift.

Figure A4. Four-state stochastic

ϵ

-machine presentation for a statistical pattern supported on the full-2 shift.

Appendix C. Cellular Automata

A one-dimensional cellular automaton or CA

(A^{L}, Φ)

consists of a spatial lattice

L = Z

whose sites take values from a finite alphabet

A

. A CA state

x \in A^{Z}

is the configuration of all site values

x^{r} \in A

on the lattice. (For states x, subscripts denote time; superscripts sites.) CA states evolve in discrete time steps according to the global evolution

Φ : A^{Z} \to X \subseteq A^{Z}

, where:

\begin{matrix} x_{t + 1} = Φ (x_{t}) . \end{matrix}

Φ

is implemented through parallel, synchronous application of a local update rule

ϕ

that evolves individual sites

x_{t}^{r}

based on their radius R neighborhoods

η (x^{r}) =

{x^{r^{'}} : | r - r^{'} | < R}

:

\begin{matrix} x_{t + 1}^{r} = ϕ (η (x_{t}^{r})) . \end{matrix}

Stacking the states in a CA orbit

x_{0 : t} = {x_{0}, x_{1}, \dots, x_{t - 1}}

in time-order produces a spacetime field

x_{0 : t}^{} \in A^{Z \otimes Z}

. Visualizing CA orbits as spacetime fields reveals the fascinating patterns and localized structures that CAs produce and how the patterns and structures evolve and interact over time.

Elementary Cellular Automata

The parameters

(A, R)

define a CA class. One simple but nontrivial class is that of the so-called elementary cellular automata (ECAs) [54] with a binary local alphabet

A = {0, 1}

and radius

R = 1

local-interaction neighborhood

η (x_{t}^{r}) = x_{t}^{r - 1} x_{t}^{r} x_{t}^{r + 1}

. Due to their definitional simplicity and wide study, we explore ECAs in our examples.

A local update rule

ϕ

is generally specified through a lookup table that enumerates all possible neighborhood configurations

η

and their outputs

ϕ (η)

. The lookup table for ECAs is given as:

where each output

O_{η} = ϕ (η) \in A

and the

η

s are listed in lexicographical order. There are

2^{8} = 256

possible ECA lookup tables, as specified by the possible strings of output bits:

O_{7} O_{6} O_{5} O_{4} O_{3} O_{2} O_{1} O_{0}

. A specific ECA lookup table is often referred to as an ECA rule with a rule number given as the binary integer

o_{7} o_{6} o_{5} o_{4} o_{3} o_{2} o_{1} o_{0} \in [0, 255]

. For example, ECA 172’s lookup table has output bit string 10101100.

The

n th

-order lookup table

ϕ^{n}

maps the radius

n \cdot R

neighborhood of a site to that site’s value n time steps in the future. Said another way, a spacetime site

x_{t + n}^{r}

is completely determined by the radius

n \cdot R

neighborhood n time-steps in the past according to:

\begin{matrix} x_{t + n}^{r} = ϕ^{n} (η^{n} (x_{t}^{r})) . \end{matrix}

The depth-n past lightcone is the collection of all

ϕ^{ℓ}

for

1 \leq ℓ \leq n

, plus the present site value itself (i.e., “depth-0”).

Appendix D. ECA Domain Sofic Shifts

The generalization of invariant sets from low-dimensional dynamical systems to high-dimensional cellular automata are known as domains [50]. These are sets of spatial configurations that are invariant under the global CA dynamic

Φ

. Spatial configurations of one-dimensional cellular automata are strings of symbols from an alphabet

A

. Thus, shift spaces are natural choices for sets of spatial configurations. In fact, the Curtis–Hedlund–Lyndon theorem shows that a CA’s dynamic—a sliding-block code—naturally induces shift-invariance in the set of images under

Φ

[55]. Said another way, the dynamic of a CA on a shift space maps to a shift space.

For a CA

Φ

, a domain

Λ = {Λ_{1}, Λ_{2}, \dots, Λ_{p}}

is a set of irreducible sofic shifts such that

Φ (Λ_{i}) \in Λ

. That is, if x is a spatial configuration that is a point in one of the sofic shifts

Λ_{i} \in Λ

, then the image

Φ (x)

is also a point in one of the sofic shifts in

Λ

. As irreducible sofic shifts, domains may possess patterns that are exact symmetries, partial symmetries, hidden symmetries, or general patterns with no symmetries. Empirically, the spacetime fields produced from the evolution of ECA domains possess the same type of pattern as their invariant sofic shifts. The generalized spacetime symmetries are revealed by the local causal states, as shown above in Figure 5. Here, we provide the machine presentations for the domain sofic shifts used in each case.

The exact symmetry spacetime field is given by the evolution of ECA Rule 54’s domain, whose machine presentation is shown in Figure A5. As can be seen, there are two phase

Λ_{A}

and

Λ_{B}

that are cycled between under

Φ_{54}

. Each phase has an exact symmetry, with

Λ_{A}

tiled by blocks of 0001 and

Λ_{B}

tiled by blocks of 1110. As seen in Figure 5a, this creates an exact symmetry spacetime field that is period-4 in both time and space.

The partial symmetry spacetime field possesses a stochastic symmetry and is given by the evolution of ECA Rule 18’s domain. Its machine presentation is shown in Figure A6. This is a single stochastic symmetry sofic shift with a period-2 tiling of

0 Σ

, where again

Σ

is a wildcard that can be either 0 or 1. Spacetime fields generated by the domain of Rule 18, as shown in Figure 5b, form a subshift of the 0-

Σ

checkerboard spacetime shift space. Both are discussed in detail above in the stochastic symmetry Case Study of Section 4.

Figure A5. Presenting semiautomaton for the domain of ECA Rule 54.

Figure A6. Presenting semiautomaton for the domain of ECA Rule 18.

Finally, the hidden symmetry spacetime field in Figure 5c is generated by the hidden symmetry domain sofic shift of ECA Rule 22. The presenting automaton is shown in Figure A7. Like Rule 54’s domain, the domain of ECA Rule 22 comes in two phases. As can be seen, phase

Λ_{A}

is actually a stochastic symmetry, as each state in

P (Λ_{A})

returns to itself after four translations. However, this is not the case for phase

Λ_{B}

, which is a hidden symmetry sofic shift. This is because only states

F

and

G

of

P (Λ_{B})

return to themselves after four translations. As drawn, one can imagine “folding”

P (Λ_{B})

up onto itself to create a period-4 symmetry in the states. Said another way, one can think of a “length-3 wildcard” that produces either three 0s or three 1s. The

Λ_{B}

sofic shift is then given by tilings of this length-3 wildcard followed by a fixed 0. This is the nature of hidden symmetries. Blocks of three 0s and three 1s are forced to occur, followed by a fixed 0, but the sequence of these blocks is not constrained. For example, the string of all 0s and the string of all 1s are both in

Λ_{B}

.

Figure A7. Presenting semiautomaton for the domain of ECA Rule 22.

References

Englert, F.; Brout, R. Broken symmetry and the mass of gauge vector mesons. Phys. Rev. Lett. 1964, 13, 321. [Google Scholar] [CrossRef]
Higgs, P.W. Broken symmetries and the masses of gauge bosons. Phys. Rev. Lett. 1964, 13, 508. [Google Scholar] [CrossRef]
Guralnik, G.S.; Hagen, C.R.; Kibble, T.W. Global conservation laws and massless particles. Phys. Rev. Lett. 1964, 13, 585. [Google Scholar] [CrossRef]
Weinberg, S. A model of leptons. Phys. Rev. Lett. 1967, 19, 1264. [Google Scholar] [CrossRef]
Cross, M.C.; Hohenberg, P.C. Pattern Formation Outside of Equilibrium. Rev. Mod. Phys. 1993, 65, 851–1112. [Google Scholar] [CrossRef]
Ball, P. The Self-Made Tapestry: Pattern Formation in Nature; Oxford University Press: New York, NY, USA, 1999. [Google Scholar]
Cross, M.; Greenside, H. Pattern Formation and Dynamics in Nonequilibrium Systems; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Bernard, H. Les Tourbillons Cellulaires dans une Nappe Liquide [The Cellular Vortices in a Liquid Layer]. Rev. Gén. Sci. Pure Appl. 1900, 11, 1261–1271. [Google Scholar]
Paul, M.; Chiam, K.H.; Cross, M.; Fischer, P.; Greenside, H. Pattern Formation and Dynamics in Rayleigh–Bénard Convection: Numerical Simulations of Experimentally Realistic Geometries. Phys. Nonlinear Phenom. 2003, 184, 114–126. [Google Scholar] [CrossRef]
Zhabotinsky, A.M. Periodical Oxidation of Malonic Acid in Solution (a Study of the Belousov Reaction Kinetics). Biofizika 1964, 9, 306–311. [Google Scholar]
Epstein, I.; Pojman, J. An Introduction to Nonlinear Chemical Dynamics: Oscillations, Waves, Patterns, and Chaos; Oxford University Press: Oxford, UK, 1998. [Google Scholar]
Holmes, P.; Lumley, J.; Berkooz, G.; Rowley, C. Turbulence, Coherent Structures, Dynamical Systems and Symmetry; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
Haller, G. Lagrangian Coherent Structures. Ann. Rev. Fluid Mech. 2015, 47, 137–162. [Google Scholar] [CrossRef]
Marcus, P.S. Numerical simulation of Jupiter’s Great Red Spot. Nature 1988, 331, 693–696. [Google Scholar] [CrossRef]
Sommeria, J.; Meyers, S.D.; Swinney, H.L. Laboratory simulation of Jupiter’s Great Red Spot. Nature 1988, 331, 689–693. [Google Scholar] [CrossRef]
Tung, W.K. Group Theory in Physics: An Introduction to Symmetry Principles, Group Representations, and Special Functions in Classical and Quantum Physics; World Scientific Publishing Co. Inc.: Philidelphia, PL, USA, 1985. [Google Scholar]
Rupe, A.; Crutchfield, J.P. Local Causal States and Discrete Coherent Structures. Chaos 2018, 28, 1–22. [Google Scholar] [CrossRef] [PubMed]
Rupe, A.; Kumar, N.; Epifanov, V.; Kashinath, K.; Pavlyk, O.; Schlimbach, F.; Patwary, M.; Maidanov, S.; Lee, V.; Prabhat; et al. DisCo: Physics-Based Unsupervised Discovery of Coherent Structures in Spatiotemporal Systems. In Proceedings of the 2019 IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments (MLHPC), Denver, CO, USA, 18 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 75–87. [Google Scholar]
Schmidt, K. Dynamical Systems of Algebraic Origin; Springer Science & Business Media: Berlin/Heidelberg, Germany, 1995. [Google Scholar]
Lindgren, K.; Moore, C.; Nordahl, M. Complexity of Two-Dimensional Patterns. J. Stat. Phys. 1998, 91, 909–951. [Google Scholar] [CrossRef]
Lind, D. Multi-dimensional symbolic dynamics. Proc. Symb. Dyn. Appl. Am. Math. Soc. 2004, 60, 61–80. [Google Scholar]
Shalizi, C.R.; Crutchfield, J.P. Computational Mechanics: Pattern and Prediction, Structure and Simplicity. J. Stat. Phys. 2001, 104, 817–879. [Google Scholar] [CrossRef]
Grenander, U. Elements of Pattern Theory; JHU Press: Baltimore, ML, USA, 1996. [Google Scholar]
Mumford, D.; Desolneux, A. Pattern Theory: The Stochastic Analysis of Real-World Signals; CRC Press: Boca Raton, FL, USA, 2010. [Google Scholar]
Sethna, J.P. Order Parameters, Broken Symmetry, and Topology. arXiv 1992, arXiv:9204009. [Google Scholar]
Greenberg, J.M.; Hastings, S.P. Spatial patterns for discrete models of diffusion in excitable media. SIAM J. Appl. Math. 1978, 34, 515–523. [Google Scholar] [CrossRef]
Robinson, M.D.; Feldman, D.P.; McKay, S.R. Local Entropy and Structure in a Two-dimensional Frustrated System. Chaos 2011, 21, 037114. [Google Scholar] [CrossRef]
Lind, D.; Marcus, B. An Introduction to Symbolic Dynamics and Coding; Cambridge University Press: New York, NY, USA, 1995. [Google Scholar]
Smale, S. Differentiable Dynamical Systems. Bull. Am. Math. Soc. 1967, 73, 797–817. [Google Scholar] [CrossRef]
Parry, W. Intrinsic Markov Chains. Trans. Am. Math. Soc. 1964, 112, 55. [Google Scholar] [CrossRef]
Weiss, B. Subshifts of Finite Type and Sofic Systems. Monastsh. Math. 1973, 77, 462. [Google Scholar] [CrossRef]
Ginzburg, A. Algebraic Theory of Automata; Academic Press: Cambridge, MA, USA, 1968. [Google Scholar]
Kitchens, B.; Tuncel, S. Semi-groups and Graphs. Israel J. Math. 1986, 53, 231. [Google Scholar] [CrossRef]
Krieger, W. On sofic systems I. Israel J. Math. 1984, 48, 305–330. [Google Scholar] [CrossRef]
Fischer, R. Sofic Systems and Graphs. Monastsh. Math 1975, 80, 179. [Google Scholar] [CrossRef]
Fischer, R. Graphs and Symbolic Dynamics. Coll. Math. Soc. János Bólyai 16 Top. Inf. Theory 1975. [Google Scholar]
Ash, R.B. Information Theory; John Wiley and Sons: New York, NY, USA, 1965. [Google Scholar]
Hopcroft, J.E.; Motwani, R.; Ullman, J.D. Introduction to Automata Theory, Languages, and Computation, 3rd ed.; Prentice-Hall: New York, NY, USA, 2006. [Google Scholar]
Marques-Pita, M.; Rocha, L. Schema Redescription in Cellular Automata: Revisiting Emergence in Complex Systems. In Proceedings of the Artificial Life (ALIFE), 2011 IEEE Symposium on, Paris, France, 13–15 April 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 233–240. [Google Scholar]
Krohn, K.; Rhodes, J. Algebraic theory of machines. I. Prime decomposition theorem for finite semigroups and machines. Trans. Am. Math. Soc. 1965, 116, 450–464. [Google Scholar] [CrossRef]
Rhodes, J. Applications of Automata Theory and Algebra via the Mathematical Theory of Complexity to Biology, Physics, Psychology, Philosophy, Games, and Codes; Nehaniv, C., Ed.; University of California: Berkeley, CA, USA; World Scientific Publishing Company: Singapore, 1971. [Google Scholar]
Boyle, M.; Petersen, K. Hidden Markov processes in the context of symbolic dynamics. In Entropy of Hidden Markov Processes and Connections to Dynamical Systems; Marcus, B., Petersen, K., Weissman, T., Eds.; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Dudley, R.M. Real Analysis and Probability, 1st ed.; Cambridge University Press: Cambridge, UK, 2002. [Google Scholar]
Furstenberg, H. Stationary Processes and Prediction Theory; Princeton University Press: Princeton, NJ, USA, 1960; Volume 44. [Google Scholar]
Riechers, P.; Crutchfield, J.P. Spectral Simplicity of Apparent Complexity, Part I: The Nondiagonalizable Metadynamics of Prediction. Chaos 2018, 28, 033115. [Google Scholar] [CrossRef]
Riechers, P.; Crutchfield, J.P. Spectral Simplicity of Apparent Complexity, Part II: Exact Complexities and Complexity Spectra. Chaos 2018, 28, 033116. [Google Scholar] [CrossRef]
Crutchfield, J.P. Between Order and Chaos. Nat. Phys. 2012, 8, 17–24. [Google Scholar] [CrossRef]
Hochman, M. Multidimensional shifts of finite type and sofic shifts. In Combinatorics, Words and Symbolic Dynamics; Cambridge University Press: Cambridge, UK, 2016; pp. 296–359. [Google Scholar]
Shalizi, C. Optimal Nonlinear Prediction of Random Fields on Networks. Discret. Math. Theor. Comput. Sci. 2003. [Google Scholar] [CrossRef]
Hanson, J.E.; Crutchfield, J.P. The Attractor-Basin Portrait of a Cellular Automaton. J. Stat. Phys. 1992, 66, 1415–1462. [Google Scholar] [CrossRef]
Rupe, A. Rule 90 Patterns Jupyter Notbook. 2022. Available online: https://github.com/adamrupe/ca_patterns/blob/main/notebooks/rule90_patterns.ipynb (accessed on 30 June 2022).
Rupe, A. Rule 18 Domain Patterns Jupyter Notbook. 2022. Available online: https://github.com/adamrupe/ca_patterns/blob/main/notebooks/domain18_patterns.ipynb (accessed on 30 June 2022).
Rupe, A. General CA Patterns Jupyter Notbook. 2022. Available online: https://github.com/adamrupe/ca_patterns/blob/main/notebooks/general_patterns.ipynb (accessed on 30 June 2022).
Wolfram, S. Statistical Mechanics of Cellular Automata. Rev. Mod. Phys. 1983, 55, 601. [Google Scholar] [CrossRef]
Hedlund, G.A. Endomorphisms and Automorphisms of the Shift Dynamical System. Theory Comput. Syst. 1969, 3, 320–375. [Google Scholar] [CrossRef]

Figure 1. Semiautomata presentations

P (X)

: (a) Exact symmetry shift, (b) partial symmetry shift, (c) hidden symmetry shift, and (d) general pattern sofic shift.

Figure 1. Semiautomata presentations

P (X)

: (a) Exact symmetry shift, (b) partial symmetry shift, (c) hidden symmetry shift, and (d) general pattern sofic shift.

Figure 2. Fringes induced by spacetime shifts: Co-occurring depth-4 past lightcone

L^{-} (r_{0}, t_{0})

(red) and depth-4 spacetime patches resulting from concatenations of (a) left transition fringes

V_{ℓ} (r_{0}, t_{0})

(blue), (b) right transition fringes

V_{r} (r_{0}, t_{0})

(blue), (c) forward transition fringes

V_{t} (r_{0}, t_{0})

(blue), and (d) unions of left, right, and forward transition fringes

V (r_{0}, t_{0})

(blue). Arrows indicate the direction(s) in which local spacetime patches may be generated with successive concatenations to the seed past lightcone

L^{-} (r_{0}, t_{0})

.

Figure 2. Fringes induced by spacetime shifts: Co-occurring depth-4 past lightcone

L^{-} (r_{0}, t_{0})

(red) and depth-4 spacetime patches resulting from concatenations of (a) left transition fringes

V_{ℓ} (r_{0}, t_{0})

(blue), (b) right transition fringes

V_{r} (r_{0}, t_{0})

(blue), (c) forward transition fringes

V_{t} (r_{0}, t_{0})

(blue), and (d) unions of left, right, and forward transition fringes

V (r_{0}, t_{0})

(blue). Arrows indicate the direction(s) in which local spacetime patches may be generated with successive concatenations to the seed past lightcone

L^{-} (r_{0}, t_{0})

.

Figure 3. Co-occurring past (

L^{-}

) and future (

L^{+}

) lightcones at a spacetime site

(r_{0}, t_{0})

in

1 + 1

dimensions with

c = 1

.

Figure 3. Co-occurring past (

L^{-}

) and future (

L^{+}

) lightcones at a spacetime site

(r_{0}, t_{0})

in

1 + 1

dimensions with

c = 1

.

Figure 4. ECA Rule 90 spacetime field depicted as white (0) and black (1) squares. The corresponding local causal-state field is overlaid with colored letters; simply the single causal state A. Three sample right-transition fringes for past lightcone depth-2 are highlighted in colored (orange, green, purple) boxes.

Figure 5. Spacetime pattern classes: Spacetime fields

x_{}^{}

(above) and corresponding local causal state fields

S_{}^{} = ϵ (x_{}^{})

(below) for (a) an exact symmetry (ECA Rule 54 domain), (b) a partial symmetry (ECA Rule 18 domain), (c) a hidden symmetry (ECA Rule 22 domain), and (d) a general pattern (ECA Rule 54 evolving random initial configuration).

Figure 5. Spacetime pattern classes: Spacetime fields

x_{}^{}

(above) and corresponding local causal state fields

S_{}^{} = ϵ (x_{}^{})

(below) for (a) an exact symmetry (ECA Rule 54 domain), (b) a partial symmetry (ECA Rule 18 domain), (c) a hidden symmetry (ECA Rule 22 domain), and (d) a general pattern (ECA Rule 54 evolving random initial configuration).

Figure 6. Spacetime fields

x_{}^{}

shown in Figure 5 (above) and corresponding V state fields

S_{}^{} = ϵ (x_{}^{})

(below) for (a) an exact symmetry, (b) a partial symmetry, (c) a hidden symmetry, and (d) a general pattern.

Figure 6. Spacetime fields

x_{}^{}

shown in Figure 5 (above) and corresponding V state fields

S_{}^{} = ϵ (x_{}^{})

(below) for (a) an exact symmetry, (b) a partial symmetry, (c) a hidden symmetry, and (d) a general pattern.

Figure 7. Sample field from 0-Wildcard shift space in black (1) and white (0) squares with (a) local causal states and (b)

V

presentation local states (b) overlaid. In both cases, the local states can be assigned fixed-0 and wildcard semantics. So, they are labeled

F

and

W

, respectively. The local causal states in (a) have an additional “indeterminate” state assigned to the all-0 past lightcone, labeled as

X

; see, for example, the field at

r = 60

and

t = 34

).

Figure 7. Sample field from 0-Wildcard shift space in black (1) and white (0) squares with (a) local causal states and (b)

V

presentation local states (b) overlaid. In both cases, the local states can be assigned fixed-0 and wildcard semantics. So, they are labeled

F

and

W

, respectively. The local causal states in (a) have an additional “indeterminate” state assigned to the all-0 past lightcone, labeled as

X

; see, for example, the field at

r = 60

and

t = 34

).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rupe, A.; Crutchfield, J.P. Algebraic Theory of Patterns as Generalized Symmetries. Symmetry 2022, 14, 1636. https://doi.org/10.3390/sym14081636

AMA Style

Rupe A, Crutchfield JP. Algebraic Theory of Patterns as Generalized Symmetries. Symmetry. 2022; 14(8):1636. https://doi.org/10.3390/sym14081636

Chicago/Turabian Style

Rupe, Adam, and James P. Crutchfield. 2022. "Algebraic Theory of Patterns as Generalized Symmetries" Symmetry 14, no. 8: 1636. https://doi.org/10.3390/sym14081636

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Algebraic Theory of Patterns as Generalized Symmetries

Abstract

1. Patterns in Nature

2. One-Dimensional Patterns

2.1. Statistical Field Theories

2.2. Symbolic Dynamics

2.3. Sofic Shifts as Topological Patterns

2.4. Exact Symmetries

2.5. Generalized Symmetries

2.6. Statistical Patterns Supported on Sofic Shifts

3. Patterns in Spacetime

3.1. Local Spacetime Presentations

3.2. The Shape of Local Futures

3.3. Generalized Spacetime Symmetries

4. Case Study: Stochastic Symmetries

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Examples and Constructions

Appendix A.1. Exact Symmetry Shifts

Appendix A.2. General Pattern: The Even Shift

Appendix B. Distinct Statistical Patterns Supported on the Same Sofic Shift Topological Pattern

Appendix C. Cellular Automata

Elementary Cellular Automata

Appendix D. ECA Domain Sofic Shifts

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI