Next Article in Journal
FedUB: Federated Learning Algorithm Based on Update Bias
Previous Article in Journal
Navigating Supply Chain Resilience: A Hybrid Approach to Agri-Food Supplier Selection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assembly Theory of Binary Messages

by
Szymon Łukaszyk
* and
Wawrzyniec Bieniawski
Łukaszyk Patent Attorneys, ul. Głowackiego 8, 40-052 Katowice, Poland
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(10), 1600; https://doi.org/10.3390/math12101600
Submission received: 23 April 2024 / Revised: 16 May 2024 / Accepted: 17 May 2024 / Published: 20 May 2024
(This article belongs to the Section Mathematical Physics)

Abstract

:
Using assembly theory, we investigate the assembly pathways of binary strings (bitstrings) of length N formed by joining bits present in the assembly pool and the bitstrings that entered the pool as a result of previous joining operations. We show that the bitstring assembly index is bounded from below by the shortest addition chain for N, and we conjecture about the form of the upper bound. We define the degree of causation for the minimum assembly index and show that, for certain N values, it has regularities that can be used to determine the length of the shortest addition chain for N. We show that a bitstring with the smallest assembly index for N can be assembled via a binary program of a length equal to this index if the length of this bitstring is expressible as a product of Fibonacci numbers. Knowing that the problem of determining the assembly index is at least NP-complete, we conjecture that this problem is NP-complete, while the problem of creating the bitstring so that it would have a predetermined largest assembly index is NP-hard. The proof of this conjecture would imply P ≠ NP since every computable problem and every computable solution can be encoded as a finite bitstring. The lower bound on the bitstring assembly index implies a creative path and an optimization path of the evolution of information, where only the latter is available to Turing machines (artificial intelligence). Furthermore, the upper bound hints at the role of dissipative structures and collective, in particular human, intelligence in this evolution.

1. Introduction

Assembly theory (AT) [1,2,3,4,5,6,7] provides a distinctive complexity measure superior to established complexity measures used in information theory, such as Shannon entropy or Kolmogorov complexity [1,5]. AT does not alter the fundamental laws of physics [6]. Instead, it redefines objects on which these laws operate. In AT, objects are not considered sets of point particles (as in most physics) but instead are defined via the histories of their formation (assembly pathways) as an intrinsic property where, in general, there are multiple assembly pathways to create a given object.
AT explains and quantifies selection and evolution, capturing the amount of memory necessary to produce a given object [6] (this memory is the object [8]). This is because, the more complex a given object, the less likely an identical copy can be observed without the selection of some information-driven mechanism that generated that object. Formalizing assembly pathways as sequences of joining operations, AT begins with basic units (such as chemical bonds) and ends with a final object. This conceptual shift captures evidence of selection in objects [1,2,6].
The assembly index of an object corresponds to the smallest number of steps required to assemble this object, and—in general—it increases with the object’s size but decreases with symmetry, so large objects with repeating substructures may have a smaller assembly index than smaller objects with greater heterogeneity [1]. The copy number specifies the observed number of copies of an object. Only these two quantities describe the evolutionary concept of selection by showing how many alternatives were excluded to assemble a given object [6,8].
AT has been experimentally confirmed in the case of molecules and has been probed directly via experiments with high accuracy by means of spectroscopy techniques, including mass spectroscopy, IR, and NMR spectroscopy [6,7]. It is a versatile concept with applications in various domains. Beyond its application in the fields of biology and chemistry [7], its adaptability to different data structures, such as text, graphs, groups, music notations, image files, compression algorithms, human languages, memes, etc., showcases its potential in diverse fields [2].
In this study, we investigated the assembly pathways of binary strings (bitstrings) by joining individual bits present in the assembly pool and bitstrings that entered the pool as a result of previous joining operations.
A bit is the smallest amount and the quantum of information. Perceivable information about any object can be encoded via a bitstring [9,10], but this does not imply that a bitstring defines an object. Information that defines a chemical compound, a virus, a computer program, etc. can be encoded via a bitstring. However, a dissipative structure [11], such as a living biological cell (or its conglomerate, such as a human, for example) cannot be represented with a bitstring (even if its genome can). This information can only be perceived (so this is not object-defining information). Therefore, we use emphasis for the object in this paper since this term, understood as a collection of matter, is a misnomer in that it neglects (quantum) nonlocality [12]. Nonlocality is independent of the entanglement among particles [13], as well as quantum contextuality [14], and it increases as the number of particles [15] grows [16,17]. Furthermore, the ugly duckling theorem [9,10] asserts that every two objects we perceive are equally similar (or equally dissimilar).
Furthermore, a bitstring as such is neither dissipative nor creative. Its assembly process is what can be dissipative or creative. The perceivable universe is not big enough to contain the future; it is deterministic when going back in time and non-deterministic when going forward in time [18]. But we know  [2,11,19,20,21,22,23,24,25,26,27,28,29] that it has evolved to the present since the Big Bang. Evolution is about assembling a novel structure of information and optimizing its assembly process until it reaches the assembly index. Once the new information is assembled (via a dissipative structure operating far from thermodynamic equilibrium, including humans), it enters the realm of the 2nd law of thermodynamics, and nature seeks how to optimize its assembly pathway.
At first, the newly assembled structure of information is discovered via groping [19], and its assembly pathway does not attain its most economical or efficient form at once. For a certain period of time, its evolution gropes about within itself. The try-out follows the try-out, not being finally adopted. Then, finally, perfection comes within sight, and from that moment, the rhythm of change slows down [19]. The new information, having reached the limit of its potentialities, enters the phase of conquest. Stronger now than its less-perfected neighbours, the new information multiplies and consolidates. When the assembly index is reached, new information attains equilibrium, and its evolution terminates. It becomes stable.
Thanks to its characteristic additive power, living matter (unlike the matter of the physicists) finds itself ”ballasted” with complications and instability. It falls, or rather rises, towards forms that are more and more improbable. Without orthogenesis life would only have spread; with it there is an ascent of life that is invincible.
[19]
The paper is structured as follows. Section 2 introduces the basic concepts and definitions used in the paper. Section 3 shows that the bitstring assembly index is bounded from below and provides the form of this bound. Section 4 defines the degree of causation for the smallest assembly-index bitstrings. Section 5 shows that the bitstring assembly index is bounded from above, and we conjecture about the exact form of this bound. Section 6 introduces the concept of a binary assembling program and shows that, in general, the trivial assembling program assembles the smallest assembly index bitstrings. Section 7 discusses and concludes the findings of this study.

2. Preliminaries

For K subunits of an object, O, the assembly index, a O , of this object is bounded [1] from below via
min a O   =   log 2 ( K ) ,
and from above via
max a O   =   K 1 .
The lower bound (1) represents the fact that the simplest way to increase the size of a subunit in a pathway is to take the largest subunit assembled so far and join it to itself [1], and in the case of the upper bound (2), subunits must be distinct so that they cannot be reused from the pool, decreasing the index.
Here, we consider bitstrings C k ( N ) containing bits { 1 , 0 } , with  N 0 zeros and N 1 ones, having a length of N   =   N 0 + N 1 . N 1 is called the binary Hamming weight or the bit summation of a bitstring. Bitstrings are our basic AT objects [2], and we consider the process of their formation within the AT framework. Where the bit value can be either 1 or 0, we write   =   { 1 , 0 } , with being the same within the bitstring C k ( N ) . If we allow for the second possibility that can be the same as or different from , we write   =   { 1 , 0 } . Thus, C k ( 2 )   =   [ ] , for example, is a placeholder for all four 2-bit strings.
In general, we consider bitstrings, C k ( N ) , to be messages transmitted through a communication channel between a source and a receiver, similar to the Claude Shannon approach [30] used in the derivation of binary information entropy
H C k ( N )   =   p 0 log 2 ( p 0 ) p 1 log 2 ( p 1 ) ,
where
p 0   =   N 0 N and p 1   =   N 1 N
are the ratios of occurrences of zeros and ones within the bitstring C k ( N ) , and the unit of entropy (3) is a bit.
Definition 1.
A bitstring assembly index, a ( N ) , is the smallest number of steps, s, required to assemble a bitstring, C k ( N ) , of length N by joining two distinct bits contained in the initial assembly pool, P   =   { 1 , 0 } , and bitstrings assembled in previous steps that were added to the assembly pool. Therefore, the assembly index, a ( N ) ( C k ) , is a function of the bitstring C k ( N ) .
For example, the 8-bit string
C k ( 8 )   =   [ 01010101 ]
can be assembled in at most seven steps:
1.
Join 0 with 1 to form C k ( 2 )   =   [ 01 ] , adding [ 01 ] to P   =   { 1 , 0 , 01 } ;
2.
Join C k ( 2 )   =   [ 01 ] with 0 to form C k ( 3 )   =   [ 010 ] , adding [ 010 ] to P   =   { 1 , 0 , 01 , 010 } ;
3.
7.
Join C k ( 7 )   =   [ 0101010 ] with 1 to form C k ( 8 )     =     [ 01010101 ]
(i.e., not using the assembly pool P), in six, five, or four steps:
  • Join 0 with 1 to form C k ( 2 )   =   [ 01 ] , adding [ 01 ] to P;
  • Join C k ( 2 )   =   [ 01 ] with [ 01 ] taken from P to form C k ( 4 )   =   [ 0101 ] , adding [ 0101 ] to P;
  • Join C k ( 4 )   =   [ 0101 ] with [ 01 ] taken from P to form C k ( 6 )   =   [ 010101 ] , adding [ 010101 ] to P;
  • Join C k ( 6 )   =   [ 010101 ] with [ 01 ] taken from P to form C k ( 8 )   =   [ 01010101 ]
or at least three steps:
  • Join 0 with 1 to form C k ( 2 )   =   [ 01 ] , adding [ 01 ] to P;
  • Join C k ( 2 )   =   [ 01 ] with [ 01 ] taken from P to form C k ( 4 )   =   [ 0101 ] , adding [ 0101 ] to P;
  • Join C k ( 4 )   =   [ 0101 ] with [ 0101 ] taken from P to form C k ( 8 )   =   [ 01010101 ] .
Meanwhile, the 8-bit string
C l ( 8 )   =   [ 00010111 ]
can be assembled in at least six steps:
  • Join 0 with 1 to form C l ( 2 )   =   [ 01 ] , adding [ 01 ] to P;
  • Join C l ( 2 )   =   [ 01 ] with [ 01 ] taken from P to form C l ( 4 )   =   [ 0101 ] , adding [ 0101 ] to P;
  • Join 0 with 0 adding [ 00 ] to P;
  • Join C l ( 4 )   =   [ 0101 ] with [ 00 ] taken from P to form C l ( 6 )   =   [ 000101 ] , adding [ 000101 ] to P;
  • Join C l ( 6 )   =   [ 000101 ] with 1 to form C l ( 7 )   =   [ 0001011 ] , adding [ 0001011 ] to P;
  • Join C l ( 7 )   =   [ 0001011 ] with 1 to form C l ( 8 )   =   [ 00010111 ]
as only the doublet [ 01 ] can be reused from the pool. Therefore, bitstrings (5) and (6), despite having the same length, N   =   8 , Hamming weight, N 1   =   4 , and Shannon entropy (3), have respective assembly indices a ( 8 ) ( C k )   =   3 and a ( 8 ) ( C l )   =   6 that represent the lengths of their shortest assembly pathways, which, in turn, ensures that their assembly pools, P, are distinct sets for a given assembly pathway.
Table 1 and Table A5, Table A6, Table A7, Table A8, Table A8, Table A9, Table A10, Table A11 and Table A12 (Appendix C) show the distributions of the assembly indices among 2 N bitstrings for 4 N 12 , taking into account their Hamming weights, N 1 . The sums of columns are rows of Pascal’s triangle (OEIS A007318).
The following definition is commonly known, but we provide it here for clarity.
Definition 2.
A bitstring, B k ( N ) , is a balanced string if its Hamming weight N 1   =   N / 2 or N 1   =   N / 2 .
Without a loss of generality, we assume that, if N is odd, N 1 < N 0 (e.g., for  N   =   5 , N 1   =   2 , and  N 0   =   3 ). However, our results are equivalently applicable if we assume the opposite (i.e., a larger number of ones for an odd N). The number | B ( N ) | of balanced bitstrings among all 2 N bitstrings is as follows
| B ( N ) |   =   N N / 2   =   N N / 2 2 π N 2 N ,
where “ x ” is the floor function that yields the greatest integer less than or equal to x, and “ x ” is the ceiling function that yields the least integer greater than or equal to x. This is the OEIS A001405 sequence, the maximal number of subsets of an N-set, such that no one contains another, as asserted via Sperner’s theorem, and approximated using Stirling’s approximation for a large N. Balanced and even-length bitstrings, B k ( N ) , have natural binary entropies, (3) H ( B k ( N ) )   =   { 0 , 1 } . Conversely, non-balanced and/or odd-length bitstrings, C k ( N ) , have binary entropies, 0 < H ( C k ( N ) ) < 1 .
Theorem 1.
An N   =   4 -bit string is the shortest string with more than one bitstring assembly index.
Proof. 
The proof is trivial. For N   =   1 , the assembly index a ( 1 ) ( C )   =   0 , as all basis objects have a pathway assembly index of 0 [2] (they are not assembled). N   =   2 provides four available bitstrings with a ( 2 ) ( C )   =   1 . N   =   3 provides eight available bitstrings with a ( 3 ) ( C )   =   2 . Only N   =   4 provides sixteen bitstrings that include four stings with a ( 4 ) ( C )   =   2 and twelve bitstrings with a ( 4 ) ( C )   =   3 including | B ( 4 ) |   =   6 balanced bitstrings, as shown in Table 1 and Table 2. For example, to assemble the bitstring B 1   =   [ 0101 ] , we need to assemble the bitstring [ 01 ] and reuse it. Therefore, a ( N ) ( C k )   =   N 1 for 0 < N < 4 ,   k   =   { 1 , 2 , , 2 N } and min k { a ( N ) ( C k ) } < N 1 for N 4 , where { a ( N ) ( C k ) } denotes a set of assembly indices of all 2 N bitstrings.    □
Interestingly, Theorem 1 strengthens the meaning of N   =   4 as the minimum information capacity that provides a minimum thermodynamic black hole entropy [31,32,33]. There is no disorder or uncertainty in an object that can be assembled in the same number of steps, s < 3 .
The following definition, taking into account the cyclic order of bitstrings, is also provided for the sake of clarity.
Definition 3.
A bitstring, R k ( N ) , is a ringed bitstring if a ring formed with this string by joining its beginning with its end is unique among the rings formed from the other ringed strings, R l ( N ) , l k .
There are at least two and at most N forms of a ringed bitstring, R k ( N ) , that differ in the position of the starting bit. For example, for | B ( 4 ) |   =   6 balanced bitstrings, shown in Table 2, two augmented strings with a ( 4 )   =   2 correspond to each other if we change the starting bit:
[ 1   |   0101   |   0101   |   01 ]   =   [ 10   |   1010   |   1010   |   1 ] .
Similarly, four augmented bitstrings with a ( 4 )   =   3 correspond to each other
[   |   0110   |   0110   |   011 ]   =   [ 0   |   1100   |   1100   |   11 ]   =   [ 01   |   1001   |   1001   |   1 ]   =   [ 011   |   0011   |   0011   |   ] ,
after a change in the position of the starting bit. Thus, there are only two balanced, ringed bitstrings E k ( 4 ) . The number of ringed bitstrings | R k ( N ) | among all 2 N bitstrings is given via the OEIS sequence A000031. In general (for N 3 ), the number, | R k ( N ) | , of ringed bitstrings is much lower than the number, | B k ( N ) | , of balanced bitstrings.
By neglecting the notion of the beginning and end of a string, we focus on its length and content. In Yoda’s language,
“complete, no matter where it begins. A message is”.
The numbers of the balanced | B k ( N ) | , ringed | R k ( N ) | , and balanced, ringed | E k ( N ) | bitstrings are shown in Table 3 and Figure 1. | E k ( N ) | is close to OEIS A000014 sequence up to the eleventh term and its formula remains to be researched.
We note that, in general, the starting bit is relevant for the assembly index. Thus, different forms of a ringed bitstring may have different assembly indices. For example, for  N   =   7 balanced bitstrings, B 34 and B 35 , shown in Table A15, have a ( 7 )   =   6 . However, these bitstrings are not ringed since they correspond to each other and to the balanced bitstrings B 13 , B 18 , B 20 , B 28 , and  B 30 with a ( 7 )   =   5 . They all have the same triplet of adjoining ones.
Definition 4.
The assembly index of a ringed bitstring, R k ( N ) , is the smallest assembly index among all forms of this string.
Thus, if different forms of a ringed bitstring have different assembly indices, we assign the smallest assembly index to this string. In other words, we assume that the smallest number of steps
a ( N ) ( R k )   =   min l { a ( N ) ( R k ) l } , ( R k ) l R k ,
where ( R k ) l denotes a particular l th form of a ringed bitstring, R k , is the bitstring assembly index of this ringed string (cf., e.g., strings (19) and (20)). We assume that, if an object that can be represented with a ringed bitstring can be assembled in fewer steps, this procedure will be preferred in nature.
The distribution of the assembly indices of the balanced, ringed bitstrings, E k , is shown in Table 4.

3. Minimum Bitstring Assembly Index

In the following, we derive the tight lower bound of the set of different bitstring assembly indices.
Theorem 2
(Tight lower bound of the bitstring assembly index). The smallest bitstring assembly index, a ( N ) ( C min ) , as a function of N corresponds to the shortest addition chain for N (OEIS A003313).
Proof. 
Bitstrings, C min , for which a ( N ) ( C min )   =   min k { a ( N ) ( C k ) } , k   =   { 1 , 2 , , 2 N } can be formed in subsequent steps, s, by joining the longest bitstring assembled so far with itself until N   =   2 s is reached [1]. Therefore, if  N   =   2 s , then min k { a ( 2 s ) ( C k ) }   =   s   =   log 2 ( N ) . Only four bitstrings
C min 1 ( 2 s )   =   [ 00 ] , C min 2 ( 2 s )   =   [ 11 ] , C min 3 ( 2 s )   =   [ 0101 ] , and C min 4 ( 2 s )   =   [ 1010 ]
have such an assembly index in this case.
An addition chain for N N , having the shortest length s N (commonly denoted as l ( N ) ), is defined as a sequence, 1   =   b 0 < b 1 < < b s   =   N , of integers such that, for each j 1 , b j   =   b k + b l for l k < j . The first step in creating an addition chain for N is always b 1   =   1 + 1   =   2 , and this corresponds to assembling a doublet, [ ] , from the initial assembly pool, P. Thus, the lower bound for s of the addition chain for N, s log 2 ( N ) is achieved for N   =   2 s . In our case, bitstrings (11) achieve this bound. The second step in creating an addition chain can be b 2   =   1 + 1   =   2 or b 2   =   1 + 2   =   3 .
Thus, finding the shortest addition chain for N corresponds to finding an assembly index of a bitstring containing bits and/or doublets and/or triplets generated via these doublets for N 2 s since, due to Theorem 1, only they provide the same assembly indices { 0 , 1 , 2 } . Such strings correspond to linear molecules made of carbons (cf. supplementary material, S3.2 in [4]). □
The smallest assembly indices, a min ( N ) , are shown in Table 5 for 1 N 21 . Calculating the minimum length of the addition chain for N, as well as finding the shortest assembly pathway for a chemical molecule, has been shown to be at least as hard as NP-complete [4,34].

4. Degree of Causation for Minimum-Assembly-Index Bitstrings

Using the difference between the general AT lower bound (1) and the smallest bitstring assembly index (OEIS A003313), we can define the quantity
D C ( N ) : = 2 log 2 ( N ) a min ( N )   =   N 2 a min ( N ) ,
capturing a degree of causation [6] of assembling the bitstrings of length N with the smallest assembly index, as shown in Figure 2. For  N   =   2 s , the degree of causation, D C ( N )   =   1 , as all four bitstrings (11), can be assembled along a single pathway only; their assembly is entirely causal. However, for  N 2 s , D C ( N ) < 1 , since some bitstrings C min ( N ) can be assembled along different pathways. For example, there are two pathways for the bitstring [ 001 ] : (a) [ 00 ] + 1 and (b) 0 + [ 01 ] , leaving different subunits (respectively, [ 00 ] and [ 01 ] ) in their assembly pools and resulting in lower values of D C ( N ) .
Equation (12) naturally divides the set of natural numbers into sections, 2 s N < 2 s + 1 , and shows regularities that, for certain values of N, can be used to determine the smallest assembly index (i.e., the shortest addition chain for N) as a min ( N )   =   log 2 ( N ) log 2 ( D C ( N ) ) . For each N   =   2 s a min ( 2 s )   =   s and for each N ^ being the sum of two powers of 2 (OEIS A048645),
N ^ : = 2 s + 1 · 2 l , l   =   0 , 1 , , s 1 a min ( N ^ )   =   s + 1 ,
while, for the remaining N ˜ not being the sum of two powers of 2 (OEIS A072823),
2 s < N ˜ < 2 s + 1 , N ˜ N ^ a min ( N ˜ )   =   s + k ,   k 2 ,
where k   =   2 for N ˜   =   { 7 , 11 , 13 15 , 19 , 21 23 , 25 28 , } , while some N ˜ s generate exceptions to this general rule (cf. OEIS A230528). For example, k   =   3 for N ˜  = {29, 31, 47, 53, 55, 57–59, 61–63, …}, k   =   4 for N ˜  = {127, 191, 235, 237, 239, 247, 251, 253, 254, …}, etc. The first exception, k   =   3 , is for N ˜ 13   =   29 . The first double exception, k   =   4 , is for N ˜ 63   =   127 . However, in particular, for 
N ˜ 3   =   2 s + 3 · 2 l , l   =   0 , 1 , , s 2 a min ( N ˜ 3 )   =   s + 2 , and
N ˜ 7 , s 3   =   2 s + 7 · 2 s 3   =   { 15 , 30 , 60 , } , s 3 a min ( N ˜ 7 , s 3 )   =   s + 2 ,
so the number of Ns within each section, not included in the set of general rules 2 s , (13), (15), and (16), is | N ngr |   =   | 2 s 1 s ( s 1 ) 1 |   =   | 2 s 2 s 1 | . Furthermore,
lim s min D C ( N ^ ) = lim s 1 2 1 + 1 2 s   =   1 2 , lim s max D C ( N ˜ ) = D C ( N ˜ 7 , s 3 )   =   15 32 .
The shortest addition chain-sequence-generating factors for 1 s 5 are listed in Table 6, where the subsequent odd numbers of the form m k generate sequences N   =   2 s + m k · 2 l , where l   =   0 , 1 , , k 1 , while the m k numbers in red indicate that certain N ˜ s within the sequences they generate are exceptions to the general a min ( N ˜ )   =   s + 2 rule.
For example, if  s   =   4 , then a min ( 16 )   =   4 and
N ^   =   2 4 + 2 l   =   { 17 , 18 , 20 , 24 } a min ( N ^ )   =   4 + 1   =   5 , N ˜ 3   =   2 4 + 3 · 2 l   =   { 19 , 22 , 28 } a min ( N ˜ 3 )   =   4 + 2   =   6 , N ˜ 5   =   2 4 + 5 · 2 l   =   { 21 , 26 } a min ( N ˜ 5 )   =   4 + 2   =   6 , N ˜ 7   =   2 4 + 7 · 2 l   =   { 23 , 30 } a min ( N ˜ 7 )   =   4 + 2   =   6 , N ˜ 9   =   2 4 + 9 · 2 l   =   25 a min ( N ˜ 9 )   =   4 + 2   =   6 , N ˜ 11   =   2 4 + 11 · 2 l   =   27 a min ( N ˜ 11 )   =   4 + 2   =   6 , N ˜ 13   =   2 4 + 13 · 2 l   =   29 a min ( N ˜ 13 )   =   4 + 3   =   7 , N ˜ 15   =   2 4 + 15 · 2 l   =   31 a min ( N ˜ 15 )   =   4 + 3   =   7 ,
where the last two values, a min ( N ˜ ) , are higher than those given via the general rule. Based on the OEIS A003313 sequence for N 10 5 , we have determined the number of exceptions, that is | N exc | , such that a min ( N exc ) { s , s + 1 , s + 2 } for 0 s 15 , as shown in Table 7, where min m k is the minimal generating factor, m k , shown in Table 6 that generates the exceptional a min ( N exc ) . For all s 4 , max m k   =   2 s 1 . The fact that | N ngr | > | N exc | , s 3 hints at the existence of general rules other than 2 s , (13), (15), and (16).
Furthermore, for  4 s 6 , | N exc |   =   2 s s 2 + 2 and for s 7 , | N exc |   =   2 s s 2 + 1 (OEIS A024012). As shown in Figure 3a, for all s, | N exc | asymptotically approaches 2 s available in a given section as s . As shown in Figure 3b, for s   =   6 , the ratio | N exc | / 2 s has a deflection point, | N exc | / 2 6   =   15 / 32 .
Only living systems have been found to be capable of producing abundant molecules with an assembly index greater than an experimentally determined value of 15 steps [3,8]. The cut-off between 13 and 15 is sharp, which means that molecules made through random processes cannot have assembly indices exceeding 13 steps [3,8]. In particular, N   =   15 is the length of the shortest addition chain for N, which is smaller than the number of multiplications to compute N th power using the Chandah-sutra method (OEIS A014701, OEIS A371894). Furthermore, the values of the sequence A014701 are larger than the shortest addition chain for N 2 2 + 2 l . These values (OEIS A371894) are not given via Equation (15), but Equation (16) provides their subset. Their Hamming weight is at least four in binary representation. Furthermore, the exceptional a min ( N exc ) values bear similarity to the atomic numbers, Z, of chemical elements that violate the Aufbau rule [15], which correctly predicts the electron configurations of most elements. Only about twenty elements within 24 Z 103 (with only two non-doubleton sets of consecutive ones) violate the Aufbau rule.

5. Maximum Bitstring Assembly Index

In the following, we conjecture upon the form of the upper bound of the set of different bitstring assembly indices. In general, of all bitstrings, C k , having a given assembly index, shown in Table 1 and Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11 and Table A12 (Appendix C), most have N 1   =   N / 2 , though we have found a few exceptions, mostly for non-maximal assembly indices, namely for a ( 8 )   =   4 ( 4 < 8 ) and for a ( 8 )   =   6 ( 24 < 26 ), for a ( 10 )   =   4 ( 2 < 5 ) and for a ( 10 )   =   5 ( 32 < 33 ), and for a ( 12 )   =   4 ( 2 < 3 ). These observations allow us to restrict the search space of possible bitstrings with the largest assembly indices to balanced bitstrings only: with the exception of N   =   8 , of all bitstrings, C k ( N ) , having a largest assembly index, most are balanced. We can further restrict the search space to ringed bitstrings (Definition 3). If a bitstring, C min , for which a ( N ) ( C min )   =   min k { a ( N ) ( C k ) } is constructed from repeating patterns, then a bitstring, C max , for which a ( N ) ( C max )   =   max k { a ( N ) ( C k ) } must be the most patternless. The bitstring assembly index must be bounded from above, and a ( N ) ( C max ) must be a monotonically nondecreasing function of N that can increase at most by one between N and N + 1 . Certain heuristic rules apply in our binary case. For example,
  • For N   =   7 , we cannot avoid two doublets (e.g., 2 × [ 00 ] ) within a ringed bitstring, E 28 ( 7 )   =   [ 0011100 ] , and thus, a ( 7 ) ( C max )   =   5 < 6 ;
  • For N   =   8 , we cannot avoid two pairs of doublets (e.g., 2 × [ 00 ] and 2 × [ 11 ] ) within a ringed bitstring, E 7 ( 8 )   =   [ 00001111 ] , and thus, a ( 8 ) ( C max )   =   5 < 6 ;
  • For N   =   12 , we cannot avoid three pairs of doublets (e.g., 2 × [ 00 ] , 2 × [ 10 ] , and  2 × [ 11 ] ) within a ringed bitstring, E k ( 12 )   =   [ 111000101100 ] , and thus, a ( 12 ) ( C max )   =   8 < 9 ;
  • For N   =   14 , we cannot avoid two pairs of doublets and one doublet three times (e.g., 2 × [ 00 ] , 2 × [ 11 ] , and  3 × [ 01 ] , and thus, a ( 14 ) ( C max )   =   9 < 10 ;
  • etc.
Table 8 shows the exemplary balanced bitstrings, B max , having the largest assembly indices that we assembled (cf. also Appendix A). To determine the assembly index, a ( 18 )     =     11 , of the bitstring
E k ( 18 )   =   [ 1 ( 001 ) ( 11 ) ( 110 ) ( 110 ) ( 00 ) ( 001 ) 0 ] ,
for example, we look for the longest patterns that appear at least twice within the string, and we look for the largest number of these patterns. Here, we find that the two triplets [ 001 ] and [ 110 ] appear twice in E k ( 18 ) and are based on the doublets [ 00 ] and [ 11 ] also appearing in E k ( 18 ) . Thus, we start with the assembly pool { 1 , 0 , [ 00 ] , [ 001 ] , [ 11 ] , [ 110 ] } made in four steps and join the elements of the pool in the following seven steps to arrive at a ( 18 ) ( E k )   =   11 . On the other hand, another form of this balanced, ringed string,
E l ( 18 )   =   [ ( 01 ) ( 11 ) ( 110 ) ( 110 ) 00 ( 001 ) ( 01 ) 0 ] ,
has a ( 18 ) ( E l )   =   12 .
These results allow us to formulate the following conjecture.
Conjecture 1
(Tight upper bound on a bitstring assembly index). With exceptions for small N, the largest bitstring assembly index a ( N ) ( C max ) is given via a sequence formed by { + 1 , + 1 , k × 0 , + 1 , + 1 , k × 0 } for k N 0 , where + 1 denotes increasing a ( N ) ( C max ) by one, and 0 denotes maintaining it at the same level, and  a ( 0 )   =   1 .
However, at this moment, we cannot state whether this conjecture applies to ringed or non-ringed bitstrings. The assembly indices for N < 3 are the same for a given N, whereas the assembly indices for 4 N 10 were discussed above and are calculated in Appendix C for balanced and balanced, ringed bitstrings.
The conjectured sequence is shown in Figure 4 and Figure 5, starting with a ( 0 )   =   1 (we note in passing that n   =   1 is a dimension of the void, the empty set ∅, or (−1)-simplex). Subsequent terms are given via { 0 , 1 , 2 , 3 , 4 , 5 , 5 , 6 , 7 , 7 , 8 , 9 , 9 , 9 , 10 , } , which is periodic for N   =   k ( k + 3 ) and defines plateaus of a constant bitstring assembly index at a ( N ) ( C max )   =   4 k 3 , and  a ( N ) ( C max )   =   4 k 1 , k N , k > 1 .
This sequence can be generated using the procedure given by Listing 1.
Listing 1. Matlab code to generate the conjectured bitstring assembly index upper bound.
step = 1;% step flag
run = 1;% run flag
flat = 0;% flat~counter
  
Nk = 0;
aub = −1;% the upper bound
while Nk < N
          if step < 3
                    Nk = Nk + 1;% next Nk
                    aub= aub + 1;% increment the bound
          else% step==3
                    for k=1:flat
                              if flat > 0
                                        Nk = Nk + 1;% next Nk
                              end
                    end
                    run = run+1;% increment run flag
                    if run > 2
                              run  = 1;% reset run flag
                              flat = flat + 1;% increment flat counter
                    end
          end
          step = step+1;% increment step flag
          if step > 3
                    step = 1;% reset step flag
          end
end
We note the similarity of this bound to the monotonically nondecreasing Shannon entropy of chemical elements, including observable ones [15]. Perhaps the exceptions in the sequence of Conjecture 1 vanish as N increases.

6. Binputation

So far, we have assembled bitstrings “manually”. Now, we shall automatize this process using other bitstrings as assembling programs.
Definition 5.
The binary assembling program Q B is a bitstring of length s Q that acts on the assembly pool P and outputs the assembled bitstrings, adding them to the pool.
Definition 6.
The trivial assembling program Q is a binary assembling program with consecutive bits denoting the following commands:
0 
take the last element from P, join it with itself, and output;
1 
take the last two elements from P, join them with each other, and output.
As the assembly pool P is a distinct set to which bitstrings are added in subsequent assembly steps, only these two commands apply to the initial assembly pool, P   =   { 1 , 0 } , containing only two bits, regardless of the starting command.
Theorem 3.
If a bitstring, C min ( N ) , can be assembled via an elegant trivial program of length s Q   =   a ( N ) ( C min ) , then N is expressible as a product of Fibonacci numbers (OEIS A065108), and the length, s Q , of any trivial program Q is not shorter than the assembly index of the string that this trivial assembling program assembles.
Proof. 
An elegant program is the shortest program that produces a given output [35,36]. Furthermore, no program, P, shorter than an elegant program, Q, can find this elegant program, Q [35]. If it could, it could also generate Q’s output. But if P is shorter than Q, then Q would not be elegant, which leads to a contradiction.
The 1 st bit of the trivial assembling program Q is irrelevant, as Q   =   0 assembles C min 1 ( 2 )   =   [ 00 ] and Q   =   1 assembles C min 4 ( 2 )   =   [ 10 ] , so Q   =   assembles C min 1 , 4 ( 2 )   =   [ 0 ] . Then, the programs Q   =   0 0 assemble the 2 s Q -bit strings C min 1 , 4 ( 2 s )   =   [ 0 0 ] having the assembly index a min ( 2 s Q )   =   s Q , while bitstrings C min 2 , 3 2 s Q with the smallest assembly index, a min ( 2 s Q )   =   s Q , can be assembled with the same two programs starting with the reversed assembly pool P   =   { 0 , 1 } .
The remaining 2 s Q 1 2 programs will assemble some of the shorter bitstrings with the assembly index a min ( N )   =   s Q . In general, all programs, Q, assemble bitstrings that have lengths expressible as a product of Fibonacci numbers (OEIS A065108), as shown in Table A1 (Appendix B), wherein, out of 2 s Q 1 programs (cf. Table A1 and Table A4), the following applies:
  • 2 s Q 2 programs Q   =   0 assemble even-length, balanced bitstrings B   =   [ 0 0 ] having natural binary entropies (3) H ( C )   =   { 0 , 1 } , including bitstrings C min 1 , 4 ( 2 s Q ) (11);
  • 2 s Q 3 programs Q   =   10 assemble [ 0 00 0 ] bitstrings having lengths divisible by three and entropies H ( C ) { 0 , 0.9183 } ;
  • 2 s Q 4 programs Q   =   110 assemble [ 00 0 00 0 ] bitstrings having lengths divisible by five and entropies H ( C ) { 0 , 0.9710 } ;
  • 2 s Q 5 programs Q   =   1110 assemble [ 0 0 00 0 ] bitstrings having lengths divisible by eight, entropies H ( C ) { 0 , 0.9544 } , and assembly indices a ( N )   =   s Q 1 if   =   1 ;
  • …;
  • the program Q   =   1 0 joins the two shortest bitstrings assembled in a previous step into a bitstring of a length that is twice the Fibonacci sequence (OEIS A055389);
  • and finally, the program Q   =   1 1 assembles the shortest bitstring that has a length belonging to the set of Fibonacci numbers.
Thus, for   =   1 , binary assembling programs, Q, assemble subsequent 2 s Q 1   =   2 s Q 2 + 2 s Q 3 + + 2 0 + 1 Fibonacci words and their concatenations that have entropies (3) with ratios (4)
p 1 , m   =   F m F m + 2 and p 0 , m   =   F m + 1 F m + 2 ,
where m   =   { 1 , 2 , s Q } , and F is the Fibonacci sequence starting from 1. Ratios (21) rapidly converge to
lim s Q p 0 , m   =   φ 1 0.618033989 and lim s Q p 1 , m   =   2 φ 0.381966011
where φ is the golden ratio. Therefore, lim s Q H m 0.9594 is the binary entropy of the Fibonacci word limit. The Fibonacci sequence can be expressed through the golden ratio, which corresponds to the smallest Pythagorean triple { 3 , 4 , 5 } [37,38].
However, for s Q 4 , some of the programs are no longer elegant if   =   0 and some of the assembled bitstrings are not C min if   =   1 .
For s Q 4 , Q   =   111100 assembles a bitstring
C non - min ( 2 s Q 1 )   =   [ 01010010 ]
with an assembly index, a ( 2 s Q 1 )   =   s Q , which is not the minimum for this length of the bitstring. For example, the 4-bit program Q   =   111 assembles the bitstring C ( 8 )   =   [ 0 0 00 0 ] , but if   =   0 , this string can be assembled via a shorter 3-bit program, Q   =   00 , and if   =   1 , this string does not have the smallest assembly index, a ( 8 ) ( C min )   =   3 , but a ( 8 ) ( C non - min )   =   4 .
For s Q   =   { 4 , 7 } and s Q 10 , and for the shortest bitstring assembled via the program Q, the program Q is not elegant for   =   0 , and the shortest bitstring assembled via the program is not C min for   =   1 .
However, the length, s Q , of any program, Q, is not shorter than the assembly index of the bitstring that this program assembles. □
The trivial assembly programs, Q, and the bitstrings they assemble are listed in Table 9 and Table A2, Table A3 and Table A4 (Appendix B) for one version of the assembly pool and for 1 s Q 6 .
We note in passing that there are other mathematical results on bitstrings and the Fibonacci sequence. For example, it was shown [39] that having two concentric circles with radii { F n , F n + 2 } and drawing two pairs of parallel lines orthogonal to each other and tangent to the inner circle, one obtains an octagon defined by the points of intersection of those lines with the outer circle, which comes very close to the regular octagon with n . Furthermore, each of these octagons defines a Sturmian binary word (a cutting sequence for lines of irrational slope), except in the case of n   =   5 [39].
Perhaps the smallest assembly index given via Theorem 2 and the bitstrings of Theorem 3 are related to the Collatz conjecture, as the lengths of the strings (11) for N   =   2 2 k correspond to the numbers to which the Collatz conjecture converges, from N   =   ( 2 2 k 1 ) / 3 , k N (OEIS A002450).
Theorem 3 is also related to Gödel’s incompleteness theorems and the halting problem. N cases of the halting problem correspond only to log 2 ( N ) , not to N bits of information [40], and therefore, complexity is more fundamental to incompleteness than self-reference of Gödel’s sentence [41]. Any formal axiomatic system only enables provable theorems to be proved. If a theorem can be proved with an automatic theorem prover, the prover will halt after proving this theorem. Thus, proving a theorem equals halting. If we assume that the axioms of the trivial program given via Definition 6 define the formal axiomatic system, then the bitstrings that have lengths expressible as a product of Fibonacci numbers assembled through this program would represent provable theorems.
If we wanted to define a binary assembling program, Q B , that would use specific bitstrings other than the last one or two bitstrings in the assembly pool, we would have to index the bitstrings in the pool. However, at the beginning of the assembly process, we cannot predict in advance how many bitstrings will enter the assembly pool. Thus, we do not know how many bits will be needed to encode the indices of the strings in the pool. Therefore, we state the following conjecture.
Conjecture 2.
There is no binary assembling program (Definition 5) that has a length shorter than the length of the bitstring that has the largest assembly index that could assemble this string.
Theorem 3 would be violated if, in Definition 6, we specified the command “0”, e.g., as “take the last element from the assembly pool, join it with itself, join with what you have already assembled (say at ’the right’), and output”. Then, the 2-bit program “00” would produce the 6-bit string [ 000000 ] with the assembly index a ( 6 )   =   3 . However, such a one-step command would violate the axioms of assembly theory since it would perform two assembly steps in one program step. An elegant program to output the gigabyte bitstring of all zeros would take a few bits of code and would have a low Kolmogorov complexity [42]. However, such a bitstring would be outputted, not assembled. Furthermore, the length of such a program that outputs the bitstring [ 0 ] would be shorter than the length of the program that outputs the string [ 10 ] , while in AT, the lengths of these programs must be the same if the strings have the same assembly indices. Definitions 5 and 6 and Theorem 3 are about binputation, about bitstrings assembling other bitstrings.
In particular, Theorem 3 confirms that the assembly index is related to the amount of physical memory required to store the information to direct the assembly of an object (a bitstring in our case) and set a directionality in time from the simple to the complex [8]: s Q -bit long trivial assembling programs (i.e., with s Q -bits of memory) can assemble 2 s Q -bit strings with minimal assembly indices, s Q and, for s Q 4 , some shorter but more complex bitstrings with non-minimal assembly indices s Q . The memory defines the object [8].

7. Discussion and Conclusions

Consider the SARS-CoV-2 genome sequence defined by 29,903 nucleobases { A , C , G , T } , its initial version, MN908947 (Available online at https://www.ncbi.nlm.nih.gov/nuccore/MN908947, accessed on 16 May 2024), collected in December 2019 in Wuhan and its sample, OL351370 (Available online at https://www.ncbi.nlm.nih.gov/nuccore/OL351370, accessed on 16 May 2024), collected in Egypt nearly two years after the Wuhan outbreak, on 23 October 2021. In the MN version, the nucleobases are distributed as | A |   =   8954 , | C |   =   5492 , | G |   =   5863 , and | T |   =   9594 , and in the OL version, they are distributed as | A |   =   8954 , | C |   =   5470 , | G |   =   5856 , and | T |   =   9623 , following Chargaff’s parity rules with the same count of adenines. We can convert these sequences into bitstrings by assigning two bits per nucleobase. For such N = 59,806, not being the sum of two powers of 2, with the degree of causation [6] given via Equation (14), the assembly index is bounded by
21 a ( 59 , 806 ) ( C k ) 975 .
If a bitstring, C ( N ) , were to encode four DNA/RNA nucleobases, then the smallest assembly index bitstrings (as well as the strings generated via trivial assembly programs, Q, according to Definition 6), would not encode all nucleobases. For example, the bitstring C min ( 10 )   =   [ 1001010010 ] with a min ( 10 )   =   4 and encoding A = 00, C = 01, G = 10, and T = 11, cannot encode T = 11. Therefore, we increase the lower bound (24), given via Theorem 2, by one. The upper bound (24) was estimated by finding the smallest k     =     244 that satisfies k ( k + 3 ) N and using the relation a ( N ) ( C max )     =     4 k 1 of Conjecture 1. We do not know the actual assembly indices of the MN and OL sequences. Their determination is an NP-complete problem, as we conjecture. However, we note a relatively wide range of 954 assembly indices that nature provides for this genome sequence.
There are twelve possible assignments of two bits per nucleobase with twelve different Hamming weights
N 1 C M N ( 59,806 )   =   { 25,801 , 26,172 , 26,441 , 26,812 , 29,263 , 29,532 , 30,274 , 30,543 , 32,994 , 33,365 , 33,634 , 34,005 } , N 1 C O L ( 59,806 )   =   { 25,750 , 26,136 , 26,419 , 26,805 , 29,234 , 29,517 , 30,289 , 30,572 , 33,001 , 33,387 , 33670 , 34056 } ,
and six different Shannon entropies (3)
H C M N ( 59,806 )   =   { 0.9864 , 0.9887 , 0.9903 , 0.9923 , 0.9997 , 0.99989 } , H C O L ( 59,806 )   =   { 0.9860 , 0.9885 , 0.9902 , 0.9922 , 0.9996 , 0.99988 } .
Hamming weights (25) show that all sequences are almost balanced ( N / 2 = 29,903). However, the later OL versions are less balanced, producing lower Shannon entropies (26) and showcasing the existence of an entopic force that governs genetic mutations [25].
The bounds of Theorem 2 and Conjecture 1 are shown in Table 5 and Table 8 and are illustrated in Figure 4 and Figure 5. No bitstring can be assembled in a smaller number of steps than is given by a lower bound of Theorem 2. However, some bitstrings cannot be assembled in a smaller number of steps than given by an upper bound.
We found it much easier to determine the assembly index of a given bitstring, C k ( N ) , than to assemble a bitstring so that it would have the largest assembly index. Similarly, a trivial bitstring with the smallest assembly index for N can have the form C min 1 4 ( N )   =   [ ] (11) or the form of a Fibonacci word generated via the trivial assembling program (Definition 6). Therefore, we state the following conjecture.
Conjecture 3.
The problem of determining the assembly index of any bitstring, C k ( N ) , is NP-complete. The problem of assembling the bitstring so that it would have the largest assembly index for a large N is NP-hard. This corresponds to determining the largest assembly index value for a large N.
A proof of Conjecture 3 would also be the proof of the following known conjecture.
Conjecture 4.
P NP .
Every computable problem and every computable solution can be encoded as a finite bitstring. Here, determining whether the assembly index of a given bitstring has its known maximal value corresponds to checking the solution to a problem for correctness, whereas assembling such a bitstring corresponds to solving the problem. Thus, AT would solve the P versus NP problem in theoretical computer science. There is ample pragmatic justification for adding P NP as a new axiom [40]; rather than attempting to prove this conjecture, mathematicians should accept that it may not be provable and simply accept it as an axiom [43].
The bounds on the bitstring assembly index given via Theorem 2 and Conjecture 1, and the general bounds (1), and (2) on the assembly index [1], are illustrated in Figure 6 (adopted from [1] and modified; not to scale). The lower bound on the bitstring assembly index implies two paths of the evolution of information:
  • a creative path (slanting lines in Figure 6); and
  • an optimization path (vertical lines in Figure 6),
since for some bitstrings, C m , of length N > 3 , it admits the possible region of their assembly steps, a min ( N ) ( C m ) < s N 1 . For 1 N 3 , only the creative path is available, as there is nothing to optimize: a ( 1 N 3 )   =   N 1 . The 2nd path becomes available already at N   =   4 , where the suboptimal number of three steps used to assemble a bitstring, [ 0101 ] , can be optimized to a min ( 4 )   =   2 . The evolution becomes interesting for N 7 ( N > 7 for ringed strings, cf. Table 8), due to an upper bound on the bitstring assembly index. For each ( N 7 )-bit string, C m , suboptimally assembled in a ( N 7 ) ( C m ) < s N 1 steps, the search space is recursively explored to optimize the number of steps until the assembly index, a ( N 7 ) ( C m ) , of this bitstring is reached, where a min ( N 7 ) a ( N 7 ) ( C m ) a max ( N 7 ) .
We conjecture that, in general, the assembly of a novel, nontrivial bitstring, C m ( N + l ) , for l N , with a longer length, N + l , using the 1st path of evolution is NP-hard, requires access to noncomputability and, thus, is available only to dissipative structures, including living beings, such as humans. This path represents “true” creativity. However, once this new bitstring is assembled, it is unlikely that it will be assembled optimally in s steps corresponding to its assembly index. This implies the 2nd path of minimizing the number of steps, s, required to assemble this newly found, nontrivial bitstring, C m ( N + l ) , towards its assembly index, which is only NP-complete. The bitstring C m ( N + l ) is reassembled in a simpler way, but such a reassembly is no longer creative. The 2nd path represents “generative creativity” available both to dissipative structures and to artificial intelligence.
To illustrate this process, consider two examples: one from biological evolution (the emergence of amphibians from fish) and another from technological evolution (the invention of an airplane). The fish began to evolve around 541 million years ago, forming a plethora of fish species and exploring the available search space, optimizing the fish assembly index and increasing the information capacity within the range delimited by the same upper bound fish plateau (cf. Conjecture 1). Around 400 million years ago, some species of fish began using areas with fluctuating water levels, where occasionally water was scarce. The next amphibian plateau of a larger assembly index was within sight. Proto-lungs developed by groping [19], allowing fish to obtain oxygen from the air instead of water. The breakthrough was made, and amphibians were formed, exploring the subsequent amphibian plateau and optimizing this evolutionary gain. Many inventions led to the first airplane: the invention of the airfoil (George Cayley), its use in gliders (Otto Lilienthal), a propeller, etc. Again, the search space was well explored, and the airplane plateau of a larger assembly index was close. Finally, it was the Wright brothers, bicycle retailers who realized the importance of combining roll and yaw control in their first suboptimal Wright Flyer foreplane configuration. Once it was shown that it could be done, other people began to optimize this invention, minimizing the number of steps required to recreate it.
AT captures the notion of intelligence, understood as a degree of ability to reach the same goal through different means (assembly pathways) [44], where a fundamental aspect of intelligence is collective behavior [45]. Once the search space is saturated, the fish collectively explore it to develop lungs, just as humans, starting at least in the nineteenth century, began to think collectively about heavier-than-air flying machines. We assume that only dissipative structures can assemble novel structures of information and define living beings as dissipative structures provided with choice (the ability to select [6]) and humans as living dissipative structures provided with an abstract, modality-independent language. As shown in Figure 6, we predict a limit on complexity or a maximum assembly index, a H , achievable via non-human dissipative structures. These structures do not use an abstract, modality-independent language required for advanced human creativity. A human creative work also needs a certain minimum amount of information, N H . We take it for granted that, presently, only Homo sapiens has a gift of creativity that exceeds a H . Any creation is required to be shaped through the unique personality of its human creator(s) to such an extent that it is statistically one-time in nature [46]; it is an imprint of the author’s personality. Subsequent plateaus of a max ( N ) > a H can also be thought of as scientific paradigms [47] that define the basic concepts and research practices in science.
Any structure of information assembled via a dissipative structure in s steps can belong to one of the four regions shown in Figure 6:
1. N < N H and a max ( N ) < s < N 1 ,suboptimally assembled via dissipative structures
(green region),
2. a min ( N ) < s < max a max ( N ) , a H ,optimally assembled via dissipative structures,
3. N > N H and a H < s < a max ( N ) ,optimally assembled via humans, and
4. N > N H and a max ( N ) < s < N 1 ,suboptimally assembled via humans (red region).
We do not exclude the possibility that non-human dissipative structures are capable of suboptimally assembling structures C above a H , provided that their assembly indices satisfy a ( N ) ( C ) < a H . Thus, the optimization path shown in the white rectangle in Figure 6 is available only to humans.
The results reported here can be applied in the fields of cryptography, data compression methods, stream ciphers, approximation algorithms [48,49], reinforcement learning algorithms [50], information-theoretically secure algorithms, etc. Another possible application of the results of this study could be molecular physics and crystallography. Overall, the results reported here support the AT, emergent dimensionality [12,15,22,23,24,26,27,28,38], and the second law of infodynamics [25,29], and they invite further research.

Author Contributions

W.B.: conjecture concerning the diversification of bitstrings in Theorem 1; partitioning conjecture for N 16 resulting in the flattening of the maximum bitstring assembly index; observation that the maximum bitstring assembly index should be monotonically nondecreasing; prior-art search; numerous clarity corrections and improvements; S.Ł.: the remaining part of the study. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The public repository for the code written in the MATLAB computational environment is given under the link https://github.com/szluk/Evolution_of_Information (accessed on 16 May 2024).

Acknowledgments

The authors thank Piotr Masierak for his research on the general strategy for determining the bitstring assembly indices and creating the E k ( 18 ) bitstring (Peter’s rock), Andrzej Tomski for clarity corrections, Michael Orr for his note on the role of choice, and Mariola Bala for noting that “this is logical”. Szymon Łukaszyk thanks his wife, Magdalena Bartocha, for her unwavering support and motivation and his partner and friend, Renata Sobajda, for her prayers. Szymon Łukaszyk dedicates this work to the loving memory of his dad.

Conflicts of Interest

Authors Szymon Łukaszyk and Wawrzyniec Bieniawski were employed by the company Łukaszyk Patent Attorneys. The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ATassembly theory;
Nlength of a bitstring;
N 0 number of 0s in the bitstring;
N 1 binary Hamming weight of the bitstring;
C k ( N ) bitstring of length N;
B k ( N ) balanced bitstring of length N;
R k ( N ) ringed bitstring of length N;
E k ( N ) balanced, ringed bitstring of length N;
| C ( N ) | number of bitstrings of length N ( 2 N );
| B ( N ) | number of balanced bitstrings of length N (OEIS A001405);
| R ( N ) | number of ringed bitstrings of length N (OEIS A000031);
| E ( N ) | number of balanced, ringed bitstrings of length N;
a ( N ) assembly index of a bitstring of length N;
P   =   { 1 , 0 } initial assembly pool;
sassembly step;
Qbinary assembling program;
s Q length of the binary assembling program;
FFibonacci sequence.

Appendix A. Exemplary Maximal-Assembly-Index Bitstrings

For the exemplary balanced, ringed bitstrings, E max , shown in Table 8:
  • All forms of E k ( 4 )   =   [ 0011 ] have a ( 4 )   =   3 ;
  • All forms of E 6 ( 5 )   =   [ 00011 ] have a ( 5 )   =   4 ;
  • All forms of E 16 ( 6 )   =   [ 000111 ] have a ( 6 )   =   5 ;
  • The form E 28 ( 7 )   =   [ 0011100 ] has a ( 7 )   =   5 , but the form E 34 ( 7 )   =   [ 0001110 ] has a ( 7 )   =   6 ;
  • All forms of E 45 ( 8 )   =   [ 00010111 ] have a ( 8 )   =   6 ;
  • All forms of E 13 ( 9 )   =   [ 000011101 ] have a ( 9 )   =   7 ;
  • The form E 22 ( 10 )   =   [ 0000111101 ] has a ( 10 )   =   7 , but the form E l ( 10 )   =   [ 0111101000 ] has a ( 10 )   =   8 ;
  • All forms of E 7 ( 11 )   =   [ 00000101111 ] have a ( 11 )   =   8 ;
  • All forms of E 9 ( 12 )   =   [ 111000101100 ] have a ( 12 )   =   8 ;
  • All forms of E 8 ( 13 )   =   [ 0000001011111 ] have a ( 13 )   =   9 ;
  • All forms of E k ( 14 )   =   [ 00000101011111 ] have a ( 14 )   =   9 ;
  • All forms of E k ( 15 )   =   [ 000001010111110 ] have a ( 15 )   =   10 ;
  • All forms of E k ( 16 )   =   [ 1000000101011111 ] have a ( 16 )   =   10 ;
  • All forms of E k ( 17 )   =   [ 00000010101111110 ] have a ( 17 )   =   11 ;
  • All forms of E k ( 18 )   =   [ 000000101010111111 ] have a ( 18 )   =   11 ;
  • Some forms of E k ( 19 )   =   [ 1000010101001111101 ] have a ( 19 )   =   12 ;
  • Some forms of E k ( 20 )   =   [ 10100111110110000010 ] have a ( 20 )   =   13 .

Appendix B. Trivial Assembling Programs

Table A1 shows the lengths of the bitstrings assembled via the trivial assembling program introduced in Section 6 for 1 s Q 7 . The table is divided into sections corresponding to sets of assembled bitstrings that have the same form but different lengths. For example, thirty-two 7-bit programs in the bottom section assemble bitstrings C   =   [ 1 1 ] .
Table A1. Lengths of the bitstrings assembled via trivial assembly programs, Qs (OEIS A065108).
Table A1. Lengths of the bitstrings assembled via trivial assembly programs, Qs (OEIS A065108).
a ( N ) 01233, 44, 55, 66, 7
s Q 1234567
11235   1321 34    
||||| 42
|||| 0 2639
|||| 52
||| 0 16     24     40    
||| | 48    
||| 32     48    
||| 64    
|| 0 101525 40    
|| || 50
|| | 3045
|| | 60
|| 203050
|| | 60
|| 4060
|| 80
| 0 691524    39
| ||| 48    
| || 3045
| || 60
| | 182745
| | | 54
| | 3654
| | 72
| 121830 48    
| || 60
| | 3654
| | 72
| 243660
| | 72
| 4872
| 96
0 4610 16     2642
|||| 52
||| 32     48    
||| 64    
|| 203050
|| | 60
|| 4060
|| 80
| 121830 48    
| || 60
| | 3654
| | 72
| 243660
| | 72
| 4872
| 96
81220 32     52
||| 64    
|| 4060
|| 80
| 243660
| | 72
| 4872
| 96
162440 64    
|| 80
| 4872
| 96
324880
| 96
6496
128
. The program is not elegant. . The program is not elegant if   =   0 and the assembled bitstring is not C min if   =   1 . The boxed symbols denote program commands, not the bitstring lengths.
Table A2. Four-bit programs assembling bitstrings with a ( N )   =   { 3 , 4 } .
Table A2. Four-bit programs assembling bitstrings with a ( N )   =   { 3 , 4 } .
Q C   ( s   =   3 ) C   ( s Q   =   4 ) N
111 00 0 0 0 00 0 8  
110 00 0 00 0 10
101 0 0 0 0 9
100 0 0 0 0 12
011 0 0 10
010 0 0 12
001 0 0 12
000 0 0 16
. The program is not elegant if   =   0 and the assembled bitstring is not C min if   =   1 .
Table A3. Five-bit programs assembling bitstrings with a ( N )   =   { 4 , 5 } .
Table A3. Five-bit programs assembling bitstrings with a ( N )   =   { 4 , 5 } .
Q C   ( s   =   4 ) C   ( s Q   =   5 ) N
1111 0 0 00 0 00 00 0 00 0 13
1110 0 0 00 0 0 0 00 0 16  
1101 00 0 00 0 15
1100 00 0 00 0 20
1011 0 0 0 0 15
1010 0 0 0 0 18
1001 0 0 0 0 18
1000 0 0 0 0 24
0111 0 0 16  
0110 0 0 20
0101 0 0 18
0100 0 0 24
0011 0 0 20
0010 0 0 24
0001 0 0 24
0000 0 0 32
. The program is not elegant (the same bitstring can be assembled using the shorter 4-bit program 000 ). . The program is not elegant if   =   0 and the assembled bitstring is not C min if   =   1 .
Table A4. Six-bit programs assembling bitstrings with a ( N )   =   { 5 , 6 } .
Table A4. Six-bit programs assembling bitstrings with a ( N )   =   { 5 , 6 } .
Q C   ( s Q   =   6 ) N
11111 0 0 00 0 00 00 0 00 0 21
11110 00 00 0 00 0 26
1110 0 0 00 0 24   , 32  
110 00 0 25 , 30 , 40
10 0 0 24   , , 48
0 0 26 , 32   , , 64
. The program is not elegant. . The program is not elegant if   =   0 and the assembled bitstring is not C min if   =   1 .

Appendix C. Bitstrings and Their Assembly Indices

Table A1 show the lengths of the bitstrings assembled by programs Fs having the smallest assembly indices. Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11 and Table A12 show distributions of the assembly indices for 5 N 12 . Table A13, Table A14, Table A15, Table A16 and Table A17 show balanced bitstrings B ( N ) and their assembly indices for 5 N 8 . Table A18, Table A19, Table A20, Table A21, Table A22 and Table A23 show the balanced, ringed bitstrings E ( N ) and their assembly indices for 5 N 10 . Table A24, Table A25 and Table A26 show selected balanced, ringed bitstrings E ( N ) and their assembly indices for 11 N 13 .
Table A5. Distribution of the assembly indices for N   =   5 .
Table A5. Distribution of the assembly indices for N   =   5 .
N 1
a ( 5 ) ( C ) | a ( 5 ) ( C ) | 012345
318135531
414 2552
3215101051
Table A6. Distribution of the assembly indices for N   =   6 .
Table A6. Distribution of the assembly indices for N   =   6 .
N 1
a ( 6 ) ( C ) | a ( 6 ) ( C ) | 0123456
3101 323 1
444 61012106
510 262
641615201561
Table A7. Distribution of the assembly indices for N   =   7 .
Table A7. Distribution of the assembly indices for N   =   7 .
N 1
a ( 7 ) ( C ) | a ( 7 ) ( C ) | 01234567
4501571212751
574 2142121142
64 22
128172135352171
Table A8. Distribution of the assembly indices for N   =   8 .
Table A8. Distribution of the assembly indices for N   =   8 .
N 1
a ( 8 ) ( C ) | a ( 8 ) ( C ) | 012345678
341 2 1
438 98489
5132 817224022178
682 22624262
25618285670562881
Table A9. Distribution of the assembly indices for N   =   9 .
Table A9. Distribution of the assembly indices for N   =   9 .
N 1
a ( 9 ) ( C ) | a ( 9 ) ( C ) | 0123456789
42413 3553 31
5184 41735363635174
6248 21942616142192
756 424244
512193684126126843691
Table A10. Distribution of the assembly indices for N   =   10 .
Table A10. Distribution of the assembly indices for N   =   10 .
N 1
a ( 10 ) ( C ) | a ( 10 ) ( C ) | 012345678910
4201 3 525 3 1
5198 8222033323320228
6502 2186810811010868182
7288 232629662322
816 2122
1024110451202102522101204510
Table A11. Distribution of the assembly indices for N   =   11 .
Table A11. Distribution of the assembly indices for N   =   11 .
N 1
a ( 11 ) ( C ) | a ( 11 ) ( C ) | 01234567891011
518417142318292918231471
6686 4326910413413410469324
7970 969178229229178699
8208 4307070304
2048111551653304624623301655511
Table A12. Distribution of the assembly indices for N   =   12 .
Table A12. Distribution of the assembly indices for N   =   12 .
N 1
a ( 12 ) ( C ) | a ( 12 ) ( C ) | 0123456789101112
4101 3 2 3 1
594 1341012161210413
61034 124294141130196130141944212
71688 1110619635435435419610611
81180 1614328229828214316
990 21458142
40961126622049579292479249522066121
Table A13. | B ( 5 ) |   =   10 balanced bitstrings.
Table A13. | B ( 5 ) |   =   10 balanced bitstrings.
k B k ( 5 ) a ( 5 ) ( B k )
10(01)(01)3
2(01)0(01)3
3(01)(01)03
4(10)0(10)3
5(10)(10)03
6000114
7001104
8011004
9100014
10110004
Table A14. | B ( 6 ) |   =   20 balanced bitstrings.
Table A14. | B ( 6 ) |   =   20 balanced bitstrings.
k B k ( 6 ) a ( 6 ) ( B k )
1(01)(01)(01)3
2(10)(10)(10)3
30(01)(01)14
40(01)1(01)4
5(01)0(01)14
6(01)(01)104
7(01)10(01)4
8(01)1(01)04
9(10)0(10)14
10(10)01(10)4
11(10)(10)014
12(10)1(10)04
131(10)0(10)4
141(10)(10)04
150011105
160001115
170111005
181000115
191100015
201110005
Table A15. | B ( 7 ) |   =   35 balanced bitstrings.
Table A15. | B ( 7 ) |   =   35 balanced bitstrings.
k B k ( 7 ) a ( 7 ) ( B k )
10(01)(01)(01)4
2(01)(01)(01)04
3(10)(10)(10)04
4(01)(01)0(01)4
5(10)(10)0(10)4
6(01)0(01)(01)4
7(10)0(10)(10)4
8(100)(100)14
9(100)1(100)4
101(100)(100)4
11(001)1(001)4
12(001)(001)14
131(00)(00)115
14100(01)(01)5
15(10)001(10)5
16(10)(10)0015
17(10)1(10)005
1811(00)(00)15
191(10)(10)005
20111(00)(00)5
21(01)(01)1005
22(01)100(01)5
23(01)10(01)05
24(01)1(01)005
25(01)0(01)105
260(01)(01)105
270(01)1(01)05
28(00)111(00)5
29(01)00(01)15
30(00)(00)1115
3100(01)(01)15
3200(01)1(01)5
331(10)00(10)5
3400011106
3501110006
Table A16. | B ( 8 ) |   =   70 balanced bitstrings (1st part).
Table A16. | B ( 8 ) |   =   70 balanced bitstrings (1st part).
k B k ( 8 ) a ( 8 ) ( B k )
1((01)(01))((01)(01))3
2((10)(10))((10)(10))3
3((00)(11))((00)(11))4
4((01)(10))((01)(10))4
5((10)(01))((10)(01))4
6((11)(00))((11)(00))4
7(00)(00)(11)(11)5
8(001)(001)115
90(01)(01)(01)15
100(01)(01)1(01)5
110(01)1(01)(01)5
12(001)11(001)5
13(00)(11)(11)(00)5
14(01)0(01)(01)15
15(01)0(01)1(01)5
16(01)(01)0(01)15
17(01)(01)(01)105
18(01)(01)10(01)5
19(01)(01)1(01)05
20(011)00(011)5
21(01)10(01)(01)5
22(01)1(01)0(01)5
23(01)1(01)(01)05
24(011)(011)005
25(100)(100)115
2610(01)(01)(01)5
27(10)0(10)1(10)5
28(10)01(10)(10)5
29(100)11(100)5
30(101)00(101)5
31(10)(10)01(10)5
32(10)(10)(10)015
33(10)(10)1(10)05
34(10)1(10)0(10)5
35(10)1(10)(10)05
36(11)(00)(00)(11)5
37(110)00(110)5
3811(001)(001)5
39110010105
401(10)(10)0(10)5
41(110)(110)005
42110101005
4311(100)(100)5
44(11)(11)(00)(00)5
4500(011)(011)5
460(011)(011)05
Table A17. | B ( 8 ) |   =   70 balanced bitstrings (2nd part).
Table A17. | B ( 8 ) |   =   70 balanced bitstrings (2nd part).
k B k ( 8 ) a ( 8 ) ( B k )
4700(01)(01)116
4800(01)11(01)6
49000(11)(11)06
500(01)(01)1106
510011(10)(10)6
52(01)00(01)116
53(01)0(01)1106
54(01)(01)11006
55(01)1100(01)6
56(01)110(01)06
57(01)11(01)006
580(11)(11)0006
591(00)(00)1116
60(10)00(10)116
61100(01)1(01)6
62(10)0011(10)6
63(10)(10)00116
64(10)1(10)0016
65(10)11(10)006
661(10)00(10)16
6711(01)00(01)6
68111(00)(00)16
6911(10)00(10)6
7011(10)(10)006
Table A18. | E ( 5 ) |   =   2 balanced, ringed bitstrings.
Table A18. | E ( 5 ) |   =   2 balanced, ringed bitstrings.
k E k ( 5 ) a ( 5 ) ( E k )
10(01)(01)3
6000114
Table A19. | E ( 6 ) |   =   4 balanced, ringed bitstrings.
Table A19. | E ( 6 ) |   =   4 balanced, ringed bitstrings.
k E k ( 6 ) a ( 6 ) ( E k )
1(01)(01)(01)3
30(01)(01)14
40(01)1(01)4
160001115
Table A20. | E ( 7 ) |   =   5 balanced, ringed bitstrings.
Table A20. | E ( 7 ) |   =   5 balanced, ringed bitstrings.
k E k ( 7 ) a ( 7 ) ( E k )
10(01)(01)(01)4
12(001)(001)14
30(00)(00)1115
3100(01)(01)15
3200(01)1(01)5
Table A21. | E ( 8 ) |   =   10 balanced, ringed bitstrings.
Table A21. | E ( 8 ) |   =   10 balanced, ringed bitstrings.
k E k ( 8 ) a ( 8 ) ( E k )
1((01)(01))((01)(01))3
3(00)(11)(00)(11)4
7(00)(00)(11)(11)5
80(01)0(01)115
90(01)(01)(01)15
100(01)(01)1(01)5
110(01)1(01)(01)5
4600(011)(011)5
4500(01)(01)116
4700(01)11(01)6
Table A22. Selected balanced, ringed bitstrings | E ( 9 ) |   =   14 .
Table A22. Selected balanced, ringed bitstrings | E ( 9 ) |   =   14 .
k E k ( 9 ) a ( 9 ) ( E k )
10((01)(01))((01)(01))4
20((00)(11))((00)(11))5
3(0(01))(01)(001)15
4(0(01))(001)1(01)5
5(0(01))(001)(01)15
60(001)11(001)6
700(01)1(01)(01)6
800(01)(01)1(01)6
900(01)(01)(01)16
100(001)(001)116
11(00)(00)(11)0(11)6
120(00)(00)(11)(11)6
13(00)(00)111017
14(00)(00)101117
Table A23. | E ( 10 ) |   =   26 balanced, ringed bitstrings.
Table A23. | E ( 10 ) |   =   26 balanced, ringed bitstrings.
k E k ( 10 ) a ( 10 ) ( E k )
1((01)(01))((01)(01))(01)4
20((01)(01))((01)(01))15
3(01)(1(01)0)(1(01)0)5
4(0(01)1)(0011)(01)5
50((01)01)1(0101)5
60((10)10)1(1010)5
7(01)((01)10)(0110)5
8(0(01))(01)(001)116
9(0(01))(001)11(01)6
10(0(01))(001)1(01)16
11(0(01))(001)(01)116
140(0011)1(0011)6
1500((01)1)(011)(01)6
1600((01)1)(01)(011)6
170(0011)(0011)16
1900(01)((01)1)(011)6
12(00)0(11)(11)(00)17
1300(01)11(01)(01)7
1800(01)(01)11(01)7
2000(01)(01)(01)117
21(00)01(00)(11)(11)7
22(00)(00)(11)(11)017
23(00)(00)(11)10(11)7
24(00)(00)(11)0(11)17
25(00)(00)10(11)(11)7
26(00)(00)01(11)(11)7
Table A24. Selected balanced, ringed bitstrings E ( 11 ) .
Table A24. Selected balanced, ringed bitstrings E ( 11 ) .
k E k ( 11 ) a ( 11 ) ( E k )
10(01)((01))(01))(0101)5
2(0(01)(01))(00101)15
3(00)((00)11)(10011)6
4(0(01))(01)(01)(001)16
5(00)(00)(00)(11)(11)17
6(00)(110)1(00)(110)7
7(00)(00)(01)(01)1118
Table A25. Selected balanced, ringed bitstrings E ( 12 ) .
Table A25. Selected balanced, ringed bitstrings E ( 12 ) .
k E k ( 12 ) a ( 12 ) ( E k )
1((01)(01))(0101)(0101)4
2(0(01)1(01))(001101)5
3((01)1(0(01)))((01)1(001))5
4(0(01)1)(0011)(01)(01)6
5((01)0(01))(01001)116
6(001)(001)(001)1117
7(00)(00)(00)(11)(11)(11)7
8(00)(00)(11)(11)1(00)18
9(00)(10)(11)(00)(11)(10)8
10(11)(11)(01)(01)(00)(00)8
11(11)(11)(00)(00)(10)(10)8
Table A26. Selected balanced, ringed bitstrings E ( 13 ) .
Table A26. Selected balanced, ringed bitstrings E ( 13 ) .
k E k ( 13 ) a ( 13 ) ( E k )
10((01)(01))(0101)(0101)5
20((10)01(10))(100110)6
3(0((01)(01))(00101)(01)16
40(00)((00)(11))(0011)(11)7
5(00)((00)(11))(0011)0(11)7
6(00)(00)(00)0(11)(11)(11)8
7(00(01))(0001)(01)1118
8(00)(00)(00)10(11)(11)19

References

  1. Marshall, S.M.; Murray, A.R.G.; Cronin, L. A probabilistic framework for identifying biosignatures using Pathway Complexity. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2017, 375, 20160342. [Google Scholar] [CrossRef]
  2. Murray, A.; Marshall, S.; Cronin, L. Defining Pathway Assembly and Exploring its Applications. arXiv 2018, arXiv:1804.06972. [Google Scholar]
  3. Marshall, S.M.; Mathis, C.; Carrick, E.; Keenan, G.; Cooper, G.J.T.; Graham, H.; Craven, M.; Gromski, P.S.; Moore, D.G.; Walker, S.I.; et al. Identifying molecules as biosignatures with assembly theory and mass spectrometry. Nat. Commun. 2021, 12, 3033. [Google Scholar] [CrossRef]
  4. Liu, Y.; Mathis, C.; Bajczyk, M.D.; Marshall, S.M.; Wilbraham, L.; Cronin, L. Exploring and mapping chemical space with molecular assembly trees. Sci. Adv. 2021, 7, eabj2465. [Google Scholar] [CrossRef]
  5. Marshall, S.M.; Moore, D.G.; Murray, A.R.G.; Walker, S.I.; Cronin, L. Formalising the Pathways to Life Using Assembly Spaces. Entropy 2022, 24, 884. [Google Scholar] [CrossRef]
  6. Sharma, A.; Czégel, D.; Lachmann, M.; Kempes, C.P.; Walker, S.I.; Cronin, L. Assembly theory explains and quantifies selection and evolution. Nature 2023, 622, 321–328. [Google Scholar] [CrossRef]
  7. Jirasek, M.; Sharma, A.; Bame, J.R.; Mehr, S.H.M.; Bell, N.; Marshall, S.M.; Mathis, C.; MacLeod, A.; Cooper, G.J.T.; Swart, M.; et al. Investigating and Quantifying Molecular Complexity Using Assembly Theory and Spectroscopy. ACS Cent. Sci. 2024. [Google Scholar] [CrossRef]
  8. Walker, S.; Cronin, L. Time is an object (Not a backdrop, an illusion or an emergent phenomenon, time has a physical size that can be measured in laboratories). Am. Sci. 2023, 111, 302. [Google Scholar]
  9. Watanabe, S. Knowing and Guessing: A Quantitative Study of Inference and Information; Wiley: Hoboken, NJ, USA, 1969. [Google Scholar]
  10. Watanabe, S. Epistemological Relativity. Ann. Jpn. Assoc. Philos. Sci. 1986, 7, 1–14. [Google Scholar] [CrossRef]
  11. Prigogine, I.; Stengers, I. Order out of Chaos: Man’s New Dialogue with Nature; Bantam Books: New York, NY, USA, 1984. [Google Scholar]
  12. Łukaszyk, S. A No-go Theorem for Superposed Actions (Making Schrödinger’s Cat Quantum Nonlocal). In New Frontiers in Physical Science Research Vol. 3; Purenovic, D.J., Ed.; Book Publisher International (A part of SCIENCEDOMAIN International): London, UK, 2022; pp. 137–151. [Google Scholar] [CrossRef]
  13. Qian, K.; Wang, K.; Chen, L.; Hou, Z.; Krenn, M.; Zhu, S.; Ma, X. Multiphoton non-local quantum interference controlled by an undetected photon. Nat. Commun. 2023, 14, 1480. [Google Scholar] [CrossRef]
  14. Xue, P.; Xiao, L.; Ruffolo, G.; Mazzari, A.; Temistocles, T.; Cunha, M.T.; Rabelo, R. Synchronous Observation of Bell Nonlocality and State-Dependent Contextuality. Phys. Rev. Lett. 2023, 130, 040201. [Google Scholar] [CrossRef] [PubMed]
  15. Łukaszyk, S. Shannon Entropy of Chemical Elements. Eur. J. Appl. Sci. 2024, 11, 443–458. [Google Scholar] [CrossRef]
  16. Tran, D.M.; Nguyen, V.D.; Ho, L.B.; Nguyen, H.Q. Increased success probability in Hardy’s nonlocality: Theory and demonstration. Phys. Rev. A 2023, 107, 042210. [Google Scholar] [CrossRef]
  17. Colciaghi, P.; Li, Y.; Treutlein, P.; Zibold, T. Einstein-Podolsky-Rosen Experiment with Two Bose-Einstein Condensates. Phys. Rev. X 2023, 13, 021031. [Google Scholar] [CrossRef]
  18. Cronin, L. Lee Cronin: Controversial Nature Paper on Evolution of Life and Universe|Lex Fridman Podcast #404. 2023. Available online: https://lexfridman.com/lee-cronin-3-transcript/ (accessed on 16 May 2024).
  19. de Chardin, P.T. The Phenomenon of Man; Harper: New York, NY, USA, 1959. [Google Scholar]
  20. Melamede, R. Dissipative Structures and the Origins of Life; Unifying Themes in Complex Systems IV; Minai, A.A., Bar-Yam, Y., Eds.; Springer: Berlin/Heidelberg, Germany, 2008; pp. 80–87. [Google Scholar]
  21. Vedral, V. Decoding Reality: The Universe as Quantum Information; Oxford University Press: Oxford, UK, 2010. [Google Scholar] [CrossRef]
  22. Łukaszyk, S. Life as the Explanation of the Measurement Problem. J. Phys. Conf. Ser. 2024, 2701, 012124. [Google Scholar] [CrossRef]
  23. Łukaszyk, S. Black Hole Horizons as Patternless Binary Messages and Markers of Dimensionality. In Future Relativity, Gravitation, Cosmology; Nova Science Publishers: Hauppauge, NY, USA, 2023; Chapter 15; pp. 317–374. [Google Scholar] [CrossRef]
  24. Łukaszyk, S. Four Cubes. arXiv 2020, arXiv:2007.03782. [Google Scholar] [CrossRef]
  25. Vopson, M.M.; Lepadatu, S. Second law of information dynamics. AIP Adv. 2022, 12, 075310. [Google Scholar] [CrossRef]
  26. Łukaszyk, S. Novel Recurrence Relations for Volumes and Surfaces of n-Balls, Regular n-Simplices, and n-Orthoplices in Real Dimensions. Mathematics 2022, 10, 2212. [Google Scholar] [CrossRef]
  27. Łukaszyk, S.; Tomski, A. Omnidimensional Convex Polytopes. Symmetry 2023, 15, 755. [Google Scholar] [CrossRef]
  28. Łukaszyk, S. The Imaginary Universe. Phys. Sci. 2023, preprint. [Google Scholar]
  29. Vopson, M.M. The second law of infodynamics and its implications for the simulated universe hypothesis. AIP Adv. 2023, 13, 105308. [Google Scholar] [CrossRef]
  30. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  31. Bekenstein, J.D. Black holes and the second law. Lett. Al Nuovo Cimento Ser. 2 1972, 4, 737–740. [Google Scholar] [CrossRef]
  32. Bekenstein, J.D. Black Holes and Entropy. Phys. Rev. D 1973, 7, 2333–2346. [Google Scholar] [CrossRef]
  33. Hawking, S.W. Particle creation by black holes. Commun. Math. Phys. 1975, 43, 199–220. [Google Scholar] [CrossRef]
  34. Downey, P.; Leong, B.; Sethi, R. Computing Sequences with Addition Chains. SIAM J. Comput. 1981, 10, 638–646. [Google Scholar] [CrossRef]
  35. Chaitin, G.J. Randomness and Mathematical Proof. Sci. Am. 1975, 232, 47–52. [Google Scholar] [CrossRef]
  36. Chaitin, G.J. The unknowable; Springer Series in Discrete Mathematics and Theoretical Computer Sciencel; Springer: Singapore; New York, NY, USA, 1999. [Google Scholar]
  37. Rajput, C. Metallic Ratios in Primitive Pythagorean Triples: Metallic Means embedded in Pythagorean Triangles and other Right Triangles. J. Adv. Math. 2021, 20, 312–344. [Google Scholar] [CrossRef]
  38. Łukaszyk, S. Metallic Ratios and Angles of a Real Argument. IPI Lett. 2024, 26–33. [Google Scholar] [CrossRef]
  39. Caldarola, F.; d’Atri, G.; Maiolo, M.; Pirillo, G. New algebraic and geometric constructs arising from Fibonacci numbers: In honor of Masami Ito. Soft Comput. 2020, 24, 17497–17508. [Google Scholar] [CrossRef]
  40. Chaitin, G. From Philosophy to Program Size: Key Ideas and Methods: Lecture Notes on Algorithmic Information Theory from the 8th Estonian Winter School in Computer Science, EWSCS’03: [2–7 March]; Institute of Cybernetics at Tallinn Technical University: Tallinn, Estonia, 2003. [Google Scholar]
  41. Chaitin, G.J. Computational complexity and Gödel’s incompleteness theorem. ACM SIGACT News 1971, 11–12. [Google Scholar] [CrossRef]
  42. Kolmogorov, A. On tables of random numbers. Theor. Comput. Sci. 1998, 207, 387–395. [Google Scholar] [CrossRef]
  43. Chaitin, G. Omega and Why Maths Has No TOEs; University of Cambridge: Cambridge, UK, 2023. [Google Scholar]
  44. James, W. The Principles of Psychology; Henry Holt and Company: New York, NY, USA, 1890. [Google Scholar]
  45. McMillen, P.; Levin, M. Collective intelligence: A unifying concept for integrating biology across scales and substrates. Commun. Biol. 2024, 7, 378. [Google Scholar] [CrossRef] [PubMed]
  46. Barta, J.; Markiewicz, R. (Eds.) Prawo Autorskie: Przepisy, Orzecznictwo, Umowy Miedzynarodowe; 4. rozsz. i zaktualizowane, ed.; Dom Wydawniczy ABC: Warszawa, Poland, 2002. [Google Scholar]
  47. Kuhn, T.S. The Structure of Scientific Revolutions, 3rd ed.; University of Chicago Press: Chicago, IL, USA, 1996. [Google Scholar]
  48. Łukaszyk, S. A new concept of probability metric and its applications in approximation of scattered data sets. Comput. Mech. 2004, 33, 299–304. [Google Scholar] [CrossRef]
  49. Tomski, A.; Łukaszyk, S. Reply to “Various issues around the L1-norm distance”. IPI Lett. 2024, 2, 1–8. [Google Scholar] [CrossRef]
  50. Castro, P.S.; Kastner, T.; Panangaden, P.; Rowland, M. MICo: Improved representations via sampling-based state similarity for Markov decision processes. Adv. Neural Inf. Process. Syst. 2021, 34, 30113–30126. [Google Scholar] [CrossRef]
Figure 1. Numbers of all 2 N bitstrings (red), balanced bitstrings, | B k ( N ) | (green), ringed bitstrings, | R k ( N ) | (cyan), and balanced, ringed bitstrings, | E k ( N ) | (blue) as a function of the bitstring length, N.
Figure 1. Numbers of all 2 N bitstrings (red), balanced bitstrings, | B k ( N ) | (green), ringed bitstrings, | R k ( N ) | (cyan), and balanced, ringed bitstrings, | E k ( N ) | (blue) as a function of the bitstring length, N.
Mathematics 12 01600 g001
Figure 2. Degree of causation as a function of 1 N 10 5 .
Figure 2. Degree of causation as a function of 1 N 10 5 .
Mathematics 12 01600 g002
Figure 3. (a) Semi-log plot of 2 s (red), | N ngr | (green), and | N exc | (blue). (b) Fractions of | N ngr | (green) and | N exc | (blue) to 2 s , showing the deflection point for s   =   6 (see text for details).
Figure 3. (a) Semi-log plot of 2 s (red), | N ngr | (green), and | N exc | (blue). (b) Fractions of | N ngr | (green) and | N exc | (blue) to 2 s , showing the deflection point for s   =   6 (see text for details).
Mathematics 12 01600 g003
Figure 4. Lower bound on the bitstring assembly index given by Theorem 2 (red) and log 2 ( N ) (red, dash–dot), upper bound on the bitstring assembly index given by Conjecture 1 (green), factual values of the bitstring assembly index (blue) and the ringed bitstring assembly index (cyan), and N 1 (green, dash–dot) for the bitstring length 0 N 20 .
Figure 4. Lower bound on the bitstring assembly index given by Theorem 2 (red) and log 2 ( N ) (red, dash–dot), upper bound on the bitstring assembly index given by Conjecture 1 (green), factual values of the bitstring assembly index (blue) and the ringed bitstring assembly index (cyan), and N 1 (green, dash–dot) for the bitstring length 0 N 20 .
Mathematics 12 01600 g004
Figure 5. Lower bound on the bitstring assembly index (red), log 2 ( N ) (red, dash–dot), general rule a min ( { 2 s , N ^ , N ˜ } )   =   { s , s + 1 , s + 2 } (cyan), and OEIS A014701 (yellow), conjectured upper bound on the bitstring assembly index (green) and N 1 (green, dash–dot), and assembly indices of C non - min ( N ) bitstrings assembled through trivial assembling programs (blue) for the bitstring length 0 N 100 (see text for details).
Figure 5. Lower bound on the bitstring assembly index (red), log 2 ( N ) (red, dash–dot), general rule a min ( { 2 s , N ^ , N ˜ } )   =   { s , s + 1 , s + 2 } (cyan), and OEIS A014701 (yellow), conjectured upper bound on the bitstring assembly index (green) and N 1 (green, dash–dot), and assembly indices of C non - min ( N ) bitstrings assembled through trivial assembling programs (blue) for the bitstring length 0 N 100 (see text for details).
Mathematics 12 01600 g005
Figure 6. An illustrative graph of complexity against information capacity: the orange regions are impossible, as they are above or below the assembly index general bounds, the yellow region indicates the bitstring assembly index bounds, the green region contains structures that can be assembled via dissipative structures of nature, the red region contains structures that can only be assembled by humans, the blue circles and dots denote, respectively, the number of steps of suboptimally assembled bitstrings and their assembly indices, and the blue slanting and vertical lines denote, respectively, creative and optimization paths of the evolution of information (figure not to scale; see text for details).
Figure 6. An illustrative graph of complexity against information capacity: the orange regions are impossible, as they are above or below the assembly index general bounds, the yellow region indicates the bitstring assembly index bounds, the green region contains structures that can be assembled via dissipative structures of nature, the red region contains structures that can only be assembled by humans, the blue circles and dots denote, respectively, the number of steps of suboptimally assembled bitstrings and their assembly indices, and the blue slanting and vertical lines denote, respectively, creative and optimization paths of the evolution of information (figure not to scale; see text for details).
Mathematics 12 01600 g006
Table 1. Distribution of the assembly indices for N   =   4 .
Table 1. Distribution of the assembly indices for N   =   4 .
N 1
a ( 4 ) ( C ) | a ( 4 ) ( C ) | 01234
241 2 1
312 444
1614 | B ( 4 ) |   =   6 41
Table 2. | B ( 4 ) |   =   6 balanced bitstrings B k ( 4 ) .
Table 2. | B ( 4 ) |   =   6 balanced bitstrings B k ( 4 ) .
k B k ( 4 ) a ( 4 ) ( B k )
1(01)(01)2
2(10)(10)2
301103
411003
510013
600113
Table 3. Bitstring length: N; number of all bitstrings: 2 N ; number of balanced bitstrings: B k ( N ) ; number of ringed bitstrings: R k ( N ) ; and number of balanced, ringed bitstrings: E k ( N ) .
Table 3. Bitstring length: N; number of all bitstrings: 2 N ; number of balanced bitstrings: B k ( N ) ; number of ringed bitstrings: R k ( N ) ; and number of balanced, ringed bitstrings: E k ( N ) .
N 2 N | B k ( N ) | | R k ( N ) | | E k ( N ) | | B k ( N ) | / | E k ( N ) |
121211
242312
383413
4166623
53210825
664201445
7128352057
82567036107
951212660149
10102425210826 9.6923
1120484621884211
1240969243528011.55
138192171663213213
1416,38434321182246 13.9512
1532,7686435219242915
Table 4. Distribution of assembly indices among balanced, ringed bitstrings, E ( N ) , for 4 N 11 .
Table 4. Distribution of assembly indices among balanced, ringed bitstrings, E ( N ) , for 4 N 11 .
N | E ( N ) | a ( N )   =   2 a ( N )   =   3 a ( N )   =   4 a ( N )   =   5 a ( N )   =   6 a ( N )   =   7 a ( N )   =   8
4211
52 11
64 121
75 23
810 1162
914 1472
1026 16910
1142 214206
Table 5. The lower bound on the bitstring assembly index (OEIS A003313).
Table 5. The lower bound on the bitstring assembly index (OEIS A003313).
N123456789101112131415161718192021
a min ( N ) 012233434454555455656
Table 6. List of the shortest-addition chain-sequence-generating factors for 1 s 5 .
Table 6. List of the shortest-addition chain-sequence-generating factors for 1 s 5 .
s 2 s The Shortest-Addition Chain-Sequence-Generating Factors
12 1 1
24 1 2 3 1
38 1 3 3 2 5 1 7 1
416 1 4 3 3 5 2 7 2 9 1 11 1 13 1 15 1
532 1 5 3 4 5 3 7 3 9 2 11 2 13 2 15 2 17 1 19 1 21 1 23 1 25 1 27 1 29 1 31 1
Table 7. Number of exceptional a min ( N exc ) values | N exc | and the number, | N ngr | , of a min ( N ngr ) not generated via general rules for 0 s 15 .
Table 7. Number of exceptional a min ( N exc ) values | N exc | and the number, | N ngr | , of a min ( N ngr ) not generated via general rules for 0 s 15 .
s0123456789101112131415
| N exc | 000029308019343292519283953802416,18932,544
2 s 1248163264128256512102420484096819216,38432,768
| N ngr | 000172151113239493100320254071816516,35532,737
min m k 13137777777777
Table 8. Exemplary balanced bitstrings, B max ( N ) , that have a largest assembly index. Conjectured form of the largest assembly index ( a conj ( N ) ) and its factual values for ringed ( a rng ( N ) ) and non-ringed ( a nrng ( N ) ) bitstrings (red if below the conjectured value and green if above).
Table 8. Exemplary balanced bitstrings, B max ( N ) , that have a largest assembly index. Conjectured form of the largest assembly index ( a conj ( N ) ) and its factual values for ringed ( a rng ( N ) ) and non-ringed ( a nrng ( N ) ) bitstrings (red if below the conjectured value and green if above).
N B max ( N ) a conj ( N ) a rng ( N ) a nrng ( N )
10 000
210 111
3001 222
40011 333
500011 444
6000111 555
70011100 556
800010111 666
9000011101 777
100000111101 778
1100000101111 888
12111000101100 988
130000001011111 999
1400000101011111 999
15000001010111110 101010
161000000101011111 111010
1700000010101111110 111111
18100111110110000010 111112
191000010101001111101 121112
2010100111110110000010131213
Table 9. Three-bit elegant programs assembling bitstrings with a ( N )   =   3 .
Table 9. Three-bit elegant programs assembling bitstrings with a ( N )   =   3 .
Q C   ( s   =   1 ) C   ( s   =   2 ) C   ( s Q   =   3 ) N
11 0 0 0 00 0 5
10 0 0 0 0 00 0 6
01 0 0 0 0 0 0 6
00 0 0 0 0 0 0 0 8
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Łukaszyk, S.; Bieniawski, W. Assembly Theory of Binary Messages. Mathematics 2024, 12, 1600. https://doi.org/10.3390/math12101600

AMA Style

Łukaszyk S, Bieniawski W. Assembly Theory of Binary Messages. Mathematics. 2024; 12(10):1600. https://doi.org/10.3390/math12101600

Chicago/Turabian Style

Łukaszyk, Szymon, and Wawrzyniec Bieniawski. 2024. "Assembly Theory of Binary Messages" Mathematics 12, no. 10: 1600. https://doi.org/10.3390/math12101600

APA Style

Łukaszyk, S., & Bieniawski, W. (2024). Assembly Theory of Binary Messages. Mathematics, 12(10), 1600. https://doi.org/10.3390/math12101600

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop