Next Article in Journal
Special Type Routing Problems in Plane Graphs
Next Article in Special Issue
Compact Word-Serial Modular Multiplier Accelerator Structure for Cryptographic Processors in IoT Edge Nodes with Limited Resources
Previous Article in Journal
Problem Transformation as a Gateway to the Wider Use of Basic Computational Algorithms
Previous Article in Special Issue
A Formal Approach to Coercion Resistance and Its Application to E-Voting
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Efficient Algorithm to Compute the Linear Complexity of Binary Sequences

by
Amparo Fúster-Sabater
1,*,
Verónica Requena
2 and
Sara D. Cardell
3
1
Instituto de Tecnologías Físicas y de la Información, C.S.I.C., 28006 Madrid, Spain
2
Departament de Matemàtiques, University of Alicante, 03690 Alicante, Spain
3
Centro de Matemática, Computação e Cognição, Universidade Federal do ABC (UFABC), Santo André 09210-580, Brazil
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(5), 794; https://doi.org/10.3390/math10050794
Submission received: 27 January 2022 / Revised: 18 February 2022 / Accepted: 24 February 2022 / Published: 2 March 2022
(This article belongs to the Special Issue Mathematics Cryptography and Information Security 2021)

Abstract

:
Binary sequences are algebraic structures currently used as security elements in Internet of Things devices, sensor networks, e-commerce, and cryptography. In this work, a contribution to the evaluation of such sequences is introduced. In fact, we present a novel algorithm to compute a fundamental parameter for this kind of structure: the linear complexity, which is related to the predictability (or non-predictability) of the binary sequences. Our algorithm reduced the computation of the linear complexity to just the addition modulo two (XOR logic operation) of distinct terms of the sequence. The performance of this procedure was better than that of other algorithms found in the literature. In addition, the amount of required sequence to perform this computation was more realistic than in the rest of the algorithms analysed. Tables, figures, and numerical results complete the work.

1. Introduction

The Internet is rapidly evolving, making possible the connectivity among any kinds of devices and giving rise to the Internet of Things (IoT) [1,2]. The IoT is based on all the applications and possibilities provided by the devices of daily use such as mobile phones, televisions, refrigerators, etc., to improve people’s lives and business environments. Nevertheless, the need to secure the interconnection of all these devices soon arose. This task is not easy, since the vast majority of IoT devices are not manufactured nor designed with safety in mind. Moreover, these devices are very different from each other, and security must be adapted to distinct models, types, and technical characteristics.
The security of an IoT network involves different techniques, i.e., blockchain, trust management [3,4,5], as well as cryptographic mechanisms. Cryptographic security is needed to prevent users from losing data and to avoid risks related to the inappropriate use of passwords. Many designs providing cryptographic security are based on true random numbers, but their generation is a complex task [6,7]. Some popular random noise algorithms are somehow imperfect, showing defects that make them vulnerable and predictable, which in cryptographic terms is a real concern. Some weaknesses are never found, at least publicly, and create a false sense of security in the community of users. The devices in which faults are detected are those with the most flagrant glitches or those that are most popular, e.g.: (1) the algorithm A5 in GSM mobile communications cryptanalysed in [8,9]; (2) the J3Gen generator for low-cost passive RFID tags; (3) the generator RC4 for encrypting Internet traffic that was also cryptanalysed in [10,11]. In brief, it is difficult to design a true random number generator that can provide a strong cryptographic basis for system security, especially for IoT devices; see [12,13].
Pseudo-Random Number Generators (PRNGs) are deterministic algorithms [14,15] used to produce random number sequences for cryptographic applications, such as digital signatures, the generation of nonces or keys, etc. These applications require some statistical properties in their output sequences such as a large period and linear complexity, low auto-correlation, wide dimensional distribution, the uniformity of the distribution for large quantities of generated numbers, etc. The interested reader is referred to [16] (Chapter 2).
Binary sequences generated by maximal-period Linear Feedback Shift Registers (LFSRs), i.e., PN-sequences [17], have been extensively used in many and diverse applications such as e-commerce, mobile wireless communications, digital broadcasting, and cryptography (stream ciphers) [18,19]. In order to ensure their cryptographic use, it is required to remove the linearity inherent to the PN-sequences. LFSRs play an important part in the design of cryptographic PNRGs [20,21]. Among the most popular families of cryptographic sequence generators based on PN-sequences, we can highlight the irregular decimation-based generators. The method of irregularly decimating the output sequences of LFSRs [22] generates powerful PNRGs that produce, in turn, sequences with good cryptographic properties such as: long periods, large linear complexity, two-valued auto-correlation properties, good distribution of zeros and ones along the sequence, etc. One of the most important generators in this family is the Generalized Self-Shrinking Generator (GSSG) [23]. This generator is fast, easy to implement, and generates good cryptographic sequences [24], so it is appropriate for low-cost applications.
In this paper, we present a contribution to security whose significance is limited to cryptographic mechanisms. In fact, our proposal is an algorithm that computes the linear complexity of sequences with a period of a power of two. The technique herein presented uses Hadamard matrices [25] and the B-representation of sequences proposed in [26,27]. Our algorithm was much more efficient than the other ones proposed in the literature [26,28,29,30], and the required amount of sequence to perform this computation was also more realistic.
Furthermore, besides being faster and requiring less intercepted bits of the sequence, the importance of this algorithm relies on the fact that it is especially efficient for families of sequences with an upper bound on the parameter’s linear complexity, i.e, the sequences generated by the Generalized Self-Shrinking Generator (GSSG) [23].
Our contributions can be enumerated as follows: (1) some basic concepts needed to understand the rest of the paper are introduced (Section 2); (2) we recall the concept of B-representation and establish a relation between this representation of sequences and the Hadamard matrices (Section 3); (3) we propose a novel method of computing the linear complexity of sequences with a period of a power of two and apply it to generalized sequences in Section 4 and Section 5, respectively; (4) we discuss (Section 6) the advantages of our method when compared with the other ones; (5) the paper ends (Section 7) with some important remarks concerning our results.

2. Basic Concepts and Generalities

In this section, we introduce different concepts and structures that are used systematically throughout the work.

2.1. Linear Feedback Shift Registers

Let F 2 = { 0 , 1 } be the Galois field of two elements. Consider { u i } i 0 = { u 0 , u 1 , u 2 , } a binary sequence with u i F 2 , for i = 0 , 1 , 2 , We say that the sequence { u i } i 0 is periodic if there exists an integer T, called the period, such that u i + T = u i , for all i 0 . In the sequel, all the sequences considered are binary sequences, and the symbol + denotes the Exclusive-OR (XOR) logic operation.
Let r be a positive integer, and let a 1 , a 2 , a 3 , , a r be constant coefficients with a j F 2 , for j = 1 , 2 , , r . A binary sequence { u i } i 0 satisfying the relation:
u i + r = a 1 u i + r 1 + a 2 u i + r 2 + a 3 u i + r 3 + + a r 1 u i + 1 + a r u i , i 0 ,
is called an r-th order linear recurring sequence in F 2 . The terms { u 0 , u 1 , , u r 1 } are referred to as the initial terms and define uniquely the construction of the sequence.
A relation of the form given by Equation (1) is called an r-th order linear recurrence relationship.
The monic polynomial:
p ( x ) = x r + a 1 x r 1 + a 2 x r 2 + a 3 x r 3 + + a r 1 x + a r F 2 [ x ]
is called the characteristic polynomial of the linear recurring sequence and { u i } i 0 is said to be generated by p ( x ) .
We can obtain linear recurring sequences through Linear Feedback Shift Registers (LFSRs) [17]. In fact, an LFSR can be defined as an electronic device with r interconnected memory cells (or stages) with binary content. At every clock pulse, the binary element of each stage is shifted to the adjacent stage, as well as a new element is computed through the linear feedback to fill the empty stage (see Figure 1). We say that the LFSR has the maximal length if the characteristic polynomial of the linear recurring sequence is primitive. In this case, its output sequence is called the Pseudo-Noise sequence (PN-sequence), and its period is T = 2 r 1 with 2 r 1 ones and 2 r 1 1 zeros; see [17].
The linear complexity of a sequence { u i } i 0 , denoted by L C , is defined as the length of the shortest LFSR that generates such a sequence or, equivalently, as the lowest-order linear recurrence relationship that generates such a sequence.
In cryptographic applications, the linear complexity must be as large as possible. The expected value is approximately half the period L C T / 2 (see [31]).

2.2. The Generalized Self-Shrinking Generator

Consider a PN-sequence { u i } i 0 obtained from a maximal-length LFSR with L stages and an L-dimensional binary vector G = [ g 0 , g 1 , g 2 , . . . , g L 1 ] F 2 L , and let { v i } i 0 be the sequence defined as:
v i = g 0 u i + g 1 u i 1 + g 2 u i 2 + + g L 1 u i L + 1 for i 0 .
Next, we define a decimation rule to generate a new sequence { s j } j 0 as follows:
I f u i = 1 , then s j = v i ; I f u i = 0 , then v i is discarded .
The sequence { s j } j 0 , denoted by S ( G ) , is called the generalized self-shrunken sequence, GSS-sequence, or simply generalized sequence associated with G ; see [23]; the sequence generator is called the Generalized Self-Shrinking Generator (GSSG). The set of sequences S = S ( G ) | G F 2 L is called the family of generalized sequences based on the PN-sequence { u i } i 0 . It is worth noticing that the period of any generalized sequence is a divisor of 2 L 1 (as 2 L 1 is the number of ones in the PN-sequence), and the linear complexity satisfies [29]:
2 L 2 < L C 2 L 1 ( L 2 ) .
In order to analyse more properties of this generator, the interested reader is referred to [22,23,27,32].
From now on, we consider a sequence with the notation { u i } i 0 or { u i } , indistinctly.

2.3. Binomial Sequences

The binomial number n i represents the coefficient corresponding to the power x i in the algebraic expansion of the polynomial ( 1 + x ) n . For every integer n 0 , we know that n 0 = 1 , while n i = 0 for i > n . Now, the binomial sequences are introduced as follows.
Definition 1.
Given a fixed integer k 0 , the k-th binomial sequence is given by:
n k n 0 = 0 , i f n < k , n k m o d 2 , i f n k .
Table 1 shows the first eight binomial coefficients, as well as the first eight binomial sequences with their corresponding periods and linear complexities. Moreover, recall that the binomial sequences are just shifted versions (starting in the first one) of the successive diagonals of Sierpinski’s triangle reduced modulo two, as depicted in Figure 2.
In the following, we recall some results about the period and the structure of the binomial sequences.
Proposition 1
([26], Proposition 3). Given the binomial sequence n 2 L + k , with 0 k < 2 L , we have that:
(a) 
The period of such a sequence is T = 2 L + 1 ;
(b) 
The period of such a sequence has the following structure:
n 2 L + k 0 n < 2 L + 1   =   0 i f 0 n < 2 L + k , n k i f 2 L + k n < 2 L + 1 .
Corollary 1
([26], Corollary 4). The binomial sequences of the form n 2 L , ( L = 0 , 1 , 2 , ) have period T = 2 L + 1 and the following structure:
n 2 L 0 n < 2 L + 1   =   0 I f 0 n < 2 L ; 1 I f 2 L n < 2 L + 1 .
Corollary 2
([26], Corollary 5). The binomial sequences of the form n 2 L ( L = 0 , 1 , 2 , ) are balanced, i.e, they contain the same number of ones and zeros.
The proofs and more properties of such sequences can be found in [26].
Remark 1.
The binomial sequences have the following structure:
n 2 L : 0 0 0 2 L zeros 1 1 1 2 L ones n 2 L + k : 0 0 0 2 L zeros the first 2 L terms of n k
See the Figure 3 for more details. The following example illustrates the previous results.
Example 1.
The binomial sequence n 4 has period eight and is composed of four consecutive zeros followed by four consecutive ones:
n 4 : 0 0 0 0 1 1 1 1
The binomial sequence n 7 = n 4 + 3 has also period eight and is composed of four consecutive zeros followed by the first four bits of n 3 :
n 7 : 0 0 0 0 0 0 0 1 n 3
Theorem 1.
The linear complexity of the binomial sequence n k is k + 1 .
Notice that, as a consequence of Theorem 1, the linear complexity of any binomial sequence is immediate. Furthermore, in the following sections, we will see that any sequence of a period of a power of two can be expressed as a sum (modulo two) of binomial sequences. As a result, it is easy to compute the linear complexity of such sequences by just observing the binomial sequence of a greater degree in the binomial decomposition [26].

2.4. Construction of Binomial Matrices from Binomial Sequences

In this subsection, we introduce the concept of the binomial matrix, which is closely related with the binomial sequences and Sierpinski’s triangle.
Definition 2.
Let t be a non-negative integer. We define the binomial matrix H t as the binary Hadamard matrix of size 2 t × 2 t constructed as follows: H 0 = [ 1 ] , for t = 0 , and:
H t = H t 1 H t 1 0 t 1 H t 1 ,
for t > 0 , with 0 t 1 the 2 t 1 × 2 t 1 null matrix.
In general, any binomial matrix H t can be easily constructed from binomial sequences as follows:
  • Its rows correspond to the first 2 t bits of the first 2 t binomial sequences, that is,
    H t = n 0 n 1 n 2 t 2 n 2 t 1 = 1 1 1 1 1 1 1 1 1 1 0 1 0 1 0 1 0 1 0 1 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 1 ;
  • Its columns correspond to the first 2 t bits of shifted versions of the first 2 t binomial sequences starting in their first one, i.e., the columns of H t are the diagonals of Sierpinski’s triangle (see Figure 2):
    H t = n 2 t 1 * n 2 t 1 * n 1 * n 0 * = 1 1 1 1 0 1 0 1 0 0 1 1 0 0 0 1 0 0 1 1 0 0 0 1 .
In the previous expression, we denoted by n k * the shifted version of the binomial sequence n k starting in the first one ( k = 0 , , 2 t 1 ) .
As we said before, the diagonals of the Sierpinski triangle correspond to the columns of the Hadamard matrix, and at the same time, they are shifted versions of the binomial sequences, that is the rows of the same matrix. See, for example, Figure 4. The circled diagonal corresponds to the shifted version n 4 * of the binomial sequence n 4 . Notice that the sequence n 4 corresponds to the fourth row of H t , and the circled diagonal corresponds to the ( 2 t 5 ) -th column of H t , for t sufficiently large. In general, the sequence n k corresponds to the k-th row of H t , and the shifted version n k * (that is, the k-th diagonal of Sierpinski’s triangle) corresponds to the ( 2 t k 1 ) -th column of H t , for t sufficiently large.
In brief, there is a close relation among binomial sequences, diagonals of Sierpinski’s triangle, and binary Hadamard matrices.

3. B-Representation

The binomial representation (B-representation) of binary sequences was first introduced in [26]. In fact, every binary sequence { s n } whose period T is a power of two, that is T = 2 t , can be expressed as a linear combination of binomial sequences as follows:
{ s n } = i = 0 2 t 1 c i n i ,
where t is a non-negative integer, n i is the i-th binomial sequence, and the coefficients c i F 2 , for ( i = 0 , 1 , , 2 t 1 ) . The above equation is the B-representation of the sequence { s n } .
Let i m a x be an integer in the interval ( 0 i m a x 2 t 1 ) such that the coefficient c i m a x of the B-representation satisfies:
c i m a x 0 while c i = 0 for all index i in the range ( i m a x < i 2 t 1 ) .
The coefficient c i m a x and the B-representation provide us with information about two fundamental parameters of the sequence, the period and the linear complexity:
  • Period of { s n } in terms of the B-representation: As a consequence of Proposition 1, it is possible to prove that the period T of the sequence { s n } is the period of the binomial sequence n i m a x , since the period of the sequence is the greatest period of the binomial sequences included in its B-representation (see ([26], Proposition 3));
  • Linear complexity of { s n } in terms of the B-representation: As a consequence of Theorem 1, the linear complexity of the sequence { s n } is the linear complexity of the binomial sequence n i m a x (see ([26], Corollary 14)), that is:
    L C = i m a x + 1 .
Fixing a linear complexity L C , the period T of the corresponding sequence is uniquely determined. Nevertheless, fixing a period T, there exist distinct sequences with such a period, but with different values of their linear complexities, as we can observe in Table 1.
Due to the particular structure of the binomial sequences, we can reformulate the binomial representation of { s n } given in Equation (4) and convert it into a matrix equation using the binomial matrix H t .
Theorem 2.
Consider the B-representation of a sequence { s n } of period T = 2 t , with t a non-negative integer, and let H t be the binomial matrix of size 2 t × 2 t . Then,
( s 0 , s 1 , , s 2 t 1 ) = ( c 0 , c 1 , , c 2 t 1 ) · H t m o d 2 ,
where the vector ( s 0 , s 1 , , s 2 t 1 ) corresponds to the 2 t successive terms of the sequence { s n } and ( c 0 , c 1 , , c 2 t 1 ) are the coefficients that weight the binomial sequences in (4).
Proof. 
From the B-representation of { s n } given in Expression (4), we have that:
c 0 · { 1 1 1 1 1 1 1 1 1 1 1 1 } c 1 · { 0 1 0 1 0 1 0 1 0 1 0 1 } c 2 · { 0 0 1 1 0 0 1 1 0 0 1 1 } c 3 · { 0 0 0 1 0 0 0 1 0 0 0 1 } c 2 t 4 · { 0 0 0 0 0 0 0 0 1 1 1 1 } c 2 t 3 · { 0 0 0 0 0 0 0 0 0 1 0 1 } c 2 t 2 · { 0 0 0 0 0 0 0 0 0 0 1 1 } + c 2 t 1 · { 0 0 0 0 0 0 0 0 0 0 0 1 } { s n } = { s 0 s 1 s 2 s 3 s 4 s 5 s 6 s 7 s 2 t 4 s 2 t 3 s 2 t 2 s 2 t 1 }
where s k , the k-th term of the sequence { s n } , is the bitwise XOR operation of the k-th term of each binomial sequence, notated k i , multiplied by its corresponding coefficient c i ( i = 0 , 1 , , 2 t 1 ) , that is:
s k = i = 0 2 t 1 c i k i .
Thus, in matrix form, the previous representation can be expressed as:
( s 0 , s 1 , , s 2 t 1 ) = ( c 0 , c 1 , , c 2 t 1 ) · H t mod 2 .
An useful property of the Hadamard matrices is described as follows:
Lemma 1.
The inverse of a binomial matrix H t is the matrix itself, i.e., it is an idempotent matrix.
Proof. 
We proceed by induction. We have that H 2 2 = I 2 , where I 2 is the identity matrix of size 2 × 2 . Suppose that the axiom is true for t 1 , then we have that:
H t 2 = H t 1 2 0 t 1 0 t 1 H t 1 2 = I t 1 0 t 1 0 t 1 I t 1 = I t .
Making use of the previous results, the next theorem is introduced.
Theorem 3.
Consider the B-representation of { s n } , a sequence of period T = 2 t , with t a non-negative integer. Let H t be the binomial matrix of size 2 t × 2 t . Then,
( c 0 , c 1 , , c 2 t 1 ) = ( s 0 , s 1 , , s 2 t 1 ) · H t mod 2 .
Proof. 
The result is an immediate consequence of Theorem 2 and Lemma 1 multiplying Equation (6) by H t and rewriting it as indicated in (7). □
The previous expression allows us to compute the coefficients c i in terms of the binomial matrix H t and the elements of the sequence { s n } . The following example clarifies this construction.
Example 2.
Consider the sequence { s n } = { s 0 , s 1 , s 2 , , s 7 } of period T = 2 3 . According to Equation (4), we can write:
c 0 · { 1 1 1 1 1 1 1 1 } c 1 · { 0 1 0 1 0 1 0 1 } c 2 · { 0 0 1 1 0 0 1 1 } c 3 · { 0 0 0 1 0 0 0 1 } c 4 · { 0 0 0 0 1 1 1 1 } c 5 · { 0 0 0 0 0 1 0 1 } c 6 · { 0 0 0 0 0 0 1 1 } + c 7 · { 0 0 0 0 0 0 0 1 } { s n } = { s 0 s 1 s 2 s 3 s 4 s 5 s 6 s 7 }
Thus, the elements of the sequence { s n } can be expressed as:
s 0 = c 0 s 1 = c 0 + c 1 s 2 = c 0 + c 2 s 3 = c 0 + c 1 + c 2 + c 3 s 4 = c 0 + c 4 s 5 = c 0 + c 1 + c 4 + c 5 s 6 = c 0 + c 2 + c 4 + c 6 s 7 = c 0 + c 1 + c 2 + c 3 + c 4 + c 5 + c 6 + c 7 .
Therefore, the matrix equation is:
( s 0 , s 1 , , s 7 ) = ( c 0 , c 1 , , c 7 ) · H 3 m o d 2 ,
where the binomial matrix H 3 is:
H 3 = H 2 H 2 0 2 H 2 = 1 1 1 1 1 1 1 1 0 1 0 1 0 1 0 1 0 0 1 1 0 0 1 1 0 0 0 1 0 0 0 1 0 0 0 0 1 1 1 1 0 0 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 1 .
Now, multiplying both members of Equation (8) by the inverse matrix of H 3 , that is the matrix itself, we obtain:
( s 0 , s 1 , , s 7 ) · H 3 m o d 2 = ( c 0 , c 1 , , c 7 ) .
This expression allows us to compute the coefficients c i from the sequence { s n } and the binomial matrix H 3 .
Recall that, in the previous example, the rows of H 3 correspond to the first eight binomial sequences, while its columns read from right to left are the first eight diagonals of Sierpinski’s triangle (shifted versions of the corresponding binomial sequences).

4. Computation of LC by Means of the B-Representation

According to the previous section, Equation (7) introduces a simple method of computing the coefficient c i m a x and, consequently, the linear complexity and period of the corresponding sequence.

4.1. An Algorithm to Compute the L C of Sequences with a Period of a Power of Two

In this section, we present a fast algorithm, based on binomial matrices, that computes the L C of any binary sequence whose period T is a power of two.
Note that the matrix H t can be expressed in terms of its columns as:
H t = h 0 , h 1 , , h 2 t 1 .
Therefore, every binary coefficient c i ( i = 0 , 1 , , 2 t 1 ) is computed as the product modulo two of the row vector ( s 0 , s 1 , , s 2 t 1 ) (the 2 t bits of the sequence { s n } ) by the corresponding column vector h i . From now on, we represent in bold the column vectors of the binomial matrix H t . The computation starts with the coefficient c 2 t 1 and proceeds in reverse order until the first coefficient c i 0 is reached. In that case, the index i m a x = i and, consequently, the parameter L C are computed via Equation (5). Algorithm 1 illustrates this computation.
Algorithm 1: Computation of the L C of a given sequence.                                                                      
Input:
       s e q : sequence of period 2 t ,
       H t : the ( 2 t × 2 t ) binomial matrix
01:    i m a x = 1 ; i = l e n g t h ( s e q ) 1 ;
02:   while  i 0  do
03:       c i = ( s 0 , s 1 , , s 2 t 1 ) · h i ;
04:      if  c i 0  then
05:          i m a x = i ;
06:         Break;
07:      endif
08:        i = i 1 ;
09:   endwhile
Output:
       L C = i m a x + 1 : Linear complexity of the sequence.
From this computational method, two basic ideas can be drawn:
  • The algorithm that computes L C is reduced to products modulo two of binary vectors. Clearly, its computational complexity will be minimum compared with other algorithms found in the literature; see Section 6;
  • If the column h imax has many zeros and only a few ones, then only a few terms of the sequence { s n } will be required to compute its L C .
In the sequel, these features were analysed in detail when this procedure was applied to a particular family of cryptographic sequences, the generalized self-shrunken sequences (see Section 2.2 for more details).

4.2. Sequences with an Upper Bound on the Linear Complexity

The previous algorithm is particularly useful when we analyse sequences whose L C is upper bounded by a maximum value L C m a x . In that case, the computation of coefficients is simplified as c i = 0 for every coefficient in the range ( i m a x < i 2 t 1 ) . Then, Algorithm 1 starts with the index i = i m a x = L C m a x 1 and computes the coefficient:
c i m a x = ( s 0 , s 1 , , s 2 t 1 ) · h imax .
Two different situations can occur:
  • If c i m a x 0 , then the linear complexity of the sequence is L C m a x . Recall that, in this case, the computation of L C was reduced to the simple product of two binary vectors. For sequences with an upper bound on L C , this computation can be seen as a quick test to check whether the sequence exhibits maximum L C ;
  • If c i m a x = 0 , then the previous algorithm proceeds in decreasing order computing the remaining indices ( i m a x > i 0 ) until a coefficient c i 0 is reached. In that case, the linear complexity of the sequence satisfies L C < L C m a x .
In the next section, we applied the previous method to the generalized self-shrunken sequences.

5. Application of the Algorithm to Generalized Sequences

The generalized sequences seem to be the ideal candidates for the application of the previous algorithm. In fact, their period is a power of two, and as we saw in Section 2, their linear complexity is upper bounded by L C 2 L 1 ( L 2 ) , where L is the length of the LFSR that generates the family of generalized sequences. Therefore, we initialized Algorithm 1 with the value i = i m a x = 2 L 1 ( L 1 ) .
Now, we can determine the value of the coefficient c i m a x by multiplying the generalized sequence { s n } by the corresponding column h imax of the Hadamard matrix (or binomial matrix) H L 1 . Depending on the value of c i m a x , we can check whether the generalized sequence has or has not maximum complexity L C m a x . Indeed, c i m a x 0 implies that the generalized sequence complexity is L C = 2 L 1 ( L 2 ) , otherwise L C will take a lower value.
According to [29], half the sequences of the generalized family have maximum linear complexity. Therefore, half the coefficients c i m a x satisfy the inequality c i m a x 0 . Thus, the linear complexity of half the sequences of a generalized family can be simply determined by the computation of the coefficient c i m a x .
Let us see a particular example of the application of the defined algorithm in a generalized sequence.
Example 3.
Let { s n } = { 1110001001110100 } be a generalized sequence obtained from an LFSR of length L = 5 , the characteristic polynomial x 5 + x 3 + 1 , the initial state I S = ( 11111 ) , and G = [ 1 , 0 , 0 , 1 , 0 ] . The period of the generalized sequence is T = 2 L 1 = 2 4 . Then, the binomial matrix H L 1 = H 4 is a 2 4 × 2 4 -Hadamard matrix of the form:
H 4 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0 0 1 0 1 0 0 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 1 0 1 0 1 0 1 0 0 0 0 0 0 0 0 0 0 1 1 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 .
According to Equation (3), the maximum value of the complexity will be here L C m a x = 13 . Therefore, if the coefficient c i m a x = c 13 1 = c 12 = 1 , then the sequence { s n } will have maximum linear complexity. In this example, we multiplied the generalized sequence by the column h 12 of the matrix H 4 (the fourth column of H 4 read from right to left), giving rise to the coefficient c 12 = 1 .
At the same time, we realized that the column h 12 = ( 1000100010001000 ) only includes four ones (a fourth of the period, i.e., T 4 ), which means that only four terms of { s n } will determine whether the generalized sequence has L C m a x . Those terms are: s 0 , s 4 , s 8 , s 12 , which, in turn, determine the coefficient:
c 12 = s 0 + s 4 + s 8 + s 12 .
The remaining terms of the generalized sequence are redundant for this computation.
If the sequence { s n } was shifted, then the reasoning would be the same: one out of four consecutive digits of the sequence would be needed to check whether L C = L C m a x . In brief, according to the column h 12 , any set of four digits separated by four positions from each other is enough for this checking.
We remark that, for generalized sequences, Equation (5) can be rewritten as:
i m a x = L C m a x 1 = 2 L 1 ( L 2 ) 1 = 2 L 1 ( L 1 ) .
Thus, the column h imax corresponds to the ( L 1 ) -th column of the matrix H L 1 read from right to left.
All the previous results can be generalized to any value of L taking into account that the binomial matrix H L 1 is a Hadamard matrix. In the next subsections, we analysed the method described in the previous section. In fact, in order to obtain a bound on the number of required operations for this calculation, we computed the linear complexity of the generalized sequences coming from LFSRs with different lengths L.

5.1. Analysis of 3 L 5

In a similar way to that developed in Example 3, we applied this method to generalized sequences coming from LFSRs with characteristic polynomials of degrees L = 3 , 4 , 5 .
In the case L = 3 , the maximum value of the linear complexity of a generalized sequence is L C m a x = 3 . It corresponds to the column h 2 of the matrix:
H 2 = 1 1 1 0 1 1 0 0 1 0 0 1 ,
that is the second column of H 2 (read from right to left) circled in (11), which corresponds to the binomial sequence { n 1 } * . In Table 2, we computed, for L = 3 , 4 , 5 , the values of L C m a x , the index i m a x , the location of h imax , the binomial matrix H L 1 , and the binomial sequence associated with h imax .
We observed that for L = 4 , L C m a x = 6 , and using our algorithm, we obtained h imax = h 5 , that is the third column of H 3 (read from right to left) circled in ( 12 ) .
H 3 = 1 1 1 1 1 1 1 0 1 0 1 0 0 1 0 0 1 1 0 1 1 0 0 0 1 0 0 1 0 0 0 0 1 1 1 0 0 0 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0 0 0 1 .
Due to the recursive structure of the Hadamard matrix, we can obtain the column h 5 of H 3 using the columns of H 2 . We only had to concatenate twice the sequence given in the third column of H 2 . In a similar way, for L = 5 , L C m a x = 13 , then h imax = h 12 , that is the fourth column of H 4 in Equation (10) read from right to left. It can also be obtained from the concatenation twice of the fourth column of H 3 or concatenating four times the sequence given in the fourth column of H 2 . In conclusion, we can construct the columns h imax , for the cases L = 3 , 4 , 5 , using the matrix H 2 , as we show in Table 3. We only needed to concatenate 2 L 3 times the corresponding column of h imax in H 2 to obtain the column in H L 1 . In the next subsection, we generalized this method of obtaining the binomial sequences associated with the column h imax , for different values of L.

5.2. Analysis of 5 L 17

In the previous subsection, we used the matrix H 2 as the reference matrix to obtain the columns of h imax for L = 3 , 4 , 5 . From now on, we fix H 4 as our reference matrix, called the 16-box. From this matrix, we determined the column h imax for the different cases of L, which allowed us to obtain the number of required bits to compute L C .
In a similar way to that developed in previous sections, we analysed the linear complexity of the generalized sequences coming from LFSRs with length L in the range L = 5 , , 17 .
The steps of this analysis can be enumerated as follows:
  • Step 1: Take the 2 4 × 2 4 -matrix H 4 in Equation (10) as the reference matrix, the 16-box, since the successive matrices H L 1 are made up of sub-matrices H 4 or 16-boxes;
  • Step 2: Divide the period T = 2 L 1 of the generalized sequence by 16 to determine the number of 16-boxes included in the ( 2 L 1 × 2 L 1 ) -matrix H L 1 ;
  • Step 3: Count the number of ones in the column h imax of the 16-box;
  • Step 4: Multiply this number by the number of 16-boxes in H L 1 in order to obtain the total number of ones in the column h imax of H L 1 . This number coincides with the number of bits necessary to check whether the generalized sequence has L C m a x . At the same time, the column h imax shows the distribution of the required bits all along the sequence { s n } .
This analysis was adapted to the successive n-boxes used in the following subsections. As far as L increases by one, in the frame of the box, the column h imax is shifted one position to the left.
The 16-box is depicted in Table 4. We can see the column h imax corresponding to each value of L (written at the bottom of the table) in the interval L [ 5 , 6 , , 17 ] , as well as the corresponding binomial sequence { n k } * . Indeed, for L = 5 , h imax is the fourth column of H 4 read from left to right. Now, for L = 6 , h imax is the fifth column of H 5 or also the fifth column of H 4 concatenated twice. Following a similar reasoning to that presented in the previous subsection, we have that the binomial sequence associated with the column h imax of H L 1 , for 5 L 17 , can be obtained from the concatenation of 2 L 4 times the column h imax of H 4 .
The parameters that describe this analysis are shown in Table 5 where its columns correspond to:
  • L = length of the LFSR generating the family of generalized sequences;
  • T = 2 L 1 period of the generalized sequence;
  • No. of 16-boxes: number of 16-boxes included in the binomial matrix H L 1 , i.e., T 16 ;
  • No. of ones/16-box: number of ones in the column h imax of the 16-box;
  • No. of required bits: number of one in the column h imax of the matrix H L 1 or, equivalently, number of required bits of the sequence { s n } ; this number is expressed as a fraction of the period T.
From Table 5, we realized that Algorithm 1 will never require the knowledge of the whole sequence { s n } to check whether its L C is maximum. At any rate, depending on the value of L, the amount of sequence needed will be greater or shorter.
In fact, as we show in the following results, we can give an upper and lower bound of the number of operations required to compute L C for some particular cases.
Theorem 4.
Let { s n } be a generalized sequence with period T = 2 L 1 . Consider that the column h imax of H L 1 corresponds to the binomial sequence n 2 m * , with m < L 1 a positive integer. Then, the maximum number of required bits to compute L C from Algorithm 1 is T 2 .
Proof. 
This result is immediate by Corollary 2. □
Theorem 5.
Let { s n } be a generalized sequence with period T = 2 L 1 . Assume that the column h imax of H L 1 corresponds to the binomial sequence n 2 m 1 * , with m < L 1 a positive integer. Then, the minimum number of required bits to compute L C from Algorithm 1 is T 2 m .
Proof. 
This result is immediate by Definition 1, since the binomial sequence n 2 m 1 * has a unique digit of one, while the remaining digits are zero. □
Note that these considerations can be generalized to any value of L. For L 5 , 17 , it is easy to see that L = 6 and L = 10 , corresponding to the binomial sequences n 4 * and n 8 * , respectively, are the least suitable cases since such values require the knowledge of half the sequence. Nevertheless, the value L = 17 , corresponding to the binomial n 15 * , requires only the knowledge of T 16 bits.
At the same time, the column h imax of the 16-box determines the distribution of the required terms along the sequence. For instance:
  • For L = 10 , h imax = ( 1111111100000000 ) , which means that eight out of sixteen bits of the sequence are needed to check whether L C = L C m a x . Moreover, the required bits must be consecutive;
  • For L = 17 , h imax = ( 1000000000000000 ) , which means that one out of sixteen bits of the sequence is needed to check whether the linear complexity is maximum;
  • For the remaining values of L, the number of required bits takes the values T 4 or T 8 , as shown in Table 5. In fact, it depends on the number of ones along the column h imax of the 16-box, in particular, T 8 for columns with two ones, e.g., L = 9 , 13 , 15 , 16 , and T 4 for columns with four ones, e.g., L = 5 , 7 , 8 , 11 , 12 , 14 .
The next subsections analyse these results for greater values of L.

5.3. Analysis of 18 L 33

The study was similar to that of the previous subsection, but we now used a 32-box as shown in Table 6, where H 4 is the 16-box defined above and 0 4 is the ( 2 4 × 2 4 ) -null matrix. Next, we divided the period T of the sequence by thirty-two and analysed the number of ones in the successive columns h imax of the 32-box when L takes values in the interval L [ 18 , 19 , , 33 ] . It can be noticed that for these values of L, the columns h imax include the corresponding ones of the 16-box plus sixteen zeros of 0 4 . See Table 6.
Table 7 shows in detail this study for the different values of L. According to the table, the least suitable case is L = 18 corresponding to the binomial sequence n 16 * with sixteen consecutive ones followed by sixteen consecutive zeros. On the other hand, the most suitable case is L = 33 corresponding to the binomial sequence n 31 * with a unique one followed by thirty-one zeros. See in Table 7 the amount of sequence needed in terms of the period T.

5.4. Analysis of 34 L 65

The study was similar to that of the previous subsections, but we now used a 64-box as shown in Table 8, where H 4 and 0 4 are defined as before and 0 5 is the ( 2 5 × 2 5 ) -null matrix. Now, we divided the period T of the sequence by sixty-four and analysed the number of ones in the successive columns h imax of the 64-box when L takes values in the interval L [ 34 , 35 , , 65 ] . It can be noticed that this range of values of L can be divided into two sub-intervals:
  • For L [ 34 , 35 , , 49 ] , in the the 64-box columns we will have the ones of two 16-boxes plus thirty-two consecutive zeros of 0 5 ;
  • For L [ 50 , 51 , , 65 ] , in the 64-box columns, we will have the ones of the 16-box plus 16 + 32 consecutive zeros coming from 0 4 and 0 5 , respectively. See Table 8.
Table 9 shows in detail this study for some values of L in each sub-interval. According to Table 9, the least suitable case is L = 34 , corresponding to the binomial sequence n 32 * with 16 + 16 consecutive ones followed by thirty-two consecutive zeros, which means that T / 2 bits are required to check the maximum complexity. On the other hand, the most suitable case is L = 65 corresponding to the binomial sequence n 63 * with a unique one followed by sixty-three zeros. Therefore, one out of sixty-four bits is needed to check L C m a x , that is only T / 64 bits of the sequence are needed.

5.5. Analysis of 66 L 129

The study was similar to that of the previous subsections, but we now used a 128-box as shown in Table A1, where H 4 , 0 4 , 0 5 are defined as before and 0 6 is the ( 2 6 × 2 6 ) -null matrix. Now we divided the period T of the sequence by one-hundred twenty-eight and analysed the number of ones in the successive columns h imax of the 128-box, when L takes values in the interval L [ 66 , 67 , , 129 ] . It can be noticed that this range of L values can be divided into four sub-intervals:
  • For L [ 66 , 67 , , 81 ] , in the the 128-box columns, we will have the ones of four 16-boxes plus sixty-four consecutive zeros of 0 6 ;
  • For L [ 82 , 83 , , 97 ] , in the 128-box columns, we will have the ones of the 16-box, then sixteen consecutive zeros coming from 0 4 , then the ones of the 16-box, and finally, sixteen consecutive zeros coming from 0 4 . See Table A1;
  • For L [ 98 , 99 , , 113 ] , in the 128-box columns, we will have the ones of two consecutive 16-boxes, then 32 + 64 consecutive zeros coming from 0 5 and 0 6 , respectively;
  • For L [ 114 , 115 , , 129 ] , in the the 128-box columns, we will have the ones of a unique 16-box, then 16 + 32 + 64 consecutive zeros coming from 0 4 , 0 5 , and 0 6 , respectively. See Table A1.
Table 10 shows in detail this study for some values of L in each sub-interval. According to Table 10, we notice that:
  • In the interval L [ 66 , 67 , , 81 ] , the least suitable case is L = 66 , corresponding to the binomial sequence n 64 * where 16 + 16 + 16 + 16 consecutive bits of the sequence are required followed by other 64 non-necessary bits, that is T 2 bits of the sequence are needed to check the maximum complexity;
  • In the interval L [ 82 , 83 , , 97 ] , the most suitable case is L = 97 where the distribution of the needed bits over the sequence is as follows: two bits separated thirty positions from each other followed by 15 + 16 + 64 redundant bits, which means that we need the knowledge of T 64 bits of the sequence;
  • In the interval L [ 98 , 99 , , 113 ] , the most suitable case is L = 113 , where the distribution of the needed bits over the sequence is as follows: two bits separated sixteen positions from each other followed by 15 + 32 + 64 redundant bits, which means that we need the knowledge of T 64 bits of the sequence;
  • In the interval L [ 114 , 115 , , 129 ] , the most suitable case is L = 129 , corresponding to the binomial sequence n 127 * . In that case, one out of one-hundred twenty-eight bits is required to check L C m a x .
In general, we employed:
  • The 16-box for L [ 5 , 6 , , 17 ] ;
  • The 32-box for L [ 18 , 19 , , 33 ] ;
  • The 64-box for L [ 34 , 35 , , 65 ] ;
  • The 128-box for L [ 66 , 67 , , 129 ] .
For greater values of L, the process goes on in the same way as the Hadamard structure of the binomial matrices is systematically repeated.
Remark 2.
With only four reference matrices, we easily obtained values of L in the range L > 128 , which is the cryptographic range with practical application.

6. Comparison with Other Algorithms

In this section, we compared our own method with other techniques to compute the L C found in the literature.
  • The Berlekamp–Massey algorithm [28] computes the L C of a sequence synthesizing the shortest LFSR that generates such a sequence. It is a sequential algorithm that needs to know and process at least 2 · L C consecutive bits of the sequence. In the case of application to, e.g., generalized sequences, this means the knowledge of more than one period of the sequence, which is clearly out of the application range. The characteristics of this algorithm are depicted in the first row of Table 11;
  • In [26], the authors proposed an algorithm, Binomial Sequence Decomposition (BSD-algorithm), that computes the B-representation of a sequence and, consequently, its L C via Equation (5). Thus, the BSD-algorithm computes the linear complexity, as the Berlekamp–Massey algorithm does, but after having processed only L C bits instead of 2 · L C . The complexity of the BSD-algorithm, which performs the sum of two sequences of T bits (T additions) for every binomial sequence, is O ( r × T ) , r being the number of binomial sequences with r T . Again, this method is, in practice, unrealistic. See the characteristics of this algorithm in the second row of Table 11;
  • The folding algorithm [30] is another technique to compute the L C of sequences with a period of a power of two, T = 2 t . It is based on successive foldings of the own sequence to locate the maximum binomial sequence and calculate L C . At every step, the folding algorithm sums the first half of the sequence with the second half to cancel common binomial sequences in both halves. The procedure ends when only one bit is left. In fact, at every step, the folding mechanism reduces the length of the studied sequence by two with a total of log T steps. Moreover, at each step, the folding algorithm performs T / 2 i ( i = 0 , , log T ) logic operations. This algorithm only performs logic operations, but it needs to handle the whole sequence. See the characteristics of such an algorithm in the third row of Table 11;
  • In the method proposed in this work, two main features must be enhanced:
    (a)
    The algorithm only performs bitwise XOR logic operations;
    (b)
    It will never need the whole sequence, as the Hadamard matrices always include null blocks corresponding to portions of the sequence whose knowledge is not necessary.
    When this algorithm is applied, there are more and less favorable cases. For an integer fixed m, the worst scenario corresponds to sequences whose column h imax in the binomial matrix is the binomial sequence n 2 m * , as we need the knowledge of half the sequence T 2 . Conversely, the most favorable scenario corresponds to sequences whose column h imax in the binomial matrix is the binomial sequence n 2 m + 1 1 * , as we need the knowledge of T 2 m bits of the sequence. See the characteristics of this algorithm in the fourth row of Table 11.
Table 11 summarizes the comparison, in terms of the computational complexity and length requirements, of our algorithm with the other ones mentioned above in this section. Notice that the r in the second row corresponds to the number of binomial sequences in the B-representation of the sequence under study.

7. Conclusions

In this work, we proposed a general technique based on the B-representation to compute the linear complexity of any binary sequence with a period of a power of two. We focused on the study of the L C of the generalized sequences just to distinguish generalized sequences with a maximum L C of value 2 L 1 ( L 2 ) .
Furthermore, this algorithm is particularly efficient for families with an upper bound on the value of the linear complexity.
The computation of L C m a x was only performed by means of the bitwise XOR operation of several bits of the sequence. At the same time, the fractal structure of the binomial matrix (the Hadamard matrix) was exploited.
Notice that we did not need the whole sequence to compute the linear complexity, but just partial knowledge of it. Depending on the value of L, more or less favorable cases can be found. It is worth mentioning that the only parameter we used in this method was L, which means that the algorithm did not depend on the characteristic polynomial of the original LFSR, but on its degree.
Finally, we want to emphasize that we can reach suitable cryptographic values of L only with four reference matrices (16-box, 32-box, 64-box, and 128-box), all of them made out of 16-boxes. Thanks to this method, it is possible to achieve values of L > 128 , just by using the same structure as the one employed in Table A1.

Author Contributions

A.F.-S., V.R. and S.D.C. contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Spanish State Research Agency (AEI) of the Ministry of Science and Innovation (MICINN), Project P2QProMeTe (PID2020-112586RB-I00/AEI/ 10.13039/501100011033), co-funded by the European Regional Development Fund (ERDF, EU). It is also supported by Comunidad de Madrid (Spain) under Project CYNAMON (P2018/TCS-4566), co-funded by FSE and European Union FEDER funds. The work of the second author was partially supported by Spanish Grant VIGROB-287 of the University of Alicante.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of the data; in the writing of the manuscript; nor in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
LFSRLinear Feedback Shift Register
PRNGPseudo-Random Number Generator
GSSGGeneralized Self-Shrinking Generator
L C Linear Complexity
IoTInternet of Things

Appendix A

Table A1. The 128-box to analyse generalized sequences with 66 L 129 .
Table A1. The 128-box to analyse generalized sequences with 66 L 129 .
H 4 H 4 H 4 H 4 H 4 H 4 H 4 H 4
0 4 H 4 0 4 H 4 0 4 H 4 0 4 H 4
0 5 H 4 H 4 0 5 H 4 H 4
0 4 H 4 0 4 H 4
0 6 H 4 H 4 H 4 H 4
0 4 H 4 0 4 H 4
0 5 H 4 H 4
0 4 H 4
L = 129114113989782816665 343318172

References

  1. Bouguettaya, A.; Sheng, Q.Z.; Benatallah, B.; Neiat, A.G.; Mistry, S.; Ghose, A.; Nepal, S.; Yao, L. An internet of things service roadmap. Commun. ACM 2021, 64, 86–95. [Google Scholar] [CrossRef]
  2. Zhang, W.; Sheng, Q.Z.; Mahmood, A.; Tran, D.; Zaib, M.; Hamad, S.; Aljubairy, A.; Alhazmi, A.F.; Sagar, S.; Ma, C. The 10 Research Topics in the Internet of Things. In Proceedings of the 2020 IEEE 6th International Conference on Collaboration and Internet Computing (CIC), Atlanta, GA, USA, 1–3 December 2020; IEEE Computer Society: Los Alamitos, CA, USA, 2020; pp. 34–43. [Google Scholar] [CrossRef]
  3. Khan, W.Z.; Arshad, Q.u.A.; Hakak, S.; Khan, M.K.; Saeed-Ur-Rehman. Trust Management in Social Internet of Things: Architectures, Recent Advancements, and Future Challenges. IEEE Internet Things J. 2021, 8, 7768–7788. [Google Scholar] [CrossRef]
  4. Xu, L.D.; Lu, Y.; Li, L. Embedding Blockchain Technology Into IoT for Security: A Survey. IEEE Internet Things J. 2021, 8, 10452–10473. [Google Scholar] [CrossRef]
  5. Mahmood, A.; Siddiqui, S.A.; Sheng, Q.Z.; Zhang, W.E.; Suzuki, H.; Ni, W. Trust on wheels: Towards secure and resource efficient IoV networks. Computing 2022. [Google Scholar] [CrossRef]
  6. Fischer, V. A Closer Look at Security in Random Number Generators Design. In Constructive Side-Channel Analysis and Secure Design, COSADE 2012; Schindler, W., Huss, S.A., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; Volume 7275, pp. 167–182. [Google Scholar] [CrossRef] [Green Version]
  7. Francillon, A.; Castelluccia, C. TinyRNG: A Cryptographic Random Number Generator for Wireless Sensors Network Nodes. In Proceedings of the 2007 5th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks and Workshops, Limassol, Cyprus, 16–20 April 2007; pp. 1–7. [Google Scholar] [CrossRef] [Green Version]
  8. Biryukov, A.; Shamir, A.; Wagner, D. Real Time Cryptanalysis of A5/1 on a PC. In Proceedings of the Fast Software Encryption 2000, New York, NY, USA, 10–12 April 2000; Goos, G., Hartmanis, J., Van Leeuwen, J., Schneier, B., Eds.; Springer: Berlin/Heidelberg, Germany, 2001; Volume 1978, pp. 1–18. [Google Scholar] [CrossRef] [Green Version]
  9. Petrovic, S.; Fúster-Sabater, A. Cryptanalysis of the A5/2 Algorithm. IACR Cryptol. EPrint Arch. 2000, 2000, 52. [Google Scholar]
  10. Peinado, A.; Munilla, J.; Fúster-Sabater, A. EPCGen2 Pseudorandom Number Generators: Analysis of J3Gen. Sensors 2014, 14, 6500–6515. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Paul, G.; Maitra, S. RC4 Stream Cipher and Its Variants; CRC Press, Taylor and Francis Group: Boca Raton, FL, USA, 2012. [Google Scholar]
  12. Dutta, I.K.; Ghosh, B.; Bayoumi, M. Lightweight Cryptography for Internet of Insecure Things: A Survey. In Proceedings of the 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 7–9 January 2019; pp. 475–481. [Google Scholar] [CrossRef]
  13. Philip, M.A.; Vaithiyanathan. A survey on lightweight ciphers for IoT devices. In Proceedings of the 2017 International Conference on Technological Advancements in Power and Energy ( TAP Energy), Kollam, India, 21–23 December 2017; pp. 1–4. [CrossRef]
  14. Dubrova, E.; Hell, M. Espresso: A stream cipher for 5G wireless communication systems. Cryptogr. Commun. 2017, 9, 273–289. [Google Scholar] [CrossRef] [Green Version]
  15. Orúe López, A.B.; Hernández Encinas, L.; Montoya Vitini, F. Trifork, a new Pseudorandom Number Generator Based on Lagged Fibonacci Maps. J. Comput. Sci. Eng. 2010, 2, 46–51. [Google Scholar]
  16. Paar, C.; Pelzl, J. Understanding Cryptography; Springer: Berlin, Germany, 2010. [Google Scholar]
  17. Golomb, S.W. Shift Register-Sequences; Aegean Park Press: Laguna Hill, CA, USA, 1982. [Google Scholar]
  18. Biryukov, A.; Perrin, L. State of the Art in Lightweight Symmetric Cryptography. IACR Cryptol. EPrint Arch. 2017, 2017, 511. [Google Scholar]
  19. Orúe López, A.B.; Hernández Encinas, L.; Martín Muñoz, A.; Montoya Vitini, F. A Lightweight Pseudorandom Number Generator for Securing the Internet of Things. IEEE Access 2017, 5, 27800–27806. [Google Scholar] [CrossRef]
  20. Shabbir Hassan, M.U.B. Design of Pseudo Random Number Generator using Linear Feedback Shift Register. Int. J. Eng. Adv. Technol. IJEAT 2019, 9, 1956–1965. [Google Scholar] [CrossRef]
  21. Rahimov, H.; Babaei, M.; Farhadi, M. Cryptographic PRNG based on combination of LFSR and chaotic logistic map. Appl. Math. 2011, 2, 1531–1534. [Google Scholar] [CrossRef] [Green Version]
  22. Díaz Cardell, S.; Fúster-Sabater, A. Cryptography with Shrinking Generators: Fundamentals and Applications of Keystream Sequence Generators Based on Irregular Decimation; Springer Briefs in Mathematics; Springer International Publishing: Berlin/Heidelberg, Germany, 2019. [Google Scholar] [CrossRef]
  23. Hu, Y.; Xiao, G. Generalized Self-Shrinking Generator. IEEE Trans Inf. Theory 2004, 50, 714–719. [Google Scholar] [CrossRef]
  24. Cardell, S.D.; Requena, V.; Fúster-Sabater, A.; Orúe, A.B. Randomness Analysis for the Generalized Self-Shrinking Sequences. Symmetry 2019, 11, 1460. [Google Scholar] [CrossRef] [Green Version]
  25. Seberry, J.; Yamada, M. Hadamard matrices, Sequences, and Block Designs. In Contemporary Design Theory—A Collection of Surveys; Stinson, D.J., Dinitz, J., Eds.; John Wiley and Sons: Chichester, UK, 1992; pp. 431–560. [Google Scholar]
  26. Cardell, S.D.; Fúster-Sabater, A. Binomial Representation of Cryptographic Binary Sequences and Its Relation to Cellular Automata. Complexity 2019, 2019, 2108014. [Google Scholar] [CrossRef] [Green Version]
  27. Cardell, S.D.; Climent, J.J.; Fúster-Sabater, A.; Requena, V. Representations of Generalized Self-Shrunken Sequences. Mathematics 2020, 8, 1006. [Google Scholar] [CrossRef]
  28. Massey, J.L. Shift-register synthesis and BCH decoding. IEEE Trans. Inf. Theory 1969, 15, 122–127. [Google Scholar] [CrossRef] [Green Version]
  29. Fúster-Sabater, A.; Cardell, S. Linear complexity of generalized sequences by comparison of PN-sequences. Rev. Real Acad. Cienc. Exactas Físicas Y Nat. Ser. A Mat. RACSAM 2020, 114, 79–97. [Google Scholar] [CrossRef]
  30. Martin-Navarro, J.L.; Fúster-Sabater, A. Folding-BSD Algorithm for Binary Sequence Decomposition. Computers 2020, 9, 100. [Google Scholar] [CrossRef]
  31. Rueppel, R.A. Linear Complexity and Random Sequences. In Advances in Cryptology — EUROCRYPT 85, Workshop on the Theory and Application of of Cryptographic Techniques; Lecture Notes in Computer Science; Pichler, F., Ed.; Springer: Berlin/Heidelberg, Germany, 1986; Volume 219, pp. 167–188. [Google Scholar] [CrossRef]
  32. Cardell, S.D.; Fúster-Sabater, A. Discrete linear models for the generalized self-shrunken sequences. Finite Fields Their Appl. 2017, 47, 222–241. [Google Scholar] [CrossRef]
Figure 1. LFSR of length r or LFSR with r stages.
Figure 1. LFSR of length r or LFSR with r stages.
Mathematics 10 00794 g001
Figure 2. Two representations of Sierpinski’s triangle.
Figure 2. Two representations of Sierpinski’s triangle.
Mathematics 10 00794 g002
Figure 3. Structure of the binomial sequences n k and n 2 L + k .
Figure 3. Structure of the binomial sequences n k and n 2 L + k .
Mathematics 10 00794 g003
Figure 4. Binary Sierpinski’s triangle where the circled bits correspond to the shifted version of the binomial sequence n 4 .
Figure 4. Binary Sierpinski’s triangle where the circled bits correspond to the shifted version of the binomial sequence n 4 .
Mathematics 10 00794 g004
Table 1. The first 8 binomial coefficients n k ( 0 k < 8 ) and their binomial sequences n k , periods, and linear complexities.
Table 1. The first 8 binomial coefficients n k ( 0 k < 8 ) and their binomial sequences n k , periods, and linear complexities.
Binomial Coeff.Binomial Sequences n k PeriodLC
n 0 11111111 11
n 1 01010101 22
n 2 00110011 43
n 3 00010001 44
n 4 00001111 85
n 5 00000101 86
n 6 00000011 87
n 7 00000001 88
Table 2. Table of the values of the upper bound of L C , i m a x , and h imax for L = 3 , 4 , 5 .
Table 2. Table of the values of the upper bound of L C , i m a x , and h imax for L = 3 , 4 , 5 .
L LC max imax h imax H L 1 { n k } *
3322 H 2 { n 1 } *
4653 H 3 { n 2 } *
513124 H 4 { n 3 } *
Table 3. The 4-box to analyse generalized sequences with L = 3 , 4 , 5 .
Table 3. The 4-box to analyse generalized sequences with L = 3 , 4 , 5 .
1111
0101
0011
0001
L =543
{ n k } * = { n 3 } * { n 2 } * { n 1 } *
Table 4. The 16-box to analyse generalized sequences with 5 L 17 .
Table 4. The 16-box to analyse generalized sequences with 5 L 17 .
1111111111111111
0101010101010101
0011001100110011
0001000100010001
0000111100001111
0000010100000101
0000001100000011
0000000100000001
0000000011111111
0000000001010101
0000000000110011
0000000000010001
0000000000001111
0000000000000101
0000000000000011
0000000000000001
L =17109 5
{ n k } * = { n 15 } * { n 8 } * { n 7 } * { n 3 } *
Table 5. Basic parameters for generalized sequences with 5 L 17 .
Table 5. Basic parameters for generalized sequences with 5 L 17 .
LTNo. of 16-BoxesNo. of 1’s/16-BoxNo. of Required Bits
5 2 4 2 0 2 2 2 2 = T / 4
6 2 5 2 1 2 3 2 4 = T / 2
7 2 6 2 2 2 2 2 4 = T / 4
8 2 7 2 3 2 2 2 5 = T / 4
9 2 8 2 4 2 1 2 5 = T / 8
10 2 9 2 5 2 3 2 8 = T / 2
11 2 10 2 6 2 2 2 8 = T / 4
12 2 11 2 7 2 2 2 9 = T / 4
13 2 12 2 8 2 1 2 9 = T / 8
14 2 13 2 9 2 2 2 11 = T / 4
15 2 14 2 10 2 1 2 11 = T / 8
16 2 15 2 11 2 1 2 12 = T / 8
17 2 16 2 12 2 0 2 12 = T / 16
Table 6. The 32-box to analyse generalized sequences with 18 L 33 .
Table 6. The 32-box to analyse generalized sequences with 18 L 33 .
H 4 H 4
0 4 H 4
L = 33262518171092
Table 7. Basic parameters for generalized sequences with 18 L 33 .
Table 7. Basic parameters for generalized sequences with 18 L 33 .
LTNo. of 32-BoxesNo. of 1’s/32-BoxNo. of Required Bits
18 2 17 2 12 2 4 2 16 = T / 2
19 2 18 2 13 2 3 2 16 = T / 4
20 2 19 2 14 2 3 2 17 = T / 4
21 2 20 2 15 2 2 2 17 = T / 8
22 2 21 2 16 2 3 2 19 = T / 4
23 2 22 2 17 2 2 2 19 = T / 8
24 2 23 2 18 2 2 2 20 = T / 8
25 2 24 2 19 2 1 2 20 = T / 16
26 2 25 2 20 2 3 2 23 = T / 4
27 2 26 2 21 2 2 2 23 = T / 8
28 2 27 2 22 2 2 2 24 = T / 8
29 2 28 2 23 2 1 2 24 = T / 16
30 2 29 2 24 2 2 2 26 = T / 8
31 2 30 2 25 2 1 2 26 = T / 16
32 2 31 2 26 2 1 2 27 = T / 16
33 2 32 2 27 2 0 2 27 = T / 32
Table 8. The 64-box to analyse generalized sequences with 34 L 65 .
Table 8. The 64-box to analyse generalized sequences with 34 L 65 .
H 4 H 4 H 4 H 4
0 4 H 4 0 4 H 4
0 5 H 4 H 4
0 4 H 4
L =655049343318172
Table 9. Basic parameters for generalized sequences with 34 L 65 .
Table 9. Basic parameters for generalized sequences with 34 L 65 .
LTNo. of 64-BoxesNo. of 1’s/64-BoxNo. of Required Bits
34 2 33 2 27 2 5 2 32 = T / 2
38 2 37 2 31 2 4 2 35 = T / 4
39 2 38 2 32 2 3 2 35 = T / 8
41 2 40 2 34 2 2 2 36 = T / 16
45 2 44 2 38 2 2 2 40 = T / 16
47 2 46 2 40 2 2 2 42 = T / 16
48 2 47 2 41 2 2 2 43 = T / 16
49 2 48 2 42 2 0 2 43 = T / 32
50 2 49 2 43 2 4 2 47 = T / 4
54 2 53 2 47 2 3 2 50 = T / 8
55 2 54 2 48 2 2 2 50 = T / 16
57 2 56 2 50 2 1 2 51 = T / 32
61 2 60 2 54 2 1 2 55 = T / 32
63 2 62 2 56 2 1 2 57 = T / 32
64 2 63 2 57 2 1 2 58 = T / 32
65 2 64 2 58 2 0 2 58 = T / 64
Table 10. Basic parameters for generalized sequences with 66 L 129 .
Table 10. Basic parameters for generalized sequences with 66 L 129 .
LTNo. of 128-BoxesNo. of 1’s/128-BoxNo. of Required Bits
66 2 65 2 58 2 6 2 64 = T / 2
67 2 66 2 59 2 5 2 64 = T / 4
69 2 68 2 61 2 4 2 65 = T / 8
73 2 72 2 65 2 3 2 68 = T / 16
78 2 77 2 70 2 4 2 74 = T / 8
79 2 78 2 71 2 3 2 74 = T / 16
80 2 79 2 72 2 3 2 75 = T / 16
81 2 80 2 73 2 2 2 75 = T / 32
82 2 81 2 74 2 5 2 79 = T / 4
83 2 82 2 75 2 4 2 79 = T / 8
85 2 84 2 77 2 3 2 80 = T / 16
89 2 88 2 81 2 2 2 83 = T / 32
94 2 93 2 86 2 3 2 89 = T / 16
95 2 94 2 87 2 2 2 89 = T / 32
96 2 95 2 88 2 2 2 90 = T / 32
97 2 96 2 89 2 1 2 90 = T / 64
98 2 97 2 90 2 5 2 95 = T / 4
99 2 98 2 91 2 4 2 95 = T / 8
101 2 100 2 93 2 3 2 96 = T / 16
105 2 104 2 97 2 2 2 99 = T / 32
110 2 109 2 102 2 3 2 105 = T / 16
111 2 110 2 103 2 2 2 105 = T / 32
112 2 111 2 104 2 2 2 106 = T / 32
113 2 112 2 105 2 1 2 106 = T / 64
114 2 113 2 106 2 4 2 110 = T / 8
115 2 114 2 107 2 3 2 110 = T / 16
117 2 116 2 109 2 2 2 111 = T / 32
121 2 120 2 113 2 1 2 114 = T / 64
126 2 125 2 118 2 2 2 120 = T / 32
127 2 126 2 119 2 1 2 120 = T / 64
128 2 127 2 120 2 1 2 121 = T / 64
129 2 128 2 121 2 0 2 121 = T / 128
Table 11. Comparison among the algorithms.
Table 11. Comparison among the algorithms.
AlgorithmType of SequenceLength RequirementsComplexity
Berlekamp–Massey Alg. [28]Any sequence 2 T O ( T 2 )
BSD-Algorithm [26]Period T = 2 t T O ( T × r )
Folding Algorithm [30]Period T = 2 t T O ( T )
B-representationPeriod T = 2 t T / 2 (Worst case) O ( T / 2 )
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fúster-Sabater, A.; Requena, V.; Cardell, S.D. An Efficient Algorithm to Compute the Linear Complexity of Binary Sequences. Mathematics 2022, 10, 794. https://doi.org/10.3390/math10050794

AMA Style

Fúster-Sabater A, Requena V, Cardell SD. An Efficient Algorithm to Compute the Linear Complexity of Binary Sequences. Mathematics. 2022; 10(5):794. https://doi.org/10.3390/math10050794

Chicago/Turabian Style

Fúster-Sabater, Amparo, Verónica Requena, and Sara D. Cardell. 2022. "An Efficient Algorithm to Compute the Linear Complexity of Binary Sequences" Mathematics 10, no. 5: 794. https://doi.org/10.3390/math10050794

APA Style

Fúster-Sabater, A., Requena, V., & Cardell, S. D. (2022). An Efficient Algorithm to Compute the Linear Complexity of Binary Sequences. Mathematics, 10(5), 794. https://doi.org/10.3390/math10050794

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop