On the Effects of Kernel Configuration in Multi-Kernel Polar Codes

Saha, Souradip; Maßny, Luis; Adrat, Marc; Jax, Peter

doi:10.3390/e24101457

Open AccessArticle

On the Effects of Kernel Configuration in Multi-Kernel Polar Codes^†

by

Souradip Saha

^1,*,

Luis Maßny

²,

Marc Adrat

^1,* and

Peter Jax

²

¹

Fraunhofer Institute for Communication, Information Processing and Ergonomics, Fraunhoferstraße 20, 53343 Wachtberg, Germany

²

Institute of Communication Systems, RWTH Aachen University, Muffeter Weg 3a, 52074 Aachen, Germany

^*

Authors to whom correspondence should be addressed.

^†

This article is a follow-on work of the following paper: Saha, S.; Adrat, M.; Maßny, L.; Schrammen, M.; Jax, P. Novel Recursive Kernel Construction for Polar Codes with Practical Codeword Lengths. In Proceedings of the 15th International Conference on Signal Processing and Communication Systems (ICSPCS), Online, 13–15 December 2021; pp. 1–10.

Entropy 2022, 24(10), 1457; https://doi.org/10.3390/e24101457

Submission received: 31 August 2022 / Revised: 28 September 2022 / Accepted: 11 October 2022 / Published: 12 October 2022

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

Polar codes are a relatively new family of linear block codes which have garnered a lot of attention from the scientific community, owing to their low-complexity implementation and provably capacity achieving capability. They have been proposed to be used for encoding information on the control channels in 5G wireless networks due to their robustness for short codeword lengths. The basic approach introduced by Arikan can only be used to generate polar codes of length

N = 2^{n}

,

\forall n \in N

. To overcome this limitation, polarization kernels of size larger than

2 \times 2

(like

3 \times 3

,

4 \times 4

, and so on), have already been proposed in the literature. Additionally, kernels of different sizes can also be combined together to generate multi-kernel polar codes, further improving the flexibility of codeword lengths. These techniques undoubtedly improve the usability of polar codes for various practical implementations. However, with the availability of so many design options and parameters, designing polar codes that are optimally tuned to specific underlying system requirements becomes extremely challenging, since a variation in system parameters can result in a different choice of polarization kernel. This necessitates a structured design technique for optimal polarization circuits. We developed the DTS-parameter to quantify the best rate-matched polar codes. Thereafter, we developed and formalized a recursive technique to design polarization kernels of higher order from component smaller order. A scaled version of the DTS-parameter, namely SDTS-parameter (denoted by the symbol

ζ

in this article) was used for the analytical assessment of this construction technique and validated for single-kernel polar codes. In this paper, we aim to extend the analysis of the aforementioned SDTS parameter for multi-kernel polar codes and validate their applicability in this domain as well.

Keywords:

polar codes; multi-kernel; 5G

1. Introduction

Polar codes offer many advantages for channel coding of information bits to be transmitted across a noisy channel, such as low complexity, provably capacity achieving capability and robust characteristics for short codeword lengths. Therefore, they are selected to be used for 5G wireless control channels [1]. Additionally, there are multiple decoding techniques that can be used for the same polar codes, such as Successive Cancellation (SC) [2] and SC-based List [3] or Flip [4] decoding. For a Soft-Input-Soft-Output (SISO) system, Belief Propagation (BP) [5] decoding can also be used. However, on the other hand, certain limitations of polar code design exist as well. This includes the rigid nature of codeword length of the form

N = 2^{n}

due to the originally proposed

2 \times 2

kernel circuit structure in [2]. To address this limitation, well-known rate-matching techniques such as upsizing (by repetition) or downsizing (either by puncturing or shortening) exist. However, this brings in considerable computational overhead, especially if the number of upsized/downsized bits is high. The concept of channel polarization has been generalized to higher polarization kernel sizes, for example, using

3 \times 3

kernel circuits to generate

M = 3^{m}

long codewords as shown in [6,7,8,9]. A combination of multiple kernels of different sizes can be concatenated within the same polarization circuit to design multi-kernel polar codes as provided in [10,11,12,13,14,15,16]. A performance comparison analysis of single kernel polar codes generated by

2 \times 2

and

3 \times 3

kernels and their corresponding multi-kernel counterparts have been performed in [17,18]. It was observed that depending on underlying system requirements, design optimality can vary and in some cases a non-downsized code can outperform a downsized code, therefore being beneficial from both complexity and error rate performance perspectives.

The redundancy of bits in polar codes is implemented by segregating them in two sets, one to encode information bits and the other to encode frozen bits. Ideally, the bits that are polarized to high capacities should be used to encode information bits and the bits with lower capacities for frozen bits. However, different parameters such as channel condition, coderate and codeword length, can create a difference in the optimal choice of the sets of information and frozen bits. A wrong choice could lead to sub-optimal utilization of available channel capacity and consequently sub-optimal error rate performance. By including different kernel circuit designs and multi-kernel layouts in this paradigm, the optimal choice of information and frozen bits is complicated even further. Additionally, for higher kernel sizes, there exist a higher number of possibilities for valid polarization circuits and therefore a higher number of configurations of the corresponding multi-kernel designs. The increment of possible circuit designs to kernel sizes is exponential as shown in [19].

This necessitates a technique to determine the optimal design configuration of polarizing circuits by taking into account all the aforementioned degrees of freedom. It is necessary to determine optimal code design before hardware-based implementation. Additionally, simulating Bit Error Rate (BER) or Block Error Rate (BLER) curves for all possible design configurations is an exceedingly time-consuming process due to an enormous number of design possibilities and the choice of frozen-information bits. Therefore, a simple analytical parameter that can predict polar code design optimality based on system requirements would significantly expand the usability of polar codes for many applications. To address this artifact,

ζ

parameter was proposed in [19], which in turn was developed from the Downsizing Type-Selection (DTS) parameter from [17]. It was used to quantify the optimality of a specific single kernel design and determine the best choice of kernel circuit over given channel conditions. Since

ζ

is a scaled version of the DTS parameter, it will be referred to as scaled-DTS or SDTS parameter for the remainder of this paper. The conclusions have been promising since SDTS has proved to be a good estimator of the performance behavior of polarization kernel circuits, validated by corresponding BLER curves. In this paper, we intend to validate the applicability of the SDTS parameter in a multi-kernel polar code circuit design as well.

The rest of the paper is organized as follows. In Section 2, some preliminary information on polarization circuits and multi-kernel designs is provided. The analysis of a multi-kernel configuration using

ζ

is provided in Section 3. In Section 4, BLER curves are provided to validate the optimality predictions from Section 3. The outlook of the research from the scope of this paper is provided in Section 5. Finally, some concluding remarks are provided in Section 6.

2. Preliminaries

2.1. $2 \times 2$ Kernel Circuit

The idea of channel polarization was proposed in [2]. This involves using a

2 \times 2

polarization circuit, as shown in Figure 1, to polarize 2-bit channels of equal capacities to 2-bit channels of unequal capacities. This method of channel polarization can be recursively used to design polarization circuits and thus generate polar codes of length

N = 2^{n}

(for

n \in N

).

To quantify the channel polarization effect, the z-parameter values are obtained as follows [2],

\begin{matrix} Z_{1} & = 2 Z (W) - Z {(W)}^{2} \end{matrix}

(1)

\begin{matrix} Z_{2} & = Z {(W)}^{2} \end{matrix}

(2)

where

Z (W)

denotes the z-parameter value of the real channel W and

Z_{1}

and

Z_{2}

denote the z-parameter values of the effective virtual channels. Since,

Z_{1} + Z_{2} = 2 Z (W)

, net channel capacity is conserved after polarization. Additionally, as

Z_{1} \geq Z (W)

and

Z_{2} \leq Z (W)

, one virtual channel has higher capacity than the real channel while the other one has lower capacity, creating virtual bit channels with polarized capacities. Although the equality in Equations (1) and (2) is valid just for Binary Erasure Channels (BECs), the concept of channel transformation to polarize their effective capacity holds true for any binary memoryless channel model.

2.2. $3 \times 3$ Kernel Circuit

One need not be limited to using just a

2 \times 2

polarization circuit to generate polar codes. The idea of channel polarization can be extended to using any

l \times l

polarization circuit, where

l \in N

, to generate polar codes of length

l^{n}

. This extends the range of possible codeword lengths with the need for resizing. Using

l = 3

, one can design polar codes of length

N = 3^{n}

. An example of a

3 \times 3

polarization circuit is shown in Figure 2, which is same as circuit

C_{3 / 1 d}

from [19].

The corresponding, z-parameter values are as follows,

\begin{matrix} Z_{1} & = 3 Z (W) - 3 Z {(W)}^{2} + Z {(W)}^{3} \end{matrix}

(3)

\begin{matrix} Z_{2} & = 2 Z {(W)}^{2} - Z {(W)}^{3} \end{matrix}

(4)

\begin{matrix} Z_{3} & = Z {(W)}^{2} \end{matrix}

(5)

There have been multiple proposals in the literature to how to polarize 3-bit channels (or how to design the

3 \times 3

polarization circuit) like in the set of articles [6,7,8,9]. In this paper, the focus would be on the

3 \times 3

polarization circuits

C_{3 / 1 d}

and

C_{3 / 2 x}

from [19], to analyze the effect of ordering and placement within a multi-kernel circuit layout.

2.3. Multi-Kernel Circuit

For channel polarization, it is not necessary to limit the recursive usage of a single kernel circuit to polarize bit channels. The polarization effect can also be achieved by using multiple kernel sizes simultaneously within the same encoding/decoding circuits, called multi-kernel polar codes. This extends the scope of codeword lengths even further to increase the versatility. The concept of multi-kernel polar codes is a well-established technique and the corresponding theoretical analysis of channel polarization aspects has been performed in the series of papers [10,11,12,13,14,15,16]. An example of using

2 \times 2

and

3 \times 3

circuits to generate a polar code of length 6 is shown in Figure 3.

The corresponding, z-parameter values are given as follows [2],

\begin{matrix} Z_{1} & = 6 Z (W) - 15 Z {(W)}^{2} + 20 Z {(W)}^{3} - 15 Z {(W)}^{4} + 6 Z {(W)}^{5} - Z {(W)}^{6} \end{matrix}

(6)

\begin{matrix} Z_{2} & = 8 Z {(W)}^{2} - 16 Z {(W)}^{3} + 14 Z {(W)}^{4} - 6 Z {(W)}^{5} + Z {(W)}^{6} \end{matrix}

(7)

\begin{matrix} Z_{3} & = 4 Z {(W)}^{2} - 4 Z {(W)}^{3} + Z {(W)}^{4} \end{matrix}

(8)

\begin{matrix} Z_{4} & = 3 Z {(W)}^{2} - 3 Z {(W)}^{4} + Z {(W)}^{6} \end{matrix}

(9)

\begin{matrix} Z_{5} & = 2 Z {(W)}^{4} - Z {(W)}^{6} \end{matrix}

(10)

\begin{matrix} Z_{6} & = Z {(W)}^{4} \end{matrix}

(11)

Using a different kernel size in each stage or a different order of kernel sizes in different stages or even different kernel designs of the same size, one can generate different polarization circuits for the same codeword length

K = l^{n 1} \times m^{n 2} \times \dots \times p^{n s}

for

l, m, p, n 1, n 2, n s \in N

. In this article, we focus just on such multi-kernel polarization circuits of the form

K = 2^{n 1} \times 3^{n 2}

using

3 \times 3

circuits

C_{3 / 1 d}

and

C_{3 / 2 x}

from [19]. This would be the baseline for our analysis of kernel placement and ordering in multi-kernel polarization circuits in the subsequent sections.

3. Analysis of Kernel Configuration

Multi-kernel polar codes not only provide higher flexibility in the choice of polarization kernel, but they also offer the opportunity to arrange the kernels in any arbitrary order. Due to the non-commutativity of the Kronecker product, different kernel orders yield different polarization circuits and generator matrices, i.e., different polar codes of the same length. The effect of kernel order has not been extensively discussed in the literature and a systematic approach to the optimization of the kernel order is near non-existent. In this paper, we investigate the effect of the kernel order on practical codeword lengths by assessing the polarization effect using the z-parameter. Analogously to [19],

ζ

(

= \sum_{i \in I} Z_{i}

), the upper bound of (7) from [19], is used as BLER estimate.

3.1. Effect of Kernel Order on z Parameter

Similar to the approach used in [19], in this paper, we would use the SDTS parameter to analyze the polarization behavior and correspondingly determine the optimal circuit configurations for a given set of system parameters. Therefore, the effect of different kernel orders can be studied by z and effectively SDTS parameters. The purpose of this section is to formalize the connection between the z-parameter behavior for different kernel orders.

Let

f (Z)

(

f {(Z)}_{C_{n}}^{(i)} : [0, 1] \to [0, 1]

) denote the evolution of z-parameter values, where n denotes kernel size and identification of circuit

C_{n}

and i denotes the bit index. The functional values are exemplified by (1) and (2) for kernel size 2, where

f {(Z)}_{C_{2}}^{(1)} \equiv (1)

and

f {(Z)}_{C_{2}}^{(2)} \equiv (2)

. Similarly, since Figure 2 corresponds to circuit

C_{3} / 1 d

, (3)–(5) provide the

f {(Z)}_{C_{3 / 1 d}}

values, i.e.,

f {(Z)}_{C_{3 / 1 d}}^{(1)} \equiv (3)

,

f {(Z)}_{C_{3 / 1 d}}^{(2)} \equiv (4)

and

f {(Z)}_{C_{3 / 1 d}}^{(3)} \equiv (5)

. As given in [19], the generator matrices corresponding to polarization circuits

C_{n 1}

and

C_{n 2}

, are denoted as

G_{n 1}

and

G_{n 2}

. Considering a simple example of the two-stage concatenation of kernel circuits of two different sizes, the resultant generator matrix is,

G_{n} = G_{n 1} \otimes G_{n 2}

(12)

where

n = n 1 \cdot n 2

. There are

n / n 1 = n 2

blocks of

C_{n 1}

circuits in stage 1 and

n / n 2 = n 1

blocks of

C_{n 2}

circuits in stage 2. The jth bit channel of the ith block in stage 1 is connected to the ith bit channel of the jth kernel in stage 2. This is the structural makeup for multi-kernel polar codes. Thus, the z-parameter value of output bit channels of jth kernel block from stage 1 to stage 2 is

f {(Z)}_{C_{n 1}}^{(j)}

. Since the kernel block

C_{n 2}

is repeated

n 1

times in stage 2, and each of them employ the same recursive z-parameter formulae, in stage 2,

f {(Z)}_{C_{n 2}}^{(k)}

is applied to the bits of first kernel block (

k = 1, \dots, n 2

),

f {(Z)}_{C_{n 2}}^{(k - n 2)}

for the second kernel block (

k = n 2 + 1, \dots, 2 \cdot n 2

) and so on, until the bit

n = n 1 \cdot n 2

. Effectively, the z-parameter value

Z_{k}

for the kth input bit channel can be recursively computed as,

Z_{k} = \{\begin{matrix} f {(Z)}_{C_{n 2}}^{(k)} (f {(Z)}_{C_{n 1}}^{(1)}) & k = 1, \dots, n 2 \\ f {(Z)}_{C_{n 2}}^{(k - n 2)} (f {(Z)}_{C_{n 1}}^{(2)}) & k = n 2 + 1, \dots, 2 \cdot n 2 \\ . \\ . \\ . \\ f {(Z)}_{C_{n 2}}^{(k - (n 1 - 1) \cdot n 2)} (f {(Z)}_{C_{n 1}}^{(n 2)}) & k = (n 1 - 1) \cdot n 2, \dots, n 1 \cdot n 2 \end{matrix}

(13)

The aforementioned (13) can be generalized as,

Z_{k} = f {(Z)}_{C_{n 2}}^{(1 + ((k - 1) m o d n 2))} (f {(Z (W))}_{C_{n 1}}^{((k - 1) d i v n 2)})

(14)

where W denotes the real bit channel. At the final output stage, for simplicity, we denote (14) as

f {(Z)}_{C_{n 2}}^{(k 2)} \circ f {(Z)}_{C_{n 1}}^{(k 1)}

, for

k l = 1, \dots, n l

. Generalizing it to a concatenation of a arbitrary number kernels

C_{n 1}, C_{n 2}, \dots, C_{n s}

, the resulting z-parameter values at the circuit output is given as

f {(Z)}_{C_{n s}}^{(k s)} \circ \dots \circ f {(Z)}_{C_{n 2}}^{(k 2)} \circ f {(Z)}_{C_{n 1}}^{(k 1)}

. Therefore, the task of optimizing the kernel order can be denoted by the following optimization problem,

f {(Z)}_{C_{n π_{s}}}^{(k π_{s})} \circ \dots \circ f {(Z)}_{C_{n π_{2}}}^{(k π_{2})} \circ f {(Z)}_{C_{n π_{1}}}^{(k π_{1})} {Z (W)} = min ζ

(15)

where

π

is used to denote the permutation of ordering the kernel circuits such that kernel

C_{n i}

is assigned to stage i, i.e.,

π = {π_{1}, π_{2}, \dots, π_{s}}

with

π_{1} \in {n 1, n 2, \dots, n s}

,

π_{2} \in {n 1, n 2, \dots, n s} {π_{1}}

,

π_{3} \in {n 1, n 2, \dots, n s} {π_{1}, π_{2}}

and so on. This gives the optimal kernel order for a given codeword length and the required number of information bits.

Limiting the analysis for two different kernels and sizes, we divide the aforementioned optimization problem into the following degrees of freedom to be tuned:

The order in which the sets of two kernels would be concatenated;
The position (stage #) at which the second kernel is to be placed if used just once.

Case 1 involves determining whether a kernel is used either in the initial or in the latter stages. Case 2 is used to determine at which intermediate stage would it be optimal to place a different kernel. The goal is to identify regularities that can be generalized to any kernel.

3.2. Evolution of SDTS Parameter for Variation of Kernel Order

In this section, we analyze the first case of kernel ordering mentioned in Section 3.1. Here, two different kernels (of different sizes) are concatenated to design the polarization circuit. The different kernels are arranged in separate blocks of stages and are not nested. Since only two kernels are considered, the assessment involves placing a set of kernels on either the encoder input or output side of the polarization circuit. For the rest of this paper, the encoder output would be denoted as the first stage (stages close to the encoder input are initial stages) and the encoder input as the last stage (stages close to the encoder output are latter stages). In this section, we will focus only on kernels of sizes 2 and 3. In [19], four polarization circuits have been developed with different polarization characteristics, amongst which groups of two shared similar behavior with each other. We will use circuits

C_{3 / 1 d}

and

C_{3 / 2 x}

for the analysis in this section as they have shown to depict contrary behavior in polarization circuit designs. Simulations over different values of the total number of stages have shown that the polarization exponents (a metric proposed in [20] to quantify the strength of polarization for near infinite codeword length) of either

C_{n 1}

or

C_{n 2}

have no effect on the optimal kernel order. Since we would like to investigate codewords of practical codeword (finite) lengths, the exponent would not be the most reliable indicator of design optimality anyway.

Using

n 1 = 2

and

n 2 = 3

with three stages each (

N = 2^{3} \cdot 3^{3} = 216

), the evolution of values of

ζ

is shown in Figure 4 and Figure 5. In Figure 4, circuit

C_{3 / 1 d}

is used as the

C_{3}

circuit whereas in Figure 5 circuit

C_{3 / 2 x}

is used as the

C_{3}

circuit. In both figures, the blue curves denote the circuits where

C_{3}

kernels are used in the initial stages of the polarization circuit. On the other hand, the red curves denote the circuits where

C_{3}

kernels are used in the latter stages of the polarization circuit. It has been observed that for high values of

ζ

(typically for high coderate and high value of

Z (W)

), the curves are too close to each other. Therefore, the range of

ζ

values can be ignored. On the other hand, the comparison of

ζ

for very low values would not be accurate since the difference in polarization quantified by

ζ

would be negligible. Hence, the range of very low

ζ

values would also be ignored.

In Figure 4, it is observed that

C_{3 / 1 d}

is more effective when placed at the latter stages. There is a clear gap between the

ζ

curves of the two kernel order permutations at low coderate (

R_{d} = 0.2

) and half coderate (

R_{d} = 0.5

) as seen in Figure 4. At high coderate, however, the

ζ

curves of the different kernel orders are very close to each other. For high coderate, the

ζ

curves interchange their order, with the blue one being slightly better for high

ζ

values and the red one being slightly better for low

ζ

values. Due to the close proximity of

ζ

curves, the effectiveness of

ζ

needs to be verified by BLER simulations, as to whether the order is truly irrelevant when using this combination at high coderates.

In Figure 5, it is observed that it is preferable to use kernel

C_{3 / 2 x}

in the initial stages for low coderates (

R_{d} = 0.2

) and half coderate (

R_{d} = 0.5

). However, it is advantageous to use this kernel in the latter stages for very high coderates. It was shown in [19] that

C_{3 / 2 x}

is a good choice just for low coderates. Therefore, the best circuit order involving

C_{3 / 2 x}

for high coderates is of low practical importance. Although the analysis was performed in a single-kernel paradigm, the evaluation of the SDTS parameter is the same in a multi-kernel paradigm as well.

Comparing Figure 4 to Figure 5, the SDTS parameter performance of multi-kernel polar codes based on kernel

C_{3 / 1 d}

is worse than that of multi-kernel polar codes based on kernel

C_{3 / 2 x}

for low coderates (

R_{d} = 0.2

and

R_{d} = 0.5

). On the other hand, the SDTS parameter performance of multi-kernel polar codes based on kernel

C_{3 / 1 d}

is better than that of multi-kernel polar codes based on kernel

C_{3 / 2 x}

for higher coderates (

R_{d} = 0.8

). Hence, when

C_{3 / 1 d}

is used in multi-kernel circuits, the performance can be optimized by placing them at the latter stages of the polarization circuit. Similarly, when using

C_{3 / 2 x}

, the performance can be optimized by placing them at the initial stages of the polarization circuit.

Overall, a general best kernel order, that fits all design parameters, is difficult to determine, since the behavior of the SDTS parameter depends on the coderate, the particular kernel used and the corresponding kernel order. Nevertheless, based on desired system parameters, optimized circuit design and kernel choices can be determined and vice-versa.

3.3. Evolution of SDTS Parameter for Variation of Kernel Position

In this section, we analyze the second case of kernel ordering mentioned in Section 3.1. Here, the effect of different positions of a single kernel inside a multi-kernel polar code is analyzed. To simplify the analysis, we will consider only one stage of the

C_{3}

kernel and the remaining

n - 1

stages of the

C_{2}

kernel, for n being the total number of stages in the polarization circuit. The

C_{3}

kernel is placed in one of the intermediate or the first or last stages. Depicting position number by

p o s

,

p o s \in [1, n]

. The resulting codeword length would be of the form

N = 2^{(n - 1)} \cdot 3

. As in Section 3.2, we will only use kernel circuits

C_{3 / 1 d}

and

C_{3 / 2 x}

for

C_{3}

kernel circuits. To further simplify the analysis, in this sub-section

ζ

would be assessed just for half coderate values (

R_{d} = 0.5

).

Figure 6 depicts the evolution of the SDTS parameter, by placing one kernel

C_{3 / 1 d}

at different intermediate stages of a polarization circuit composed along with

(n - 1) = {6, 8}

stages of

C_{2}

, i.e., in total

n = {7, 9}

stages or

N = {192, 768}

. The position that yields the best

ζ

performance differs for varying n. For the

n = 7

, placing

C_{3 / 1 d}

at the last stage leads to the best performance. For

n = 9

, the optimal position of

C_{3 / 1 d}

is barely distinguishable. Therefore, as a generalization, it would be safe to conclude that placing kernel

C_{3 / 1 d}

at the latter stage yields near-optimal performance. A justification for such behavior is that since

C_{3 / 1 d}

provides a high number of relatively good bit channels, it is best suited for the last stages, where the bit channels are already polarized to some extent and the overall polarization needs to be improved further. Since the

ζ

curves are very close, however, the exact performance comparison would require verification with BLER simulations.

The same analysis is presented for different positions of

C_{3 / 2 x}

in Figure 7. For the

n = 7

(

N = 192

), placing

C_{3 / 2 x}

at the first stage leads to the best performance, whereas, placing

C_{3 / 2 x}

on the last stage yields the worst performance. For

n = 9

(

N = 768

), placing

C_{3 / 2 x}

at the third position yields best performance. Thus, there is a general tendency that

C_{3 / 2 x}

yields a better

ζ

performance at the initial rather than at the latter stages. A justification for such behavior is due to the availability of fewer good bit channels in

C_{3 / 2 x}

. This is less effective at the latter stages because the quality of the majority of the already polarized bit channels is not significantly improved. Employing this kernel in the initial stages, however, generates some highly polarized bit channels, if not many. These highly reliable channels can be used in the succeeding stages to further improve the reliability of other channels.

A general observation is that performance optimality based on kernel position gets gradually less relevant with the increasing number of stages. This pattern is as expected since, with an increasing number of stages, the polarization effect from one specific stage gets diluted and becomes less significant within the overall polarization circuit. Therefore, it is more important to optimize the kernel order for a smaller number of stages, i.e., small finite codeword lengths. Additionally,

C_{3 / 1 d}

yields better performance when placed at the latter stages, whereas

C_{3 / 2 x}

is more effective at the initial stages. Such behavioral differences of

C_{3 / 1 d}

and

C_{3 / 2 x}

with respect to coderates have already been observed in [19]. Hence, the kernel order of a multi-kernel polar code has to be optimized individually for each kernel implementation.

In order to validate this hypothesis, a profile of the polarization over the position of

C_{3}

kernel at various stages for a seven stage circuit (

n = 7

) and

R_{d} = 0.5

is presented in Figure 8 under different channel conditions (values of

Z (W)

). This polarization profile shows how

ζ

varies when

C_{3}

circuits (

C_{3 / 1 d}

or

C_{3 / 2 x}

) are placed at one of the intermediate or first or last stages of the circuit.

From each of the four subplots, one can observe that it is beneficial to place

C_{3 / 2 x}

in the initial stages of the circuit as it results in a smaller

ζ

value. Eventually, for the latter when placed in the latter stages,

C_{3 / 1 d}

offers a smaller

ζ

value, therefore being the better choice of

C_{3}

. This observation holds true for any channel condition (multiple instances of

Z (W)

). An interesting observation from Figure 8 is that the curve corresponding to

C_{3 / 1 d}

is predominantly monotonically decreasing by a small factor over subsequent stages. This is because

C_{3 / 1 d}

can improve the overall polarization in subsequent stages owing to the presence of a high number of relatively good bit channels. On the other hand, the curve corresponding to

C_{3 / 2 x}

tends to have a contradictory behavior due to fewer reliable bit channels of this kernel. Only when applied at the first stage, the succeeding stages can exploit the few highly reliable bit channels to eventually improve other bit channels.

4. Error Rate Performance

In Section 3, we used the SDTS parameter to analyze the optimality of the order and position of different kernel circuits within a multi-kernel polar code configuration. It was observed that there does not exist a universal configuration of optimal polar code design that performs best for any given codeword length or coderate. However, SDTS provides an indicator of the trends in the polarization of a certain circuit design and how it would perform for a given set of network parameter requirements.

Based on Section 3, the best position of a kernel within a multi-kernel polar code depends on the coderate on the one hand, and on the kernel implementation on the other hand. For

C_{3}

kernels, the preferable position of

C_{3 / 1 d}

within a multi-kernel polar code is one of the latter stages, while the preferable position of kernel

C_{3 / 2 x}

is one of the initial stages. These

ζ

-based predictions would be verified using block error rate (BLER) curves in the following sub-sections. All simulations have been performed over an Additive White Gaussian Noise Channel (AWGNC), Binary Phase Shift Keying (BPSK) is the underlying modulation scheme and SC decoder is the polar decoding technique. The complexity of SC decoding [2] depends only on the codeword length and is independent of the kernel configuration and ordering. In Section 4.1, the observations from Section 3.2 are validated and in Section 4.2 the observations from Section 3.3 are validated.

4.1. BLER Performance for Variation of Kernel Order

In this section, the BLER results are presented for the case in Section 3.2, where the transformation network consists of two groups of different kernels. The error rate performance corresponding to Figure 4 are presented in Figure 9 and the error rate performance corresponding to and Figure 5 are presented in Figure 10. As per the system configuration in Section 3.2,

N = 2^{3} \cdot 3^{3} = 216

, implying three stages of

C_{2}

and

C_{3}

kernels each. The blue BLER curves correspond to the

C_{3}

kernels being used in the initial stages of the polarization circuit, whereas the red BLER curves correspond to the

C_{3}

kernels being used in the latter stages of the polarization circuit.

In Figure 9, it is observed that BLER performance is best when kernel

C_{3 / 1 d}

is used at the latter stages for any given coderate. However, for low coderate, the performance difference is substantially high, approximately

0.2

dB for

R_{d} = 0.2

. For half or high coderates, the performance gap is negligible with positioning

C_{3 / 1 d}

at the latter stages slightly outperforming the positioning at the initial stages. For high coderate (

R_{d} = 0.8

) too, the performance gap is almost negligible with a flip in performance curves at low and high BLER ranges. This behavior corresponds exactly to the predicted behavior from Figure 4, hence proving the effectiveness of using the SDTS parameter to predict BLER performance.

In Figure 10, it is observed that the BLER performance is best when kernel

C_{3 / 2 x}

is used at the initial stages for any given coderate, with approximately

0.7

dB for

R_{d} = 0.5

. One exception is for high coderate at high BLER values when placing

C_{3 / 2 x}

at the latter stages is a better option, which is predicted by Figure 5. However, since low BLER is not of high interest for performance analysis, this observation can be ignored.

Comparing Figure 9 to Figure 10, clearly for low coderate (

R_{d} = 0.2

), the BLER performance is better when

C_{3 / 2 x}

is used over

C_{3 / 1 d}

. On the other hand, the BLER performance is better when

C_{3 / 1 d}

is used over

C_{3 / 2 x}

at high coderate (

R_{d} = 0.8

). This further validates the prediction of optimality of

C_{3 / 2 x}

for low coderate and

C_{3 / 1 d}

for high coderate mentioned previously as well as determined in [19]. Most importantly, the BLER performance concurs with the observations from Section 3.2. The SDTS parameter performance predicted the best kernel order correctly. The only wrong prediction was made for kernel

C_{3 / 2 x}

for high coderate value (

R_{d} \approx 0.8

) at low BLER. As

C_{3 / 2 x}

has been deemed unsuitable for high coderates anyway, this inconsistency is not highly relevant for practical use cases, as

C_{3 / 1 d}

would be the preferred kernel choice for higher coderates.

4.2. BLER Performance for Variation of Kernel Position

In addition to the group-wise kernel order permutation, the SDTS parameter performance of different positions for a

C_{3}

kernel in combination with multiple stages of

C_{2}

kernels was analyzed in Section 3.3. The error rate performance corresponding to Figure 6 are presented in Figure 11 and the error rate performance corresponding to and Figure 7 are presented in Figure 12.

Figure 11 shows that placing the kernel

C_{3 / 1 d}

, at the last stage yields the best BLER performance, while placing it at the third stage provides the worst performance for both

n = 7

and

n = 9

polarization circuits. This complies with the predictions from the SDTS parameter in Section 6 that kernel

C_{3 / 1 d}

is most effective when placed on the last stages. Comparing Figure 11a to Figure 11b, the performance difference is smaller (more negligible) for a higher number of stages, i.e., for

n = 9

compared to

n = 7

, which corresponds to the observation (Figure 6a compared to Figure 6b) that for a polarization circuit with a higher number of stages, the performance dependency on kernel position for a single stage is substantially diluted.

Figure 12 shows that placing the kernel

C_{3 / 2 x}

, at the first stage for

n = 7

and at the third stage for

n = 9

yields the best BLER performance, while placing

C_{3 / 2 x}

at the last stage provides worst performance. This complies exactly with the observations from Section 5. It is beneficial to apply kernel

C_{3 / 2 x}

in the initial stages.

The authors of [11] presented a similar approach to kernel positioning. They concluded that the kernel considered therein (

G_{3}

with polarization exponent the same as

C_{3 / 1 d}

) is preferable in the latter stages for half coderate. As already conjectured above, kernels with the same or a similar exponent may have the same optimal kernel order. Overall, the optimization of kernel order turns out to be a complex problem. One challenge is the high number of parameters that affect the performance of a kernel order, such as the particular kernel design, codeword length and the coderate. Another issue is the lack of ability to generalize the observations since the optimization of kernel order proved to be a problem specific to the considered code design. Nevertheless, the behavioral tendency of a particular kernel implementation could be verified, for example, whether it tends to be advantageous either in the initial or in the latter stages of the polarization circuit. This has been demonstrated for the kernels

C_{3 / 1 d}

and

C_{3 / 2 x}

.

5. Future Work

The effectiveness of the SDTS parameter has been successfully investigated for polarization kernels of sizes 2, 3 and 4 in the single kernel and sizes 2 and 3 in the multi-kernel paradigm. Naturally, an extension of this research would include identifying how useful

ζ

could be for arbitrarily higher kernel sizes and their multi-kernel implementations. Since, the number of valid kernels increases exponentially with increasing kernel size, there exist many possible candidates and consequently their combinations. Hence, it becomes increasingly complex to determine optimal configurations of polarization circuit designs. The availability of such a parameter to quantify the strength of polarization for finite codeword lengths without error rate simulations is quite useful. Since the SDTS parameter is aimed for the same purpose as the polarization exponent, identifying a relationship between these two parameters could be an interesting future research work that would significantly improve generalized polar code design for any codeword length.

6. Conclusions

The performance characteristics of higher ordered kernels have been identified by the SDTS parameter and its utility for optimization of polarization circuits to generate polar codes has been validated for single kernel polar codes in [19] and multi-kernel polar codes in this paper. Polarization kernels of higher order are desirable to improve the applicability of polar codes since a wider range of codeword lengths can be achieved without the need for downsizing techniques such as puncturing or shortening. The SDTS parameter has been able to accurately predict the preferable choice of kernel configuration as well as ordering and placement within the circuit structure to provide the best error rate performance of finite length codewords for a given set of system parameter settings. This offers a much simpler and quicker approach for polar code design without the need for BER/BLER simulations to determine the optimality of polar code design. Although the BLER validation has been performed using only the SC decoder, we performed some baseline simulations using SCL (Successive Cancellation List) decoding with list sizes 2 and 8, well as the corresponding SCL with CRC (Cyclic Redundancy Check) of 8 bits. The observed behavior was the same as for the SC decoder, i.e., the SDTS parameter can make accurate error rate performance prediction over all those SC-based polar decoding techniques.

For the

3 \times 3

kernels investigated, it has been determined that

C_{3 / 1 d}

is better at the latter and

C_{3 / 2 x}

is better at the initial stages. Additionally, the performance difference for

C_{3 / 1 d}

curves is small and for

C_{3 / 2 x}

is large. This is because

C_{3 / 2 x}

has few highly polarized channels and therefore a change in their placement within the polarization circuit has a higher effect on the performance. On the other hand, since

C_{3 / 1 d}

has a higher number of less polarized bit channels, the effect of a change in their placement within the polarization circuit has a lower effect on the performance. All these effects are predicted by the SDTS parameter and thus it could provide insights into kernel behavior within a polarization circuit. This aids in generalizing the optimal design of polarization circuits for a wide range of codeword lengths which is one of the key challenges associated with the implementation of polar codes.

Author Contributions

Conceptualization, S.S. and L.M.; methodology, L.M. and S.S.; software, L.M. and S.S.; validation, S.S. and L.M.; formal analysis, S.S.; investigation, L.M. and S.S.; resources, M.A. and P.J.; data curation, S.S. and L.M.; writing—original draft preparation, S.S.; writing—review and editing, S.S. and M.A.; visualization, S.S., L.M. and M.A.; supervision, M.A. and P.J.; project administration, M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data available on request and not publicly accessible due to ownership restrictions.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AWGNC	Additive White Gaussian Noise Channel
BEC	Binary Erasure Channel
BER	Bit Error Rate
BLER	Block Error Rate
BP	Belief Propagation
BPSK	Binary Phase Shift Keying
DTS	Downsizing Type Selection
SDTS	Scaled-DTS
SISO	Soft Input Soft Output
SC	Successive Cancellation

References

3GPP TS 38.212 V17.2.0. Technical Specification, 3rd Generation Partnership Project; Technical Specification Group Radio Access Network; NR; Multiplexing and Channel Coding (Release 17). Available online: https://www.3gpp.org/ftp/Specs/archive/38_series/38.212/ (accessed on 20 July 2022).
Arikan, E. Channel Polarization: A Method for Constructing Capacity-Achieving Codes for Symmetric Binary-Input Memoryless Channels. IEEE Trans. Inf. Theory 2009, 55, 3051–3073. [Google Scholar] [CrossRef]
Tal, I.; Vardy, A. List Decoding of Polar Codes. IEEE Trans. Inf. Theory 2015, 61, 2213–2226. [Google Scholar] [CrossRef]
Afisiadis, O.; Balatsoukas-Stimming, A.; Burg, A. A low-complexity improved successive cancellation decoder for polar codes. In Proceedings of the 48th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 2–5 November 2014; pp. 2116–2120. [Google Scholar]
Arikan, E. Polar codes: A pipelined implementation. In Proceedings of the 4th ISBC, Malacca, Malaysia, 11–14 July 2010. [Google Scholar]
Mahdavifar, H.; El-Khamy, M.; Lee, J.; Kang, I. Polar Coding for Bit-Interleaved Coded Modulation. IEEE Trans. Veh. Technol. 2016, 65, 3115–3127. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, Z.; Wang, X. Polar code with block-length N = 3ⁿ. In Proceedings of the International Conference on Wireless Communications and Signal Processing (WCSP), Huangshan, China, 25–27 October 2012; pp. 1–6. [Google Scholar]
Cheng, L.; Zhou, W.; Zhang, L. Hybrid Multi-Kernel Construction of Polar Codes. In Proceedings of the IEEE 89th Vehicular Technology Conference (VTC2019-Spring), Kuala Lumpur, Malaysia, 28 April–1 May 2019; pp. 1–5. [Google Scholar]
Xia, C.; Tsui, C.-Y.; Fan, Y. Construction of Multi-Kernel Polar Codes With Kernel Substitution. IEEE Wirel. Commun. Lett. 2020, 9, 1879–1883. [Google Scholar] [CrossRef]
Gabry, F.; Bioglio, V.; Land, I.; Belfiore, J. Multi-kernel construction of polar codes. In Proceedings of the IEEE International Conference on Communication (ICC), Paris, France, 21–25 May 2017; pp. 761–765. [Google Scholar]
Bioglio, V.; Gabry, F.; Land, I.; Belfiore, J. Multi-Kernel Polar Codes: Concept and Design Principles. IEEE Trans. Commun. 2020, 68, 5350–5362. [Google Scholar] [CrossRef]
Benammar, M.; Bioglio, V.; Gabry, F.; Land, I. Multi-Kernel Polar Codes: Proof of Polarization and Error Exponents. In Proceedings of the IEEE Information Theory Workshop (ITW), Kaohsiung, Taiwan, 6–10 November 2017; pp. 101–105. [Google Scholar]
Bioglio, V.; Gabry, F.; Land, I.; Belfiore, J. Minimum-Distance Based Construction of Multi-Kernel Polar Codes. In Proceedings of the IEEE Global Communications Conference (GLOBECOM), Singapore, 4–8 December 2017; pp. 1–6. [Google Scholar]
Bioglio, V.; Land, I.; Gabry, F.; Belfiore, J. Flexible design of Multi-Kernel Polar Codes by reliability and distance properties. In Proceedings of the IEEE 10th International Symposium on Turbo Codes and Iterative Information Processing (ISTC), Hong Kong, China, 3–7 December 2018. [Google Scholar]
Bioglio, V.; Land, I. On the Marginalization of Polarizing Kernels. In Proceedings of the IEEE 10th International Symposium on Turbo Codes and Iterative Information Processing (ISTC), Hong Kong, China, 3–7 December 2018. [Google Scholar]
Bioglio, V.; Land, I.; Condo, C. Improved Hybrid Design of Polar Codes and Multi-Kernel Polar Codes. In Proceedings of the IEEE International Symposium on Information Theory (ISIT), Paris, France, 7–12 July 2019; pp. 1947–1951. [Google Scholar]
Saha, S.; Adrat, M. Novel Multi-Parameter based Rate-Matching of Polar Codes. In Proceedings of the International Conference on Military Communications and Information Systems (ICMCIS), Oeiras, Portugal, 4–5 May 2021. [Google Scholar]
Saha, S.; Adrat, M. Multi-Kernel Polar Codes versus Classical Designs with Different Rate-Matching Approaches. Electronics 2021, 10, 1717. [Google Scholar] [CrossRef]
Saha, S.; Adrat, M.; Maßny, L.; Schrammen, M.; Jax, P. Novel Recursive Kernel Construction for Polar Codes with Practical Codeword Lengths. In Proceedings of the 15th International Conference on Signal Processing and Communication Systems (ICSPCS), Online, 13–15 December 2021; pp. 1–10. [Google Scholar]
Korada, S.B.; Sasoglu, E.; Urbanke, R. Polar codes: Characterization of exponent, bounds, and constructions. IEEE Trans. Inf. Theory 2010, 56, 6253–6264. [Google Scholar] [CrossRef]

Figure 1.

2 \times 2

polarization circuit.

Figure 1.

2 \times 2

polarization circuit.

Figure 2. Example of

3 \times 3

polarization circuit [19].

Figure 2. Example of

3 \times 3

polarization circuit [19].

Figure 3. Example of

6 \times 6

polarization circuit using

2 \times 2

and

3 \times 3

polarization circuits [18].

Figure 3. Example of

6 \times 6

polarization circuit using

2 \times 2

and

3 \times 3

polarization circuits [18].

Figure 4. Evolution of SDTS parameter using

n 1 = 2

and

n 2 = 3

with circuit

C_{3 / 1 d}

for

N = 216

.

Figure 4. Evolution of SDTS parameter using

n 1 = 2

and

n 2 = 3

with circuit

C_{3 / 1 d}

for

N = 216

.

Figure 5. Evolution of SDTS parameter using

n 1 = 2

and

n 2 = 3

with circuit

C_{3 / 2 x}

for

N = 216

.

Figure 5. Evolution of SDTS parameter using

n 1 = 2

and

n 2 = 3

with circuit

C_{3 / 2 x}

for

N = 216

.

Figure 6. Evolution of SDTS parameter for

N = 192

and

N = 768

with

n - 1 = 6

stages (a) or

n - 1 = 8

stages (b) of

C_{2}

circuits and one intermediate stage of

C_{3 / 1 d}

circuit with

p o s

being its stage number.

Figure 6. Evolution of SDTS parameter for

N = 192

and

N = 768

with

n - 1 = 6

stages (a) or

n - 1 = 8

stages (b) of

C_{2}

circuits and one intermediate stage of

C_{3 / 1 d}

circuit with

p o s

being its stage number.

Figure 7. Evolution of SDTS parameter for

N = 192

and

N = 768

with

n - 1 = 6

stages (a) or

n - 1 = 8

(b) of

C_{2}

circuits and one intermediate stage of

C_{3 / 2 x}

circuit with

p o s

being its stage number.

Figure 7. Evolution of SDTS parameter for

N = 192

and

N = 768

with

n - 1 = 6

stages (a) or

n - 1 = 8

(b) of

C_{2}

circuits and one intermediate stage of

C_{3 / 2 x}

circuit with

p o s

being its stage number.

Figure 8. Polarization profile, using SDTS parameter, when

C_{3 / 1 d}

and

C_{3 / 2 x}

are placed at first or last stages of a polarization circuit with

n = 7

.

Figure 8. Polarization profile, using SDTS parameter, when

C_{3 / 1 d}

and

C_{3 / 2 x}

are placed at first or last stages of a polarization circuit with

n = 7

.

Figure 9. BLER performance comparison using

C_{3 / 1 d}

in the initial or latter stages for

N = 216

.

Figure 9. BLER performance comparison using

C_{3 / 1 d}

in the initial or latter stages for

N = 216

.

Figure 10. BLER performance comparison using

C_{3 / 2 x}

in the initial or latter stages for

N = 216

.

Figure 10. BLER performance comparison using

C_{3 / 2 x}

in the initial or latter stages for

N = 216

.

Figure 11. BLER performance comparison using

C_{3 / 1 d}

in the initial or latter stages for

n = 7

stages (a) and

n = 9

stages (b).

Figure 11. BLER performance comparison using

C_{3 / 1 d}

in the initial or latter stages for

n = 7

stages (a) and

n = 9

stages (b).

Figure 12. BLER performance comparison using

C_{3 / 2 x}

in the initial or latter stages for

n = 7

stages (a) and

n = 9

stages (b).

Figure 12. BLER performance comparison using

C_{3 / 2 x}

in the initial or latter stages for

n = 7

stages (a) and

n = 9

stages (b).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Saha, S.; Maßny, L.; Adrat, M.; Jax, P. On the Effects of Kernel Configuration in Multi-Kernel Polar Codes. Entropy 2022, 24, 1457. https://doi.org/10.3390/e24101457

AMA Style

Saha S, Maßny L, Adrat M, Jax P. On the Effects of Kernel Configuration in Multi-Kernel Polar Codes. Entropy. 2022; 24(10):1457. https://doi.org/10.3390/e24101457

Chicago/Turabian Style

Saha, Souradip, Luis Maßny, Marc Adrat, and Peter Jax. 2022. "On the Effects of Kernel Configuration in Multi-Kernel Polar Codes" Entropy 24, no. 10: 1457. https://doi.org/10.3390/e24101457

APA Style

Saha, S., Maßny, L., Adrat, M., & Jax, P. (2022). On the Effects of Kernel Configuration in Multi-Kernel Polar Codes. Entropy, 24(10), 1457. https://doi.org/10.3390/e24101457

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Effects of Kernel Configuration in Multi-Kernel Polar Codes^†

Abstract

1. Introduction

2. Preliminaries

2.1. $2 \times 2$ Kernel Circuit

2.2. $3 \times 3$ Kernel Circuit

2.3. Multi-Kernel Circuit

3. Analysis of Kernel Configuration

3.1. Effect of Kernel Order on z Parameter

3.2. Evolution of SDTS Parameter for Variation of Kernel Order

3.3. Evolution of SDTS Parameter for Variation of Kernel Position

4. Error Rate Performance

4.1. BLER Performance for Variation of Kernel Order

4.2. BLER Performance for Variation of Kernel Position

5. Future Work

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

On the Effects of Kernel Configuration in Multi-Kernel Polar Codes †

Abstract

1. Introduction

2. Preliminaries

2.1. 2 × 2 Kernel Circuit

2.2. 3 × 3 Kernel Circuit

2.3. Multi-Kernel Circuit

3. Analysis of Kernel Configuration

3.1. Effect of Kernel Order on z Parameter

3.2. Evolution of SDTS Parameter for Variation of Kernel Order

3.3. Evolution of SDTS Parameter for Variation of Kernel Position

4. Error Rate Performance

4.1. BLER Performance for Variation of Kernel Order

4.2. BLER Performance for Variation of Kernel Position

5. Future Work

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

On the Effects of Kernel Configuration in Multi-Kernel Polar Codes^†

2.1. $2 \times 2$ Kernel Circuit

2.2. $3 \times 3$ Kernel Circuit