Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge

Song, Boqun; Smith, Jonathan D. H.; Wang, Jigang

doi:10.3390/quantum8010014

Open AccessArticle

Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge

by

Boqun Song

^1,2,3,*

,

Jonathan D. H. Smith

^1,4

and

Jigang Wang

^1,2,*

¹

Ames Laboratory, Iowa State University, Ames, IA 50011, USA

²

Department of Physics and Astronomy, Iowa State University, Ames, IA 50011, USA

³

Department of Physics and Texas Center for Superconductivity, University of Houston, Houston, TX 77204, USA

⁴

Department of Mathematics, Iowa State University, Ames, IA 50011, USA

^*

Authors to whom correspondence should be addressed.

Quantum Rep. 2026, 8(1), 14; https://doi.org/10.3390/quantum8010014

Submission received: 7 January 2026 / Revised: 31 January 2026 / Accepted: 6 February 2026 / Published: 12 February 2026

(This article belongs to the Special Issue Exclusive Feature Papers of Quantum Reports in 2024–2025)

Download

Browse Figures

Versions Notes

Abstract

The position operator

\hat{r}

appears as

i \partial_{p}

in wave mechanics, while its matrix form (e.g., under a Bloch basis) is well known diverging in diagonals, causing difficulties in basis transformation, observable yielding, etc. We aim to find a convergent r-matrix (CRM) to improve the existing divergent r-matrix (DRM), and investigate its influence at both the conceptual and the application levels. A key modification is increasing the familiar substitution of

\hat{r}

by

i \partial_{p}

to

i \sum_{j} \partial_{k_{j}}

, namely the N-th Weyl algebra. Resolving the divergence makes r-matrix rigorously defined, and we are able to show r-matrix is distinct from a spin matrix in terms of its defining principles, transformation behavior, and the observable it yields. Conceptually, the CRM fills the logical gap between the r-matrix and the Berry connection (this unremarked vagueness has caused the diagonal divergence). In application, we focus on transport, and discover that the Hermitian matrix is not identical with the associative Hermitian operator, i.e.,

r_{m, n} = r_{n, m}^{*} ⇎ \hat{r} = {\hat{r}}^{†}

, which subtly affects the celebrated Berry curvature formula for adiabatic current. We also discuss how such a non-representation CRM can contribute to building a unified transport theory.

Keywords:

convergent r-matrix; Weyl algebras; optical shift and injection currents; transport theory; quantum geometry

1. Introduction

In the change from classical to quantum mechanics, the position r was promoted to the operator

\hat{r}

, conjugated with momentum p by the non-trivial commutator relation

[\hat{r}, p] = i ℏ

that leads to uncertainty in their values [1,2]. As such,

\hat{r}

appears as

i \partial_{p}

acting on wavefunctions coordinated by p, with crucial consequences whether

〈 \hat{r} 〉

is explicitly evaluated (e.g., for transport) or if

〈 \hat{r} 〉

is trivially constant (e.g., with an atomic Hamiltonian) but energies/eigenstates are to be solved [1].

The familiar

i \partial_{p}

is the form of

\hat{r}

in wave mechanics, while the matrix form of

\hat{r}

is rarely seen. This disproportionate observation is due to the lack of convergence of the r-matrix: the diagonals diverge in plane waves, Bloch bases, etc. [3,4,5,6,7,8]. Given the autonomy of matrix mechanics and its equivalence with wave mechanics [2] (Chapter 3), it is weird that the diagonals of a physical operator (i.e., their expectation values) all diverge. This undermines preservation of the spectrum of the matrix, obstructs obtaining an equivalent r-matrix by basis transformation, and even casts doubt on the off-diagonal convergence, given the holistic nature of a matrix. We elaborate on the misbehavior of the r-matrix in Section 2.

In contrast, the spin matrix can readily be found by solving

[s_{i}, s_{j}] = ϵ_{i, j, k} s_{k}

. Pauli matrices provide the simplest solutions, and many others exist, either reducible or irreducible. Formally, representations of Lie algebras are involved [9]. In physics, we consider

su (2, C)

as it stands for spin. Generally, systematic construction and classification of Lie algebras have been achieved for finite (Cartan matrices [9,10]) and infinite dimensions (e.g., Kac-Moody algebras [10]). It is tempting to try the same for the r-matrix by solving

[\hat{r}, p] = i ℏ

: the operator on the right is now replaced by a complex number, leading to Weyl algebras [10,11]. Unfortunately, this is doomed to fail, because Weyl algebras admit no matrix representations (Section 2). Does this mean that an r-matrix cannot exist? Given that

[\hat{r}, p] = i ℏ

cannot be solved with matrices, could any matrix be assigned to

\hat{r}

?

A major goal of this work is to derive convergent r-matrices (CRM) in arbitrary finite dimensions by introducing the N-th Weyl algebra

A_{N}

(Section 2). It can be viewed as a procedure for encoding

\hat{r}

, an infinite dimensional operator of continuous spectrum, using matrices of finite dimension. Theoretically, it is convenient to have such a formal conversion for dimensions, and such a matrix description of a differential operator [12]. However, we stress that the r-matrices do not yield representations. In particular, we must carefully distinguish the terminologies “matrix” and “matrix representation.” In fact, the known r-matrices [3,4,5,8], called divergent r-matrices (DRM) in this context, do not yield representations either, a fact which has somehow been concealed by the divergence. The DRMs derive from the first Weyl algebra

A_{1}

(just the familiar substitution

\hat{r} \to i \partial_{p}

); we analyze the divergence and find that expanding

A_{1}

to

A_{N}

can fix it. Here, N depends on the dimension of the Bloch space. Although

A_{1}

is the special case of

A_{N}

with

N = 1

, the DRMs are not 1-D CRMs. In fact, the CRMs converge for arbitrary dimensions, and are thus distinct from the DRMs, rather than including them as a special case (Section 4).

Why are r-matrices important? A short answer is that they are involved in both transport [4,5,6,7,8,13,14,15,16,17,18] and topology [19,20,21,22,23,24,25,26,27] in crystals (e.g., under Bloch bases). For example, when band electrons are exposed to light [28,29,30,31,32,33,34] and undergo resonant inter-band transition [4,5,7,8,13,18], the charge center changes, leading to a shift current

J_{s} \propto γ_{m, n} \cdot R_{m, n}

[8,18]. The

γ_{m, n}

are hopping rates, and

R_{m, n}

is the position shift associated with

| ψ_{n} 〉 \to | ψ_{m} 〉

:

\begin{matrix} R_{m, n} (k) : = r_{m, m} (k) - r_{n, n} (k) - X_{m, n} (k) \end{matrix}

(1)

Here,

R_{m, n} (k)

is the shift vector, obtained by subtracting diagonals between bands m, n; while

X_{m, n} (k) = \partial_{k} ℑ \ln (r_{m, n} (k))

is a complementary term (involving the imaginary part of off-diagonals

r_{m, n} (k)

) to ensure gauge invariance [4,5,18]. As such, r-matrices enter through the distance shifted during hopping.

The r-matrices make another entrance in connection with the hopping rates

γ_{m, n}

. The light field is usually modelled with [4].

\begin{matrix} {\hat{V}}_{int} = e E \hat{r} \end{matrix}

(2)

The modification of quantum states is encoded in the matrix

〈 ψ_{m} | e E \hat{r} | ψ_{n} 〉

, which forms the basic building unit for nth-order perturbation [4,5,8]. For instance, consider the linear response

\begin{matrix} γ_{m, n} (k) \propto \frac{1}{h} {| 〈 u_{m} | {\hat{V}}_{int} | u_{n} 〉 |}^{2} (f_{n} (k) - f_{m} (k)) δ (ω - ω_{m, n}), \end{matrix}

(3)

which is known as Fermi Golden Rule derived via time-dependent perturbation. The

δ (ω - ω_{m, n})

reflects energy conservation, and

ω_{m, n}

is the energy difference as defined below.

Combined with Equation (1), the DC component of the second-order

J_{s}

response to the external driving with frequency

ω

is found to be [8]

\begin{matrix} J_{s} (ω) = \int & f_{m, n} \cdot (r_{m, m} - r_{n, n} - X_{m, n}) \cdot \\ r_{m, n} r_{n, m} δ (ω_{m, n} - ω) E (ω) E (- ω) \cdot d k, \end{matrix}

(4)

where

f_{m, n} (ω_{m, n})

is the Fermi distribution (energy) difference between two bands:

\begin{matrix} f_{m, n} : = f_{n} (k) - f_{m} (k), ω_{m, n} : = ω_{m} (k) - ω_{n} (k) . \end{matrix}

(5)

Compared with

J_{s} \propto γ_{m, n} \cdot R_{m, n}

, we recognize

\begin{matrix} γ_{m, n} (k) = f_{m, n} r_{m, n} r_{n, m} δ (ω_{m, n} - ω) E (ω) E (- ω) \end{matrix}

(6)

for

m \neq n

. Clearly, the hopping rate

γ_{m, n} (k)

relies on the r-matrix. When higher-order perturbations are counted,

γ_{m, n} (k)

should involve higher-order products of r-matrices:

r_{m, j} r_{j, l} r_{l, n} r_{n, m}

, etc. [4,5,8,26].

Simply speaking, the matrix components

r_{m, n}

originate from band labels m, n. The r-matrix is linked to observables in various forms [15,16,35,36,37,38,39,40,41]. In view of these consequences of the r-matrix, the dichotomy between wave and matrix mechanics and their mutual replaceability deserves second thought. We should not naively attribute the r-matrix solely to matrix mechanics, since evaluation and transformation of the r-matrix inevitably involve the nature of

\hat{r}

as a differential operator, sometimes in implicit ways. Nor should we think that using the differential form

i \partial_{p}

makes the r-matrix redundant. Matrices and differential operators are extensively discussed in Section 5.

We surpress Cartesian indices in Equations (1)–(6). In general, optical conductivities are tensors. Nevertheless, the simple fact is that the r-matrix appears in optical responses. More importantly, its role is clear: since the r-matrix takes the form of a connection, a differential-geometric notion, it opens the door to quantum geometry [35,36,37,38,39,40,41,42]. A notable success is the linkage of the Berry connection with the Wannier center [23,27], leading to quantization of adiabatic charge pumping [22] and the Berry phase theory of polarization [27]. In this vein, given appropriate coupling forms or scenarios (potentially ignoring certain degrees of freedom [38]), more geometric interpretations appear, such as curvature [19], a quantum metric (as a distance defined between quantum states) [36], and tangent space [38]. It is fascinating that these geometric notions enter into diverse phenomena which are seemingly irrelevant.

Although substantial advances have been made in the geometrical interpretation of optical transitions [17,18,35,36,37,38,39,40,41], we have not yet dealt with the issue of divergence. The r-matrix is still based on DRMs, the divergence arising from the diagonal

\partial_{k} (k - k^{'})

terms. The issue was raised by Blount in the 1950s [3]. Since then, no essential progress has been made on resolving the divergence, or on its origin and significance. Thus, the diagonal terms

r_{m, m}

,

r_{n, n}

in Equation (1) are not evaluated with Bloch functions but with periodic functions

u_{m, k} (r)

,

u_{n, k} (r)

. In arguing for such a substitution, resort is made to a heuristic: “The Bloch wave does not work, so something else should be used.” However, it remains unaddressed whether

u_{n, k} (r)

is the only possible choice, and whether using

u_{n, k} (r)

can be attributed to a certain general principle. Moreover, the implications of employing

u_{n, k} (r)

and integrating it over k to yield

〈 \hat{r} 〉

are disturbing because of when observables are no longer taken from diagonals of a physical operator, not the orthodoxy of matrix mechanics [2]. There has not yet been any comprehensive justification of this procedure, which seems quite remarkable given the maturity of quantum mechanics [2].

One can ensure that the diagonal entries

r_{n, n}

appear in quarantined form, just like cutting off the rotten parts of an apple, while it is unclear whether converging entries in a diverging matrix are still meaningful, at the very least in the absence of a renormalization protocol. Another concern is that geometry often relies on perturbation series [4,38,40], which incurs risks: strong interaction or gap closing might undermine the perturbation treatment; the geometric interpretation could be sensitive to the orders of truncation; it is hard to recognize a geometric effect when it is confounded with other effects [40]. Any of these possibilities could diminish the fundamentality and elegance of a geometric formula. Additionally, hopping, which should be a continuous process, is usually interpreted in terms of a pair of states (initial and final), while the geometric interpretation requires the number of intermediate states to be accounted for [38,40]. In short, the present situation is not satisfactory, and worry arises from the logic gap: the r-matrix does not stand on a solid foundation, and observable extraction is clearly incompatible with the basic rules for matrices, while observables based on DRMs are continually being proposed [8,17,18,35,36,37,38,39,40,41].

Our aim in this work is two-fold. At the conceptual level, we resolve the vagueness in using

ψ_{n, k}

or

u_{n, k}

by introducing the “r-matrix”

r_{m, n} (k, k^{'})

and the “reduced r-matrix”

r_{m, n} (k)

. Here,

r_{m, n} (k, k^{'})

is evaluated with

ψ_{n, k}

, and

r_{m, n} (k)

is evaluated with

u_{n, k}

. Both are defined in convergent fashion, regarded as different facets of the CRM, and their relations are deduced. The vector space spanned by

u_{n, k}

is recognized, whose dimension and relation with the Bloch space (spanned by

ψ_{n, k}

) are clarified. With CRM, the difficulty in using either diagonals or off-diagonals disappears. Moreover, we recall the non-existence of a matrix representation for a Weyl algebra, and show that Bloch waves are incomplete for

\hat{r}

. As a consequence, the principles for deducing the r-matrix must be different from, for instance, those for the spin operator. For spin, a complete “total space” serves as a Hilbert space which affords a Lie algebra representation. For

\hat{r}

and the Weyl algebra, it is a quotient space of a total space (or fiber space of a bundle space in bundle theory [43,44]) which serves as a Hilbert space. The algebraic procedure for this expands

A_{1}

to

A_{N}

.

At the application level, our main focus is on transport, which, by definition, means position change in charge carriers. Thus, it ultimately concerns the expectation value of

\hat{r}

. The analysis of diverse transport mechanisms, such as injection currents [8,28], shift currents [8,18], and adiabatic currents [22,27], currently involves vagueness or arbitrariness in extraction of the observable

〈 \hat{r} 〉

. We leave the development of a unified transport theory based on CRM for the future. Here, we concentrate on the issue of observable extraction. Since the definition of the r-matrix is subject to different principles, distinct transformation behaviors and gauge issues emerge. The methods for extracting expectation values also vary. All these phenomena suggest that

\hat{r}

is not the same type of operator as spin. To unify the differeing concepts, we introduce the notion of a ribbon. Another focus is on designation systems, and we point out the risks in using the bra/ket notation when differential operators are involved. The organization, major results, and innovative features of the paper are summarized in Table 1.

2. Position Operators and Weyl Algebras

We introduce three important vector spaces involved. The first is the space

H

spanned by the eigenstates of the position operator

\hat{r}

(or of p | the two sets of eigenstates are equivalent bases linked by Fourier transformation). The second vector space is the Bloch space

H_{B}

by bases

| ψ_{n, k} 〉

. Bloch space

H_{B}

is isomorphic to the space

H_{W}

spanned by the Wannier functions [27]. The third vector space

V

is defined shortly, as a quotient space of

H_{B}

(i.e.,

H_{B}

can be expressed as

V \otimes E

, where E is another vector space). We show how to bring

\hat{r}

down from

H

, on which it is originally defined, to a matrix defined on a finite-dimensional quotient space

V

.

Let us first introduce

H

. In general, the identity of a vector space is characterized by the dimension and the inner product. If and only if both aspects are the same, two vector spaces are considered identical. For finite dimensional spaces, if there exists an invertible (one-to-one) map between two spaces and the inner product remains unchanged after the map (formally, such a map is called an inner-product-preserving or structure-preserving map), the two spaces are said isomorphic [45].

The dimension of

H

is evidently infinite as eigenvalue r takes all possible

R

. To be more accurate, the dimension is uncountable infinite, as detailed shortly. The inner product defined for a vector space (often said “equipped on the space”) is formally a map

H \times H \to C

, which means the inputs (one the left of →) are two elements (vectors) in space

H

and the output is a complex number

C

.

On top of inner products, one could say the space

H

is complete (such that it is qualified for a Hilbert space), referring to the following fact:

\begin{matrix} 〈 r | r^{'} 〉 = δ (r - r^{'}) r, r^{'} \in R . \end{matrix}

(7)

Each r corresponds to a distinct eigenstate; thus, these eigenstates are as numerous as real numbers. With more rigor, the cardinality (a term characterizing the population of an infinite set) of the eigenstates is equal to that of

R

. The set

R

is known as “uncountably infinite”, meaning it is unable to list the entries in a one-to-one correspondence with the set of natural numbers

N

, which is called countable infinite. In other words,

R

is “more” than

N

, although both are infinite. Therefore, if there is another space whose dimension is countably infinite, it is smaller than

H

, and thus cannot be isomorphic to

H

[46].

When dimensions rise to infinity, some fundamental changes take place. For example, it is possible to write a finite-dimensional operator M in matrix form. With respect to eigenstates forming a basis, the matrix M is diagonal.

\begin{matrix} M \to {(\begin{matrix} M_{1, 1} & \dots & 0 \\ ⋮ & M_{i, i} & ⋮ \\ 0 & \dots & M_{N, N} \end{matrix})}_{N \to \infty} . \end{matrix}

(8)

It might tempting to think that

δ (r - r^{'})

could be such a diagonal matrix, if r,

r^{'}

may be regarded as row/column labels

i, j

, except that now the matrix becomes infinitely big (

N \to \infty

) to host the infinite number of elements. However, this is incorrect. Firstly, the row/column labels take values in the set of (positive) natural numbers N, which is countably infinite. This procedure does not apply to uncountably infinite sets such as

R

. Intuitively speaking, the matrix with countably infinite many rows/columns is still “not big enough.” Secondly, the matrix formalism stipulates that contraction of i should sum over all its possible values:

\sum_{j} M_{i, j} ϕ_{j}

. When this is extended to r, it becomes an uncountably infinite sum

\sum_{r \in R} M_{r, r^{'}} ϕ_{r^{'}}

, which in general diverges [47].

Another issue regarding the infinite dimension is the lack of converging norms. The norm means the “length” of a vector, which should be positive definite, and physically gives the probability density of a particular eigenstate. If we simply apply the normalization for finite-dimensional vectors to eigenstates of

\hat{r}

, we should find the norm is infinite. Roughly speaking, this divergence is reflected by the “infinite spike” of the

δ

-function. The diverging norm makes derivatives of states ill-defined and thus, in this space, a Berry connection

\begin{matrix} 〈 r | \partial_{r} r 〉 \end{matrix}

(9)

is also ill-defined. This is understandable, since an arbitrarily small deviation

r + Δ r

makes Equation (7) jump from infinity to zero (i.e.,

Δ r = 0

,

〈 r | r 〉 = \infty

, and

Δ r \neq 0

,

〈 r | r + Δ r 〉 = 0

), evidently not differentiable. In fact, the following aspects are interrelated: (1) the norm of a space; (2) the dimension of a space; (3) differentiability and derivative of vector states; (4) geometric notions, such as Berry connection and curvatures. To define notions such as Berry curvatures, we must reduce the dimension of the space, and the four aspects above need to be modified in parallel.

In addition, the average position of an extensive state, e.g., plane waves, also diverges. This is not a concern for scattering problems, where normalization is not required, and just the relative amplitudes of incoming/outgoing beams are adequate; or again when only localized states are involved and the average position is constantly fixed, such as for atomic Hamiltonians or harmonic oscillators [1]. However, for transport (e.g., shift currents [8,18]), a diverging position could be fatal, destroying any attempt at a meaningful definition of transport.

In the space

H

, the operator

\hat{r}

is needed in the commutation relation

\begin{matrix} [\hat{r}, p] = i ℏ, \end{matrix}

(10)

but never stands alone. It is always paired with its conjugate, the momentum p [1,2]. By linearity, one may adsorb i into the operator to obtain the alternative convention

[\hat{r}, p] = 1

. With Equation (10) as the generator, one obtains an infinite set of operators forming a ring. A ring is an algebra equipped with two operations: addition and multiplication [48] (division is not required). Addition must be Abelian and invertible; multiplication is not required to be commutative or invertible.

A most familiar ring is the set of integer

Z

. Apparently, one has addition and multiplication defined among integers; most importantly, addition and multiplication of two integers gives another integer—this requirement is known as closure. Thus, in this case, the ring is the set of integer numbers combined with operations defined on them (or said equipped on them); thus, it is more than just a set.

In general, the elements in a ring could be anything. Here, we are concerned with a ring composed of polynomials and derivatives as below:

\begin{matrix} f_{m} (r) \frac{\partial^{m}}{\partial r^{m}} or f_{m} (p) \frac{\partial^{m}}{\partial p^{m}} \end{matrix}

(11)

using the Einstein convention, where

f_{m}

is a polynomial serving as the “coefficients” of partial derivatives. One is at liberty to select either r or p as the variable. The ring generated by Equation (11) is called a Weyl algebra [10,11]. In fact, we encounter many Weyl algebras in quantum mechanics. Consider the formalism

\begin{matrix} \int ψ^{*} (r) (- i) \frac{\partial}{\partial r} ϕ (r) d r \end{matrix}

(12)

for yielding the expectation value of momentum. It involves the multiplication of

ψ^{*} (r) (- i) \frac{\partial}{\partial r}

and

ϕ (r)

, where generic functions

ψ^{*} (r)

and

ϕ (r)

can be approximated by Taylor expansion with polynomials

f_{m} (r)

serving the role of coefficients. The integration over r arises from the addition operation equipped on the ring. Hence, defining the position operator, which is a major goal in this work, boils down to its mathematical role in constructing generators of Weyl algebras (Appendix F).

In quantum mechanism, we tend to interpret

\partial_{r} ϕ (r)

as a derivative “acting” on a function of r. In other words,

\partial_{r}

is operation, and

ϕ (r)

is a function for

\partial_{r}

to act on;

\partial_{r}

and

ϕ (r)

are not on the equal status. In the ring framework, this is equivalently interpreted as an abstract multiplication of

\partial_{r}

with

ϕ (r)

. The

\partial_{r}

is interpreted with

1 \cdot \partial_{r}

, and

ϕ (r)

is interpreted as

ϕ (r) \cdot \partial_{r}^{0}

, such that the two are on the equal status as elements in a ring. For example, if

ϕ (r) = r^{2}

, the result of multiplication is

2 r \partial_{r}

, which corresponds in Equation (11) to

f_{m} (r) = 2 r

and

m = 1

. In abstract algebra, such a mutliplication is no different than a “normal” multiplication like

2 \cdot 4 = 8

, as long as closure (definition) of the multiplication is respected. The closure determines the range of the ring. Note that the conjugate pair

\hat{r}

and p are Weyl algebra generators; the full Weyl algebra contains all possible orders of polynomials for multiplication closure.

It is instructive to compare the Weyl algebra with Lie algebra; the latter is a vector space V (

s_{x}

,

s_{y}

,

s_{z}

serve as bases), over

R

or

C

(real or complex numbers as coefficients to be multiplied with bases

s_{i}

), equipped with Lie brackets [44], which is just the commutator “

[,]

”. Consider spin operators

su (2, C)

with

[s_{i}, s_{j}] = ϵ_{i, j, k} s_{k}

. In intuitive language, the bracket makes two operators (ones plugged in brackets) become a single operator on the right. Formally, the bracket is a binary map:

V \times V \to V

. Finding spin representation is just looking for mathematical objects (matrices or any other well-defined terms) that reproduce the relation described by the brackets. For a Weyl algebra, a crucial difference is Equation (10) replacing the operator on the right by a complex number, as a map

V \times V \to C

, where

C

is a complex number.

This subtle difference leads to significant consequences: finite-dimension matrix representations of Weyl algebras do not exist. In other words, Weyl algebras cannot be represented by matrices. If they could, we would have

\begin{matrix} Tr ([A, B]) = A_{i, l} B_{l, i} - B_{i, l} A_{l, i} = 0 = Tr (i \cdot I_{N}) = N i, \end{matrix}

(13)

where the only solution would be with trivial zero-dimensional matrices |

N = 0

.

This is why Equations (11) and (12) take the form of polynomials and differential operators rather than matrices. In general, they belong to a Weyl algebra (Appendix F). The representation is

\begin{matrix} \hat{r} \mapsto i \frac{\partial}{\partial p}, \end{matrix}

(14)

yielding the first Weyl algebra

A_{1}

. Lie algebras (such as

sl (2, C)

) can be realized as subalgebras of

A_{1}

[11]. We can also consider multiple pairs of variables

{r_{i}, p_{i}}_{N}

, yielding Nth Weyl algebra

A_{N}

. Consequently, the polynomials may involve multiple variables:

\begin{matrix} f_{m} (p) \frac{\partial^{m}}{\partial p^{m}} \mapsto f_{m_{1}, \dots, m_{N}} (p_{1}, \dots, p_{N}) \prod_{i}^{N} \frac{\partial^{m_{i}}}{\partial p_{i}^{m_{i}}} . \end{matrix}

(15)

Nth Weyl algebra

A_{N}

(Equation (15)) is used to construct an r-matrix in Section 4 (also see Appendix F).

3. Structure of Bloch Space and Its Quotients

In this section, we introduce the other two spaces: Bloch space

H_{B}

and its quotient space

V

(in some literature, quotient space is also called factor space). We first point out some “bad features” of

H_{B}

. Then, we construct a product space

V \otimes E

, which is isomorphic to

H_{B}

(the definition of isomorphic is given in Section 2). The space

V \otimes E

is well behaved, and the r-matrix is defined on the product space instead of on

H_{B}

. Since

V \otimes E ≅ H_{B}

, the space

V

is also a quotient space of

H_{B}

.

Some features of

H_{B}

are unsuitable for serving as a Hilbert space. We give two examples. Firstly, Hilbert space should be a Banach space (a complete normed vector space) [2,44]. The “normed” means a vector could be normalized to unity, such that a physical probability could be recognized. The norm is defined as

\begin{matrix} ∥ ψ_{n, k} ∥ & = {[\int | ψ_{n, k} {(r) |}^{p}]}^{\frac{1}{2}} = {[\int_{- \infty}^{\infty} ψ_{n, k}^{*} (r) ψ_{n, k} (r) d r]}^{\frac{1}{2}} \\ = {[\int_{- \infty}^{\infty} u_{n, k}^{*} (r) u_{n, k} (r) d r]}^{\frac{1}{2}} \end{matrix}

(16)

(with

p = 2

) is required to be finite. In physics, Equation (16) is comprehended as the total probability (or the total number of particles) in the space should be finite. In the above, we use Bloch’s Theorem

\begin{matrix} ψ_{n, k} (r) = e^{i k r} u_{n, k} (r) \end{matrix}

(17)

|

u_{n, k} (r)

is a periodic function of the lattice constant a. Evidently, the norm for

H_{B}

diverges, i.e., the integration Equation (16) diverges. (It suffices to consider the special case where

u_{n, k} (r)

is a constant function.)

The norm can be induced from the inner product: the self-product of a vector yields its norm. Thus, defining the norm boils down to defining inner products with continuous indices. Definitions like Equation (16) arise from the analog

\begin{matrix} Inner Prod . \sum_{j}^{N} ψ_{j}^{*} ψ_{j} \to \int_{- \infty}^{+ \infty} ψ^{*} (r) ψ (r) d r, \end{matrix}

(18)

where continuous r assumes the role of the discrete values j, the sum over j becoming an integration over infinity. Equation (18) is an obvious transition from the discrete to the continuous case, but it is not the only one, and may not be the proper one. It is modified later (Section 4), as a key step to casting

\hat{r}

onto discrete bases.

\begin{matrix} 〈 ψ_{m, k^{'}} | ψ_{n, k} 〉 = δ_{m, n} δ_{k^{'}, k} \overset{N \to \infty}{\to} δ_{m, n} δ (k^{'} - k) . \end{matrix}

(19)

As a convention, discrete variables are denoted in subscripts; if

m = n

,

δ_{m, n} = 1

. For continuous variables, we denote them in brackets; if

k = k^{'}

,

δ (k - k^{'}) \to \infty

. With Equation (19), the self-product

m = n

,

k = k^{'}

produces an infinitely “long” vector, meaning an infinite probability. In addition, it also makes the derivative

| \partial_{k} ψ_{n, k} 〉

diverge, a similar problem to that inherent in Equation (9).

As the second example of bad behavior, we observe that

H_{B}

is incomplete for

\hat{r}

, which is a serious concern since transport arises from position change. The matrix of the position operator is expressed as

\begin{matrix} \int_{- \infty}^{+ \infty} ψ_{m, k^{'}}^{*} (r) r ψ_{n, k} (r) d r, \end{matrix}

(20)

which unfortunately diverges. This is easily seen from translating the diagonal terms

\begin{matrix} {\hat{T}}_{a} 〈 \hat{r} 〉 = 〈 \hat{r} 〉 - a, \end{matrix}

(21)

where

{\hat{T}}_{a}

is the translation operator by a (here a is the lattice constant and

a \neq 0

). Plugging Equation (20) into

〈 \hat{r} 〉

, we obtain the contradiction

\begin{matrix} {\hat{T}}_{a} [\int ψ_{n, k}^{*} (r) r ψ_{n, k} (r) d r] & = \int ψ_{n, k}^{*} (r) e^{- i k a} r e^{i k a} ψ_{n, k} (r) d r \\ = \int ψ_{n, k}^{*} (r) r ψ_{n, k} (r) d r = 〈 \hat{r} 〉 . \end{matrix}

(22)

Here, we apply

{\hat{T}}_{a} ψ_{n, k} (r) =

\begin{matrix} ψ_{n, k} (r + a) = e^{i k (r + a)} u_{n, k} (r + a) = e^{i k a} ψ_{n, k} (r) . \end{matrix}

(23)

The periodic function is a function of crystal momentum k and band n, transcribed from

ψ_{n, k} (r)

. Now

u_{n, k} (r + a)

is obtained by translating

u_{n, k} (r)

by a in the negative direction, thus

〈 \hat{r} 〉

in Equation (21) is shifted by

- a

.

The inconsistency between Equations (21) and (22) indicates that the integration in Equation (20) cannot converge, for otherwise the contradiction

〈 \hat{r} 〉 - a = 〈 \hat{r} 〉

would be obtained. This divergence is genuine and inevitable. It arises from the fact that it is impossible to pin down the “center” of an infinitely extensive wave function. One may attribute this divergence to the infinite dimension of

H_{B}

(the infinite dimension arising from the infinite number of possible values for (n, k), where k might be either discretely infinite or continuous), since a sum over a finite numbers of terms should never diverge. Note that each distinct (n, k) corresponds to a linearly independent basis. Note the ambiguous meanings of “bases”: the bases span

H_{B}

, rather than the vector label k. It is very possible that

k \cdot k^{'} \neq 0

, but

〈 ψ_{n, k} | ψ_{n, k^{'}} 〉 = 0

.

To resolve these problems, we next construct a well-behaved product space related to

H_{B}

with isomorphism; CRM is established on the well-behaved space. We begin by introducing a well-defined inner product, on the basis of which procedures associated with vectors, operators, etc., may be defined [44]. We adopt the following procedure to force convergence of Equation (20):

\begin{matrix} \int ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r = \\ \dots & + \int_{- a}^{0} + \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r + \int_{a}^{2 a} + \dots \\ = \sum_{i}^{N} e^{- i (k_{p} - k_{q}) R_{i}} \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r . \end{matrix}

(24)

For simplicity, consider a 1D atomic chain of N sites with lattice constant a. Set

R_{i} = (i - 1) \cdot a

and

k_{p} = (p - 1) \cdot \frac{2 π}{N a}

(

i, p, q = 1, \dots, N

). The sum over

R_{i}

is independent of r, and thus can be factored out: the infinite integration is reduced to a finite integration. We have

\begin{matrix} \sum_{i}^{N} e^{- i (k_{p} - k_{q}) R_{i}} = N \cdot \sum_{l}^{N} δ_{k_{p}, k_{q} - G_{l}} \overset{1^{s t} B r i l l o u i n Z o n e}{\to} N \cdot δ_{k_{p}, k_{q}} . \end{matrix}

(25)

If k and

k^{'}

are restricted to the first Brillouin zone (BZ) by convention, the above sum can be reduced to a single

δ

-function. For the other term, in-cell integration, we require

\begin{matrix} \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{p}} (r) d r \\ = \int_{0}^{a} e^{- i k_{p} r} u_{m, k_{p}}^{*} (r) e^{i k_{q} r} u_{n, k_{p}} (r) d r = δ_{m, n} \end{matrix}

(26)

for the case where

k_{p} = k_{q}

. On the other hand, for

k_{p} \neq k_{q}

, even if

m \neq n

, Equation (26) may not necessarily vanish, because it is Equation (25) which governs the orthogonality for distinct k-values. In that case, Equation (26) plays a role of a normalization factor, as discussed in connection with Remark 3 below.

We express the inner product Equation (24) as a product of two

δ

-functions. This leads one to think that

H_{B}

can be isomorphic to a tensor product space. We invoke

δ_{k_{p}, k_{q}}

from Equation (25) to generate one quotient space E of dimension N (the number of possible values of k), and invoke

δ_{m, n}

to generate a second quotient space

V

of dimension

N

(the number of bands), yielding the isomorphism

\begin{matrix} Π : H_{B} \to V \otimes E; | ψ_{n, k} 〉 \mapsto | A_{n, k} 〉 \otimes | E_{k} 〉 \end{matrix}

(27)

with

| A_{n, k} 〉 \in V

and

| E_{k} 〉 \in E

. Here, both “→” and “↦” indicate a map. The difference is “→” connects two sets: from map’s domain to co-domain; ↦ connects elements belonging to the two sets. Thus, “→” and “↦” in Equation (27) stand for two conventions for denoting a map. These map denotations are frequently used in this paper, especially in Section 5 and Section 6.

Isomorphism means a linear map which preserves the inner product (inner product value is invariant with map)

〈 ψ_{m, k^{'}} | ψ_{n, k} 〉 = 〈 A_{m, k^{'}} | A_{n, k} 〉 \cdot 〈 E_{k^{'}} | E_{k} 〉

(28)

for

| ψ_{m, k^{'}} 〉, | ψ_{n, k} 〉 \in H_{B}

. Intuitively speaking, we seek a replacement vector

| A_{n, k} 〉 \otimes | E_{k} 〉

for the original

| ψ_{n, k} 〉

, such that after the replacement the inner product value remains unchanged as indicated by Equation (28).

To facilitate analysis, we introduce maps

\begin{matrix} Π_{1} : I_{B} \to V \\ Π_{2} : I_{B} \to E \\ Π = Π_{1} \otimes Π_{2} : I_{B} \to V \otimes E \end{matrix}

(29)

where

I_{B}

is the basis

I_{B} = {ψ_{n, k} | n = 1, 2, \dots, N; k = p \cdot \frac{2 π}{N a}, p = 0, 1, \dots, N}

of

H_{B}

which contains

N \times N

elements.

Since each basic Bloch vector is characterized by band n and crystal momentum k, one can write these maps in terms of

\begin{matrix} Π_{i} (ψ_{n, k}) or Π_{i} (n, k) . \end{matrix}

(30)

In particular,

Π

can be expressed as

\begin{matrix} Π (ψ_{n, k}) = Π_{1} (ψ_{n, k}) \otimes Π_{2} (ψ_{n, k}) . \end{matrix}

(31)

The existence of these maps

Π_{i}

is constrained by

\begin{matrix} \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r = \sum_{i = 1}^{N} a_{i}^{(m) *} (k_{p}) \cdot a_{i}^{(n)} (k_{q}) \end{matrix}

(32)

as discussed in Appendix E. The maps

Π_{1}

,

Π_{2}

are then defined as

\begin{matrix} Π_{1} : | ψ_{n, k} 〉 \mapsto | A_{n, k} 〉 = (\begin{matrix} a_{1}^{(n)} (k) \\ a_{2}^{(n)} (k) \\ ⋮ \\ a_{N}^{(n)} (k) \end{matrix}) and \\ Π_{2} : & | ψ_{n, k} 〉 \mapsto | E_{k} 〉 = \frac{1}{\sqrt{N}} (\begin{matrix} e^{- i k R_{0}} \\ e^{- i k R_{1}} \\ ⋮ \\ e^{- i k R_{N - 1}} \end{matrix}), \end{matrix}

(33)

with

\begin{matrix} 〈 A_{m, k_{p}} | A_{n, k_{q}} 〉 = \sum_{i = 1}^{N} a_{i}^{(m)} {(k_{p})}^{*} a_{i}^{(n)} (k_{q}) and \end{matrix}

(34)

\begin{matrix} 〈 E_{k_{p}} | E_{k_{q}} 〉 = \frac{1}{N} \sum_{j = 0}^{N - 1} e^{- i (k_{p} - k_{q}) R_{j}} \end{matrix}

(35)

as the corresponding inner product rules. Since

\begin{matrix} 〈 A_{m, k_{p}} | A_{n, k_{q}} 〉 \otimes 〈 E_{k_{p}} | E_{k_{q}} 〉 \\ = \frac{1}{N} \sum_{j = 0}^{N - 1} e^{- i (k_{p} - k_{q}) R_{j}} \cdot \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r \\ = 〈 ψ_{m, k_{p}} | ψ_{n, k_{q}} 〉, \end{matrix}

(36)

the conjectured map

Π

preserve inner products.

Theoretically, the projection maps formalize the idea “the geometry quantities, such as Berry connection defined on the periodic part

u (r)

of a Bloch function” into these being defined in an abstract quotient space—the projection maps bridge the Hilbert space to its quotient. Consequently, one can use it in broader physical scenarios: not limited to single-particle (orbital) Bloch waves, but also the spin degrees freedom, correlated systems without translation symmetry, etc.

Physically speaking,

Π_{1}

and

Π_{2}

are closely related to two physical processes. The

Π_{1}

projects to the quotient space

V

, which involves different bands, thus

Π_{1}

is for (elastic) inter-band transitions. The

Π_{2}

involves distinct

k_{j}

, thus it is for the inelastic k-transport.

Remark 1.

Recall the main goal of this section: building a well-behaved isomorphic space to substitute for

H_{B}

. The goal is realized by Π map constructed with Equations (27)–(36). The map Π reads

\begin{matrix} | ψ_{n, k} 〉 \mapsto | A_{n, k} 〉 \otimes | E_{k} 〉 = \frac{1}{\sqrt{N}} (\begin{matrix} a_{1}^{(n)} (k) \\ a_{2}^{(n)} (k) \\ ⋮ \\ a_{N}^{(n)} (k) \end{matrix}) \otimes (\begin{matrix} e^{- i k R_{0}} \\ e^{- i k R_{1}} \\ ⋮ \\ e^{- i k R_{N - 1}} \end{matrix}) \end{matrix}

(37)

to circumvent integration over r, facilitating the evaluation of the r-matrix and other expressions (Section 4).

Remark 2.

Roughly speaking, Π creates a new basis equivalent to

I_{B}

by switching the representation from

H_{B}

to

V \otimes E

. But in fact, Π does not map vectors to vectors. In the ad hoc terminology introduced in Section 5, it maps vectors to ribbon bands or ribbons. Within the framework of bundle theory [43,44], a ribbon is a feature of the bundle space

K \times H_{B}

, where K is the Brillouin zone and Bloch space

H_{B}

is the fiber space (see Section 5). Precisely speaking, Π is to map N mutually orthogonal vectors (

I_{B}

) to a ribbon (Section 5), on which the vectors at

{k_{i}}_{N}

comprise a set of basis vectors isomorphic with the original set

I_{B}

. In plain but less accurate words, this means that the components of the vectors (Equation (37)) should be functions of k (e.g.,

a_{i}^{(n)} (k), e^{- i k R_{0}}

) instead of constant values. For example, the following map also defines an isomorphism on

I_{B}

but does not work:

\begin{matrix} | ψ_{n, k_{p}} 〉 \mapsto δ_{j, n} \otimes δ_{l, p} = (\begin{matrix} 0 \\ ⋮ \\ 1^{n^{t h} r o w} \\ ⋮ \end{matrix}) \otimes (\begin{matrix} 0 \\ ⋮ \\ 1^{p^{t h} r o w} \\ ⋮ \end{matrix}) . \end{matrix}

(38)

The crucial difference is that in Equation (37) we have

{lim}_{k_{p} \to k_{q}} Π [| ψ_{n, k_{p}} 〉] = Π [| ψ_{n, k_{q}} 〉]

when the

k_{p}

’s are viewed as variables. For maps like Equation (38), however, we have

{lim}_{k_{p} \to k_{q}} Π [| ψ_{n, k_{p}} 〉] = Π [| ψ_{n, k_{p}} 〉]

, since the components in Equation (38) are constants. If the k-independent Equation (38) interacts with partial derivatives, the r-matrix will vanish. Thus, Π cannot be chosen with an arbitrary isomorphism defined from

I_{B}

.

Remark 3.

How should we visuaize the map Π? The wave functions

ψ_{n, k}

are the eigenstates of the translations

{\hat{T}}_{a}

. Thus, we seek representations of the translation group on

H_{B}

and its quotients. It is conceivable that one part of space affords a trivial repesentation, while the rest affords a non-trivial one characterized by k. Spaces can be described by orthogonal bases: the k-independent

δ_{m, n}

in Equation (26) seem appropriate for the trivial representation, with the

δ_{k, k^{'}}

for the non-trivial part. Thus, we introduce

Π_{1}

to project to vectors whose inner product is meant to reproduce

δ_{m, n}

, while

Π_{2}

projects to the other quotient space. Constructing a function space requires more than just expressing the functions: the inner product must also be specified. Thus, we simultaneously move from Equations (24)–(36).

We can rewrite the image of

| ψ_{n, k_{p}} 〉

under

Π

from Equation (37) in a single Kronecker product column as

\begin{matrix} {(\begin{matrix} a_{1}^{(n)} (k) \\ ⋮ \\ a_{N}^{(n)} (k) \end{matrix})}_{N} \otimes {(\begin{matrix} e^{- i k R_{1}} \\ ⋮ \\ e^{- i k R_{N}} \end{matrix})}_{N} = {(\begin{matrix} a_{1}^{(n)} (k) \cdot e^{- i k R_{1}} \\ ⋮ \\ a_{1}^{(n)} (k) \cdot e^{- i k R_{N}} \\ a_{2}^{(n)} (k) \cdot e^{- i k R_{1}} \\ ⋮ \\ a_{2}^{(n)} (k) \cdot e^{- i k R_{N}} \\ ⋮ \end{matrix})}_{N N} \end{matrix}

(39)

so that the inner product Equation (36) returns to its familiar row–times–column form with rows and columns of length

N N

. In the conjugate transpose row of the column from Equation (39), element

a_{i}^{(n)} (k) \cdot e^{i k R_{j}}

is the component for the nth Bloch wave at the point k projecting to band i and local site j. There are two pairs of conjugated variables:

n \sim i

, and

k \sim R_{j}

. Note that k is not conjugated with r, since r merely provides the normalization factor of Equation (26). The translation group acts trivially on the quotient space

V

, i.e., translation does not change vectors

v \in V

. This is why

R_{j}

is only involved in the quotient space E.

Bearing in mind that components in the column vector of Equation (39) must correlate to certain inner products

〈 X | ψ_{n, k} 〉

, then what basis vectors

| X_{i, R_{j}} 〉

are chosen for

| ψ_{n, k} 〉

to be projected onto? We consider the map

Π

as exhibited in terms of basis vectors by Equation (36) of Remark 1. The basis elements can be regarded as generalized Wannier functions

\begin{matrix} | ψ_{n, k_{p}} 〉 = \sum_{i = 1}^{N} \sum_{j = 1}^{N} a_{i}^{(n)} (k_{p}) \cdot e^{- i k_{p} R_{j}} | X_{i, R_{j}} 〉 . \end{matrix}

(40)

If

a_{i}^{(n)} (k_{p}) = δ_{n, i}

, we recover the normal definition of Wannier functions:

| X_{i, R_{j}} 〉 \to | w_{i, R_{j}} 〉

. While Equation (40) is not the standard Fourier transformation, it is still invertible. The space spanned by the generalized Wannier functions

| X_{i, R_{j}} 〉

is denoted by

H_{W}

. Evidently,

H_{B}

and

H_{W}

are isomorphic.

The isomorphism (Equation (37)) is a major result of this work, which allows us to switch from

H_{B}

to

V \otimes E

and avoid the continuous coordinate r that appears in

ψ_{n, k} (r)

and

u_{n, k} (r)

. Inner products like Equations (34) and (35) sum over discrete indices, without referring to the integral

\int \dots d r

within a unit cell volume. The pre-factor

\frac{{(2 π)}^{d}}{V_{cell}}

arising from the integration is also avoided.

We should be cautious about designations such as

\begin{matrix} ψ_{n, k} (r) = 〈 r | ψ_{n, k} 〉 \\ u_{n, k} (r) = 〈 r | u_{n, k} 〉 . \end{matrix}

(41)

These denotations are often seen in literatures but not perfectly accurate. Since

| r 〉 \in H

,

| ψ_{n, k} 〉 \in H_{B}

, and

H ≇ H_{B}

(later we show

| u_{n, k} 〉 \in V

). Rigorously speaking, inner products are illegal to be defined between vectors in different spaces, such as

〈 r | ψ_{n, k} 〉

and

〈 r | u_{n, k} 〉

. Given Equation (41) is accepted, one obtains

\begin{matrix} 〈 r | ψ_{n, k} 〉 & = ψ_{n, k} (r) = e^{i k r} u_{n, k} (r) \\ = e^{i k r} 〈 r | u_{n, k} 〉 = 〈 r | e^{i k \hat{r}} | u_{n, k} 〉 . \end{matrix}

(42)

If

〈 r |

is taken away from the left and only ket is kept, we have the following expressions:

\begin{matrix} | ψ_{n, k} 〉 = e^{i k \hat{r}} | u_{n, k} 〉 \\ | u_{n, k} 〉 = e^{- i k \hat{r}} | ψ_{n, k} 〉 . \end{matrix}

(43)

Equation (43) is about using a unitary operator

e^{i k \hat{r}}

to link two vectors

| u_{n, k} 〉

and

| ψ_{n, k} 〉

. In general, a unitary operator is invertible and may only connect spaces of the same dimension. Remember

| ψ_{n, k} 〉

and

| u_{n, k} 〉

are belonging to spaces

H_{B}

and

V

, which are of different dimensions. The concern is that

e^{i k \hat{r}}

promotes a lower-dimension vector

| u_{n, k} 〉

to a higher-dimension one

| ψ_{n, k} 〉

; its inverse

e^{- i k \hat{r}}

degrades

| ψ_{n, k} 〉

to

| u_{n, k} 〉

. Having the same dimension is the sufficient and necessary condition for two vector spaces being isomorphic.

| ψ_{n, k} 〉 = e^{i k \hat{r}} | u_{n, k} 〉

(or

| u_{n, k} 〉 = e^{- i k \hat{r}} | ψ_{n, k} 〉

) would suggest

H_{B} ≅ V

. However,

V

is merely a quotient space of

H_{B}

. Similarly, expression like

H (k) = e^{- i k \hat{r}} H e^{i k \hat{r}}

seen in the literature (e.g., Equation (2) of [49]) deserves special attention. The dimensions of operators (

e^{i k \hat{r}}

,

H (k)

, etc.) are summarized in Section 7.

Another point, as we should be aware of, is that the below orthogonality is false (whether k can be either discrete or continuous)

\begin{matrix} \sum_{n}^{N} \int ψ_{n, k}^{*} (r) ψ_{n, k} (r^{'}) \cdot d k \neq δ (r - r^{'}) . \end{matrix}

(44)

Otherwise, if equality in Equation (44) holds,

H_{B}

is complete for

\hat{r}

. The set of functions

ψ_{n, k} (r)

can expand arbitrary functions. However, this is false. To demonstrate the incompleteness, we construct counter examples in Appendix B, i.e., functions unachievable by superposition of

ψ_{n, k} (r)

.

In band contexts,

H_{B}

is the Hilbert space, thus

H_{B}

should be complete—this idea has been taken for granted. However, although

H_{B}

is complete for operators defined within

H_{B}

,

H_{B}

is incomplete for operators defined beyond

H_{B}

, such as

\hat{r}

. Evidence includes:

(1): Space $H_{B}$ spanned by Bloch waves ${| ψ_{n, k} 〉}_{N \times N}$ has a lower dimension than space $H$ spanned by eigenstates ${| r 〉}_{R}$ of position operator; that is, the population (cardinality) of elements in the two basis sets ${| r 〉}_{R}$ and ${| ψ_{n, k} 〉}_{N \times N}$ are unequal.
(2): Matrix of position operator (shortly seen in Section 4) is diverging.
(3): $δ (r - r^{'})$ is false (Equation (44)), and functions that cannot be achieved by superposition of Bloch waves $ψ_{n, k} (r)$ are constructed (Appendix B).

A conviction is that different quantum bases are equivalent. Thus, one vaguely believes

| r 〉

and

| ψ_{n, k} 〉

are equivalent; continuous functions (e.g.,

ψ_{n, k} (r)

) and discrete bases are equivalent, fancying that these bases could be linked by unitary transformations. However, on a second thought, how can a continuously infinite bases possibly be linked to discrete bases? In fact, these bases are not equivalent.

We conclude this section by emphasizing some important points:

(1): The space $H_{B}$ is complete for operators defined within $H_{B}$ , while $H_{B}$ is incomplete for the $\hat{r}$ operator, as $\hat{r}$ is defined in $H$ .
(2): The sets ${| r 〉}_{R}$ and ${| ψ_{n, k} 〉}_{N \times N}$ are bases of different dimensions (different cardinality); they cannot be linked by a unitary transformation, and one cannot be obtained from the other by a change-of-basis transformation.
(3): The respective spaces spanned by ${| r 〉}_{R}$ and ${| ψ_{n, k} 〉}_{N \times N}$ are not isomorphic.

Nevertheless, the map

Π

(Equation (27)) is a precise tool for introducing the Nth Weyl algebra (Section 4) and obtaining convergent matrices for the position operator. Moreover, geometrical quantities, such as Berry connections and curvatures, are defined unambiguously as operators on the quotient space

V

of

H_{B}

.

4. Matrices of the Position Operator

In this section, we identify converging r-matrices (CRMs). We start by examining r-matrices in

H_{B}

as they have appeared in previous work [3]. These matrices inevitably contain divergent terms. The matrix elements are defined by integration of the Bloch function

ψ_{n, k} (r)

over an infinite range. For reasons discussed in Section 5, we avoid basis-free designations for the position operator, such as

〈 ψ_{m, k^{'}} | \hat{r} | ψ_{n, k} 〉

. For the moment, we consider the matrix element

\begin{matrix} \int ψ_{m, k^{'}}^{*} (r) r ψ_{n, k} (r) \cdot d r \end{matrix}

(45)

given by the original integration. Making the substitution

r \mapsto i \frac{\partial}{\partial k}

in Bloch’s Theorem (Equation (17)), we obtain

\begin{matrix} r ψ_{n, k} (r) & = r e^{i k r} u_{n, k} (r) \\ = - i \frac{\partial}{\partial k} [e^{i k r} u_{n, k} (r)] + i e^{i k r} \frac{\partial}{\partial k} u_{n, k} (r) . \end{matrix}

(46)

The matrix elements then take the form

\begin{matrix} \int ψ_{m, k^{'}}^{*} (r) { & - i \frac{\partial}{\partial k} [e^{i k r} u_{n, k} (r)] + i e^{i k r} \frac{\partial}{\partial k} u_{n, k} (r)} \cdot d r = \\ - i \frac{\partial}{\partial k} \int_{r} ψ_{m, k^{'}}^{*} (r) ψ_{n, k} (r) \\ + i \int_{r} e^{i (k - k^{'}) r} u_{m, k^{'}}^{*} (r) \frac{\partial}{\partial k} u_{n, k} (r) \end{matrix}

(47)

on plugging Equation (46) back into Equation (45). Note that the first term in Equation (47) becomes

- i \partial_{k} δ_{m, n} δ (k - k^{'})

. Recall that for the continuous

δ

-function, arguments appear in brackets | e.g.,

δ (k)

, and

δ (0) \to \infty

. On the other hand, for the discrete

δ

-function, the arguments are in the subscripts | e.g.,

δ_{m, n}

, and

δ_{m, m} = 1

. The second term in Equation (47) is

\begin{matrix} i \int e^{i (k - k^{'}) r} u_{m, k^{'}}^{*} (r) \frac{\partial}{\partial k} u_{n, k} (r) \cdot d r \\ = \sum_{i}^{N} e^{i (k - k^{'}) R_{i}} \int_{V_{cell}} e^{i (k - k^{'}) r} i u_{m, k^{'}}^{*} (r) \frac{\partial}{\partial k} u_{n, k} (r) \cdot d r \\ = \frac{{(2 π)}^{d}}{V_{cell}} δ (k - k^{'}) \int_{V_{cell}} e^{i (k - k^{'}) r} i u_{m, k^{'}}^{*} (r) \frac{\partial}{\partial k} u_{n, k} (r) \cdot d r . \end{matrix}

(48)

For a general function

F (k)

, we have

\begin{matrix} F (k) δ (k) = F (0) δ (k) . \end{matrix}

(49)

Thus, Equation (48) becomes

\begin{matrix} \frac{{(2 π)}^{d}}{V_{cell}} δ (k - k^{'}) \cdot \int_{V_{cell}} i u_{m, k}^{*} (r) \frac{\partial}{\partial k} u_{n, k} (r) \cdot d r . \end{matrix}

(50)

We define

\begin{matrix} A_{m, n} (k) : = i \int_{V_{cell}} u_{m, k}^{*} (r) \frac{\partial}{\partial k} u_{n, k} (r) \cdot d r, \end{matrix}

(51)

yielding

\begin{matrix} \int ψ_{m, k^{'}}^{*} (r) r ψ_{n, k} (r) \cdot d r \\ = - i δ_{m, n} \frac{\partial}{\partial k} δ (k - k^{'}) + \frac{{(2 π)}^{d}}{V_{cell}} δ (k - k^{'}) A_{m, n} (k) . \end{matrix}

(52)

Except for a sign difference in the first term, our derivation is consistent with previous work [3], the second term differing by a normalization convention. Due to the appearence of

\partial_{k} δ (k - k^{'})

, the diagonal terms evidently diverge. Thus, the matrix elements defined by Equation (52) are merely formal.

Now, instead of working directly with

H_{B}

, we construct a Weyl algebra on its isomorphic copy

V \otimes E

, obtaining a non-singular matrix which converges at both the diagonal and off-diagonal entries. The basic rule for the differential operator is the coproduct

\begin{matrix} i \partial_{k} = i \partial_{k} \otimes 1 + 1 \otimes i \partial_{k} \end{matrix}

(53)

in the tensor algebra [44,48]. The partial derivative acts on each of the two tensor factors

V

and E. Thus,

\begin{matrix} (〈 E_{k^{'}} | \otimes 〈 A_{m, k^{'}} |) (i \partial_{k} \otimes 1 + 1 \otimes i \partial_{k}) (| A_{n, k} 〉 \otimes | E_{k} 〉) = \\ 〈 A_{m, k^{'}} | i \partial_{k} A_{n, k} 〉 \cdot 〈 E_{k^{'}} | E_{k} 〉 + 〈 A_{m, k^{'}} | A_{n, k} 〉 \cdot 〈 E_{k^{'}} | i \partial_{k} E_{k} 〉 . \end{matrix}

(54)

For momentum and position, we have

\begin{matrix} λ \cdot p = h and k = \frac{2 π}{λ} = \frac{2 π \cdot p}{h} = \frac{p}{ℏ} . \end{matrix}

(55)

The commutator becomes

\begin{matrix} [\hat{r}, k] = i . \end{matrix}

(56)

To satisfy Equation (56), we may choose the position operator

\begin{matrix} \hat{r} = i \partial_{k} . \end{matrix}

(57)

The operator of Equation (57) is the generator of the first Weyl algebra

A_{1}

, as there is a single variable k.

4.1. The Nth Weyl Algebra

The choice

\hat{r} = i \partial_{k}

made in Equation (57) is not the only solution of the commutator Equation (56). Instead, we may consider N variables

{k_{i}}_{N}

, thereby obtaining the Nth Weyl algebra

A_{N}

[10]. In this algebra, take the new position operator

\begin{matrix} \hat{r} = \frac{1}{N} \sum_{m = 1}^{N} i \frac{\partial}{\partial k_{m}} \end{matrix}

(58)

in place of Equation (57). The commutator Equation (56) is then solved as

\begin{matrix} [\frac{1}{N} \sum_{m = 1}^{N} i \frac{\partial}{\partial k_{m}}, \sum_{n = 1}^{N} k_{n}] \\ = \frac{1}{N} (\sum_{1 \leq m, n \leq N} i \frac{\partial}{\partial k_{m}} k_{n} - \sum_{1 \leq m, n \leq N} i k_{n} \frac{\partial}{\partial k_{m}}) = \\ \frac{1}{N} (\sum_{1 \leq m, n \leq N} i (k_{n} \frac{\partial}{\partial k_{m}} + δ_{m, n}) - \sum_{1 \leq m, n \leq N} i k_{n} \frac{\partial}{\partial k_{m}}) \\ = \frac{1}{N} \sum_{1 \leq m, n \leq N} i \cdot δ_{m, n} = \frac{i}{N} \sum_{m = 1}^{N} 1 = i . \end{matrix}

(59)

We set the number of variables involved in the Weyl algebra equal to the dimension N of the quotient space E. It is absolutely essential for N to be finite, to ensure that the partial derivatives are well defined without recourse to any infinite limits. Note that Equation (59) is a concrete realization of Equation (15) in Section 2. The state vector is parameterized by

k_{1}, \dots, k_{N}

, assuming the role of

f_{m_{1}, \dots, m_{N}} (p_{1}, \dots, p_{N})

in Equation (15) (Appendix F).

Remark 4.

At this stage of the construction, we do not endow

k_{m}

or

\partial_{k_{m}}

with any physical significance, like taking the

k_{m}

as quantum numbers of “one-particle” or “many particle’” states. Thus,

k_{m}

is not yet interpreted as the crystal momentum. Currently, we are working at the purely algebraic level, just making the substitutions

\hat{r} \to \frac{1}{N} \sum_{m}^{N} i \frac{\partial}{\partial k_{m}}

and

k \to \sum_{n}^{N} k_{n}

.

Remark 5.

Equation (59) is at the same fundamental level as Equation (57): both model the Weyl algebra relation of Equation (56).

Remark 6.

It is tempting to think that the variable k appearing in

\partial_{k}

must be continuous, because otherwise the derivative would not be defined. This misconception is based on the narrow calculus definition of a derivative as

\partial_{k} f (k) = {lim}_{Δ k \to 0} (f (k + Δ k) - f (k)) / Δ k

. This definition requires extraneous apparatus, such as a division operation, a limit process, and so on. In fact, it suffices to work with the formal definition by

\partial_{k} k^{n} = n k^{n - 1}

, which only requires multiplication and addition (mathematically, a ring structure [48]). Using power series, one can extend the action of

\partial_{k}

’s to generic analytic functions

f (k)

. As an example, taken from [50], we have

\begin{matrix} c f (c^{†}) | Ψ_{0} 〉 = (\frac{\partial f (c^{†})}{\partial c^{†}}) | Ψ_{0} 〉, \end{matrix}

(60)

where c and

c^{†}

are fermion operators with

{c, c^{†}} = 1

, the function f is analytic, and

| Ψ_{0} 〉

is the vacuum state, i.e.,

c | Ψ_{0} 〉 = 0

. Equation (60) is an example of a derivative appearing in an operator acting on a term

c^{†}

which is not required to be a continuous numerical function.

We now interpret the action of the Nth Weyl algebra

A_{N}

on the space

V \otimes E

as it appears in basis form in Equation (37), obtaining the matrix of the position operator

r_{m, n} (k_{p}, k_{q})

with respect to the Bloch basis. We have

\begin{matrix} r_{m, n} (k_{p}, k_{q}) = δ_{k_{p}, k_{q}} \cdot \sum_{l = 1}^{N} {(a_{l}^{(m)} (k_{p}))}^{*} i \partial_{k_{q}} a_{l}^{(n)} (k_{q}) \\ + [\sum_{l = 1}^{N} {(a_{l}^{(m)} (k_{p}))}^{*} a_{l}^{(n)} (k_{q})] \cdot [\frac{1}{N} \sum_{j = 1}^{N} e^{i (k_{p} - k_{q}) R_{j}} R_{j}] . \end{matrix}

(61)

The second term can be expressed as

\begin{matrix} [\sum_{l}^{N} {(a_{l}^{(m)} (k_{p}))}^{*} a_{l}^{(n)} (k_{q})] \cdot [\frac{1}{N} \sum_{j}^{N} e^{i (k_{p} - k_{q}) R_{j}}] \\ = δ_{k_{p}, k_{q}} δ_{m, n} \bar{R} \\ + (1 - δ_{k_{p}, k_{q}}) K_{m, n} (k_{p}, k_{q}) \frac{1}{N} \sum_{j}^{N} e^{i (k_{p} - k_{q}) R_{j}} R_{j} \end{matrix}

(62)

with

\bar{R}

as the average position

\frac{1}{N} \sum_{j = 1}^{N} R_{j}

of the N-site chain. This constant, independent of k, is the mass center of the crystal. The term

K_{m, n} (k_{p}, k_{q})

depends on the particular forms of

a_{l}^{(n)} (k_{q})

, and we shortly evaluate

K_{m, n} (k_{p}, k_{q})

in a concrete two-band model. In general,

K_{m, n} (k_{p}, k_{q})

cannot be reduced to a

δ

-function in terms of either

k_{p}

and

k_{q}

, or m and n. Recall that N must be finite to have the Nth Weyl algebra defined. Thus, CRMs are always based on finite N. On the other hand, if we consider

N \to \infty

, will the CRMs approach the DRM? Or, if we let N be finite, will the DRM become a CRM? The answer is no! The CRMs are fundamentally distinct from the DRM, and one cannot relate them. A more detailed comparison appears later.

Evidently, the matrix

r_{m, n} (k_{p}, k_{q})

of Equation (61) is well defined, with both diagonal and off-diagonal terms converging. Note

δ_{k_{p}, k_{q}} = 1

if

k_{p} = k_{q}

. We call

r_{m, n} (k_{p}, k_{q})

the convergent r-matrix (CRM). It is an

(N \cdot N)

-dimensional square matrix. To emphasize this point, we can write the matrix as

r_{{m, k_{p}}, {n, k_{q}}}

. In this context, we call the matrix of Equation (52) a divergent r-matrix (DRM). Although DRMs frequently appear in the literature, their dimensions have not explicitly been stated [3,27].

4.2. Geometry Defined on the Quotient Space

The first term in Equation (61) is the Berry connection, which naturally emerges when

k_{p} = k_{q}

and

m = n

. Comparing with Equation (52), we obtain the correspondence

\begin{matrix} δ_{k_{p}, k_{q}} \cdot & \sum_{l = 1}^{N} {(a_{l}^{(m)} (k_{p}))}^{*} i \partial_{k_{q}} a_{l}^{(n)} (k_{q}) \\ \mapsto \frac{{(2 π)}^{d}}{V_{cell}} δ (k - k^{'}) \int_{V_{cell}} u_{m, k}^{*} (r) i \partial_{k} u_{n, k} (r) \cdot d r . \end{matrix}

(63)

This is a map

\begin{matrix} | u_{n, k} 〉 \mapsto | A_{n, k} 〉 = {(\begin{matrix} a_{1}^{(n)} (k) \\ ⋮ \\ a_{N}^{(n)} (k) \end{matrix})}_{N} \in V \end{matrix}

(64)

Equation (64) indicates that the

| u_{n, k} 〉

map injectively to vectors

| A_{n, k} 〉

in the quotient space

V

. In precise language, the inner product space spanned by the

| u_{n, k} 〉

is isomorphic to

V

. In other words, there exists a map between the two spaces which preserves the inner product. Note that

| u_{n, k} 〉

is an

N

-dimensional vector, and

a_{j}^{(n)} (k)

is its jth component.

Remark 7.

Equation (64) builds a vector space associated with the functions

u_{n, k} (r)

. Rigorously speaking, the

u_{n, k} (r)

do not yet provide a function space. For that purpose, we should have δ-functions as extra structure (just like a norm or Lie brackets on vector spaces). The answer to the question “what is the dimension of the space containing the

u_{n, k} (r)

” is indeterminate [51]. Thus, we cannot set

u_{n, k} (r) = | u_{n, k} 〉

. Consider the false implication

\begin{matrix} ψ_{n, k} (r) = e^{i k r} u_{n, k} (r) \Rightarrow ∥ ψ_{n, k} 〉 = e^{i k r} | u_{n, k} 〉 \end{matrix}

(65)

While the hypothesis of ⇒ is correct, the conclusion is not. Unfortunately,

u_{n, k} (r)

and

| u_{n, k} 〉

are often used indisciminately [4,5,27]. It is tempting to interpret

ψ_{n, k} (r)

and

u_{n, k} (r)

as basis elements, and then convert these basis elements from functions into bra/ket forms. But that is incorrect.

Equating

u_{n, k} (r)

and

| u_{n, k} 〉

may lead to vagueness and misconceptions. For example, if we (mistakenly) infer

| ψ_{n, k} 〉 = e^{i k r} | u_{n, k} 〉

from the Bloch Theorem, we might be led to thinking that there would be a linear relation between

| ψ_{n, k} 〉

and

| u_{n, k} 〉

, and further that every Bloch basis element

| ψ_{n, k} 〉

corresponds linearly to a basis element

| u_{n, k} 〉

, such that their two spans have the same dimension. In addition, we would have difficulty with the boundary conditions. Bloch waves should be continuous over the B.Z., i.e.,

| ψ_{n, k = 0} 〉 = | ψ_{n, k = 2 π} 〉

. Given the (mistaken) assumption that

| ψ_{n, k} 〉 = e^{i k r} | u_{n, k} 〉

, there is no way to make

| u_{n, k} 〉

continuous, since

| u_{n, k = 0} 〉 \neq | u_{n, k = 2 π} 〉

for

k \neq 0

.

In fact, there is no phase correlation between

| ψ_{n, k} 〉

and

| u_{n, k} 〉

, because when

| u_{n, k} 〉

is first introduced, the definition (Equation (64)) merely considers inner products, allowing the freedom to adjust phases. Perhaps it might be more accurate to adopt a different vector notation (e.g.,

| A_{n, k} 〉

) for

| u_{n, k} 〉

to avoid confusion between

u_{n, k} (r)

and

| u_{n, k} 〉

. (Note that

u_{n, k} (r)

is a function of r, while

| u_{n, k} 〉

is not labeled with r.) However, given the wide use of

| u_{n, k} 〉

in the literature, we adopt this notation.

It is often stated that the Berry connection is defined on the periodic part of the wave function

u_{n, k} (r)

, instead of on the Bloch functions

ψ_{n, k} (r)

[27]. Now we have the accurate statement that the Berry connection is defined on the

| u_{n, k} 〉

, and the map Equation (64) establishes the identity of the

| u_{n, k} 〉

-space as the quotient space

V

of

H_{B}

.

We may ask why the Berry connection is defined in terms of the

| u_{n, k} 〉

, rather than in terms of the

| ψ_{n, k} 〉

. This is commonly explained by showing that

| ψ_{n, k} 〉

does not work. Consider a discrete formalism for the Berry phase

ϑ

| the system goes through a series of discrete states

| k_{1} 〉 \to | k_{2} 〉 \to \dots

[27]. Then

\begin{matrix} ϑ = - ℑ log (〈 k_{1} | k_{2} 〉 〈 k_{2} | k_{3} 〉 \dots 〈 k_{N} | k_{1} 〉) . \end{matrix}

(66)

If we plug in Bloch waves

| k 〉 = | ψ_{n, k} 〉

, orthogonality will force

〈 k_{j} | k_{j + 1} 〉 = 0

. Thus,

| k 〉 = | ψ_{n, k} 〉

make

ϑ

trivially zero for arbitrary band structures. But this only precludes

| ψ_{n, k} 〉

from appearing in

ϑ

, and does not show that

| k 〉 = | u_{n, k} 〉

must be the case.

Consideration of the DRM does at least show that the Berry connection

A_{m, n} (k)

is contained in the matrix [3,4]. However, there are at least three shortcomings. Firstly, the DRM itself is ill defined. In particular, the divergence on the diagonals directly influences displacement and transport. Secondly, the substitution

\hat{r} \to i \partial_{k}

is not well justified. Naively, one may argue that

i \partial_{k}

is the fundamental form of

\hat{r}

, as quantum mechanics suggests. However, the quantum oracle merely suggests the commutation

[\hat{r}, k] = i

, and

\hat{r} = i \partial_{k}

is not the unique solution. The operator

\hat{r}

might also take the form of Equation (59), for instance. In fact, both Equations (57) and (59) can serve as appropriate forms for the operator

\hat{r}

. Thirdly, the Berry connection matrix

A_{m, n} (k)

belongs to the space spanned by

u_{n, k} (r)

, but the dimension of this space is left uncertain. One is led to (mistakenly) consider the continuous parameter r as the index for the basis elements, under the vague impression that “the space is infinite-dimensional,” which hinders a comparison with

H_{B}

.

With the CRM and Nth Weyl algebra, we obtain all Berry connection signatures contained in r-matrix as indicated by DRM and first Weyl algebra; moreover, the divergence disappears, and the matrix is well defined. The constant term

\bar{R}

is precisely cancelled by subtraction, establishing a rigorous link between diagonal terms and transport. In the absence of a well-defined renormalization protocol, one cannot just drop or cancel two diverging terms in the DRM. We stress that the DRM cannot connect to the CRMs by a limiting process. Secondly, compared with the DRM, construction of the CRM took two steps:

(1): Express $\hat{r}$ with the Nth Weyl algebra $A_{N}$ in $H_{B}$ ;
(2): Reduce it to the first Weyl algebra $A_{1}$ established on the lower-dimensional quotient space $V$ ,

as summarized in Table 2. It is risky to directly replace

\hat{r}

with

i \partial_{k}

, without referring to the hosting space. Thirdly, the dimension of the space spanned by the

| u_{n, k} 〉

is specified, identifying it as the quotient space

V

of

H_{B}

. However, this issue has been concealed by the vague idea that both spaces are infinite-dimensional, which also hides the method to make the Berry phase

ϑ

non-zero. Now, we understand the procedure of obtaining

ϑ \neq 0

by “folding”

H_{B}

into a product space, and defining the Berry connection (and other geometric objects) on the quotient space

V

, instead of on

H_{B}

. The dimension of

V

is finite (and the norm can be defined). Usually, it is equal to the number

N

of bands, which might either be given when

H_{B}

is first introduced, or obtained by a truncation.

4.3. How Is the Convergence Achieved?

Let us revisit the divergent expressions

\begin{matrix} \int_{- \infty}^{+ \infty} ψ_{n, k}^{*} (r) r ψ_{n, k} (r) \cdot d r or \int_{- \infty}^{+ \infty} ψ_{n, k}^{*} (p) i \partial_{p} ψ_{n, k} (p) \cdot d p . \end{matrix}

(67)

Here, the divergence arises from the use of either of the unbounded coordinates r or p. Since they are conjugate variables, use of one is no better than the other. Recall that

\hat{r} \to i \partial_{p}

is different from

\hat{r} \to i \partial_{k}

, because k is the quantum number for Bloch vectors, while p is the real momentum. Contrasting the conjugate pairs,

\hat{r}

is conjugate with p, while it is

R_{j}

which is conjugate with k according to Equation (25).

Convergence has now been achieved thanks to two modifications:

(i): The choice of the operator $\hat{r} = \frac{1}{N} \sum_{m = 1}^{N} i \frac{\partial}{\partial k_{m}}$ in the Weyl algebra $A_{N}$ ;
(ii): Application of the isomorphism

$Π : H_{B} \to V \otimes E; ψ_{n, k} (r) \to a_{i}^{(n)} (k) \cdot e^{i k R_{j}},$

i.e., $H_{B}$ is replaced by tensor product space $V \otimes E$ .

Rethinking both the operator

\hat{r}

and the wave functions

ψ_{n, k} (r)

reflects that the Weyl algebra

A_{N}

, as a ring, involves not only the differential operators, but also the functions

f_{m}

on which they act, as seen in the definition of Equation (11). Physically, the functions correspond to the wave functions. Intuitively speaking, after making the replacement (i), one still needs the replacement (ii) to specify the arguments on which these differential operators act, in order to complete the representation of the Weyl algebra.

From a different perspective, we are specifying a new pair

r \leftrightarrow k

of conjugate variables. The position variable r is paired with the crystal momentum k, rather than with real momentum p. Although (i) and (ii) do not have the explicit form of a declaration of conjugate variables, the declaration is implicit within them. It would not be enough merely to state that

i \partial_{k}

(or

\frac{1}{N} \sum_{m}^{N} i \frac{\partial}{\partial k_{m}}

) is the expression of

\hat{r}

, since that would only invoke modification (i), not (ii).

It is incomplete and misleading to assert that

\hat{r}

is “equal to”

i \partial_{k}

. For example, it is easily seen that

\begin{matrix} \int ψ_{m, k^{'}}^{*} (r) r ψ_{m, k} (r) \cdot d r \neq \int ψ_{m, k^{'}}^{*} (r) i \partial_{k} ψ_{m, k} (r) \cdot d r . \end{matrix}

(68)

One may encounter attempts in the literature to use

\hat{r} = i \partial_{k}

to suggest a form like

〈 u_{m, k} | i \partial_{k} u_{n, k} 〉

for the r-matrix. However, in other situations, if we replace

i \partial_{k}

by r, we may obtain the contradiction

〈 u_{m, k} | i \partial_{k} u_{n, k} 〉 = 〈 u_{m, k} | r | u_{n, k} 〉 = δ_{m, n} r

suggesting that

| u_{n, k} 〉

would be an eigenstate of

\hat{r}

.

4.4. Can the DRM Be a Limit of CRMs?

We now show that letting

N \to \infty

does not produce the DRM as a limit of the CRMs. Recall that

k_{p}

and

R_{j}

are conjugate variables. Thus, increasing the population N of

R_{j}

-values corresponds to making the

k_{p}

-values denser, bringing us to the limit where k is continuous. The first term of Equation (61) approaches

δ (k - k^{'}) \cdot \sum_{l = 1}^{N} {(a_{l}^{(m)} (k))}^{*} i \partial_{k} a_{l}^{(n)} (k),

coinciding with the Berry connection

δ (k - k^{'}) \cdot A_{m, n} (k)

in Equation (52). However, the second terms cannot match. The CRM does not invoke

δ_{m, n}

when

k_{p} \neq k_{q}

, in contradiction to the separation of the factor

δ_{m, n}

in Equation (52).

This can be seen on a concrete example with

N = 2

| a two-band model. In this case, the quotient space

V

is two-dimensional, spanned by two basis elements

| u_{1, k_{p}} 〉

and

| u_{2, k_{p}} 〉

(following the notation of Equation (33)). We take

\begin{matrix} (\begin{matrix} a_{1}^{(1)} (k_{p}) \\ a_{2}^{(1)} (k_{p}) \end{matrix}) = (\begin{matrix} cos (\frac{θ (k_{p})}{2}) \\ sin (\frac{θ (k_{p})}{2}) \cdot e^{i ϕ (k_{p})} \end{matrix}) and \\ (\begin{matrix} a_{1}^{(2)} (k_{p}) \\ a_{2}^{(2)} (k_{p}) \end{matrix}) = (\begin{matrix} - sin (\frac{θ (k_{p})}{2}) \cdot e^{- i ϕ (k_{p})} \\ sin (\frac{θ (k_{p})}{2}) \end{matrix}), \end{matrix}

(69)

where

θ

and

ϕ

are functions of k. Given Equation (69), we find

\begin{matrix} K_{1, 1} = K_{2, 2}^{*} & = cos (\frac{θ (k_{p})}{2}) cos (\frac{θ (k_{q})}{2}) \\ + sin (\frac{θ (k_{p})}{2}) sin (\frac{θ (k_{q})}{2}) e^{- i (ϕ (k_{p}) - ϕ (k_{q}))} . \end{matrix}

(70)

The off-diagonals are

\begin{matrix} K_{1, 2} = - K_{2, 1}^{*} & = - cos (\frac{θ (k_{p})}{2}) sin (\frac{θ (k_{q})}{2}) e^{- i ϕ (k_{q})} \\ + sin (\frac{θ (k_{p})}{2}) cos (\frac{θ (k_{q})}{2}) e^{- i ϕ (k_{p})} . \end{matrix}

(71)

In general,

K_{m, n}

is non-vanishing. Equation (71) shows that the r-matrix does not vanish when

k_{p} \neq k_{q}

. The entries non-diagonal with k identify a clear distinction from the DRM.

In practice, the Hamiltonian

H (k)

mainly focuses on the diagonal terms

k_{p} = k_{q} = k

. In terms of observables, we are interested in a subset of the elements of the r-matrix, but this does not mean that the r-matrix is block diagonal in k. We have

r_{1 \leq m, n \leq 2} (k) =

\begin{matrix} (\begin{matrix} - {sin}^{2} (θ) \partial_{k} ϕ + \bar{R} & - \frac{i}{2} e^{- i ϕ} \partial_{k} θ - \frac{1}{2} sin (θ) e^{- i ϕ} \partial_{k} ϕ \\ \frac{i}{2} e^{i ϕ} \partial_{k} θ - \frac{1}{2} sin (θ) e^{i ϕ} \partial_{k} ϕ & {sin}^{2} (θ) \partial_{k} ϕ + \bar{R} \end{matrix}) . \end{matrix}

(72)

Recall that

θ

and

ϕ

are functions of k. In a particular model of graphene, for instance, they take the specific forms

\begin{matrix} θ (k) & = \frac{π}{2}, \\ ϕ (k) & = - arg (e^{i k \cdot δ_{1}} + e^{i k \cdot δ_{2}} + e^{i k \cdot δ_{3}}) \\ = - arg (e^{i k_{x} a} + e^{i (- \frac{1}{2} k_{x} + \frac{\sqrt{3}}{2} k_{y}) a} + e^{i (- \frac{1}{2} k_{x} - \frac{\sqrt{3}}{2} k_{y}) a}) \end{matrix}

(73)

where the

δ_{i}

are the position vectors of the three nearest neighbor (NN) carbon atoms, arg denotes argument of a complex number, and a is the carbon bond length [52]. The two components physically represent the two bands due to the mutual independence of the

A / B

atoms in a primitive cell of graphene. How can we interpret that

θ (k) \equiv π / 2

is independent of

k

? The conduction band and valence band are formed with

π

bonding with equal weights from the

A / B

orbitals, which requires

θ (k) \equiv π / 2

to make the magnitudes of the two components equal.

We may summarize the logic of our process as follows:

(1): Employ the Weyl algebra to define $\hat{r}$ and p;
(2): Based on these physical operators, extract bases to build a vector space to serve as Hilbert space;
(3): The first (unsuccessful) attempt (left arrow below) built the space with $\hat{r}$ - (or p-) eigenstates as bases; however, the eigenvalues of $\hat{r}$ (or p) cover all of $R$ , whose cardinality is uncountably infinite, leading to a diverging norm, unsuitable for a Hilbert space;
(4): The second (successful) attempt (right arrow below) recognizes the bases differently.

\begin{matrix} ψ_{i} \overset{1 s t : i \leftrightarrow r}{\leftarrow} ψ (r) \overset{2 n d : label m}{\to} ψ_{m} (r) . \end{matrix}

(74)

We add index m to label the bases, and r serves as a “parameter”; different from the first attempt (the left arrow in Equation (74)), for which r was taken as the label of bases and was discretized into finite intervals. In the second attempt with CRM, the dimension of vector space depends on label m instead of r. Therefore, CRM represents a modified means (compared with DRM) of assigning a vector space to Weyl algebra, such that the constructed vector space is equipped with a converged norm. This addresses the question raised by Equation (18) in Section 3, representing a different route for discrete crossover to continuous situations. DRM also arises from Weyl algebra; however, the resultant space has diverging norm, unsuitable for Hilbert space.

From a math point of view, Weyl algebra is defined as a ring (an algebra equipped with addition and multiplication). A vector space, in terms of ring’s definition, is not an intrinsic notion; thus, it is an art to associate a vector space to the ring. If this is improperly performed, one ends up with a space of diverging norm (e.g., a space of infinite dimensions), which hinders evaluating

〈 \hat{r} 〉

, a fatal issue for transport. Our scheme is that the dimension of vector space should not be characterized by r (eigenvalues of

\hat{r}

) nor its conjugated variable p, but by

N \cdot N

-dimensional vector space, on which Nth Weyl algebra acts on, resulting

N

-dimensional r-matrix (ind

N \neq N

.) Since

N

is arbitrary, this approach represents a generic approach of projecting

\hat{r}

to arbitrary finite dimensions. In previous deriving of r-matrix, the implicit belief that the notion continuity of k is indispensable for partial derivative

\partial_{k}

has prevented the extension to Nth Weyl algebra

A_{N}

.

$r$ -matrix $r_{m, n} (k, k^{'})$ , reduced $r$ -matrix $r_{m, n} (k)$ and Berry connection matrix $A_{m, n} (k)$ . CRM represents a way of mapping

\hat{r}

to a finite-dimensional Hermitian matrix (but the matrix does not form a representation of Weyl algebra). Next, we sharpen terminology “matrix”.

The r-matrix is originally introduced on Bloch bases; thus, its dimension is equal to the dimension of Bloch waves:

N \times N

.

\begin{matrix} r_{m, n} (k, k^{'}) : H_{B} \to H_{B} . \end{matrix}

(75)

We further introduce “reduced r-matrix”

r_{m, n}^{(N)} (k)

(or

r_{m, n} (k)

for short), i.e., project

N \times N

dimensional

r_{m, n} (k, k^{'})

to quotient space

V

of dimension

N

by setting

k = k^{'}

:

\begin{matrix} r_{m, n}^{(N)} (k) = A_{m, n} (k) + δ_{m, n} \bar{R}, \end{matrix}

(76)

where

A_{m, n} (k)

is Berry connection matrix of

N

dimension. Noteworthy is that

\bar{R}

in the diagonal term of

r_{m, n} (k, k^{'})

(also the reduced

r_{m, n}^{(N)} (k)

) has a clear physical meaning: the mass center of the crystal, which is a k-independent constant. That means term

\bar{R}

for different bands are exactly the same, which will be cancelled (in evaluating displacement, one will take the difference of two diagonal terms and

\bar{R}

will be exactly cancelled). This converts a problem defined in

H_{B}

to its quotient space

V

.

Accurately speaking,

r_{m, n}^{(N)} (k)

(also

A_{m, n} (k)

) is not a single matrix, but “a continuous series of matrices” for variable k. Formally, it is a map,

\begin{matrix} r_{m, n}^{(N)} (k) : K \to A, \end{matrix}

(77)

where

A

represents Hermitian matrices on quotient space

V

(not

H_{B}

). Berry connection matrix

A_{m, n} (k)

is also such a map. Equation (72) gives an example of

r_{m, n}^{(N)} (k)

with

N = 2

. It is not that CRM reduces an operator of infinite dimension to one of two dimensions, which immediately raises the concern how the information can be encoded into such a small matrix? Instead, it is a single matrix of higher dimensions to be mapped to a series of lower-dimensional matrices. Formally speaking, the higher-dimensional matrix is mapped to a map whose codomain elements are two-dimensional matrices.

In terms of bundle theory [43,44], one may interpret that the reduced r-matrix

r_{m, n}^{(N)} (k)

transforms the problem originally defined in high-dimension vector space

H_{B}

to a bundle whose fiber space is a lower dimensional space

V

. It is incorrect to regard

r_{m, n}^{(N)} (k)

as an

N

-dimensional matrix, neither as the lower dimensional counterpart of DRM.

CRM does not form representation of Weyl algebra (i.e.,

[\hat{r}, k] \neq i

). We point out CRM does not satisfy commutation, whether N is finite or

N \to \infty

. (In fact, DRM does not satisfy neither, which is concealed by its divergence.) The matrix for k can be found in a similar fashion as Equation (72).

\begin{matrix} 〈 ψ_{m, k_{p}} | \sum_{n}^{N} k_{n} | ψ_{n, k_{q}} 〉 = δ_{m, n} δ_{k_{p}, k_{q}} k_{p} . \end{matrix}

(78)

Project it to quotient space, we have

\begin{matrix} k_{m, n}^{(N = 2)} (k) = (\begin{matrix} k & 0 \\ 0 & k \end{matrix}) \end{matrix}

(79)

The commutation yields

\begin{matrix} r_{m, j}^{(N = 2)} k_{j, n}^{(N = 2)} - k_{m, j}^{(N = 2)} r_{j, n}^{(N = 2)} = 0 . \end{matrix}

(80)

The commutation does not yield the expected i. Thus, the r-matrix together with the k-matrix does not form the generator of Weyl algebra. This is different from spin’s matrices, which are meant to preserve the Lie-algebra (Lie brackets). Therefore, it involves new principles to define r-matrix, for which we give more discussions in Section 7. The way of defining matrices is hinged to the way of extracting observables. In addition, when we work with Berry connection in the quotient space, one vector is not one-to-one corresponding to a physical state if a vector in

H_{B}

represents a physical state.

Remark 8.

We stress a few points about CRM and DRM:

(1): 1D CRM is still convergent, different from DRM; thus, DRM is not the 1D special case of CRM.
(2): The continuous limit of CRM does not approach DRM.
(3): CRM is not an inferior or approximate form of $\hat{r}$ , it is not achieved by representing $\hat{r}$ in a subspace of $H$ .
(4): The matrix does not reproduce the commutator; $[\hat{r}, k] = i$ cannot serve as the principle in defining the form of r-matrix.

5. Properties of the Position Operator and r-Matrix

In Section 5 and Section 6, the language of maps is used to illustrate concepts, especially the

Π

map built earlier, and the paper is organized with a few progressive definitions. This may cause some discomfort, but we try to keep it at the minimum level. In addition, in view of the mistakes made by authors themselves, it seems necessary to underscore certain algebraic rules. Although these parts of discussions might appear not quite “physical”, they are needed to make the raised concepts and algebraic derivation unambiguous.

One is familiar with the bases to represent a spin (Lie algebra) and such notions as basis transformations. It is natural to wonder about the counterparts for

\hat{r}

(Weyl algebra). Because DRM contains divergence, these issues are left open, since one cannot perform any calculation in the presence of “∞”. With CRM, we find the notion “ribbon band”, on which r-matrix is defined, is in analog with “bases” on which spin is defined. Physically, the ribbon band is related to the description of electronic states in crystals.

On top of ribbons,

\hat{r}

is handled like a matrix when it interplays with ribbons or other matrix operators without reference to its origin. Procedures associated with

\partial_{k}

, such as effectual range, one-sided acting on the right, are incarnated in matrix multiplication. Intuitively speaking, we disguise a differential operator like a matrix as much as possible; however, a differential operator may never really become a matrix due to the distinctions in their bottom algebras. Therefore, the cost one must pay is the differential operator’s matrix follows distinctive rules for transformation, exactly where the gauge transformation makes entrance as articulated in the next section. In this context, terms “differential operator” and “the matrix of the differential operator” should be discriminated.

Next, we follow the logic line of introducing bases to introduce ribbon bands (Definition 1) and associated concepts like inner product of ribbons (Definition 2), orthogonal ribbons (Definition 3), components of ribbons (Definition 4), ribbon transformation (Definition 5), etc.

Definition 1.

A ribbon band (or “ribbon” for short) over smooth manifold K to vector space V is a map

R : K \to V or R : k \mapsto v, k \in K, v \in V .

(81)

In band context, K is the B.Z., topologically an n-dimensional torus

T^{n} = S^{1} \times \dots S^{1}

; V is a vector space, e.g., quotient space

V

of

H_{B}

. The rank is defined as the dimension of V. If continuity is globally satisfied for map

R

, it is called continuous ribbon band; otherwise, we say the ribbon band is discontinuous at point

k_{0}

. For example, if degeneracy exists, eigenstate

| u_{n, k} 〉

of

H (k)

specifies a ribbon, which is discontinuous at the degenerate k.

A ribbon band could be induced by

Π

maps (Equation (37)). For example

\begin{matrix} R : k \mapsto Π^{(n)} (k) : = Π (ψ_{n, k}), \end{matrix}

(82)

and

\begin{matrix} R : k \mapsto Π_{1}^{(n)} (k) : = Π_{1} (ψ_{n, k}), \end{matrix}

(83)

Equation (82) is a ribbon

K \to V \times E

of rank

N \times N

; Equation (83) is another ribbon

K \to V

of rank

N

. These ribbons defined for either the product space or the quotient space. From Equations (82) and (83), we notice that given

Π

or

Π_{1}

maps,

N

branches of ribbon bands are induced (for there are

N

-fold eigenstates).

Definition 2.

Inner product between ribbons is defined as a linear binary map

\begin{matrix} 〈 \cdot, \cdot 〉 : R \times R \mapsto f, \end{matrix}

(84)

where f is a map

\begin{matrix} f : K \mapsto C . \end{matrix}

(85)

For two arbitrary ribbons

| φ (k) 〉

and

| ψ (k) 〉

, commonly defined over K to vector space V, the inner product of two ribbons can be induced by the inner product for vectors in V

\begin{matrix} f (k) = 〈 ϕ (k) | ψ (k) 〉 \end{matrix}

(86)

Definition 3.

Orthogonal ribbons are two ribbons whose inner products (Definition 2) are constantly zero. Consider a set of ribbons

I = {| m (k)}_{N}

with the number of elements equal to the ranks of these ribbons. If ribbons in

I

are mutually orthogonal,

I

is a set of ribbon bases.

\begin{matrix} 〈 m (k) | n (k) 〉 = δ_{m, n} \forall k \in K, | m (k) 〉, | n (k) 〉 \in I . \end{matrix}

(87)

Just like a vector is characterized by the dimension and can be represented by a set of bases of the same dimension, a ribbon can be characterized by its rank and represented by a set of orthogonal ribbons. We may denote a general ribbon over K as, for example,

| φ (k) 〉

with an extra parameter

k \in K

, in analog of a general vector

| φ 〉

.

We use

| m (k) 〉

,

| n (k) 〉

, etc., to denote different elements in the same set of ribbon bases

I = {| m (k) 〉}_{N}

, i.e., the

m^{t h}

,

n^{t h}

elements in

I

. When different ribbon bases are involved, we add primes

I^{'} = {| m^{'} (k) 〉}_{N}

.

Definition 4.

In analog to arbitrary vectors being expressed in components on a set of orthogonal bases, we define the components of a ribbon projected to ribbon bases

I = {| m (k) 〉}_{N}

as

\begin{matrix} φ_{m}^{*} (k) : = 〈 φ (k) | m (k) 〉; φ_{m} (k) : = 〈 m (k) | φ (k) 〉 . \end{matrix}

(88)

φ_{m}^{*} (k)

and

φ_{m} (k)

are with respect to a set of ribbons, instead of a set of bases of V. Thus,

\begin{matrix} φ_{m}^{*} (k) \neq 〈 φ (k) | m 〉; φ_{m} (k) \neq 〈 m | φ (k) 〉, \end{matrix}

(89)

where

{| m 〉}_{N}

is a set of bases for V, which could also be viewed as k-independent ribbon bases.

{| m 〉}_{N}

is likely to be different from

{| m (k) 〉}_{N}

. Thus, the k label should not be discarded.

Definition 5.

In an analog of basis transformation, one may introduce ribbon transformation

\begin{matrix} T_{R} : V_{R} \to V_{R} or T_{R} : R \mapsto R^{'}, \end{matrix}

(90)

where

V_{R}

stands for a ribbon space that consists of ribbon bands

R

.

T_{R}

transforms a ribbon space just like basis transformation transforms a vector space.

T_{R}

turns a ribbon band into another

R \mapsto R^{'}

, which can be realized by a rotation of vector space V at a local k.

\begin{matrix} φ_{m} (k) \overset{T_{R}}{\to} U_{m, n} (k) φ_{n} (k), \end{matrix}

(91)

where

φ_{m} (k)

is the component of a ribbon. Thus, ribbon transformation

T_{R}

can be written in an equivalent form

\begin{matrix} T_{R} : K \to A u t (V), \end{matrix}

(92)

where

A u t (V)

stands for automphism group. Automorphisms refer to inversible self-maps (

V \to V

) that preserve the inner product

〈 φ | ψ 〉 = 〈 φ | U^{†} U | ψ 〉

. In other contexts,

A u t (V)

may preserve other structures equipped on V than inner products. This requires U to be a unitary transformation. Then the information of

T_{R}

is fully encoded in a unitary matrix

U_{m, n} (k)

indexed by

k \in K

.

Definition 6.

The matrix of a matrix operator (e.g., spin) defined on ribbon space

R : K \to V

, spanned by orthogonal ribbons

I = {| m (k) 〉}_{N}

is a matrix function of k that commits to the following ribbon transformations:

\begin{matrix} O_{m, n} (k) \overset{T_{R}}{\to} O_{m, n}^{'} (k) = U_{m, i} (k) O_{i, j} (k) U_{j, n}^{†} (k) . \end{matrix}

(93)

Definition 7.

The matrix of a differential operator on ribbon space

R : K \to V

spanned by ribbon bases

I

is defined as a matrix function of k subject to the following ribbon transformation:

\begin{matrix} M_{m, n} (k) \overset{T_{R}}{\to} M_{m, n}^{'} (k) = U_{m, i} (k) M_{i, j} (k) U_{j, n}^{†} (k) \\ + U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k)) . \end{matrix}

(94)

Note that in the last term, the effectual range of

\partial_{k}

is limited to

U_{j, n}^{†} (k)

and does not act all the way to the right. For the rules about

\partial_{k}

, refer to Appendix D. In this work, we adopt a convention: the matrix of matrix operator is denoted with

O_{m, n}

or

O_{m, n} (k)

; the matrix of differential operator is

M_{m, n} (k)

.

Remark 9.

Matrix is the denotation of an operator on a specific space. Equation (93) generalizes such a denotation for a matrix operator: from vector space V to ribbon space

V_{R}

. Such generalization is equivalent to introducing independent replicas (labelled by k) of the operator. Transformation at different k is separate, in principle, not requiring

U_{m, i} (k)

to be continuous or smooth with k. On the other hand, Equation (94) defines a matrix denotation for differential operator. Transformation of

M_{m, n} (k)

at different k is not irrelevant but requires neighborhood knowledge of k (due to the term

U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k))

), such that the global topology begins to enter.

Remark 10.

Differential operator is the motivation to introduce the ribbon band

R

(such generalization is trivial for matrix operators). Nonetheless, ribbon

R

allows the two types of operators to be examined on a common ground. It is not that the ribbon band is solely associated to differential operators, nor “bases” are solely associated with matrix operators.

Remark 11.

Regarding linear maps.

\hat{O} (k)

is a linear map

V \to V

at a k point because

\hat{O} (k) (| m (k) 〉 + | n (k) 〉) = \hat{O} | m (k) 〉 + \hat{O} (k) | n (k) 〉

and

\hat{O} (k) (c | m (k) 〉) = c \hat{O} (k) | m (k) 〉

. Although

\partial_{k}

is often referred to as a linear operation because of

\partial_{k} (F_{1} (k) + F_{2} (k)) = \partial_{k} F_{1} (k) + \partial_{k} F_{2} (k)

and

\partial_{k} F (c k) = c \partial_{k} F (k)

, it is not a linear map

V \to V

at a local k. In other words, it is a linear map:

H \to H

(

H

is the first space introduced in Section 2), but not for vector space V. This could be seen from

\partial_{k}

acting on vector

v \in V

.

\begin{matrix} \partial_{k} (c (k) \cdot v) = c (k) \cdot \partial_{k} (v) + v \cdot \partial_{k} (c (k)), \end{matrix}

(95)

producing an extra term

v \partial_{k} (c)

, where c is a function of k. In addition, we see the matrix of

\partial_{k}

is subject to a different transformation rule Equations (94) and (95) [53].

Next, we underscore a fortuitous finding during our clarifying the fundamentals about CRM and the ribbon space. The fact that position

\hat{r}

is Hermitian, in matrix context, indicates

r_{m, n} = r_{n, m}^{*}

; in the operator context, this is denoted as

\hat{r} = {\hat{r}}^{†}

— the two are usually considered identical. However,

\hat{r} = {\hat{r}}^{†}

has two implicit connotations which turn out stronger arguments: (1)

\hat{r}

is associative, i.e., two-sided action; (2)

\hat{r}

being free of index indicates its basis invariance.

Our argument is that position is a Hermitian operator and r-matrix is a Hermitian matrix, while this fact cannot be expressed with associative operator in basis-independent forms, because (1)

i \partial_{k}

is not associative, (2) basis-free denotation should not be taken for granted due to the distinct transformation properties of r-matrix. This idea could be be compactly expressed as

\begin{matrix} r_{m, n} = r_{n, m}^{*} ⇎ \hat{r} = {\hat{r}}^{†} . \end{matrix}

(96)

Noteworthy,

O_{m, n} = O_{n, m}^{*}

is a property about a matrix, which involves a particular set of bases, while

\hat{O} = {\hat{O}}^{†}

is basis-free designation, which is usually applied to bra/ket.

Basis-free designation means “it works for arbitrary bases”; therefore, it is implicitly conditioned by invariance under basis (or ribbon) transformation. For example, a ket state

| ψ 〉 = | m 〉 〈 m | ψ 〉 = | m^{'} 〉 〈 m^{'} | ψ 〉 = \dots

works for arbitrary bases

{| m 〉}

,

{| m^{'} 〉}

, etc. Thus, we erase the subscripts and denote it as

| ψ 〉

. The same idea works for

\hat{r}

, which has no subscripts associated with particular bases. Note that basis-free designation is not always justified. It is true for matrix operators, although it might lead to mistakes for differential operators.

Matrix operator

\hat{O} (k)

is an example of a ribbon-invariant map. Accordingly, one develops the notion that elements in the domain or co-domain sets are objects whose identities are independent of bases, endowed by the following invariance under ribbon transformation:

\begin{matrix} φ_{m} (k) O_{m, n} (k) ψ_{n} (k) \overset{T_{R}}{\to} φ_{m}^{'} (k) O_{m, n}^{'} (k) ψ_{n}^{'} (k) \\ = (φ_{m} (k) U_{m, l}^{†} (k)) (U_{l, i} (k) O_{i, j} (k) U_{j, g}^{†} (k)) (U_{g, n} (k) ψ_{n} (k)) \end{matrix}

(97)

A unitary matrix leads to

\begin{matrix} U_{m, j}^{†} (k) U_{j, n} (k) = δ_{m, n} . \end{matrix}

(98)

Thus, it is invariant with

T_{R}

\begin{matrix} φ_{m} (k) O_{m, n} (k) ψ_{n} (k) = φ_{m}^{'} (k) O_{m, n}^{'} (k) ψ_{n}^{'} (k) \end{matrix}

(99)

In Equation (97),

T_{R}

is used to replace the vector and the operator with their counterparts under the updated ribbon bases, which are given by Definitions 5 and 6. The idea is the components of vector and operators are alterable, but the inner product Equation (99) is invariant. For this, one can introduce a denotation as below, ignoring the indices associated with specific ribbons

\begin{matrix} 〈 φ (k) | \hat{O} (k) | ψ (k) 〉 \overset{T_{R}}{\to} 〈 φ^{'} (k) | {\hat{O}}^{'} (k) | ψ^{'} (k) 〉 \\ = (〈 φ (k) | U^{†} (k)) (U (k) \hat{O} (k) U^{†} (k)) (U (k) | ψ (k) 〉) . \end{matrix}

(100)

In above, the basis-free designation such as

| ψ (k) 〉

,

\hat{O} (k)

, etc. (belonging to Dirac’s ket/bra symbolism), is not subject to specific ribbons nor attached with subindices m, n, etc. One can interpret

| ψ (k) 〉

is not representing a single vector but a class of equivalent vectors that are linked by the ribbon transformation Equation (100). Since the equivalence class covers all possible choices of orthogonal ribbons, the class becomes “ribbon independent”. Then, one may designate it without explicitly referring to the choice of ribbon bases. This gimmick is commonly used in defining coordinate-independent fiber bundle, origin-free space (affine space), etc. [43].

On the other hand, if the invariance fails, as in the case of differential operator below, basis-free designations should not be taken for granted.

\begin{matrix} φ_{m} (k) \cdot r_{m, n}^{(N)} (k) \cdot ψ_{n} (k) \overset{T_{R}}{\to} φ_{m}^{'} (k) \cdot r_{m, n}^{{(N)}^{'}} (k) \cdot ψ_{n}^{'} (k) \\ = (φ_{m} (k) U_{m, l}^{†} (k)) {U_{l, i} (k) r_{i, j}^{(N)} (k) U_{j, g}^{†} (k) \\ + U_{l, j} (k) i \partial_{k} (U_{j, g}^{†} (k))} (U_{g, n} (k) ψ_{n} (k)) \\ = φ_{m} (k) \cdot r_{m, n}^{(N)} (k) \cdot ψ_{n} (k) + φ_{m} (k) i \partial_{k} (U_{j, g}^{†} (k)) U_{g, n} (k) ψ_{n} (k) . \end{matrix}

(101)

In performing ribbon transformation Equation (101), we substitute

O_{m, n} (k)

with

r_{m, n} (k)

in Equation (99) and apply the transformation rule of differential operator Equation (94). The basis-free designations could be problematic. Obviously, there is an extra term, and invariance is lost. That is why in Equation (94) we define the operator with a form of pure matrix components, without referring to basis-free designations, such as bra or ket.

We give an example of bra/ket notations causing problems in handling complex conjugation. For matrix operator

O_{m, n}^{†} : = {[O^{†}]}_{m, n}

, i.e.,

O^{†}

stands for a matrix as a whole, just like O, and † is not acting on a specific matrix elements such like

O_{m, n}

,

\begin{matrix} O_{m, n}^{†} = O_{n, m}^{*} \end{matrix}

(102)

This is true for generic matrix operator, without requiring

\hat{O}

to be Hermitian. If

\hat{O}

is a Hermitian operator, we further have

\begin{matrix} O_{m, n} = O_{n, m}^{*} \end{matrix}

(103)

Since a matrix operator is invariant under ribbon/basis transformation, one may employ basis-independent notation

\begin{matrix} {\hat{O}}^{†} = \hat{O} \end{matrix}

(104)

Then, we evaluate the expectation value of Hermitian

\hat{O}

\begin{matrix} \partial_{λ} 〈 \hat{O} 〉 = \partial_{λ} 〈 φ | \hat{O} | φ 〉 = 〈 \partial_{λ} φ | \hat{O} | φ 〉 + 〈 φ | \hat{O} | \partial_{λ} φ 〉 \\ = 〈 \partial_{λ} φ | \hat{O} | φ 〉 + {(〈 \partial_{λ} φ | {\hat{O}}^{†} | φ 〉)}^{*} = 2 ℜ [〈 \partial_{λ} φ | \hat{O} | φ 〉] . \end{matrix}

(105)

When taking the complex conjugation, we employ formulas in Appendix D.

Equation (105) is true for matrix operator

\hat{O}

, but not for differential operator

\hat{r}

. If we plug in

\hat{O} = \hat{r}

and replace

\hat{r} \to i \partial_{k}

, we achieve a celebrated result (Chapter 4 of [27])

\begin{matrix} \partial_{λ} 〈 \hat{r} 〉 = 2 ℜ [〈 \partial_{λ} φ (k) | i \partial_{k} φ (k) 〉] . \end{matrix}

(106)

The derivative of displacement is linked to the Berry curvature defined in the

(λ, k)

space, which is the kernel for developing Berry phase formalism of electric polarization. Equation (106) is also essential for path-independent formulation of polarization field P.

Noteworthy is the fact that the validity of elegant Equation (106) [27] relies on implicit preconditions. Compare it with a second way of handling it: make the replacement

\hat{r} \to i \partial_{k}

in the first place.

\begin{matrix} \partial_{λ} 〈 \hat{r} 〉 & = \partial_{λ} 〈 φ (k) | i \partial_{k} φ (k) 〉 \\ = 〈 \partial_{λ} φ (k) | i \partial_{k} φ (k) 〉 + 〈 φ (k) | i \partial_{λ} \partial_{k} φ (k) 〉 . \end{matrix}

(107)

In order to yield a consistent result as Equation (106), the following equality must be true:

\begin{matrix} 〈 φ (k) | i \partial_{λ} \partial_{k} φ (k) 〉 \overset{?}{\to} - 〈 \partial_{k} φ (k) | i \partial_{λ} φ (k) 〉 \\ = 〈 i \partial_{k} φ (k) | \partial_{λ} φ (k) 〉 = {(〈 \partial_{λ} φ (k) | i \partial_{k} φ (k) 〉)}^{*} . \end{matrix}

(108)

Plugging in Equation (108) to Equation (107), we get the Berry curvature results Equation (106). However, the deriving is based on “

\overset{?}{\to}

” in Equation (108) is an equality, which is only conditionally true if

\begin{matrix} \partial_{k} (〈 φ (k) | \partial_{λ} φ (k) 〉) \equiv 0 . \end{matrix}

(109)

It is straightforward to show Equation (109) does not hold locally in general. Thus, we discriminate

r_{m, n} = r_{n, m}^{*} ⇎ \hat{r} = {\hat{r}}^{†}

. Such equivalence is true for matrix operators.

As a consequence, the equality in Equation (106) does not hold for local k. Only if one integrates the left and right sides of Equation (106) on a closed manifold, such as B.Z., the total integral will be equal although each local k might make different contributions. As such, the celebrated Berry curvature formula for adiabatic currents relies on a closed topology.

It appears the adiabatic current is infinitely fragile to missing (or adding) even a single particle that will break a closed topology. Since thermal excitation is existing even at low temperature, it seems necessary to extensively examine the stability of Equation (106) with the presence of excitation, although the purpose of Equation (106) is for adiabatic limit. This will be given in a separate work. Nonetheless, the finding of this work shows the substitution as Equation (106) is false in a local sense.

In short, position operator is a Hermitian (differential) operator (all its eigenvalues are

R

) and r-matrix is a Hermitian matrix (the left side of Equation (96)); but the fact of the position being a Hermitian operator does not guarantee the position operator should exhibit behaviors like a two-sided associative operator as the basis-free designation

\hat{r} = {\hat{r}}^{†}

alludes to. Given the formal invariance Equation (99) is absent, Equation (96) is an example of a mistake caused by basis-free notations.

To elude the problem, one may either replace the ket/bra designation system with matrix component formalism, as Weinberg does [54], or keep using it but with special attention paid when

\hat{r}

is involved.

\begin{matrix} Recommend & : 〈 φ (k) | i \partial_{k} ψ (k) 〉 . \\ Not Recommend & : 〈 φ (k) | \hat{r} | ψ (k) 〉 . \end{matrix}

(110)

The difference is clear:

i \partial_{k}

has effectual range (in this case, confined to

ψ (k)

) and is acting on one side (right);

\hat{r}

is interpreted as associative (acting on both sides) without an effectual range. It is never an equivalent replacement

\hat{r} \leftrightarrow i \partial_{k}

. Thus, we do not directly inherit the designation designed for matrix operator and apply it to

\hat{r}

. Recall that in expressing r-matrix element in Section 4, we adopt the original integration Equation (45) instead of using

〈 ψ_{m, k^{'}} | \hat{r} | ψ_{n, k} 〉

. Equation (110) is exactly the reason.

Next, we summarize the algebraic rules (a)–(e) for matrix of differential operator in comparison with the matrix of matrix operator.

(a) Matrix elements.

\hat{r}

operator is directly defined by matrix elements.

\begin{matrix} r_{m, n} (k) = 〈 m (k) | i \partial_{k} n (k) 〉 . \end{matrix}

(111)

In contrast, a matrix operator has

\begin{matrix} O_{m, n} (k) = 〈 m (k) | \hat{O} (k) | n (k) 〉 . \end{matrix}

(112)

A matrix operator may use basis-free designation

\hat{O}

in the midst of

〈 \dots 〉

, while for the reason listed above, we should avoid using

r_{m, n} (k) = 〈 m (k) | \hat{r} | n (k) 〉

. Additionally,

\hat{O} (k)

may intrinsically depend on k, not just due to ribbons being k-dependent; thus, the k label in

\hat{O} (k)

should not be ignored.

(b) Complex conjugation.

\begin{matrix} r_{m, n}^{*} (k) = {(〈 m (k) | i \partial_{k} n (k) 〉)}^{*} = 〈 i \partial_{k} n (k) | m (k) 〉 \end{matrix}

(113)

where

\begin{matrix} | i \partial_{k} n (k) 〉 & = i | \partial_{k} n (k) 〉, \\ 〈 i \partial_{k} n (k) | & = (- i) 〈 \partial_{k} n (k) | . \end{matrix}

(114)

For the matrix operator,

\begin{matrix} (〈 m (k) | \hat{O} {(k) | n (k) 〉)}^{*} = 〈 n (k) | {\hat{O}}^{†} (k) | m (k) 〉 . \end{matrix}

(115)

Note that

i \partial_{k}

cannot take the position of

\hat{O}

, and “†” should not be attached to

i \partial_{k}

since

{(i \partial_{k})}^{†}

is ill defined. Given

\hat{O}

co-exists with

i \partial_{k}

, the algebra rule is, for instance,

\begin{matrix} (〈 m (k) | \hat{O} (k) | i \partial_{k} {n (k) 〉)}^{*} = 〈 i \partial_{k} n (k) | {\hat{O}}^{†} (k) | m (k) 〉 . \end{matrix}

(116)

The following could be used to express the fact that

\hat{r}

is a Hermitian operator (i.e., r-matrix is a Hermitian matrix)

\begin{matrix} 〈 m (k) | i \partial_{k} n (k) 〉 = {(〈 n (k) | i \partial_{k} m (k) 〉)}^{*} \Leftrightarrow r_{m, n} = r_{n, m}^{*} . \end{matrix}

(117)

However,

\begin{matrix} r_{m, n} = r_{n, m}^{*} ⇏ \hat{r} = {\hat{r}}^{†} \end{matrix}

(118)

For matrix operators,

\begin{matrix} 〈 m (k) | \hat{O} (k) | n (k) 〉 & = {〈 n (k) | \hat{O} (k) | m (k) 〉}^{*} \Leftrightarrow O_{m, n} = O_{n, m}^{*} \\ O_{m, n} & = O_{n, m}^{*} \Leftrightarrow \hat{O} = {\hat{O}}^{†} \end{matrix}

(119)

The difference between Equations (118) and (119) is due to differential operators lacking the basis-free designation. For

\partial_{k}

acting on generic vectors,

\begin{matrix} {(〈 φ (k) | i \partial_{k} ψ (k) 〉)}^{*} = 〈 i \partial_{k} ψ (k) | φ (k) 〉 . \end{matrix}

(120)

For matrix operators,

\begin{matrix} (〈 φ (k) | \hat{O} {(k) | ψ (k) 〉)}^{*} = 〈 ψ (k) | {\hat{O}}^{†} (k) | φ (k) 〉 . \end{matrix}

(121)

A mistaken expression is

\begin{matrix} Mistaken : {(i \partial_{k})}^{†} = i \partial_{k} or \hat{r} = {\hat{r}}^{†}, \end{matrix}

(122)

which leads to mistakes

\begin{matrix} Incorrect : [〈 φ (k) | i \partial_{k} {| ψ (k) 〉]}^{*} = 〈 ψ (k) | {(i \partial_{k})}^{†} | φ (k) 〉 = 〈 ψ (k) | i \partial_{k} | φ (k) 〉 . \end{matrix}

(123)

Obviously, Equation (123) is against the correct result of Equation (113).

(c) Dimensions and Effectual range. Differential operator

\partial_{k}

does not have a fixed dimension of matrix. It can be seen that

\partial_{k}

might act on both

| ψ_{n, k} 〉 \in H_{B}

and on

| u_{n, k} 〉 \in V

. On the other hand, a matrix operator

\hat{O} (k)

is associated with a determined dimension when first introduced.

Another feature of

\partial_{k}

is effectual range.

\partial_{k}

should always be specified with its effectual range, which is denoted by

\partial_{k} (\dots)

. For example,

\begin{matrix} \partial_{k} | n (k) 〉 = | \partial_{k} n (k) 〉 + | n (k) 〉 \partial_{k} . \end{matrix}

(124)

Thus, we distinguish

\partial_{k} | n (k) 〉

from

| \partial_{k} n (k) 〉

because

| \partial_{k} n (k) 〉

has

\partial_{k}

’s effect restricted to

| n (k) 〉

;

\partial_{k} | n (k) 〉 \dots

, and it affects every term all the way to the right (Appendix D).

(d) Inner product. For a differential operator (Einstein convention)

\begin{matrix} 〈 φ (k) | \partial_{k} ψ (k) 〉 & = 〈 φ (k) | m (k) 〉 〈 m (k) | \partial_{k} (| n (k) 〉 〈 n (k) | ψ (k) 〉) \\ = 〈 φ (k) | m (k) 〉 〈 m (k) | \partial_{k} n (k) 〉 〈 n (k) | ψ (k) 〉 \\ + 〈 φ (k) | m (k) 〉 \partial_{k} (〈 m (k) | ψ (k) 〉) \end{matrix}

(125)

That is,

\begin{matrix} φ_{m}^{*} (k) r_{m, n} (k) ψ_{n} (k) + φ_{m}^{*} (k) \partial_{k} (ψ_{m} (k)) \end{matrix}

(126)

In contrast, a matrix operator has

\begin{matrix} 〈 φ (k) | \hat{O} (k) | ψ (k) 〉 = \\ 〈 φ (k) | m (k) 〉 〈 m (k) | \hat{O} (k) | n (k) 〉 〈 n (k) | ψ (k) 〉 . \end{matrix}

(127)

That is,

\begin{matrix} φ_{m}^{*} (k) O_{m, n} (k) ψ_{n} (k) \end{matrix}

(128)

Compared with Equation (128), the inner product of a differential operator features an inhomogeneous term. Only with the k-independent ribbons, Equation (126) reduces to the same form as the matrix operator.

(e) Ribbon band transformations. The transformation rules for matrix and differential operators are specified by Definitions 6 and 7. The unitary matrix is stipulated as

\begin{matrix} U_{n, j} (k) = 〈 n^{'} (k) | j (k) 〉 = 〈 n (k) | U (k) | j (k) 〉, \end{matrix}

(129)

where

| n^{'} (k) 〉 = U^{†} (k) | n (k) 〉

.

I

and

I^{'}

are two sets of orthogonal ribbon bases, and

| n (k) 〉 \in I

and

| n^{'} (k) 〉 \in I^{'}

. (

| n (k) 〉

and

| n^{'} (k) 〉

mean the

n^{t h}

elements in sets

I

and

I^{'}

, respectively.)

Note that the rule for differential operators (Definition 7) is specified in matrix forms; unfortunately, the rule is virtually incompatible with the basis-independent designation. Neither of the following is proper:

\begin{matrix} \partial_{k} \overset{T_{R}}{\to} U (k) \partial_{k} U^{†} (k) \\ \partial_{k} \overset{T_{R}}{\to} U (k) \partial_{k} (U^{†} (k)) \end{matrix}

(130)

These two expressions are motivated by an analog with

\hat{O} \overset{T_{R}}{\to} U \hat{O} U^{†}

, i.e.,

\partial_{k}

takes the position of

\hat{O}

in the middle and transformation takes a similarity form. The difference is merely about the effectual range. In the first line of Equation (130),

\partial_{k}

acts all the way to the right. Consider an inner product with two arbitrary ribbons under ribbon transformations

\begin{matrix} 〈 φ (k) | i \partial_{k} ψ (k) 〉 \overset{T_{R}}{\to} 〈 φ (k) | U^{†} (k) (U (k) i \partial_{k} U^{†} (k)) U (k) ψ (k) 〉 \\ = 〈 φ (k) | i \partial_{k} ψ (k) 〉 \end{matrix}

(131)

That is,

〈 φ (k) | i \partial_{k} ψ (k) 〉

is invariant under the ribbon transformation (due to

U^{†} (k)

canceling with

U (k)

). Such invariance is against the ribbon transformation defined with matrix forms (Definition 7, Equation (94)), which gives an extra term

φ_{m} (k) i \partial_{k} (U_{j, g}^{†} (k)) U_{g, n} (k) ψ_{n} (k)

. The invariance of Equation (131) is also against the common knowledge that Berry-connection-like quantity should be variant under transformation.

On the other hand, such a designation is not a total failure, as

\partial_{k} \to U (k) \partial_{k} U^{†} (k)

may correctly deduce matrix forms of ribbon transformation when

\partial_{k}

is “isolated”, i.e., it does not act upon other ribbons (detailed in Section 7). That is why denotation like

U (k) \partial_{k} U^{†} (k)

has been adopted in some literatures. However, one may encounter difficulty when the two parts “work some interplay” demonstrated by Equation (131). Such a designation cannot constantly stay harmonic with itself, nor yield consistent results with matrix forms (unfortunately, these issues often elude people’s notices).

How about using

U (k) \partial_{k} (U^{†} (k))

, restricting the effectual range to

U^{†} (k)

? In that case,

U (k) \partial_{k} (U^{†} (k))

becomes a pure matrix operator (one may just regard

\partial_{k} (U^{†} (k))

as a matrix, and

\partial_{k}

becomes a product of two matrices, which yield another matrix). Then, a differential operator decays into a matrix operator, which is obviously incorrect.

Ribbon transformation is defined on the matrix of the differential operator rather than the differential operator. The fundamental mistake for Equation (130) is that we try to find a denotation that directly expressed with the differential operator; instead, we shall first introduce the matrix of

\partial_{k}

, and define transformation on the matrix elements. In other words, it implicates that the analog between

\hat{O}

and

\partial_{k}

is improper, although people tend to call both of them operators. Thus, we see a second example for the deep incompatibility between a differential operator and basis-free designations, adding to the earlier issue on complex conjugation (Equation (118)).

We compare the two designations in Table 3, showing that basis-free designation encounters problems occasionally; seemingly, matrix designation is advantageous. That is why Definitions 6 and 7 and those afterward are given in matrix forms, rather than basis-free forms. Algebraic rules about incorporating differential operators with ket/bra designations are summarized in Appendix D.

6. Gauge Transformation, Ribbon Transformation, and Basis Transformation

Although gauge invariance is common in constructing transport theory [8,18,27], there is still vagueness in concepts and the relationship between different proposals. Here, we particularly focus on gauge transformation’s relation with CRM and differential operators. We address in which cases one should concern gauge issues. We define gauge transformation unambiguously (Definition 8) as the frame to characterize various gauge transformations. We emphasize the gauge invariant could have quite different meaniungs in different contexts [8,18,27]. This will shed light on understanding different transport theories [8,18,22,27,28].

In the classical case, gauge transformation arises from modifying vector potential

A (r)

but preserving magnetic field B (B is considered as physical reality). The generic expression is to add a curl-less field

\nabla Λ

[57]

\begin{matrix} A (r) \to A^{'} (r) = A (r) + \nabla Λ (r) . \end{matrix}

(132)

The only notion involved is the (differential) vector field. In quantum (especially in the context of geometric phases), however, (Abelian) gauge transformation refers to “phase shift” of eigenstate

| u_{m} (k) 〉

[15,24],

\begin{matrix} | u_{m} (k) 〉 \to e^{i ξ_{m} (k)} | u_{m} (k) 〉 . \end{matrix}

(133)

It is more like a convention change, not involving preservation of physical quantities. Moreover, Equation (133) relies on notions absent in the classical Equation (132), such as eigenstates and complex phase

ξ_{m} (k)

, which require a vector space established on

C

; in contrast, the classical

A

and

Λ

are built on

R

.

Why are Equations (132) and (133) both referred to as gauge transformations despite these distinctions? In this section, we define gauge transformation and clarify its relations with ribbon and basis transformations, as well as its physical implications.

Definition 8.

Gauge transformation

T_{G}

associated with manifold K is defined as a matrix map of a particular form

\begin{matrix} T_{G} : M_{m, n}^{(υ)} (k) \mapsto & M_{m, n}^{{(υ)}^{'}} (k) = U_{m, i} (k) M_{i, j}^{(υ)} (k) U_{j, n}^{†} (k) \\ + U_{m, j} (k) i \partial_{k_{υ}} (U_{j, n}^{†} (k)), \end{matrix}

(134)

where

M_{m, n}^{(υ)} (k)

is a matrix and its elements are indexed by m, n.

k \in K

and

υ = 1, 2 \dots, d

, and

d = d i m (K)

.

U_{m, i} (k)

is unitary matrix with k being a shorthand for coordinates

{k_{1}, \dots k_{υ}, \dots}_{d}

.

T_{G}

can be denoted with a generic form

\begin{matrix} T_{G} : K \to f, \end{matrix}

(135)

where

f : \oplus_{υ}^{d} V_{O}^{(υ)} \to \oplus_{υ}^{d} V_{O}^{(υ)}

, i.e., direct sum a series of operator space

V_{O}^{(υ)}

, and the number of sums depends on

d = d i m {K}

.

M_{m, n}^{(υ)} \in V_{O}^{(υ)}

, i.e., elements in

V_{O}^{(υ)}

are matrices.

Remark 12.

Consider a concrete case

d i m (K) = 3

(3D B.Z.) and where

d i m (V_{O}^{(υ)}) = 1

and

V_{O}^{(υ)}

is defined on

R

. In that case,

V_{O}^{(υ)} ≅ R

and

\oplus_{υ}^{3} V_{O}^{(υ)} ≅ R^{3}

. Thus,

f : R^{3} \to R^{3}

. Thus, at a local k,

T_{G}

is equivalent to transforming a 3D vector transformation. Matrix

M_{m, n}^{(υ)} \in V_{O}^{(υ)}

reduces to a real number and is commutative, and unitary matrices

U (k)

become complex phases

e^{i ξ_{υ} (k)}

and bypass

M^{(υ)}

and cancel its conjugation. Manifold K refers to the space that hosts

r

, thus

K = R^{3}

, 3D real space. This is exactly the situation of Equation (132): the classical gauge transformation corresponds to Abelian

T_{G}

. Note that

\nabla Λ (r)

is merely a means to specify map f, not indispensable for gauge definition.

Remark 13.

Consider non-Abelian case

d i m (K) = 1

(1D B.Z.). In a band model,

d i m (V_{O}^{(υ)}) = d i m (V) = N

, i.e., the dimension of matrices in

V_{O}^{(υ)}

is equal to the dimension of quotient space

V

of Bloch space

H_{B}

(the band number). In fact, gauge transformation is exactly the transformation rule of reduced r-matrices under ribbon transformation.

\begin{matrix} r_{m, n}^{(N)} (k) \overset{T_{R}}{\to} r_{m, n}^{{(N)}^{'}} (k) = & U_{m, i} (k) r_{i, j}^{(N)} (k) U_{j, n}^{†} (k) \\ + U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k)) . \end{matrix}

(136)

Thus, the Abelian Equation (133) is a special case of

T_{G}

.

Remark 14.

T_{G}

can be viewed as a modified form of matrix rotation, which should have obeyed a similarity form

\begin{matrix} O_{m, n} (k) \to O_{m, n}^{'} (k) = U_{m, i} (k) O_{i, j} (k) U_{j, n}^{†} (k) . \end{matrix}

(137)

Evidently,

T_{G}

has an extra term

λ \cdot U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k))

with

λ = 1

; while

λ \to 0

,

T_{G}

reduces to the transformation behavior of matrix operator.

Remark 15.

T_{G}

is not an arbitrary transformation from K to f, but one that conforms to a particular form specified by Equation (134). Why is this form chosen? Because it is the transformation of r-matrix, or more generally, it is the transformation response of matrices of differential operators. That means that gauge transformation is closely relevant to a differential operator, i.e., the transformation form is determined from differential operator’s matrix behaviors. On the other hand, if the differential operator is not involved, for example, for spin operators (or other matrix operators), gauge transformation is trivial, as it is identical to many independent versions of basis transformation.

Remark 16.

T_{G}

is not for a single point but defined for

\forall k \in K

(thus, it is associated with K). In band scenarios, K is B.Z. In terms of topological space, B.Z. is a torus

T^{d}

.

Remark 17.

The map Equation (134) is directly defined with matrices without involving the matrices’ identities or natures. An

n \times n

matrix can denote a linear operator (or map) in an n-dimensional vector space V, but only until its response under the basis rotation is specified. Intuitively speaking, an operator is a matrix that is hinged with basis transformation: for an operator, its matrix

σ_{1}

might have to change to a different “version”, say

σ_{2}

, under new bases, while a pure matrix is just “free” when it is not associated with vector spaces or bases, but just a set of numbers arranged in a rectangular box. In the level of defining

T_{G}

, the target is just a pure matrix, without worrying whether the entries in the domain form linear operators or not. (In fact, these matrices are not linear operators, as the non-linear terms in

T_{G}

violates the linearity.)

Remark 18.

Note that

T_{G}

is not a linear transformation, as

T_{G} (M_{1} + M_{2}) \neq T_{G} (M_{1}) + T_{G} (M_{2})

,

T_{G} (c M_{1}) \neq c T_{G} (M_{1})

. On the other hand, it has a property,

T_{G} (M_{1}) - T_{G} (M_{2}) = M_{1} - M_{2}

.

Remark 19.

There are multiple spaces involved in

T_{G}

’s definitions; thus, there are multiple dimensions associated with

T_{G}

. The first one is the dimension of the manifold K that corresponds to B.Z. in this context. Next is the dimension of matrix M which, in band models, is set equal to

d i m (V)

, i.e., equal to the band number. Then, the

υ^{t h}

component of

M^{(υ)}

, whose dimension is equal to

d i m (K)

.

Remark 20.

T_{G}

is defined as an abstract map, not associated with classical or quantum physics nor with Hilbert space.

Relation between gauge transformation $T_{G}$ and ribbon transformation $T_{R}$ . Ribbon transformation

T_{R} : K \to Aut (V)

(Equation (92)) is associated with vector space V and manifold K; gauge transformation

T_{G} : K \to f

(Equation (135)) is only defined with K, not involving any vector space.

T_{R}

is used to transform a ribbon (which is a map) while

T_{G}

is employed to transform a matrix—distinct transformation targets. In other words, the domains (also the co-domains) of the two transformations are different: the domain of

T_{R}

is ribbon space

V_{R}

, while

T_{R}

’s domain is a matrix set. Additionally,

T_{R} : R \mapsto R^{'}

is a linear map, while

T_{G}

is not linear.

Despite these conceptual distinctions,

T_{G}

and

T_{R}

are closely related: they are both established on manifold K; moreover, the entire information about

T_{G}

is encoded in unitary matrix

U_{m, i} (k)

. This is easily seen from Equation (134) given the unitary matrix

U_{m, i} (k)

; output

M_{m, n}^{{(υ)}^{'}} (k)

is determined, which is exactly the matrix designation for a ribbon transformation. Therefore,

T_{R}

can induce

T_{G}

. In other words, a correspondence exists between

T_{G} \sim T_{R}

via

U_{m, i} (k)

matrix.

Additionally, both

T_{G}

and

T_{R}

can be classified by unitary group U(N), where N refers to the highest dimension of the group’s irreducible representation (IR) on V. Thus, we may utilize a superscript “N” in

T_{G}^{(N)}

(or

T_{R}^{(N)}

) to denote U(N) gauge (ribbon) transformations. The dimension of U(N) could be different from the dimension of matrices. (The dimension of a group is a property about a set of matrices [44], while the dimension of a matrix is about a single matrix.) For example, consider

T_{G}^{(1)}

on two bands, i.e.,

e^{i ξ_{m} (k)}

(

m = 0, 1

). Aut(V) takes a form of

2 \times 2

diagonal matrix as below:

\begin{matrix} (\begin{matrix} | u_{0}^{'} (k) 〉 \\ | u_{1}^{'} (k) 〉 \end{matrix}) = (\begin{matrix} e^{i ξ_{0} (k)} & 0 \\ 0 & e^{i ξ_{1} (k)} \end{matrix}) (\begin{matrix} | u_{0} (k) 〉 \\ | u_{1} (k) 〉 \end{matrix}) . \end{matrix}

(138)

The matrix above is not representing a single one, but a set of matrices parameterized by k that forms 2D reducible representation of U(1) group, in which the highest IR is 1D, i.e.,

U (1) \oplus U (1)

. Thus,

T_{G}

is 1D, while the matrix (or the vector space) is 2D.

Back to the question raised earlier: why are Equations (132) and (133) both regarded as gauge transformations? Accurately speaking, based on Definition 8 (Equation (135)), only Equation (132) is

T_{G}^{(1)}

, while Equation (133) is transforming a ribbon with a unitary matrix

U (k)

. Equation (133) being considered a gauge transformation requires an additional step: the correspondence

T_{R} \sim T_{G}

, i.e.,

e^{i ξ (k)}

in Equation (133) contains all the information to deduce the gauge transformation. Forgetting that might lead to conceptual vagueness and confusion. For example, one may mistakenly believe

T_{G}

is established on the notion of eigenstate and complex phase shift. In fact,

T_{G}

can be defined as an abstract map, without referring to eigenvectors.

Next, we extend the familiar notion “basis invariance” to another concept, “gauge invariance”. Basis invariance is associated with a specific function of a matrix operator. For example,

\begin{matrix} F (O_{m, n}) = φ_{m}^{*} O_{m, n} ψ_{n} \end{matrix}

(139)

The basis invariance refers to

\begin{matrix} T_{B} [F (O_{m, n})] = F (O_{m, n}) \end{matrix}

(140)

The basis transformation

T_{B}

means that we need to find the updated components for the vector

φ_{m}^{*}

,

ψ_{n}

\begin{matrix} φ_{m}^{*} \overset{T_{R}}{\to} φ_{j}^{*} U_{j, m}^{†}, ψ_{n} \overset{T_{R}}{\to} U_{n, l} ψ_{l}, \end{matrix}

(141)

and for matrix operator

O_{m, n}

\begin{matrix} O_{m, n} \overset{T_{R}}{\to} U_{m, g} O_{g, h} U_{h, n}^{†} . \end{matrix}

(142)

Then,

\begin{matrix} T_{B} [F (O_{m, n})] & = φ_{j}^{*} U_{j, m}^{†} U_{m, g} O_{g, h} U_{h, n}^{†} U_{n, l} ψ_{l} \\ = φ_{m}^{*} O_{m, n} ψ_{n} = F (O_{m, n}) \end{matrix}

(143)

Then, function F is said to be invariant under basis transformation

T_{B}

. This invariance is associated with vector space V and function F.

In the same vein, we try to define invariance for ribbon transformation. In this case, function F needs to be replaced by a functional

F

about matrix

M_{m, n} (k)

. A functional can be viewed as a generalized function whose variable is a map.

M_{m, n} (k)

is the map

K \to V_{O}

(from manifold K to matrix (operator) space

V_{O}

) that serves as the functional’s variable; thus, the functional is denoted as

F (M_{m, n} (k))

. Additionally, we need to replace

T_{B}

by

T_{R}

.

\begin{matrix} T_{R} [F (M_{m, n} (k))] : = F (T_{R} [M_{m, n} (k)]), \end{matrix}

(144)

for which

T_{R} [M_{m, n} (k)]

is given by Definitions 6 and 7. That means

T_{R} [M_{m, n} (k)]

relies on whether matrix

M_{m, n} (k)

belongs to matrix or differential operators: the two are subject to distinct behaviors under

T_{R}

. The ribbon band is a common platform for both of the two types of operators, but only when it is defined for the matrix operator

T_{R}

may effectively reduce to the

T_{B}

since transformations at different k are independent.

On the other hand,

T_{R}

applies non-trivially to the matrix of differential operators. Consider an example of

M_{m, n} (k)

belonging to a differential operator:

\begin{matrix} F (M_{m, n} (k)) = Tr [\oint M_{m, n} (k) \cdot d k] \end{matrix}

(145)

Plug in Equation (145) with Equation (94):

\begin{matrix} T_{R} [F (M_{m, n} (k))] = \\ Tr [\oint (U_{m, i} (k) M_{i, j} (k) U_{j, n}^{†} (k)) \cdot d k] \\ + & Tr [\oint (U_{m, i} (k) i \partial_{k} (U_{j, n}^{†} (k))) \cdot d k] \end{matrix}

(146)

Using that Tr and ∮ may exchange the sequence and similarity transformation preserves trace. Equation (146) becomes

\begin{matrix} \oint Tr (U_{m, i} (k) M_{i, j} (k) U_{j, n}^{†} (k)) \cdot d k \\ + Tr \oint (U_{m, i} (k) i \partial_{k} (U_{j, n}^{†} (k))) \cdot d k \\ = \oint Tr (M_{i, j} (k)) \cdot d k + Tr \oint U_{m, j} (k) i d (U_{j, n}^{†} (k)) . \end{matrix}

(147)

A generic fact for unitary matrices

\begin{matrix} U = e^{i H} \end{matrix}

(148)

where H is a Hermitian matrix.

\begin{matrix} U d (U^{†}) = e^{i H} d (e^{- i H}) = e^{i H} e^{- i H} d (H) = d H . \end{matrix}

(149)

Combined with Equation (147), we have

\begin{matrix} Tr [\oint M_{i, j} (k) d k] + Tr [\oint i d (H_{m, n} (k))] \\ = Tr \oint M_{i, j} (k) d k + Tr [H_{m, n} (k) |_{0}^{2 π}] . \end{matrix}

(150)

The last term in Equation (150) is vanishing for continuity of

H_{m, n} (k)

on torus. Thus,

F

is invariant under ribbon transformation, i.e.,

T_{R} [F (M_{m, n} (k))] = F (M_{m, n} (k))

. On the other hand, if the integration is not for closed manifold or the trace Tr is absent, ribbon invariance fails.

Since gauge transformation

T_{G}

is induced by ribbon transformation

T_{R}

, we may introduce the notion of gauge invariance and gauge symmetry in a similar line as ribbon invariance.

Definition 9.

Gauge invariance is the following property associated with functional

F : M_{m, n} (k) \mapsto R

or

C

:

\begin{matrix} T_{G} [F (M_{m, n} (k))] : = F (T_{G} [M_{m, n}]) = F (M_{m, n} (k)), \end{matrix}

(151)

for which

\forall T_{R} \in U (N)

holds.

T_{G} [M_{m, n} (k)]

is given by Definition 8. If Equation (151) is fulfilled, functional

F

about matrix

M_{m, n} (k)

is said to be invariant under U(N) gauge transformation, or

F

has U(N) gauge symmetry. Obviously, if U(N) is the gauge symmetry of

F

, the subgroups of U(N) is also the gauge symmetry of

F

.

Remark 21.

T_{G}

for functional

F (M_{m, n} (k))

generalizes

T_{G}

for matrix

M_{m, n} (k)

(Definition 8). Then, gauge invariance is a notion established on

T_{G}

for

F

. Since transformation of

M_{m, n} (k)

is “fixed” (by Definition 8), whether it is gauge invariant entirely depends on the form of

F

.

Remark 22.

Since

T_{G}

(Definition 8) is defined for

M_{m, n} (k)

, transformation of the matrix of differential operator (Definition 7) follows; thus, gauge invariance is more pertinent to the matrix of differential operators (e.g.,

\hat{r}

). Ribbon invariance is a more generalized notion in this context, which works for both differential and matrix operators.

Remark 23.

Gauge invariance is a notation subject to functional

F

and matrix

M_{m, n} (k)

. It is also implicitly subject to the manifold K and vector space V, which are ingredients for the definition of matrix

M_{m, n} (k)

, since

M_{m, n} (k)

is a map:

K \to f

, where it is a linear self-map

f : V \to V

.

Remark 24.

For the correspondence

T_{R} \sim T_{G}

, gauge invariance and gauge symmetry can be characterized by the U(N) group.

Remark 25.

“Gauge invariance” might vary slightly in its meanings and emphasis in different contexts, thus showing different facets. These distinctions can all be attributed to specific constructions of functional

F

.

Consider an

F

without gauge invariance.

\begin{matrix} A_{n, n} (k) = F (A_{m, n} (k^{'})) \\ = lim_{k \to k^{'}} \frac{1}{k - k^{'}} \int_{k^{'}}^{k} δ_{m, n} A_{m, n} (k^{'}) d k^{'} \end{matrix}

(152)

and

\begin{matrix} T_{R} [F (A_{m, n} (k^{'}))] = U_{n, i} (k) A_{i, j} (k) U_{j, n}^{†} (k) \\ + U_{n, i} (k) i \partial_{k} (U_{j, n}^{†} (k)) \neq F (A_{m, n} (k^{'})) \end{matrix}

(153)

In fact,

F (A_{m, n} (k^{'}))

is no more than a functional construction for Berry connection; it is not gauge invariant, as is well known. Note that all the above invariances are defined with matrices and matrix transformation behavior, without reference to whether these matrices arise from differential operator or bra/ket. In other words, one does not need to refer to

A_{m, n} (k^{'}) = 〈 u_{m, k^{'}} | i \partial_{k^{'}} u_{n, k^{'}} 〉

, but only the matrix transformation.

Consider an

F

respecting gauge invariance.

\begin{matrix} ϑ = F (A_{m, n} (k)) = \oint δ_{m, n} A_{m, n} (k) \cdot d k \end{matrix}

(154)

It can be shown that

\begin{matrix} T_{R} [F (A_{m, n} (k))] = \oint U_{n, i} (k) A_{i, j} (k) U_{j, n}^{†} (k) \cdot d k . \end{matrix}

(155)

The inhomogeneous term

\oint U_{n, j} (k) i d (U_{j, n}^{†} (k))

is vanishing. Equation (155) is invariant when

U_{n, i} (k)

commutes with

A_{i, j} (k)

; however, in general, Equation (155) is not invariant. Thus, such a construction of

F

has U(1) gauge symmetry but not U(N).

In short, gauge transformation

T_{G}

reflects the response of the matrix of a differential operator under ribbon transformation

T_{R}

. Thus,

T_{G}

can be induced by

T_{R}

. Both of them can be characterized by the unitary group U(N) involved in the ribbon transformation. In certain cases, the two may be exchangeable. But their definitions in terms of the map are not the same.

Relation between gauge transformation $T_{G}$ and basis transformation $T_{B}$ . Basis is a notion associated with a vector space V. Basis transformation

T_{B}

affects vectors and operators defined in space V. It is expressed as a map:

\begin{matrix} T_{B} : V \to V . \end{matrix}

(156)

Conceptually, one can interpret

T_{B}

change components

φ_{m}

of a vector under the updated bases but does not alter the vector; alternatively, one may interpret

T_{B}

does alter the vector, sending it to another vector. The two interpretations only differ by the “reference”, actually equivalent.

T_{G}

and

T_{R}

are notions associated with product space

K \otimes V

(technically,

K \otimes V

can be called a bundle space with K being base space and V being fiber space;

K \otimes V

is locally like a tensor product of spaces, but not necessary globally [43]). Roughly speaking,

T_{G}

and

T_{R}

are defined for a “bigger” space. Then, could we view

T_{G}

and

T_{R}

as “basis transformation” for the bigger space? This view could be mistaken, since

K \otimes V

may not form a vector space (e.g., when K is B.Z.), but only a topological space.

T_{G}

and

T_{R}

are different maps from

T_{B}

as summarized in Table 4.

T_{G}

and

T_{R}

affect ribbons and operators defined in

K \otimes V

, just like

T_{B}

modifies vectors and operators in V. Quantum operators are originally defined in Hilbert space, which is a vector space; in treating transport, as mentioned in Section 2, we encounter diverging norms and transcribe operators into non-vector spaces, wherein

T_{G}

and

T_{R}

are defined. In the band context, V is quotient space

V

of

H_{B}

, and

T_{B}

is a map

V \to V

. Thus, one can practically understand

T_{B}

is a “single”

N

-dimensional matrix at k, while

T_{G}

(also

T_{R}

) involves

\forall k \in K

and thus it is a matrix field.

It is inaccurate to regard

T_{G}^{(1)}

just as “phase shifts”, as

T_{B}

can also produce that, for example, a 2D vector space spanned by

| u_{j} 〉

,(

j = 0, 1

). Consider a

T_{B}

of a phase shift,

| u_{j} 〉 \to e^{i ξ_{j}} | u_{j} 〉

. Is this phase shifting a

T_{G}^{(1)}

? No. Because it does not involve manifold K, i.e.,

ξ_{j}

is not a field about k. If

T_{G}^{(1)}

is a “special case” of

T_{B}

, since a general

T_{B}

allows mixing of bases, e.g.,

| u_{0}^{'} 〉 = c_{0} | u_{0} 〉 + c_{1} | u_{1} 〉

,

T_{G}^{(1)}

constrains

c_{0} = e^{i ξ_{0}}

and

c_{1} = 0

. This view is inaccurate for the same reason:

ξ_{j}

is not a field about k. On the other hand, at a local k,

T_{G}^{(1)}

can be viewed as a constrained

T_{B}

; if one relaxes the constraint and mixing of orthogonal bases is allowed,

T_{G}^{(N)}

is achieved. Thus, one may use the dimension of local

T_{B}

to classify

T_{G}

and

T_{R}

.

We see that

T_{G}^{(1)}

invariance is an important criterion for constructing observables. One may wonder whether invariance of

T_{B}

should be tested, too. Fortunately, associative operators (i.e., matrix operators) defined in vector space are automatically endowed with U(1) symmetry. Thus, gauge invariance is only pertinent to differential operators, such as position operators, Berry connection, etc.

In other words, if an observable involves

\partial_{k}

, etc., one should test whether this quantity is invariant under

T_{G}^{(1)}

; if gauge invariance is true, in principle, the quantity could be detectable. On the other hand, for matrix operator serving as observables (e.g., spin), there is not such an issue of gauge invariance. For operators other than observables, e.g., propagation operator, should we worry about basis or gauge transformation? Fortunately, the answer is no, again. But for a different reason. Consider an evolution operator (

T

is time-ordered):

\begin{matrix} | u_{n, k} (t) 〉 = T \exp [- i \int_{0}^{t} H (k, τ) d τ] | u_{n, k} (0) 〉 . \end{matrix}

(157)

We notice that the evolution operator is composed of a product of matrix operators and their sums (expansion of exponential functions under time ordering), which again form a matrix operator. Therefore, when we evaluate a spin operator and its evolution, we do not face a gauge issue.

At last, we remark

T_{R}

or

T_{G}

are not only “transforming” the ribbon but also affect the matrix defined in the ribbon space, just like basis transformation also affects operators defined in the vector space. Thus, ribbon transformation is defined as map of ribbons while, as shown by Equation (92), the operators are also affected.

Extracting observables. From Figure 1, we realize

T_{R}

is a general transformation applicable to both matrix and differential operators. When it is applied to a differential operator, it induces gauge transformation

T_{G}

; gauge invariance is a special case of

T_{R}

invariance, as

T_{R}

is applied to differential operators.

By reviewing the observable of a matrix operator, we find it features U(1) symmetry, i.e., invariant under

T_{R}^{(1)}

. It corresponds to a function F, namely observable function, that links an observable with the diagonal terms of matrix

O_{m, n}

.

\begin{matrix} F (O_{m, n}) = O_{n, n} \end{matrix}

(158)

Function F is invariant under U(1) ribbon transformation

\begin{matrix} T_{R}^{(1)} [F (O_{m, n})] & = F (T_{R}^{(1)} [O_{m, n}]) = F (e^{i ξ_{m}} O_{m, n} e^{- i ξ_{n}}) \\ = e^{i ξ_{n}} O_{n, n} e^{- i ξ_{n}} = O_{n, n} = F (O_{m, n}), \end{matrix}

(159)

a property not shared by off-diagonal terms (thus, the observable is linked to diagonals rather than off-diagonals). In the case of

N > 1

, we generally have

T_{R}^{(N)} [F (O_{m, n})] \neq F (O_{m, n})

, i.e., F does not enjoy U(N) symmetry. U(1) symmetry is believed indispensable for observables. In band scenarios, the ground state has a fixed occupancy (all states below Fermi level), but phase is flexible due to dynamic evolution

e^{- i E (k) t / ℏ}

. That means that even without disturbance each quasi-particle keeps evolving, and the ground state is composed by a collection of quasi-particles with random phases. Thus, robustness to phase fluctuations, i.e., U(1) symmetry, ensures a quantity to be stable over time (given occupancy is unchanged) and thus detectable during a measurement. Since our “vision” depends on measurement conditions, time/spatial scales, etc., the meaning and criteria for observables might vary with cases. In non-Abelian gauge theory, observables might have higher symmetries. Nonetheless, U(1) symmetry should be of outstanding importance.

We try to extend the gauge symmetry principle for observables to differential operators. Evidently, for the matrix of differential operators,

F (M_{m, n})

is not invariant under

T_{G}^{(1)}

(i.e.,

T_{R}^{(1)}

).

\begin{matrix} T_{G}^{(1)} [F (M_{m, n} (k))] = F (T_{G}^{(1)} [M_{m, n} (k)]) \\ = F (e^{i ξ_{m} (k)} M_{m, n} (k) e^{- i ξ_{n} (k)} + e^{i ξ_{m} (k)} i \partial_{k} e^{- i ξ_{n} (k)}) \\ = e^{i ξ_{n} (k)} M_{n, n} (k) e^{- i ξ_{n} (k)} + e^{i ξ_{n} (k)} i \partial_{k} e^{- i ξ_{n} (k)} \\ = M_{n, n} (k) + \partial_{k} ξ_{n} (k) \neq F (M_{m, n} (k)) \end{matrix}

(160)

That means function F is unsuitable for observables associated with differential operators. Thus, we reconstruct a form that ensures U(1) symmetry. Consider the following:

\begin{matrix} F (M_{m, n} (k)) : = \oint M_{n, n} (k) \cdot d k, \end{matrix}

(161)

which is a satisfaction that gives F minimum modification (and thus maximum elegance).

F

still involves the diagonal terms

M_{n, n} (k)

but adds an integrand over k. This form has emerged in different fields of physics [15,16,24,27] and is privileged with U(1) symmetry.

\begin{matrix} T_{G}^{(1)} [F (M_{m, n} (k))] = F (T_{G}^{(1)} [M_{m, n} (k)]) \\ = F (e^{i ξ_{m} (k)} M_{m, n} (k) e^{- i ξ_{n} (k)} + e^{i ξ_{m} (k)} i \partial_{k} e^{- i ξ_{n} (k)}) \\ = \oint (e^{i ξ_{n} (k)} M_{n, n} (k) e^{- i ξ_{n} (k)} + e^{i ξ_{n} (k)} i \partial_{k} e^{- i ξ_{n} (k)}) d k \\ = \oint M_{n, n} (k) \cdot d k + \oint \partial_{k} ξ_{n} (k) \cdot d k \\ = \oint M_{n, n} (k) \cdot d k = F (M_{m, n} (k)) . \end{matrix}

(162)

F (O_{m, n})

and

F (M_{m, n} (k))

represent two distinct ways of obtaining observables. The dichotomy seems weird: one has to follow separate principles. Now, we argue the two are linked by a common principle: U(1) symmetry, subject to a common physical origin of “stability to dynamical phases”. If U(1) symmetry is the principle for observables (at least in condensed matter scenarios), subject to which the distinct ways of yielding observables,

F (O_{m, n})

and

F (M_{m, n} (k))

, can be unified.

Conventionally, observables are evaluated by inner products with the observable’s operator, i.e., diagonal terms of

O_{m, n}

(or their supposition), expressed by

F (O_{m, n})

, which is a necessary result if the following are true: (i) every observable has a corresponding (Hermitian) operator; (ii) the corresponding operator is an associative operator (i.e., a matrix operator), which ensures U(1) symmetry. However, counterexamples are now known for both (i) and (ii). In non-relativistic quantum mechanics, time does not have such a corresponding operator; in relativistic scope, boson lacks its position operator [54]. Thus, observable–operator correspondence is not guaranteed. Moreover, when the corresponding operator exists (e.g.,

\hat{r}

as discussed throughout Section 3), it might not be associative; that is why we see the divergence in DRM and incompleteness of vector space for

\hat{r}

, which has motivated our seeking CRM with Nth Weyl algebra

A_{N}

, leading to distinct transformation behaviors for the r-matrix. In history, based on mistaken presumptions (i) and (ii), Von Neumann “proved” hidden local variables in quantum mechanics [2].

The traditional view is that F is the generic form of generating observables, subject to which

F

should belong to the frame of F. However, the difficulty is that there is not a counterpart for integration over k in F. In other words, F is for local k, while

F

is for global K: employing

F

to determine the value of observable, one must come into knowledge about

M_{m, n} (k)

all over B.Z., while F is only about a vector at a single k. This issue was noticed in Vanderbilt’s book (Chapter 4) [27]. The essential argument is that electric polarization cannot be expressed as the expectation value of a quantum operators as the case of most quantum operators; instead, it is related to Berry phases which are defined by global means.

For the new principle, we first establish matrix and differential operators on a common ground: ribbon space; on top of it, U(N) symmetry is to classify both of them. Then, we argue

F (O_{m, n})

is no more fundamental than

F (M_{m, n} (k))

. Instead, we take the U(1) symmetry as the major principle and seek the robust forms for each type of operator (Figure 2).

Recall that an important reason for

F (O_{m, n})

to be an observable is its U(1) symmetry; off-diagonal terms are unqualified for observables in view of the absence of U(1). In a similar fashion, if we find another U(1) invariance form

F (M_{m, n} (k))

, which works for differential operators,

F (M_{m, n} (k))

should be treated with equity as

F (O_{m, n})

. That is,

F

is not subject to F, nor deduced by F. U(1) could be extended to U(N) invariance form, such as Equation (145), by including trace.

We call for attention on the following hinged aspects:

(1): Differential operators are not associative, fundamentally distinct from matrix operators;
(2): Distinct transformation rules for matrices of the two types of operators;
(3): Different meanings of U(1) symmetry;
(4): Means of extracting observables.

A differential operator is not associative but “one-sided”, never equivalent to matrix operators. Consequently, the matrix of differential operators features a distinct transition rule with an extra term (Equation (94)). The inhomogeneous transformation renders a different meaning of U(1) invariance since it is a notion (Definition 9) associated with specific functionals. Although both matrix and differential operators have U(1) symmetry, their functionals are different, and the meaning of U(1) is not the same. An observable is about constructing U(1) gauge; since the functionals are different, the two operators have different ways of extracting observables.

Thus,

F

is implicitly linked to behaviors of

M_{m, n} (k)

and its transformation symmetries, and the way of extracting observable Equation (161) is a generic consequence for differential operators, not limited to

\hat{r}

.

7. Discussion and Outlook

The spaces related to the $r$ -matrix. In this work, we transcribe

\hat{r}

, originally defined as an linear infinite-dimensional operator in space

H

, to a ribbon space

K \otimes V

, in which

\hat{r}

presents as a finite-dimensional matrix, namely CRM. In the course, several spaces are involved as summarized in Table 5.

CRM arises from

\hat{r}

, while it is not a representation of Weyl algebra (Section 2) and thus not a representation of generator

\hat{r}

. CRM should be viewed as a matrix incarnation of

\hat{r}

that encodes the information to evaluate

〈 \hat{r} 〉

. CRM is hosted by a “ribbon space” spanned by ribbon bases

I

(Definition 3) just as a vector is hosted by a vector space spanned by vector bases. The ribbon basis is a product space

K \otimes V

(which may or may not form a vector space relying on whether K is a vector space). In

K \otimes V

, notions such as ribbon

T_{R}

and gauge transformations

T_{G}

emerge.

CRM is an array of matrices parameterized with continuous k. Formally speaking, it is map

K \to f

, which involves two spaces K and

V

. Thus, there are two dimension-like quantities associated with CRM: its dimension dim(K)

+ N

and its rank

N

. Accordingly, Hamiltonian H (originally defined in

H

) is incarnated by

H_{m, n} (k)

defined in the ribbon space. Rigorously speaking,

H_{m, n} (k)

is not a representation of H, since spaces

H

and

K \otimes V

are not of the same dimension.

Although

\hat{r}

is a linear map in

H

, its incarnation CRM in

K \otimes V

loses linearity, reflected by CRM’s transformation featuring an inhomogeneous term (Definition 7). That is, the transcribing does not maintain linearity, exactly because the transcribing procedure is different from a basis transformation. Representations of linear operators under different bases are linked by reversible transformations, which should preserve the dimension of spaces. However, the dimension is decreasing from ∞ in

H

to

N \times N

in

H_{B}

, and finally to dim(K)

+ N

in

K \times V

. Consider two bands in a 1D B.Z. A vector in

H_{B}

is denoted with

2 \times N

components

(c_{1, k_{1}}, \dots c_{1, k_{N}}, c_{2, k_{1}}, \dots, c_{2, k_{N}})

, while in

K \times V

, a point is denoted by

(k, c_{1}, c_{2})

, which is three-dimensional. Continuity is crucial in reducing the dimension from

N \times N

to dim(K)

+ N

(Appendix E). One cannot simultaneously maintain linearity and achieve convergence. Whether DRM or CRM is adopted, it does not form a representation of Weyl algebra for its non-existence mentioned in Section 2.

The principles of defining the $r$ -matrix. Matrices are usually defined as representations of operators for which the interchangeable terms “matrix” and “operator” are often acceptable. To be representations, matrices should reproduce the defining algebra. For example, Pauli matrices are representations of spin (matrix operator) that satisfy Lie algebra. Formally speaking, we seek maps from operators to matrices that preserve Lie brackets. Such a structure-preserving map is named an isomorphism [44], which is the principle in defining the matrix for operators (Table 6).

Another well-known matrix isomorphism is group representation, which describes abstract groups in terms of bijective linear transformations of a vector space to itself (i.e., vector space automorphisms); in particular, they can be used to represent group elements as invertible matrices so that the group operation can be represented by matrix multiplication.

However, a matrix for

\hat{r}

(differential operator) invokes distinct defining principles since matrix isomorphism does not exist for Weyl algebra (Section 2). We still build a map from

\hat{r}

to Hermitian matrices, and this map is not meant to preserve the algebra (that is why we find

r_{m, j} k_{j, n} - k_{m, j} r_{j, n} = 0

in Equation (80), while Weyl algebra has

\hat{r} k - k \hat{r} = i

); however, this does not prevent the operator’s information being encoded in the matrix. That is, via the matrix, one can deduce the observable, except for when the means of extracting the observable is different from matrix operators, since the encoding way is changed.

For the r-matrix, the transformation rule is inhomogeneous (Equation (94)), which is impossible to preserve the commutator

[\hat{r}, k] = i ℏ

. In other words, the commutator is not invariant under

T_{G}

. This is a significant difference between Lie algebra and Weyl algebra. Despite the lack of matrix isomorphism, we may still define a matrix for differential operators, and it interacts with ribbons in a similar fashion to a “genuine” matrix.

Two types of operators. We discriminate two types of operators: matrix operators (e.g., spin) and differential operators (e.g.,

\hat{r}

). Their properties and designations are compared in Section 5. The two operators are expressed by matrices that are equipped with distinct transformation rules (see Definitions 6 and 7). Differential operators “appear like a matrix” until ribbon transformation is performed; just like coefficients of a vector are no more than “a column of numbers” until basis transformation is performed when covariance and contravariance manifest. Thus, an operator is not just about its matrix elements but also their transformation rules.

Then why the two matrices follow the rules specified by Definitions 6 and 7? Why are these particular forms of transformation chosen? This arises from the basic algebras for the two operators. The matrix operator is associative, i.e.,

(A B) C = A (B C)

. One is at liberty to act B on the left or the right first and ends up with the same outcome. Then, the matrix elements are

\begin{matrix} 〈 m | \hat{O} | n 〉 \to 〈 m^{'} | \hat{O} | n^{'} 〉 = 〈 m | U \hat{O} U^{†} | n 〉 \\ = 〈 m | U | i 〉 〈 i | \hat{O} | j 〉 〈 j | U^{†} | n 〉 = U_{m, i} O_{i, j} U_{j, n}^{†} \end{matrix}

(163)

This is exactly the expression for Equation (93) in Definition 6. The form of Equation (93) is interpreted as the transformation rule is the operator’s matrix against the updated bases.

However, we do not take the above procedure for granted. It is only true when

\hat{O}

is associative. Otherwise, e.g.,

\partial_{k}

, which acts on only one side (conventionally on the right), when we try to repeat the procedure as Equation (163), we encounter a different situation.

\begin{matrix} 〈 m (k) | i \partial_{k} n (k) 〉 \to & 〈 m^{'} (k) | i \partial_{k} n^{'} (k) 〉 \\ = & 〈 m (k) | U (k) i \partial_{k} U^{†} (k) n (k) 〉 . \end{matrix}

(164)

Obviously, there is an extra term on the right side, which arises from

\partial_{k}

being non-associative and obeying

\partial_{k} (φ (k) ψ (k)) = \partial_{k} (φ (k)) ψ (k) + φ (k) \partial_{k} (ψ (k))

, namely Leibniz rule. If we write the above terms in matrix forms, it is exactly Definition 7. As such, the algebra differences in associative property are incarnated by the matrix’s transformation. In terms of abstract algebra, the particular form of transformation introduced by Definition 7 stems from the Leibniz rule imposed on

\partial_{k}

.

One may wonder: What if we can define a generalized differential operator that may act on both sides, such that the associative rule will be recovered for

\partial_{k}

? Unfortunately, this is unachievable due to the intrinsic properties of

\partial_{k}

. Consider two-sided actions:

\partial_{k}

acting on the right and

\overset{\leftarrow}{\partial}

acting on the left. Clearly,

\begin{matrix} c \partial_{k} φ (k) \neq (c \overset{\leftarrow}{\partial}) φ (k) = 0, \end{matrix}

(165)

where c is a k-independent constant. That means

\partial_{k}

acting on left or right sides first leads to distinct results, and thus the associative rule is violated. Thus, the differential operator is incompatible with the matrix operator in its bottom algebra. It is impossible to turn a differential operator into a matrix. Therefore, one should distinguish the terms “operators” and “matrices of operators”. This distinction is trivial when only matrix operators are involved.

Basis-free designation vs matrix designation. The bra/ket designation could be entirely substituted by matrix designation such as

| φ 〉 \to φ_{n}

,

〈 m (k) | i \partial_{k} n (k) 〉 \to A_{m, n} (k)

, etc. Gauge transformation

T_{G}

is directly defined with Equation (134) on the matrix, without reference to first multiplying

| φ 〉

with a phase

e^{i ξ (k)}

(or a unitary operator) and then deducing

T_{G}

’s form. Thus, Definitions 5–9 are directly established on matrices, not referring to “matrix definition” in terms of bra/ket.

The bra/ket may lead to mistakes when differential operators are present (Section 5). Differential operators commonly exist in transport problems, relativistic quantum mechanics, gauge field theory, etc., wherein matrix could be a better designation [54]. However, bra/ket designation is elegant for matrix operators and broadly used. Thus, in Section 5. we list the “translation”: matrix “definition” in terms of bra/ket inner product. However, it is never suggested that one must refer to the “internal structures” of matrices (definition begins with matrix elements, no need to refer to the “origin” of these matrices). Gauge transformation, basis transformation, and extracting observables can all be handled with pure matrix designations.

Thus, we switch to matrices for a common denotation for both the matrix operator and the differential operator. Accordingly, the matrix is established on ribbon bases in the ribbon space (or the bundle space) as a generalization of bases of vector space. Because of the fundamental algebraic distinctions, the matrices of the two types of operators follow different rules of transformation under different ribbons.

Inelastic process. While the non-conserved k is inferior in chance, a valid formulation should be able to handle it (at least in principle). It serves as an indicator of how promising the method is.

Formulating inelastic processes involves two tiers. The first is the following: as the main topic of this work, CRM or observable

〈 r 〉

given input of generic ribbon band is evaluated—this tier does not distinguish elastic, inelastic, or other modes. The second tier is about how a particular state (described by a ribbon) is achieved—this involves the specific form of an inelastic evolution operator.

The generic evolution of a state in

H_{B}

can be expressed as

\begin{matrix} U (k_{1}, \dots, k_{N}) = \tilde{U} (k_{1}, \dots, k_{N}) \otimes \tilde{S} (k_{1}, \dots, k_{N}) \end{matrix}

(166)

where

\tilde{U}

and

\tilde{S}

are unitary operators defined in quotient spaces

V

and E. That is,

\begin{matrix} \tilde{U} (k_{1}, \dots, k_{N}) \otimes \tilde{S} (k_{1}, \dots, k_{N}) (| u_{n, k} 〉 \otimes | E_{k} 〉) = \tilde{U} (k_{1}, \dots, k_{N}) | u_{n, k} 〉 \otimes \tilde{S} (k_{1}, \dots, k_{N}) | E_{k} 〉 \end{matrix}

(167)

In the elastic case (k is preserved), we adopt

\tilde{S} (k_{1}, \dots, k_{N})

as the identity. Physically, that means the evolution does not “mix” different

k_{j}

states. In this condition, the fiber space can be reduced to lower dimensions: from

H_{B} ≅ V \otimes E

to

V

.

In the inelastic case,

\tilde{S} \neq 1

, and this increases the dimensions of fiber space. In terms of observable

〈 r 〉

, we find that the evolution operator behaves like a ribbon transformation. Based on Equation (94), the displacement may contain extra terms

{\tilde{S}}^{†} (i \partial_{k} \tilde{S})

due to inelastic

\tilde{S} \neq 1

.

\begin{matrix} {(\tilde{U} \otimes \tilde{S})}^{†} i \partial_{k} (\tilde{U} \otimes \tilde{S}) = {\tilde{U}}^{†} i \partial_{k} (\tilde{U}) + {\tilde{S}}^{†} i \partial_{k} (\tilde{S}) \end{matrix}

(168)

Thus, the transitions of inter-band and inelastic k are understood commonly by the “twist” of ribbons, except for in two distinct quotient spaces

V

and E.

At this point, it seems that the formulation has ports to build in the inelastic transport; on the other hand, the full theory of the inelastic transport, as well as the comparison with existing theory and experiment, is beyond the scope of one paper.

Application and Outlook. Why can the fundamentals about the r-matrix give insights for a transport theory? We give two examples, each deserving extensive discussions in separate work. The first is regarding two arguments established earlier: (i) isomorphism

H_{B} ≅ V \otimes E

(just the

Π

map in Section 3); (ii) one-to-one correspondence between

| u_{n, k} 〉

and vectors in the quotient space

V

of

H_{B}

(space spanned by

| u_{n, k} 〉 ≅ V

in Section 4). Argument (i) links

V

with

H_{B}

as its quotient space, and argument (ii) relates space spanned by

| u_{n, k} 〉

with

V

by isomorphism such that the vector space spanned by

| u_{n, k} 〉

can be transcribed to the original physical space

H_{B}

. Although such linkage does not influence the evaluation of, for instance, Berry connection or curvature, it provides counting for vectors in

H_{B}

and

V

spaces to shed light on the essence of the phenomenon.

To be concrete, consider the adiabatic current

J_{d}

evaluated by integration of Berry connection [23,27]

\begin{matrix} J_{d} & = \partial_{t} P = - \frac{e}{V_{cell}} \partial_{t} {〈 \hat{r} 〉}_{n} \\ = - \frac{e}{V_{cell}} \partial_{t} \frac{a}{2 π} \oint r_{n, n} (k) \cdot d k . \end{matrix}

(169)

What is the input needed for evaluating

J_{d}

? Equation (169) relies on

r_{n, n} (k)

over the entire B.Z. Remember

r_{n, n} (k)

is the reduced r-matrix evaluated with

| u_{n, k} 〉

in quotient space

V

. Thus, to understand what

| u_{n, k} 〉

physically represents, one ought to apply

Π

inversely (called

Π_{k}^{*}

map temporally) to yield their pre-images in the original physical space

H_{B}

. It can be proved that

| u_{n, k} 〉

always corresponds to N mutually orthogonal vectors in

H_{B}

(even when

| u_{n, k} 〉

is constant with k). Since the dimension of subspace in

H_{B}

corresponds to particle numbers (orthogonality is due to exclusion principle)

\begin{matrix} # (particles) = \dim ({ψ_{k}}_{occ}), \end{matrix}

(170)

it indicates that the adiabatic current

J_{d}

is an N-particle phenomenon. Because of that, given full knowledge of a single-particle wave function

φ (t)

, it is still insufficient to determine

J_{d}

with Equation (169). In previous formulation,

〈 \hat{r} 〉_{n}

is evaluated with wavefunction

u_{n, k} (r)

coordinated by r. Thus, one tends to (mistakenly) attribute

J_{d}

to a single-particle effect, since

u_{n, k} (r)

can be extracted from a single-particle Bloch wave

ψ_{n, k} (r)

. As such, the many-body feature in

J_{d}

is concealed.

Note that counting the physical states must be established on vector space (regarding the dimensions of a subspace). It is ill-defined to ask how many particles are involved for functions

u_{n, k} (r)

. That is why arguments (i) and (ii) that provide counting for vectors are important. In the adiabatic limit, the corresponding of

u_{n, k} (r)

to N vectors has a simple interpretation: just the N particles occupied by a band (N is the unit cell numbers). In general, the correspondence to N orthogonal states may apply to non-equilibrium, whether inter-band hopping is involved or whether k is preserved. As such, we realize that the N particles form a bounded “unit”, even when some of them are excited to a different band or different k. This N-particle picture gives insight into the description of electronic states in crystals.

Moreover, arguments (i) and (ii) concern the stability for adiabatic currents, as

J_{d}

loses gauge invariance even with a single particle missing. In other words, gauge invariance is fragile with particle number variation, e.g.,

N \pm 1

. Then, an intriguing allusion is yielded: the N-particle

J_{d}

is correlated, although these N particles are non-interacting; that is, correlation still exists given interaction is all removed. In contrast, the traditional wisdom is that correlation exclusively arises from interaction, and a free particle is uncorrelated. It is known that the Berry phase theory of polarization [23] links

J_{d}

and its transported charges with global topology. Now with arguments (i) and (ii), we reveal that the physical meaning of global topology is the N-body correlated effects.

As the second example, we show why the variable extraction from the r-matrix (functional

F

in Section 6) is linked to transport theory. One is often overwhelmed by the diverse transport mechanisms: shift current [4,8,18], injection current [4,8,28], adiabatic current [22]; and has to resort to a case study to recognize which mechanism is in effect. Such situation owns its origin to lacking an unambiguous way of determining current as an observable. In contrast, spin’s expectation value is

\begin{matrix} 〈 \hat{s} 〉 = 〈 φ (t) | \hat{s} | φ (t) 〉, \end{matrix}

(171)

which is independent of whether

φ (t)

changes slowly or fast. However, such a generic definition is lacking for the current because the divergence of the r-matrix makes

J \propto \partial_{t} 〈 \hat{r} 〉

ill-defined. Thus, the divergence of the r-matrix leads to vagueness in extracting the observable for

\hat{r}

, leading to ill-defined J with

\partial_{t} 〈 \hat{r} 〉

, then leading to the diverse transport mechanisms that involve different approximations or physical intuitions (e.g., electronic hoping is fast or slow, inter-band or intra-band), then leading to the inconsistency among different mechanisms from the point of view of being a single-particle effect or an N-particle one, being correlated or not. Thus, finding the converging matrix to recover the original definition

J \propto \partial_{t} 〈 \hat{r} 〉

is crucial for developing a unified transport theory.

To be concrete, consider adiabatic current

J_{d}

and shift current

J_{s}

. We realize there is a “gulf”. For

J_{s}

, it is evaluated by Equation (4), in which the domain of integration ∫ is arbitrary, because gauge invariance of

J_{s}

remains whether the integration domain is closed or not, connected or not, even on a single k point. In other words,

J_{s}

only depends on the initial and final states of hopping, which are two discrete points along the evolution wave function

φ (t)

, such that information of

J_{s}

, within the

J_{s}

formulation, is fully encoded in

φ (t)

and thus is a single-particle phenomenon. On the other hand,

J_{d}

requires stringently closed ∮ and presents as an N-particle phenomenon, as preceeding discussion suggests, fundamentally different from

J_{s}

.

Remember there is only “one” current in crystals. Adiabatic current

J_{d}

, shift current

J_{s}

, etc., are merely artificial classifications.

J_{d}

and

J_{s}

are to handle the slow- and fast-changing Hamiltonians, respectively, characterized by external driving frequencies

ℏ ω

. By tuning

ℏ ω

, the current should gradually switch from one formulation to the other. But how can a single-particle

J_{s}

cross the gulf to continuously connect with an N-particle

J_{d}

? Note that both

J_{d}

and

J_{s}

are non-interacting, irrelevant to interaction causing emergent collective states for electrons. Namely, the gulf is that at high

ℏ ω

, information of the current is encoded in the wavefunction of a single particle, while at low

ℏ ω

one has to know the states of all N particles (without missing any of them) to determine the current—no known mechanism can realize this. Moreover, it is hard to imagine the transition regime when

ℏ ω

is intermediate. The current situation is reminiscent of the inconsistency between quantum and relativity theories in terms of the fundamental description of the space and events: whether local or non-local laws are paramount in universe. The central question is how to reconcile a non-local theory at quantum scales with a local theory at larger scales that describe the same universe. Here, we consider a much more modest question: how to reconcile the two transport theories that describe the same non-interacting current, while one is about a single particle and the other is about N particles. Therefore, removing divergence of the r-matrix and finding the way of extracting observables from CRM will help judge which picture is correct. Our theoretical framework is poised to enhance understanding of the photocurrent and phonon responses exhibited by topological materials, which are currently the focus of active exploration in THz and ultrafast experiments [58].

A physical prediction in a superconductor scenario is detailed in a separate work [21] using this formulation. With the application of Nth Weyl algebra, charge position and movement in superconductor and insulator (two extremes of conductivity) can be unified by one formula [21]. It improves our knowledge by discovering a missing current component (termed as

J_{corr}

) entirely due to the correlation change of Cooper pairs, carrying velocity but no momentum. Simulation and experimental verification are outlined by Figure 2 of [21].

In short, we should not be just content for an evaluable formulation for currents but also examine (a) whether the formulation is generic and unique, (b) whether formulations for different limiting situations are compatible and can crossover to each other, and (c) whether the formulation is stable. For example,

J_{d}

requires ∮, depending on whether the formulation is robust against particle missing in B.Z. Addressing these issues boils down to understanding the r-matrix, the relations between different involved spaces, and also the way of extracting observables.

8. Summary and Conclusions

This work surveys the definitions of the

\hat{r}

operator and the r-matrix, addressing why although no matrix may satisfy the commutation

[r, p] = i ℏ

, a matrix can still be assigned to

\hat{r}

. This involves a fundamental question: what is the defining principle of the r-matrix? Subject to that principle, we further wonder whether one could find CRM to substitute for the well-known diverging DRM, motivated by a belief: the matrix of a physical operator should not diverge. In the CRM to be derived, every element should be finite; the dimensions of CRM are finite and arbitrary.

In Section 2, we first introduce the math involved in defining

\hat{r}

: Weyl algebra, which is characterized by the number of conjugated variable pairs, denoted as

A_{N}

. Then, we demonstrate that DRM does not satisfy

[r, p] = i ℏ

; indeed, no matrix can satisfy the commutation equation. Thus, a different principle for defining r-matrix is needed.

In Section 3, we first show that first Weyl algebra

A_{1}

(the familiar substitution

\hat{r} \to i \partial_{k}

) inevitably leads to divergence in the r-matrix. Then, we show that the divergence could be resolved by Nth Weyl algebra

A_{N}

to substitute for

A_{1}

. A key modification is (Equation (59))

CRM : \hat{r} \to i \partial_{k_{1}} + i \partial_{k_{2}} + \dots + i \partial_{k_{N}} .

(172)

Note that

A_{1} \to A_{N}

is merely substituting generators of Weyl algebra, not the entire modification, since the space for generators to act upon must adjust, too (Appendix F). For that, we introduce three spaces

H

,

H_{B}

, and

V

, on top of which a space

V \otimes E

(ribbon space as declared latter) for

A_{N}

to act on is constructed before we derive CRM. The constructing is essentially about

Π

map

H_{B} \to V \otimes E

, with which we are able to show, rigorously, that some often used denotations such as

〈 r | ψ_{n, k} 〉

,

〈 r | u_{n, k} 〉

are not accurate, because

| r 〉 \in H

,

| ψ_{n, k} 〉 \in H_{B}

,

| u_{n, k} 〉 \in V

and inner products cannot be defined between vectors belonging to different vector spaces.

In Section 4, by acting Nth Weyl algebra on the product space

V \otimes E

, we obtain the explicit forms of CRM as Equation (61). As two facets of CRM, the “r-matrix” and the “reduced r-matrix” are discriminately introduced linked to Bloch space

H_{B}

and its quotient space

V

, respectively (Equations (75) and (76)). A corollary is achieved that geometric quantities (e.g., Berry connection) are associated with the quotient space

V

instead of Bloch space

H_{B}

. CRM and DRM are discussed in aspects of what causes the divergence, how the divergence is resolved, DRM not being a special case of CRM, etc.

In Section 5, we show that matrices defined for position and spin operators display different properties in transformation and other aspects. As a consequence, two types of operators are recognized: a matrix operator (Definition 6) and a differential operator (Definition 7), which can be unified under a platform “ribbon space”. The unifying leads to fortuitous discoveries. For example, we show

r_{m, n} = r_{n, m}^{*} ⇎ \hat{r} = {\hat{r}}^{†}

, which subtly affects the well-known Berry curvature formula for polarization (Equations (96) and (109)). A designation system must adjust to suit the different transformation properties. We find the ket/bra designations (perfectly workable for matrix operator) might encounter ambiguity for differential operators in certain situations (Table 3).

In Section 6, we extensively discuss the space that hosts CRM: ribbon space, in which ribbon transformation

T_{R}

is introduced in analog with basis transformation

T_{B}

in a vector space. Particularly, we show how gauge transformation

T_{G}

and gauge symmetry make entrance as a natural consequence of

T_{R}

. We give a formal definition for

T_{G}

(Definition 8) associated with a manifold K and gauge symmetry U(N). Noteworthy is the fact that although labeled with “gauge invariance”, formulations can vary significantly in meanings depending on distinct K and associated gauge symmetries. We further show that

T_{G}

owes its origin to differential operators; on the other hand,

T_{G}

is only trivially defined for a matrix operator. We explain the relationships among ribbon transformation

T_{R}

, gauge transformation

T_{G}

, and basis transformation

T_{B}

. We address why U(1) gauge symmetry is necessary for an observable, and whether U(N) gauge symmetry is necessary, too.

In Section 7, we review the journey: from definitions of the position operator, to the various spaces involved, to the principles of defining CRM on these spaces, to the designation and symbolism, to observable extraction. Remarkably, setting out from the basic definition of

\hat{r}

, a series of concepts (involving geometry, transport, gauge, etc.) emerge and become intertwined. We reveal two pathways with hinged aspects:

H \sim A_{1} \sim DRM \sim ambiguity in 〈 \hat{r} 〉

V \otimes E \sim A_{N} \sim CRM \sim unambiguity in 〈 \hat{r} 〉 .

Accepting one pathway probably means one must accept the related aspects. Accurately speaking, we are not denying the diverging nature of DRM in the original space

H

but discover a potentially unique way of resoling the ambiguity (arising from this divergence) in defining the r-matrix and in obtaining observable r. This approach aligns harmoniously with existing arguments regarding the Berry connection, Wannier centers, and other related concepts; and will provide more.

Last but never least, we look at how the CRM, seemingly an abstract notion, is related to concrete applications in transport. The divergence of the r-matrix necessitates alternative ways of yielding

〈 \hat{r} 〉

based on different approximations, which unfortunately turn out non-unique, leading to diverse transport formalisms. On the other hand, resolving the divergence in the r-matrix gives a logically unique way of yielding

〈 \hat{r} 〉

and thus a unique direction in building transport theory. We stress that this does not indicate the existing transport formalism is incorrect in view of the success each has achieved. However, they might correspond to different expansion limits of a certain unified theory. Moreover, understanding how one transport formalism crosses over to another should bring deeper physical insights such as the description of electronic states in crystals.

Author Contributions

Conceptualization and draft preparation, B.S.; analysis, review, and editing, J.D.H.S. and J.W.; supervision, J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work (and related THz spectroscopy measurement) was supported by the U.S. Department of Energy (DOE), Office of Science, Basic Energy Sciences, Materials Science and Engineering Division. Ames National Laboratory is operated for the US Department of Energy by Iowa State University under contract No. DE-AC02-07CH11358.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

We are grateful for the insights and ideas arising from discussions with Chandan Setty and Pavan Hosur related to this work and its subsequent developments.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

CRM	Convergent r-matrix
DRM	Divergent r-matrix

Appendix A. Summary of Notation

$\hat{r}$	Position operator
r	Real space coordinate or eigenvalues of $\hat{r}$
$R$	Ribbon band (map)
$J_{d}$	Adiabatic current (or displacement current)
$J_{s}$	Shift current
$J_{i}$	Injection current
$φ (t)$	Generic time-dependent wave function
$ψ_{n, k} (r)$	Bloch wave functions (function of r) associated with band n and
	crystal momentum k
$u_{n, k} (r)$	Periodic part of Bloch wave function $ψ_{n, k} (r)$
$\| ψ_{n, k} 〉$	Vector in Bloch space $H_{B}$
$\| A_{n, k} 〉$ (or $\| u_{n, k} 〉$ )	Vector in space $V$ (different from $u_{n, k} (r)$ which is yet a vector)
$\| E_{k} 〉$	Vector in quotient space E
V	Vector space (general)
$H$	Space spanned by eigenstates of $\hat{r}$
$H_{B}$	Bloch space
$H_{W}$	Wannier space
$V$	Quotient space of Bloch space $H_{B}$ associated with bands
E	Quotient space of $H_{B}$ associated with k
$V_{R}$	Ribbon space
$V_{O}$	Matrix operator space
K	Brillouin Zone (otherwise a generic smooth manifold)
$I_{B}$	A set of bases of Bloch space $H_{B}$
$I$	A set of ribbon bases
$Π$	Projection map: $I_{B} \to V \otimes E$
$Π_{1}$	Projection map: $I_{B} \to V$
$Π_{2}$	Projection map: $I_{B} \to E$
$\hat{O}$	Basis-free designation of matrix operator
$O_{m, n}$	Matrix elements of matrix operator
$M_{m, n}$	Matrix elements of differential operator
$A_{1}$	First Weyl algebra
$A_{N}$	Nth Weyl algebra
$A$	Linear self-conjugated maps
$ϑ$	Berry phase
$I$	Identity operator
I	Inversion symmetry (IS) operation
$T$	Time-reversal symmetry (TRS) operation
$T$	Time-ordered operator
ℜ (ℑ)	Real (imaginary) part
k	Crystal momentum of no particular dimension
$k$	Crystal momentum of 2D or 3D
$σ_{i}$	Pauli matrices representing spin
$τ_{i}$	Pauli matrices representing pseudo-spin
$r_{m, n} (k, k^{'})$	Converging r-matrix
$r_{m, n}^{(N)} (k)$	Reduced r-matrix of $N$ -dimension ( $r_{m, n} (k)$ is the shorthand notation)
$A_{m, n} (k)$	Berry connection matrix
$R_{m, n} (k)$	Shift vector (k-dependent) from band n and m
$X_{m, n} (k)$	Complementary term of shift vector $R_{m, n}$
$γ_{m, n} (k)$	Pumping rate from band n to m at k
$: =$	Define to be
≅	Isomorphic
Aut	Automorphism
$ζ_{n}$	The parity for $n^{t h}$ band
$T_{R}$	Ribbon transformation
$T_{B}$	Basis transformation
$T_{G}$	Gauge transformation
F	Observable function for matrix operator
$F$	Observable functional for differential operator
$ℵ_{0}$	Aleph-null as cardinality of $N$
$N$	Set of natural numbers (non-negative integers)
$Z$	Set of integers
$Q$	Set of rational numbers
$R$	Set of real numbers
$C$	Set of complex numbers

Appendix B. Proof for the Incompleteness of the Bloch Space

We aim to prove the incompleteness of

H_{B}

for

\hat{r}

. When Bloch space

H_{B}

is the Hilbert space, it is complete for arbitrary operators defined within it. However, if an operator is defined beyond

H_{B}

, it becomes incomplete. Thus, completeness is associated with specific operators, not taken for granted. An “easy” showing of incompleteness is by comparing the dimensions of

H_{B}

with

\hat{r}

: the former is countably infinite, while the latter is uncountably infinite. Although both are infinite, r is even “more”, such that the space required to host

\hat{r}

is larger than

H_{B}

, and

H_{B}

is incomplete.

To make precise the above statement, we introduce “cardinality”, which extends the measure of the number of elements in a set from finite to infinity. For a finite sets, the cardinality could be replaced by “the number”. For an infinite set, the cardinality of the natural numbers is denoted as

ℵ_{0}

. A set has cardinality

ℵ_{0}

if and only if it is countably infinite, that is, there is a bijection between it and the natural numbers, such as integer number

Z

, rational number

Q

. Occasionally, it leads to counterintuitive results. For example, although even natural number is a proper subset of

N

, i.e., even natural number is smaller than

N

, the cardinalities of the two are equal, because one could build a one-to-one map:

n \mapsto 2 n

. Thus, it is possible that a proper subset of an infinite set has the same cardinality as the original set, which is impossible for proper subsets of finite sets.

The set of all finite ordinals, called

ω

, has cardinality

ℵ_{0}

. The label

n, k

in

ψ_{n, k}

is, at most, with ordinality of

ω^{2}

has cardinality

ℵ_{0}

, when both

n, k

take infinitely many values. On the other hand, if band n is finite, it is

n ω

. Thus, it cannot be equal to the cardinality of real number

R

. Therefore, the space “shrinks” and cannot be isomorphic to the space spanned by the eigenstates of

\hat{r}

.

To gain more evidence for incompleteness, we prove it from a different angle. That is, if Bloch space

H_{B}

is complete for

\hat{r}

,

ψ_{n, k}

can expand arbitrary functions defined on

R

(all eigenvalues of

\hat{r}

). On the other hand, if we construct a set of Bloch bases and show that there exists function

f (r)

that cannot be expanded by this set of bases,

H_{B}

is incomplete for

\hat{r}

.

Firstly, we define Bloch functions and Bloch bases. A Bloch function refers to function forms

ψ (r) : R \to C

that satisfy

\begin{matrix} ψ_{k} (r) = e^{i k r} u (r), \end{matrix}

(A1)

where

u (r) = u (r + a)

. Then,

ψ_{k} (r)

is called a Bloch function subject to wavevector k and periodicity a. Bloch function specifies a certain form of functions, which has an involved vector space.

The Bloch basis is a notion associated with a vector space. Bloch bases are a (finite or infinite) set of mutually orthogonal Bloch functions. We mention that it is a mistake to think Bloch bases are formed by all Bloch functions. Because first, arbitrary two Bloch functions might not be orthogonal; second, “all” involves vagueness, and to be a vector space, the dimension needs to be well specified (whether it is finite or infinite).

We define the following (orthogonal) functions to serve as the set of bases:

\begin{matrix} ψ_{n, k} (r) = \{\begin{matrix} \frac{2}{\sqrt{a}} sin (\frac{4 n π}{a} r), 0 < r < \frac{1}{2} a, \\ 0, \frac{1}{2} a < r < a, \end{matrix} \end{matrix}

(A2)

where

n = 1, 2, 3 \dots

are labeling bands.

ψ_{n, k} (r)

above is defined in a single unit cell

[0, a]

. For other cells, one can find it out by a phase shift:

ψ_{n, k} (r + R_{i}) = e^{i k (r + R_{i})} u_{n, k} (r + R_{i}) = e^{i k R_{i}} e^{i k r} u (r) = e^{i k R_{i}} ψ_{n, k} (r)

. By such a definition, orthogonality is satisfied.

\begin{matrix} \int ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r \\ = \sum_{i}^{N} e^{- i (k_{p} - k_{q}) R_{i}} \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r \\ = \sum_{i}^{N} e^{- i (k_{p} - k_{q}) R_{i}} \frac{4}{a} \int_{0}^{\frac{a}{2}} sin (\frac{4 m π}{a} r) sin (\frac{4 n π}{a} r) d r \\ = N δ_{p, q} δ_{m, n} . \end{matrix}

(A3)

Thus, the constructed space forms an isomorphic space

≅ H_{B}

with dimension

N \cdot N

. For infinite dimension, it is about making N or

N

approach to infinity.

Clearly, within each unit cell, as designed, there is a vacuum gap. Thus, such bases, which are isomorphic to

H_{B}

, are not able to express any function which is non-vanishing in the vacuum regime. Thus, we showed that a non-arbitrary function is expandable with Bloch bases, even though we allow N and

N

approach to infinity.

One might wonder why the counterexample like Equation (A2) could be constructed without referring to a Hamiltonian. The answer is we may use the constructed bases Equation (A3) to construct a desired Hamiltonian, i.e., using these bases as Hamiltonian’s eigenstates. Although it is possible to use eigenstates of Hamiltonian to specify Bloch bases, it is not necessary since Bloch bases ultimately are a notion associated with a vector space rather than a Hamiltonian operator.

One might also wonder whether the constructed function should be “orbital-like”, rather than “designed”. In fact, in terms of showing the incompleteness or not, whether the basis is an “orbital” comes into play. There are two ways of rationalizing it. Firstly, we can use these physically weird bases to construct a physically weird Hamiltonian, under which the eigenstates are these bases. That means that if we construct the Hamiltonian, these states will be physical and no longer “weird”. The second way is to make the vacuum gap a little more physical, not sharply vanish in

[a / 2, a]

, but allowing certain spread. In that case, we need to reconstruct the

ψ_{n, k} (r)

within. However, the extra complexation in constructing such orbital of more physical comfort will not provide any more fundamentality.

Appendix C. Designations for Differential Operator

It is crucial to specify the effectual range of differential operators. We utilize brackets to indicate the range.

\begin{matrix} \partial_{k} (\dots) \end{matrix}

(A4)

The effectual range for

\partial_{k}

is the expressions contained in the brackets. If there is not

(\dots)

, it means

\partial_{k}

will act all the way to the right most. For example,

\begin{matrix} \partial_{k} (〈 φ (k) | ψ (k) 〉) = 〈 \partial_{k} φ (k) | ψ (k) 〉 + 〈 φ (k) | \partial_{k} ψ (k) 〉, \end{matrix}

(A5)

where we define

\begin{matrix} 〈 \partial_{k} φ (k) | & : = \partial_{k} (〈 φ (k) |) \\ | \partial_{k} ψ (k) 〉 & : = \partial_{k} (| ψ (k) 〉) . \end{matrix}

(A6)

Note that

\begin{matrix} \partial_{k} (〈 φ (k) | ψ (k) 〉) \neq \partial_{k} 〈 φ (k) | ψ (k) 〉 \\ = 〈 \partial_{k} φ (k) | ψ (k) 〉 + 〈 φ (k) | \partial_{k} ψ (k) 〉 + 〈 φ (k) | ψ (k) 〉 \partial_{k} . \end{matrix}

(A7)

That is, without the brackets, one obtains an extra term

〈 φ (k) | ψ (k) 〉 \partial_{k}

.

The Berry connection matrix is defined as

\begin{matrix} A_{m, n} (k) : & = i 〈 m (k) | \partial_{k} n (k) 〉 \neq i 〈 m (k) | \partial_{k} | n (k) 〉 \\ = i 〈 m (k) | \partial_{k} n (k) 〉 + i 〈 m (k) | n (k) 〉 \partial_{k} . \end{matrix}

(A8)

That means

A_{m, n} (k)

is taken as a “normal” matrix in terms of interacting with other matrices (just follow the rules of matrix multiplication) but behaves differently under ribbon transformations compared with a pure matrix operator. In other words, under a fixed ribbon basis, there is no difference between

A_{m, n} (k)

and a normal matrix.

Consider a ribbon transformation

\begin{matrix} T_{R} : K \to Aut (V) \end{matrix}

(A9)

That is

\begin{matrix} | n (k) 〉 \mapsto U (k) | n (k) 〉 \end{matrix}

(A10)

We examine the transformation behavior of

r_{m, n} (k)

. The incorrect expression is

\begin{matrix} Wrong : r_{m, n} (k) \mapsto r_{m, n}^{'} (k) = i 〈 m (k) | U (k) \partial_{k} U^{†} (k) | n (k) 〉 . \end{matrix}

(A11)

Because, as denoted by Equation (A11),

\partial_{k}

acts all the way to the right, but the effectual range is only

\partial_{k} (U^{†} (k) n (k))

. Thus, the correct denotation is

\begin{matrix} r_{m, n}^{'} (k) = i 〈 m (k) | U (k) | \partial_{k} U^{†} (k) n (k) 〉 . \end{matrix}

(A12)

Then, we have

\begin{matrix} i 〈 m (k) | U (k) | \partial_{k} U^{†} (k) n (k) 〉 \\ = i 〈 m (k) | U (k) | j (k) 〉 〈 j (k) | \partial_{k} (| l (k) 〉 〈 l (k) | U^{†} (k) | n (k) 〉) \\ = i 〈 m (k) | U (k) | j (k) 〉 {〉 j (k) | \partial_{k} l (k) 〉 〈 l (k) | U^{†} (k) | n (k) 〉 \\ + 〈 j (k) | l (k) 〉 \partial_{k} (〈 l (k) | U^{†} (k) | n (k) 〉)} \\ = U_{m, j} (k) r_{j, l} (k) U_{l, n}^{†} (k) + U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k)), \end{matrix}

(A13)

where

\begin{matrix} U_{m, j} (k) = 〈 m (k) | U (k) | j (k) 〉, \\ U_{l, n}^{†} (k) = 〈 l (k) | U^{†} (k) | n (k) 〉 . \end{matrix}

(A14)

We suggest keeping the last brackets in Equation (A13) to indicate the effectual range of

\partial_{k}

for the same reason as in Equation (A11). Note that

\begin{matrix} \partial_{k} (U_{j, n} (k)) \neq 〈 l (k) | \dot{U} (k) | n (k) 〉 : = 〈 l (k) | (\partial_{k} U (k)) | n (k) 〉 . \end{matrix}

(A15)

An easy mistake about Equation (A13) is

\begin{matrix} i 〈 m (k) | U (k) | \partial_{k} U^{†} (k) n (k) 〉 \\ = i 〈 m (k) | U (k) {U^{†} (k) | \partial_{k} n (k) 〉 + \partial_{k} (U^{†} (k)) | n (k) 〉} \\ = i 〈 m (k) | \partial_{k} n (k) 〉 + i 〈 m (k) | U (k) | j (k) 〉 〈 j (k) | \partial_{k} (U^{†} (k)) | n (k) 〉 \\ \overset{?}{\to} r_{m, n} (k) + U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k)) . \end{matrix}

(A16)

Equation (A16) is inconsistent with the correct expression Equation (A13) for the first term

r_{m, n} (k)

different from the first term in Equation (A13)

U_{m, j} (k) r_{j, l} (k) U_{l, n}^{†} (k)

. In fact, the inconsistency is due to the mistake of the last step, indicated by

\overset{?}{\to}

.

Appendix D. Useful Expressions for Differential Operators

For generic ribbons

\begin{matrix} \partial_{k} | m (k) 〉 = | \partial_{k} m (k) 〉 + | m (k) 〉 \partial_{k}, \\ \partial_{k} 〈 m (k) | = 〈 \partial_{k} m (k) | + 〈 m (k) | \partial_{k} . \end{matrix}

(A17)

The identity expression (Einstein convention) for bases could be generalized to ribbons

\begin{matrix} 1 = | m (k) 〉 〈 m (k) |, \end{matrix}

(A18)

and

\begin{matrix} | \partial_{k} ψ (k) 〉 = \partial_{k} (| m (k) 〉 〈 m (k) | ψ (k) 〉) . \end{matrix}

(A19)

For complex conjugation,

\begin{matrix} 〈 m (k) | \partial_{k} n (k) 〉^{*} = 〈 \partial_{k} n (k) | m (k) 〉, \end{matrix}

(A20)

and

\begin{matrix} 〈 \partial_{λ} m (k, λ) | \partial_{k} n (k, λ) 〉^{*} = 〈 \partial_{k} n (k, λ) | \partial_{λ} m (k, λ) 〉 . \end{matrix}

(A21)

Equation (A20) could be generalized to cases of multiple differential operators.

\begin{matrix} 〈 \partial_{λ} \partial_{k} m (k, λ) | n (k, λ) 〉^{*} = 〈 n (k, λ) | \partial_{λ} \partial_{k} m (k, λ) 〉 . \end{matrix}

(A22)

For orthogonal ribbons

{| m (k) 〉}

, we further have sign reversal properties

\begin{matrix} 〈 \partial_{k} m (k) | n (k) 〉 = - 〈 m (k) | \partial_{k} n (k) 〉 . \end{matrix}

(A23)

Similarly (Einstein convention),

\begin{matrix} | \partial_{k} m (k) 〉 〈 m (k) | = - | m (k) 〉 〈 \partial_{k} m (k) | . \end{matrix}

(A24)

For generic ribbons, however, Equation (A23) does not hold:

\begin{matrix} 〈 \partial_{k} φ (k) | ψ (k) 〉 \neq - 〈 φ (k) | \partial_{k} ψ (k) 〉 . \end{matrix}

(A25)

In addition, the sign reversal properties (Equation (A23)) are invalid for multiple differential operators.

\begin{matrix} 〈 \partial_{λ} m (k, λ) | \partial_{k} n (k, λ) 〉 \neq - 〈 m (k, λ) | \partial_{λ} \partial_{k} n (k, λ) 〉 . \end{matrix}

(A26)

Appendix E. Continuous Conditions and Existence of the $Π$ Map

Given a set of functions

{ψ_{n, k_{q}} (r)}_{N}

defined on

[0, a]

, is it possible to find a set

{a_{i}^{(n)} (k_{q})}

of solutions of the equation

\begin{matrix} \int_{0}^{a} ψ_{m, k_{p}}^{*} (r) ψ_{n, k_{q}} (r) d r = \sum_{i}^{N} a_{i}^{(m) *} (k_{p}) \cdot a_{i}^{(n)} (k_{q}) ? \end{matrix}

(A27)

Consider a simple one-band situation, where

m, n

only take one possible value. Then, the above equation reduces to

\begin{matrix} \int_{0}^{a} ψ_{k_{p}}^{*} (r) ψ_{k_{q}} (r) d r = a^{*} (k_{p}) \cdot a (k_{q}) . \end{matrix}

(A28)

Since

k_{p}

takes N possible values, the left side represents

N (N - 1) / 2

combinations, which lead to that number of independent constraint equations. On the right side,

a (k_{q})

is an N-component vector, which consists of N variables to satisfy the

N (N - 1) / 2

constraint equations. Evidently, the overdetermination means that solutions are not guaranteed to exist.

The situation is unchanged for a higher number of bands. There are

N N (N N - 1) / 2

combinations on the left side, greater than the tunable variable number

N^{2} N

on the right side. The prime message here is that a set of continuous functions

ψ_{k_{q}} (r)

cannot always be reproduced by inner products of vectors of discrete components. Roughly, this can be rationalized by saying that

ψ_{k_{q}} (r)

is continuous with r, which in principle could be infinite in dimension. Thus, Equation (A27) is equivalent to dimension reduction, which is not always realizable.

Therefore, the derivative

\begin{matrix} \partial_{k} \int_{0}^{a} ψ_{k}^{*} (r) ψ_{k} (r) d r \end{matrix}

(A29)

is not automatically well defined globally on

k \in K

. The word “globally” should be stressed, since locally one can take derivatives with the assistance of a series expansion of

ψ_{k} (r)

with respect to k within its convergence range.

Intuitively speaking, in a local range, the

N (N - 1) / 2

constraint equations do not come into play, since the points

k_{p}

are distributed all over K; locally, we do not have to consider them. On the other hand, if we hope to obtain a global smooth solution, we must move beyond the convergence domain and begin to consider the existence of global solutions, for which the

N (N - 1) / 2

constraint equations start to take effect. Technically, connecting local solutions into a global one is known as sheaf theory [59]. In this particular case, a global solution might not exist. In practice, the problem can be solved by relaxing equality to obtain an approximate solution. For example, one might introduce an error function and minimize it for a given dimension of

a (k_{q})

.

Note that global solution is of tremendous importance for the global property of Berry phase, and the derivative is meant to be integrated over the entire B.Z. Therefore, an implicit assumption involved is the existence of a global solution. This assumption is equivalent to assuming that the

a (k_{q})

are discrete points on a globally continuous function about k. It is an analytic assumption, like requiring the physical wavefunction to be continuous and smooth, with a partial derivative everywhere. In that case, the smoothness involves r. Here, it involves k. This assumption is indispensable for both the previous method that employed the first Weyl algebra and the present method with the Nth Weyl algebra as outlined in Section 3, as long as

\partial_{k}

is involved.

Returning to the question raised in the main text, since

H_{B} ≅ V \otimes E

, why should

H_{B} ≅ V \otimes E

be more convenient? And are there any pre-conditions for

H_{B} ≅ V \otimes E

? The answer is yes: the existence of a global solution for Equation (32) is assumed. There is a vague intuition that an exact transformation, without approximation added, cannot reduce complexity, and thus cannot bring us closer to solving the problem. Why is

H_{B} ≅ V \otimes E

advantageous over

H_{B}

if the two are isomorphic? It is the assumption of global solutions of Equation (32) that has neatly removed the complexity arising for continuous r.

Appendix F. Weyl Algebra and Ring

A major result of this paper is convergent r-matrix (CRM) by extending

A_{1}

to Nth Weyl algebra

A_{N}

. Although Weyl algebra is the math underlying the fundamental quantum operators

\hat{r}

and p, its mathematical identity as a ring is less well known to physicists, as one may simply accept the replacement of

\hat{r}

by

i \partial_{k}

and yield the correct result in many situations. However, in others, as shown in this paper, this replacement might lead to inconsistencies; indeed, it is exactly this replacement that has led to the divergent r-matrix (DRM). Therefore, if one aims to resolve the divergence, the algebra definition in the bottom level cannot be overlooked. In this appendix, we try to bridge the physics–math gap and point out that quantum has already encountered Weyl algebra much more than people had thought. We should cover those aspects most pertinent to physical applications, keeping the math extension in the minimum but an adequate level.

What is a ring? Why is Weyl algebra a ring? A ring is a set of elements, defined with several abstract conditions, namely ring axioms; any set that satisfies them can be called a ring, such as Weyl algebra. A most familiar ring is the set of all integer numbers, usually denoted as

Z

. The ring is defined by three aspects.

(1): There is an invertible addition operation (a binary operation), which must fulfill closeness. Take $Z$ as an example. Such addition as $1 + 2 = 3$ is invertible as subtraction is always well defined, e.g., $3 - 2 = 1$ , which means $- 2$ is the inverse of $+ 2$ . Such addition clearly fulfills the closeness because the addition of two integers leads to another.
(2): There is a multiplication (not necessarily invertible). In the set of $Z$ , multiplication of two integers is another. However, different form addition, inverse of multiplication might not exist, like $0^{- 1}$ is ill defined.
(3): Regarding the interplay of multiplication and addition, they satisfy the distribution rules.

Now, we check whether

[r, k] = i

Weyl algebra is a ring. Accurately,

\hat{r}

and k are the generators rather than the full set, as the full set should contain infinitely many elements, otherwise the closeness for addition and multiplication would not be satisfied.

For

A_{1}

, there are one pair of generators

\hat{r}

and k. For

A_{N}

, there should be N pairs

{r_{i}, k_{i}}_{N}

. We consider

A_{1}

first. The generators should first be included in the set:

\hat{r}

and k. Then, we apply the addition to k and obtain

2 k = k + k

,

3 k = 2 k + k

, etc. Then, multiplying gives

k^{n}

, which should also be included. Then, we combine the multiplication with addition and realize the ring should contain polynomials as

c_{0} + c_{1} k + c_{2} k^{2} +

. That is why in Section 2 Weyl algebra involves the polynomials. Since a generic wavefunction

φ (k)

coordinated with k could be locally expanded, the polynomials are just the wavefunctions.

Then, we account for

\hat{r}

, which should be

i \partial_{k}

. For the same reason of multiplication, we include

i \partial_{k}^{n}

. In addition, when

i \partial_{k}^{n}

interplays with polynomials, one could move the differential to the right most to yield a “standard” form as

f_{n} (k) \partial_{k}^{n}

, where

f_{n} (k)

is a polynomial associated with nth-order derivative

\partial_{k}^{n}

; then, with addition, one could add different orders together

\sum_{n} f_{n} (k) \partial_{k}^{n}

. That is how we use the basic commutation rule

[r, k] = i

to generate a polynomial ring that obeys all the axioms. Note that the distribution rule is valid, which is easy to verify.

For Nth Weyl algebra

A_{N}

, one simply extends the polynomial variable from k to

{k_{i}}

and the derivative from

\partial_{k}

to

{\partial_{k_{i}}}_{N}

. This is reflected by Equation (15) in main texts. The

\hat{r}

and k contains N variables and the corresponding derivatives.

A ring is a set equipped with two binary operations satisfying properties analogous to those of addition and multiplication of integers. Ring elements may be numbers such as integers or complex numbers, but they may also be non-numerical objects such as polynomials, square matrices, functions, and power series. A common worry is that

\partial_{k}

needs to be acting on something; otherwise, it is meaningless. That is incorrect. One should accept the terms are meaningful on their own, just like numbers 1, 6, etc., are meaningful on their own. We do not require they must multiply with anything or representing anything, such as one cat or six dogs.

In front of

\frac{\partial^{m}}{\partial_{k}^{m}}

, one could multiply with a coefficient, just as one could multiply a with 2,

3 / 5

, etc., to yield

2 a

,

3 a / 5

. In addition, one is able to multiply two terms that both contain

\partial_{k}

. As a general property of a ring, multiplication should be well-defined between arbitrary pairs between them. This is exactly the same requirement as the closeness of integer multiplication.

In view of these general aspects above, we see that expressions in quantum mechanics belong to Weyl algebra. One needs to temporally forget the physical meanings of these terms, such as physical observable, wavefunctions. Then we merely focus on the abstract multiplication between different parts, for instance, view wavefunction

φ

and physical operator

i \partial_{k}

on equal status as different elements in a sing set.

Next, we clarify a few basic terminologies. “Algebra” is a pretty loose and broad concept. Some of them, e.g., Lie algebra, are defined as a vector space, while others, such as Weyl algebra, may not necessarily form vector spaces. Thus, one algebra could be very different from another, although they are both called algebra.

In this paper, we focus on Lie algebra and Weyl algebra. Many physicists tend to equate Lie algebra to a “spin”. This is not a mistake in many situations; however, it might conceal Lie algebra being a vector space as a formal definition. Because, for spins, physicists tend to imagine there are three of them along x, y, z, while for a vector space, there are infinite elements. With Lie algebra, we notice that

s_{x}

,

s_{y}

,

s_{z}

are merely the bases, and one is at liberty to multiply with a complex number.

Vector space is usually defined as a set of bases (finite or infinite) together with a field such as

R

or

C

. Could we view

\frac{\partial^{m}}{\partial_{k}^{m}}

as bases and the polynomial as the coefficients associated with an “abstract” number, such that it will form a vector space? The answer is no, as the coefficients do not form a field (which requires division to be defined). In addition, the multiplication is defined for

\frac{\partial^{m}}{\partial_{k}^{m}}

, too, which is lacking in the definition of a vector space. Thus, Weyl algebra is usually not viewed as a vector space.

What is the difference between a vector space and a topological space? One might feel any point, for instance, denoted with two numbers could be viewed as a 2D vector. This is incorrect. The axiom of vector space requires

a + a = 2 a

, and if

2 a = 0

, then one must have

a = 0

. Consider a point on a sphere surface, represented by

(θ, ϕ)

. Consider a point on equator

a = (π / 2, π)

. We may think a can represent a vector, just like a point in vector space represents a vector

a + a = 2 a = (0, 0)

, which leads to

a = (0, 0)

. However,

a \neq 0

. Thus,

S^{2}

does not form a vector space, but only a topological space. A topological space requires the neighborhood to be well defined, some information about some points being near to another but further from others. Without topology, these points are just like “grains of sand”.

References and Notes

Sakurai, J.J.; Napolitano, J. Modern Quantum Mechanics, 3rd ed.; Cambridge University Press: Cambridge, UK, 2020. [Google Scholar]
Auletta, G. Foundations and Interpretation of Quantum Mechanics; World Scientific: Singapore, 2000. [Google Scholar]
Blount, E.I. Formalisms of Band Theory. Solid State Phys. 1962, 13, 305. [Google Scholar]
Aversa, C.; Sipe, J.E. Nonlinear optical susceptibilities of semiconductors: Results with a length-gauge analysis. Phys. Rev. B 1995, 52, 14636. [Google Scholar] [CrossRef]
Sipe, J.E.; Shkrebtii, A.I. Second-order optical response in semiconductors. Phys. Rev. B 2000, 61, 5337. [Google Scholar] [CrossRef]
Sinitsyn, N.A.; Niu, Q.; MacDonald, A.H. Coordinate shift in the semiclassical Boltzmann equation and the anomalous Hall effect. Phys. Rev. B 2006, 73, 075318. [Google Scholar] [CrossRef]
Shi, L.K.; Song, J.C.W. Shift vector as the geometric origin of beam shifts. Phys. Rev. B 2019, 100, 201405(R). [Google Scholar] [CrossRef]
Ahn, J.; Guo, G.-Y.; Nagaosa, N. Low-Frequency Divergence and Quantum Geometry of the Bulk Photovoltaic Effect in Topological Semimetals. Phys. Rev. X 2020, 10, 041041. [Google Scholar] [CrossRef]
Angles, P. Conformal Groups in Geometry and Spin Structures; Birkhäuser: Basel, Switzerland, 2008. [Google Scholar]
Wakimoto, M. Infinite-Dimensional Lie Algebras; American Mathematical Society: Providence, RI, USA, 1999. [Google Scholar]
de Traubenberg, M.R.; Slupinski, M.J.; Tanasa, A. Finite-dimensional Lie subalgebras of the Weyl algebra. J. Lie Theory 2006, 16, 427–454. [Google Scholar]
Introducing a δ-function to yield a formal r-matrix attempts to encode $\hat{r}$ into an infinite matrix, which is purely formal because of its inevitable divergence.
von Baltz, R.; Kraut, W. Theory of the bulk photovoltaic effect in pure crystals. Phys. Rev. B 1981, 23, 5590. [Google Scholar] [CrossRef]
Zak, J. Berry’s phase for energy bands in solids. Phys. Rev. Lett. 1989, 62, 2747. [Google Scholar] [CrossRef]
Moody, J.; Shapere, A.; Wilczek, F. Geometric Phases in Physics; World Scientific: Singapore, 1989. [Google Scholar]
Niu, Q.; Chang, M.C.; Wu, B.; Xiao, D. Physical Effects of Geometric Phases; World Scientific: Singapore, 2017. [Google Scholar]
Sukhachov, P.O.; Rostami, H. Acoustogalvanic Effect in Dirac and Weyl Semimetals. Phys. Rev. Lett. 2020, 124, 126602. [Google Scholar] [CrossRef] [PubMed]
Shi, L.K.; Zhang, D.; Chang, K.; Song, J.C.W. Geometric photon-drag effect and nonlinear shift current in centrosymmetric crystals. Phys. Rev. Lett. 2021, 126, 197402. [Google Scholar] [CrossRef]
Thouless, D.J.; Kohmoto, M.; Nightingale, M.P.; den Nijs, M. Quantized Hall Conductance in a Two-Dimensional Periodic Potential. Phys. Rev. Lett. 1982, 49, 405. [Google Scholar] [CrossRef]
Song, B.Q.; Smith, J.D.H.; Yao, Y.X.; Wang, J. Type-II pumping beyond resonance principle: From energetic to geometric rules. arXiv 2024, arXiv:2408.01282. [Google Scholar] [CrossRef]
Song, B.Q.; Smith, J.D.H.; Wang, J. Geometric origin of supercurrents in Berry phase: Formula for computing currents from wavefunctions with correlation and particle number variation. arXiv 2025, arXiv:2502.16258. [Google Scholar] [CrossRef]
Thouless, D.J. Quantization of particle transport. Phys. Rev. B 1983, 27, 6083. [Google Scholar] [CrossRef]
King-Smith, R.D.; Vanderbilt, D. Theory of polarization of crystalline solids. Phys. Rev. B 1993, 47, 1651. [Google Scholar] [CrossRef] [PubMed]
Bernevig, B.A.; Hughes, T.L. Topological Insulators and Topological Superconductors; Princeton University Press: Princeton, NJ, USA, 2013. [Google Scholar]
Morimoto, T.; Nagaosa, N. Topological nature of nonlinear optical effects in solids. Sci. Adv. 2016, 2, 1501524. [Google Scholar] [CrossRef]
de Juan, F.; Grushin, A.G.; Morimoto, T.; Moore, J.E. Quantized circular photogalvanic effect in Weyl semimetals. Nat. Commun. 2017, 8, 15995. [Google Scholar] [CrossRef]
Vanderbilt, D. Berry Phases in Electronic Structure Theory: Electric Polarization, Orbital Magnetization and Topological Insulators, 1st ed.; Cambridge University Press: Cambridge, UK, 2018. [Google Scholar]
Vazifeh, M.M.; Franz, M. Electromagnetic Response of Weyl Semimetals. Phys. Rev. Lett. 2013, 111, 027201. [Google Scholar] [CrossRef] [PubMed]
Song, B.Q.; Smith, J.D.H.; Jiang, T.; Yao, Y.X.; Wang, J. Quantum geometry embedded in unitarity of evolution: Revealing its impacts as geometric oscillation and dephasing in spin resonance and crystal bands. Phys. Rev. B 2025, 111, 144305. [Google Scholar] [CrossRef]
Vaswani, C.; Wang, L.-L.; Mudiyanselage, D.H.; Li, Q.; Lozano, P.M.; Gu, G.D.; Cheng, D.; Song, B.; Luo, L.; Kim, R.H.J.; et al. Light-driven Raman coherence as a nonthermal route to ultrafast topology switching in a Dirac semimetal. Phys. Rev. X 2020, 10, 021013. [Google Scholar] [CrossRef]
Luo, L.; Cheng, D.; Song, B.; Wang, L.-L.; Vaswani, C.; Lozano, P.M.; Gu, G.; Huang, C.; Kim, R.H.J.; Liu, Z.; et al. A light-induced phononic symmetry switch and giant dissipationless topological photocurrent in ZrTe₅. Nat. Mater. 2021, 20, 329. [Google Scholar] [CrossRef]
Huang, C.; Mootz, M.; Luo, L.; Perakis, I.E.; Wang, J. Unlocking Quantum Control and Multi-Order Correlations via Terahertz Two-Dimensional Coherent Spectroscopy. Nat. Rev. Phys. 2026. [Google Scholar] [CrossRef]
Huang, C.; Mootz, M.; Luo, L.; Cheng, D.; Khatri, A.; Park, J.-M.; Kim, R.H.J.; Qiang, Y.; Quito, V.L.; Yao, Y.; et al. Discovery of an unconventional quantum echo by interference of Higgs coherence. Sci. Adv. 2025, 11, eads8740. [Google Scholar] [CrossRef] [PubMed]
Luo, L.; Song, B.; Gu, G.; Mootz, M.; Yao, Y.; Perakis, I.E.; Li, Q.; Wang, J. Symmetry instability induced by topological phase transitions. Phys. Rev. B 2025, 111, 075151. [Google Scholar] [CrossRef]
Lapa, M.F.; Hughes, T.L. Semiclassical wave packet dynamics in nonuniform electric fields. Phys. Rev. B 2019, 99, 121111(R). [Google Scholar] [CrossRef]
Rhim, J.-W.; Kim, K.; Yang, B.J. Quantum distance and anomalous Landau levels of flat bands. Nature 2020, 584, 59. [Google Scholar] [CrossRef] [PubMed]
Song, B.Q.; Smith, J.D.H.; Luo, L.; Wang, J. Geometric pumping and dephasing at topological phase transition. Phys. Rev. B 2022, 105, 035101. [Google Scholar] [CrossRef]
Ahn, J.; Guo, G.-Y.; Nagaosa, N.; Vishwanath, A. Riemannian geometry of resonant optical responses. Nat. Phys. 2022, 18, 290–295. [Google Scholar] [CrossRef]
Hofmann, J.S.; Berg, E.; Chowdhury, D. Superconductivity, Charge Density Wave, and Supersolidity in Flat Bands with a Tunable Quantum Metric. Phys. Rev. Lett. 2023, 130, 226001. [Google Scholar] [CrossRef]
Ahn, J.; Xu, S.-Y.; Vishwanath, A. Theory of optical axion electrodynamics and application to the Kerr effect in topological antiferromagnets. Nat. Commun. 2022, 13, 7615. [Google Scholar] [CrossRef]
Xie, F.; Song, Z.; Lian, B.; Bernevig, B.A. Topology-Bounded Superfluid Weight in Twisted Bilayer Graphene. Phys. Rev. Lett. 2020, 124, 167002. [Google Scholar] [CrossRef]
Song, B.Q.; Smith, J.D.H.; Luo, L.; Wang, J. Quantum Liouville’s theorem based on Haar measure. Phys. Rev. B 2024, 109, 144301. [Google Scholar] [CrossRef]
Steenrod, N. The Topology of Fiber Bundles; Princeton University Press: Princeton, NJ, USA, 1951. [Google Scholar]
Nakahara, M. Geometry, Topology, and Physics; CRC Press: Boca Raton, FL, USA, 2003. [Google Scholar]
For an infinite dimensional space, one needs further specifications to truncate it into a finite dimensional one to make a comparison.
A rough argument is that $H$ is “bigger” than Bloch space $H_{B}$ , because one space is uncountable and the other is countable, thus $H$ ≇ $H_{B}$ . Accurately, a feasible $\hat{r}$ is usually defined on a countable subspace $\tilde{H}$ of $H$ , such as $\tilde{H}$ = L², a well-behaved 2-normed space on $R$ , leading to deeper inquiry if $\tilde{H}$ may be isomorphic with $H_{B}$ ? Then, a waterproof justification could be, since $H$ is uncountable (just like a reservoir), it could always generate a countable dimensional $\tilde{H}$ that is higher than Bloch space $H_{B}$ , whether $H_{B}$ ’s dimension is finite or infinite.
Note that ∑_r′∈RM_r,r′φ_r′ differs from ∫M(r,r^′)φ(r^′)dr^′. The former involves a sum over uncountably many entries, which is generally ill-defined, while the latter is defined. For example, $\int_{0}^{1}$ dr=1; however, if we try to sum up every real number between 0 and 1, the result diverges.
Jacobson, N. Basic Algebra I, 2nd ed.; Dover Publications: Mineola, NY, USA, 2009. [Google Scholar]
Ozawa, T.; Price, H.M.; Amo, A.; Goldman, N.; Hafezi, M.; Lu, L.; Rechtsman, M.C.; Schuster, D.; Simon, J.; Zilberberg, O.; et al. Topological photonics. Rev. Mod. Phys. 2019, 91, 015006. [Google Scholar] [CrossRef]
Schwinger, J. On Angular Momentum; Dover Publications: Mineola, NY, USA, 2015. [Google Scholar]
Defining a δ-function is no mystery. In an $N$ -band model it is equivalent to defining a δ-function of dimension $N$ . It is also equivalent to truncating to an $N$ -dimensional factor space. Since $N$ is an arbitrary integer, there are multiple choices for the dimension of the space to be extracted from the functions u_n,k(r).
Castro Neto, A.H. Selected Topics in Graphene Physics. arXiv 2010, arXiv:1004.3682. [Google Scholar] [CrossRef]
If we view k as the label of bases, then the co-efficient must be independent of k. Then the last term will be vanishing. That is why it is linear in $H$ but not in V.
Weinberg, S. Lectures on Quantum Mechanics, 1st ed.; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
By Equation (125), we are able to deduce 〈φ(k)|∂_kψ(k)〉=〈φ(k)|∂_kn(k)〉〈n(k)|ψ(k)〉+〈φ(k)|m(k)〉∂_k(〈m(k)|ψ(k)〉). To have basis free designation, one should hope the right side does not contain |m(k)〉 or |n(k)〉.
Explained by discussions associated with Equations (130) and (131).
Jackson, J.D. Classical Electrodynamics, 3rd ed.; Wiley: Hoboken, NJ, USA, 1998. [Google Scholar]
Cheng, B.; Cheng, D.; Jiang, T.; Xia, W.; Song, B.; Mootz, M.; Luo, L.; Perakis, I.E.; Yao, Y.; Guo, Y.; et al. Chirality manipulation of ultrafast phase switchings in a correlated CDW-Weyl semimetal. Nat. Commun. 2024, 15, 785. [Google Scholar] [CrossRef]
Tennison, B.R. Sheaf Theory; Cambridge University Press: Cambridge, UK, 1975. [Google Scholar]

Figure 1. Color online: Ribbon transformations apply to matrices of both matrix and differential operators. When

T_{R}

is applied to matrix of differential operator, it induces transformation

T_{G}

. Both

T_{G}

and

T_{R}

are directly defined with matrices, independent of the ket/bra such basis-free designations.

Figure 1. Color online: Ribbon transformations apply to matrices of both matrix and differential operators. When

T_{R}

is applied to matrix of differential operator, it induces transformation

T_{G}

. Both

T_{G}

and

T_{R}

are directly defined with matrices, independent of the ket/bra such basis-free designations.

Figure 2. Color online: Gauge symmetry becomes the fundamental principle of deriving observables from matrix and differential operators. To ensure gauge symmetry, function F (functional

F

) is applied to matrices of matrix (differential) operators.

Figure 2. Color online: Gauge symmetry becomes the fundamental principle of deriving observables from matrix and differential operators. To ensure gauge symmetry, function F (functional

F

) is applied to matrices of matrix (differential) operators.

Table 1. Major results of the present work. In particular, we highlight 2.3, 3.2, 3.3, 4.2, 4.3, 4.4, 4.5, 4.6, 5.2, 5.3, 5.4, 5.5, 6.1, 6.3, 7.2, and 7.3 as major innovation points that might update or challenge certain pre-existing viewpoints, or have significant impact on subsequent research.

Section 2 Position operator

\hat{r}

and the Weyl algebra.

2.1 Define three spaces: (i)

H

spanned by

\hat{r}

’s eigenstates, (ii) Bloch space

H_{B}

, (iii) quotient space

V

of Bloch space.

2.2 Show the relation between

\hat{r}

and generators of Weyl algebra.

2.3 Non-existence of matrix representations of Weyl algebra; need for new principles to determine r-matrices.

Section 3 Bloch space structure and its quotient space.

3.1 The norm of Bloch space

H_{B}

is divergent; the expectation value

〈 \hat{r} 〉

is divergent in Bloch bases.

3.2 To avoid

\int_{- \infty}^{\infty} d r

, we introduce an isomorphic product space

V \otimes E

to substitute fo

H_{B}

, which is realized by a projection map

Π

:

H_{B} \to V \otimes E

.

3.3 Prove Bloch space

H_{B}

is incomplete for

\hat{r}

(with counterexamples), i.e.,

H_{B} ≇ H

.

Section 4 Matrices of position operators.

4.1 Derive DRM with

A_{1}

(reproduce previous result).

4.2 Derive converging r-matrix (CRM) of arbitrary dimensions with N-th Weyl algebra

A_{N}

.

4.3 Define r-matrix

r_{m, n} (k, k^{'})

and reduced r-matrix

r_{m, n} (k)

.

4.4 Show the space spanned by periodic functions

u_{n, k} (r)

is isomorphic to

V

, a quotient space of

H_{B}

. Show geometric quantities, e.g., Berry connection or curvature, are defined on

V

, not on

H_{B}

.

4.5 Articulate how the divergence in DRM is fixed; demonstrate the relation between DRM and CRM.

4.6 Show neither DRM nor CRM will satisfy the commutation

[\hat{r}, p] = i ℏ

.

Section 5 Properties of the $\hat{r}$ operator and $r$ -matrix.

5.1 Define ribbon and its transformation in bundle space in analog with basis and its transformation in vector space.

5.2 Under a unified frame of ribbon, two types of operators are recognized: matrix operator and differential operators.

5.3 Show

r_{m, n} = r_{n, m}^{*} ⇎ \hat{r} = {\hat{r}}^{†}

; show the well-known Berry curvature expression for polarization is conditionally true.

5.4 Algebraic rules for matrix of differential operator: complex conjugation, inner product, transformation, etc.

5.5 Inapplicability of bra/ket designations for denoting r-matrix.

Section 6 Gauge, ribbon, and basis transformations.

6.1 Show relations between gauge transformation

T_{G}

, ribbon transformation

T_{R}

, basis transformation

T_{B}

; show

T_{G}

can be induced by

T_{R}

.

6.2 Define gauge invariance, ribbon (transformation) invariance.

6.3 Procedures of extracting observables for matrix and differential operators, characterized by gauge symmetry.

Section 7 Discussion and outlook.

7.1 Several spaces related to r-matrix.

7.2 Principles for defining r-matrix in comparison with those for defining spin matrix and group matrix.

7.3 Applications of CRM and its implications in building a unified transport mechanism.

Table 2. It is correct to assert that the CRM could eventually be brought down to the first Weyl algebra

A_{1}

, but it is mistaken to have

A_{1}

act directly on

H_{B}

, as this leads to divergence. Instead, it is

A_{N}

which acts on

H_{B}

or its isomorphic copy

V \otimes E

. One may subsequently reduce this action to one of

A_{1}

on

V

.

Table 2. It is correct to assert that the CRM could eventually be brought down to the first Weyl algebra

A_{1}

, but it is mistaken to have

A_{1}

act directly on

H_{B}

, as this leads to divergence. Instead, it is

A_{N}

which acts on

H_{B}

or its isomorphic copy

V \otimes E

. One may subsequently reduce this action to one of

A_{1}

on

V

.

	DRM	CRM
Vector space	$H_{B}$	Step 1: $V \otimes E$	Step 2: $V$
Weyl algebra	$A_{1}$	$A_{N}$	$A_{1}$

Table 3. Comparison of basis-free (bra/ket) and matrix designations in terms of complex conjugation, inter product (

〈 φ | \hat{O} | ψ 〉

or

〈 φ | i \partial_{k} ψ 〉

), and ribbon transformation

T_{R}

. “N/A” means no appropriate or self-consistent designation is found, as in the case of differential operators in inter-product and ribbon transformation. Hence, Basis-free designation is considered unsuitable for differential operators (k labels are ignored for matrix operator).

Table 3. Comparison of basis-free (bra/ket) and matrix designations in terms of complex conjugation, inter product (

〈 φ | \hat{O} | ψ 〉

or

〈 φ | i \partial_{k} ψ 〉

), and ribbon transformation

T_{R}

. “N/A” means no appropriate or self-consistent designation is found, as in the case of differential operators in inter-product and ribbon transformation. Hence, Basis-free designation is considered unsuitable for differential operators (k labels are ignored for matrix operator).

	Matrix Oper.		Differential Oper.
	Basis-Free	Matrix Form	Basis-Free	Matrix Form
Conj.	${〈 φ \| \hat{O} \| ψ 〉}^{*}$	${(φ_{m}^{} O_{m, n} ψ_{n})}^{} = φ_{m} O_{m, n}^{} ψ_{n}^{}$	$〈 φ (k) \| i \partial_{k} ψ (k) 〉^{*}$	${(φ_{m}^{} (k) r_{m, n} (k) ψ_{n} (k))}^{} = φ_{m} (k) r_{m, n}^{} (k) ψ_{n}^{} (k)$
	$= 〈 ψ \| {\hat{O}}^{†} \| φ 〉$		$= 〈 i \partial_{k} ψ (k) \| φ (k) 〉$	${(φ_{m}^{} (k) i \partial_{k} ψ_{n} (k))}^{} = φ_{m} (k) (- i) \partial_{k} ψ_{n}^{*} (k)$
Prod.	$〈 φ \| \hat{O} \| ψ 〉$	$φ_{m}^{*} O_{m, n} ψ_{n}$	N/A [55]	$φ_{m} (k) r_{m, n} (k) ψ_{n} (k) + φ_{m} (k) i \partial_{k} (ψ_{m} (k))$
$T_{R}$	$\hat{O} \overset{T_{R}}{\to} U \hat{O} U^{†}$	$O_{m, n} \overset{T_{R}}{\to} U_{m, i} O_{i, j} U_{j, n}^{†}$	N/A [56]	$M_{m, n} (k) \overset{T_{R}}{\to} M_{m, n}^{'} (k)$
	$\| ψ 〉 \overset{T_{R}}{\to} U \| ψ 〉$	$φ_{m} \overset{T_{R}}{\to} U_{m, n} φ_{n}$		$= U_{m, i} M_{i, j} (k) U_{j, n}^{†} (k) + U_{m, j} (k) i \partial_{k} (U_{j, n}^{†} (k))$

Table 4. Comparison of ribbon transformation, gauge transformation, and basis transformation in terms of designation symbols, identities as maps, and application scopes, either on matrix or differential operators, or both.

Transf.	Symbol	Map	Oper.
Ribbon	$T_{R}$	$K \to Aut (V)$	Matrix and Diff.
Gauge	$T_{G}$	$K \to f$	Diff.
Basis	$T_{B}$	$V \to V$	Matrix

Table 5. Summary of spaces relevant to CRM, in terms of dimensions, identity of matrix as map, and transformation defined in the space. The dimension of

H

is uncountable infinite, while a matrix requires its labels to be either finite or countably infinite; thus, a matrix is ill-defined in space

H

. Matrices are well defined in other spaces, while their meanings (in terms of maps) vary with spaces. The ribbon space provides a platform that allows to express both differential operators and matrix operators.

Table 5. Summary of spaces relevant to CRM, in terms of dimensions, identity of matrix as map, and transformation defined in the space. The dimension of

H

is uncountable infinite, while a matrix requires its labels to be either finite or countably infinite; thus, a matrix is ill-defined in space

H

. Matrices are well defined in other spaces, while their meanings (in terms of maps) vary with spaces. The ribbon space provides a platform that allows to express both differential operators and matrix operators.

	Vector Space			Ribbon Space
	$H$	$H_{B}$	$V$	$K \otimes V$
Dim.	∞	$N \times N$	$N$	dim(K) $+ N$
Matrix	–	$H_{B} \to H_{B}$	$V \to V$	$K \to f$ , $f : V \to V$
Transf.	$T_{B}$	$T_{B}$	$T_{B}$	$T_{R}$ & $T_{G}$

Table 6. Comparison of matrices for

\hat{r}

operator, spin, and group, in terms of whether the matrices form isomorphism, what structure to be preserved, and the host space. Matrices of spin and group have a common defining principle: isomorphism; but they differ in the structure to be preserved by isomorphism. However, isomorphism is not the principle of defining the r-matrix, which makes the r-matrix distinctive from spin and group matrices.

Table 6. Comparison of matrices for

\hat{r}

operator, spin, and group, in terms of whether the matrices form isomorphism, what structure to be preserved, and the host space. Matrices of spin and group have a common defining principle: isomorphism; but they differ in the structure to be preserved by isomorphism. However, isomorphism is not the principle of defining the r-matrix, which makes the r-matrix distinctive from spin and group matrices.

	Matrix of $\hat{r}$	Matrix of Spin	Matrix of Group
Oper.	$\hat{r}$ , k	$s_{1}$ , $s_{2}$ , $s_{3}$	${g \| g \in G}$
Isomorphism	No	Yes	Yes
Str. to preserve	–	$[s_{i}, s_{j}] = ϵ_{i, j, k} s_{k}$	$M (g_{1}) M (g_{2}) = M (g_{1} \circ g_{2})$
Space	Ribbon space $V_{R}$	Vector space	Vector space

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Song, B.; Smith, J.D.H.; Wang, J. Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge. Quantum Rep. 2026, 8, 14. https://doi.org/10.3390/quantum8010014

AMA Style

Song B, Smith JDH, Wang J. Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge. Quantum Reports. 2026; 8(1):14. https://doi.org/10.3390/quantum8010014

Chicago/Turabian Style

Song, Boqun, Jonathan D. H. Smith, and Jigang Wang. 2026. "Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge" Quantum Reports 8, no. 1: 14. https://doi.org/10.3390/quantum8010014

APA Style

Song, B., Smith, J. D. H., & Wang, J. (2026). Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge. Quantum Reports, 8(1), 14. https://doi.org/10.3390/quantum8010014

Article Menu

Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge

Abstract

1. Introduction

2. Position Operators and Weyl Algebras

3. Structure of Bloch Space and Its Quotients

4. Matrices of the Position Operator

4.1. The Nth Weyl Algebra

4.2. Geometry Defined on the Quotient Space

4.3. How Is the Convergence Achieved?

4.4. Can the DRM Be a Limit of CRMs?

5. Properties of the Position Operator and r-Matrix

6. Gauge Transformation, Ribbon Transformation, and Basis Transformation

7. Discussion and Outlook

8. Summary and Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Summary of Notation

Appendix B. Proof for the Incompleteness of the Bloch Space

Appendix C. Designations for Differential Operator

Appendix D. Useful Expressions for Differential Operators

Appendix E. Continuous Conditions and Existence of the $Π$ Map

Appendix F. Weyl Algebra and Ring

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Position Operators in Terms of Converging Finite-Dimensional Matrices and Their Intertwining with Geometry, Transport, and Gauge

Abstract

1. Introduction

2. Position Operators and Weyl Algebras

3. Structure of Bloch Space and Its Quotients

4. Matrices of the Position Operator

4.1. The Nth Weyl Algebra

4.2. Geometry Defined on the Quotient Space

4.3. How Is the Convergence Achieved?

4.4. Can the DRM Be a Limit of CRMs?

5. Properties of the Position Operator and r-Matrix

6. Gauge Transformation, Ribbon Transformation, and Basis Transformation

7. Discussion and Outlook

8. Summary and Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Summary of Notation

Appendix B. Proof for the Incompleteness of the Bloch Space

Appendix C. Designations for Differential Operator

Appendix D. Useful Expressions for Differential Operators

Appendix E. Continuous Conditions and Existence of the Π Map

Appendix F. Weyl Algebra and Ring

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix E. Continuous Conditions and Existence of the $Π$ Map