LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability

Mantela, Marilena; Simserides, Constantinos; Di Felice, Rosa

doi:10.3390/ma14174930

Open AccessArticle

LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability

by

Marilena Mantela

¹

,

Constantinos Simserides

^1,*

and

Rosa Di Felice

^2,3,*

¹

Department of Physics, National and Kapodistrian University of Athens, Panepistimiopolis, Zografos, GR-15784 Athens, Greece

²

Department of Physics and Astronomy and Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA

³

CNR-NANO Modena, I-41125 Modena, Italy

^*

Authors to whom correspondence should be addressed.

Materials 2021, 14(17), 4930; https://doi.org/10.3390/ma14174930

Submission received: 22 July 2021 / Revised: 22 August 2021 / Accepted: 23 August 2021 / Published: 30 August 2021

(This article belongs to the Special Issue Computational Modeling and Simulation of Polymers and Biopolymers)

Download

Browse Figures

Versions Notes

Abstract

To describe the molecular electronic structure of nucleic acid bases and other heterocycles, we employ the Linear Combination of Atomic Orbitals (LCAO) method, considering the molecular wave function as a linear combination of all valence orbitals, i.e., 2s, 2p

_{x}

, 2p

_{y}

, 2p

_{z}

orbitals for C, N, and O atoms and 1s orbital for H atoms. Regarding the diagonal matrix elements (also known as on-site energies), we introduce a novel parameterization. For the non-diagonal matrix elements referring to neighboring atoms, we employ the Slater–Koster two-center interaction transfer integrals. We use Harrison-type expressions with factors slightly modified relative to the original. We compare our LCAO predictions for the ionization and excitation energies of heterocycles with those obtained from Ionization Potential Equation of Motion Coupled Cluster with Singles and Doubles (IP-EOMCCSD)/aug-cc-pVDZ level of theory and Completely Normalized Equation of Motion Coupled Cluster with Singles, Doubles, and non-iterative Triples (CR-EOMCCSD(T))/aug-cc-pVDZ level of theory, respectively, (vertical values), as well as with available experimental data. Similarly, we calculate the transfer integrals between subsequent base pairs, to be used for a Tight-Binding (TB) wire model description of charge transfer and transport along ideal or deformed B-DNA. Taking into account all valence orbitals, we are in the position to treat deflection from the planar geometry, e.g., DNA structural variability, a task impossible for the plane Hückel approach (i.e., using only 2p

_{z}

orbitals). We show the effects of structural deformations utilizing a 20mer evolved by Molecular Dynamics.

Keywords:

charge transfer; DNA; nucleic acids; Linear Combination of Atomic Orbitals (LCAO); Molecular Dynamics (MD); Tight Binding (TB); heterocycles

1. Introduction

The study of the electronic structure of organic heterocyclic molecules has been of interest for the scientific community for decades, especially since the establishment of investigation methods based on quantum mechanics. This includes the electronic structure and properties of nucleic acid oligomers and polymers, DNA and RNA. The sequence of bases, adenine (A), thymine (T) or uracil (U), guanine (G), cytosine (C), is where genetic information is stored and transferred in all living organisms. The understanding of its electronic structure and charge transfer [1] properties is a crucial issue in biology, involved in functions such as damage and repair, carcinogenesis and mutagenesis [2,3,4], mutations and diseases [5,6,7,8] and is also important for novel applications in nanotechnology [9,10].

The last two decades have witnessed a surge of studies of DNA as the basis for molecular wires and molecular electronics devices/circuits, based on self-assembly and specific base hybridization [11,12,13,14,15]. The prospect of using DNA in materials science stems from exploiting its properties of molecular recognition, assembly, and processing information [11] as well as its ability to transfer or transport charge. Among other theoretical and experimental attempts, the electronic structure of single DNA molecules has been resolved by transverse scanning tunneling spectroscopy and assigned to groups of orbitals originating from the molecular entities, i.e., nucleobases, backbone, counterions [12]. Properties of long-range charge transport in DNA and DNA-mediated charge transfer and mechanisms have been studied a for a long time now [13]. Furthermore, currents in the range of 10–100 pA have been measured in G4-DNA over distances in the range of 10–100 nm [14]. Today, DNA plays an increasingly important role in molecular electronics due to its structural and molecular recognition properties [15].

In this work, we calculate the ionization and excitation energies of nucleic acid bases and similar molecules as well as assemblies of DNA bases using a semi-empirical Linear Combination of Atomic Orbitals (LCAO) method that includes all valence orbitals with a novel parameterization developed by us. Additionally, using this approach, we obtain electronic parameters for charge (electron or hole) transfer along DNA, which can be employed to model electron and hole conductivity. We investigate the electronic structure of the four DNA bases A, T, G, C and of the two Watson–Crick H-bonded pairs A-T and G-C. We focus on the HOMO (Highest Occupied Molecular Orbital) and LUMO (Lowest Unoccupied Molecular Orbital) wave functions and energies. With the new LCAO parameterization developed by us in this work, we calculate the transfer matrix elements between stacking base pairs, for all possible combinations between them, for both electrons and holes, aiming at parameterizing a Tight-Binding (TB) wire model. We calculate the transfer matrix elements for ideal geometries, namely for planar bases and base pairs separated and twisted approximately by 3.4 Å and 36

^{\circ}

, respectively, relative to the double helix growth axis. Our results are compared with published experimental and computational (from first principles and simpler TB models) data for the HOMO and LUMO energies. Finally, the deformed base pairs pruned from several snapshots of a 500 ns Molecular Dynamics (MD) trajectory of a 20mer [16] are used in order to address the effects of structural variability in the electronic structure and charge transfer properties of B-DNA within the LCAO approach.

The rest of this article is organized in the following way: In Section 2, we develop the novel LCAO parameterization that includes all valence orbitals for nucleic acid bases (Section 2.1) and base pairs (Section 2.2). This methodology is not limited to these specific molecular systems but can be applied to similar heterocycles. Next, we obtain the TB parameters that are relevant for a wire model description of charge transfer and transport along B-DNA (Section 2.3). We also describe non ideal bases and base pairs obtained by MD (Section 2.4). In Section 3, we present our results on ionization and excitation energies of various heterocyclic planar molecules, including isolated DNA bases (Section 3.1). The on-site energies of base pairs and transfer integrals between stacked base pairs are presented in Section 3.2. We study the effects of structural variability on the electronic structure and charge transfer properties of B-DNA in Section 3.3. Finally, Section 4, contains our overall conclusions.

2. Theory

2.1. LCAO with All Valence Orbitals for Nucleic Acid Bases or Similar Molecules

We consider the state

│ β 〉

of a nucleic acid base, or a similar molecule, as a linear combination of all valence orbital states

{│ ϕ}_{i ν} 〉

, i.e., 2s, 2p

_{x}

, 2p

_{y}

, 2p

_{z}

for C, N, and O atoms, and 1s for H atoms:

│ β 〉 = \sum_{ν = 1}^{N} \sum_{i = 1}^{I} c_{i ν} {│ ϕ}_{i ν} 〉 .

(1)

The index

ν

runs among all N atoms of the molecule and the index i runs among all I orbital states of each atom, respectively.

│ β 〉

obeys the Schrödinger equation

{\hat{H}}_{B} │ β 〉 = E_{B} │ β 〉 .

(2)

{\hat{H}}_{B}

is the Hamiltonian of the base (or other molecule), with eigenvalues

E_{B, k}

and eigenvectors

{│ β 〉}_{k}

. Taking the bracket, using

{〈 ϕ}_{j μ} │

, Equation (2) gives the linear system of equations

\sum_{ν = 1}^{N} \sum_{i = 1}^{I} [(H_{B, j μ i ν} - E_{B} S_{j μ i ν}) c_{i ν}] = 0, μ = 1, \dots, N, j = 1, \dots, I .

(3)

The Hamiltonian matrix elements

H_{B, j μ i ν}

are given by

H_{B, j μ i ν} = 〈 ϕ_{j μ} │ {\hat{H}}_{B} {│ ϕ}_{i ν} 〉

(4)

and the overlap matrix elements are

S_{j μ i ν} = 〈 ϕ_{j μ} {│ ϕ}_{i ν} 〉 \approx δ_{j μ i ν} .

(5)

We notice that we have approximated

S_{j μ i ν}

by

δ_{j μ i ν}

. The system of Equation (3) is solved by numerical diagonalisation, giving the eigenenergies

E_{B k}

and eigenvectors

│ β_{k} 〉 = [\begin{matrix} c_{11 k} \\ c_{12 k} \\ ⋮ \\ c_{i ν k} \\ ⋮ \\ c_{I N k} \end{matrix}] .

(6)

To this end we need the values of the Hamiltonian matrix elements,

H_{B, j μ i ν}

. Regarding the diagonal matrix elements

H_{B, i ν i ν}

—also known as on-site energies—we utilize a novel parameterization, namely:

E_{H (1 s)} = - 13.64

eV for H 1s orbitals,

E_{C (2 s)} = - 13.18

eV for C 2s orbitals,

E_{C (2 p)} = - 6.70

eV for C 2p orbitals,

E_{N (2 s)} = - 14.51

eV for N 2s orbitals,

E_{N (2 p)} = - 9.55

eV for N 2p orbitals,

E_{O (2 s)} = - 15.03

eV for O 2s orbitals,

E_{O (2 p)} = - 11.52

eV for O 2p orbitals. As for the nondiagonal matrix elements

H_{B, j μ i ν} (μ \neq ν)

referring to neighboring atoms, we utilize the Slater–Koster two-center interaction transfer integrals [17]

\begin{matrix} V_{ss} & = & V_{ss σ}, \end{matrix}

(7)

\begin{matrix} V_{sx} & = & ξ_{1} V_{sp σ}, \end{matrix}

(8)

\begin{matrix} V_{xx} & = & ξ_{1}^{2} V_{pp σ} + (1 - ξ_{1}^{2}) V_{pp π}, \end{matrix}

(9)

\begin{matrix} V_{xy} & = & ξ_{1} ξ_{2} (V_{pp σ} - V_{pp π}), \end{matrix}

(10)

with

ξ_{1}

,

ξ_{2}

being the directional cosines of

\vec{d} = \vec{j i}

which points from atom i to atom j. Concerning the values of

V_{ss σ}, V_{sp σ}, V_{pp σ}

,

V_{pp π}

, we use the relevant expressions proposed by Harrison [18,19], of the form:

V_{χ} = χ \frac{ℏ^{2}}{m d^{2}},

(11)

with m being the electron mass and d being the two-center distance. The

χ

values that we propose here are:

χ_{ss σ} = - 1.32

,

χ_{sp σ} = - 1.42

,

χ_{pp π} = - 0.73

(slightly modified relative to the original Harrison constant),

χ_{pp σ} = 2.22

. For each H orbital, the interactions are multiplied by a factor

b = 0.70

that resulted from the optimization. We arrived at the above parameterization after careful optimization by fitting the LCAO numerical results with the experimental values for the excitation and the ionization energies of nucleic acid bases A, G, T, C, and U. To do so, we used the Nelder–Mead algorithm as implemented in Matlab software. All other nondiagonal matrix elements, referring to non-neighboring atoms, are assumed equal to zero,

H_{B, j μ i ν} = 0

. In Table 1 and Table 2 we summarize our LCAO parameters.

From the numerical diagonalization of the Hamiltonian matrix, one obtains the energy eigenvalues corresponding to the electronic spectrum of molecular orbitals. The occupied and unoccupied orbitals—and thus the HOMO and LUMO—can be found by counting all valence electrons contributed by the atoms of the molecule and arranging them successively in couples of different spin in accordance with the Pauli principle. The same treatment developed for DNA bases is applicable to other purines, pyrimidines, and similar molecules.

2.2. LCAO with All Valence Orbitals for B-DNA Base Pairs

Likewise, we obtain the HOMO and LUMO states of a B-DNA base pair or monomer. Let us call

N_{1}

,

N_{2}

the number of atoms making up the two bases of the base pair. We consider the base pair or monomer state

│ α 〉

as a linear combination of all valence orbital states

{│ ϕ}_{i ν} 〉

, i.e., 2s, 2p

_{x}

, 2p

_{y}

, 2p

_{z}

for C, N and O atoms and 1s for H atoms:

│ α 〉 = \sum_{ν = 1}^{N_{1} + N_{2}} \sum_{i = 1}^{I} c_{i ν} {│ ϕ}_{i ν} 〉 .

(12)

The indexes

ν

and i run among the

N_{1} + N_{2}

atoms of the base pair and the I orbitals of each atom, respectively.

│ α 〉

obeys the Schrödinger Equation

{\hat{H}}_{A} │ α 〉 = E_{A} │ α 〉 .

(13)

│ α 〉

and

E_{A}

are the eigenvectors and eigenenergies of the monomer or base pair Hamiltonian

{\hat{H}}_{A}

. By taking the bracket, using

{〈ϕ}_{j μ} │

, Equation (13) gives the linear system of equations

\sum_{ν = 1}^{N_{1} + N_{2}} \sum_{i = 1}^{I} [(H_{A, j μ i ν} - E_{A} S_{j μ i ν}) c_{i ν}] = 0, μ = 1, \dots, N_{1} + N_{2}, j = 1, \dots, I .

(14)

The system of Equation (14) is solved by numerical diagonalisation, as well, giving the eigenenergies

E_{A k}

and eigenvectors

│ α_{k} 〉 = [\begin{matrix} c_{α 1 k} \\ c_{12 k} \\ ⋮ \\ c_{i ν k} \\ ⋮ \\ c_{I (N_{1} + N_{2}) k} \end{matrix}] .

(15)

In this case, the values of the Hamiltonian matrix elements,

H_{A, j μ i ν}

, are expressed slightly differently. The matrix elements

H_{A, j μ i ν}

with (a)

1 \leq ν \leq N_{1}

and

1 \leq μ \leq N_{1}

, and (b)

N_{1} + 1 \leq ν \leq N_{1} + N_{2}

and

N_{1} + 1 \leq μ \leq N_{1} + N_{2}

, are expressed in the same way as previously described for molecules. For the remaining matrix elements, we employ the Slater–Koster two-center interaction transfer integrals of Equations (7), (8), (9), (10) but in this case, the values of

V_{s s σ}, V_{s p σ}, V_{p p σ}, V_{p p π}

are of the form

V_{χ} = χ \frac{ℏ^{2}}{m d_{0}^{2}} e^{- \frac{2}{d_{0}} (d - d_{0})},

(16)

where

d_{0} = 1.35

Å is a typical covalent bond distance within a base. This difference stems from the fact that Harrison’s relations are valid for interatomic distances of the size of covalent bonds. However, the B-DNA bases (A and T, or G and C) are connected with noncovalent hydrogen bonds to form a base pair. The length of hydrogen bonds is longer than the typical length

d_{0}

of the covalent bond connecting neighboring atoms within a base. Thus, when dealing with interatomic distances of the size of hydrogen bonds and longer, Harrison’s expressions of Equation (11) are replaced with the appropriate exponentially decaying expressions of the form of Equation (16) [20,21,22].

From the aforementioned diagonalization of the Hamiltonian matrix, we obtain the energy eigenvalues

E_{A}

—including HOMO and LUMO—of the electronic spectrum, as well as the corresponding eigenvectors (coefficients)

c_{i ν}

of a base pair.

2.3. Coherent Charge Transfer and Transport Parameters for a TB Wire Model

2.3.1. Eigenstates

The HOMO or LUMO state of a DNA segment, made up of

N

monomers, can be expressed as

│ DNA 〉 = \sum_{α = 1}^{N} v_{α} │ α 〉 .

(17)

│ α 〉

is the HOMO or LUMO state of monomer (base pair)

α

and

v_{α}

are time-independent quantities. The Hamiltonian, in second quantization notation, in this TB wire model approach, can be written as

{\hat{H}}_{DNA} = \sum_{α = 1}^{N} E_{α} │ α 〉 〈α│ + \sum_{α = 1}^{N - 1} t_{α, α + 1} │ α 〉 〈α + 1 │ + \sum_{α = 2}^{N} t_{α, α - 1} │ α 〉 〈α - 1 │ .

(18)

E_{α}

is the HOMO or LUMO on-site energy of monomer

α

, and

t_{α, γ}

is the transfer integral between monomers

α

and

γ

. By substituting Equations (17) and (18) into the time-independent Schrödinger equation

{\hat{H}}_{DNA} │ DNA 〉 = E_{DNA} │ DNA 〉,

(19)

we arrive to a system of

N

coupled equations

E_{α} v_{α} + t_{α, α + 1} v_{α + 1} + t_{α, α - 1} v_{α - 1} = E_{DNA} v_{α} .

(20)

Equation (20) is equivalent to the eigenvalue-eigenvector problem

H_{DNA} \vec{v} = E_{D N A} \vec{v} .

(21)

H_{DNA}

is the Hamiltonian matrix of order

N

composed of the TB parameters (on-site energies and transfer integrals) and

\vec{v}

is the vector matrix composed of the coefficients

v_{j}

. The diagonalization of

H_{DNA}

leads to the determination of the HOMO or LUMO eigenenergy spectra (eigenspectra),

{E_{k}}

,

k = 1, 2, \dots, N

and of the occupation probabilities for each eigenstate,

| v_{j k} |^{2}

, where

v_{j k}

is the j-th component of the k-th eigenvector.

2.3.2. Coherent Charge Transfer

To describe charge transfer between stacked base pairs of double-stranded DNA, we suppose that an extra inserted electron travels through LUMOs, while an extra inserted hole travels through HOMOs. The time-dependent HOMO or LUMO state of the whole B-DNA segment,

│ DNA (t) 〉

, is considered as a linear combination of base-pair HOMO or LUMO states with time-dependent coefficients

│ DNA (t) 〉 = \sum_{α} A_{α} (t) │ α 〉,

(22)

where

│ α 〉

is the HOMO or LUMO state of the

α

-th monomer and the sum is extended over all monomers of the B-DNA segment. Substituting Equations (18) and (22) to the time-dependent Schrödinger equation

i ℏ \frac{d │ DNA (t) 〉}{d t} = {\hat{H}}_{DNA} │ DNA (t) 〉,

(23)

we obtain the system of

N

coupled differential equations:

i ℏ \frac{A_{α}}{d t} = E_{α} A_{α} + t_{α, α - 1} A_{α - 1} + t_{α, α + 1} A_{α + 1} .

(24)

Equation (24) is equivalent to a first-order matrix differential equation, which can be solved with the eigenvalue method.

2.3.3. Coherent Charge Transport

To handle coherent charge transport in a TB approach, we also need the TB parameters (on-site energies and transfer integrals) described above. This can be done, e.g., with a transfer matrix approach [23].

2.3.4. TB Parameters for a Wire Model Description

The TB parameters for a wire model description of charge transfer or transport can be obtained as follows. The transfer integral between monomers

│ λ 〉

and

│ λ^{'} 〉

t_{λ, λ^{'}} = 〈 λ │ {\hat{H}}_{D N A} │ λ^{'} 〉,

(25)

can be analyzed as

t_{λ, λ^{'}} = \sum_{ν = 1}^{N_{λ}} \sum_{i = 1}^{I_{λ}} \sum_{μ = 1}^{N_{λ^{'}}} \sum_{j = 1}^{I_{λ^{'}}} c_{i ν (λ)}^{*} V_{i ν j μ} c_{j μ (λ^{'})},

(26)

where

V_{i ν j μ} = 〈 ϕ_{i ν (λ)} │ {\hat{H}}_{DNA} │ ϕ_{j μ (λ^{'})} 〉 .

(27)

The matrix elements

V_{i ν j μ}

are given by the Slater–Koster two-center interaction transfer integrals of Equations (7)–(8) with the values of

V_{ss σ}, V_{sp σ}, V_{pp σ}, V_{pp π}

being of the form of Equation (16). The tight-binding parameters

E_{λ}

and

t_{λ, λ^{'}}

computed in this work could be used to treat charge transfer (Section 2.3.2) and transport (Section 2.3.3) along a B-DNA segment.

Finally, we obtain the maximum transfer percentage of the carrier from one base pair to another. This refers to the maximum probability to find the extra hole or electron at the site where it was not placed at initially. The maximum transfer percentage reads

p = \frac{{(2 t)}^{2}}{{(2 t)}^{2} + Δ^{2}}

(28)

where t is the transfer parameter between the two base pairs and

Δ

is the difference between the HOMO or LUMO energies of the two base pairs.

2.4. DNA Fragments Generated by MD

In order to study the effects of structural variability on the electronic structure and charge transfer parameters in B-DNA, we used multiple instances of AA and GG dimers. These instances were pruned from representative structures of the 500 ns MD trajectory of the 20mer

5^{'} -

CGAAAAGGGGAAAAGGGGAT

- 3^{'}

at constant temperature

T = 300

K and constant pressure

P = 1

bar. Specifically, we considered the centroid structures of the two most populated clusters, accounting for 35% (cl1) and 12% (cl2) of the whole trajectory. More details are given elsewhere [16]. From the two most representative 20mers we extracted all the possible AA and GG dimers (two stacked H-bonded base pairs), excluding the dimers of the edges. These dimers are denoted as: A4A5_cl1, A4A5_cl2, A5A6_cl1, A5A6_cl2, G7G8_cl1, G7G8_cl2, G8G9_cl1, G8G9_cl2, G9G10_cl1, G9G10_cl2, A11A12_cl1, A11A12_cl2, A12A13_cl1, A12A13_cl2, A13A14_cl1, A13A14_cl2, G15G16_cl1, G15G16_cl2, G16G17_cl1, G16G17_cl2. We denote the corresponding monomers as: A4_cl1, A4_cl2, A5_cl1, A5_cl2, A6_cl1, A6_cl2, G7_cl1, G7_cl2, G8_cl1, G8_cl2, G9_cl1, G9_cl2, G10_cl1, G10_cl2, A11_cl1, A11_cl2, A12_cl1, A12_cl2, A13_cl1, A13_cl2, A14_cl1, A14_cl2, G15_cl1, G15_cl2, G16_cl1, G16_cl2, G17_cl1, G17_cl2.

Local complementary base-pair parameters are employed in order to define the base pair structure and its variability. The parameters describing the relative translations in all axes, involving two bases of a Watson–Crick pair, are shear (

S x

), stretch (

S y

), and stagger (

S z

), while the corresponding rotations around x, y, and z axes are buckle (

κ

), propeller twist (

π

), and opening (

σ

) [24]. Figure 1 depicts the definitions of these translation and rotation parameters involving two bases of a Watson–Crick pair.

Figure 2 sketches the translation and rotation parameters for each one of the studied monomers. The parameters were computed using the web interface 3DNA. Dashed lines denote the mean value of each parameter, that is:

0.03

Å (shear),

- 0.03

Å (stretch),

0.04

Å (stagger),

6 . 53^{\circ}

(buckle),

- 10 . 40^{\circ}

(propeller twist),

1 . 06^{\circ}

(opening) for A-T monomers and

- 0.09

Å (shear),

- 0.04

Å (stretch),

0.01

Å (stagger),

0 . 55^{\circ}

(buckle),

- 1 . 13^{\circ}

(propeller twist),

- 0 . 66^{\circ}

(opening) for G-C monomers. These values together with values found in the literature are listed in Table 3.

3. Results and Discussion

3.1. Heterocyclic Planar Molecules including Nucleic Acid Bases

The theoretical scheme described in Section 2 was employed to calculate the HOMO and LUMO eigenenergies for a variety of heterocyclic planar organic molecules. We make the convenient simplifying assumption that the HOMO absolute value expresses the ionization energy, and the HOMO–LUMO gap expresses the excitation energy (in most cases the first

π

-

π^{*}

transition). Below, the ionization energies are of

π

molecular orbital character and the excitation energies are

π

-

π^{*}

transitions, unless otherwise stated. We studied the following groups of molecules: adenine and isomers; guanine and isomers; purine and isomers; thymine, cytosine, uracil, and isomers; pyrimidine and isomers; and other planar heterocyclic molecules. Table 4 summarizes our LCAO results using all valence orbitals, along with relevant experimental values.

I_{CC}

and

E_{CC}

are calculations of the vertical ionization energies at the Ionization Potential Equation of Motion Coupled Cluster with Singles and Doubles (IP-EOMCCSD)/aug-cc-pVDZ level of theory and vertical excitation energies at the Completely Renormalised Equation of Motion Coupled Cluster with Singles, Doubles, and non-iterative Triples (CR-EOMCCSD(T))/aug-cc-pVDZ level of theory, respectively, ref. [29].

Table 4 also includes transition oscillator strengths f that we calculated in a simplistic approximation, considering point contribution of the corresponding orbitals; i.e., the transition dipole moment

\vec{d}

was approximated as

\begin{matrix} \vec{d} & = (- e) 〈L│ \vec{r} │ H 〉 = (- e) (\sum_{ν = 1}^{N} \sum_{i = 1}^{I} c_{i ν L}^{*} {〈ϕ}_{i ν} │) \vec{r} (\sum_{μ = 1}^{N} \sum_{j = 1}^{I} c_{j μ H} {│ ϕ}_{j μ} 〉) \\ = (- e) \sum_{ν = 1}^{N} \sum_{i = 1}^{I} \sum_{μ = 1}^{N} \sum_{j = 1}^{I} c_{i ν L}^{*} c_{j μ H} {〈
ϕ}_{i ν} │ \vec{r} │ ϕ_{j μ 〉} ≃ (- e) \sum_{ν = 1}^{N} \sum_{i = 1}^{I} c_{i ν L}^{*} \vec{r_{i}} c_{i ν H}, \end{matrix}

(29)

where |L〉 (|H〉) is the LUMO (HOMO) state. The oscillator strength is [30]

f = \frac{2}{3} \frac{m}{e^{2} ℏ^{2}} E d^{2} .

(30)

E is the excitation energy. The results are illustrated in Figure 3, Figure 4 and Figure 5.

Regarding the ionization energy, the LCAO obtained results are in very good agreement with both the experimental data and the CC results, although there are some deviations. The Root Mean Square Percentage Error (RMSPE), with respect to the experimental values, is

3.65 %

. Differences in tautomer ionization energies are as expected negligible, that is

0.12

eV for purine tautomers and

0.01

eV for indazole tautomers. As for the excitation energies of the

π

-

π^{*}

transition, the RMSPE, with respect to the experimental values, is

6.49 %

. Both purine and indazole tautomers have a negligible

0.03

eV difference in their excitation energies. Based on the presented data and reported comments about individual bases, we note that the LCAO method used in this work, though not exact, is capable of producing results in a good agreement with experimental data, when choosing the suitable set of parameters. This outcome has motivated the use of the same method for all other systems of interest, whose computational results are presented in the remainder of this article. Vertical ionization energies of nucleic acid bases in the gas phase with different electronic structure methods are, generally, in agreement with our results, cf. Reference [51] and references therein.

3.2. B-DNA Base Pairs

In this subsection, we present our results for the B-DNA base pairs. In Table 5, we show the HOMO, LUMO, and HOMO–LUMO gap energies of the two B-DNA base pairs (Adenine (A)-Thymine (T) and Guanine (G)-Cytosine (C)), according to the procedure described in Section 2.3 using LCAO with all valence orbitals, along with the corresponding energies found in Ref. [52] using only 2p

_{z}

orbitals. At this point, we should state that the bases making up the base pairs are slightly deformed in comparison to their structure when isolated (cf. Section 3.1), so the corresponding HOMO and LUMO energies for these two cases may differ. Thus, Table 5 also contains the HOMO, LUMO, and HOMO–LUMO gap energies of the distorted bases. The HOMO (LUMO) energies are of

π

(

π^{*}

) molecular orbital character and the HOMO–LUMO gap energies are

π

-

π^{*}

transitions, unless otherwise stated.

The energy values for the bases are slightly different from those in Table 4, as expected. In addition, based on Table 5, one can assume that the HOMO energy of a particular base pair is very close to the largest of the HOMO energies of the two bases of the base pair, while the LUMO energy of the base pair is closer to the lowest of the two LUMO energies.

In Figure 6 and Figure 7 we represent the occupation probabilities of holes and electrons on each atomic orbital of bases and base pairs, calculating the squared coefficients

| c_{i ν} |^{2}

(cf. Equations (1) and (12)) of the corresponding states (HOMO for holes, LUMO for electrons). We observe that our calculated HOMO state for the base pair A-T (G-C) is localized almost totally in Adenine (Guanine), while the corresponding LUMO wave function is localized in Thymine (Cytosine), in accordance to results from ab initio techniques of References [53,54], which locate the HOMO of a base pair in purine and the LUMO in pyrimidine. This is due to the higher HOMO energy of Adenine (Guanine) and lower LUMO energy of Thymine (Cytosine) and the large values of these differences compared to the transfer integrals (see Table 6). We calculate the first transition character of A, T, A-T, and G to be

π

-

π^{*}

, while C and G-C have

π

-

σ^{*}

transition character.

We obtain the charge transfer parameters between two successive base pairs by calculating the corresponding overlap integrals from Equation (26). We denote by XY two successive base pairs, X-X

_{compl}

and Y-Y

_{compl}

. The bases X and Y are located at the same strand in the direction

5^{^{'}}

-

3^{^{'}}

, while X

_{compl}

and Y

_{compl}

, respectively, are their complementary bases on the other strand. In the most common B-DNA conformation, X-X

_{compl}

and Y-Y

_{compl}

are approximately separated by 3.4 Å and twisted by 36

^{\circ}

.

Table 6 summarizes our LCAO results using all valence orbitals for the transfer parameters, for all possible combinations of successive base pairs and close-to-ideal geometrical conformations. The Table also contains comparisons with other methods.

In Figure 8, we illustrate the absolute values of transfer parameters for all possible combinations of successive base pairs for holes and for electrons. The figure contains the transfer parameters obtained from our LCAO calculations using all valence orbitals, along with the corresponding parameters found in Ref. [55] (where various estimations from bibliography had been taken into account). Furthermore, those from Ref. [29], where only 2p

_{z}

orbitals had been used, and finally, electron transfer parameters from Ref. [52], where only 2p

_{z}

orbitals had been used. Peluso et al. [56], based on electrochemical and time-dependent spectroscopic measurements, find for GG a transfer integral ≈ 0.1 eV, which is very close to our results, while, for AA, they report a value ≈ 0.3 eV, which seems large compared to the parametrization reported here taking into account all valence orbitals as well as to the parametrization in Reference [55], which takes into account, for holes, the works [52,57,58,59,60,61].

In Figure 9, we depict the maximum transfer percentage of Equation (28) obtained by our LCAO calculations using all valence orbitals, compared to the values using parameters from Reference [55] for holes (an estimation from various articles from bibliography). Furthermore, from Reference [29] for electrons and holes as well as from Reference [52] for electrons (where only 2p

_{z}

orbitals had been used). For ideal B-DNA geometries and for dimers made of identical monomers, the maximum transfer percentage is 1, while in the case of different monomers, p is smaller than 1, both for holes and for electrons. Both for t and p, we observe that the current LCAO using all valence orbitals is closer to the results from Reference [55] for holes (where various estimations from bibliography of different origin had been taken into account). For electrons, as far as we know this current LCAO calculation is the only one beyond simple Hückel models, using only 2p

_{z}

orbitals.

3.3. Effects of Structural Variability

In this subsection, we analyze the effects of structural variability on the electronic structure and charge transfer properties of B-DNA using the fragments derived from MD, as detailed in Section 2.4. In Figure 10, we present the absolute values of the parameters

Δ

(difference between the HOMO eigenenergies of the two base pairs of each studied dimer) and t (transfer integral between the two base pairs’ HOMOs of each studied dimer), as well as the maximum transfer percentages p as calculated via Equation (28). The values of

| t |

and p can also be found in Reference [16] in comparison with results obtained by Density Functional Theory (DFT) techniques.

From Equation (28) it is expected that ideal dimers (made up of ideal monomers) should have a maximum transfer percentage equal to 1. However, by observing Figure 10, one can notice that not all AA and GG dimers have

p = 1

. Specifically, dimers with a p considerably different from unit (and a

Δ

different than zero) are: A11A12_cl2, A12A13_cl1, A121A13_cl2, A13A14_cl2, G15G16_cl1, and G16G17_cl1. This is expected because the studied monomers are not ideal, which means their consisting bases have relative translations and rotations (Figure 1) as depicted in Figure 2. More specifically, a small p value is related to a large

Δ

value, in accordance with Equation (28). Thus, it is expected that the structural parameters (shear, stretch, stagger, buckle, propeller twist, opening) have a reasonable effect on the HOMO (and LUMO) base-pair energy values and consequently on the values of

Δ

and p. As for the contribution of transfer integrals t to the above discussion, it is documented in Reference [16].

4. Conclusions and Outlook

In this work, we computed the tight-binding parameters that are necessary for a wire-model description of longitudinal (axial) charge transfer through B-DNA. We took into account structural variability by carrying out these computations for multiple structures resulting from a classical trajectory.

We initially calculated the lowest ionization and excitation energies of various “ideal” (frozen) heterocyclic organic molecules with a biological function, including the DNA and RNA bases and isomers. We did so employing the LCAO approximation in a new parameterization that accounts for all valence orbitals, i.e., 2s, 2p

_{x}

, 2p

_{y}

, 2p

_{z}

orbitals for C, N and O atoms and 1s orbital for H atoms. This LCAO approach is more suitable than the standard LCAO parameterization to investigate non-planar geometries. We predict ionization and excitation energies with RMSPE

3.65 %

and

6.49 %

, respectively, compared to the experimental values. Based on these errors, we infer that the proposed computational strategy is an adequate tool for a quick and relatively accurate estimation of the electronic structure for a variety of organic molecules.

Using the computed energies of the HOMO and LUMO within the proposed LCAO method, we then evaluated the energy levels of DNA base pairs (A-T, G-C) and the transfer integrals between stacked base pairs. Our results are in good agreement with reference data. The obtained transfer integrals can be used in further studies of charge transfer/transport in DNA oligomers and polymers.

Finally, we addressed the impact of structural flexibility (dynamics) on the electronic structure and charge transfer ability of B-DNA. To this end, we applied our LCAO method to 20 AA and GG dimers, extracted from representative structures in a classical MD trajectory of a 20mer evolved for 500 ns. For all these systems, we calculated the parameters

Δ

and t, as well as the maximum transfer percentage between the two monomers of a dimer p. We found that the values of

Δ

and p are significantly affected by geometrical changes. Nevertheless, in the vast majority of the studied dimers, the maximum transfer percentage is very close to unity.

We suggest that the proposed methodology can be used in a high-throughput manner to characterize dynamical effects on charge transfer in organic polymers constituted of heterocyclic building blocks.

Our cost-effective simple method is suitable for very fast computations of electronic structure and transfer integrals. It can greatly facilitate charge transfer and transport calculations in sequences of arbitrary geometry taken, e.g., by MD simulations, as far as purines, pyrimidines, and similar molecules are the constituents. Although we took only valence orbitals for carbon, nitrogen, oxygen, and hydrogen into account, this approach can be generalized to include other atomic species and orbitals.

Author Contributions

Conceptualization, C.S. and R.D.F.; methodology, M.M. and C.S. and R.D.F.; software, M.M., C.S. and R.D.F.; validation, C.S. and R.D.F.; formal analysis, M.M.; investigation, M.M.; resources, M.M. and C.S. and R.D.F.; data curation, M.M.; writing—original draft preparation, M.M.; writing—review and editing, M.M. and C.S. and R.D.F.; visualization, M.M.; supervision, C.S. and R.D.F.; project administration, C.S. and R.D.F.; funding acquisition, M.M. and C.S. and R.D.F. All authors have read and agreed to the published version of the manuscript.

Funding

M.M. wishes to thank the State Scholarships Foundation (IKY). This research is co-financed by Greece and the European Union (European Social Fund-ESF) through the operational programme “Human Resources Development, Education and Lifelong Learning” in the context of the project “Strengthening Human Resources Research Potential via Doctorate Research” (MIS-5000432), implemented by the State Scholarships Foundation (IKY). R.D.F. acknowledges the Center for Advanced Research Computing (CARC) for computing resources, the Zumberge fund, and the Women in Science and Engineering program at the University of Southern California. The APC is expected to be funded partially by C.S. referee vouchers and partially by the National and Kapodistrian University of Athens open access program.

Data Availability Statement

Data available upon reasonable request.

Acknowledgments

We thank K. Lambropoulos and A. Morphis for useful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

LCAO	Linear Combination of Atomic Orbitals
MD	Molecular Dynamics
DNA	Deoxyribonucleic Acid
TB	Tight Binding
IP-EOMCCSD	Ionization Potential Equation of Motion
	Coupled Cluster with Singles and Doubles
CR-EOMCCSD(T)	Completely Renormalized Equation Of Motion
	Coupled Cluster with Singles, Doubles, and non-Iterative Triples
RMSPE	Root Mean Square Percentage Error
HOMO	Highest Occupied Molecular Orbital
LUMO	Lowest Unoccupied Molecular Orbital

References

Kawai, K.; Majima, T. Hole Transfer Kinetics of DNA. Acc. Chem. Res. 2013, 46, 2616–2625. [Google Scholar] [CrossRef]
Dandliker, P.J.; Holmlin, R.E.; Barton, J.K. Oxidative Thymine Dimer Repair in the DNA Helix. Science 1997, 275, 1465–1468. [Google Scholar] [CrossRef]
Rajski, S.R.; Jackson, B.A.; Barton, J.K. DNA repair: Models for damage and mismatch recognition. Mutat. Res. 2000, 447, 49–72. [Google Scholar] [CrossRef]
Giese, B. Electron transfer through DNA and peptides. Bioorgan. Med. Chem. 2006, 14, 6139–6143. [Google Scholar] [CrossRef] [PubMed]
Shih, C.T.; Cheng, Y.Y.; Wells, S.A.; Hsu, C.L.; Römer, R.A. Charge transport in cancer-related genes and early carcinogenesis. Comput. Phys. Commun. 2011, 182, 36–38. [Google Scholar] [CrossRef]
Páez, C.J.; Schulz, P.A.; Wilson, N.R.; Römer, R.A. Robust signatures in the current–voltage characteristics of DNA molecules oriented between two graphene nanoribbon electrodes. New J. Phys. 2012, 14, 093049. [Google Scholar] [CrossRef]
Shih, C.T.; Roche, S.; Römer, R.A. Point-Mutation Effects on Charge-Transport Properties of the Tumor-Suppressor Gene p53. Phys. Rev. Lett. 2008, 100, 018105. [Google Scholar] [CrossRef]
Oliveira, J.I.N.; Albuquerque, E.L.; Fulco, U.L.; Mauriz, P.W.; Sarmento, R.G.; Caetano, E.W.S.; Freire, V.N. Conductance of single microRNAs chains related to the autism spectrum disorder. Europhys. Lett. 2014, 107, 68006. [Google Scholar] [CrossRef]
Wohlgamuth, C.H.; McWilliams, M.A.; Slinker, J.D. DNA as a Molecular Wire: Distance and Sequence Dependence. Anal. Chem. 2013, 85, 8634–8640. [Google Scholar] [CrossRef]
Lewis, F.D.; Wasielewski, M.R. Dynamics and efficiency of photoinduced charge transport in DNA: Toward the elusive molecular wire. Pure Appl. Chem. 2013, 85, 1379–1387. [Google Scholar] [CrossRef]
Seeman, N.C. DNA in a material world. Nat. Nanotechnol. 2003, 421, 427–431. [Google Scholar] [CrossRef]
Shapir, E.; Cohen, H.; Calzolari, A.; Cavazzoni, C.; Ryndik, D.; Cuniberti, G.; Kotlyar, A.; Di Felice, R.; Porath, D. Electronic structure of single DNA molecules resolved by transverse scanning tunnelling spectroscopy. Nat. Mater. 2008, 7, 68–74. [Google Scholar] [CrossRef] [PubMed]
Genereux, J.C.; Barton, J.K. Mechanisms for DNA Charge Transport. Chem. Rev. 2010, 110, 1642–1662. [Google Scholar] [CrossRef] [PubMed]
Livshits, G.; Stern, A.; Rotem, D.; Borovok, N.; Eidelshtein, G.; Migliore, A.; Penzo, E.; Wind, S.; Di Felice, R.; Skourtis, S.; et al. Long-range charge transport in single G-quadruplex DNA molecules. Nat. Nanotechnol. 2014, 9, 1040–1046. [Google Scholar] [CrossRef]
Wang, K. DNA-Based Single-Molecule Electronics: From Concept to Function. J. Funct. Biomater. 2018, 9, 8. [Google Scholar] [CrossRef]
Mantela, M.; Morphis, A.; Lambropoulos, K.; Simserides, C.; Di Felice, R. Effects of Structural Dynamics on Charge Carrier Transfer in B-DNA: A Combined MD and RT-TDDFT Study. J. Phys. Chem. B 2021, 125, 3986–4003. [Google Scholar] [CrossRef] [PubMed]
Slater, J.C.; Koster, G.F. Simplified LCAO Method for the Periodic Potential Problem. Phys. Rev. 1954, 94, 1498–1524. [Google Scholar] [CrossRef]
Harrison, W.A. Electronic Structure and the Properties of Solids: The Physics of the Chemical Bond, 2nd ed.; Dover: New York, NY, USA, 1989. [Google Scholar]
Harrison, W.A. Elementary Electronic Structure; World Scientific: River Edge, NJ, USA, 1999. [Google Scholar]
Menon, M.; Allen, R.E. Simulations of atomic processes at semiconductor surfaces: General method and chemisorption on GaAs(110). Phys. Rev. B 1988, 38, 6196–6205. [Google Scholar] [CrossRef] [PubMed]
Menon, M.; Subbaswamy, K.R. Nonorthogonal tight-binding molecular-dynamics study of silicon clusters. Phys. Rev. B 1993, 47, 12754–12759. [Google Scholar] [CrossRef] [PubMed]
Menon, M.; Connolly, J.; Lathiotakis, N.; Andriotis, A. Tight-binding molecular-dynamics study of transition-metal clusters. Phys. Rev. B 1994, 50, 8903–8906. [Google Scholar] [CrossRef]
Lambropoulos, K.; Simserides, C. Periodic, quasiperiodic, fractal, Kolakoski, and random binary polymers: Energy structure and carrier transport. Phys. Rev. E 2019, 99, 032415. [Google Scholar] [CrossRef] [PubMed]
Dickerson, R. Definitions and nomenclature of nucleic acid structure components. Nucleic Acids Res. 1989, 17, 1797–1803. [Google Scholar] [CrossRef]
Olson, W.K.; Bansal, M.; Burley, S.K.; Dickerson, R.E.; Gerstein, M.; Harvey, S.C.; Heinemann, U.; Lu, X.J.; Neidle, S.; Shakked, Z.; et al. A Standard Reference Frame for the Description of Nucleic Acid Base-pair Geometry. J. Mol. Biol. 2001, 313, 229–237. [Google Scholar] [CrossRef] [PubMed]
Lavery, R.; Moakher, M.; Maddocks, J.H.; Petkeviciute, D.; Zakrzewska, K. Conformational analysis of nucleic acids revisited: Curves+. Nucleic Acids Res. 2009, 37, 5917–5929. [Google Scholar] [CrossRef]
Mazur, J.; Jernigan, R.L. Comparison of Rotation Models for Describing DNA Conformations: Application to Static and Polymorphic Forms. Biophys. J. 1995, 68, 1472–1489. [Google Scholar] [CrossRef][Green Version]
Ussery, D. DNA Structure: A-, B- and Z-DNA Helix Families. Encycl. Life Sci. 2002, 1–7. [Google Scholar] [CrossRef]
Mantela, M.; Morphis, A.; Tassi, M.; Simserides, C. Lowest ionisation and excitation energies of biologically important heterocyclic planar molecules. Mol. Phys. 2016, 114, 709–718. [Google Scholar] [CrossRef]
Hilborn, R.C. Einstein coefficients, cross sections, f values, dipole moments, and all that. Am. J. Phys. 1982, 50, 982–986. [Google Scholar] [CrossRef]
Hush, N.; Cheung, A.S. Ionization potentials and donor properties of nucleic acid bases and related compounds. Chem. Phys. Lett. 1975, 34, 11–13. [Google Scholar] [CrossRef]
Clark, L.B.; Peschel, G.G.; Tinoco, I. Vapor Spectra and Heats of Vaporization of Some Purine and Pyrimidine Bases. J. Phys. Chem. 1965, 69, 3615–3618. [Google Scholar] [CrossRef]
Voet, D.; Gratzer, W.B.; Cox, R.A.; Doty, P. Absorption spectra of nucleotides, polynucleotides, and nucleic acids in the far ultraviolet. Biopolymers 1963, 1, 193–208. [Google Scholar] [CrossRef]
Santhosh, C.; Mishra, P. Electronic spectra of 2-aminopurine and 2,6-diaminopurine: Phototautomerism and fluorescence reabsorption. Spectrochim. Acta Part A Mol. Spectrosc. 1991, 47, 1685–1693. [Google Scholar] [CrossRef]
Clark, L.B.; Tinoco, I. Correlations in the Ultraviolet Spectra of the Purine and Pyrimidine Bases1. J. Am. Chem. Soc. 1965, 87, 11–15. [Google Scholar] [CrossRef]
Maier, J.P.; Muller, J.F.; Kubota, T. Ionisation Energies and the Electronic Structures of the N-oxides of diazabenzenes. Helv. Chim. Acta 1975, 58, 1634–1640. [Google Scholar] [CrossRef]
Kubota, T. Electronic Spectra and Electronic Structures of Some Basic Heterocyclic N-Oxides. Bull. Chem. Soc. Jpn. 1962, 35, 946–955. [Google Scholar] [CrossRef]
Gleiter, R.; Heilbronner, E.; Hornung, V. Photoelectron Spectra of Azabenzenes and Azanaphthalenes: I. Pyridine, diazines, s-triazine and s-tetrazine. Helv. Chim. Acta 1972, 55, 255–274. [Google Scholar] [CrossRef]
Pisanias, M.N.; Christophorou, L.G.; Carter, J.G.; McCorkle, D.L. Compound-negative-ion resonance states and threshold-electron excitation spectra of N-heterocyclic molecules: Pyridine, pyridazine, pyrimidine, pyrazine, and sym-triazine. J. Chem. Phys. 1973, 58, 2110–2124. [Google Scholar] [CrossRef]
Bolovinos, A.; Tsekeris, P.; Philis, J.; Pantos, E.; Andritsopoulos, G. Absolute vacuum ultraviolet absorption spectra of some gaseous azabenzenes. J. Mol. Spectrosc. 1984, 103, 240–256. [Google Scholar] [CrossRef]
Halverson, F.; Hirt, R.C. Near Ultraviolet Solution Spectra of the Diazines. J. Chem. Phys. 1951, 19, 711–718. [Google Scholar] [CrossRef]
Ramsey, B.G. Substituent effects on imidazole basicity and photoelectron spectroscopy determined ionization energies. J. Org. Chem. 1979, 44, 2093–2097. [Google Scholar] [CrossRef]
Caswell, D.S.; Spiro, T.G. Ultraviolet resonance Raman spectroscopy of imidazole, histidine, and Cu(imidazole)42+: Implications for protein studies. J. Am. Chem. Soc. 1986, 108, 6470–6477. [Google Scholar] [CrossRef]
Lichtenberger, D.L.; Copenhaver, A.S. Ionization band profile analysis in valence photoelectron spectroscopy. J. Electron Spectrosc. Relat. Phenom. 1990, 50, 335–352. [Google Scholar] [CrossRef]
Noyce, D.S.; Ryder, E.; Walker, B.H. The ultraviolet absorption spectra of substituted pyrazoles. J. Org. Chem. 1955, 20, 1681–1686. [Google Scholar] [CrossRef]
Gordon, R.D.; Yang, R.F. Vapor absorption spectra of benzoxazole, benzimidazole, and benzothiazole near 2850 Å. Can. J. Chem. 1970, 48, 1722–1729. [Google Scholar] [CrossRef]
Kovać, B.; Klasinc, L.; Stanovnik, B.; Tišler, M. Photoelectron spectroscopy of heterocycles. Azaindenes and azaindolizines. J. Heterocycl. Chem. 1980, 17, 689–694. [Google Scholar] [CrossRef]
Cané, E.; Trombetti, A.; Velino, B.; Caminati, W. Assignment of the 290-nm electronic band system of indazole [1,2-benzodiazole] as π^*-π by rotational band contour analysis. J. Mol. Spectrosc. 1992, 155, 307–314. [Google Scholar] [CrossRef]
Fuke, K.; Yoshiuchi, H.; Kaya, K.; Achiba, Y.; Sato, K.; Kimura, K. Multiphoton ionization photoelectron spectroscopy and two-color multiphoton ionization threshold spectroscopy on the hydrogen bonded phenol and 7-azaindole in a supersonic jet. Chem. Phys. Lett. 1984, 108, 179–184. [Google Scholar] [CrossRef]
Ilich, P. 7-Azaindole: The low-temperature near-UV/vis spectra and electronic structure. J. Mol. Struct. 1995, 354, 37–47. [Google Scholar] [CrossRef]
Pluhařová, E.; Slavíček, P.; Jungwirth, P. Modeling Photoionization of Aqueous DNA and Its Components. Acc. Chem. Res. 2015, 48, 1209–1217. [Google Scholar] [CrossRef]
Hawke, L.G.D.; Kalosakas, G.; Simserides, C. Electronic parameters for charge transfer along DNA. Eur. Phys. J. E 2010, 32, 291–305. [Google Scholar] [CrossRef] [PubMed]
Varsano, D.; Di Felice, R.; Marques, M.; Rubio, A. A TDDFT Study of the Excited States of DNA Bases and Their Assemblies. J. Phys. Chem. B 2006, 110, 7129–7138. [Google Scholar] [CrossRef]
Mallajosyula, S.S.; Datta, A.; Pati, S.K. Structure and electronic properties of the Watson–Crick base pairs: Role of hydrogen bonding. Synth. Met. 2005, 155, 398–401. [Google Scholar] [CrossRef]
Simserides, C. A systematic study of electron or hole transfer along DNA dimers, trimers and polymers. Chem. Phys. 2014, 440, 31–41. [Google Scholar] [CrossRef]
Peluso, A.; Caruso, T.; Landi, A.; Capobianco, A. The Dynamics of Hole Transfer in DNA. Molecules 2019, 24, 4044. [Google Scholar] [CrossRef]
Endres, R.G.; Cox, D.; Singh, R.R.P. Colloquium: The quest for high-conductance DNA. Rev. Mod. Phys. 2004, 76, 195–214. [Google Scholar] [CrossRef]
Voityuk, A.A.; Jortner, J.; Bixon, M.; Rösch, N. Electronic coupling between Watson–Crick pairs for hole transfer and transport in desoxyribonucleic acid. J. Chem. Phys. 2001, 114, 5614–5620. [Google Scholar] [CrossRef]
Migliore, A.; Corni, S.; Varsano, D.; Klein, M.L.; Di Felice, R. First Principles Effective Electronic Couplings for Hole Transfer in Natural and Size-Expanded DNA. J. Phys. Chem. B 2009, 113, 9402–9415. [Google Scholar] [CrossRef]
Kubař, T.; Woiczikowski, P.B.; Cuniberti, G.; Elstner, M. Efficient Calculation of Charge-Transfer Matrix Elements for Hole Transfer in DNA. J. Phys. Chem. B 2008, 112, 7937–7947. [Google Scholar] [CrossRef]
Ivanova, A.; Shushkov, P.; Rösch, N. Systematic Study of the Influence of Base-Step Parameters on the Electronic Coupling between Base-Pair Dimers: Comparison of A-DNA and B-DNA Forms. J. Phys. Chem. A 2008, 112, 7106–7114. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Definitions of translation parameters (top row) and rotation parameters (bottom row) involving two bases of a base pair.

Figure 2. Translation (shear, stretch, stagger) and rotation (buckle, propeller twist, opening) parameters for all studied monomers. Dashed lines denote the mean value of each parameter.

Figure 3. First

π

ionization energy and first

π

-

π^{*}

excitation energy of purines calculated via our LCAO method using all valence orbitals, along with results at the IP-EOMCCSD/aug-cc-pVDZ (vertical ionization energies) and CR-EOMCCSD(T)/aug-cc-pVDZ (vertical excitation energies) level of theory [29], as well as available experimental data. Different isomers are specified in Table 1.

Figure 3. First

π

ionization energy and first

π

-

π^{*}

excitation energy of purines calculated via our LCAO method using all valence orbitals, along with results at the IP-EOMCCSD/aug-cc-pVDZ (vertical ionization energies) and CR-EOMCCSD(T)/aug-cc-pVDZ (vertical excitation energies) level of theory [29], as well as available experimental data. Different isomers are specified in Table 1.

Figure 4. First

π

ionization energy and first

π

-

π^{*}

excitation energy of pyrimidines calculated via our LCAO method using all valence orbitals, along with results at the IP-EOMCCSD/aug-cc-pVDZ (vertical ionization energies) and CR-EOMCCSD(T)/aug-cc-pVDZ (vertical excitation energies) level of theory [29], as well as available experimental data.

Figure 4. First

π

ionization energy and first

π

-

π^{*}

excitation energy of pyrimidines calculated via our LCAO method using all valence orbitals, along with results at the IP-EOMCCSD/aug-cc-pVDZ (vertical ionization energies) and CR-EOMCCSD(T)/aug-cc-pVDZ (vertical excitation energies) level of theory [29], as well as available experimental data.

Figure 5. First

π

ionization energy and first

π

-

π^{*}

excitation energy of other planar heterocyclic molecules calculated via our LCAO method using all valence orbitals, along with results calculated at the IP-EOMCCSD/aug-cc-pVDZ (vertical ionization energies) and CR-EOMCCSD(T)/aug-cc-pVDZ (vertical excitation energies) level of theory [29], as well as available experimental data.

Figure 5. First

π

ionization energy and first

π

-

π^{*}

excitation energy of other planar heterocyclic molecules calculated via our LCAO method using all valence orbitals, along with results calculated at the IP-EOMCCSD/aug-cc-pVDZ (vertical ionization energies) and CR-EOMCCSD(T)/aug-cc-pVDZ (vertical excitation energies) level of theory [29], as well as available experimental data.

Figure 6. Occupation probabilities of each atomic orbital,

| c_{i ν} |^{2}

(cf. Equation (1)), for the HOMO (left) and LUMO (right) states of A and T bases into an A-T base pair (top), along with the corresponding probabilities (cf. Equation (12)) for the HOMO and LUMO states of the A-T base pair (bottom).

Figure 6. Occupation probabilities of each atomic orbital,

| c_{i ν} |^{2}

(cf. Equation (1)), for the HOMO (left) and LUMO (right) states of A and T bases into an A-T base pair (top), along with the corresponding probabilities (cf. Equation (12)) for the HOMO and LUMO states of the A-T base pair (bottom).

Figure 7. Occupation probabilities of each atomic orbital,

| c_{i ν} |^{2}

(cf. Equation (1)), for the HOMO (left) and LUMO (right) states of G and C bases into a G-C base pair (top), along with the corresponding probabilities (cf. Equation (12)) for the HOMO and LUMO states of the G-C base pair (bottom).

Figure 7. Occupation probabilities of each atomic orbital,

| c_{i ν} |^{2}

(cf. Equation (1)), for the HOMO (left) and LUMO (right) states of G and C bases into a G-C base pair (top), along with the corresponding probabilities (cf. Equation (12)) for the HOMO and LUMO states of the G-C base pair (bottom).

Figure 8. The absolute values of transfer parameters for all possible combinations of successive base pairs for holes (left) and for electrons (right). We show the transfer parameters obtained from our LCAO calculations using all valence orbitals, as well as the corresponding transfer parameters found in Reference [55] (for holes, estimation from various articles in bibliography), in Reference [29] (using only 2p

_{z}

orbitals) and in Reference [52] (for electrons, using only 2p

_{z}

orbitals).

Figure 8. The absolute values of transfer parameters for all possible combinations of successive base pairs for holes (left) and for electrons (right). We show the transfer parameters obtained from our LCAO calculations using all valence orbitals, as well as the corresponding transfer parameters found in Reference [55] (for holes, estimation from various articles in bibliography), in Reference [29] (using only 2p

_{z}

orbitals) and in Reference [52] (for electrons, using only 2p

_{z}

orbitals).

Figure 9. Comparison of the maximum transfer percentage p obtained by our LCAO method using all valence orbitals, with the p values extracted from other sources: obtained from parameters found in Reference [55] (for holes, estimation from various articles in bibliography), in Reference [29] (using only 2p

_{z}

orbitals) and in Reference [52] (for electrons, using only 2p

_{z}

orbitals). Left panel for holes, right panel for electrons.

Figure 9. Comparison of the maximum transfer percentage p obtained by our LCAO method using all valence orbitals, with the p values extracted from other sources: obtained from parameters found in Reference [55] (for holes, estimation from various articles in bibliography), in Reference [29] (using only 2p

_{z}

orbitals) and in Reference [52] (for electrons, using only 2p

_{z}

orbitals). Left panel for holes, right panel for electrons.

Figure 10. The parameters

| Δ |

and

| t |

, as well as the maximum transfer percentage p for all the dimers of the MD oligomer.

Figure 10. The parameters

| Δ |

and

| t |

, as well as the maximum transfer percentage p for all the dimers of the MD oligomer.

Table 1. Diagonal matrix elements also known as on-site energies, in our LCAO parameterization (eV).

$E_{H (1 s)}$	$E_{C (2 s)}$	$E_{C (2 p)}$	$E_{N (2 s)}$	$E_{N (2 p)}$	$E_{O (2 s)}$	$E_{O (2 p)}$
$- 13.64$	$- 13.18$	$- 6.70$	$- 14.51$	$- 9.55$	$- 15.03$	$- 11.52$

Table 2.

χ

values of Harrison-type expressions for nondiagonal matrix elements, utilizing Slater–Koster two-center interaction transfer integrals, and the correction factor for interactions involving H atoms, in our LCAO parameterization.

Table 2.

χ

values of Harrison-type expressions for nondiagonal matrix elements, utilizing Slater–Koster two-center interaction transfer integrals, and the correction factor for interactions involving H atoms, in our LCAO parameterization.

$χ_{ss σ}$	$χ_{sp σ}$	$χ_{pp π}$	$χ_{pp σ}$	b
$- 1.32$	$- 1.42$	$- 0.73$	$2.22$	$0.70$

Table 3. The second and third column contain mean values of translation and rotation parameters for monomers A-T and G-C, as studied in the present work. Other columns list values from bibliography.

Parameter	A-T	G-C	[25]	[26]	[27]	[28]
shear (Å)	$0.03$	$- 0.09$	$0.00$	$- 0.04$
stretch (Å)	$- 0.03$	$- 0.04$	$- 0.15$	$- 0.17$
stagger (Å)	$0.04$	$0.01$	$0.09$	$0.21$
buckle ( $^{\circ}$ )	$6.53$	$0.55$	$0.5$	$0.3$	$(- 7.5, 7.5)$
propeller twist ( $^{\circ}$ )	$- 10.40$	$- 1.13$	$- 11.4$	$- 13.7$	$11.5$	$- 12.60 \pm 3.2$
opening ( $^{\circ}$ )	$1.06$	$- 0.66$	$0.6$	$1.0$	$(- 2, 2)$

Table 4. Ionization and excitation energies (eV).

I_{LCAO}

and

E_{LCAO}

are the ionization and excitation energies obtained by our LCAO scheme, including all valence orbitals.

f_{LCAO}

is the relevant oscillator strength.

I_{CC}

and

E_{CC}

are the energies calculated at the IP-EOMCCSD/aug-cc-pVDZ and CR-EOMCCSD(T)/aug-cc-pVDZ level of theory [29].

I_{\exp}

and

E_{\exp}

are the experimental data. In parentheses, the character of the transition.

Table 4. Ionization and excitation energies (eV).

I_{LCAO}

and

E_{LCAO}

are the ionization and excitation energies obtained by our LCAO scheme, including all valence orbitals.

f_{LCAO}

is the relevant oscillator strength.

I_{CC}

and

E_{CC}

are the energies calculated at the IP-EOMCCSD/aug-cc-pVDZ and CR-EOMCCSD(T)/aug-cc-pVDZ level of theory [29].

I_{\exp}

and

E_{\exp}

are the experimental data. In parentheses, the character of the transition.

Name Formula	$I_{LCAO}$	$E_{LCAO}$	$f_{LCAO}$	$I_{CC}$	$E_{CC}$	$I_{\exp}$	$E_{\exp}$
Adenine
$C_{5} H_{5} N_{5}$	8.44	4.20	0.330	8.23	5.04	8.44 [31]	4.84 [32,33]
(Isomer 1)
2-Aminopurine
$C_{5} H_{5} N_{5}$	8.56	3.84	0.239	7.95	4.27		4.11 [34]
(Isomer 2)
1H-pyrazolo[3,4-d]
pyrimidin-4-amine
$C_{5} H_{5} N_{5}$	8.78	4.25	0.328	8.51	4.92
(Isomer 3)
Pyrimido [5,4-e]-as-
triazine, 1,2-dihydro-
$C_{5} H_{5} N_{5}$	8.04	3.21	0.282	7.18	3.16
(Isomer 4)
Guanine					4.77 ( $π \to σ^{*}$ )
$C_{5} H_{5} N_{5} O$	8.36	4.25	0.288	7.83	4.85	8.24 [31]	4.51 [33]
(Isomer 1)
7-Amino-S-triazolo(1,5-a)
pyrimidin-5(4H)-one
$C_{5} H_{5} N_{5} O$	8.42	4.37	0.285	8.60	4.91
(Isomer 2)
Pyrimido[5,4-e]-as-triazin-
5[6h]-one, 1,2-dihydro-
$C_{5} H_{5} N_{5} O$	8.19	3.42	0.198	6.68	2.54
(Isomer 3)
7H-imidazo[4,5-d]-v
triazin-4-one, 6-methyl-					4.47 ( $n \to σ^{*}$ )
$C_{5} H_{5} N_{5} O$	8.93	3.64	0.302	8.92	4.55
(Isomer 4)
9H-purine					4.49 ( $n \to π^{*}$ )		4.28 [35] ( $n \to π^{*}$ )
$C_{5} H_{4} N_{4}$	9.20	4.40	0.313	9.34	4.92	9.52 [31]	4.68 [35]
(Isomer 1)
7H-purine				9.34 (n)	4.36 ( $n \to π^{*}$ )
$C_{5} H_{4} N_{4}$	9.08	4.26	0.295	9.40	4.79
(Isomer 1 taut.)
1H-1,2,3-triazolo
[4,5-b]pyridine					4.49
$C_{5} H_{4} N_{4}$	9.42	4.12	0.340	9.41	4.54
(Isomer 2)
[1,2,4]Triazolo
[1,5-a]pyrazine
$C_{5} H_{4} N_{4}$	8.95	4.20	0.230	9.27	4.63
(Isomer 3)
[1,2,3]Triazolo
[1,5-a]pyrazine
$C_{5} H_{4} N_{4}$	8.64	3.96	0.172	8.95	4.31
(Isomer 4)
Formula	$I_{LCAO}$	$E_{LCAO}$	$f_{LCAO}$	$I_{CC}$	$E_{CC}$	$I_{\exp}$	$E_{\exp}$
Thymine					5.07 ( $n \to π^{*}$ )
$C_{5} H_{6} N_{2} O_{2}$	9.09	4.77	0.316	9.03	5.17	9.14 [31]	4.69 [33]
Cytosine
$C_{4} H_{5} N_{3} O$	8.68	4.54	0.306	8.67	4.64	8.94 [31]	4.64 [33]
Uracil					5.03 ( $n \to π^{*}$ )
$C_{4} H_{4} N_{2} O_{2}$	8.89	4.70	0.286	9.44	5.27	9.50 [31]	4.79 [33,35]
(Isomer 1)
Pyrazine, 1,4-dioxide
$C_{4} H_{4} N_{2} O_{2}$	8.77	4.28	0.403	8.11	3.30	8.33 [36]	4.05 [37]
(Isomer 2)
4(1H)-pyrimidinone,
6-hydroxy-
$C_{4} H_{4} N_{2} O_{2}$	9.01	4.95	0.103	9.66	5.29
(Isomer 3)
Maleic hydrazide
$C_{4} H_{4} N_{2})_{2}$	8.77	3.34	0.113	8.77	4.11
(Isomer 4)
Pyrazine				9.49 (n)	4.07 ( $n \to π^{*}$ )	9.63 [38]	4.20 [39]
$C_{4} H_{4} N_{2}$	9.53	4.39	0.258	10.09	4.88	10.18 [38]	4.79 [40,41]
(Isomer 1)
Pyrimidine					4.41 ( $n \to π^{*}$ )		4.35 [39]
$C_{4} H_{4} N_{2}$				9.56 (n)	4.84 ( $n \to π^{*}$ )	9.73 [38]	4.62 [40]
(Isomer 2)	9.98	5.28	0.249	10.44	5.25	10.41 [38]	5.13 [33,35,40,41]
Pyridazine					3.76 ( $n \to π^{*}$ )		3.70 [39]
$C_{4} H_{4} N_{2}$	9.41 (n)	4.28	0.000 ( $n \to π^{*}$ )	9.07 (n)	4.47 ( $n \to π^{*}$ )	9.31 [38]
(Isomer 3)	10.39	5.26	0.253	10.59	5.12	10.61 [38]	5.00 [41]
1H-imidazole		4.97	0.000 ( $π \to σ^{*}$ )		5.50 ( $π \to σ^{*}$ )
$C_{3} H_{4} N_{2}$	8.80	5.77	0.171	8.90	6.29	8.96 [42]	5.99 [43]
(Isomer 1)
1H-pyrazole		5.69	0.000 ( $π \to σ^{*}$ )
$C_{3} H_{4} N_{2}$	9.69	5.90	0.000 ( $π \to σ^{*}$ )		6.11 ( $π \to σ^{*}$ )
(Isomer 2)	9.48	5.97	0.196	9.35	6.25	9.38 [44]	5.90 [45]
1H-benzimidazole
$C_{7} H_{6} N_{2}$	8.84	4.63	0.245	8.40	4.67	8.44 [42]	4.47 [46]
(Isomer 1)
1H-indazole
$C_{7} H_{6} N_{2}$	8.41	3.85	0.217	8.26	4.50	8.35 [47]	4.27 [48]
(Isomer 2)
2H-indazole
$C_{7} H_{6} N_{2}$	8.42	3.84	0.229	7.90	4.54
(Isomer 2 taut.)
1H-pyrrolo[2,3-b]
pyridine
$C_{7} H_{6} N_{2}$	8.47	3.82	0.184	8.17	4.50	8.11 [49]	4.28 [50]
(Isomer 3)

Table 5. HOMO (

E_{LCAO, H}

) and LUMO (

E_{LCAO, L}

) eigenenergies of the base pairs A-T and G-C, obtained in this work using LCAO with all valence orbitals, along with the corresponding HOMO–LUMO energy gaps (

E_{LCAO, g}

) in eV (rows 6 and 7). Rows 2–5 contain the calculated HOMO and LUMO energies of each distorted base making up these base pairs. The third, fifth, and the seventh columns list the corresponding energies from Reference [52] where only 2p

_{z}

orbitals had been used.

Table 5. HOMO (

E_{LCAO, H}

) and LUMO (

E_{LCAO, L}

) eigenenergies of the base pairs A-T and G-C, obtained in this work using LCAO with all valence orbitals, along with the corresponding HOMO–LUMO energy gaps (

E_{LCAO, g}

) in eV (rows 6 and 7). Rows 2–5 contain the calculated HOMO and LUMO energies of each distorted base making up these base pairs. The third, fifth, and the seventh columns list the corresponding energies from Reference [52] where only 2p

_{z}

orbitals had been used.

Base or Base Pair	$E_{LCAO, H}$	$E_{H}$ [52]	$E_{LCAO, L}$	$E_{L}$ [52]	$E_{LCAO, g}$	$E_{g}$ [52]
A	$- 8.50$	$- 8.30$	$- 4.19$	$- 4.40$	$4.31$	$3.90$
T	$- 9.12$	$- 9.00$	$- 4.30$	$- 4.90$	$4.82$	$4.10$
G	$- 8.31$	$- 8.00$	$- 4.12$	$- 4.50$	$4.19$	$3.50$
			$- 4.43$ ( $σ^{*}$ )		$4.24$ ( $π \to σ^{*}$ )
C	$- 8.67$	$- 8.80$	$- 4.11$	$- 4.30$	$4.56$	$4.50$
A-T	$- 8.49$	$- 8.30$	$- 4.31$	$- 4.90$	$4.18$	$3.40$
			$- 4.43$ ( $σ^{*}$ )		$3.87$ ( $π \to σ^{*}$ )
G-C	$- 8.30$	$- 8.00$	$- 4.14$	$- 4.50$	$4.16$	$3.50$

Table 6. Close-to-ideal geometrical conformations. The absolute values of transfer parameters for all possible combinations of successive base pairs.

| t_{LCAO, H} |

(

| t_{LCAO, L} |

) of the second (fifth) column refer to hole (electron) transfer parameters obtained from our LCAO calculations using all valence orbitals. The third column lists hole transfer parameters of Reference [55], an estimation from various articles found in bibliography. The sixth column lists the electron transfer parameters of Reference [52], where only 2p

_{z}

orbitals had been used. The fourth and seventh columns list the transfer parameters with the parameterization of Reference [29], where only 2p

_{z}

orbitals had been used. All transfer parameters are given in meV.

Table 6. Close-to-ideal geometrical conformations. The absolute values of transfer parameters for all possible combinations of successive base pairs.

| t_{LCAO, H} |

(

| t_{LCAO, L} |

) of the second (fifth) column refer to hole (electron) transfer parameters obtained from our LCAO calculations using all valence orbitals. The third column lists hole transfer parameters of Reference [55], an estimation from various articles found in bibliography. The sixth column lists the electron transfer parameters of Reference [52], where only 2p

_{z}

orbitals had been used. The fourth and seventh columns list the transfer parameters with the parameterization of Reference [29], where only 2p

_{z}

orbitals had been used. All transfer parameters are given in meV.

XY	$\| t_{LCAO, H} \|$	$\| t_{H} \|$ [55]	$\| t_{H} \|$ [29]	$\| t_{LCAO, L} \|$	$\| t_{L} \|$ [55]	$\| t_{L} \|$ [29]
				92 ( $σ^{*}$ )
GG, CC	116	100	51	2	20	8
				11 ( $σ^{*}$ )
AG, CT	37	30	32	11	3	10
				2 ( $σ^{*}$ )
TG, CA	28	10	4	9	17	10
				1 ( $σ^{*}$ )
AC, GT	16	10	3	1	32	23
				3 ( $σ^{*}$ )
TC, GA	142	110	57	6	1	7
AA, TT	38	20	32	22	29	17
AT	50	35	6	1	1	1
TA	37	50	10	2	2	1
				2 ( $σ^{*}$ )
GC	10	10	10	19	10	19
				1 ( $σ^{*}$ )
CG	75	50	13	9	8	13

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mantela, M.; Simserides, C.; Di Felice, R. LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability. Materials 2021, 14, 4930. https://doi.org/10.3390/ma14174930

AMA Style

Mantela M, Simserides C, Di Felice R. LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability. Materials. 2021; 14(17):4930. https://doi.org/10.3390/ma14174930

Chicago/Turabian Style

Mantela, Marilena, Constantinos Simserides, and Rosa Di Felice. 2021. "LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability" Materials 14, no. 17: 4930. https://doi.org/10.3390/ma14174930

APA Style

Mantela, M., Simserides, C., & Di Felice, R. (2021). LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability. Materials, 14(17), 4930. https://doi.org/10.3390/ma14174930

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

LCAO Electronic Structure of Nucleic Acid Bases and Other Heterocycles and Transfer Integrals in B-DNA, Including Structural Variability

Abstract

1. Introduction

2. Theory

2.1. LCAO with All Valence Orbitals for Nucleic Acid Bases or Similar Molecules

2.2. LCAO with All Valence Orbitals for B-DNA Base Pairs

2.3. Coherent Charge Transfer and Transport Parameters for a TB Wire Model

2.3.1. Eigenstates

2.3.2. Coherent Charge Transfer

2.3.3. Coherent Charge Transport

2.3.4. TB Parameters for a Wire Model Description

2.4. DNA Fragments Generated by MD

3. Results and Discussion

3.1. Heterocyclic Planar Molecules including Nucleic Acid Bases

3.2. B-DNA Base Pairs

3.3. Effects of Structural Variability

4. Conclusions and Outlook

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI