Statistical Mechanical Treatments of Protein Amyloid Formation

Schreck, John S.; Yuan, Jian-Min

doi:10.3390/ijms140917420

Open AccessReview

Statistical Mechanical Treatments of Protein Amyloid Formation

by

John S. Schreck

and

Jian-Min Yuan

^*

Department of Physics, Drexel University, Philadelphia, PA 19104, USA

^*

Author to whom correspondence should be addressed.

Int. J. Mol. Sci. 2013, 14(9), 17420-17452; https://doi.org/10.3390/ijms140917420

Submission received: 27 June 2013 / Revised: 5 August 2013 / Accepted: 9 August 2013 / Published: 23 August 2013

(This article belongs to the Special Issue Molecular Self-Assembly 2012)

Download

Browse Figures

Versions Notes

Abstract

:

Protein aggregation is an important field of investigation because it is closely related to the problem of neurodegenerative diseases, to the development of biomaterials, and to the growth of cellular structures such as cyto-skeleton. Self-aggregation of protein amyloids, for example, is a complicated process involving many species and levels of structures. This complexity, however, can be dealt with using statistical mechanical tools, such as free energies, partition functions, and transfer matrices. In this article, we review general strategies for studying protein aggregation using statistical mechanical approaches and show that canonical and grand canonical ensembles can be used in such approaches. The grand canonical approach is particularly convenient since competing pathways of assembly and dis-assembly can be considered simultaneously. Another advantage of using statistical mechanics is that numerically exact solutions can be obtained for all of the thermodynamic properties of fibrils, such as the amount of fibrils formed, as a function of initial protein concentration. Furthermore, statistical mechanics models can be used to fit experimental data when they are available for comparison.

Keywords:

protein aggregation; protein amyloid; statistical mechanics; partition function; transfer matrix

1. Introduction

Protein aggregation is an active, multidisciplinary science, with researchers and practitioners working in broad disciplines, including biophysics, medicine, biomaterials, and pharmaceuticals. With diverse perspectives, it is not surprising that the papers on protein aggregation differ widely in their emphasis and methodologies: from fundamental research related to molecular mechanisms and aggregation pathways to searching for biomarkers, drug targets, even to imaging of plaques in the brain, dissolution of fibrils and amyloids in vivo, etc. The present article is only concerned with the fundamental investigations into the aggregation mechanisms of amyloid formation related to neurodegenerative disease. Many review articles have been written on the approaches based on molecular dynamical simulations [1–4], as well as kinetic studies [5–7]. In our examination, we will focus instead on the statistical mechanical approaches to equilibrium assembly processes and present some new results while summarizing past approaches.

Early applications of statistical mechanical methods to the studies of protein problems can best be represented by the treatment of helix-coil transitions in proteins by Zimm and Bragg in the 1950s [8–13]. They assumed that each peptide bond linking amino acid residues together can exist in two states: a helical or non-helical state, and characterized the linear chain of residues with a partition function, which is a sum of all possible combinations of states.

Analogous to a one-dimensional Ising model [14], Zimm and Bragg expressed the partition function in terms of transfer matrices and solved the problem analytically in the large polymerization limit, and also for finite chains [8]. Over the years, researchers have extended the original Ising-type models to study sheet-coil [15–21] and helix-sheet [21–23] transitions in proteins, as well as helix-coil [24], sheet-coil [25], and helix-sheet-coil [23,26] transitions in protein aggregates.

It is in a similar spirit that our statistical mechanical treatment of protein aggregation has been developed [21], which is the main subject of this article. Our formalism of the aggregation processes was stimulated by other statistical mechanical studies. These works will be briefly reviewed in Section 3, along with conceptual developments of statistical mechanical techniques beyond those used in the Zimm-Bragg model. In Sections 4–6, canonical ensemble and grand canonical ensemble treatments of the aggregation processes will be separately presented, which is followed by a conclusion section. In Section 2, we first review some properties of amyloid proteins and aggregates.

2. Amyloid Aggregation

To see the complication of protein aggregation processes, we use β-amyloid as an example. Under proper conditions, such as higher concentration, amyloid monomers can aggregate into dimers, trimers, tetramers, . . ., oligomers. These co-existing oligomers are in rapid kinetic equilibrium, making it difficult to determine their structures or numbers [27]. Oligomers of the same size may exist in different conformations: some are partially ordered and some are disordered. As soluble oligomers grow larger they can become richer in β-sheet structures, but overall they lack secondary structures. They may resemble micelles, which have a spherical or cylindrical shape [28]. Aβoligomers seem to range in size, with a diameter ranging from 5–15 nm and molar mass ranging from 20–50 kDa up to 1 MDa [29,30]. Additionally, oligomers with ordered β-sheet or β-hairpin structures are believed to form protofibrils at a higher rate than their disordered counterparts [3,31–33]. For example, Li et al. [3] showed in numerical simulations that a native chain has to unfold partially into an intermediate with beta-hairpin structure before ordered assembly can be formed. Since protein aggregation is thought to be a nucleation process, some of the ordered oligomers may act as paranuclei [34]. Once formed, paranuclei can lead to the formation of protofibrils in down-hill fashion. The nucleation is illustrated in Figure 1b,c. Since Aβoligomers may contain β-sheet structure, they could be pathway intermediate [28]. However, the same cannot be said about other oligomers, such as those comprised of prion proteins [35]. Additionally, Aβ(1-40) and Aβ(1-42) monomers may self-associate to form off-pathway globular assemblies including amylospheroids [34,36] and β-amyloid balls [37] [formed by Aβ(1-40) only]. Both of these structures can grow to be quite large.

Protofibrillar intermediates are heterogeneous, metastable aggregates already containing β-sheet regions in the core [38], but retaining some features that are similar to oligomers. The term “protofibril” has varying definitions throughout the literature, where for Aβ the term could refer to structures ranging from 4–11 nm in diameter, up to 200 nm in length, and possibly even longer [39,40]. Protofibrils are considered to be on-pathway during fibrillogenisis, and they could grow larger via monomer addition or merging with other oligomers or protofibrils [41,42]. As protofibrils grow longer, the β -sheet region grows larger. Eventually, a stable and tight β -sheet network is formed in the core by the backbone H-bonds and the hydrophobic interactions of side chains [38]. Figure 2 illustrates the cross-beta structure of fibrils comprised of the prion Sup35. These features are generally associated with protofibrils and fibrils [38]. Another early intermediate thought to play a role in the formation of fibrils is the Aβ protofilament, which can range in diameter from 2.5 nm up to about 6 nm [43], and are 50–100 nm long [44]. Several protofilaments may then merge and form fibrils, which may exhibit a twisted, helical ribbon structure [45] and contain highly-ordered β-sheet regions. A cartoon illustration of an Aβ protofilament is shown in Figure 3. The fibrils may even be composed of several segments with distinct morphologies and varying levels of ordered structure [46]. Additionally, mature fibrils have a diameter ranging from 7–12 nm, and may grower longer than 1 μm [45]. Typically, fibrils are linear, non-branching structures. They contain very large amounts of β-structure, and are generally insoluble. Fibrils can further assemble into bundles [47], and may form plaques outside the neurons. The total assembly process from monomers to fibrils for a simple 1D model is illustrated in Figure 1a, and models for Aβ(1-42) and Aβ(1-40) fibrils are illustrated in Figure 1b and 1c, respectively.

3. Statistical Mechanical Approaches to Protein Folding and Aggregation

One of the earlier applications of statistical mechanical methods to protein systems is the Zimm-Bragg model [8–10,12] originally developed for the studies of helix-coil transitions in proteins. Although the problems of macromolecular self-assembly that we are dealing with are quite different from conformational changes in proteins, the lesson that can learn from the model is fundamental. The Zimm-Bragg model is like an Ising model, which catches some essential features of a problem, and is solved rigorously using statistical mechanical methods [14]. Extending the ZB model to protein aggregation was first advanced by Oosawa and Kasai [51], Terzi et al. [52], and more recently applied by van Gestel and de Leeuw [25], Schmit et al. [53], and others [54–60] to the study of macromolecular aggregation. These simple statistical mechanical models may be used to predict the average lengths of protofibrils and fibrils, and the fraction of protein molecules that assume various conformational secondary structures, including sheet, coil, and possibly helix. The main models summarized here focus primarily on computing partition functions for protofibrils and fibrils using simple effective Hamiltonians and transfer matrices.

3.1. Partition Function for Helix-Coil Transitions in Proteins

Although protein aggregation is the subject of the article, we use the simplest model, i.e., the Zimm-Bragg model for helix-coil transitions in proteins, to illustrate the power of the statistical mechanical techniques. We first define the coil conformation of a protein residue as the reference state, which means its statistical weight is one [61,62]. The parameter s is the equilibrium constant for a coil residue converting into a helical residue, and relates to the free energy change of adding a helical residue to one end of a helical block, ΔG_s = −RT ln s, where R is the gas constant. The nucleation step is to convert a coil residue into a helical residue in a chain of coil residues. The equilibrium constant associated with this event is σs, where ΔG_nuc = −RT ln σ is the additional free energy barrier the coil must overcome before converting into the first helical residue that can eventually be part of the helical block. Conventionally, σ and s are referred to as the initiation and propagation parameters. A single helical block in a protein thus has free energy ΔG = ΔG_nuc+n_hΔG_s, where n_h is the number of helical residues in the block.

In direct combinatorial approaches to solving the partition function, Z_N for the helix-coil transitions in proteins of length N amino acids is often expressed in terms of the initiation and propagation parameters. Some approximations can be made to simplify the mathematical expression for Z_N [62,63]. For example, the simplest model for helix-coil transitions assumes a single helical stretch where all of the residues in the protein are locked into the helix conformation. A more general approach assumes that the conformation of residues could depend on its neighbors and the nucleation and propagation of the chain occurs via a “zipper” mechanism, that is, one single stretch of helix can form along the chain of N residues but may vary in length from one up to N. The partition function for the chain of N residues in the zipper model can be written as:

\begin{array}{l} Z_{N} = 1 + \sum_{k = 1}^{N} (N - k + 1) σ s^{k} \\ = 1 + σ s \frac{N - s - N s}{{(s - 1)}^{2}} \end{array}

(1)

where the term (N − k + 1) is the degeneracy in the number of ways putting k helical residues next to each other along a chain of length N.

The Zimm-Bragg (ZB) model for helix-coil transitions in proteins assumes that any number of helical stretches may form along the chain, where residues could be involved in short-ranged interactions with other residues. Each residue could assume either a coil or a helical conformation, thus for nearest-neighbor interactions between residues, there are four possible combinations of a pair of residues. That is, two residues at positions j − 1, and j, respectively, could have states cc, hc, ch, or hh. The jth residue involved in the pair is assigned a weight depending on its conformation, and the conformation of the residue at j − 1. For cc, the weight is one. If two neighboring residues adopt the ch conformations, the helical residue is assigned the weight σs, which corresponds to the nucleation of a helical block. On the other hand, in the original ZB model, the state hc simply has weight one. That is, the growth of a helical block only proceeds in one direction along the chain. Finally, the hh state corresponds with a weight of s at the jth residue. The ZB model is summarized in Table 1.

The partition function for the ZB model, Z_N, can be easily computed by using a transfer matrix [64,65]. A transfer matrix is a device that can be used when a system of N units can be decomposed into a subsystem of nearest neighbor, or next nearest neighbor, etc., interactions between all units. Since the residue located at a position j along the chain only depends on the conformation of the residue at chain position j − 1, the transfer matrix factors the partition function for some given energy function E(r) of a system of N units as:

\begin{array}{l} Z_{N} = \sum_{r} e^{- β H (r)} \\ = 〈 f ∣ T^{N} ∣ i 〉 \\ = \sum_{i = 1}^{N_{λ}} c_{i} λ_{i} \end{array}

(2)

where the vectors 〈f | and | i 〉 represent the states of the residues at either end of the chain, respectively, λ_i and N_λ are the eigenvalues, and total number of eigenvalues of T, respectively. Additionally, in the third line of Equation 2, the coefficients c_i take into account the effect of the boundary conditions. The notation r = (r_1,r_2,. . ., r_N) refers to the set of all N spins that represent the states of the residues, where r_i denotes the state of the ith residue.

The transfer matrix elements represent the probability that a residue occupies a different state from its neighbor. Thus, the transfer matrix used in the ZB model for helix-coil transitions in proteins has the following form:

T_{i} = (\begin{array}{c} h & c \\ h & s & 1 \\ c & σ s & 1 \end{array})

(3)

where the column (h, c) is the (j − 1)th state, and the row (h, c) the jth state. As indicated in Equation (2), the matrix T can be readily diagonalized and the partition function written in terms of its eigenvalues. We show the diagonalization as an example in the next section while discussing the helix-coil and sheet-coil transitions in equilibrium protein aggregation. For periodic boundary conditions, the c_i’s all equal unity in Equation (2) and the partition function can be written as:

\begin{array}{l} Z = tr (T^{N}) \\ \approx λ_{1}^{N} \end{array}

(4)

where “tr” refers to the trace operation and λ₁ is the largest eigenvalue of Equation (3). Equation (4) is valid only when N is large and becomes exact in the thermodynamic limit when N → ∞. The partition function then reduces to:

Z = {(\frac{1 + s + \sqrt{{(s - 1)}^{2} + 4 s σ}}{2})}^{N}

(5)

3.2. Thermodynamic Properties of Proteins

Once the partition function for the protein is known explicitly using the ZB theory, some average quantities can be defined and compared with experiments. The average fraction of residues in a chain of length N that are helical is referred to as the helicity, θ. It can be defined as:

θ \equiv \frac{1}{N_{H}} \frac{\partial ln Z}{\partial ln s}

(6)

where N_H is the maximum number of helical residues in the chain. The helicity is akin to measuring the magnetization of spin systems when an external magnetic field is supplied. For the helical protein, the average number of helical segments, υ, and the average helical length, L, are also found using Equation (2),

v = \frac{\partial ln Z}{\partial ln σ}

(7)

L = N_{H} \frac{θ}{v} = \frac{s}{σ} \frac{\partial Z / \partial s}{\partial Z / \partial σ}

(8)

Similar averages are calculated for the sheet-coil and helix-sheet-coil systems. In general for a chain of length N, the average number of residues having the property x_j is given by:

〈 n_{j} 〉 = \frac{1}{N_{H}} \frac{\partial ln Z}{\partial ln x_{j}}

(9)

where j could refer to helix, coil, sheet, etc. Thus, the helicity and the average length of helical segments for long chains are now easily computed by inserting Equation (5) into Equations (6) and (8), we are left with:

θ = \frac{1}{2} + \frac{s - 1}{2 \sqrt{{(s - 1)}^{2} + 4 σ s}}

(10)

L = 1 + \frac{2 s}{1 - s + \sqrt{{(1 - s)}^{2} + 4 σ s}}

(11)

and a similar result can be derived for θ and L when using finite boundary conditions.

4. Equilibrium Protein Aggregation

In 1961, Oosawa and Kasai constructed a model for equilibrium protein aggregation using ideas from the helix-coil theory that were being developed at the same time. First, in the model the total number of proteins in the system is fixed and is denoted by m_tot. Next, the model assumes that a dimer is the smallest aggregate that may form. The chemical reaction for dimer formation represents the nucleation of an aggregate [51,52,66] and can be quantified by an equilibrium constant denoted K_eq:

A + A \overset{K_{e q}}{⇋} A_{2}

(12)

where A represents monomer, A_k represents the aggregate containing k proteins and is referred to as a k-mer.

The nucleation equilibrium constant is often denoted K_eq = σs, for example, in Terzi’s model [52]. The concentrations for monomers, n₁, and dimers, n₂, can also be written as:

n_{2} = σ s n_{1}^{2}

(13)

Once a dimer is formed, then trimer, . . ., k-mer may form by successive addition of a protein to an aggregate. Any of these reactions can be described by the monomer addition mechanism, represented by:

A_{i} + A \overset{s}{⇋} A_{i + 1}

(14)

where s is the equilibrium constant. Thus, if the equilibrium constant for monomer addition is s, we can write the equilibrium concentration for the k-mer as:

n_{k} = σ s^{k - 1} n_{1}^{k}

(15)

Since the total protein mass in the system is conserved, m_tot can be written in terms of the concentrations of monomers and aggregates as:

\begin{array}{l} m_{t o t} = \sum_{k = 1}^{\infty} k n_{k} \\ = 1 \cdot n_{1} + 2 \cdot n_{2} + \dots + k \cdot n_{k} + \dots \\ = 1 \cdot n_{1} + 2 σ s n_{1}^{2} + \dots + k σ s^{k - 1} n_{1}^{k} + \dots \end{array}

(16)

Therefore, in the thermodynamic limit N → ∞ the expression can be written as:

\begin{array}{l} m_{t o t} = n_{1} (1 + \sum_{k = 2}^{\infty} k σ s^{k - 1} n_{1}^{k - 1}) \\ = n_{1} (1 - σ + \frac{σ}{{(1 - s n_{1})}^{2}}) \end{array}

(17)

where the sum converges when sn₁< 1. If m_tot is known, Equation (17) can be solved for the monomer concentration n₁.

4.1. A Generalized Zimm-Bragg Model for Protein Aggregation

A more recent approach to equilibrium peptide assembly introduced by van Gestel and van der Schoot relates the concentrations of protein aggregates to equilibrium partition functions [25]. The partition functions of aggregates are expressed in terms of ZB initiation and propagation parameters and a transfer matrix. The model describes 1D protein aggregation, where the protein monomer is dominated by coil, sheet, or helical conformations, discussed below, and may participate in short-ranged interactions with other proteins. Hence, the aggregates may exhibit various degrees of conformational order, where helix-coil [24,67], sheet-coil [25,68], or helix-sheet transitions [21,26] may occur.

This modeling approach is consistent with a recent set of experiments [69], for example, that have shown that oligomers of Aβ(1-40) and Aβ(1-42) are dominated by antiparallel β-sheet structures, while their fibrils are mainly characterized by parallel β-sheet structures. Thus major conformational changes may take place somewhere between the oligomer and the fibril formations, and using Ising-like ZB models may be advantageous for studying conformational transitions involved in protein aggregation.

The isolated monomer in the ZB model for aggregation is assumed to be a natively unstructured protein. A “helix” protein is defined if θ_helix> θ_sheet and θ_helix> θ_coil, where θ_helix can be defined by using Equation (10), or a related equation for sheets. Similar definitions define “sheet” and “coil” proteins. The random coil does not have stable secondary structures. This toy model is suitable for understanding how proteins form fibrils in 1D, however, a word of caution is that, as mentioned above, amyloid formation can be sequence-dependent, for example, Aβ(1-40) and Aβ(1-42), have different pathways [27].

All of the aggregate species in a system of volume V are assumed to be in kinetic equilibrium with each other, where interest lies in studying the thermodynamic properties of these aggregates. The system of proteins and aggregates is also assumed to be well mixed, and containing a fixed amount of protein mass given by Equation (16). If the system contains only low concentrations of proteins and aggregates, the solution properties of the aggregates can be calculated by employing a standard ideal gas approximation for the k-mers, where the partition function for the system can be written as:

Z_{T} = \prod_{k} \frac{Z_{k}^{n_{k}}}{n_{k}!}

(18)

where Z_k is the canonical partition function for the k-mer. The number distributions n_k per unit volume define the number densities ρ_k ≡n_k=V. The relative densities of k-mers can be derived by considering the total free energy density Δ

ℱ

, which may be written compactly for a system containing N number of proteins, as:

Δ F = \sum_{N = 1}^{\infty} ρ (N) [ln ρ (N) - 1 - ln Z (N)]

(19)

which contains an entropy of mixing term as well as the free energy of the aggregate of size N. We can minimize Equation (19) with respect to the total number density ρ_T and subject to constraint given in Equation (16), i.e., conservation of mass, which yields for the number densities:

ρ (N) = Z (N) exp (μ N)

(20)

where μ is the Lagrange multiplier, and is realized as the chemical potential of a protein, ρ(N) is just the Nth moment of quasi-grand ensemble

Ω = \prod_{k = 1}^{N_{T}} ρ (k)

. As in the 1D model by van Gestel et al. [24,25], the state of an aggregate is directly coupled to the aggregate size distribution.

A generalized ZB model for protein aggregation can now be defined by an effective Hamiltonian. The effective Hamiltonian is used to find a transfer matrix by assuming the interactions between aggregates are described by a nearest-neighbor, Ising-like model, in which the protein could be in any of the two states: a sheet (or helix) or coil conformation. The interactions include the free energy R < 0, which describes the inter-facial tension between adjacent sheet and coil proteins in an aggregate. The parameter P > 0 represents the interaction between two neighboring proteins, where one of the proteins located at position j along the chain is in a sheet conformation. P for sheets is measured relative to the coil interaction energy, which was taken to be zero. Additionally, the free energy K > 0 quantifies a polymerizing interaction between any two monomers along the 1D lattice that does not depend on their respective conformations. The effective Hamiltonian used by van Gestel and others has the following form for N monomers [25,70]:

E (r) = - \frac{1}{2} R \sum_{i = 1}^{N - 1} (r_{i} r_{i + 1} - 1) + \frac{1}{2} P \sum_{i = 1}^{N} (r_{i} + 1) + (N - 1) K

(21)

where r = (r_1,r_2,. . ., r_N) and r_i can take on values {1,−1} corresponding to the spin states {↑, ↓ } in the Ising model, and to {s, c} in a Zimm-Bragg model for sheet-coil aggregates. With periodic boundary conditions, the partition function for the two-state model can be written as:

\begin{array}{l} Z_{N} = \sum_{r} e^{- β E (r)} \\ = e^{(N - 1) K} \sum_{r} exp [\frac{1}{2} R \sum_{i = 1}^{N - 1} (r_{i} r_{i + 1} - 1) - \frac{1}{2} P \sum_{i = 1}^{N} (r_{i} + 1)] \\ = k^{N - 1} \sum_{r} T (r_{1}, r_{2}) T (r_{2}, r_{3}) \dots T (r_{N - 1}, r_{N}) T (r_{N}, r_{1}) \end{array}

(22)

where e^β⁽^N⁻¹⁾^K ≡ k^N⁻¹ with β = 1/k_BT, and the parameters are redefined as K ≡ K/k_BT, R ≡ R/k_BT, P ≡ P/k_BT. The transfer matrix can be written as:

\begin{array}{l} T (r, r^{'}) = exp [- \frac{R}{2} (r r^{'} - 1) + \frac{P}{2} (r^{'} + 1)] \\ = (\begin{matrix} 1 & \sqrt{σ_{1}} \\ s_{1} \sqrt{σ_{1}} & s_{1} \end{matrix}) \end{array}

(23)

where σ₁ = exp(−2R) and s₁ = exp(P) are the initiation and propagation parameters for the aggregate system. We note that Equation (23) and Equation (3) yield the same characteristic equation, hence they predict the same thermodynamics results. The difference in the formulation of the folding model versus the aggregation model is that the helical regions of proteins, for example, may only elongate in one direction, while helical aggregates may grow longer at either side. While amyloid fibrils during fibril formation [71,72]. It may also be advantageous to study statistical mechanical models that can describe the interaction of molecules capable of binding to Aβ structure in fibrils, which may inhibit oligomer formation in the early stages of aggregation [73].

5. Partition Functions for Fibrils

Amyloid formation is generally believed to be dominated by 1D or quasi-1D chains of proteins, which may then bundle into protofibrils and fibrils. It is because of this fact that the transfer matrix formulation in statistical mechanics, if extended successfully, is a powerful technique for the studies of amyloid formation. We focus on this extension in this and the following sections.

5.1. Potts Model for 1D Filaments

To include helix, sheet, and coil conformations in a single model, a Potts model [74] for 3-state proteins can be used [26]. The spin variable, r, may now assume values of r = 0 for coil proteins, r = 1 for helical proteins, and r = 2 for sheet proteins. Interactions between proteins are assumed to be with nearest-neighbors only so that a dimensionless Potts model for the aggregate containing N number of proteins can be written as:

- β H_{f i l} = - \sum_{i = 1}^{N - 2} R (r_{i}, r_{i + 1}) [1 - δ (r_{i}, r_{i + 1})] + \sum_{i = 1}^{N - 1} (P_{1} δ (r_{i}, 1) + P_{2} δ (r_{i}, 2))

(24)

where the Kronecker delta δ(x, y) equals one if x = y and zero otherwise. Equation (24) is illustrated in Figure 4. Like in Equation (21), the initiation parameters are defined as σ(r_j, r_j₊₁) ≡ exp(−2R(r_j, r_j₊₁)) and R(r_j, r_j₊₁) > 0 is the free energy of the interfacial tension between proteins at positions j, j + 1 that are not in the same conformation [24]. Thus, three types of interfaces between neighboring proteins in a generalized model are possible: hc or ch, R(0, 1) = R(1, 0) ≡ R₁; sc or cs, R(0, 2) = R(2, 0) ≡ R₂; and sh or hs R(1, 2) = R(2, 1) ≡ R₃. The notation can be simplified by letting σ(1, 0) = σ(0, 1) ≡ σ₁, σ(0, 2) = σ(2, 0) ≡ σ₂, and finally σ(2, 1) = σ(1, 2) ≡ σ₃.

The propagation parameters s₁ and s₂ are associated with the free energies P₁ and P₂ that refer to the interaction between the ith protein that is helix or sheet, respectively, and the nearest neighbor protein at location i + 1. The coil protein interaction energy is assumed to be zero so that it may serve as a reference state. The transfer matrix can then be written as

\begin{array}{l} T (r, r^{'}) = exp {- R (r, r^{'}) [1 - δ (r, r^{'})] + P_{1} δ (r, 1) + P_{2} δ (r, 2)} \\ = (\begin{matrix} 1 & \sqrt{σ_{1}} & \sqrt{σ_{2}} \\ s_{1} \sqrt{σ_{1}} & s_{1} & s_{1} \sqrt{σ_{3}} \\ s_{2} \sqrt{σ_{2}} & s_{2} \sqrt{σ_{3}} & s_{2} \end{matrix}) \end{array}

(25)

and the partition function for N > 2 proteins in the aggregate can be calculated by diagonalizing Equation (25) and plugging into Equation (2).

The Ising-like ZB model for sheet-coil (or helix-coil) transitions in aggregates, Equation (21), can be recovered by writing the q = 2 version of the effective Hamiltonian given in Equation (24). By choosing internal states such that r_i = −1 is the coil state and r_i = +1 is the helix state, and by making the substitution

δ (r_{i}, r_{j}) = \frac{1}{2} (1 + r_{i} r_{j})

, the effective Hamiltonian and corresponding transfer matrix for 1D, two-state helix-coil is recovered [24]. Next, the Hamiltonian for the 1D model given by Equation (24) can be applied to describe the interactions between proteins in aggregates on quasi-1D lattices.

5.2. Simple Model for Fibrils

To describe the fibrils, which may contain several filaments that can be described by Equation (24), various models could be used. For example, two identical filaments could align in register, and all of the proteins could be in the sheet conformation already, or all coil, or some combination of sheet, coil, and even helix proteins. A simple Hamiltonian for the all-sheet case can be written as [25,55,68]:

- β H = L_{y} (L_{x} - 1) (K + P_{1}) + L_{x} (L_{y} - 1) B

(26)

where the free energy B > 0 describes a lateral binding interaction between two sheet proteins on different filaments, L_y refers to the the number of filaments, and L_x is the length of each filament. Additionally, in our approach K was the polymerizing interaction between two adjacent proteins in a fibril, and P₁ was free energy of an interaction between a sheet protein and one of its neighbors. Other effective Hamiltonians could also be written to describe the case where the filaments are not aligned in register with each other, as well as cases where the protein conformation may also play a role in the assembly of the filaments into full fibrils [55,68].

5.3. Quasi-1D Models for Aggregates

In addition to conformational changes in filaments and fibrils, another complication is that the kinetic pathways to the formation of fibrils seem to be sequence-dependent [27]. The Aβ(1–40) isoform solution is abundant in dimers, then trimers, tetramers, . . . , in decreasing order. However, the Aβ(1–42) isoform is more abundant in hexamers and pentamers than in dimers and trimers [4,27]. These facts seem to be consistent with recent experiments [38,69,75], which indicate that the Aβ(1–40) dimer is particularly stable and contributes to protofibril formation. On the other hand, circular hexamers seem to play a role in the protofibril formation of Aβ(1–42) [4,27].

We can model the equilibrium aggregates of Aβ(1–40) and Aβ(1–42) proteins by using finite strips of a two-dimensional N × N square lattice. In Figure 5 and Figure 6a, two identical 1D lattices stacked in-register, that is, a strip lattice of width two, are used to represent an Aβ(1–40) aggregate. For Aβ(1–40), we assume that the smallest equilibrium aggregate is the critical nucleus, which in this case is taken to be the dimer, while the smallest aggregate for Aβ(1–42) is the hexamer, as illustrated in Figure 7 for a strip lattice of width six.

The position of a vertex within the strip is specified by coordinates (i, j), where i is the position along the x-axis of length L_x vertices and j is the position along the y-axis of width L_y vertices. The total number of vertices is N_T = L_xL_y. In Figure 5a, strips of spin variables

s_{i}^{j}

in the y-axis are referred to as L_y-mer’s, where

s_{i}^{j} = - 1 or + 1

for Ising-type models and 0, 1, or 2 for Potts models. The critical nucleus can be represented by a column of L_y proteins on the strip lattice. The proteins in the nucleus could also participate in inter-protein interactions other than the polymerizing interaction K in the y-direction. These interactions can be described by using the sheet and helix interactions from the filament model, plus the free energy introduced above, B > 0, that quantifies the inter-filament interactions between two sheet proteins.

The interactions between the proteins in these aggregates is modeled similarly to the 2-helix chain model for proteins proposed by Skolnick [76] and others [59,63,77,78], which use Zimm-Bragg or Lifson-Roig (LR) parameters to quantify the inter-chain interactions between residues in independent chains. When the inter-chain interactions between two helical residues are made zero, for example, the partition function reduces to a direct product of Zimm-Bragg [76] (or LR [63,77]) transfer matrices. Since the model for aggregates proposed here uses a strip of a finite 2D lattice, as illustrated in Figure 6a, the two-protein case studied by Skolnick can be considered as a special case when considering protein folding instead of aggregation. The lessons learned from the folding models can guide us in constructing aggregation models.

As mentioned, the nucleus is the smallest equilibrium aggregate in our formulation. The next smallest aggregate occupies the first two columns of the strip lattice, and contain 2L_y number of proteins, then 4L_y number of proteins, and so on. The interactions between L_y-mers along the x-axis can be described by generalizing the effective Hamiltonian for the 1D aggregation model, that is, P₁> 0 and P₂> 0 represent the interaction between helical and sheet proteins, respectively, and their nearest-neighbors in the aggregate. The total effective Hamiltonian for aggregates on a strip lattice with boundary conditions can be written as:

- β H_{2 D} = \sum_{j = 1}^{L_{y}} H_{f i l} (j) + B \sum_{i = 1}^{L_{x} - 1} \sum_{j = 1}^{L_{y} - 1} δ (s_{i}^{j}, + 2) δ (s_{i}^{j + 1}, + 2) + (L_{y} - 1) L_{x} K

(27)

where H_fil is given by Equation (24) upon substituting

r_{i} \to s_{i}^{j}

and where

s_{i}^{j}

can assume the values of 0, 1, 2 for coil, helix, or sheet monomers, respectively. Additionally, B is the lateral binding interaction between two proteins from different filaments. The third term involving K are the polymerizing interactions between proteins in both the x and y directions on the quasi-1D lattice. We assumed any polymerizing interactions between proteins on a quasi-1D lattice were equal in magnitude in order to keep the number of parameters used in the model to a minumum. It is similar to the parameter K in Equation (21), which took into account the polymerizing interaction between two adjacent proteins on the 1D lattice. For the case q = 2 and L_y = 2 of Equation (27), the Hamiltonian for sheet-coil protofibrils can be explicitly written as:

- β H_{2 D} (s) = H_{f i l} (s^{1}) + H_{f i l} (s^{2}) + B \sum_{i = 1}^{L_{x} - 1} δ (s_{i}^{1}, + 2) δ (s_{i}^{2}, + 2) + L_{x} K

(28)

where 2D refers to aggregates on a strip lattice. The transfer matrix can then be written as:

T_{2 D} = (\begin{matrix} 1 & \sqrt{σ_{1}} & \sqrt{σ_{1}} & σ_{1} \\ s_{1} \sqrt{σ_{1}} & s_{1} & s_{1} σ_{1} & s_{1} \sqrt{σ_{1}} \\ s_{1} \sqrt{σ_{1}} & s_{1} σ_{1} & s_{1} & s_{1} \sqrt{σ_{1}} \\ s_{1}^{2} σ_{1} & s_{1}^{2} \sqrt{σ_{1}} & s_{1}^{2} \sqrt{σ_{1}} & s_{1}^{2} b \end{matrix})

(29)

where b ≡ exp(B). As mentioned, in the limit when the inter-filament interactions between two sheet proteins B → 0, the transfer matrix given by Equation (29) decomposes into a direct product of transfer matrices given by Equation (23) for 1D filaments of length N along the x-axis [25]. Moreover, the limit in which the sheet-coil interfacial interactions R → 0 (or σ₁ → 1) yields independent strips parallel to the y-axis. The 2^L_y × 2^L_y transfer matrix is also symmetric with respect to the sheet-coil interfacial interaction R. This fact is analogous to the 1D case, where the transfer matrix was symmetric in σ₁, σ₂ and σ₃. The partition function for aggregates on the strip lattice can be calculated by plugging the eigenvalues, σ₂_D,i, of Equation (29) into Equation (2) and specifying boundary conditions. The result is:

Z_{2 D} = k^{L_{y} (2 L_{x} - 1) - L_{x}} \sum_{i = 1}^{N_{λ_{2 D}}} c_{i} λ_{2 D}^{L_{x} - 2} .

(30)

where, again, c_i are determined by boundary conditions and k was defined in Equation (22).

Using the ZB formalism, we can also model fibrils in the x-, y- and z-directions using a quasi-1D lattice in 3D. For example, two or more filaments could join to form a protofibril or fibril, as illustrated in Figure 6 for two simple geometric configurations. For simplicity, we study the case where two 2D aggregates represented by strip lattices, as depicted in Figure 6b, are the same geometrical shape and are stacked one on top the other, in-register. The spin variable associated with one of the aggregates is denoted by s, while the spin variables associated to the second aggregate is denoted by t. The effective Hamiltonian for the fibril model in Figure 6b, referred to as the “cube” model, may be written using the strip model for aggregates as:

- β H_{3 D} = - β H_{2 D} (s) - β H_{2 D} (t) + B \sum_{i = 1}^{N} δ (s_{i}^{1}, + 2) δ (t_{i}^{1}, + 2) δ (s_{i}^{2}, + 2) δ (t_{i}^{2}, + 2) + L_{x} L_{z} K

(31)

where we assume that the interaction between any two sheet proteins from adjacent aggregates is also quantified by the free energy B > 0. Additionally, the aggregates can now polymerize in any direction. To help keep the number of parameters used in the model to a minimum, we assume the polymerizing interactions between two adjacent proteins in the z-direction has the same strength as the polymerizing interactions, K, in the x and y directions. The corresponding transfer matrices for each model for fibrils are found just as they were for the 1D and strip models discussed earlier. The result is:

Z_{3 D} = k^{- L_{y} + L_{x} (- 1 + 2 L_{y} + L_{z})} \sum_{i = 1}^{N_{λ_{3 D}}} c_{i} λ_{3 D, i}^{L_{x} - 2} .

(32)

In general, the transfer matrix for the 2D model has dimension q^L_y × q^L_y, while the 3D model has dimension q²^L_y × q²^L_y, where we assumed that the two protofibrils composing the fibril contain L_y number of filaments. The transfer matrices for both the 2D and 3D models are illustrated in Figure 6a and Figure 6b, respectively. Since the Potts model is used, q is 2 for sheet-coil, helix-coil, or helix-sheet models, and 3 for the helix-sheet-coil model.

5.4. Dilute Thermodynamic Averages

To compare with experiments, some average properties of the dilute system of monomers and aggregates can be defined. The total number density for the strip or cube model for fibrils can be written as:

ρ (L) = Z (L) exp (μ L)

(33)

where Z can be given by Equation (30) or (32) for 2D or 3D aggregates, respectively, and L is the total number of proteins in aggregates. In the 2D case, L = L_xL_y. The average fraction that an aggregate is helix (i = 1) or sheet (i = 2) can be defined as:

〈 θ_{i} 〉 \equiv \frac{\sum_{L_{x} = 1}^{\infty} θ_{i} (L) L ρ (L)}{\sum_{L_{x} = 1}^{\infty} L ρ (L)}

(34)

where the x-axis is the axis of propagation of the aggregate, and θ_i can be calculated for any of the aggregate species discussed earlier by computing Equation (9). Equation (34) can be used as a fit function for CD spectral data points. The average degree of polymerization of an aggregate growing in the x-direction can also be defined as:

〈 L_{x} 〉 \equiv \frac{\sum_{L_{x} = 1}^{\infty} L ρ (L)}{\sum_{L_{x} = 1}^{\infty} ρ (L)}

(35)

and is directly related to the length of the fibrils. The expressions for 〈θ₁〉 and 〈L〉 can be obtained for systems of α-synuclein (αS) and Aβ(1–40) aggregates. The α-synuclein fibril is modeled by placing the proteins in the aggregates onto the L_y₌₄ strip lattice, as illustrated in Figure 6a. Thus, the average length of these fibrils is then:

〈 L 〉 = \frac{〈 L_{x} 〉}{4}

(36)

which can be used to fit the AFM measurements of the average lengths of the fibrils. This relation also holds for the average length of a fibril described by the cube lattice model as depicted in Figure 6b for the Aβ(1–40) fibrils. As an example, we calculate these quantities explicitly for a system of equilibrium Aβ(1–40) aggregates using a sheet-coil model. Specifically, for a system where 1D filaments, L_y = 2 strip aggregates, or 3D cube aggregates could be present at equilibrium, we first define:

\begin{array}{l} ρ_{A β} \equiv \sum_{L_{x} = 1}^{\infty} ρ (L_{x}) + ρ (2 L_{x}) + ρ (4 L_{x}) \\ = z + k z^{2} {〈 i ∣ f 〉}_{1 D} + \sum_{i = 1}^{N_{λ_{1 D}}} \frac{k^{2} z^{3} x_{i} λ_{i}}{1 - k z λ_{i}} + z^{2} + k^{4} z^{4} {〈 i ∣ f 〉}_{2 D} + \sum_{j = 1}^{N_{λ_{2 D}}} \frac{k^{7} z^{6} x_{j} λ_{j}}{1 - k^{3} z^{2} λ_{j}} + z^{4} + k^{12} z^{8} {〈 i ∣ f 〉}_{3 D} + \sum_{l = 1}^{N_{λ_{3 D}}} \frac{k^{13} z^{12} x_{l} λ_{l}}{1 - k^{5} z^{4} λ_{l}} \end{array}

(37)

where the fugacity z ≡ exp(βμ) and in each sum λ_k is the kth eigenvalue of the 1D, 2D, or 3D transfer matrix for filament, strip, or cube models, respectively. The Aβ(1–40) aggregates considered here at equilibrium are illustrated in Figure 8. Additionally, x_k is the kth term of the expression 〈i|f〉, where |i〉 and |f〉 are the specified boundary conditions. The sums computed converge only if kzλ_j< 1 for all j. Details on boundary conditions for ZB-type models can be found in [67]. By using Equation (16), φ = m_tot/V can also be written explicitly for Aβ(1–40) aggregates as:

\begin{array}{l} φ \equiv \sum_{L_{x} = 1}^{\infty} L_{x} ρ (L_{x}) + 2 L_{x} ρ (2 L_{x}) + 4 L_{x} ρ (4 L_{x}) \\ = z + 2 k z^{2} {〈 i ∣ f 〉}_{1 D} + \sum_{i = 1}^{N_{λ_{1 D}}} \frac{(k^{2} z^{3} x_{j} λ_{j}) (3 - 2 k z λ_{j})}{{(1 - k z λ_{j})}^{2}} \\ + 2 z^{2} + 4 k^{4} z^{4} {〈 i ∣ f 〉}_{2 D} + \sum_{j = 1}^{N_{λ_{2 D}}} \frac{2 (k^{7} z^{6} x_{j} λ_{j}) (3 - 2 k^{3} z^{2} λ_{j})}{{(1 - k^{3} z^{2} λ_{j})}^{2}} \\ + 4 z^{4} + 8 k^{12} z^{8} {〈 i ∣ f 〉}_{3 D} + \sum_{j = 1}^{N_{λ_{3 D}}} \frac{4 (k^{13} z^{12} x_{j} λ_{j}) (3 - 2 k^{5} z^{4} λ_{j})}{{(1 - k^{5} z^{4} λ_{j})}^{2}} \end{array}

(38)

Now the average lengths of the aggregates can be computed. Next, the average fraction of the aggregates that are sheet, 〈θ₂〉, is calculated for the Aβ(1–40) model. By plugging Equation (6) for filament, strip, and cube aggregates into Equation (34), 〈θ₂〉 can be written as:

\begin{array}{l} {〈 θ_{2} 〉}_{A β} = \frac{s_{2}}{φ} \sum_{k = 1}^{N_{1 D}} \frac{x_{k} \frac{\partial λ_{k}}{\partial s_{2}} k^{2} z^{3} (3 - 2 k z λ_{k})}{{(1 - k z λ_{k})}^{2}} + \frac{\frac{\partial x_{k}}{\partial s_{2}} k z^{2} (k z λ_{k} + 2 (- 1 + k z λ_{k}) log (1 - k z λ_{k}))}{1 - k z λ_{k}} \\ + \frac{s_{2}}{φ} \sum_{k = 1}^{N_{2 D}} \frac{2 x_{k} \frac{\partial λ_{k}}{\partial s_{2}} k^{7} z^{6} (3 - 2 k^{3} z^{2} λ_{k})}{{(1 - k^{3} z^{2} λ_{k})}^{2}} + \frac{2 \frac{\partial x_{k}}{\partial s_{2}} k^{7} z^{6} λ_{k}}{1 - k^{3} z^{2} λ_{k}} - 4 \frac{\partial x_{k}}{\partial s_{2}} k^{4} z^{4} log (1 - k^{3} z^{2} λ_{k}) \\ + \frac{s_{2}}{φ} \sum_{k = 1}^{N_{3 D}} \frac{4 x_{k} \frac{\partial λ_{k}}{\partial s_{2}} k^{13} z^{12} (3 - 2 k^{5} z^{4} λ_{k})}{{(1 - k^{5} z^{4} λ_{k})}^{2}} \\ + 4 \frac{\partial x_{k}}{\partial s_{2}} k^{8} z^{8} λ_{k} (- 1 + \frac{1}{1 - k^{5} z^{4} λ_{k}} - 2 log (1 - k^{5} z^{4} λ_{k})) \end{array}

(39)

where s₂ is the sheet propagation parameter. The procedure for finding 〈θ₂〉 is quite general, and works for all of the transfer matrices that we have considered in this model.

5.5. Comparison to Experiment

In this subsection, our ZB-like model predictions are compared to the experimental results for the CD spectra of Aβ(1–40) fibrils [52] and the AFM measurements of the lengths of α-synuclein fibrils [68]. The CD and the AFM measurements were made at various initial mass concentrations of each protein, when the fibrils had reached a steady state. For the fit of the CD data, we used as our fit function the total fractional amount of sheet structure in all of the aggregates, 〈θ₂〉_Aβ, given by Equation (39). The proteins at the boundaries of aggregates could be coil or sheet for 1D, 2D, and 3D lattices. The fit and the values for P₂, the sheet interaction free energy, K, the free energy describing the polymerization of the aggregate in any direction, R₂, the sheet-coil interfacial free energy, and B, and lateral binding free energy for the q = 2 sheet-coil model are given in Figure 9a. The fit parameters are then used to predict the average length of the fibrils in the system, 〈L〉, which illustrated in Figure 9b.

For the α-synuclein model, where L_y = 1, 2, 3 and 4 strip aggregates could be present, we fit the L_y = 4 contribution from Equation (36) to the AFM average length data for the fibrils. The α-synuclein aggregates at equilibrium are illustrated in Figure 8. The fit is illustrated in Figure 9c. The fit parameters are then used to predict the total fraction of aggregates that are sheet, which can be written for the α-synuclein model as:

{〈 θ_{2} 〉}_{α S} = {〈 θ_{2} 〉}_{L_{y} = 1} + {〈 θ_{2} 〉}_{L_{y} = 2} + {〈 θ_{2} 〉}_{L_{y} = 3} + {〈 θ_{2} 〉}_{L_{y} = 4}

(40)

where each contribution in Equation (40) can be calculated using transfer matrices derived from Equation (27) for L_y = 1, ..., 4. Equation (40) is illustrated in Figure 9d. The model predicts the average length data pretty well (Notice the log scales used in the plot. At the low coverage, the predicted points fall actually within the error bars), as we should have expected because the other variations of the Ising-ZB model fit the data well [25,68]. The resulting predictions for 〈θ₂〉_αS illustrate that the concentration at which the fibril concentration takes off is around 15 μM, again, as we should have expected [68].

The fit of 〈θ₂〉_Aβ to the CD data predicts that the fibrils are held together tightly due to the relatively large value of the sheet-interaction between proteins in the aggregates, P₁, and to a lesser extent on the binding between filaments as quantified by B. The fitted value for the sheet-coil interface free energy, R₂, indicates that the interfacial tension between sheet and coil regions in the aggregates is modest. However, the fibril concentrations do not really increase from zero until nearly 100 μM according to the model predictions, but β-rich fibrils have been observed at lower concentrations as seen in Figure 9a. The fit could be improved by considering other types of models for the Aβ fibrils and as well as different boundary conditions.

The model predictions for the strip model of α-synuclein fibrils seem to agree with the experimentally determined average lengths as illustrated in Figure 9c where the boundary conditions were set so that the ends of fibrils could be sheet or coil proteins. Additionally, as illustrated in Figure 9d, the model for synuclein fibrils predicts that the sheet-coil transition of proteins in fibrils largely drives the polymerization process, where ρ_ƒib/φ and 〈θ₂〉_{L_y}₌₄ give nearly the same result for the concentrations used in the AFM experiments.

The fits of the AFM data done by van Raaij et al. [68] and Schmit et al. [53] needed only 2 parameters, compared to 3 in the present model. When compared with van Raaij’s fit of the AFM data, our model predicts that the probability that fibrils contain sheet structure is high once overcoming a sheet-coil free energy barrier R₂, whereas in van Raaij’s model prediction, the free energy barrier between adjacent sheet and coil proteins in the aggregates does not seem to be present. A finite contribution from R₂ means the fibrils will have longer stretches of sheet content when compared to cases when R₂ is closer to zero when there is little or no penalty between coil to sheet regions in the fibrils. Additionally, our model predicts that the inter-filament interactions, B, are slightly weaker than the interactions between sheets along the axis of growth of the fibril, whereas van Raaij’s fit is the other way around: the inter-filament interactions are stronger than those between proteins along the fibril axis of growth.

When fitting the Aβ(1–40) CD data, our model predicts that the value of the polymerizing interaction between proteins on a quasi-ID lattice, K, is small and could be due to modeling uncertainty, thus the number of parameters needed to fit the CD data could be less. The fact that not all adjustable parameters were required to fit CD and AFM data suggests that the model Hamiltonians introduced throughout the paper may be simplified to describe only the relevant interactions quantified by the non-zero fit parameters. It may also imply that the fibrils are mainly held together by a few types of energetic interactions, for example, the inter-filament interaction B for Aβ(1–40) had a finite contribution in the Hamiltonian. Other interactions described by the Hamiltonians could be non-existent or very small. More detailed experimental results are needed to discern the correctness of the models in predicting the values of interaction energy parameters, which in turn describe the dominant interactions within the fibrils.

As mentioned, the data sets were fit using various boundary conditions, and the open boundary case (proteins could adopt either conformation at the boundaries) was found to be the best choice for fitting for the average lengths of the α-synuclein fibrils. The CD fits could not be shown to be dependent on certain boundary conditions since there is currently no AFM measurements of the fibrils to compliment the CD data. This means we could fit the CD data using most choices for boundary conditions, including the case where all proteins at the ends of fibrils are in the coil conformation. However, for some choices of boundary conditions [25,67], the corresponding average length predictions yielded unreasonable lengths (not shown) for protein aggregates in vivo.

6. Grand Canonical Approach

The models for fibrils discussed so far do not take into account interactions between protein and solvent, or some free energy that would be associated with nucleus formation. The ZB model for aggregation can be extended to take into account these phenomena by using a grand-canonical model. We summarize several main differences between canonical and grand-canonical approaches: (1) in the grand canonical model, aggregates of all sizes are included; (2) an aggregate phase and solution phase are in equilibrium; (3) chemical potentials can be used relating the solution phase as well as the aggregate phase [26].

The solution phase is defined by specifying the chemical potential for protein monomers in the solution can be written [79–81] as:

μ_{s o l n} = μ_{S T} + μ_{S R} + R T ln c

(41)

where the subscript “S” stands for solution, μ_ST and μ_SR are the free energy contributions arising from the translational and rotational motion of monomers moving in solution, respectively, and c is the concentration of monomers in solution. The aggregate phase is defined by specifying the chemical potential of the aggregates, μ_agg, by assuming a crystalline approximation so that μ_agg can be written as [82]:

μ_{a g g} = μ_{P C} + μ_{P V}

(42)

where “P” stands for polymers of proteins. μ_PC is the free energy contribution arising from the contact interactions between proteins in aggregates, which may vary for different monomer organizations in the aggregates. We assume this term also includes the conformational entropy of the backbone and side-chains. The term μ_PV is the free energy arising from the proteins vibrating about their equilibrium positions, but not molecular internal vibrations within the proteins [79,82]. When the phases are at equilibrium, the chemical potentials for each phase are equal:

μ_{a g g} = μ_{s o l n}

(43)

With the simple statistical mechanical model summarized in the sections to follow, we can relate the chemical potential contribution from the protein interactions in aggregates, μ_PC, to the experimental concentration of protein in solution via Equation (43).

As a first step in generalizing the canonical effective-Hamiltonian models presented earlier, the free energy A is introduced to quantify the entropic penalty needed to nucleate the aggregate, i.e., the first column of a strip lattice that contains protein aggregates. This free energy may also be viewed as a boundary between proteins and solvent. The aggregate phase then assumes strip or cube lattices may be occupied by aggregates and any other species, including solvent clusters.

To write down an effective Hamiltonian that can include the free energy A, the 1D or quasi-1D lattices used in constructing fibrils can be generalized to allow solvent clusters to occupy the lattice sites. For example, in Figure 10, a square represents a solvent cluster, whereas circle represents protein. Both solvent and proteins can occupy sites along 1D or quasi-1D lattices. By introducing a lattice gas model into the aggregate phase, a Potts Hamiltonian for the 1D lattice that quantifies the interactions between helix, sheet, or coil proteins and solvent can be written as [26]:

- β H_{f i l} = - β H_{p p} - β H_{p s}^{n_{c}}

(44)

- β H_{p p} = \sum_{i = 1}^{N_{T} - 1} {P_{1} δ (t_{i}, 1) + K - R_{1} χ (t_{i}, t_{i + 1})} n_{i} n_{i + 1} - \sum_{i = 1}^{N_{T} - 1} R_{1} χ (n_{i}, n_{i + 1}) [δ (t_{i}, 1) n_{i} + δ (t_{i + 1}, 1) n_{i + 1}]

(45)

- β H_{p s}^{n_{c}} = - \sum_{i = 1}^{N_{T} - n_{c} - 1} A_{χ} (n_{i}, n_{i + n_{c}}) \prod_{j = i + 1}^{i + n_{c} - 1} δ (n_{j}, 1)

(46)

where the lattice-gas variable n_i = 1 refers to a protein occupied lattice site, and n_i = 0 a solvent occupied site. Additionally, “pp” in −β

ℋ

_pp refers to “protein-protein” interactions and “ps” in

- β H_{p s}^{n_{c}}

refers to “protein-solvent” interactions. The term χ(n_i,n_i₊_{n_c}) = 1 − δ(n_i, n_i₊_n_{_c}) ensuring that there is solvent at site i and a protein at i + n_c, or vice-versa.

Since the number of proteins on the lattice can fluctuate, this description of protein aggregation is described by using the grand canonical ensemble. The lattice-gas formalism, i.e., Equation (44), is able to describe a variety of elongation mechanisms including merging and fracturing of aggregates of different sizes along the 1D lattice. The partition function can be written as:

Q = \sum_{{t}, {n}} exp (- β H_{f i l} + β μ_{P C} N_{p})

(47)

where βμ_PC is the dimensionless chemical potential arising from the contact and interfacial interactions between proteins in aggregates, and where the sum is performed over both spin and lattice-gas variables. Just like in the canonical models, Ijms 14 17420f13

may be solved for exactly by a transfer matrix T. A simple example illustrating T for the case n_c = 1 in a sheet-coil (t_i = −1 for coil, t_i = 1 for sheet) system can be written as:

T = \begin{array}{c} t_{i + 1} & - 1 & 1 \\ n_{i + 1} & 0 & 1 & 1 \\ t_{i} & n_{i} \\ 0 & 1 & \sqrt{α} & \sqrt{α σ_{1}} \\ - 1 & 1 & z \sqrt{α} & k z & k z \sqrt{σ_{1}} \\ 1 & 1 & z \sqrt{α σ_{1}} & k z s_{1} \sqrt{σ_{1}} & k z s_{1} \end{array}

(48)

where s₁, σ₁, and k were defined earlier and α ≡ exp(−2A) is a new Zimm-Bragg-like parameter. Additionally, the fugacity is now defined as z ≡ exp(βμ_PC).

The inter-filament interactions between two 1D filaments are treated using the same methodology introduced in earlier sections. In general, the Hamiltonian for an L_x × L_y strip lattice that includes inter-filament interactions can be written using the 1D Hamiltonian, Equation (44), by changing the spin and lattice-gas variables

t_{i} \to t_{i}^{j}

and

n_{i} \to n_{i}^{j}

, respectively, as:

- β H_{s t r i p}^{A} = - \sum_{j = 1}^{L_{y}} β H_{f i l} (j) + F \sum_{i = 1}^{N} \sum_{j = 1}^{L_{y} - 1} δ (t_{i}^{j}, 1) δ (t_{i}^{j + 1}, 1) n_{i}^{j} n_{i}^{j + 1}

(49)

where

ℋ

_fil(j) refers to the jth filament. For Aβ(1–40) L_y = 2, as illustrated in Figure 5b,c. The parameter F quantifies the interaction energy between two sheet-linked proteins from adjacent filaments, and plays the same role as the free energy B in earlier models for fibrils in this article. In our treatment F > 0, the proto-fibrils and fibrils are more stable than single filaments.

Since nucleation cannot in reality occur in 1D, we consider a similar model for aggregates that positions the nucleus along the y-axis, as shown in Figure 6a and Figure 11. From this point of view the orientations of proteins in the nucleus are perpendicular to the direction of propagation (x-axis) of the fibrils, and the nucleus is now a multi-layer, 1D aggregate. The nuclei may assemble into proto-fibrils that grow longer on the quasi-1D lattice. An effective Hamiltonian for protein aggregation, including the quasi-1D nucleus, can be written:

- β H_{s t r i p}^{B} = - \sum_{j = 1}^{L_{y}} β H_{p p} (j) - \sum_{j = 1}^{L_{y} - 1} β H_{y} (j) - β H_{n u c}

(50)

- β H_{y} (j) = \sum_{i = 1}^{N_{T}} {F δ (t_{i}^{j}, 1) + K - R_{1 χ} (t_{i}^{j}, t_{i}^{j + 1})} n_{i}^{j} n_{i}^{j + 1} - \sum_{i = 1}^{N_{T}} R_{1} χ (n_{i}^{j}, n_{i}^{j + 1}) [δ (t_{i}^{j}, 1) n_{i}^{j} + δ (t_{i}^{j + 1}, 1) n_{i}^{j + 1}]

(51)

- β H_{n u c} = - \sum_{i = 1}^{N_{T} - 1} A \prod_{j = 1}^{L_{y}} χ (n_{i}^{j}, n_{i + 1}^{j}) \prod_{j = 1}^{L_{y} - 1} δ (n_{i}^{j}, n_{i}^{j + 1})

(52)

where −βH_pp(j) is given by Equation (45) after substituting

t_{i} \to t_{i}^{j}

and

n_{i} \to n_{i}^{j}

. In the y-direction we write analogous interactions, −βH_y, similar to those in the x-direction. Also included in the y-direction is the nucleus term containing the parameter A, which has the same meaning of surface energy as before. The effective Hamiltonians given by Equations (49) and (50) are the most general forms of fibrils that we have considered so far.

For either description of fibrils (model A or B), the total number of proteins on a strip lattice is then

N_{s t r i p} \equiv \sum_{i = 1}^{N_{T}} \sum_{j = 1}^{L_{y}} n_{i}^{j}

. The grand partition function can be written as:

Q_{s t r i p}^{A (B)} = \sum_{{t}, {n}} exp (- β H_{s t r i p}^{A (B)} + β μ_{P C} N_{s t r i p})

(53)

where the sums over {t}, {n} are for all i and j, and A, B refers to the effective Hamiltonians given by Equation (49) or (50), respectively. For periodic boundary conditions, Equation (53) can be solved as

Q_{s t r i p}^{A (B)} = tr {(T_{s t r i p}^{A (B)})}^{N}

where

T_{s t r i p}^{A (B)}

is the partition function for the lattice-gas model (A or B). Just as in Subsection 3.1, in the thermodynamic limit N_T → ∞:

{(L_{y} N_{T})}^{- 1} ln Q_{s t r i p}^{A (B)} = ln λ_{1}^{A (B)}

(54)

where

λ_{1}^{A (B)}

is the largest eigenvalue of

T_{s t r i p}^{A (B)}

. In general, the dimension of the transfer matrix

T_{s t r i p}^{A}

is (q + 1)ⁿ^_c^L^_y × (q + 1)ⁿ^_c^L^_y and has (q + 1)ⁿ^_c^L^_y number of eigenvalues, whereas the transfer matrix

T_{s t r i p}^{B}

is (q + 1)^L^_y × (q + 1)^L^_y and has (q + 1)^L^_y number of eigenvalues.

To compare with experiments, we can define quantities similar to Equations (34) and (36), and others. For example, in the grand canonical ensemble, the average number of proteins on the lattice, 〈N_p〉, referred to as the occupation of the lattice, the number of proteins in filaments, 〈ψ〉, the average number of filaments, 〈γ〉, and the average number of sheet segments, 〈θ〉, can be written as:

〈 N_{p} 〉 \equiv z \frac{\partial}{\partial z} ln Q

(55)

〈 ψ 〉 \equiv \frac{\partial}{\partial K} ln Q + 〈 γ 〉

(56)

〈 θ 〉 \equiv \frac{\partial}{\partial P_{1}} ln Q

(57)

〈 γ 〉 \equiv \frac{1}{2} \frac{\partial}{\partial A} ln Q

(58)

respectively. Other quantities may also be defined including the average lengths of filaments, and the average length of sheet stretches in aggregates [26,83].

7. Comparison to Experiment

We solve for μ_PC for Aβ(1–40) from Equation (43) in terms of μ_ST, μ_SR, μ_PV, and experimental concentration c. Then, μ_PC is plugged into Equation (53), and Equation (57) and other thermodynamic quantities can be calculated. For Aβ(1–40), we have μ_ST + μ_SR ≈ −29 kcal/mol [79,81]. In reference [79], μ_PV for hemoglobin was found to be approximately

\frac{3}{4} (μ_{S T} + μ_{S R})

. Mutated hemoglobin does polymerize as amyloid, but amyloid proteins usually may be natively unstructured, unlike hemoglobin. We nevertheless use a similar result for μ_PV for Aβ(1–40) and Curli. Equation (57) divided by Equation (55), 〈θ〉/〈N_p〉, the β-sheet fraction, is used as our fitting function. The results are plotted in Figure 12a for Aβ(1–40) fibrils, and Figure 12b for the Curli fibrils. The fit yields reasonable free energies at room temperature for the Aβ(1–40) fibrils, P₁ ≈ K ≈ A ≈ 0 kcal/mol, R₁ = 0.35 kcal/mol, and F = 16.4 kcal/mol. For the Curli fibrils, we found P₁ = 7.26 kcal/mol, K = 2.2 kcal/mol, R₁ ≈ 0 kcal/mol, and A = 1.2 kcal/mol [26]. Clearly for the experiment involving Aβ(1–40) fibrils, the grand canonical approach to modeling fibrils does a better job than earlier canonical approaches. The grand-canonical model also suggests that the Aβ fibrils are more strongly held together by inter-filament interactions when compared to the fit from the canonical model, and also that the penalty in going from sheet to coil regions in the aggregates is very small and could be due to modeling uncertainty. The minimum number of parameters needed to fit the CD data is also the same number when compared with the van Raaij and Schmidt’s models, and if the value of R₁ in the Aβ fit is taken to be within modeling uncertainty, then only one parameter is needed to fit the CD data.

8. Conclusions

By focusing on the aggregation of proteins in forming oligomers, protofibrils, and fibrils, and their relations to neurodegenerative diseases, statistical mechanical approaches to protein aggregation have been developed. We have made a general summary of the field, presenting recent formulations of the ZB model based on the canonical, as well as the grand canonical, approaches to the amyloid formation processes. Some results are presented to show that these models can be used to interpret experimental observations as well as to provide phase diagrams [26] showing the parameter dependence of the β-sheet dominating regions.

More experimental data like the CD results of Terzi et al. [52] and the AFM measurements performed by van Raaij et al. [68] would help validate the ZB approach. For example, the ZB model for protein folding has been used to classify all the amino acids based on their propensity to fold from coil to helix [12]. Similar classification schemes could potentially be devised for the many proteins that can aggregate to form fibrils if a much larger collection of experimental data (like the CD and AFM results) were available.

Of course, a statistical mechanical approach to protein aggregation has serious limitations. It can only be used to study the equilibrium properties of the systems, not the rates of the processes involved nor the transient behaviors, such as quasi-equilibrium or kinetic trapping. Furthermore, the available experimental data that we can compare our theories with are so far extremely limited. Therefore, statistical mechanical models are not a tool for predicting assembly pathways. However, statistical mechanics can be used to show that experimental observations are consistent with the predictions of a certain route of aggregation, that is, given a route of aggregation and the associated effective free energy, statistical mechanical models can predict equilibrium distributions of oligomers, protofibrils, fibrils, etc., which can then be compared to observed data.

The pathway prediction function is better achieved by using kinetic models [7,41,42,85, 86], or better still, by molecular dynamics simulations [1,2,32,33,87,88]. Unfortunately, the latter are highly restricted by the system size and length of time that molecular dynamics can be used to simulate. On the other hand, protein aggregation, as discussed earlier, is a complex process spanning many levels of structure and many chemical species. Thus, at present, the kinetic models or coarse-graining models may be better tools for the purpose of predicting pathways. Moreover, many more experimental data are available in the literature for non-equilibrium or kinetic studies. It would be interesting, for example, to develop a kinetic approach based on statistical mechanics, similar to a kinetic Ising model derived from an Ising model. The kinetic study of protein aggregation is a rich field of investigation and of great current interest. We hope to be able to report progress in the non-equilibrium studies of protein aggregation in future work.

Acknowledgements

We would like to thank Frank Ferrone, Brigita Urbanc, and J. van Gestel for stimulating discussions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Thirumalai, D.; Klimov, D.; Dima, R. Emerging ideas on the molecular basis of protein and peptide aggregation. Curr. Opin. Struct. Biol 2003, 13, 146–159. [Google Scholar]
Ma, B.; Nussinov, R. Simulations as analytical tools to understand protein aggregation and predict amyloid conformation. Curr. Opin. Chem. Biol 2006, 10, 445–452. [Google Scholar]
Li, M.; Klimov, D.; Straub, J.; Thirumalai, D. Probing the mechanisms of fibril formation using lattice models. J. Chem. Phys 2008, 129, 175101, :1–175101:10.. [Google Scholar]
Urbanc, B.; Betnel, M.; Cruz, L.; Bitan, G.; Teplow, D.B. Elucidation of amyloid beta-protein oligomerization mechanisms: Discrete molecular dynamics study. J. Am. Chem. Soc 2010, 132, 4266–4280. [Google Scholar]
Pallitto, M.M.; Murphy, R.M. A mathematical model of the kinetics of beta-amyloid fibril growth from the denatured state. Biophys. J 2001, 81, 1805–1822. [Google Scholar]
Powers, E.T.; Powers, D.L. Mechanisms of protein fibril formation: Nucleated polymerization with competing off-pathway aggregation. Biophys. J 2008, 94, 379–391. [Google Scholar]
Knowles, T.; Waudby, C.; Devlin, G.; Cohen, S.; Aguzzi, A.; Vendruscolo, M.; Terentjev, E.; Welland, M.; Dobson, C. An analytical solution to the kinetics of breakable filament assembly. Science 2009, 326, 1533–1537. [Google Scholar]
Zimm, B.; Bragg, J. Theory of the phase transition between helix and random coil in polypeptide chains. J. Chem. Phys 1959, 31, 526–535. [Google Scholar]
Lifson, S.; Roig, A. On the theory of helix-coil transition in polypeptides. J. Chem. Phys 1961, 34, 1963–1974. [Google Scholar]
Poland, D.; Scheraga, H. Theory of Helix-Coil Transitions in Biopolymers; Academic Press: New York, NY, USA, 1970. [Google Scholar]
Bloomfield, V. Statistical thermodynamics of helix-coil transitions in biopolymers. Am. J. Phys 1999, 67, 1212–1215. [Google Scholar]
Scheraga, H.; Vila, J.; Ripoll, D. Helix-coil transitions re-visited. Biophys. Chem 2002, 101, 255–265. [Google Scholar]
Chen, Y.; Zhou, Y.; Ding, J. The helix-coil transition revisited. Proteins 2007, 69, 58–68. [Google Scholar]
Baxter, R. Exactly Solved Models in Statistical Mechanics; Dover Books on Physics Series; Dover Publications: San Diego, CA, USA, 2008. [Google Scholar]
Wako, H.; Saitô, N. Statistical mechanical theory of the protein conformation. I. General considerations and the application to homopolymers. J. Phys. Soc. Jpn 1978, 44, 1931–1938. [Google Scholar]
Muñoz, V.; Thompson, P.A.; Hofrichter, J.; Eaton, W.A. Folding dynamics and mechanism of β-hairpin formation. Nature 1997, 390, 196–199. [Google Scholar]
Mattice, W.; Scheraga, H. Matrix formulation of the transition from a statistical coil to an intramolecular antiparallel β sheet. Biopolymers 2004, 23, 1701–1724. [Google Scholar]
Mattice, W. The beta-sheet to coil transition. Ann. Rev. Biophys. Biophys. Chem 1989, 18, 93–111. [Google Scholar]
Sun, J.; Doig, A. A statistical mechanical model for β-sheet formation. J. Phys. Chem. B 2000, 104, 1826–1836. [Google Scholar]
Hong, L. A statistical mechanical model for antiparallel β-sheet/coil equilibrium. J. Chem. Phys 2008, 129, 225101, :1–225101:7.. [Google Scholar]
Schreck, J.; Yuan, J. Exactly solvable model for helix-coil-sheet transitions in protein systems. Phys. Rev. E 2010, 81, 061919, :1–061919:4.. [Google Scholar]
Mattice, W.; Scheraga, H. Suppression of the statistical coil state during the α–β transition in homopolypeptides. Biopolymers 2004, 23, 2879–2890. [Google Scholar]
Hong, L.; Lei, J. Statistical mechanical model for helix-sheet-coil transitions in homopolypeptides. Phys. Rev. E 2008, 78, 051904, :1–051904:6.. [Google Scholar]
Van Gestel, J.; van der Schoot, P.; Michels, M.A.J. Helical transition of polymer-like assemblies in solution. J. Phys. Chem. B 2001, 105, 10691–10699. [Google Scholar]
Van Gestel, J.; de Leeuw, S. A statistical-mechanical theory of fibril formation in dilute protein solutions. Biophys. J 2006, 90, 3134–3145. [Google Scholar]
Schreck, J.S.; Yuan, J. A statistical mechanical approach to protein aggregation. J. Chem. Phys 2011, 135, 235102, :1–235102:12.. [Google Scholar]
Bitan, G.; Kirkitadze, M.D.; Lomakin, A.; Vollers, S.S.; Benedek, G.B.; Teplow, D.B. Amyloid β-protein (Aβ) assembly: Aβ(1–40) and Aβ(1–42) oligomerize through distinct pathways. Proc. Natl. Acad. Sci. USA 2003, 100, 330–335. [Google Scholar]
Yong, W.; Lomakin, A.; Kirkitadze, M.; Teplow, D.; Chen, S.; Benedek, G. Structure determination of micelle-like intermediates in amyloid β-protein fibril assembly by using small angle neutron scattering. Proc. Nat. Acad. Sci. USA 2002, 99, 150–154. [Google Scholar]
Lambert, M.; Barlow, A.; Chromy, B.; Edwards, C.; Freed, R.; Liosatos, M.; Morgan, T.; Rozovsky, I.; Trommer, B.; Viola, K.; et al. Diffusible, nonfibrillar ligands derived from Aβ1–42 are potent central nervous system neurotoxins. Proc. Nat. Acad. Sci. USA 1998, 95, 6448–6453. [Google Scholar]
Huang, T.; Yang, D.; Plaskos, N.; Go, S.; Yip, C.; Fraser, P.; Chakrabartty, A. Structural studies of soluble oligomers of the alzheimer β-amyloid peptide. J. Mol. Biol 2000, 297, 73–87. [Google Scholar]
Goldschmidt, L.; Teng, P.K.; Riek, R.; Eisenberg, D. Identifying the amylome, proteins capable of forming amyloid-like fibrils. Proc. Nat. Acad. Sci. USA 2010, 107, 3487–3492. [Google Scholar]
Straub, J.E.; Thirumalai, D. Toward a molecular theory of early and late events in monomer to amyloid fibril formation. Ann. Rev. Phys. Chem 2011, 62, 437–463. [Google Scholar]
Morriss-Andrews, A.; Shea, J.E. Kinetic pathways to peptide aggregation on surfaces: The effects of β-sheet propensity and surface attraction. J. Chem. Phys 2012, 136, 065103, :1–065103:11.. [Google Scholar]
Roychaudhuri, R.; Yang, M.; Hoshi, M.; Teplow, D. Amyloid β-protein assembly and alzheimer disease. J. Biol. Chem 2009, 284, 4749–4753. [Google Scholar]
Baskakov, I.; Legname, G.; Baldwin, M.; Prusiner, S.; Cohen, F. Pathway complexity of prion protein assembly into amyloid. J. Biol. Chem 2002, 277, 21140–21148. [Google Scholar]
Haass, C.; Selkoe, D. Soluble protein oligomers in neurodegeneration: Lessons from the alzheimer’s amyloid β-peptide. Nat. Rev. Mol. Cell Biol 2007, 8, 101–112. [Google Scholar]
Hoshi, M.; Sato, M.; Matsumoto, S.; Noguchi, A.; Yasutake, K.; Yoshida, N.; Sato, K. Spherical aggregates of β-amyloid (amylospheroid) show high neurotoxicity and activate tau protein kinase I/glycogen synthase Kinase-3β. Proc. Nat. Acad. Sci. USA 2003, 100, 6370–6375. [Google Scholar]
Kheterpal, I.; Wetzel, R. Hydrogen/deuterium exchange mass spectrometry a window into amyloid structure. Acc. Chem. Res 2006, 39, 584–593. [Google Scholar]
Hartley, D.; Walsh, D.; Chianping, P.; Diehl, T.; Vasquez, S.; Vassilev, P.; Teplow, D.; Selkoe, D. Protofibrillar intermediates of amyloid β-protein induce acute electrophysiological changes and progressive neurotoxicity in cortical neurons. J. Neurosci 1999, 19, 8876–8884. [Google Scholar]
Harper, J.; Wong, S.; Lieber, C.; Lansbury, P., Jr. Assembly of Aβ amyloid protofibrils: An in vitro model for a possible early event in alzheimer’s disease. Biochemistry 1999, 38, 8972–8980. [Google Scholar]
Hong, L.; Yong, W.A. Simple moment-closure model for the self-assembly of breakable amyloid filaments. Biophys. J 2013, 104, 533–540. [Google Scholar]
Schreck, J.S.; Yuan, J.M. A kinetic study of amyloid formation: Fibril growth and length distributions. J. Phys. Chem. B 2013, 107, 6574–6583. [Google Scholar]
Serpell, L.; Blake, C.; Fraser, P. Molecular structure of a fibrillar alzheimer’s Aβ fragment. Biochemistry 2000, 39, 13269–13275. [Google Scholar]
Lashuel, H.; Wurth, C.; Woo, L.; Kelly, J. The most pathogenic transthyretin variant, L55P, forms amyloid fibrils under acidic conditions and protofilaments under physiological conditions. Biochemistry 1999, 38, 13560–13573. [Google Scholar]
Makin, O.; Serpell, L. Structures for amyloid fibrils. FEBS J 2005, 272, 5950–5961. [Google Scholar]
Urbanc, B. Drexel University: Philadelphia, PA, Unpublished work; 2012.
Ward, R.; Jennings, K.; Jepras, R.; Neville, W.; Owen, D.; Hawkins, J.; Christie, G.; Davis, J.; George, A.; Karran, E.; et al. Fractionation and characterization of oligomeric, protofibrillar and fibrillar forms of beta-amyloid peptide. Biochem. J 2000, 348, 137–144. [Google Scholar]
Nelson, R.; Sawaya, M.; Balbirnie, M.; Madsen, A.; Riekel, C.; Grothe, R.; Eisenberg, D. Structure of the cross-β spine of amyloid-like fibrils. Nature 2005, 435, 773–778. [Google Scholar]
Eisenberg, D.; Jucker, M. The amyloid state of proteins in human diseases. Cell 2012, 148, 1188–1203. [Google Scholar]
Petkova, A.T.; Yau, Y.M.; Tycko, R. Experimental constraints on quaternary structure in alzheimer’s beta-amyloid fibrils. Biochemistry 2006, 45, 498–512. [Google Scholar]
Oosawa, F.; Kasai, M. Theory of linear and helical aggregations of macromolecules. J. Mol. Biol 1962, 4, 10–21. [Google Scholar]
Terzi, E.; Hölzemann, G.; Seelig, J. Self-association of β-amyloid peptide (1–40) in solution and binding to lipid membranes. J. Mol. Biol 1995, 252, 633–642. [Google Scholar]
Schmit, J.; Ghosh, K.; Dill, K. What drives amyloid molecules to assemble into oligomers and fibrils? Biophys. J 2011, 100, 450–458. [Google Scholar]
Lee, C.F. Self-assembly of protein amyloids: A competition between amorphous and ordered aggregation. Phys. Rev. E 2009, 80, 031922, :1–031922:5.. [Google Scholar]
Nyrkova, I.A.; Semenov, A.N.; Aggeli, A.; Bell, M.; Boden, N.; McLeish, T.C.B. Self-assembly and structure transformations in living polymers forming fibrils. Eur. Phys. J. B 2000, 17, 499–513. [Google Scholar]
Van der Schoot, P.; Michels, M.A.J.; Brunsveld, L.; Sijbesma, R.P.; Ramzi, A. Helical transition and growth of supramolecular assemblies of chiral discotic molecules. Langmuir 2000, 16, 10076–10083. [Google Scholar]
Kunes, K.C.; Cox, D.L.; Singh, R.R.P. One dimensional model of yeast prion aggregation. Phys. Rev. E 2005, 72, 051915, :1–051915:8.. [Google Scholar]
Nicodemi, M.; de Candia, A.; Coniglio, A. Aggregation of fibrils and plaques in amyloid molecular systems. Phys. Rev. E 2009, 80, 041914, :1–041914:4.. [Google Scholar]
Badasyan, A.V.; Hayrapetyan, G.N.; Tonoyan, S.A.; Mamasakhlisov, Y.S.; Benight, A.S.; Morozov, V.R. Intersegment interactions and helix-coil transition within the generalized model of polypeptide chains approach. J. Chem. Phys 2009, 131, 1115104, :1–1115104:8.. [Google Scholar]
Zamparo, M.; Trovato, A.; Maritan, A. Simplified exactly solvable model for β-amyloid aggregation. Phys. Rev. Lett 2010, 105, 108102, :1–108102:4.. [Google Scholar]
Schellman, J.A. The factors affecting the stability of hydrogen-bonded polypeptide structures in solution. J. Phys. Chem 1958, 62, 1485–1494. [Google Scholar]
Qian, H.; Schellman, J.A. Helix-coil theories: A comparative study for finite length polypeptides. J. Phys. Chem 1992, 96, 3987–3994. [Google Scholar]
Qian, H. A thermodynamic model for helix-coil transition coupled to dimerization of short coiled-coil peptides. Biophys. J 1994, 67, 349–355. [Google Scholar]
Kramers, H.A.; Wannier, G.H. Statistics of the two-dimensional ferromagnet. Part I. Phys. Rev 1941, 60, 252–262. [Google Scholar]
Onsager, L. Crystal statistics. I. A two-dimensional model with an order-disorder transition. Phys. Rev 1944, 65, 117–149. [Google Scholar]
Oosawa, F.; Asakura, S. Thermodynamics of the Polymerization of Protein; Academic Press: New York, NY, USA, 1975. [Google Scholar]
Van Gestel, J.; van der Schoot, P.; Michels, M.A.J. Role of end effects in helical aggregation. Langmuir 2003, 19, 1375–1383. [Google Scholar]
Van Raaij, M.E.; van Gestel, J.; Segers-Nolten, I.M.J.; de Leeuw, S.W.; Subramaniam, V. Concentration dependence of α-synuclein fibril length assessed by quantitative atomic force microscopy and statistical-mechanical theory. Biophys. J 2008, 95, 4871–4878. [Google Scholar]
Sarroukh, R.; Cerf, E.; Derclaye, S.; Dufrêne, Y.; Goormaghtigh, E.; Ruysschaert, J.; Raussens, V. Transformation of amyloid β(1–40) oligomers into fibrils is characterized by a major change in secondary structure. Cell. Mol. Life Sci 2011, 68, 1429–1438. [Google Scholar]
Grosberg, A.; Khokhlov, A.; Atanov, Y. Statistical Physics of Macromolecules; AIP Press: New York, NY, USA, 1994. [Google Scholar]
Marini, D.M.; Hwang, W.; Lauffenburger, D.A.; Zhang, S.; Kamm, R.D. Left-handed helical ribbon intermediates in the self-assembly of a β-sheet peptide. Nano Lett 2002, 2, 295–299. [Google Scholar]
Takahashi, Y.; Ueno, A.; Mihara, H. Mutational analysis of designed peptides that undergo structural transition from α helix to β sheet and amyloid fibril formation. Structure 2000, 8, 915–925. [Google Scholar]
Takahashi, T.; Mihara, H. Peptide and protein mimetics inhibiting amyloid β-peptide aggregation. Acc. Chem. Res 2008, 41, 1309–1318. [Google Scholar]
Wu, F.Y. The potts model. Rev. Mod. Phys 1982, 54, 235–268. [Google Scholar]
Shankar, G.M.; Li, S.; Mehta, T.H.; Garcia-Muñoz, A.; Shepardson, N.E.; Smith, I.; Brett, F.M.; Farrell, M.A.; Rowan, M.J.; Lemere, C.A.; et al. Amyloid-β protein dimers isolated directly from alzheimer’s brains impair synaptic plasticity and memory. Nat. Med 2008, 14, 837–842. [Google Scholar]
Skolnick, J.; Holtzer, A. Theory of helix-coil transitions of α-helical, two-chain, coiled coils. Macromolecules 1985, 15, 303–314. [Google Scholar]
Hausrath, C.A. A model for the coupling of α-helix and tertiary contact formation. Protein Sci 2006, 15, 2051–2061. [Google Scholar]
Ghosh, K.; Dill, K.A. Theory for protein folding cooperativity: Helix bundles. J. Am. Chem. Soc 2009, 131, 2306–2312. [Google Scholar]
Ferrone, F.A. Nucleation: The connections between equilibrium and kinetic behavior. Methods Enzym 2006, 412, 285–299. [Google Scholar]
Cao, Z.; Ferrone, F. Homogeneous nucleation in sickle hemoglobin: Stochastic measurements with a parallel method. Biophys. J 1997, 72, 343–352. [Google Scholar]
Hill, T. An Introduction to Statistical Thermodynamics; Dover Publications: New York, NY, USA; p. 1987.
Abraham, F.F. Homogeneous Nucleation Theory; Academic Press: New York, NY, USA, 1974. [Google Scholar]
Hong, L.; Qi, X.; Zhang, Y. Dissecting the kinetic process of amyloid fiber formation through asymptotic analysis. J. Phys. Chem. B 2011, 116, 6611–6617. [Google Scholar]
Hammer, N.D.; Schmidt, J.C.; Chapman, M.R. The curli nucleator protein, CsgB, contains an amyloidogenic domain that directs CsgA polymerization. Proc. Natl. Acad. Sci. USA 2007, 104, 12494–12499. [Google Scholar]
Vitalis, A.; Pappu, R.V. Assessing the contribution of heterogeneous distributions of oligomers to aggregation mechanisms of polyglutamine peptides. Biophys. Chem 2011, 159, 14–23. [Google Scholar]
Ricchiuto, P.; Brukhno, A.V.; Auer, S. Protein aggregation: Kinetics versus thermodynamics. J. Phys. Chem. B 2012, 116, 5384–5390. [Google Scholar]
Peng, S.; Ding, F.; Urbanc, B.; Buldyrev, S.; Cruz, L.; Stanley, H.; Dokholyan, N. Discrete molecular dynamics simulations of peptide aggregation. Phys. Rev. E 2004, 69, 041908, :1–041908:7.. [Google Scholar]
Hall, C.K.; Nguyen, H.D.; Marchut, A.J.; Wagoner, V. Simulations of Protein Aggregation. In Misbehaving Proteins; Springer: New York, NY, USA, 2006; pp. 47–77. [Google Scholar]

Figure 1. (a) A linear model for aggregation where M-mers grow one monomer at a time in a one-dimensional fashion; (b) A model for Aβ (1-42) wild-type proteins aggregating into fibrils. Monomers assemble until reaching a critical concentration, for example, a hexamer, as illustrated. Hexamers then aggregate into more complex structures such as protofibrils, and eventually full fibrils; and (c) a model for Aβ(1-40) wild-type protein aggregation where monomers join to form dimers, which may grow into protofibrils and eventually fibrils.

Figure 2. In both (a) and (b), the illustrated cross-β structure is the sequence segment GNNQQNY from the prion Sup35. Carbon atoms are purple or grey/white, oxygen atoms are red, and nitrogen atoms are blue. In (a), cross-β structure is illustrated. Grey arrows represent the back-bone of a β-strand, and the side-chains are shown projecting from the strands. Purple arrows represent the strands residing in the back of the structure. The regions between the strands are referred to as the dry interfaces, whereas just outside of strands are wet interfaces. The fibril axis is indicated by an arrow running through the dry regions between the strands; (b) Side view of the fibril. The H-bonds are formed between red carboxyl groups and blue amide groups from adjacent layers; in (c), a top view of the fibrils shows the interdigitation of two β-sheets, referred to as the steric zipper. Within the steric zipper, water molecules are absent (a red plus sign indicates water). Both images are reprinted from Nelson et al. [48,49].

Figure 3. Cartoon illustrations of an Aβ protofibril are shown. Left: Looking down the axis of the fibril (z-axis); Right: A sideview of the protofibril illustrating the twisted, helical pattern. Proteins are spaced at ≈5 Å, and the chiral twist of 0.833 degree/°A was arbitrarily chosen for illustrative purposes. Image reprinted from Petkova et al. [50].

Figure 4. A ZB model for protein aggregation is illustrated, where the proteins (circles) in aggregates can be coil (white), sheet (black), or helix (red marked with X) in conformation. The free energies associated with each conformation are listed, as well as the interfacial free energies R₁, R₂, and R₃ between helix-coil, sheet-coil, or helix-sheet regions, respectively.

Figure 5. (a) Front-view (y-z plane) of a strip lattice representing an aggregate of Aβ(1–40) proteins. Black circles corresponds to coil proteins while white circles denote sheet or helix proteins. Dashed lines represent interactions between proteins in the y-direction whereas solid lines are the interactions between proteins along the x-axis; (b) Side-view (x-y plane) of Aβ-40 proteins illustrating the steric zipper; and (c) strip lattice representation of the Aβ-40proteins, where the parameters B and K are illustrated.

Figure 6. (a) Lattice representation and transfer matrix of α-synuclein fibrils composed of 4 filaments; (b) The fibril for Aβ is composed of two protofibrils, each represented by a strip lattice and spin-variables s and t. The strips are stacked in-register along the z-axis, and the parameter B (dashed, green) lines represents the bonds between sheet proteins in protofibrils or fibrils.

Figure 7. (a) A strip lattice with periodic boundary conditions in the y-axis is used to model the nucleus for Aβ(1–42) as a hexamer. The strip has L_y monomers per column in general. B (blue, dotted lines) is the free energy contribution from the interaction between proteins

s_{i}^{j}, s_{i}^{j + 1}

that are both sheet along the y-axis; (b) Oligomers aggregate along the x-axis with a total of L_x sites, where L_x → ∞ is the thermodynamic limit.

Figure 7. (a) A strip lattice with periodic boundary conditions in the y-axis is used to model the nucleus for Aβ(1–42) as a hexamer. The strip has L_y monomers per column in general. B (blue, dotted lines) is the free energy contribution from the interaction between proteins

s_{i}^{j}, s_{i}^{j + 1}

that are both sheet along the y-axis; (b) Oligomers aggregate along the x-axis with a total of L_x sites, where L_x → ∞ is the thermodynamic limit.

Figure 8. Partial lists of chemical species that may exist in dynamic equilibrium with fibrils for Aβ(1–40) (top) and α-synuclein (bottom). In the Aβ model, the different types of aggregates that could be present at equilibrium are 1D filaments, strips of length L_y = 2 that represent protofibrils, and 3D cubes that represent fibrils. The cubes are composed of two identical proto-fibrils stacked in-register. In the model for α-synuclein aggregates, L_y = 1, 2, 3, and 4 strip lattices are used to describe the aggregates at equilibrium, with the L_y = 4 strip lattice representing the fibril. For both Aβ(1–40) and α-synuclein, we assumed n_c = 2.

Figure 9. (a) Plot of 〈θ₂〉 for 1D, 2D, and 3D structures in the Aβ(1–40) model. Black dots represent the CD data from Terzi, et al. [52], where we the total fraction of sheet proteins in aggregates of any species; (b) Predicted average lengths, 〈L〉, of the Aβ(1–40) fibrils using the fit parameters found in (a); In plot (c), the AFM data for the α-synuclein fibrils is plotted as black dots, along with the fit function 〈L〉 [68]. We fit 〈L〉 using the L_y = 4 strip lattice model; In (d), 〈θ₂〉 for α-synuclein (solid, purple curve) is compared with ρ_fib/φ (dashed, black curve) by using the fit parameters found in (c); In (a) and (b), the fit parameters for the Aβ(1–40) model were: P₁ = 7.41RT, B = 1.4RT, R₂ = −2.47RT, and K = 0.45RT. In (a), η = 1 − (z + z² + z⁴)/φ; In (c) and (d), the fit parameters for the α-synuclein model were P₁ = K = 2.7RT, B = 1.95RT and R₂ = −1.64RT.

Figure 10. Summary of protein conformation energies. A site could be occupied with a solvent cluster, denoted by n = 0 (square), or a protein, n = 1 (circles). Proteins may assume a particular conformation (sheet, black/solid circle; coil, white circle). A dilute q = 2 Potts model for sheet-coil conformations is shown, where n_c = 1 and the free energies P₁, K, R, and A are illustrated.

Figure 11. Proteins or solvent clusters may occupy lattice sites, where the front-view (y–z plane) of an aggregate of Aβ(1–40) proteins is shown along with the interactions between proteins and solvent clusters. The n_c = 2 nucleus is represented by dashed-dotted lines (free energy A denoting the nucleation). Dotted and solid lines illustrate interactions between sheet proteins. Double solid lines illustrate a protein-solvent interface. Dashed (blue) lines have no meaning.

Figure 12. (a) 〈θ〉/〈N_p〉 is fitted to the results of the Terzi et al. experiment [52] involving Aβ(1–40) aggregates; (b) The fraction of sheet proteins in Curli fibrils is fitted to the scaled results of the Hammer et al. experiment [84]. In (a), the fit parameters were P₁ ≈ K ≈ A ≈ 0 kcal/mol, R₁ = 0.35 kcal/mol, and F = 16.4 kcal/mol.; while in (b) we have P₁ = 7.26 kcal/mol, K = 2.2 kcal/mol, R₁ ≈ 0 kcal/mol, and A = 1.2 kcal/mol [26]. In (a) we used case B of the strip models with n_c = 2 whereas in (b) we used the 1D model with n_c = 2 for aggregation. In both cases q = 2.

Table 1. Summary of the ZB weights for two residues that are adjacent to each other in a protein.

**Table 1.** Summary of the ZB weights for two residues that are adjacent to each other in a protein.
j − 1	j	Weight
c	c	1
h	c	1
c	h	σs
h	h	s

© 2013 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Schreck, J.S.; Yuan, J.-M. Statistical Mechanical Treatments of Protein Amyloid Formation. Int. J. Mol. Sci. 2013, 14, 17420-17452. https://doi.org/10.3390/ijms140917420

AMA Style

Schreck JS, Yuan J-M. Statistical Mechanical Treatments of Protein Amyloid Formation. International Journal of Molecular Sciences. 2013; 14(9):17420-17452. https://doi.org/10.3390/ijms140917420

Chicago/Turabian Style

Schreck, John S., and Jian-Min Yuan. 2013. "Statistical Mechanical Treatments of Protein Amyloid Formation" International Journal of Molecular Sciences 14, no. 9: 17420-17452. https://doi.org/10.3390/ijms140917420

APA Style

Schreck, J. S., & Yuan, J. -M. (2013). Statistical Mechanical Treatments of Protein Amyloid Formation. International Journal of Molecular Sciences, 14(9), 17420-17452. https://doi.org/10.3390/ijms140917420

Article Menu

Statistical Mechanical Treatments of Protein Amyloid Formation

Abstract

1. Introduction

2. Amyloid Aggregation

3. Statistical Mechanical Approaches to Protein Folding and Aggregation

3.1. Partition Function for Helix-Coil Transitions in Proteins

3.2. Thermodynamic Properties of Proteins

4. Equilibrium Protein Aggregation

4.1. A Generalized Zimm-Bragg Model for Protein Aggregation

5. Partition Functions for Fibrils

5.1. Potts Model for 1D Filaments

5.2. Simple Model for Fibrils

5.3. Quasi-1D Models for Aggregates

5.4. Dilute Thermodynamic Averages

5.5. Comparison to Experiment

6. Grand Canonical Approach

7. Comparison to Experiment

8. Conclusions

Acknowledgements

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI