Next Article in Journal
A Metabolite Perspective on the Involvement of the Gut Microbiota in Type 2 Diabetes
Previous Article in Journal
Prokaryotic Argonaute Proteins: A New Frontier in Point-of-Care Viral Diagnostics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Three-Dimensional Modeling of CpG DNA Binding with Matrix Lumican Shows Leucine-Rich Repeat Motif Involvement as in TLR9-CpG DNA Interactions

1
Department of Ophthalmology, NYU Grossman School of Medicine, New York, NY 10016, USA
2
Department of Pathology, NYU Grossman School of Medicine, New York, NY 10016, USA
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(19), 14990; https://doi.org/10.3390/ijms241914990
Submission received: 22 August 2023 / Revised: 26 September 2023 / Accepted: 27 September 2023 / Published: 8 October 2023
(This article belongs to the Section Molecular Immunology)

Abstract

:
Lumican is an extracellular matrix proteoglycan known to regulate toll-like receptor (TLR) signaling in innate immune cells. In experimental settings, lumican suppresses TLR9 signaling by binding to and sequestering its synthetic ligand, CpG-DNA, in non-signal permissive endosomes. However, the molecular details of lumican interactions with CpG-DNA are obscure. Here, the 3-D structure of the 22 base-long CpG-DNA (CpG ODN_2395) bound to lumican or TLR9 were modeled using homology modeling and docking methods. Some of the TLR9-CpG ODN_2395 features predicted by our model are consistent with the previously reported TLR9-CpG DNA crystal structure, substantiating our current analysis. Our modeling indicated a smaller buried surface area for lumican-CpG ODN_2395 (1803 Å2) compared to that of TLR9-CpG ODN_2395 (2094 Å2), implying a potentially lower binding strength for lumican and CpG-DNA than TLR9 and CpG-DNA. The docking analysis identified 32 amino acids in lumican LRR1–11 interacting with CpG ODN_2395, primarily through hydrogen bonding, salt-bridges, and hydrophobic interactions. Our study provides molecular insights into lumican and CpG-DNA interactions that may lead to molecular targets for modulating TLR9-mediated inflammation and autoimmunity.

1. Introduction

Lumican is a member of the small leucine-rich repeat proteoglycan (SLRP) family, widely present in various extracellular matrix (ECM) types [1,2]. A few SLRPs are post-translationally modified with glycosaminoglycan side chains in mature ECMs. Lumican is a modified keratan sulfate (KS) proteoglycan in connective tissues like the tendon, cornea, and cartilage, but it is secreted as a simple glycoprotein by activated fibroblasts during infections and inflammation [3,4,5]. Many of the core proteins consist of tandem repeats of leucine-rich repeat (LRR) motifs. A body of work, including transmission electron microscopy of collagen-rich tissues, recombinant core proteins, and biochemical approaches show that lumican and other SLRP members associate with collagen fibrils to restrict their lateral growth and assembly, and that they localize to different bands on the fibrils [6,7,8,9,10]. Functionally, the SLRPs regulate collagen fibril growth and proper assembly of ECMs, such that SLRP-null mice develop abnormal connective tissues [11,12,13,14]. SLRP interactions with collagen fibrils engage multiple LRR repeats in the core proteins [15,16]. In addition to collagen-fibril regulations, emerging studies indicate that SLRPs interact with cell surface receptors, chemokines, and pathogen-associated molecular patterns, cementing their significant involvement with host immune responses [2,4]. Given that the SLRP core proteins are largely comprised of LRR motifs, they are members of the LRR superfamily of proteins, which includes pathogen recognition toll-like receptors (TLRs).
Host pathogen recognition receptors (PRR) include TLRs that mediate the host response to microbial-pathogen-associated molecular patterns, as well as certain host-derived-danger-associated molecular patterns [17,18]. Human and mice have 10 and 12 TLRs, respectively. TLRs 1, 2, 4, and 5 localize to the plasma membrane, whereas TLR3, 7, 8, and 9 are destined to endosomal compartments for PAMP recognition and signaling [18]. TLR9 recognizes viral double stranded (ds) DNA in acidic endolysosomal compartments. TLR9 may also interact with host nuclear DNA as well as mitochondrial DNA released from dying cells during infections and inflammation, which contributes to an increased risk for autoimmunity [19]. Adequate recognition of pathogenic DNA, but minimal recognition of self-DNA, is achieved through guided trafficking and cleavage of TLR9 in endolysosomes. Our experimental study shows that ECM proteins like lumican may modulate the recognition of pathogenic DNA [5]. Future studies will also address whether ECM proteins minimize recognition of host-derived DNA by TLR9. Lumican and TLR9 have multiple tandem repeats of LRR motifs. LRR motifs from TLR9 have already been shown to interact with some synthetic forms of DNA ligands [20,21]. Our 3D modeling of lumican binding to DNA ligands, for the first time, identifies candidate DNA-binding LRR motifs in lumican.
LRR motifs, first discovered in 1985, are approximately 24-amino-acid-long highly conserved leucine-rich stretches, with regularly spaced hydrophobic residues [22]. Over 6000 proteins having LRR motifs are listed in PFAM, PRINTS, SMART, and InterPro databases [23]. These LRR domains have a highly conserved segment (HCS) and a variable segment (VS) [24,25]. The HCS, LxxLxLxxNxL, contains conserved leucine residues (Leu), which may be substituted with Isoleucine (lle), phenylalanine (Phe), or valine (Val). The ninth residue is usually an asparagine (Asn), serine (Ser), threonine (Thr), or cysteine (Cys). LRR superfamily proteins generally have a horseshoe shape, where the Leu, Ile, Phe, and Val at position 1, 4, 6, and 11 of HCS form the hydrophobic core of the protein comprised of beta sheets [24]. The Asn at position 9 connects LRR motifs to each other by forming a hydrogen bond. The VS part forms loop and helix conformation at the outer convex surface [24].
In addition to the shared structural similarities discussed above, the SLRP core proteins harbor some TLR-like functions and interact with pathogen recognition signals. In a recent study, we found that macrophages from lumican-null mice show increased innate immune responses to CpG DNA, a ligand for endosomal TLR9, while the addition of recombinant lumican suppressed TLR9 response [5]. The study further demonstrated that exogenous lumican and CpG DNA are co-localized in early endosomes, which are not enriched for TLR9. An in vitro recombinant lumican binds CpG DNA in a dose-dependent manner. The data collectively indicated that lumican binds CpG DNA and sequesters it in TLR9-poor endosomes to restrict responses to DNA. However, the molecular underpinnings in the lumican-DNA interactions are unknown.
Here, we used in-silico approaches to predict the LRR sequences in lumican and their mode of interactions with CpG DNA. We modeled lumican-CpG DNA interactions using the 22-mer synthetic CpG DNA, henceforth termed as CpG ODN_2395, which has been shown to experimentally stimulate mouse and human TLR9 responses. A few in silico studies have examined TLR9 interactions with CpG DNA forms other than the CpG ODN_2395 we studied here [21,26]. Prior crystallographic studies on the TLR9 extracellular domain (TLR9 ECD) used a variation of the 12-mer synthetic CpG ODN_1668, CATGACGTTCCT, and a 6-mer that binds to a second DNA binding region to enhance signals in human, equine, and bovine TLR9 [27]. However, there are no experimental or model predictions of TLR9 interactions with the 22-mer CpG ODN_2395 ligand. Therefore, to ascertain the extent to which lumican mimics TLR9 in its interactions with the CpG ODN_2395 ligand, we also modeled mouse TLR9 ECD interactions with this ligand. Our studies provide new insights into matrix lumican and mouse TLR9 interactions with a CpG ODN_2395 known to evoke a strong innate immune response in vivo.

2. Results

2.1. 3D Modeling of Lumican and LRR Motif-Carrying Domains in Lumican and TLR9

Because mouse lumican has not been crystallized, we used the AlphaFold Protein Structure Data Base [28,29] to visualize the predicted structure of mouse lumican in a PDB format. We also superimposed TLR9 LRR1–13 on the lumican model for visual comparison (Figure 1A). The analysis range of lumican (amino acid residues 40–309) yielded a high per residue confidence metric or predicted local distance difference test (pLDDT) score of more than 90%, and a predicted aligned error (PAE) ranging from 0 to 15 Å.
The LRR motifs, based on their consensus sequence, can be categorized into seven classes: bacterial (S), cysteine containing (CC), plant specific (PS), SDS22, ribonuclease inhibitor like (RI), irregular (I), and typical (T) [25]. In a pairwise alignment using Clustal Omega, lumican shared an overall identity of 22% and a similarity of ~40% with TLR9 (Figure S1). We detected the highest similarity between lumican LRR motifs 1–11 and LRR 1–13 of mouse TLR9 (Figure 1A). Lumican follows a typical pattern of LRR repeat consensus, with approximately 20 residues per repeat, where the central ten LRR motifs (LRR1–11) are a mixture of T, S, SDS22, and PS classes, flanked by an N-terminal LRRNT motif with 4 cysteine residues and a C-terminal LRRCT motif with 2 cysteine residues (Figure S2). TLR9 has a total of 26 LRR motifs, where several are irregular, with insertions of variable lengths of amino acid that disrupt the average 26 residues per LRR structure. Thus, LRR 2, 5, 8, and 11 of mouse TLR9 have 10–16 insertions at position 10 after the 9th consensus Asn residue [30]. Surface charge distribution analysis along the concave surface of lumican shows that it consists of basic and acidic patches, and involves an equal number of basic and acidic interface residues (Figure 1B and Figure S3A). On the other hand, in TLR9, LRR1–13 and LRR13–26 motifs along the concave surface are predominantly basic patches (Figure 1C and Figure S3B,C).

2.2. CpG ODN_2395 Docking and Interaction Surfaces on Lumican and TLR9 ECD

As TLR9 interactions with CpG ODN_2395 has not been studied before, we evaluated the docking of CpG ODN_2395 on the “unliganded form” (PDB: 3WPF) of mouse TLR9. Our docking analysis of CpG ODN_2395 on lumican identified LRR 1–11 (Table 1) and on LRR2–8, 13, 15, and 20–25 on TLR9-ECD (Table 2) as the most likely sites of interactions. The reasons for opting for a rigid-docking model is provided in the Section 4. Briefly, TLR9 has been crystalized with the 12 mer shorter ligand, CpG ODN_1668, and has been reported (PDB: 3WPC). The longer CpG ODN_2395 has only 6 aligned sequences in common with the shorter ligand, and therefore, HDOCK rejected the template with sequence coverage < 50% from the candidate templates. Instead, it selected TLR3 complexed with the double stranded (ds) lncRNA Rmrp (PDB: 7DA7) as a common template for lumican and TLR9 docking with CpG ODN_2395 that gave sequence coverage greater than 50% for both the ligand (CpG ODN_2395) and the receptors (lumican and TLR9). While this is interesting, it is not related to the biologically relevant question of how lumican interacts with the longer CpG ODN_2395 to modulate the TLR9 response in mouse models and cell culture systems.
We next visualized CpG ODN_2395 interactions with lumican and TLR9 using 3D modeling. A large area along the concave surface of lumican engages with CpG ODN_2395, where LRR2–8 from the N-terminal end forms a binding groove for the 5′ end of CpG ODN_2395 (Figure 2A,B).
We also visualized the CpG ODN_2395 interacting surfaces in TLR9 ECD. The binding region includes LRR2–13 from the N-terminus and LRR15, 20–25 from the C-terminus (Figure 2C,D and Table 2). This is different from those reported earlier for interactions of TLR9 ECD with the shorter CpG ODN_1668, where only the N-terminal, and C-terminus LRR20–22 are engaged (Figure S4). We further examined the predicted buried surface area (BSA), defined as the surface area buried or unavailable to solvents in protein bound to CpG DNA. Analysis of the lumican—CpG ODN_2395 interaction yielded a BSA of 1803 Å2 (Figure 2A), whereas the TLR9 ECD-CpG ODN_2395 interaction yielded a BSA of 2094 Å2 (Figure 2B). This indicates that compared to lumican, a slightly larger surface area of TLR9 engages with CpG ODN_2395. By contrast, a previous study on TLR9 ECD interactions with the smaller 12 mer CpG ODN_1668 showed 1430 Å2 BSA, predictably suggesting smaller surface occupancy by the smaller CpG DNA [21].

2.3. Lumican and CpG ODN_2395 Interactions Involve Hydrogen Bonding, Salt-Bridges and Hydrophobic Interactions

The lumican-CpG ODN_2395 interface involves 32 amino acid residues from lumican, and 9 nucleotide bases from the CpG ODN_2395, at an average resolution of 3.3 Å. The interacting atoms from the bases in CpG ODN_2395 and the amino acids from lumican are shown (Figure 3A and Table 3). The interacting residues include 7 basic (Lys119, His98, His121, His237, Arg73, Arg234, Arg309) residues, two hydrophilic asparagine residues (Asn74, Asn123), seven acidic residues (Glu257, Glu281, Asp97, Asp140, Asp189, Asp212, Asp259), and 16 other hydrophobic and polar uncharged amino acid residues. The interactions include 11 hydrogen bonding interactions followed by 4 salt-bridge and 2 hydrophobic interactions (Figure 3A and Table 3). Hydrogen bonding interactions with CpG ODN_2395 pyrimidine/purine bases are predominant, while 82% H-bond donor residues reside in lumican, and 82% of H-bond acceptors reside in CpG ODN_2395 (Table 3). Biochemically, the amino acids in lumican that form the 11 H-bonds include Asn74, Arg234, and Arg309, two acidic Asp259 and Glu281, a neutral Ser261, two acidic Gln142, Gln166, and three aromatic Tyr164, Tyr187 and Tyr210. The four salt bridges are formed between opposite charges in polar/charged residues, including Glu257, Asp189, His98, and His121. Acidic residues Glu257 and Asp189 form salt bridges with the G3 and G6 purine bases, respectively, of CpG ODN_2395 (Figure 3B–D). However, the positive charge centers of the bases form electrostatic interactions with the Glu257, and negative charges of the carboxyl side chains form interactions with the Asp189 (Figure 3D). The positive imidazole side chain of His98 and 121 forms salt-bridges with the ribose-phosphate backbones of CpG ODN_2395. Hydrophobic/pi-stacking interactions are observed between the aromatic ring of Tyr 262 and the pyrimidine base ring of T1, where the Tyr aromatic ring is perpendicular to the thymidine base ring, making it ideal for pi-stacking interactions (Figure 3C).
The hydrophobic and the basic residues of lumican form the interface with the first and third groove of CpG ODN_2395. The lumican residues that form close contact with the first groove of CpG ODN_2395 include Asp189, Asp259, Glu257, Glu28, Arg234, Arg309 and Tyr164. While Tyr187, Tyr210 and Ser261 form salt bridges/H-bonds with the two 5′ CpG repeats (5′-TCGTCG). These “hot spot” lumican residues in LRR5–11 form a groove that interacts with the two 5′ CpG repeats. These lumican residues are therefore somewhat biochemically similar to the TLR9 ECD LRR2–13, as predicted by the pairwise alignment score (Figure S5A–C).

2.4. TLR9 ECD Interactions with CpG ODN_2395 Involve N-Terminal LRR2–13 Motifs

The TLR9-CpG ODN_2395 interface encompasses 33 residues of TLR9, and 10 nucleic acids of CpG ODN_2395 at an average resolution of 3.5 Å (Table 2, Figure 2C,D). As indicated by the APBS map, the interface residues involve many basic residues that form distinct basic patches in the N and C terminal side of TLR9 (Figure S3B,C). The N-terminal LRR2–13 of TLR9 contributes the most towards the binding affinity, comprising 553 Å2 out of 878 Å2 of the buried surface area of TLR9 (Figure 2C). The TLR9 N-terminal LRR2–13 region contains seven basic residues: Lys95, Asn129, His152, Lys181, His203, Lys207, Asn263, and one acidic residue, Asp175 (Figure S3B). There are 15 additional neutral, polar, or hydrophobic (Ser131, Ser149, Ser151, Ser205, Tyr132, Tyr179–180, Tyr208, Tyr224, Val171, Phe173, Leu226, Pro262, Gln399 and Met400) residues in these interacting LRRs (Table 2). However, the C-terminal LRR15–25 contains six basic residues: Arg482, Arg613, Asn640, His642, Arg662, and His735 (Figure S3C). Our results show that the predominant type of interaction for TLR9-CpG ODN_2395 is hydrogen bonding, involving 12 H-bond interactions where 75% of the H-bond donors reside in TLR9 and 75% of the acceptors in CpG ODN_2395. Specifically, the H-bond donors are four tyrosine (Tyr132, Tyr179, Tyr180, and Tyr665) and other residues (His203, Arg613, Asn263, Ser131, Ser205, and Lys207) (Figure 4A–E). The salt bridge interactions are the next prevailing interaction, mostly involving charge–charge interactions between positively charged basic residues (Lys95, His152, Arg482, Arg613, His642, and His735) and negatively charged sugar-phosphate backbone atoms (Figure 4A and Table 4). One exception was the acidic residue Asp175, where the carboxylic side chain forms a salt bridge with the positive charge center of the G3 purine base ring (Figure 4B,C). Overall, 11 out of the 18 bonding interactions, including H-bond/salt-bridges, are from LRR2–13. The N-terminus LRR2–13 of the TLR9-CpG ODN_2395 interface comprises 63% of BSA (878 Å2), while LRR15–26 from the C-terminus makes up only 37% of BSA (Table 2). Thus, the C-terminus of LRR15–26 has low binding affinity for CpG ODN_2395, which is consistent with experimental data that shows its reduced involvement in TLR9 downstream signals [20]. The His152, His203, Tyr132, Tyr179, Tyr180, Ser131, Ser205, Lys207, and Asn263 of LRR2–8 interacts closely with the 5′ end hexamer (5′-TCGTCG) of CpG ODN_2395, forming the H-bonds and salt bridges (Figure 4B,C). Additional interacting atoms from CpG ODN_2395 are also shown (Figure 4D,E and Table 4).
As indicated previously, TLR9 interacting residues with CpG ODN_1668 12 mer are clustered in LRR1–8 from the N-terminus, which includes the generic PAMP binding surface LRR2, 5, and 8, followed by LRR20–22 from the C-terminus [21]. The TLR9-CpG ODN_1668 12 mer interaction region includes N-terminus basic patches of TLR9 that engage in hydrogen bonding and satisfies charge complementarity to the negatively charged ribose-phosphate backbone (Figure S4). The prevailing interactions between LRR1–8 and ODN_1668 are H-bonding between sugar-phosphate backbones.

2.5. The Predicted Effects of Collagen Type I on Lumican Interactions with CpG ODN_2395

Multiple studies have shown that lumican associations with collagen type I regulate collagen fibrillogenesis in vitro [7], and that similar interactions in vivo impact collagen fibril assembly [12,15,31]. One study found that synthetic peptides from the lumican LRR5–7 region competes with full length lumican and significantly compromises collagen type 1 fibrillogenesis [9]. Moreover, the D212N recombinant lumican, which carries this mutation in LRR5–7, disrupts both the binding of lumican to collagen and subsequent collagen fibrillogenesis [9]. Our simulation shows that lumican, upon binding to CpG ODN_2395, creates a significant amount of BSA (760 Å2 of 1803 Å2, which is 42%). The BSA of most bona fide protein-ligand interactions are observed within this range [32]. Our prior interface analysis identified Asp212, Gln166, Tyr210, Asp189, Tyr164, Thr208, Tyr187, and Phe162 as interacting with CpG ODN_2395, and these fall within the LRR5–7 that interact with collagen (Figure 5A). The Asp212 (D212) residue within LRR7, which is involved in fibrillogenesis, is predicted to be buried upon CpG binding. This site is also highly conserved in lumican from different species as shown by multiple sequence alignment, indicating its functional importance (Figure 5B). These analyses also imply that collagen binding will likely interfere with the ability of LRR5–7 to bind CpG-DNA.

3. Discussion

Our earlier experimental studies showed that lumican binds CpG ODN_2395, a synthetic ligand for TLR9, to restrict response and subsequent induction of proinflammatory cytokines by TLR9 [5]. Here, for the first time, we used in silico approaches to identify the DNA-interacting region within the core protein of lumican at a molecular level, and characterize the types of bonding involved in these interactions. Like other LRR family members, the lumican core protein is made up almost entirely of tandem repeats of LRR motifs. Our analysis shows that while overall identity (22%) and similarity (~40%) between mouse lumican and TLR9 were not high, the lumican LRR motifs, except its last LRRCE motif, share the strongest similarity with the anterior half of TLR9—namely LRR 1–13. Like other LRR proteins, lumican and TLR9 have a solenoid shape, where highly conserved residues are mainly located in the concave surfaces. Interactions with CpG DNA largely occur through the concave surface of the solenoid structures. Both proteins have a mixture of T, S, SDS22, and PS classes of LRRs, where S and T are considered to interact with proteins and DNA, and also have role in immunity and defense. Indeed, lumican LRR–1, 2, and 5–7, which are a mixture of S and T class, are predicted to engage with CpG ODN_2395.
Analysis of BSA in the interactions of lumican (1803 Å2) or TLR9 ECD (2094 Å2) with CpG ODN_2395 suggest that, compared to TLR9 ECD, a slightly lower surface area of lumican is involved. This may imply a weaker binding strength between lumican and CpG ODN_2395 than between TLR9 ECD and CpG ODN_2395. However, from a functional standpoint in tissues, ECM proteins like lumican are present at a much greater abundance than pathogen recognition receptors like TLR9, and are thus expected to be impactful. Multiple, still incompletely understood mechanisms, are in place to control the TLR9 response to pathogenic DNA as well as self-DNA for controlled inflammatory responses to infections, and minimize harmful responses to self-DNA [18,19,33,34,35,36]. Synthesized in the endoplasmic reticulum, TLR9 is bound by Unc93B1, a membrane adaptor protein, which facilitates proper folding and incorporation into transport vesicles that apparently takes it to the plasma membrane first, before its final location in acidic endolysosomes where it is proteolytically cleaved to release a smaller functional protein [37]. Our prior studies show that there is an abundance of lumican at the cell surface, and it is also endocytosed by immune cells [5]. Therefore, binding of DNA ligands by lumican at the cell surface can counteract potential ligand interactions with TLR9 transiently located at the cell surface. We also showed that endocytosed lumican colocalizes with CpG DNA, and these endosomal sites are low in TLR9. This is another way that ligand interactions with TLR9 are reduced by lumican. We still do not know how lumican favors the separation of the DNA ligand from TLR9 in endosomes. The overall effects of lumican, and likely some of the other SLRPs, are to reduce ligand availability and interactions with TLR9, as well as to reduce inflammatory responses to pathogenic and self-DNA.
A potential limitation of our study is that we used the lumican core protein instead of the glycoprotein or the proteoglycan forms of lumican in our 3D modeling. The negatively charged KS side chains of lumican will impact interactions of the core protein. However, we have shown experimentally that during infections and inflammation, much of the lumican interacting with macrophages is not the proteoglycan form, but the simple glycoprotein form without the KS side chains [5].
Another important limitation of our study is the use of the rigid, B-form of the CpG ODN_2395 in predicting its interactions with lumican and TLR9. The electrostatic repulsions between the highly negatively charged phosphate groups lead to conforma-tional flexibility in ssDNA posing a challenge in modeling the ssDNA-protein interac-tions. Earlier studies have used small-angle x-ray scattering (SAXS) to probe the dy-namic and flexible conformations of ssDNA in solution of varying ionic strength [38,39,40]. Sim et al, have used SAXS to study the flexibility of ssDNA under different salt concentration [41]. In another study the interactions between the eukaryotic replication protein A (RPA) and an 8 mer or 14 mer ssDNA was studied by combining SAXS data to generate numerous dynamic conformation with molecular dynamics (MD) simulations that may predict the dynamic interactions more accurately [42]. Few MD simulations studies have been instrumental in understanding the flexible interactions of ssDNA with ssDNA-binding proteins (SSB) [43,44] and the K homology domains (KH3 and KH4) [45]. MD simulations are highly accurate in probing the atomic mo-tions and structural flexibility of dsDNA, but often less effective in case of ssDNA [46,47]. MD simulations were also used to study the conformational changes of TLR7 and TLR8 and their agonists, where the RMSD values of TLR7 and 8 backbone atoms fluctuate from their mean position within a scope of 2–4 Å after 30 ns simulation [48]. Therefore, under physiological solutions the interacting atoms between CpG ODN_2395 and lumican and TLR9 could be different from our results generated using the rigid B form CpG DNA model. The flexible nature of CpG ODN_2395 can lead to an ensemble of major and minor conformational derivatives with lumican and TLR9 that might yield mean interacting amino acids and nucleotides that could be different from this limited study. To further validate our study, MD simulations like AMBER [49] that calculate the force of each atom as a function of their positions can be used over 100 ns period for CpG ODN_2395, lumican and TLR9. Applying similar ap-proaches to study CpG DNA – lumican or TLR9 complexes is also challenging due to few available structures of CpG DNA in PDB database and low sequence identity, making it difficult to develop template-based models. Despite these limitations, this study is the first in silico attempt to predict the amino acid residues of lumican that might be interacting with CpG DNA, corroborating our previous experimental finding that lumican binds to CpG ODN_2395 in solid-phase binding assay [5].
In summary, our study provides molecular insights into the interactions between lumican and a synthetic ligand for TLR9, providing an avenue for the future development of drug targets. Additional studies are required to expand our knowledge of the full extent of natural TLR9 ligands that lumican and other SLRPs interact with to regulate inflammation and autoimmunity.

4. Materials and Methods

4.1. Data Sources

FASTA sequences of mouse TLR9 and lumican were obtained from UniProt with primary accession numbers Q9EQU3 and P51885, respectively. The Class C CpG ODN_2395 sequence (5′-TCGTCGTTTTCGGCGCGCGCCG-3′) was obtained from Invivogen (#tlrl-2395). We used the crystal structure of the mouse TLR9 extracellular domain (TLR9 ECD) not bound to a ligand “unliganded form” (PDB:3WPF) obtained from Protein Data Bank [50]. Because mouse lumican has not been crystallized, we used the AlphaFold Protein Structure Database [28,29] to visualize the predicted structure of mouse lumican in a PDB format. AlphaFold-predicted structures are highly accurate, with a median backbone accuracy of 2.8 Å r.m.s.d.95 (95% confidence interval = 2.7–4.0 Å), and produces highly accurate side chains. The all-atom accuracy of AlphaFold is 1.5 Å r.m.s.d.95 (95% confidence interval = 1.2–1.6 Å) compared to other currently used methods. AlphaFold generates per-residue confidence metrics called the predicted local distance difference test (pLDDT) score on a scale of 0–100. Structures with pLDDT > 90 are expected to be modelled with high accuracy [28]. The analysis range of lumican (amino acid residues 40–309) yielded a high pLDDT score of more than 90%, and a predicted aligned error (PAE) ranging from 0 to 15 Å.

4.2. TLR9 and Lumican Local Similarity Analysis

Lumican and TLR9 FASTA sequences from UniProt were submitted to Clustal Omega local alignment tools [51] to align the most similar regions between two sequences. The lumican sequence 3–321—corresponding to LRR1–11 out of 338—and TLR9 sequence 19–408—corresponding to LRR1–13 out of 1032—were aligned. We introduced 121 gaps to align both sequences with a default gap opening penalty score of 10 and extension penalty score of 0.5. The matched sequences have 165 conserved and semi-conserved residues out of 415, showing 22% sequence identity and 40% sequence similarity, with an alignment score of 225.5 after subtracting the gap opening and extension penalty score.
The results are the following:
  • Aligned_sequences: LUM_MOUSE and TLR9_MOUSE
  • Gap_penalty: 10.0
  • Extend_penalty: 0.5
  • Length: 415
  • Identity: 92/415 (22.2%)
  • Similarity: 165/415 (39.8%)
  • Gaps: 121/415 (29.2%)
  • Alignment Score: 225.5
The local structural alignment between two proteins were conducted using the PyMOL Align command line accessed on 25 June 2023 (https://pymolwiki.org/index.php/Align) [52]. It performs a local sequence alignment between two proteins using a Clustal Omega local sequence alignment tool, followed by structural superimposition with the cycles of refinement in order to reject structural outliers. The lumican LRR1–11 and TLR9 LRR1–13, totaling 1494 atoms, were aligned with five cycles of structural refinement—the average distance between the atoms of two superimposed proteins, or an RMSD value 6.4 Å. Structural outliers with the average RMSD = 6.83 Å were rejected during the cycles (53 atoms).
The following parameters were used to run PyMOL structural alignment between Lumican and TLR9:
  • outlier rejection cycles: 5
  • RMS outlier rejection cutoff: 2.0
  • Mobile_state: 0 = all states
  • Target_state: 0 = all states
  • Transform: 1 = superposition
The results obtained are the following:
  • Alignment score: 225.5
  • ExecutiveAlign: 1819 atoms aligned.
  • ExecutiveRMS: 53 atoms rejected during cycle 5 (RMSD = 6.83 Å)
  • Executive: RMSD = 6.411 Å (1494 to 1494 atoms)

4.3. Surface Charge Distribution Analysis

The PyMOL Plugin APBS (Adaptive Poisson-Boltzmann solver) electrostatics software version 2.1 allows visualization of the distribution of acidic (negatively charged) and basic (positively charged) amino acids on the surface of lumican and TLR9 3D structures, but does not provide the deprotonation/protonation status of individual amino acid residues. The preparation of molecules, including adding missing molecules and assigning atomic charges, was automated by pdb2pqr, and grid spacing for APBS calculation was 0.50 Å, which are the default settings. The following parameters were used to run APBS electrostatics to calculate the surface charge of lumican and TLR9:
  • Prepare molecule with method: pdb2pqr
  • Calculate map with APBS with grid spacing: 0.50 Å
  • Molecular surface visualization with electrostatic potential range (+/−): 5.00

4.4. Molecular Modeling of CpG ODN_2395

The model structure of Class C CpG ODN_2395 was built using BIOVIA Discovery Studio (version 4.5) biopolymer tools. Given that TLR9 demonstrated strong binding and immunostimulatory activity with single stranded CpG DNA, we built a single stranded CpG ODN_2395 as a B-form using standard helix settings (right-handed, 10 bp per turn, 3.4 Å vertical rise per bp, helical diameter 19 Å) without minimizing the structure.

4.5. Rigid Docking Prediction

We used the HDOCK server for protein-ssDNA interaction. The HDOCK server searches for sequence similarities for the query receptor and ligand against the PDB database for homology modeling and homologous template searching, followed by the prediction of putative binding sites using the HDOCKlite global docking algorithm version 1.1 that uses the homologous template for accurate docking prediction [53]. The HDOCK software version 1.1 chooses a template based on maximum sequence coverage and maximum sequence identity for both ligand and receptor, and the threshold for sequence coverage is 50%. Although the TLR9 crystallized structure in complex with CpG ODN_1668 12 mer exists in the PDB database (PDB: 3WPC), the CpG DNA sequence coverage was 27% (6/22), which was below the HDOCK threshold of 50%. Therefore, HDOCK automatically excludes the templates with sequence coverages less than 50% of the template library. Instead, HDOCK chose TLR3 complexed with the double stranded lncRNA (PDB: 7DA7) as a modeling template that gave a sequence coverage greater than 50% for both the ligand (CpG ODN_2395) and receptors (lumican and TLR9), as depicted in the table below (Table 5).
The docking prediction was conducted with the following settings:
  • ligand type: ssDNA
  • receptor/ligand format: PDB file.
Docking type: global docking (default)
The interface residues were determined within a distance of 5.0 Å between the receptor and the ligand. The prediction generated 100 complexes. In this study, only top1 with the highest confidence and lowest docking score was considered for the analysis.

4.6. Protein-DNA Interface Analysis

The protein-DNA complex and residues at the interface were obtained from the HDOCK server and visualized in PyMOL. The interface area of the protein-CpG DNA complex was determined by the buried surface area (BSA), which is the sum of the accessible surface area (ASA) of unliganded protein and free CpG DNA, subtracted from that of the protein-CpG DNA complex. The buried surface area of each individual residue was calculated from the ASA of the unbound state of the individual residue subtracted from the ASA of the CpG ODN-bound state of the individual residue. The buried interfacial surface area is directly proportional to the affinity; therefore, with an increase in BSA, the binding affinity between the receptor and ligand increases [54]. The buried surface area of the protein, CpG DNA, and protein-CpG DNA complex was calculated using the PyMOL built-in command line Get_area. The probe diameter was set to 1.4 Å [26], which is the diameter of water molecule.
The following equation was used to evaluate the BSA and loss of accessible surface area of individual residue:
  • Total buried surface area (Å2) = (accessible surface areaprotein + accessible aurface areaCpG_DNA) − (accessible surface areaprotein-CpG DNA)
  • Buried surface area of individual residue (%) = (accessible surface areaFree_residue − accessible surface areaBound_residue)/accessible surface areaFree_residue
The hydrogen bonding, salt-bridges, and hydrophobic interactions in the protein-DNA interface were analyzed using the Protein-Ligand Interaction Profiler (PLIP) [55]. PLIP uses four steps of algorithms to detect interaction patterns: structure preparation to add missing atoms, functional characterization to detect hydrogen bonding acceptor/donor atoms and hydrophobic molecules, rule-based matching based on geometric criteria, and filtering of redundant interactions. The complex generated from the HDOCK server was submitted to PLIP as a PDB format, and the interaction data were visualized in PyMOL.

5. Conclusions

Our results indicate that 32 amino acids within the concave surface of lumican primarily interact with CpG ODN_2395. The residues encompassing LRR2–8 form a binding groove for the 5′ end of CpG ODN_2395. The lumican-CpG ODN_2395 interactions are mainly through hydrogen bonding, salt-bridges, and hydrophobic interactions. Lumican shares an overall identity of 22% with TLR9 ECD, which is the specific pathogen recognition receptor for CpG ODN_2395. TLR9 ECD comprises LRR1–26, with 33 amino acids interacting with CpG ODN_2395 through hydrogen bonding and salt-bridges. Our docking analysis demonstrated a smaller buried surface area for lumican-CpG ODN_2395 (1803 Å2) compared to that of TLR9-CpG ODN_2395 (2094 Å2), which may imply a lower binding strength for lumican with CpG-DNA than that of TLR9 with CpG-DNA.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms241914990/s1.

Author Contributions

Conceptualization: T.C., G.M. and S.C.; Funding acquisition: S.C.; Supervision: G.M. and S.C.; Investigation: T.C. and G.M.; Methodology and Software: T.C.; Analysis: T.C., G.M. and S.C.; Writing—original draft, review and editing: T.C., G.M. and S.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work is funded by the National Eye Institute (NEI)/National Institutes of Health (NIH) grant R01EY026104 and R01EY030917 to Shukti Chakravarti and the Research to Prevent Blindness unrestricted fund to the Department of Ophthalmology, NYU Grossman School of Medicine.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data and materials used to support the findings of this study are available from the corresponding authors (George Maiti or Shukti Chakravarti) upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hassell, J.R.; Blochberger, T.C.; Rada, J.A.; Chakravarti, S.; Noonan, D. Proteoglycan Gene Families; Bittar, E.E., Kleinman, H.K., Eds.; Elsevier: Amsterdam, The Netherlands, 1993; Volume 6. [Google Scholar]
  2. Schaefer, L.; Iozzo, R.V. Biological functions of the small leucine-rich proteoglycans: From genetics to signal transduction. J. Biol. Chem. 2008, 283, 21305–21309. [Google Scholar] [PubMed]
  3. Chakravarti, S.; Stallings, R.L.; SundarRaj, N.; Cornuet, P.K.; Hassell, J.R. Primary structure of human lumican (keratan sulfate proteoglycan) and localization of the gene (LUM) to chromosome 12q21.3-q22. Genomics 1995, 27, 481–488. [Google Scholar] [CrossRef] [PubMed]
  4. Frikeche, J.; Maiti, G.; Chakravarti, S. Small leucine-rich repeat proteoglycans in corneal inflammation and wound healing. Exp. Eye Res. 2016, 151, 142–149. [Google Scholar]
  5. Maiti, G.; Frikeche, J.; Lam, C.Y.; Biswas, A.; Shinde, V.; Samanovic, M.; Kagan, J.C.; Mulligan, M.J.; Chakravarti, S. Matrix lumican endocytosed by immune cells controls receptor ligand trafficking to promote TLR4 and restrict TLR9 in sepsis. Proc. Natl. Acad. Sci. USA 2021, 118, e2100999118. [Google Scholar]
  6. Scott, J.E. Proteoglycan: Collagen interactions in connective tissues. Ultrastructural, biochemical, functional and evolutionary aspects. Int. J. Biol. Macromol. 1991, 13, 157–161. [Google Scholar] [CrossRef] [PubMed]
  7. Rada, J.A.; Cornuet, P.K.; Hassell, J.R. Regulation of corneal collagen fibrillogenesis in vitro by corneal proteoglycan (lumican and decorin) core proteins. Exp. Eye Res. 1993, 56, 635–648. [Google Scholar] [CrossRef]
  8. Svensson, L.; Närlid, I.; Oldberg, A. Fibromodulin and lumican bind to the same region on collagen type I fibrils. FEBS Lett. 2000, 470, 178–182. [Google Scholar]
  9. Kalamajski, S.; Oldberg, A. Homologous sequence in lumican and fibromodulin leucine-rich repeat 5–7 competes for collagen binding. J. Biol. Chem. 2009, 284, 534–539. [Google Scholar] [CrossRef]
  10. Orgel, J.P.; Eid, A.; Antipova, O.; Bella, J.; Scott, J.E. Decorin core protein (decoron) shape complements collagen fibril surface structure and mediates its binding. PLoS ONE 2009, 4, e7028. [Google Scholar]
  11. Danielson, K.G.; Baribault, H.; Holmes, D.F.; Graham, H.; Kadler, K.E.; Iozzo, R.V. Targeted disruption of decorin leads to abnormal collagen fibril morphology and skin fragility. J. Cell Biol. 1997, 136, 729–743. [Google Scholar]
  12. Chakravarti, S.; Magnuson, T.; Lass, J.H.; Jepsen, K.J.; LaMantia, C.; Carroll, H. Lumican regulates collagen fibril assembly: Skin fragility and corneal opacity in the absence of lumican. J. Cell Biol. 1998, 141, 1277–1286. [Google Scholar] [PubMed]
  13. Xu, T.; Bianco, P.; Fisher, L.W.; Longenecker, G.; Smith, E.; Goldstein, S.; Bonadio, J.; Boskey, A.; Heegaard, A.M.; Sommer, B.; et al. Targeted disruption of the biglycan gene leads to an osteoporosis-like phenotype in mice. Nat. Genet. 1998, 20, 78–82. [Google Scholar] [PubMed]
  14. Svensson, L.; Aszodi, A.; Reinholt, F.P.; Fassler, R.; Heinegard, D.; Oldberg, A. Fibromodulin-null mice have abnormal collagen fibrils, tissue organization, and altered lumican deposition in tendon. J. Biol. Chem. 1999, 274, 9636–9647. [Google Scholar] [PubMed]
  15. Kalamajski, S.; Oldberg, A. The role of small leucine-rich proteoglycans in collagen fibrillogenesis. Matrix Biol. 2010, 29, 248–253. [Google Scholar]
  16. Chen, S.; Young, M.F.; Chakravarti, S.; Birk, D.E. Interclass small leucine-rich repeat proteoglycan interactions regulate collagen fibrillogenesis and corneal stromal assembly. Matrix Biol. 2014, 35, 103–111. [Google Scholar]
  17. Kawai, T.; Akira, S. The role of pattern-recognition receptors in innate immunity: Update on Toll-like receptors. Nat. Immunol. 2010, 11, 373–384. [Google Scholar]
  18. Fitzgerald, K.A.; Kagan, J.C. Toll-like Receptors and the Control of Immunity. Cell 2020, 180, 1044–1066. [Google Scholar]
  19. Lamphier, M.S.; Sirois, C.M.; Verma, A.; Golenbock, D.T.; Latz, E. TLR9 and the recognition of self and non-self nucleic acids. Ann. N. Y. Acad. Sci. 2006, 1082, 31–43. [Google Scholar]
  20. Collins, B.; Wilson, I.A. Crystal structure of the C-terminal domain of mouse TLR9. Proteins 2014, 82, 2874–2878. [Google Scholar]
  21. Ohto, U.; Ishida, H.; Shibata, T.; Sato, R.; Miyake, K.; Shimizu, T. Toll-like Receptor 9 Contains Two DNA Binding Sites that Function Cooperatively to Promote Receptor Dimerization and Activation. Immunity 2018, 48, 649–658.e644. [Google Scholar]
  22. Takahashi, N.; Takahashi, Y.; Putnam, F.W. Periodicity of leucine and tandem repetition of a 24-amino acid segment in the primary structure of leucine-rich alpha 2-glycoprotein of human serum. Proc. Natl. Acad. Sci. USA 1985, 82, 1906–1910. [Google Scholar] [PubMed]
  23. Matsushima, N.; Miyashita, H.; Mikami, T.; Kuroki, Y. A nested leucine rich repeat (LRR) domain: The precursor of LRRs is a ten or eleven residue motif. BMC Microbiol. 2010, 10, 235. [Google Scholar]
  24. Wei, T.; Gong, J.; Jamitzky, F.; Heckl, W.M.; Stark, R.W.; Rossle, S.C. LRRML: A conformational database and an XML description of leucine-rich repeats (LRRs). BMC Struct. Biol. 2008, 8, 47. [Google Scholar]
  25. Ng, A.; Xavier, R.J. Leucine-rich repeat (LRR) proteins: Integrators of pattern recognition and signaling in immunity. Autophagy 2011, 7, 1082–1084. [Google Scholar]
  26. Zhou, W.; Li, Y.; Pan, X.; Gao, Y.; Li, B.; Qiu, Z.; Liang, L.; Zhou, H.; Yue, J. Toll-like receptor 9 interaction with CpG ODN—An in silico analysis approach. Theor. Biol. Med. Model. 2013, 10, 18. [Google Scholar]
  27. Ishida, H.; Ohto, U.; Shibata, T.; Miyake, K.; Shimizu, T. Structural basis for species-specific activation of mouse Toll-like receptor 9. FEBS Lett. 2018, 592, 2636–2646. [Google Scholar]
  28. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar]
  29. Varadi, M.; Anyango, S.; Deshpande, M.; Nair, S.; Natassia, C.; Yordanova, G.; Yuan, D.; Stroe, O.; Wood, G.; Laydon, A.; et al. AlphaFold Protein Structure Database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022, 50, D439–D444. [Google Scholar]
  30. Bell, J.K.; Mullen, G.E.; Leifer, C.A.; Mazzoni, A.; Davies, D.R.; Segal, D.M. Leucine-rich repeats and pathogen recognition in Toll-like receptors. Trends Immunol. 2003, 24, 528–533. [Google Scholar]
  31. Ezura, Y.; Chakravarti, S.; Oldberg, A.; Chervoneva, I.; Birk, D.E. Differential expression of lumican and fibromodulin regulate collagen fibrillogenesis in developing mouse tendons. J. Cell Biol. 2000, 151, 779–788. [Google Scholar]
  32. Nadassy, K.; Wodak, S.J.; Janin, J. Structural features of protein-nucleic acid recognition sites. Biochemistry 1999, 38, 1999–2017. [Google Scholar] [PubMed]
  33. Ewald, S.E.; Engel, A.; Lee, J.; Wang, M.; Bogyo, M.; Barton, G.M. Nucleic acid recognition by Toll-like receptors is coupled to stepwise processing by cathepsins and asparagine endopeptidase. J. Exp. Med. 2011, 208, 643–651. [Google Scholar] [PubMed]
  34. Ewald, S.E.; Lee, B.L.; Lau, L.; Wickliffe, K.E.; Shi, G.P.; Chapman, H.A.; Barton, G.M. The ectodomain of Toll-like receptor 9 is cleaved to generate a functional receptor. Nature 2008, 456, 658–662. [Google Scholar] [PubMed]
  35. Fukui, R.; Saitoh, S.; Kanno, A.; Onji, M.; Shibata, T.; Ito, A.; Onji, M.; Matsumoto, M.; Akira, S.; Yoshida, N.; et al. Unc93B1 restricts systemic lethal inflammation by orchestrating Toll-like receptor 7 and 9 trafficking. Immunity 2011, 35, 69–81. [Google Scholar]
  36. Moresco, E.M.; Beutler, B. Special delivery: Granulin brings CpG DNA to Toll-like receptor 9. Immunity 2011, 34, 453–455. [Google Scholar]
  37. Lee, B.L.; Moon, J.E.; Shu, J.H.; Yuan, L.; Newman, Z.R.; Schekman, R.; Barton, G.M. UNC93B1 mediates differential trafficking of endosomal TLRs. ELife 2013, 2, e00291. [Google Scholar]
  38. Plumridge, A.; Meisburger, S.P.; Pollack, L. Visualizing single-stranded nucleic acids in solution. Nucleic Acids Res. 2017, 45, e66. [Google Scholar] [CrossRef]
  39. Plumridge, A.; Meisburger, S.P.; Andresen, K.; Pollack, L. The impact of base stacking on the conformations and electrostatics of single-stranded DNA. Nucleic Acids Res. 2017, 45, 3932–3943. [Google Scholar] [CrossRef]
  40. Barzak, F.M.; Ryan, T.M.; Mohammadzadeh, N.; Harjes, S.; Kvach, M.V.; Kurup, H.M.; Krause, K.L.; Chelico, L.; Filichev, V.V.; Harjes, E.; et al. Small-Angle X-ray Scattering (SAXS) Measurements of APOBEC3G Provide Structural Basis for Binding of Single-Stranded DNA and Processivity. Viruses 2022, 14, 1974. [Google Scholar] [CrossRef]
  41. Sim, A.Y.; Lipfert, J.; Herschlag, D.; Doniach, S. Salt dependence of the radius of gyration and flexibility of single-stranded DNA in solution probed by small-angle X-ray scattering. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2012, 86, 021901. [Google Scholar] [CrossRef]
  42. Pretto, D.I.; Tsutakawa, S.; Brosey, C.A.; Castillo, A.; Chagot, M.E.; Smith, J.A.; Tainer, J.A.; Chazin, W.J. Structural dynamics and single-stranded DNA binding activity of the three N-terminal domains of the large subunit of replication protein A from small angle X-ray scattering. Biochemistry 2010, 49, 2880–2889. [Google Scholar] [CrossRef] [PubMed]
  43. Maffeo, C.; Aksimentiev, A. Molecular mechanism of DNA association with single-stranded DNA binding protein. Nucleic Acids Res. 2017, 45, 12125–12139. [Google Scholar] [CrossRef] [PubMed]
  44. Guo, J.T.; Malik, F. Single-Stranded DNA Binding Proteins and Their Identification Using Machine Learning-Based Approaches. Biomolecules 2022, 12, 1187. [Google Scholar] [CrossRef]
  45. Chakraborty, K.; Bandyopadhyay, S. Dynamics of water around the complex structures formed between the KH domains of far upstream element binding protein and single-stranded DNA molecules. J. Chem. Phys. 2015, 143, 045106. [Google Scholar] [CrossRef]
  46. Dans, P.D.; Ivani, I.; Hospital, A.; Portella, G.; González, C.; Orozco, M. How accurate are accurate force-fields for B-DNA? Nucleic Acids Res. 2017, 45, 4217–4230. [Google Scholar] [CrossRef] [PubMed]
  47. Mu, Z.C.; Tan, Y.L.; Liu, J.; Zhang, B.G.; Shi, Y.Z. Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding. Molecules 2023, 28, 4833. [Google Scholar] [CrossRef]
  48. Wang, X.; Chen, Y.; Zhang, S.; Deng, J.N. Molecular dynamics simulations reveal the selectivity mechanism of structurally similar agonists to TLR7 and TLR8. PLoS ONE 2022, 17, e0260565. [Google Scholar] [CrossRef]
  49. Oweida, T.J.; Kim, H.S.; Donald, J.M.; Singh, A.; Yingling, Y.G. Assessment of AMBER Force Fields for Simulations of ssDNA. J. Chem. Theory Comput. 2021, 17, 1208–1217. [Google Scholar] [CrossRef]
  50. wwPDB consortium. Protein Data Bank: The single global archive for 3D macromolecular structure data. Nucleic Acids Res. 2019, 47, D520–D528. [Google Scholar]
  51. Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T.J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Söding, J.; et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539. [Google Scholar]
  52. DeLano, W.L. PyMOL: An Open-Source Molecular Graphics Tool. CCP4 Newsl. Protein Crystallogr. 2002, 40, 82–92. [Google Scholar]
  53. Yan, Y.; Huang, S.Y. Modeling Protein-Protein or Protein-DNA/RNA Complexes Using the HDOCK Webserver. Methods Mol. Biol. 2020, 2165, 217–229. [Google Scholar] [PubMed]
  54. Chen, J.; Sawyer, N.; Regan, L. Protein-protein interactions: General trends in the relationship between binding affinity and interfacial buried surface area. Protein Sci. 2013, 22, 510–515. [Google Scholar] [PubMed]
  55. Salentin, S.; Schreiber, S.; Haupt, V.J.; Adasme, M.F.; Schroeder, M. PLIP: Fully automated protein-ligand interaction profiler. Nucleic Acids Res. 2015, 43, W443–W447. [Google Scholar] [PubMed]
Figure 1. Lumican model structure and similarities with TLR9. (A) The model depicts the structural alignment of TLR9 LRR1–13 and lumican LRR1–11 based on ClustalW pairwise sequence alignment. (B,C) Horseshoe-like structure model and surface charge representation showing basic (blue) and acidic (red) properties of the lumican concave surface showing LRR1–11 repeats (B), and TLR9 showing LRR1–26 repeats (C). N or NT indicates N-terminal end and C or CT indicates C-terminal end of the protein.
Figure 1. Lumican model structure and similarities with TLR9. (A) The model depicts the structural alignment of TLR9 LRR1–13 and lumican LRR1–11 based on ClustalW pairwise sequence alignment. (B,C) Horseshoe-like structure model and surface charge representation showing basic (blue) and acidic (red) properties of the lumican concave surface showing LRR1–11 repeats (B), and TLR9 showing LRR1–26 repeats (C). N or NT indicates N-terminal end and C or CT indicates C-terminal end of the protein.
Ijms 24 14990 g001
Figure 2. Modeling of CpG ODN_2395 bound Lumican. (A) Surface view of the lumican-CpG ODN_2395 complex shows the buried surface area (BSA) in red, lumican (blue), CpG ODN_2395 (golden). The combined BSA for Lumican-CpG DNA complex is 1803 Å2. Lumican residues contribute 42% (761 Å2) of the BSA, whereas CpG ODN_2395 contributes 58% (1042 Å2). (B) A ribbon 3D model showing all lumican residues (red) that interact with CpG ODN_2395 (golden). (C) Surface view of the TLR9-CpG DNA complex shows the buried surface area (BSA) in red, TLR9 (blue), and CpG ODN_2395 (golden). The combined BSA for the TLR9-CpG DNA complex is 2094 Å2. TLR9 residues contribute 42% (878 Å2) of the BSA. (D) A ribbon 3D model showing all TLR9 residues (red) that interact with the CpG ODN_2395 (golden).
Figure 2. Modeling of CpG ODN_2395 bound Lumican. (A) Surface view of the lumican-CpG ODN_2395 complex shows the buried surface area (BSA) in red, lumican (blue), CpG ODN_2395 (golden). The combined BSA for Lumican-CpG DNA complex is 1803 Å2. Lumican residues contribute 42% (761 Å2) of the BSA, whereas CpG ODN_2395 contributes 58% (1042 Å2). (B) A ribbon 3D model showing all lumican residues (red) that interact with CpG ODN_2395 (golden). (C) Surface view of the TLR9-CpG DNA complex shows the buried surface area (BSA) in red, TLR9 (blue), and CpG ODN_2395 (golden). The combined BSA for the TLR9-CpG DNA complex is 2094 Å2. TLR9 residues contribute 42% (878 Å2) of the BSA. (D) A ribbon 3D model showing all TLR9 residues (red) that interact with the CpG ODN_2395 (golden).
Ijms 24 14990 g002
Figure 3. Close-up view of Lumican and CpG ODN_2395 interactions. (A) A schematic diagram showing lumican interacting with LRRs from lumican and nucleotides from CpG ODN_2395. The dotted lines in blue indicate H-bonds, red/yellow salt-bridges, and black pi-stacking interactions. (B) A 3D ribbon model showing all lumican residues (red) that interact with CpG ODN_2395 (golden). (C,D) A stick model showing the interaction of lumican with the first 5′ thymidine (T) base (C), and the next 8 bases of CpG ODN_2395 (D). The yellow ball shows the charge centers in CpG ODN_2395, and the grey ball indicates the aromatic charge centers of amino acids.
Figure 3. Close-up view of Lumican and CpG ODN_2395 interactions. (A) A schematic diagram showing lumican interacting with LRRs from lumican and nucleotides from CpG ODN_2395. The dotted lines in blue indicate H-bonds, red/yellow salt-bridges, and black pi-stacking interactions. (B) A 3D ribbon model showing all lumican residues (red) that interact with CpG ODN_2395 (golden). (C,D) A stick model showing the interaction of lumican with the first 5′ thymidine (T) base (C), and the next 8 bases of CpG ODN_2395 (D). The yellow ball shows the charge centers in CpG ODN_2395, and the grey ball indicates the aromatic charge centers of amino acids.
Ijms 24 14990 g003
Figure 4. Close up view of TLR9 ECD and CpG ODN_2395 interactions. (A) A schematic diagram showing all TLR9 LRR and CpG ODN_2395 interactions. The lines in blue indicates H-bonds and red/yellow salt-bridges. (B) A 3D ribbon model showing all the TLR9 residues (red) that interact with the CpG ODN_2395 (golden). (CE) Stick model indicates the interaction of TLR9 LRR2–8 with the bases or sugar phosphate backbone of the 5′-TCGTCG of CpG ODN_2395. (C) The LRR15, 20–22, and 25 interacting with phosphate backbone of the thymidine (T) at position 8, guanosine (G) at position 13, and cytosine (C) at position 20 of CpG ODN_2395, respectively. (D) LRR21 interacting with the phosphate backbone of guanosine (G) at position 22 of CpG ODN_2395. (E) The yellow ball shows the charge centers in CpG ODN_2395.
Figure 4. Close up view of TLR9 ECD and CpG ODN_2395 interactions. (A) A schematic diagram showing all TLR9 LRR and CpG ODN_2395 interactions. The lines in blue indicates H-bonds and red/yellow salt-bridges. (B) A 3D ribbon model showing all the TLR9 residues (red) that interact with the CpG ODN_2395 (golden). (CE) Stick model indicates the interaction of TLR9 LRR2–8 with the bases or sugar phosphate backbone of the 5′-TCGTCG of CpG ODN_2395. (C) The LRR15, 20–22, and 25 interacting with phosphate backbone of the thymidine (T) at position 8, guanosine (G) at position 13, and cytosine (C) at position 20 of CpG ODN_2395, respectively. (D) LRR21 interacting with the phosphate backbone of guanosine (G) at position 22 of CpG ODN_2395. (E) The yellow ball shows the charge centers in CpG ODN_2395.
Ijms 24 14990 g004
Figure 5. CpG DNA binding sites in LRR5–7 of lumican also known to bind collagen I. (A) CpG ODN_2395 binding residues in LRR5–7 of lumican. CpG ODN_2395 binding residues colored magenta. (B) Multiple sequence alignment of lumican LRR5–7 from different species. Interface residues are marked in magenta. Asp212 (D212) in cyan is the interface residue that interacts with type I collagen.
Figure 5. CpG DNA binding sites in LRR5–7 of lumican also known to bind collagen I. (A) CpG ODN_2395 binding residues in LRR5–7 of lumican. CpG ODN_2395 binding residues colored magenta. (B) Multiple sequence alignment of lumican LRR5–7 from different species. Interface residues are marked in magenta. Asp212 (D212) in cyan is the interface residue that interacts with type I collagen.
Ijms 24 14990 g005
Table 1. Lumican—CpG ODN_2395 interacting amino acid residues.
Table 1. Lumican—CpG ODN_2395 interacting amino acid residues.
Lum InterfaceBSALRRLRR Type
SER4517%NT-
ARG7322%LRR1T
ASN7423%
ILE9534%LRR2T
ASP9790%
HIS9842%
LYS11919%LRR3PS
HIS12165%
ASN12372%
TYR12420%
ASP14059%LRR4SDS22
GLN14296%
PHE16242%LRR5T
TYR16499%
GLN16667%
TYR18794%LRR6S
ASP18958%
THE20871%LRR7T
TYR21093%
ASP2127%
TYR23259%LRR8SDS22
ARG23471%
SER23684%
HIS23715%
GLU25777%LRR9SDS22
ASP25994%
SER261100%
TYR26241%
TYR27986%LRR10Irregular
GLU28194%
VAL2822%
ARG30937%LRR11Irregular
Table 2. TLR9—CpG ODN_2395 interacting amino acid residues.
Table 2. TLR9—CpG ODN_2395 interacting amino acid residues.
TLR9 InterfaceBSALRRLRR Type
LYS9529%LRR2Irregular
ASN12957%LRR3S
SER131100%
TYR13257%
SER14966%--
SER15185%LRR4T
HIS15270%
VAL17148%LRR5Irregular
PHE17379%
ASP17584%
TYR17963%
TYR18023%
LYS1819%
HIS20366%LRR6S
SER2050%
LYS20767%
TYR20856%
TYR22452%LRR7T
LEU22648%
PRO26217%LRR8Irregular
ASN26348%
GLN39955%LRR13T
MET40047%
PHE46737%-
ARG48224%LRR15T
ARG61336%LRR20Irregular
ASN64052%LRR21T
HIS64251%
ARG66223%LRR22T
TYR66550%
GLN68915%LRR23SDS22
SER71114%LRR24T
HIS73520%LRR25T
Table 3. Different types of interaction between lumican and CpG ODN_2395. Blue: H-Bond Donor; Orange: H-Bond Acceptor.
Table 3. Different types of interaction between lumican and CpG ODN_2395. Blue: H-Bond Donor; Orange: H-Bond Acceptor.
Lum-CpG
Hydrogen bondCpG ODN_2395AtomLumAtom
T1N3GLU281OE1
N3SER261OG
O2ARG309NH2
C2N4ASP259OD2
T4O2ARG234NH1
C5N3TYR210OH
O2TYR187OH
G6O4TYR164OH
T7O2GLN166NE2
T8O5GLN142NE2
T9OP1ASN74ND2
Salt bridgeCpG ODN_2395Charge CenterLumCharge Center
G3Purine ringGLU257Carboxylic
G6Purine ringASP189Carboxylic
T8P-backboneHIS121Imidazole
T9P-backboneHIS98Imidazole
Van der WaalsCpG ODN_2395AtomLumAtom
T1C7TYR262CE2
Pi-stackingCpG ODN_2395AtomLumAtom
T1Pyrimidine ringTYR262Tyrosyl ring
Table 4. Different types of interaction between TLR9 and CpG ODN_2395. Blue: H-Bond Donor; Orange: H-Bond Acceptor.
Table 4. Different types of interaction between TLR9 and CpG ODN_2395. Blue: H-Bond Donor; Orange: H-Bond Acceptor.
TLR9-CpG
Hydrogen bondCpG ODN_2395AtomTLR9Atom
T1N3HIS203NE2
G3N1LYS207NZ
N2NZ
N2SER205OG
T4OP1SER131OG
O3TYR132OH
C5O5TYR179OH
G6OP1TYR180OH
OP1ASN263OD1
O5ND2
C20O2TYR665OH
O4ARG613NH2
Salt bridgeCpG ODN_2395Charge CenterTLR9Charge Center
G3Purine ringASP175Carboxylic
T4P-backboneLYS95Amine
C5P-backboneHIS152Imidazole
T9P-backboneARG482Amine
G13P-backboneHIS735Imidazole
C21P-backboneARG613Amine
G22P-backboneHIS642Imidazole
Table 5. Sequence coverage and sequence identity between the target sequence and the modeling sequence.
Table 5. Sequence coverage and sequence identity between the target sequence and the modeling sequence.
Lumican-CpG ODN_2395 Docking
Template MoleculePDB IDChain IDAlgin_lengthCoverage (%)Seq_identity (%)
TLR3 (Receptor)7DA7A255 amino acids7631
dsRNA (Ligand)7DA7C14 bases6479
TLR9-CpG ODN_2395 Docking
Template MoleculePDB IDChain IDAlgin_lengthCoverage (%)Seq_identity (%)
TLR3 (Receptor)7DA7A733 amino acids9926
dsRNA (Ligand)7DA7C14 bases6479
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Choi, T.; Maiti, G.; Chakravarti, S. Three-Dimensional Modeling of CpG DNA Binding with Matrix Lumican Shows Leucine-Rich Repeat Motif Involvement as in TLR9-CpG DNA Interactions. Int. J. Mol. Sci. 2023, 24, 14990. https://doi.org/10.3390/ijms241914990

AMA Style

Choi T, Maiti G, Chakravarti S. Three-Dimensional Modeling of CpG DNA Binding with Matrix Lumican Shows Leucine-Rich Repeat Motif Involvement as in TLR9-CpG DNA Interactions. International Journal of Molecular Sciences. 2023; 24(19):14990. https://doi.org/10.3390/ijms241914990

Chicago/Turabian Style

Choi, Tansol, George Maiti, and Shukti Chakravarti. 2023. "Three-Dimensional Modeling of CpG DNA Binding with Matrix Lumican Shows Leucine-Rich Repeat Motif Involvement as in TLR9-CpG DNA Interactions" International Journal of Molecular Sciences 24, no. 19: 14990. https://doi.org/10.3390/ijms241914990

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop