Next Article in Journal
Genome-Wide Analysis of LysM-Containing Gene Family in Wheat: Structural and Phylogenetic Analysis during Development and Defense
Next Article in Special Issue
Epigenetic Evolution of ACE2 and IL-6 Genes: Non-Canonical Interferon-Stimulated Genes Correlate to COVID-19 Susceptibility in Vertebrates
Previous Article in Journal
Plasmodium vivax Cysteine-Rich Protective Antigen Polymorphism at Exon-1 Shows Recombination and Signatures of Balancing Selection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

IMGT® Biocuration and Comparative Analysis of Bos taurus and Ovis aries TRA/TRD Loci

by
Perrine Pégorier
,
Morgane Bertignac
,
Viviane Nguefack Ngoune
,
Géraldine Folch
,
Joumana Jabado-Michaloud
,
Véronique Giudicelli
,
Patrice Duroux
,
Marie-Paule Lefranc
and
Sofia Kossida
*
IMGT®, The International ImMunoGeneTics Information System®, Institut de Génétique Humaine (IGH), Centre National de la Recherche Scientifique (CNRS), Université de Montpellier (UM), 34000 Montpellier, France
*
Author to whom correspondence should be addressed.
Genes 2021, 12(1), 30; https://doi.org/10.3390/genes12010030
Submission received: 25 November 2020 / Revised: 17 December 2020 / Accepted: 22 December 2020 / Published: 28 December 2020
(This article belongs to the Special Issue The Immune Response in the Time of Omics Research)

Abstract

:
The adaptive immune response provides the vertebrate immune system with the ability to recognize and remember specific pathogens to generate immunity, and mount stronger attacks each time the pathogen is encountered. T cell receptors are the antigen receptors of the adaptive immune response expressed by T cells, which specifically recognize processed antigens, presented as peptides by the highly polymorphic major histocompatibility (MH) proteins. T cell receptors (TR) are divided into two groups, α β and γ δ , which express distinct TR containing either α and β , or γ and δ chains, respectively. The TR α locus (TRA) and TR δ locus (TRD) of bovine (Bos taurus) and the sheep (Ovis aries) have recently been described and annotated by IMGT® biocurators. The aim of the present study is to present the results of the biocuration and to compare the genes of the TRA/TRD loci among these ruminant species based on the Homo sapiens repertoire. The comparative analysis shows similarities but also differences, including the fact that these two species have a TRA/TRD locus about three times larger than that of humans and therefore have many more genes which may demonstrate duplications and/or deletions during evolution.

1. Introduction

The adaptive immune response arose in jawed vertebrates or gnathostomata more than 450 million years ago. It is characterized by the remarkable specificity and the extreme diversity of their antigen receptors [1]. These antigen receptors of the adaptive immune response are the immunoglobulins (IG) or antibodies of the B cells and plasmocytes [2], and the T cell receptors (TR) of the T cells [3]. The IG recognize antigens in their native form, whereas the TR recognize processed antigens, which are presented as peptides by the major histocompatibility (MH) proteins.
T cell receptors (TR) are divided into two groups, α β and γ δ , which express distinct TR containing either α and β , or γ and δ chains, respectively [3]. Each TR chain comprises a variable and a constant domain. The variable domain is the result of one rearrangement between variable (V) and joining (J) genes for α and γ chains, and two consecutive rearrangements between diversity (D) and J genes then between V and partially rearranged D-J genes for β and δ chains. After transcription, the V–(D)–J sequence is spliced to the constant (C) gene to give the final transcript [3].
The human TR α (TRA) locus consists of a cluster of 56 TRAV genes located upstream (in 5 ) of a J-C cluster, composed of sixty-one TRAJ and one TRAC [3]. The TR δ (TRD) locus is nested in the TRA locus between the TRAV and the TRAJ genes [3]. This locus comprises a cluster of one TRDV, three TRDD, four TRDJ, one TRDC and another TRDV, TRDV3, in inverted transcriptional orientation downstream of TRDC. One TRDV gene is also present among the TRAV cluster. The particularity of this locus is that some V genes rearrange to both TRAJ and TRDD-TRDJ genes [3].
Animal species, such as mice and larger animals, are essential models for biological research and studies on farm animals, and contribute, for example, to fundamental and applied immunology [4]. Furthermore, several veterinary species are useful for biotechnological applications that can also be applied to human medicine. This justifies the interest of scientists in the locus genomic organization of IG and TR genes involved in the adaptive immune responses. Ruminants such as sheep and cattle are considered among the “ γ δ T cell high” species, “ γ δ high” refering to the level of γ δ T cells in circulation. Bovine γ δ T cells were shown for example to respond to components of mycobacteria [5], to autologous antigens on monocytes [6]. The bovine is a valuable model to study respiratory disorders as coronaviruses [7] and influenza viruses [8]. Sheep is also a valuable model to study respiratory disorders as allergic asthma during pregnancy in relation with lung and immune development [9]. Several studies have already been done on the TRA/TRD locus of cattle [10,11,12,13] and sheep [14,15,16]. The complete genome assemblies, qualified as “representative genome”, are available at NCBI [17] for both species: ARS-UCD1.2 (de novo assembly, using long reads for assembly and short reads for scaffolding and polishing, of a Hereford cattle) for Bos taurus, and Oar_rambouillet_v1.0 (de novo assembly, using Pacific Biosciences, Menlo Park, CA, USA long reads for assembly, Illumina, San Diego, CA, USA short reads for polishing, and Hi-C Illumina data for scaffolding, of a Rambouillet sheep) for Ovis aries.
IMGT®, the international ImMunoGeneTics information system®, http://www.imgt.org [18], is the global reference in immunogenetics and immunoinformatics [1], founded in 1989 by Marie-Paule Lefranc at Montpellier (Université de Montpellier and CNRS). IMGT® is a high-quality integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), major histocompatibility (MH) of human and other vertebrate species, and in the immunoglobulin superfamily (IgSF), the MH superfamily (MhSF) and related proteins of the immune system (RPI) of vertebrates and invertebrates.
IMGT has performed the biocuration of the TRA/TRD locus extracted from the representative genome ARS-UCD1.2 (Bos taurus) and Oar_rambouillet_v1.0 (Ovis aries) in order to provide a complete overview regarding the gene content and organization for both species. The aim of this study is to present the results of the annotation of Bos taurus and Ovis aries TRA/TRD loci and to highlight the differences of the newly annotated locus compared to the previous published genomic assemblies (UMD3.1 [13], Btau_3.1 [12] and Btau 4.0 [11] for Bos taurus; Oar_v3.1 for Ovis aries [16]). The comparison of the TRA/TRD locus for both species and human is also provided.

2. Materials and Methods

2.1. Annotation of the TRA/TRD Locus

The IMGT biocuration pipeline for locus annotation has been described previously [19]. Each locus sequence was localized on the corresponding chromosome and subsequently extracted from NCBI assembly [17] in GenBank format. The delimitation of the locus was performed by the search of the “IMGT bornes” which are coding genes (other than IG or TR) conserved between species, located upstream of the first or downstream of the last gene of an IG or TR locus (http://www.imgt.org/IMGTindex/IMGTborne.php). The IMGT 5′ borne of the TRA/TRD locus is the olfactory receptor 10G3 (OR10G3) gene and the IMGT 3′ borne of the locus is the defender against cell death (DAD1) gene. The locus orientation on a chromosome can be either forward (FWD) or reverse (REV) therefore the REV locus sequences were placed in the 5′ to 3′ locus orientation. Each locus sequence was assigned to an IMGT® accession number (bovine: IMGT000049 (CM008177.2 (22253137-25584362, complement)) and sheep: IMGT000048 (CM008478.1 (23556113-26437716, complement))).
The biocuration has been performed manually assisted by internally developed tools (IMGT/LIGMotif [20], NtiToVald and IMGT/Automat [21]) based on the IMGT-ONTOLOGY axioms and concepts: ‘IDENTIFICATION’, ‘DESCRIPTION’, ‘CLASSIFICATION’, ‘NUMEROTATION’, ‘LOCALIZATION’, ‘ORIENTATION’ and ‘OBTENTION’ [22]. IMGT-ONTOLOGY includes the controlled vocabulary and annotation rules which are indispensable to ensure accuracy, consistency and coherence.
The IMGT nomenclature [23] of all TRAV and TRDV genes, ‘CLASSIFICATION’ axiom of IMGT-ONTOLOGY, was characterized according to the human TRAV/TRDV genes using IMGT/V-QUEST [24] and NGPhylogeny.fr [25] (using MAFFT [26] and PhyML [27] programs) to define the subgroups. TRAV/TRDV genes are designated by a number for the subgroup followed, whenever there are several genes belonging to the same subgroup, by a hyphen and a number picturing their relative localization in the locus. Numbers increase from 5′ to 3′ in the locus [3]. Two genes belong to the same subgroup if their identity percentage is greater than 75% in their V-REGION.
The IMGT unique numbering for the V-DOMAIN [28] and for the C-DOMAIN [29] (NUMEROTATION axiom) was used for the IMGT Colliers de Perles representations [30,31] and for the strands and loops delimitations of the V-REGION (FR1-IMGT to FR3-IMGT and CDR1-IMGT, CDR2-IMGT and germline CDR3-IMGT) and of the C-ALPHA and C-DELTA domains, respectively.
The functionality of the genes was defined according to the IMGT ‘functionality’ concept, part of the ‘IDENTIFICATION’ axiom of IMGT-ONTOLOGY, described in http://www.imgt.org/IMGTScientificChart/SequenceDescription/IMGTfunctionality.html. A gene is considered as functional if it has an open reading frame without stop codon, no defect in the splicing sites, recombination signals and/or regulatory elements; a gene is considered as ORF if the coding region has an open reading frame, but alterations in the splicing sites, recombination signals, regulatory elements and/or changes of conserved amino acids; a gene is considered as pseudogene if the coding region has stop codon(s) and/or frameshift mutation(s).
The main concept of the ‘DESCRIPTION’ axiom of IMGT-ONTOLOGY corresponds to IMGT® standardized labels in the databases and tools. A set of specific labels was defined to describe the different organizations of IG and TR genes in clusters at the scale of the locus or of the chromosome. They are available from the IMGT/LIGM-DB database http://www.imgt.org/ligmdb/label#.
The standardized annotation allows data entry in the IMGT® reference directory used in IMGT® databases and tools (IMGT/LIGM-DB [32], IMGT/GENE-DB [33], IMGT/ 3Dstructure-DB and IMGT/2Dstructure-DB [34], IMGT/V-QUEST [24], IMGT/HighV-QUEST [35] and IMGT/DomainGapAlign [36]). IMGT® genomic annotated data are then synthesized in IMGT Repertoire (http://www.imgt.org/IMGTrepertoire/) including several organized web pages (Locus representation, Locus description, Locus in genome assembly, Locus gene order, Gene tables, Potential germline repertoire, Protein displays, Alignments of alleles, Colliers de Perles [30,31], and [CDR1-IMGT.CDR2-IMGT.CDR3-IMGT] lengths) [19].

2.2. Comparison of the TRA/TRD Locus

The data obtained by biocuration were compared to the human TRA/TRD locus. The human TRA/TRD locus is located on chromosome 14 (14q11.2) on FWD orientation and spans 1000 kilobases (kb) [3]. The IMGT 5′ borne (OR10G3) has been identified 51 kb upstream of the first gene of the locus and the IMGT 3′ borne (DAD1), 13 kb downstream (in 3′) of the last gene of the locus. The potential repertoire consists of a total of 64 V genes: 56 TRAV genes (38 functional (F), 16 pseudogenes (P) and 2 F or P (depending on alleles)) belonging to 42 TRAV subgroups, 3 TRDV genes (F) belonging to 3 TRDV subgroups and 5 TRAV/DV genes (4 F and 1 F or P (depending on alleles)) belonging to 5 subgroups, 3 TRDD genes (F), 65 J genes: 61 TRAJ genes (50 F, 7 ORF, 3 P and 1 F or P) and 4 TRDJ genes (F), 1 TRAC gene (F) and 1 TRDC gene (F) [3].
A comparison was performed based on the number of genes in the locus as well as the number of genes per subgroup (potential germline repertoire), the locus representation, the functionality of genes and the CDR lengths. Potential duplications and/or deletions that may have occurred during evolution are susceptible to be highlighted from this sort of comparison.

3. Results

3.1. Annotation of TRA/TRD Loci

The two TRA/TRD loci were annotated following the pipeline described in the Materials and Methods. The results of the annotation described below are summarized in Table 1. The information regarding the genome assemblies and the boundaries is provided in Supplementary Table S1.
The bovine TRA/TRD locus, on chromosome 10 (REV), spans 3331 kb and consists of a total of 238 V genes: 183 TRAV genes (79 F, 14 ORF, 74 P, 3 F or ORF, 9 F or P, 3 ORF or P and 1 F or ORF or P) belonging to 40 TRAV subgroups and 39 (+16 non localized) TRDV genes (45 F, 5 ORF and 5 P) belonging to 5 TRDV subgroups, 9 TRDD genes (6 F and 3 ORF), 64 J genes: 60 TRAJ genes (52 F, 2 ORF, 4 P and 2 F or P) and 4 TRDJ genes (3 F and 1 ORF), 1 TRAC gene (F) and 1 TRDC gene (F). The IMGT 5’ borne (OR10G3) has been identified 24 kb upstream of the first gene of the locus and the IMGT 3’ borne (DAD1), has been identified 12 kb downstream of the last gene of the locus (cf. Supplementary Figure S1).
The sheep TRA/TRD locus, on chromosome 7 (REV), spans 2882 kb and consists of a total of 381 V genes: 277 (+16 non localized) TRAV genes (124 F, 11 ORF, 149 P, 1 F or ORF, 7 F or P and 1 ORF or P) belonging to 39 TRAV subgroups and 70 (+18 non localized) TRDV genes (34 F, 12 ORF, 28 P, 5 F or ORF, 6 F or P and 3 ORF or P) belonging to 5 TRDV subgroups, 9 TRDD genes (5 F and 4 ORF), 84 J genes: 79 (+1 non localized) TRAJ genes (61 F, 6 ORF and 13 P) and 4 TRDJ genes (3 F and 1 ORF), 1 TRAC gene (F) and 1 TRDC gene (F). The IMGT 5′ borne (OR10G3) was not found and IMGT 3′ borne (DAD1) has been identified 12 kb downstream of the last gene of the locus (cf. Supplementary Figure S2).

3.2. Comparison with Previous Studies

Regarding the sequences and the number of gaps, the quality of the last assemblies (this study) is better than the previous studies. For the bovine, the entire locus is localized on the chromosome 10 and there is only seven gaps. In all the previous assemblies there are genes on unplaced scaffolds and there are more than 260 gaps, except for [11]. On the other hand, many more genes have been described in previous studies (cf. Table 2). For the sheep, the entire locus is localized on the chromosome 7 and there are eighteen gaps. In the previous assembly there are genes on unplaced scaffolds and there are more than 80 gaps. Unlike cattle, fewer genes have been described in previous studies (cf. Table 3).
Given that there is access to two full assemblies (ARS-UCD1.2 for Bos taurus and Oar_rambouillet_v1.0 for Ovis aries), qualified as “representative genome” and as the corresponding TRA/TRD locus has been fully localized on a single chromosome with fewer gaps than in previous IMGT annotated genomic sequences, IMGT000049 and IMGT000048 are considered as IMGT references loci. It has allowed the establishment of the bovine and sheep TRA/TRD gene nomenclature, as well as the evaluation of the functionality of genes. The previous IMGT genomic sequences were re-annotated accordingly and the allelic variants determined based on nucleotide differences in the core region (V-REGION, D-REGION, J-REGION, C-REGION).

3.3. Comparison of the TRA J-C-CLUSTER

The number of TRAJ genes of human and bovine is similar and there are 19 more genes in sheep (cf. Table 1). Two TRAJ genes (TRAJ51 and TRAJ55) are missing in cattle and sheep compared to humans, and there are two TRAJ8 genes while there is only one in human. (cf. Table 4). The 19 TRAJ supplementary genes found in the sheep as a consequence of a duplication (or triplication for some genes) from TRAJ29 to TRAJ39 maybe due to a sequencing error or an amplification. Regarding the functionality, TRAC genes are functional and few TRAJ genes are P in human and bovine (3–4 and 4–6, depending on alleles, respectively). On the other hand, there are more pseudogenes in sheep mostly due to the duplicated genes (11 P out of 13 are duplicated genes) (cf. Table 4).
At the genomic level, each TRAC gene consists of several exons whose sizes are the same for all species except for exon 4 which is untranslated (EX4UTR) (cf. Figure 1). On the other hand, the size of the introns varies according to the species, especially between human and bovine/sheep. In humans, the intron between the exon 1 (EX1) and the exon 2 (EX2) and the intron between EX2 and the exon 3 (EX3) are shorter while the intron between EX3 and EX4UTR is longer compared to bovine and sheep. Each TRAC gene encodes a similar protein of 142 AA with the exon 1 (EX1) encoding the constant domain, the exon 2 (EX2) and the 5’ part of the exon 3 (EX3) encoding the connecting region, the middle of EX3 encoding the transmembrane region and the 3’ part of EX3 encoding the cytoplasmic region (cf. Figure 2). Nevertheless, the structure of EX1 is different, there are fewer AA in the E and F strand and more AA in the G strand of human TRAC compared to bovine/sheep.

3.4. Comparison of the TRD D-J-C-CLUSTER

The number of TRDJ genes of human, bovine and sheep is the same but there are more TRDD genes in bovine and sheep (nine against three in human) (cf. Table 1). Regarding the functionality, TRDC genes are functional, few TRDD genes are ORF in bovine and sheep (three and four, respectively) (cf. Table 5) and one TRDJ gene is ORF both in bovine and sheep (TRDJ2) (cf. Table 6).
Unlike TRAC, the size of the exons of TRDC varies depending on the species except for EX1 (cf. Figure 3). The EX2 is shorter in human but the EX3 is longer compared to bovine and sheep. In the same way, the size of the introns varies according to the species. Each TRDC gene encodes a similar protein of 155-156 AA with EX1 encoding the constant domain, EX2 and the 5′ part of EX3 encoding the connecting region and the 3′ part of EX3 encoding the transmembrane region (cf. Figure 4).

3.5. Comparison of the V-CLUSTER

The size of the V-CLUSTER (which describes the principal set of TRAV/TRDV genes) varies (cf. Figure 5). The V-CLUSTER is less extensive in human (56 genes on 900 kb) than in the bovine and sheep, which is consistent with the number of genes in these species (221 genes over 2200 kb and 346 genes on 2700 kb, respectively). Regarding the functionality of V genes, the proportion of functional genes is more important in human and in bovine compared to pseudogenes. However, there are more pseudogenes in sheep.

3.5.1. Comparison of the TRAV genes

All subgroups were defined according to those of the human genome. A phylogenetic tree with one representative gene by subgroup (except for TRAVA, TRAVB and TRAVC, highly degenerated pseudogenes present only in human) for the human, the bovine and the sheep was created in order to highlight the distance between the species within a subgroup (cf. Figure 6). This phylogenetic tree shows that, for the two species, the genes of a subgroup are grouped in the same branch with a corresponding human gene. Nonetheless there are subgroups missing in both cattle and sheep (TRAV7, TRAV15, TRAV30, TRAV31, TRAV32, TRAVA, TRAVB and TRAVC) and only in sheep (TRAV40), new subgroups in bovine and sheep (TRAV43, TRAV44 and TRAV45) and three subgroups are intermingled: TRAV4, TRAV26 and TRAV44 (cf. Supplementary Figure S3). However, there is less than 75% identity among the genes of these three subgroups for a given species, so they cannot be considered as genes belonging to the same subgroup.
The number of TRAV genes varies depending on the species. There are fewer genes in human than in bovine and fewer genes in bovine than in sheep (cf. Table 1). The number of genes per subgroup also varies according to the species (cf. Table 7). In humans there are one or two genes by subgroup except for TRAV8 and TRAV12 (eight and three genes, respectively) while in cattle and sheep there are subgroups highly developed. In the sheep, there are six subgroups with more than 20 genes (TRAV8, TRAV13, TRAV22, TRAV23, TRAV25 and TRAV44) and three subgroups with more than 10 genes (TRAV9, TRAV14 and TRAV43) although there are only five subgroups in bovine with more than 10 genes (TRAV22, TRAV23, TRAV25, TRAV44 and TRAV45). In addition, as show in the phylogenetic tree (cf. Figure 6) eight subgroups are absent in both species and one subgroup is missing only in sheep.
The CDR lengths are relatively well conserved between the different species (cf. Table 8). The most important differences are in bovine where for some subgroups there are two or three different lengths (TRAV10, TRAV20, TRAV22 and TRAV38) and for three human subgroups in which the CDR length is different from bovine and sheep (TRAV11, TRAV35 and TRAV39). These differences are shown in red in Table 8. For two subgroups (TRAV17 and TRAV18) the bovine has some genes with the same CDR lengths as human (in blue) and some with the same CDR lengths as sheep (in green).

3.5.2. Comparison of the TRDV genes

Like for the TRAV genes, the subgroups were defined according to those of the human genome and a phylogenetic tree with all genes was created (cf. Figure 7). This phylogenetic tree shows that, except for the TRDV1 subgroup, the genes are grouped in the same branch with a corresponding human gene. However the TRDV1 subgroup is divided in two branches even if there is more than 75% identity between all those genes.
As for the TRAV genes, the number of TRDV genes varies depending on the species. There are fewer genes in human than in bovine and fewer genes in bovine than in sheep (cf. Table 1). There are two new subgroups in bovine and sheep compared to human (TRDV4 and TRDV5) and the TRDV1 subgroup much larger in cattle and sheep with 50 and 84 genes, respectively, compared to 1 in human (cf. Table 9).
Contrary to TRAV genes, the CDR lengths are not conserved between human and bovine/sheep for TRDV2 and TRDV3 subgroups (cf. Table 10). For TRDV1 subgroups, there are several different lengths for bovine and sheep (nine and five respectively) due to the high number of genes in this subgroup. There are also genes with lack of CDR2-IMGT and part of CDR3-IMGT (deletion of nine amino acids (AA), not shown in Table 10). This particularity was already described in bovine by [11] and is present in sheep too. Four genes are concerned in bovine (three in-frame and one out-of-frame (P with frameshift)) and two in sheep (six in-frame and two out-of-frame). The in-frame genes are shown in Figure 8.

3.6. Analysis of the cDNA Sequences

The last step of the biocuration pipeline consists of the automatic annotation of the cDNAs available in IMGT/LIGM-DB database with the IMGT/Automat tool [21]: 176 cDNA sequences for cattle and 102 for sheep were annotated. This annotation allowed to highlight the transcription of approximately 50% (for cattle) and 40% (sheep) of the germline genes. Interestingly, TRAJ54 which has a stop codon in position 1 of the J-REGION, and TRDV1-13 with a stop codon in position 108, last position of the V-REGION have been found rearranged and give a productive sequence (with no stop codon and an in-frame junction) in accessions numbers JX065661 (http://www.imgt.org/ligmdb/result.action?accessionNumber=JX065661) and BC113229 (http://www.imgt.org/ligmdb/result.action?accessionNumber=BC113229) respectively, showing the trimming of the stop codon during the rearrangement.

4. Discussion

This study was carried out in order to highlight the differences between the IMGT® annotation and the data previously published and to compare the TRA/TRD loci among bovine and sheep against the human locus. The annotation of each locus followed the pipeline defined in Materials and Methods. The expertise that follows this pipeline permits to establish the TRA/TRD germline repertoire according to IMGT® nomenclature and the IMGT® reference directory (IMGT® reference sequences used by IMGT® tools) of each locus and thus obtain sequence, gene and structure data. For each gene analyzed, there are more than 200 pieces of information available in IMGT® databases, tools and web pages. The comparison of the data obtained after the biocuration was carried out against the data of the human TRA/TRD loci. This analysis was done with respect to the data entered in IMGT Repertoire.
The two loci in the last assemblies have fewer gaps and are localized on a chromosome without unplaced scaffold compared to the previous studies (cf. Table 2 and Table 3). Indeed, it is a basic requirement, with an expected positional organization of genes in the locus, for the annotation of a complete locus with a definitive nomenclature in IMGT®. We rely on publicly available data, which is why we need good quality data so that we can annotate what we see with good quality annotations.
It is worth noting that the nomenclature presented in this manuscript, for the under question loci and species, is carved on stone and it will not change in the future. As a matter of fact, once the IMGT biocuration team gets hold of a genomic assembly covering the whole locus (no contigs, no scaffolds), then a reference assembly is established which gives rise to the definite IMGT nomenclature. Obviously enough, subsequent assemblies might/will be available either for the same individual or for other individuals which will constitute novel haplotypes in the latter case, but will not afffect the original nomenclature.
During the analysis of the TRA/TRD locus in bovine and sheep, it was noted that the general organization of the locus is conserved and is similar to the human one even if the V-CLUSTER is more extensive (cf. Figure 5). It should be noted that the IMGT® unique nomenclature, based on subgroup assignment and position of genes within the locus, represents a valuable help in highlighting locus organizational similarities or differences.
The results show that some subgroups are missing and three new subgroups were described in bovine and sheep compared to human. Some subgroups are more represented in bovine and in sheep than in human, which may indicate potential duplications during evolution. It can also explain the difference in the proportion of functional genes. Indeed, duplicated subgroups in bovine and sheep are composed of an important proportionality of pseudogenes resulting higher number of pseudogenes compared to human. Another indication of duplication during evolution is the presence of an important number of TRDV1 genes (50 in bovine and 66 in sheep) compared to 1 in human [13].
In the TRAV genes, there is only one CDR length for most of human, bovine and sheep subgroups, except for six bovine subgroups (TRAV10, TRAV17, TRAV18, TRAV20, TRAV22 and TRAV38) (cf. Table 8) while in the TRDV1 subgroups there are several lengths (cf. Table 10) and even some genes without CDR2-IMGT (cf. Figure 8).
It would be interesting to see if these specificities (expansion of the TRDV1 subgroup and of the TRAV subgroups, absence of CDR2-IMGT for some TRDV1 genes, etc.) are also found in other ruminant species.
The veterinary species are valuable models for immunological and medical research. The comparison of the TRA/TRD locus among bovine and sheep presented here allow to have a global vision of the TRA/TRD locus in Bovidae and will be a useful resource to analyze the TRA/TRD locus in new species not yet analyzed. The work carried out and the use of the methodology established for the analysis of the TRB locus [19] show that this procedure can be used to facilitate the analysis of IG (IGH, IGK and IGL) and TR (TRA, TRB, TRD and TRG) loci among different species.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/12/1/30/s1. Table S1: Information regarding the genome assembly and TRA/TRD locus IMGT 5′ and 3′ borne in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries). Figure S1: Locus representation of the bovine (Bos taurus) TRA/TRD locus. Colors are according to IMGT color menu for genes (http://www.imgt.org/IMGTScientificChart/RepresentationRules/colormenu.php#h1_28). The dotted line indicates the distance in kb between two genes not represented at scale. Figure S2: Locus representation of the sheep (Ovis aries) TRA/TRD locus. Colors are according to IMGT color menu for genes (http://www.imgt.org/IMGTScientificChart/RepresentationRules/colormenu.php#h1_28). Figure S3: Phylogenetic tree of all TRAV genes for all species (using V-REGION). Homsap: human, Bostau: bovine and Oviari: sheep. Tree generated using NGPhylogeny.fr [25] (with MAFFT [26] and PhyML [27] programs) and iTOL v4 [37].

Author Contributions

Data curation, P.P., V.N.N., G.F. and J.J.-M. ; formal analysis and writing—original draft preparation, P.P.; validation, M.B., V.N.N., G.F., J.J.-M., V.G. and P.D.; supervision, M.-P.L. and S.K. All authors have read and agreed to the published version of the manuscript.

Funding

IMGT® was funded in part by the BIOMED1 (BIOCT930038), Biotechnology BIOTECH2 (BIO4CT960037), fifth PCRDT Quality of Life and Management of Living Resources (QLG2-2000-01287), and sixth PCRDT Information Science and Technology (ImmunoGrid, FP6 IST-028069) programmes of the European Union (EU). IMGT® received financial support from the GIS IBiSA, BioCampus Montpellier, the Région Occitanie (Grand Plateau Technique pour la Recherche (GPTR)), the Agence Nationale de la recherche (ANR) and the Labex MabImprove (ANR-10-LABX-53-01). IMGT® is currently supported by the Centre National de la Recherche Scientifique (CNRS), the Ministère de l’Enseignement Supérieur, de la Recherche et de l’Innovation (MESRI) and the University of Montpellier (UM). This work was granted access to the HPC@LR and to the High Performance Computing (HPC) resources of the Centre Informatique National de l’Enseignement Supérieur (CINES) and to Très Grand Centre de Calcul (TGCC) of the Commissariat à l’Energie Atomique et aux Energies Alternatives (CEA) under the allocation [036029] (2010–2020) made by GENCI (Grand Equipement National de Calcul Intensif).

Data Availability Statement

The IMGT® software and data are provided to the academic users and NPO’s (Not for Profit Organization(s)) under the CC BY-NC-ND 4.0 license. Any other use of IMGT® material, from the private sector, needs a financial arrangement with CNRS.

Acknowledgments

We are grateful to Gérard Lefranc for helpful discussion, to the IMGT® team for their expertise and constant motivation, to Dominique Scaviner for the initial annotation of the human TRA/TRD, to Amandine Lacan (deceased October 19, 2018) for the initial annotation of the bovine TRA/TRD (based on [12]) and to Imène Chentli for the initial annotation of the sheep TRA/TRD locus (based on [16]). IMGT® is a registered trademark of CNRS. IMGT® is a member of the International Medical Informatics Association (IMIA) and of the Global Alliance for Genomics and Health (GA4GH).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
IGImmunoglobulin
TRT cell receptor
MHMajor histocompatibility
IgSFImmunoglobulin Superfamily
MhSFMH Superfamily
RPIRelated Protein of the Immune system
VVariable
DDiversity
JJoining
CConstant
OR10G3Olfactory Receptor 10G3
DAD1Defender Against cell Death
FWDForward
REVReverse
kbkilobase
FFunctional
PPseudogene
AAAmino acid
EX4UTRExon 4 untranslated
EX1Exon 1
EX2Exon 2
EX3Exon 3

References

  1. Lefranc, M.P. Immunoglobulin and T Cell Receptor Genes: IMGT(®) and the Birth and Rise of Immunoinformatics. Front. Immunol. 2014, 5, 22. [Google Scholar] [CrossRef] [Green Version]
  2. Lefranc, M.P.; Lefranc, G. The Immunoglobulin FactsBook; Academic Press: London, UK, 2001. [Google Scholar]
  3. Lefranc, M.P.; Lefranc, G. The T Cell Receptor FactsBook; Academic Press: London, UK, 2001. [Google Scholar]
  4. Guzman, E.; Montoya, M. Contributions of Farm Animals to Immunology. Front. Vet. Sci. 2018, 5, 307. [Google Scholar] [CrossRef] [Green Version]
  5. Welsh, M.D.; Kennedy, H.E.; Smyth, A.J.; Girvin, R.M.; Andersen, P.; Pollock, J.M. Responses of bovine WC1(+) gammadelta T cells to protein and nonprotein antigens of Mycobacterium bovis. Infect. Immun. 2002, 70, 6114–6120. [Google Scholar] [CrossRef] [Green Version]
  6. Sathiyaseelan, T.; Naiman, B.; Welte, S.; Machugh, N.; Black, S.J.; Baldwin, C.L. Immunological characterization of a gammadelta T-cell stimulatory ligand on autologous monocytes. Immunology 2002, 105, 181–189. [Google Scholar] [CrossRef]
  7. Saif, L.J. Bovine respiratory coronavirus. Vet. Clin. N. Am. Food Anim. Pract. 2010, 26, 349–364. [Google Scholar] [CrossRef]
  8. Su, S.; Fu, X.; Li, G.; Kerlin, F.; Veit, M. Novel Influenza D virus: Epidemiology, pathology, evolution and biological characteristics. Virulence 2017, 8, 1580–1591. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Wooldridge, A.L.; Clifton, V.L.; Moss, T.J.M.; Lu, H.; Jamali, M.; Agostino, S.; Muhlhausler, B.S.; Morrison, J.L.; De Matteo, R.; Wallace, M.J.; et al. Maternal allergic asthma during pregnancy alters fetal lung and immune development in sheep: Potential mechanisms for programming asthma and allergy. J. Physiol. 2019, 597, 4251–4262. [Google Scholar] [CrossRef] [PubMed]
  10. Fries, R.; Ewald, D.; Thaller, G.; Buitkamp, J. Assessment of the nucleotide sequence variability in the bovine T-cell receptor alpha delta joining gene region. Anim. Biotechnol. 2001, 12, 29–49. [Google Scholar] [CrossRef] [PubMed]
  11. Reinink, P.; Van Rhijn, I. The bovine T cell receptor alpha/delta locus contains over 400 V genes and encodes V genes without CDR2. Immunogenetics 2009, 61, 541–549. [Google Scholar] [CrossRef] [Green Version]
  12. Herzig, C.T.A.; Lefranc, M.P.; Baldwin, C.L. Annotation and classification of the bovine T cell receptor delta genes. BMC Genom. 2010, 11, 100. [Google Scholar] [CrossRef] [Green Version]
  13. Connelley, T.K.; Degnan, K.; Longhi, C.W.; Morrison, W.I. Genomic analysis offers insights into the evolution of the bovine TRA/TRD locus. BMC Genom. 2014, 15, 994. [Google Scholar] [CrossRef] [PubMed]
  14. Massari, S.; Antonacci, R.; De Caro, F.; Lipsi, M.R.; Ciccarese, S. Assignment of the TCRA/TCRD locus to sheep chromosome bands 7q1.4–>q2.2 by fluorescence in situ hybridization. Cytogenet. Cell Genet. 1997, 79, 193–195. [Google Scholar] [CrossRef] [PubMed]
  15. Antonacci, R.; Lanave, C.; Del Faro, L.; Vaccarelli, G.; Ciccarese, S.; Massari, S. Artiodactyl emergence is accompanied by the birth of an extensive pool of diverse germline TRDV1 genes. Immunogenetics 2005, 57, 254–266. [Google Scholar] [CrossRef] [PubMed]
  16. Piccinni, B.; Massari, S.; Caputi Jambrenghi, A.; Giannico, F.; Lefranc, M.P.; Ciccarese, S.; Antonacci, R. Sheep (Ovis aries) T cell receptor alpha (TRA) and delta (TRD) genes and genomic organization of the TRA/TRD locus. BMC Genom. 2015, 16, 709. [Google Scholar] [CrossRef] [PubMed]
  17. Kitts, P.A.; Church, D.M.; Thibaud-Nissen, F.; Choi, J.; Hem, V.; Sapojnikov, V.; Smith, R.G.; Tatusova, T.; Xiang, C.; Zherikov, A.; et al. Assembly: A resource for assembled genomes at NCBI. Nucleic Acids Res. 2016, 44, D73–D80. [Google Scholar] [CrossRef] [PubMed]
  18. Lefranc, M.P.; Giudicelli, V.; Duroux, P.; Jabado-Michaloud, J.; Folch, G.; Aouinti, S.; Carillon, E.; Duvergey, H.; Houles, A.; Paysan-Lafosse, T.; et al. IMGT®, the international ImMunoGeneTics information system® 25 years on. Nucleic Acids Res. 2015, 43, D413–D422. [Google Scholar] [CrossRef] [Green Version]
  19. Pégorier, P.; Bertignac, M.; Chentli, I.; Nguefack Ngoune, V.; Folch, G.; Jabado-Michaloud, J.; Hadi-Saljoqi, S.; Giudicelli, V.; Duroux, P.; Lefranc, M.P.; et al. IMGT® Biocuration and Comparative Study of the T Cell Receptor Beta Locus of Veterinary Species Based on Homo sapiens TRB. Front. Immunol. 2020, 11, 821. [Google Scholar] [CrossRef]
  20. Lane, J.; Duroux, P.; Lefranc, M.P. From IMGT-ONTOLOGY to IMGT/LIGMotif: The IMGT standardized approach for immunoglobulin and T cell receptor gene identification and description in large genomic sequences. BMC Bioinform. 2010, 11, 223. [Google Scholar] [CrossRef] [Green Version]
  21. Giudicelli, V.; Chaume, D.; Jabado-Michaloud, J.; Lefranc, M.P. Immunogenetics Sequence Annotation: The Strategy of IMGT based on IMGT-ONTOLOGY. Stud. Health Technol. Inform. 2005, 116, 3–8. [Google Scholar]
  22. Giudicelli, V.; Lefranc, M.P. IMGT-ONTOLOGY 2012. Front. Genet. 2012, 3, 79. [Google Scholar] [CrossRef] [Green Version]
  23. Lefranc, M.P. WHO-IUIS Nomenclature Subcommittee for immunoglobulins and T cell receptors report. Immunogenetics 2007, 59, 899–902. [Google Scholar] [CrossRef] [PubMed]
  24. Brochet, X.; Lefranc, M.P.; Giudicelli, V. IMGT/V-QUEST: The highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis. Nucleic Acids Res. 2008, 36, W503–W508. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Lemoine, F.; Correia, D.; Lefort, V.; Doppelt-Azeroual, O.; Mareuil, F.; Cohen-Boulakia, S.; Gascuel, O. NGPhylogeny.fr: New generation phylogenetic services for non-specialists. Nucleic Acids Res. 2019, 47, W260–W265. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Guindon, S.; Dufayard, J.F.; Lefort, V.; Anisimova, M.; Hordijk, W.; Gascuel, O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 2010, 59, 307–321. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Lefranc, M.P.; Pommié, C.; Ruiz, M.; Giudicelli, V.; Foulquier, E.; Truong, L.; Thouvenin-Contet, V.; Lefranc, G. IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Dev. Comp. Immunol. 2003, 27, 55–77. [Google Scholar] [CrossRef]
  29. Lefranc, M.P.; Pommié, C.; Kaas, Q.; Duprat, E.; Bosc, N.; Guiraudou, D.; Jean, C.; Ruiz, M.; Da Piédade, I.; Rouard, M.; et al. IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains. Dev. Comp. Immunol. 2005, 29, 185–203. [Google Scholar] [CrossRef]
  30. Lefranc, M.P. IMGT Collier de Perles for the variable (V), constant (C), and groove (G) domains of IG, TR, MH, IgSF, and MhSF. Cold Spring Harb. Protoc. 2011, 2011, 643–651. [Google Scholar] [CrossRef]
  31. Ehrenmann, F.; Giudicelli, V.; Duroux, P.; Lefranc, M.P. IMGT/Collier de Perles: IMGT standardized representation of domains (IG, TR, and IgSF variable and constant domains, MH and MhSF groove domains). Cold Spring Harb. Protoc. 2011, 2011, 726–736. [Google Scholar] [CrossRef]
  32. Giudicelli, V.; Duroux, P.; Ginestoux, C.; Folch, G.; Jabado-Michaloud, J.; Chaume, D.; Lefranc, M.P. IMGT/LIGM-DB, the IMGT comprehensive database of immunoglobulin and T cell receptor nucleotide sequences. Nucleic Acids Res. 2006, 34, D781–D784. [Google Scholar] [CrossRef] [Green Version]
  33. Giudicelli, V.; Chaume, D.; Lefranc, M.P. IMGT/GENE-DB: A comprehensive database for human and mouse immunoglobulin and T cell receptor genes. Nucleic Acids Res. 2005, 33, D256–D261. [Google Scholar] [CrossRef] [PubMed]
  34. Kaas, Q.; Ruiz, M.; Lefranc, M.P. IMGT/3Dstructure-DB and IMGT/StructuralQuery, a database and a tool for immunoglobulin, T cell receptor and MHC structural data. Nucleic Acids Res. 2004, 32, D208–D210. [Google Scholar] [CrossRef] [PubMed]
  35. Alamyar, E.; Duroux, P.; Lefranc, M.P.; Giudicelli, V. IMGT(®) tools for the nucleotide analysis of immunoglobulin (IG) and T cell receptor (TR) V-(D)-J repertoires, polymorphisms, and IG mutations: IMGT/V-QUEST and IMGT/HighV-QUEST for NGS. Methods Mol. Biol. 2012, 882, 569–604. [Google Scholar] [CrossRef] [PubMed]
  36. Ehrenmann, F.; Kaas, Q.; Lefranc, M.P. IMGT/3Dstructure-DB and IMGT/DomainGapAlign: A database and a tool for immunoglobulins or antibodies, T cell receptors, MHC, IgSF and MhcSF. Nucleic Acids Res. 2010, 38, D301–D307. [Google Scholar] [CrossRef] [PubMed]
  37. Letunic, I.; Bork, P. Interactive Tree of Life (iTOL) v4: Recent updates and new developments. Nucleic Acids Res. 2019, 47, W256–W259. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Structure of the TRAC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). The numbers correspond to the size of the exons and introns in nucleotides.
Figure 1. Structure of the TRAC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). The numbers correspond to the size of the exons and introns in nucleotides.
Genes 12 00030 g001
Figure 2. Protein display of the TRAC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). Only alleles *01 are shown. The description of the strands and loops is according to the IMGT unique numbering for C-DOMAIN [29]. The AA between parentheses at the beginning of EX1, EX2 and EX3 corresponds to the first codon resulting from a splicing frame 1 (sf1). (http://www.imgt.org/IMGTeducation/Aide-memoire/_UK/splicing/). Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Proteins and alleles > Protein displays > C-DOMAIN > TRAC > Human, ibid. Bovine, ibid. Sheep.
Figure 2. Protein display of the TRAC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). Only alleles *01 are shown. The description of the strands and loops is according to the IMGT unique numbering for C-DOMAIN [29]. The AA between parentheses at the beginning of EX1, EX2 and EX3 corresponds to the first codon resulting from a splicing frame 1 (sf1). (http://www.imgt.org/IMGTeducation/Aide-memoire/_UK/splicing/). Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Proteins and alleles > Protein displays > C-DOMAIN > TRAC > Human, ibid. Bovine, ibid. Sheep.
Genes 12 00030 g002
Figure 3. Structure of the TRDC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). The numbers correspond to the size of the exons and introns in nucleotides.
Figure 3. Structure of the TRDC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). The numbers correspond to the size of the exons and introns in nucleotides.
Genes 12 00030 g003
Figure 4. Protein display of the TRDC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). Only alleles *01 are shown. The description of the strands and loops is according to the IMGT unique numbering for C-DOMAIN [29]. The AA between parentheses at the beginning of EX1, EX2 and EX3 corresponds to the first codon resulting from a splicing frame 1 (sf1). (http://www.imgt.org/IMGTeducation/Aide-memoire/_UK/splicing/). Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Proteins and alleles > Protein displays > C-DOMAIN > TRAC > Human, ibid. Bovine, ibid. Sheep.
Figure 4. Protein display of the TRDC genes in human (Homsap), bovine (Bostau) and sheep (Oviari). Only alleles *01 are shown. The description of the strands and loops is according to the IMGT unique numbering for C-DOMAIN [29]. The AA between parentheses at the beginning of EX1, EX2 and EX3 corresponds to the first codon resulting from a splicing frame 1 (sf1). (http://www.imgt.org/IMGTeducation/Aide-memoire/_UK/splicing/). Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Proteins and alleles > Protein displays > C-DOMAIN > TRAC > Human, ibid. Bovine, ibid. Sheep.
Genes 12 00030 g004
Figure 5. Schematic comparison of the TRA/TRD V-CLUSTER among human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries). Colors are according to IMGT color menu for genes (http://www.imgt.org/IMGTScientificChart/RepresentationRules/colormenu.php#h1_28): in green: functional genes, in yellow: ORF genes and in red: pseudogenes. The dotted line in Bos taurus indicates the distance in kb between two genes not represented at scale. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Locus representations > TRA, ibid. TRD > Human, ibid. Bovine, ibid. Sheep.
Figure 5. Schematic comparison of the TRA/TRD V-CLUSTER among human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries). Colors are according to IMGT color menu for genes (http://www.imgt.org/IMGTScientificChart/RepresentationRules/colormenu.php#h1_28): in green: functional genes, in yellow: ORF genes and in red: pseudogenes. The dotted line in Bos taurus indicates the distance in kb between two genes not represented at scale. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Locus representations > TRA, ibid. TRD > Human, ibid. Bovine, ibid. Sheep.
Genes 12 00030 g005
Figure 6. Phylogenetic tree of all TRAV subgroups for all species with one representative gene per subgroup (using V-REGION). Homsap: human, Bostau: bovine and Oviari: sheep. The different colors highlight the different subgroups. In black: subgroups only present in humans. Tree generated using NGPhylogeny.fr [25] (with MAFFT [26] and PhyML [27] programs) and iTOL v4 [37].
Figure 6. Phylogenetic tree of all TRAV subgroups for all species with one representative gene per subgroup (using V-REGION). Homsap: human, Bostau: bovine and Oviari: sheep. The different colors highlight the different subgroups. In black: subgroups only present in humans. Tree generated using NGPhylogeny.fr [25] (with MAFFT [26] and PhyML [27] programs) and iTOL v4 [37].
Genes 12 00030 g006
Figure 7. Phylogenetic tree of all TRDV genes for all species (using V-REGION). Homsap: human, Bostau: bovine and Oviari: sheep. The different colors highlight the different subgroups. Tree generated using NGPhylogeny.fr [25] (with MAFFT [26] and PhyML [27] programs) and iTOL v4 [37].
Figure 7. Phylogenetic tree of all TRDV genes for all species (using V-REGION). Homsap: human, Bostau: bovine and Oviari: sheep. The different colors highlight the different subgroups. Tree generated using NGPhylogeny.fr [25] (with MAFFT [26] and PhyML [27] programs) and iTOL v4 [37].
Genes 12 00030 g007
Figure 8. Protein display of the TRDV1 genes with lack of CDR2-IMGT and part of FR3-IMGT in bovine (Bostau) and sheep (Oviari). Only alleles *01 are shown. The description of the strands and loops is according to the IMGT unique numbering for V-REGION [28]. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Proteins and alleles > Protein displays > V-REGION > TRDV > Human, ibid. Bovine, ibid. Sheep.
Figure 8. Protein display of the TRDV1 genes with lack of CDR2-IMGT and part of FR3-IMGT in bovine (Bostau) and sheep (Oviari). Only alleles *01 are shown. The description of the strands and loops is according to the IMGT unique numbering for V-REGION [28]. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Proteins and alleles > Protein displays > V-REGION > TRDV > Human, ibid. Bovine, ibid. Sheep.
Genes 12 00030 g008
Table 1. Results of the analysis of TRA/TRD loci in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 1. Results of the analysis of TRA/TRD loci in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SpeciesHomo sapiensBos taurusOvis aries
Chromosome (Orientation)14 (forward (FWD))10 (REV)7 (REV)
Size (kb)100033312882
Number of TRAV genes56183277 (+16 non localized)
Number of TRDV genes339 (+ 16 non localized)70 (+18 non localized)
Number of TRAV/DV genes500
Number of TRDD genes399
Number of TRAJ genes616079 (+1 non localized)
Number of TRDJ genes444
Number of TRAC genes111
Number of TRDC genes111
Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Locus descriptions > Locus description > TRA, ibid. TRD > Human, ibid. Bovine, ibid. Sheep.
Table 2. Comparison of the different studies of TRA/TRD loci in bovine (Bos taurus).
Table 2. Comparison of the different studies of TRA/TRD loci in bovine (Bos taurus).
This StudyConnelley et al.,
2014 [13]
Herzig et al., 2010, [12]Reinink and Van
Rhijn, 2009 [11]
AssemblyARS-UCD1.2UMD3.1Btau_3.1Btau4.0
Chromosomechr 10chr 10 + chr 9 + 1 unplaced scaffold2 scaffolds on chr 10 + 27 unplaced scaffoldchr 10 + 3 unplaced scaffolds
Number of gap7Around 2602655
Number of TRAV183306NA318
Number of TRDV39655680
Number of TRDD955NA
Number of TRAJ6062NANA
Number of TRDJ433NA
Number of TRAC11NANA
Number of TRDC111NA
chr: chromosome; NA: not applicable.
Table 3. Comparison of the different studies of TRA/TRD loci in sheep (Ovis aries).
Table 3. Comparison of the different studies of TRA/TRD loci in sheep (Ovis aries).
This StudyPiccinni et al., 2015 [16]Antonacci et al., 2005 [15]
AssemblyOar_rambouillet_v1.0Oar_v3.1individual sequences
Chromosomechr 7chr 7 + 25 unplaced scaffoldsNA
Number of gap1883NA
Number of TRAV27766NA
Number of TRDV702523
Number of TRDD97NA
Number of TRAJ7961NA
Number of TRDJ44NA
Number of TRAC11NA
Number of TRDC11NA
chr: chromosome; NA: not applicable.
Table 4. IMGT Potential germline repertoires of the TRAJ sets in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 4. IMGT Potential germline repertoires of the TRAJ sets in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SetsHomo sapiensBos taurusOvis aries
TRAJ11 O1 F1 F
TRAJ21 O1 F1 F
TRAJ31 F1 F1 O (2)
TRAJ41 F1 FP (2)1 O
TRAJ51 F1 F1 F (2)
TRAJ61 F1 F1 O
TRAJ71 F1 F1 F
TRAJ81 FP (2)2 F (3)2 F
TRAJ91 F1 F1 F (2)
TRAJ101 F1 F1 F (2)
TRAJ111 F1 F (2)1 F
TRAJ121 F1 F1 F
TRAJ131 F (2)1 P (2)1 O
TRAJ141 F1 P (2)1 O
TRAJ151 F (2)1 FP (2)1 F
TRAJ161 F (2)1 F1 F
TRAJ171 F1 F1 F
TRAJ181 F1 F1 F
TRAJ191 O1 P1 P
TRAJ201 F1 F1 F
TRAJ211 F1 F1 F
TRAJ221 F1 F1 F
TRAJ231 F (2)1 F1 F
TRAJ241 F (3)1 F1 F
TRAJ251 O1 F (2)1 F
TRAJ261 F1 O1 F
TRAJ271 F1 F1 F
TRAJ281 F1 F1 F
TRAJ291 F1 F2 F, 1 O
TRAJ301 F1 F1 F, 1 P
TRAJ311 F1 F (2)1 F, 2 P
TRAJ321 F (2)1 F2 F (3), 1 P
TRAJ331 F1 F2 F, 1 P
TRAJ341 F1 F2 F, 1 P
TRAJ351 F1 O2 F (3), 1 P
TRAJ361 F1 F2 F, 1 P
TRAJ371 F (2)1 F (2)2 F (3), 1 P
TRAJ381 F1 F1 F, 1 P
TRAJ391 F1 F (2)1 F, 1 P
TRAJ401 F1 F1 F
TRAJ411 F1 F (2)1 F
TRAJ421 F1 F1 F
TRAJ431 F1 F1 F
TRAJ441 F1 F2 F
TRAJ451 F1 F (2)1 F
TRAJ461 F1 F1 F
TRAJ471 F (2)1 F1 F
TRAJ481 F1 F1 F
TRAJ491 F1 F1 F
TRAJ501 F1 F1 F
TRAJ511 P--
TRAJ521 F1 F1 F
TRAJ531 F1 F1 F
TRAJ541 F1 F1 F
TRAJ551 P--
TRAJ561 F1 F1 F
TRAJ571 F1 F1 F
TRAJ581 O1 F1 F
TRAJ591 O1 F1 F
TRAJ601 P1 F1 F
TRAJ611 O1 P1 P
Total per Fct50 F + 7 O + 3 P + 1 FP52 F + 2 O + 4 P + 2 FP61 F + 6 O + 13 P
Total genes61 (71)60 (72)80 (87)
For each TRAJ set, in each species, the number of TRAJ genes by functionality and, between parentheses, the number of alleles are shown. F: functional; O: ORF; P: pseudogene; FP: genes with alleles of different functionalities. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Potential germline repertoires > TRAV and TRAJ > Human, ibid. Bovine, ibid. Sheep.
Table 5. IMGT Potential germline repertoires of the TRDD sets in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 5. IMGT Potential germline repertoires of the TRDD sets in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SetsHomo sapiensBos taurusOvis aries
TRDD11 F1 O1 O (2)
TRDD21 F1 F1 F
TRDD31 F1 O1 O
TRDD4-1 F1 F
TRDD5-1 O1 O
TRDD6-1 F1 F
TRDD7-1 F1 O
TRDD8-1 F1 F
TRDD9-1 F1 F
Total per Fct3 F6 F + 3 O5 F + 4 O
Total genes3 (3)9 (9)9 (10)
For each TRDD set, in each species, the number of TRDD genes by functionality and, between parentheses, the number of alleles are shown. F: functional; O: ORF. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Potential germline repertoires > TRDV, TRDD and TRDJ > Human, ibid. Bovine, ibid. Sheep.
Table 6. IMGT Potential germline repertoires of the TRDJ sets in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 6. IMGT Potential germline repertoires of the TRDJ sets in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SetsHomo sapiensBos taurusOvis aries
TRDJ11 F1 F1 F
TRDJ21 F1 O1 O
TRDJ31 F1 F1 F
TRDJ41 F1 F1 F
Total per Fct4 F3 F + 1 O3 F + 1 O
Total genes4 (4)4 (4)4 (4)
For each TRDJ set, in each species, the number of TRDJ genes by functionality and, between parentheses, the number of alleles are shown. F: functional; O: ORF. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Potential germline repertoires > TRDV, TRDD and TRDJ > Human, ibid. Bovine, ibid. Sheep.
Table 7. IMGT Potential germline repertoires of the TRAV subgroups in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 7. IMGT Potential germline repertoires of the TRAV subgroups in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SubgroupsHomo sapiensBos taurusOvis aries
TRAV12 F (5)1 F1 O
TRAV21 F (2)5 F, 2 P1 O (2)
TRAV31 FP (2)6 F , 1 FP (2)1 F (2)
TRAV41 F1 P2 F (3)
TRAV51 F1 F (3)1 FP (2)
TRAV61 F (7)1 P (2)1 P (2)
TRAV71 F--
TRAV85 F (17), 3 P (4)1 F (3), 4 P (6)5 F (6), 22 P (24), 1 FO (2), 2 FP (4)
TRAV92 F (5)4 F (7), 2 P7 F (8), 5 P (7)
TRAV101 F (2)1 O, 2 P1 F (2), 1 P
TRAV112 P3 P3 P (4)
TRAV123 F (7)2 F, 1 FP (3)5 F (6), 2 P
TRAV132 F (5)2 F (4), 2 P (4)11 F, 10 P
TRAV141 F (4), 1 P (2)1 F, 1 O (2), 1 P, 1 FP (3)7 F, 1 O, 6 P
TRAV151 P--
TRAV161 F1 F1 F (2)
TRAV171 F1 F (2), 2 P1 P, 1 FP (2)
TRAV181 F1 F, 2 O, 1 P (2), 1 FO (3)3 F, 1 P
TRAV191 F2 F, 1 P (2), 1 FOP (3)1 P
TRAV201 F (4)2 F (3), 1 O, 1 P2 F, 2 P
TRAV211 F (2)2 F (4), 1 O, 1 P2 F (3), 3 P
TRAV221 F6 F (7), 3 O, 5 P (7), 1 FO (3), 1 FP (3)20 F (22), 22 P (24), 1 FP (2)
TRAV231 F (5)1 F (2), 10 P (14), 2 OP (4)5 F, 2 O, 32 P (37)
TRAV241 F (2)1 F , 5 P (7), 1 FO (2)1 F, 2 O, 5 P
TRAV251 F4 F (6), 1 O, 9 P, 1 FP (2)10 F (11), 2 O, 13 P, 1 FP (2)
TRAV262 F (5)1 P (3)2 F (3)
TRAV271 F (3)1 OP (2)1 FP (2)
TRAV281 P (2)2 F (4), 1 P2 F (4)
TRAV291 FP (4)1 F (3), 1 O2 F (3)
TRAV301 F (5)--
TRAV311 P (2)--
TRAV321 P--
TRAV331 P2 P (3), 1 FP (4)1 F, 1 P (2)
TRAV341 F2 P (4)1 P
TRAV351 FP (3)2 F1 P (2)
TRAV361 F (5)1 F, 1 P1 F (2)
TRAV371 P2 P1 P
TRAV382 F (5)6 F (9)2 F (3)
TRAV391 F1 F, 1 O1 O (2)
TRAV401 F1 P-
TRAV411 F1 F1 F (2)
TRAV43-3 F (6)13 F (15), 2 P (3)
TRAV44-8 F, 1 O, 8 P (11), 1 FP (2)13 F (14), 1 O, 11 P (12)
TRAV45-10 F (14), 1 O, 2 P (3), 2 FP (4)4 F (5), 1 P, 1 OP (2)
TRAV461 P1 P1 P (2)
TRAVA1 P (2)--
TRAVB1 P (2)--
TRAVC1 P--
Total per Fct42 F + 16 P + 3 FP79 F + 14 O + 74 P + 3 FO + 9 FP + 3 OP + 1 FOP124 F + 11 O + 149 P + 1 FO + 7 FP + 1 OP
Total genes61 (134)183 (263)293 (344)
For each TRAV subgroup, in each species, the number of TRAV genes by functionality and, between parentheses, the number of alleles are shown. F: functional; O: ORF; P: pseudogene; FO, FP, PO, FOP: genes with alleles of different functionalities. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Potential germline repertoires > TRAV and TRAJ > Human, ibid. Bovine, ibid. Sheep.
Table 8. TRAV [CDR1-IMGT.CDR2-IMGT.CDR3-IMGT] lengths by subgroup and species in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 8. TRAV [CDR1-IMGT.CDR2-IMGT.CDR3-IMGT] lengths by subgroup and species in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SubgroupsHomo sapiensBos taurusOvis aries
TRAV1[6.6.3][6.6.3][6.6.3]
TRAV2[6.4.3][6.4.3][6.4.3]
TRAV3[6.8.4][6.8.3][6.8.3]
TRAV4[7.5.4][7.5.4][7.5.4]
TRAV5[6.7.3][6.7.3][6.7.3]
TRAV6[6.7.3]--
TRAV7[6.7.3]--
TRAV8[6.8.3][6.8.3][6.8.3]
TRAV9[6.7.3][6.7.3][6.7.3]
TRAV10[6.7.3][5.7.3]
[6.7.3]
[6.7.3]
TRAV11[6.7.2][6.7.3][6.7.3]
TRAV12[6.6.3][6.6.3][6.6.3]
TRAV13[6.7.3][6.7.3][6.7.3]
TRAV14[7.8.4][7.8.4][7.8.4]
TRAV16[6.4.3][6.4.3][6.4.3]
TRAV17[5.7.3][5.7.3]
[6.7.3]
[6.7.3]
TRAV18[6.6.3][6.6.3]
[6.7.3]
[6.7.3]
TRAV19[7.8.4][7.8.4][7.8.4]
[4.7.3]
TRAV20[6.7.3][6.7.3][6.7.3]
TRAV21[6.7.3][6.7.3][6.7.3]
[4.5.3]
TRAV22[5.5.3][5.5.3][5.5.3]
TRAV23[6.7.3][6.7.3][6.7.3]
TRAV24[6.7.2][6.7.2][6.7.2]
TRAV25[5.7.2][5.7.2][5.7.2]
TRAV26[7.5.4][7.5.4][7.5.4]
TRAV27[5.7.2][5.7.2]
TRAV28 [5.5.3][5.5.3]
TRAV29[6.7.3][6.7.3][6.7.3]
TRAV30[5.7.3]--
TRAV33--[7.7.5]
TRAV34[5.7.3]--
TRAV35[5.7.3][5.7.2][5.7.2]
TRAV36[6.7.3][6.7.3][6.7.3]
[6.8.4]
TRAV38[7.8.4][7.8.4][7.8.4]
[8.8.4]
TRAV39[5.7.3][6.7.3][6.7.3]
TRAV40[6.4.3]--
TRAV41[5.5.3][5.5.3][5.5.3]
TRAV43-[5.5.3][5.5.3]
TRAV44-[7.5.4][7.5.4]
TRAV45-[7.7.3][7.7.3]
Only in-frame genes are considered. The differences in CDR length are shown in red. The correspondances for subgrousp TRAV17 and TRAV18 are shown in blue and green. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > 2D and 3D structures > FR-IMGT and CDR-IMGT lengths (V-REGION and V-DOMAIN) > [CDR1-IMGT.CDR2-IMGT.] length per subgroup > TRAV > Human, ibid. Bovine, ibid. Sheep.
Table 9. IMGT Potential germline repertoires of the TRDV subgroups in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 9. IMGT Potential germline repertoires of the TRDV subgroups in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SubgroupsHomo sapiensBos taurusOvis aries
TRDV11 F41 F (55), 4 O (5), 5 P32 F (41), 11 O, 27 P (28), 5 FO (11), 6 FP (13), 3 OP (6)
TRDV21 F (3)1 F1 O (2)
TRDV31 F (2)1 F1 F (2)
TRDV4-1 O1 P
TRDV5-2 F (4)1 F (2)
Total per Fct3 F45 F + 5 O + 5 P34 F + 12 O + 28 P + 5 FO + 6 FP + 3 OP
Total genes3 (6)55 (72)88 (117)
For each TRDV subgroup, in each species, the number of TRDV genes by functionality and, between parentheses, the number of alleles are shown. F: functional; O: ORF; P: pseudogene; FO, FP, PO: genes with alleles of different functionalities. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > Locus and genes > Potential germline repertoires > TRDV, TRDD and TRDJ > Human, ibid. Bovine, ibid. Sheep.
Table 10. TRDV [CDR1-IMGT.CDR2-IMGT.CDR3-IMGT] lengths by subgroup and species in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
Table 10. TRDV [CDR1-IMGT.CDR2-IMGT.CDR3-IMGT] lengths by subgroup and species in human (Homo sapiens), bovine (Bos taurus) and sheep (Ovis aries).
SubgroupsHomo sapiensBos taurusOvis aries
[5.3.4]
[7.3.4]
[8.1.4][7.3.3]
[8.3.4][7.3.4]
TRDV1[7.3.4][9.3.3][7.3.5]
[9.3.4][9.3.4]
[9.3.15][13.3.4]
[10.3.4]
[18.3.4]
TRDV2[8.3.4][9.3.4][9.3.4]
TRDV3[7.6.2][7.6.4][7.6.4]
TRDV4-[8.3.4]-
TRDV5-[7.3.5][7.3.5]
Only in-frame genes are considered. The differences in CDR length are shown in red. The correspondances for subgrousp TRDV1 are shown in blue. Data available in IMGT Repertoire (IG and TR) http://www.imgt.org/IMGTrepertoire/ > 2D and 3D structures > FR-IMGT and CDR-IMGT lengths (V-REGION and V-DOMAIN) > [CDR1-IMGT.CDR2-IMGT.] length per subgroup > TRDV > Human, ibid. Bovine, ibid. Sheep.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pégorier, P.; Bertignac, M.; Nguefack Ngoune, V.; Folch, G.; Jabado-Michaloud, J.; Giudicelli, V.; Duroux, P.; Lefranc, M.-P.; Kossida, S. IMGT® Biocuration and Comparative Analysis of Bos taurus and Ovis aries TRA/TRD Loci. Genes 2021, 12, 30. https://doi.org/10.3390/genes12010030

AMA Style

Pégorier P, Bertignac M, Nguefack Ngoune V, Folch G, Jabado-Michaloud J, Giudicelli V, Duroux P, Lefranc M-P, Kossida S. IMGT® Biocuration and Comparative Analysis of Bos taurus and Ovis aries TRA/TRD Loci. Genes. 2021; 12(1):30. https://doi.org/10.3390/genes12010030

Chicago/Turabian Style

Pégorier, Perrine, Morgane Bertignac, Viviane Nguefack Ngoune, Géraldine Folch, Joumana Jabado-Michaloud, Véronique Giudicelli, Patrice Duroux, Marie-Paule Lefranc, and Sofia Kossida. 2021. "IMGT® Biocuration and Comparative Analysis of Bos taurus and Ovis aries TRA/TRD Loci" Genes 12, no. 1: 30. https://doi.org/10.3390/genes12010030

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop