Next Article in Journal
Grape Seed Procyanidin Extract (GSPE) Improves Goat Sperm Quality When Preserved at 4 °C
Next Article in Special Issue
Association of Twelve Candidate Gene Polymorphisms with the Intramuscular Fat Content and Average Backfat Thickness of Chinese Suhuai Pigs
Previous Article in Journal
Local Knowledge for Addressing Food Insecurity: The Use of a Goat Meat Drying Technique in a Rural Famine Context in Southern Africa
Previous Article in Special Issue
Theoretical Evaluation of Multi-Breed Genomic Prediction in Chinese Indigenous Cattle
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Copy Number Variation in Domestic Chicken Using Whole-Genome Sequencing Reveals Evidence of Selection in the Genome

1
Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, Korea
2
C&K Genomics, 26 Beonbwon-ro, Songpa-gu, Seoul 05836, Korea
3
Animal Genomics & Bioinformatics Division, National Institute of Animal Science, RDA 1500, Kongjiwipatiwi-ro, Iso-myeon, Wanju-gun, Jeollabuk-do 55365, Korea
*
Author to whom correspondence should be addressed.
Animals 2019, 9(10), 809; https://doi.org/10.3390/ani9100809
Submission received: 27 August 2019 / Revised: 13 October 2019 / Accepted: 14 October 2019 / Published: 15 October 2019
(This article belongs to the Collection Applications of Quantitative Genetics in Livestock Production)

Abstract

:

Simple Summary

Chickens have been bred for meat and egg production as a source of animal protein. With the increase of productivity as the main purpose of domestication, factors such as metabolism and immunity were boosted, which are detectable signs of selection on the genome. This study focused on copy number variation (CNV) to find evidence of domestication on the genome. CNV was detected from whole-genome sequencing of 65 chickens including Red Jungle Fowl, broilers, and layers. After that, CNV region, the overlapping region of CNV between individuals, was made to identify which genomic regions showed copy number differentiation. The 663 domesticated-specific CNV regions were associated with various functions such as metabolism and organ development. Also, by performing population differentiation analyses such as clustering analysis and ANOVA test, we found that there are a lot of genomic regions with different copy number patterns between broilers and layers. This result indicates that different genetic variations can be found, depending on the purpose of artificial selection and provides considerations for future animal breeding.

Abstract

Copy number variation (CNV) has great significance both functionally and evolutionally. Various CNV studies are in progress to find the cause of human disease and to understand the population structure of livestock. Recent advances in next-generation sequencing (NGS) technology have made CNV detection more reliable and accurate at whole-genome level. However, there is a lack of CNV studies on chickens using NGS. Therefore, we obtained whole-genome sequencing data of 65 chickens including Red Jungle Fowl, Cornish (broiler), Rhode Island Red (hybrid), and White Leghorn (layer) from the public databases for CNV region (CNVR) detection. Using CNVnator, a read-depth based software, a total of 663 domesticated-specific CNVRs were identified across autosomes. Gene ontology analysis of genes annotated in CNVRs showed that mainly enriched terms involved in organ development, metabolism, and immune regulation. Population analysis revealed that CN and RIR are closer to each other than WL, and many genes (LOC772271, OR52R1, RD3, ADH6, TLR2B, PRSS2, TPK1, POPDC3, etc.) with different copy numbers between breeds found. In conclusion, this study has helped to understand the genetic characteristics of domestic chickens at CNV level, which may provide useful information for the development of breeding systems in chickens.

1. Introduction

Structural variation refers to the variation above 1 kb in the genome including copy number variation (CNV), translocation, and inversion [1]. CNV is a genomic region that shows quantitative variants when compared to the reference genome and is comprised of deletion and duplication regarding copy number [2]. With the alteration of gene structure or dosage [3], CNV shows larger variation compared to small-scale variation such as single nucleotide polymorphism (SNP). This means that CNV could have a significant impact on the phenotype of an individual [4]. Indeed, CNV has great impact both functionally and evolutionally [5,6] and has been studied on a variety of animals such as humans [7,8,9], cattle [10,11,12], pigs [13,14,15], chickens [16,17,18], goats [19], dogs [20], cats [21], etc. Recent studies have shown the importance of CNV [22] regarding the relations to human diseases (cancer [23,24], schizophrenia [25], depression [26]) and the major traits of livestock [27,28,29].
Chicken is not only a major source of protein providing meat and eggs [30] but also has great value to be utilized as bioreactors [31] and animal models of disease [32]. Therefore, various studies from the genomic point of view have been conducted to develop economic traits and reveal the mechanisms underlying phenotypic variation [33]. Beginning with avian evolution study through the comparative genomics approach between chickens and turkeys [34], CNV studies have several interesting findings such as identification of CNVs related to pea-comb [35] or muscle development [17] and breed-specific CNVs detected on the population-level [16,36].
Identification of CNV was usually processed through two types of microarray; array-based comparative genomic hybridization and SNP array. But now next-generation sequencing (NGS)based method is preferred for the following reasons: Microarray-based method cannot identify 1) small CNV, and 2) CNV at genome-wide level due to low probe density. Also, 3) the price of whole-genome sequencing is more affordable due to the development of NGS technology [37]. Although NGS is more expensive and requires more effort while used for diagnostic tests [38], it has more benefits; it is applicable for various analysis other than CNV, can be detected in poorly annotated regions, and other researchers’ data can be used as a control. Although NGS-based identification has been widely used on various animals, it has not been used much on chickens [39,40,41,42,43]. Existing studies also have limitations regarding analyzing breed-specific CNV region (CNVR) due to small sample sizes of 2–12 chickens. Kerstens et al. analyzed 100 chickens (25 per breed) using the reduced representation library only sequencing the regions around the restriction site which led to the limitations with identification of CNV compared to individual sequencing that uses full genetic information [39,44].
The goal of this study is to understand the phenotypic characteristics of commercial chicken breeds at the CNV-level with an NGS-based method, especially on the artificial selection during domestication. We used CNVnator [45], a widely used read-depth based CNV detection program. CNVs were identified on three breeds naturalized in South Korea [46]: Cornish (CN; broiler), White Leghorn (WL; layer), and Rhode Island Red (RIR; hybrid) [30], 20 per breed, a total of 60 chickens by whole-genome sequencing with approximately 10× depth from the public database. The identified CNVRs were compared with five Red Jungle Fowl (RJF; Gallus gallus) to distinguish CNVRs related to domestication. Finally, through functional annotation, we confirmed which gene ontology (GO) terms were enriched. Our findings will deepen the understanding of the breed-specific physiology and characteristics on a genome-wide level as the basis for future breeding systems.

2. Materials and Methods

2.1. Collection of Chicken Whole-Genome Sequencing Data

Twenty samples each from CN, RIR, and WL were downloaded (60 total samples), from the National Agricultural Biotechnology Information Center (NABIC; https://www.nabic.rda.go.kr) public database [47]. The project name is ‘Construction of standard genome maps of Korean native chicken’ and the accession number for the CN sample are NN-0991-000001 to NN-0991-000020, the RIR NN-0991-000081 to NN-0991-000100, and the WL NN-0991-000101 to NN-0991-000120.
Five RJF samples were downloaded from NCBI Sequence Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra) and were used as the control group. The experiment numbers for the five RJF chickens are SRX408166 [48], SRX511213, SRX511214, SRX511216, and SRX511217 [49].
All the data were sequenced on an Illumina HiSeq 2000 sequencer (Illumina Inc.) with 100bp paired-end reads.

2.2. Sequence Alignment to Reference Genome

The adapters of sequencing reads were removed by Trimmomatic-0.36 [50], and trimming was also performed where the Phred-score at the end of the read was less than 20 (TRAILING:20). Trimmed reads were mapped to galGal5 (Gallus gallus) assembly of the chicken genome using the bowtie2-2.3.3.1 [51] with the default option. PCR duplicates that could affect the CNV analysis were removed by using Picard-2.9.2 (https://broadinstitute.github.io/picard/) with Markduplicates. Finally, Genome Analysis Toolkit-3.3.0 [52] was used to realign reads for correcting errors caused by InDels.

2.3. CNV and CNVR Detection

CNVnator-v0.4 software [45], a read-depth based method, was used for CNV prediction relative to the galGal5 reference assembly. Since the average depth of the realigned bam files was about 10×, the optimal bin size was unified to 300bp for all individuals. The CNV calls were filtered with p-value <0.001, q0 (zero mapping quality) <0.5, and size >1 kb. To compare the copy number between breeds, the estimated copy number for each region was calculated using the ‘-genotype’ option of CNVnator.
CNVRs were obtained using ‘CNV_overlap.py’ script on GitHub (https://github.com/bjtrost/TCAG-WGS-CNV-workflow) [53], and only CNVRs found in three or more individuals of any breed were used for downstream analysis to remove false-positive results [54].

2.4. CNVR Annotation

The list of chicken autosome galGal5 genes was downloaded from NCBI, and the gene annotation was performed using the ‘annotatePeak’ command of the Bioconductor package ChIPseeker [55] with ‘addFlankGeneInfo = TRUE’ option. GO enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were performed using Database for Annotation, Visualization and Integrated Discovery (DAVID; https://david.ncifcrf.gov/) [56,57] only on protein-coding genes. Biological process, cellular component, and molecular function were used as GO term categories, and the significance level was a p-value of 0.05. Also, the chicken’s quantitative trait loci (QTL) were downloaded from the chicken QTLdb Release 38 (https://www.animalgenome.org/cgi-bin/QTLdb/GG/summary) [58] and compared with the identified CNVRs. We only considered QTLs reported in galGal5 version with confidence interval less than 5 Mb. We used Bedtools-v2.27.1 [59] ‘intersect’ command to detect which QTL overlaps with the identified CNVRs.

2.5. Breed Differentiation

We examined population differentiation with CNVRs found only in domesticated chickens using three methods; clustering analysis, VST statistic, and ANOVA test. First, clustering analysis was performed using pvclust R package [60]. During this process, a scoring matrix encoded as 0 or 1 was used according to the presence or absence of CNVRs identified for each individual. Approximately unbiased (AU) p-value and bootstrap probability (BP) value were obtained through 10,000 multiscale bootstrap resampling based on unweighted pair-group average method (UPGMA), and they were shown in red and green on the edges of the clustering. Second, the VST statistic was applied to detect regions with high differentiation between populations [61]. To better understand the effect on each breed, protein-coding genes annotated in CNVRs were used for VST calculation, instead of CNVRs. VST was calculated for protein-coding genes as VST = (VT − VS) / VT, where VT is the total variance in copy number between two populations and VS is the average variance within each breed. Finally, ANOVA test was performed to detect differences in copy number of genes between breeds. The genes used for ANOVA were the same as for the VST analysis. Since the sample size for each breed was the same (20 samples per breed), Tukey’s HSD test for post-hoc analysis was selected. In addition, for some of the genes found to be significant in the ANOVA test, the depth of coverage was obtained at a specific bin size to see the part of the gene with different copy number pattern. Depth of coverage was obtained using Mosdepth-0.2.6 software (https://github.com/brentp/mosdepth) [62].

3. Results

3.1. Identification of CNVs and CNVRs

Statistics of read mapping and CNVs are shown in Table S1. Mapping sequencing data to galGal 5 assembly showed an average 96.21% mapping rate and 11.51× depth. Except for RJF, an average of 340 deletions and 120 duplications were detected on autosomes with an average size of 26.9 kb for deletion and 16.1 kb for duplication (Table 1).
The overlapping regions of CNVs obtained using the TCAG-WGS-CNV workflow (https://github.com/bjtrost/TCAG-WGS-CNV-workflow) consisted of 3079 CNVRs with 2443 deletions sized from 1.2 kb to 514.8 kb and 636 duplications from 3 kb to 198.9 kb. Among the CNVRs combined from 65 samples, 1061 (34.5%) were detected in only one individual, 331 (10.8%) in two individuals, and 259 (8.4%) in three individuals. Of the 1687 CNVRs detected in three or more individuals, a total of 663 CNVRs were detected only in domesticated breeds (CN, RIR, and WL). Among the three breeds, the highest number of 466 CNVRs were detected in CN, followed by 421 in RIR and 419 in WL. A total of 211 CNVRs were detected in all three breeds. The number of breed-specific CNVRs was 84 in CN, 53 in RIR, and 94 in WL, respectively. CNVR detected in most individuals was duplication detected in 44 individuals, located on chr5 at 6.31–6.33 Mb, and second was deletion in 40 individuals, on chr1 at 11.42–11.44 Mb (Figure 1A; Table S2). Figure 1B shows the density of gene and distribution of all CNVRs we used for analysis on chromosome ideogram except for the microchromosome. We used CNVs with a minimum length of 1 kb in the analysis and the shortest CNVRs had a length of 1.8 kb. Four hundred and ninety-nine deletions ranged in size from 1.8 kb to 501 kb, with 275 deletions (55%) less than 10kb and 164 duplications from 3.9 kb to 92.1 kb with 81 duplications (50%) less than 12 kb (Figure 1C). The number of CNVRs on the chromosomes was not significantly different among the breeds. In general, the number of CNVRs detected was proportional to the chromosome size, but chromosomes such as 16 and 27 showed more CNVRs than longer chromosomes (Figure 1D).

3.2. Annotation of CNVR

We annotated genes corresponding to 663 CNVRs using NCBI Gallus gallus Annotation Release 103 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Gallus_gallus/103/). When CNVR and gene overlap more than 1 bp, the relevant gene is annotated. If not, the closest gene is annotated. When there was another gene within 1 kb, it was additionally annotated because it may affect the expression of the gene. Six hundred and sixty-three CNVRs were annotated to a total of 856 genes, including 609 protein-coding genes, 200 lncRNAs, and 16 miRNAs. Four hundred and sixty CNVRs were annotated to only one gene, and up to 12 genes were annotated for duplication on chr27 at 578.7–641.7 kb (Table S3).
Functional enrichment analysis was performed on GO terms and KEGG pathways through DAVID with 491 protein-coding genes within 1 kb of CNVR. Twenty-two terms in biological process, five cellular components, seven molecular functions, and five pathways in KEGG pathway were significantly enriched (p-value < 0.05) and presented in Table S4. They were mainly involved in organ development, neuromodulation, immune regulation, and metabolism. In particular, in KEGG pathway category, pathways related to protein and carbohydrate metabolism (gga00051: Fructose and mannose metabolism, gga00350: Tyrosine metabolism, and gga00360: Phenylalanine metabolism) and xenobiotics metabolism (gga00982: Drug metabolism—cytochrome P450 and gga00980: Metabolism of xenobiotics by cytochrome P450) were significant.
The QTL reported in chicken QTLdb were downloaded to detect QTL associated with the identified CNVRs. A total of 432 QTLs reported as galGal 5.0 version and shorter than 5 Mb [16] were used (Production_QTL: 334, Physiology_QTL: 53, Health_QTL: 26, Exterior_QTL: 16, Reproduction_QTL: 3). The results showed that 165 QTLs overlapped more than 1 bp with some of the 663 CNVRs. The most detected category was Production_QTL, a total of 142 QTLs overlapped with CNVRs. The most detected QTL was breast muscle pH, followed by QTLs such as feed intake, ileum weight, abdominal fat weight, body weight and eggshell strength (Table S5).

3.3. Breed Differentiation of CNVR

Clustering analysis was performed on CNVRs identified only in domesticated breed through the result of each individual. Hierarchical clustering was created using the scoring matrix which represents the presence or absence of any CNVR in an individual, and the p-value was calculated by multiscale bootstrap resampling. The results are shown together with the AU p-value and BP value, and it was confirmed that they were significantly clustered within each breed (Figure 2).
We estimated VST statistic which is known to be similar to FST to examine population differentiation for CNVR [61]. VST value was calculated using the variance of copy number of 609 protein-coding genes annotated by CNVR to see population-specific selective pressure at gene-level. The average values of VST of the whole genes were 0.11 for ‘CN vs. WL’, 0.11 for ‘CN vs. RIR’, and 0.15 for ‘WL vs. RIR’ (Table S6). ‘WL vs. RIR’ showed the highest degree of differentiation, which is consistent with the above clustering result. To see genes with high differentiation between breeds, we examined genes with VST > 0.79, the top 98th percentile. Five genes (LOC772271, OR52R1, LOC107052719, LOC107052465, RD3) in the ‘CN vs. WL’ pairs and nine genes (LOC772271, OR52R1, LOC101752215, AKR1B1L, LOC101748892, LOC101751764, LOC107049090, LOC107054696, LOC107054697) in ‘WL vs. RIR’ pairs exceeded the threshold and none of the ‘CN vs. RIR’ pairs exceeded the threshold. LOC772271 (aldo-keto reductase family 1 member B10-like), which overlaps 1712 bp with CNVR41, and OR52R1, which is included in CNVR145 exceeded the threshold in both ‘CN vs. WL’ and ‘WL vs. RIR’ pairs (Figure 3).
ANOVA was also performed to examine the differentiation of CN frequency and post-test was performed with Tukey’s test. As a result, 232 genes showed a statistically significant difference (p-value < 0.01) between breeds (Table S7). Even after Bonferroni correction (p-value < 0.01/609 = 1.64 × 10−5) and removal of the microchromosome with a chromosome size less than 20Mb, 63 genes remained significant. LOC772271 (p-value < 3.55 × 10−27), and OR52R1 (p-value < 1.16 × 10−24) mentioned in VST analysis were most significant, and many other genes also showed significance, such as RD3 (p-value < 4.41 × 10−20), ADH6 (p-value < 1.42 × 10−17), TLR2B (p-value < 7.6 × 10−13), and POPDC3 (p-value < 6.16 × 10−11) (Figure S1). In most cases, only one out of the three breeds showed a difference in copy number. On the other hand, PRSS2 (p-value < 8.68 × 10−13) showed different copy numbers among all three breeds (CN:RIR:WL = 1.5:2:1). We looked at the region around PRSS2 through genome browser and found that TRBV6-5 (biotype: C_region) overlapped with PRSS2. Checking the mapping read, it showed the same depth of coverage as PRSS2 over the much larger region including TRBV6-5 (Figure 4A). This region was also included in QTL with body temperature. There were cases where different copy number patterns were observed between breeds in one gene. For example, within TPK1 gene (chr2: 52858574-53196804), WL showed deletion in CNVR175 (chr2: 52890601-52910100) and CNVR176 (chr2: 53185501-53190300), while CN and RIR with CNVR177 (chr2: 53190301-53209800) showed duplication (Figure 4B). Additionally, functional analysis was performed on significant genes through DAVID. As a result, pathways related to receptor signaling and metabolic activity were significantly enriched (Table 2).

4. Discussion

Using the NGS-based method to detect CNVs shows much better performance compared to the array-based method even in low-coverage situations [63]. However, since there is a possibility of false-positives [64,65], we used 20 individuals per breed with five RJF as a control group and only CNVRs identified in at least three individuals were used for analysis. Also, ANOVA test was performed and the actual depth of coverage was visualized for significant genes.
We identified a total of 1624 CNVRs for the CN, RIR, and WL, but only 633 (40.8%) remained after removing the commonly detected CNVRs from RJF. The reason for the high number of commonly detected CNVRs between domesticated breeds and RJF is probably that they share an evolutionary history before artificial selection. It is thought that genes which were important for survival such as nervous or immune system related genes were selected before domestication [66]. Even though microchromosomes such as chromosomes 16 and 27 are short in length, plenty of CNVRs have been detected in many previous studies [16,36,41]. CNVRs detected in various studies were combined into one CNVR on chromosome 16, which covers 99.95% of the full length of chromosome 16 [64]. This is because almost all genes mapped to chromosome 16 are immune-related, including major histocompatibility complex genes which have high levels of genetic diversity [67].
Chickens have been bred for a variety of purposes, including eggs, meat, and game [68]. For example, CN was bred for meat, WL for eggs, and RIR for both purposes [69]. The result of this breeding leads to unimaginable increases in productivity not found in RJF and is also found in various forms of variation on the genome [70]. Of course, this includes CNV, which affects productivity in a variety of ways, such as by deleting genes or changing gene dosage. The functional and QTL analysis resulted in enrichment of pathways of carbohydrate and protein related metabolism and organ development. This suggests that evidence of domestication can be confirmed at the CNV level.
Clustering and VST analyses showed that CN is closer to RIR than WL. In the VST results of ‘CN, RIR vs. WL’, two genes, LOC772271 and OR52R1, were found to be common. LOC772271 gene which is associated with fatty acid synthesis [71] was expressed more in the fast-growing line than in the slow-growing line [72] and has already reported a copy number difference between WL and ‘CN and RIR’ in the previous study [73]. OR52R1 is an olfactory receptor gene (OR) and the copy number variable of OR was found in chickens as well as in pigs [74] and cattle [75]. Olfactory is associated with social communication of chickens. Especially laying hens may have high selection pressure on the copy number of OR due to housing conditions of sex-homogenous groups that are restricted from communicating with males [76]. In addition, various factors such as cognition of predators [77], daily activity pattern, habitat, and diet can act as a selective pressure on OR [78], indicating that copy number could vary between breeds.
ANOVA test showed differences in copy number between breeds in many genes. In RD3 gene involved in photoreceptors [79], deletion occurred with CNVR261 in fifteen WL. Since light has a major effect on egg production [80,81], the variation on gene structure by deletion is thought to alter light acceptance in chickens and positively affect egg production. Similarly, in POPDC3 gene, the estimated copy number of CN and RIR was 2, which was very conserved among individuals. However, WL had an average copy number of 3. In the previous study, the same duplication was found in WL and associated with myometrium maturation, protein secretion, and eggshell formation [41]. Chicken breeds we analyzed are more likely to be subject to selective pressure for genes related to meat or egg production. Therefore, we thought that most of the genes with significant differences in ANOVA test would be ‘CN, RIR (meat) vs. WL’ or ‘WL, RIR (egg) vs. CN’. However, there were also many genes in the ‘CN, WL vs. RIR’ case. For example, ADH6 gene, associated with glycolysis [82], showed duplication only in RIR, and TLR2B associated with immune response [83,84] showed deletion only in RIR. PRSS2 gene showed deletion in both CN and WL breeds. However, when looking at the depth, the number of deleted copy numbers was different. Increasing the copy number of PRSS2 gene, which encodes trypsinogen, is associated with chronic pancreatitis in humans [85]. However, the read mapping showed similar deletion patterns in the PRSS2 gene as well as the overlapped gene, TRBV6-5. TRBV genes provide a variable component in the adaptive immune receptor biosynthesis, which is related to the beta chain of T-cell receptors [86]. The reason for the difference in copy number between breeds in TRBV6-5 is similar to that of many CNVs detected due to a large number of immune-related genes in chromosome 16. This means that the immunological response can vary greatly between breeds [75]. We could identify duplication in the promoter region of TPK1 and two deletions within the gene. Interestingly, duplication could only be found in CN and RIR, and deletion in WL. The known function of TPK1 in livestock is to produce ATP by hydrolyzing thiamine diphosphate to thiamine when ATP is exhausted in the early stages of the postmortem period [87]. In addition, it has been reported that when Escherichia coli suffers from amino acid deficiency, it accumulates thiamin triphosphate, a higher phosphate derivative of thiamin, and acts as a signal related to adaptation to nutritional condition changes [88]. However, it is unclear what physiologically important function TPK1 has in chickens. Therefore, further research is needed to determine the cause of copy number differentiation on TPK1 between breeds.

5. Conclusions

In this study, whole-genome sequencing data of 65 chickens was collected from public databases to detect domesticated-specific CNVRs. By analyzing the function of annotated genes in the CNVR and genes that showed copy number differentiation between breeds, we investigated how genetic characteristics are linked to phenotypic characteristics of increased productivity. As a result, we have discovered that the genetic variation caused by artificial selection is effective on CNV level. The findings will be a great contribution to the understanding of poultry breeding and the development of economical traits along with other genomics studies on domesticated chickens.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2615/9/10/809/s1, Figure S1: Estimated copy numbers of gene with different copy number between breeds and depth of coverage of gene region (expansion of Figure 4.), Table S1: Statistical summary of read mapping and detected CNVs, Table S2: CNVRs identified in domesticated chicken breeds, Table S3: Gene annotation of CNVRs, Table S4: Gene ontology analysis for identified CNVRs, Table S5: QTL annotation of CNVRs, Table S6: VST value of genes annotated in CNVRs, Table S7: ANOVA test of genes annotated in CNVRs.

Author Contributions

Conceptualization, D.S. and H.K.; methodology, D.S.; software, D.S. and B.K.; formal analysis, D.S.; investigation, D.S., B.J.K. and B.K.; resources, H.-H.C., D.L. and H.K.; data curation, D.S. and B.K.; visualization, D.S.; writing—original draft preparation, D.S.; writing—review editing, D.S., B.J.K., B.K., H.-H.C., D.L. and H.K.; supervision, B.J.K. and H.K.; project administration, H.K.; funding acquisition, H.-H.C., D.L. and H.K.

Funding

This work was funded by ‘Population genomics of Korean long-tailed fowl’ the Program for Agriculture Science and Technology Development (Project No. PJ0133402) of the Rural Development Administration (RDA).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Feuk, L.; Carson, A.R.; Scherer, S.W. Structural variation in the human genome. Nat. Rev. Genet. 2006, 7, 85–97. [Google Scholar] [CrossRef] [PubMed]
  2. Conrad, D.F.; Pinto, D.; Redon, R.; Feuk, L.; Gokcumen, O.; Zhang, Y.; Aerts, J.; Andrews, T.D.; Barnes, C.; Campbell, P.; et al. Origins and functional impact of copy number variation in the human genome. Nature 2010, 464, 704–712. [Google Scholar] [CrossRef] [PubMed]
  3. Zhou, J.; Lemos, B.; Dopman, E.B.; Hartl, D.L. Copy-number variation: The balance between gene dosage and expression in Drosophila melanogaster. Genome Biol. Evol. 2011, 3, 1014–1024. [Google Scholar] [CrossRef] [PubMed]
  4. Mills, R.E.; Walter, K.; Stewart, C.; Handsaker, R.E.; Chen, K.; Alkan, C.; Abyzov, A.; Yoon, S.C.; Ye, K.; Cheetham, R.K.; et al. Mapping copy number variation by population-scale genome sequencing. Nature 2011, 470, 59–65. [Google Scholar] [CrossRef] [PubMed]
  5. Keel, B.N.; Lindholm-Perry, A.K.; Snelling, W.M. Evolutionary and functional features of copy number variation in the cattle genome. Front. Genet. 2016, 7, 207. [Google Scholar] [CrossRef] [PubMed]
  6. Poptsova, M.; Banerjee, S.; Gokcumen, O.; Rubin, M.A.; Demichelis, F. Impact of constitutional copy number variants on biological pathway evolution. BMC Evol. Biol. 2013, 13, 19. [Google Scholar] [CrossRef]
  7. Poole, A.C.; Goodrich, J.K.; Youngblut, N.D.; Luque, G.G.; Ruaud, A.; Sutter, J.L.; Waters, J.L.; Shi, Q.; El-Hadidi, M.; Johnson, L.M.; et al. Human salivary Amylase gene copy number impacts oral and gut microbiomes. Cell Host Microbe 2019, 25, 553–564. [Google Scholar] [CrossRef]
  8. Levy-Sakin, M.; Pastor, S.; Mostovoy, Y.; Li, L.; Leung, A.K.Y.; McCaffrey, J.; Young, E.; Lam, E.T.; Hastie, A.R.; Wong, K.H.Y.; et al. Genome maps across 26 human populations reveal population-specific patterns of structural variation. Nat. Commun. 2019, 10, 1025. [Google Scholar] [CrossRef]
  9. Mathieson, S.; Mathieson, I. FADS1 and the timing of human adaptation to agriculture. Mol. Biol. Evol. 2018, 35, 2957–2970. [Google Scholar] [CrossRef]
  10. Kommadath, A.; Grant, J.R.; Krivushin, K.; Butty, A.M.; Baes, C.F.; Carthy, T.R.; Berry, D.P.; Stothard, P. A large interactive visual database of copy number variants discovered in taurine cattle. GigaScience 2019, 8, giz073. [Google Scholar] [CrossRef]
  11. Liu, M.; Fang, L.; Liu, S.; Pan, M.G.; Seroussi, E.; Cole, J.B.; Ma, L.; Chen, H.; Liu, G.E. Array CGH-based detection of CNV regions and their potential association with reproduction and other economic traits in Holsteins. BMC Genom. 2019, 20, 181. [Google Scholar] [CrossRef] [PubMed]
  12. Geistlinger, L.; da Silva, V.H.; Cesar, A.S.M.; Tizioto, P.C.; Waldron, L.; Zimmer, R.; Regitano, L.C.D.A.; Coutinho, L.L. Widespread modulation of gene expression by copy number variation in skeletal muscle. Sci. Rep. 2018, 8, 1399. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Revilla, M.; Puig-Oliveras, A.; Castelló, A.; Crespo-Piazuelo, D.; Paludo, E.; Fernández, A.I.; Ballester, M.; Folch, J.M. A global analysis of CNVs in swine using whole genome sequence data and association analysis with fatty acid composition and growth traits. PLoS ONE 2017, 12, e0177014. [Google Scholar] [CrossRef] [PubMed]
  14. Ran, X.-Q.; Pan, H.; Huang, S.-H.; Liu, C.; Niu, X.; Li, S.; Wang, J.-F. Copy number variations of MTHFSD gene across pig breeds and its association with litter size traits in Chinese indigenous Xiang pig. J. Anim. Physiol. Anim. Nutr. (Berl) 2018, 102, 1320–1327. [Google Scholar] [CrossRef]
  15. Dong, K.; Pu, Y.; Yao, N.; Shu, G.; Liu, X.; He, X.; Zhao, Q.; Guan, W.; Ma, Y. Copy number variation detection using SNP genotyping arrays in three Chinese pig breeds. Anim. Genet. 2015, 46, 101–109. [Google Scholar] [CrossRef]
  16. Gorla, E.; Cozzi, M.C.; Román-Ponce, S.I.; Ruiz López, F.J.; Vega-Murillo, V.E.; Cerolini, S.; Bagnato, A.; Strillacci, M.G. Genomic variability in Mexican chicken population using copy number variants. BMC Genet. 2017, 18, 61. [Google Scholar] [CrossRef]
  17. Lin, S.; Lin, X.; Zhang, Z.; Jiang, M.; Rao, Y.; Nie, Q.; Zhang, X. Copy number variation in SOX6 contributes to chicken muscle development. Genes 2018, 9, 42. [Google Scholar] [CrossRef]
  18. Rao, Y.S.; Li, J.; Zhang, R.; Lin, X.R.; Xu, J.G.; Xie, L.; Xu, Z.Q.; Wang, L.; Gan, J.K.; Xie, X.J.; et al. Copy number variation identification and analysis of the chicken genome using a 60K SNP BeadChip. Poult. Sci. 2016, 95, 1750–1756. [Google Scholar] [CrossRef]
  19. Liu, M.; Zhou, Y.; Rosen, B.D.; Van Tassell, C.P.; Stella, A.; Tosser-Klopp, G.; Rupp, R.; Palhière, I.; Colli, L.; Sayre, B.; et al. Diversity of copy number variation in the worldwide goat population. Heredity 2019, 122, 636–646. [Google Scholar] [CrossRef]
  20. Serres-Armero, A.; Povolotskaya, I.S.; Quilez, J.; Ramirez, O.; Santpere, G.; Kuderna, L.F.K.; Hernandez-Rodriguez, J.; Fernandez-Callejo, M.; Gomez-Sanchez, D.; Freedman, A.H.; et al. Similar genomic proportions of copy number variation within gray wolves and modern dog breeds inferred from whole genome sequencing. BMC Genom. 2017, 18, 977. [Google Scholar] [CrossRef]
  21. Genova, F.; Longeri, M.; Lyons, L.A.; Bagnato, A.; Gandolfi, B.; Aberdein, D.; Alves, P.C.; Barsh, G.S.; Beale, H.C.; Bergström, T.F.; et al. First genome-wide CNV mapping in FELIS CATUS using next generation sequencing data. BMC Genom. 2018, 19, 895. [Google Scholar] [CrossRef] [PubMed]
  22. Lauer, S.; Avecilla, G.; Spealman, P.; Sethia, G.; Brandt, N.; Levy, S.F.; Gresham, D. Single-cell copy number variant detection reveals the dynamics and diversity of adaptation. PLoS Biol. 2018, 16, e3000069. [Google Scholar] [CrossRef] [PubMed]
  23. Hosgood, H.D., III; Hu, W.; Rothman, N.; Klugman, M.; Weinstein, S.J.; Virtamo, J.R.; Albanes, D.; Cawthon, R.; Lan, Q. Variation in ribosomal DNA copy number is associated with lung cancer risk in a prospective cohort study. Carcinogenesis 2019, 40, 975–978. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Quigley, D.A.; Dang, H.X.; Zhao, S.G.; Lloyd, P.; Aggarwal, R.; Alumkal, J.J.; Foye, A.; Kothari, V.; Perry, M.D.; Bailey, A.M.; et al. Genomic hallmarks and structural variation in metastatic prostate cancer. Cell 2018, 174, 758–769. [Google Scholar] [CrossRef] [PubMed]
  25. Warland, A.; Kendall, K.M.; Rees, E.; Kirov, G.; Caseras, X. Schizophrenia-associated genomic copy number variants and subcortical brain volumes in the UK Biobank. Mol. Psychiatry 2019. [Google Scholar] [CrossRef]
  26. Kendall, K.M.; Rees, E.; Bracher-Smith, M.; Legge, S.; Riglin, L.; Zammit, S.; O’Donovan, M.C.; Owen, M.J.; Jones, I.; Kirov, G.; et al. Association of rare copy number variants with risk of depression. JAMA Psychiatry 2019, 76, 818–825. [Google Scholar] [CrossRef]
  27. Bhanuprakash, V.; Chhotaray, S.; Pruthviraj, D.R.; Rawat, C.; Karthikeyan, A.; Panigrahi, M. Copy number variation in livestock: A mini review. Vet. World 2018, 11, 535–541. [Google Scholar] [CrossRef] [Green Version]
  28. Bickhart, D.; Liu, G. The challenges and importance of structural variation detection in livestock. Front. Genet. 2014, 5, 37. [Google Scholar] [CrossRef]
  29. Clop, A.; Vidal, O.; Amills, M. Copy number variation in the genomes of domestic animals. Anim. Genet. 2012, 43, 503–517. [Google Scholar] [CrossRef]
  30. Al-Nasser, A.; Al-Khalaifa, H.; Al-Saffar, A.; Khalil, F.; Albahouh, M.; Ragheb, G.; Al-Haddad, A.; Mashaly, M. Overview of chicken taxonomy and domestication. World’s Poult. Sci. J. 2007, 63, 285–300. [Google Scholar] [CrossRef] [Green Version]
  31. Herron, L.R.; Pridans, C.; Turnbull, M.L.; Smith, N.; Lillico, S.; Sherman, A.; Gilhooley, H.J.; Wear, M.; Kurian, D.; Papadakos, G.; et al. A chicken bioreactor for efficient production of functional cytokines. BMC Biotechnol. 2018, 18, 82. [Google Scholar] [CrossRef] [PubMed]
  32. Schock, E.N.; Chang, C.-F.; Youngworth, I.A.; Davey, M.G.; Delany, M.E.; Brugmann, S.A. Utilizing the chicken as an animal model for human craniofacial ciliopathies. Dev. Biol. 2016, 415, 326–337. [Google Scholar] [CrossRef] [PubMed]
  33. Rubin, C.-J.; Zody, M.C.; Eriksson, J.; Meadows, J.R.S.; Sherwood, E.; Webster, M.T.; Jiang, L.; Ingman, M.; Sharpe, T.; Ka, S.; et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature 2010, 464, 587. [Google Scholar] [CrossRef] [PubMed]
  34. Griffin, D.K.; Robertson, L.B.; Tempest, H.G.; Vignal, A.; Fillon, V.; Crooijmans, R.P.M.A.; Groenen, M.A.M.; Deryusheva, S.; Gaginskaya, E.; Carré, W.; et al. Whole genome comparative studies between chicken and turkey and their implications for avian genome evolution. BMC Genom. 2008, 9, 168. [Google Scholar] [CrossRef] [PubMed]
  35. Wright, D.; Boije, H.; Meadows, J.R.S.; Bed’hom, B.; Gourichon, D.; Vieaud, A.; Tixier-Boichard, M.; Rubin, C.-J.; Imsland, F.; Hallböök, F.; et al. Copy number variation in Intron 1 of SOX5 causes the Pea-comb phenotype in chickens. PLoS Genet. 2009, 5, e1000512. [Google Scholar] [CrossRef] [PubMed]
  36. Jia, X.; Chen, S.; Zhou, H.; Li, D.; Liu, W.; Yang, N. Copy number variations identified in the chicken using a 60K SNP BeadChip. Anim. Genet. 2013, 44, 276–284. [Google Scholar] [CrossRef]
  37. Escaramís, G.; Docampo, E.; Rabionet, R. A decade of structural variants: Description, history and methods to detect structural variation. Brief. Funct. Genom. 2015, 14, 305–314. [Google Scholar] [CrossRef]
  38. Hayes, J.L.; Tzika, A.; Thygesen, H.; Berri, S.; Wood, H.M.; Hewitt, S.; Pendlebury, M.; Coates, A.; Willoughby, L.; Watson, C.M.; et al. Diagnosis of copy number variation by Illumina next generation sequencing is comparable in performance to oligonucleotide array comparative genomic hybridisation. Genomics 2013, 102, 174–181. [Google Scholar] [CrossRef] [Green Version]
  39. Kerstens, H.H.D.; Crooijmans, R.P.M.A.; Dibbits, B.W.; Vereijken, A.; Okimoto, R.; Groenen, M.A.M. Structural variation in the chicken genome identified by paired-end next-generation DNA sequencing of reduced representation libraries. BMC Genom. 2011, 12, 94. [Google Scholar] [CrossRef]
  40. Fan, W.-L.; Ng, C.S.; Chen, C.-F.; Lu, M.-Y.J.; Chen, Y.-H.; Liu, C.-J.; Wu, S.-M.; Chen, C.-K.; Chen, J.-J.; Mao, C.-T.; et al. Genome-wide patterns of genetic variation in two domestic chickens. Genome Biol. Evol. 2013, 5, 1376–1392. [Google Scholar] [CrossRef]
  41. Yi, G.; Qu, L.; Liu, J.; Yan, Y.; Xu, G.; Yang, N. Genome-wide patterns of copy number variation in the diversified chicken genomes using next-generation sequencing. BMC Genom. 2014, 15, 962. [Google Scholar] [CrossRef] [PubMed]
  42. Yan, Y.; Yang, N.; Cheng, H.H.; Song, J.; Qu, L. Genome-wide identification of copy number variations between two chicken lines that differ in genetic resistance to Marek’s disease. BMC Genom. 2015, 16, 843. [Google Scholar] [CrossRef] [PubMed]
  43. Sohrabi, S.S.; Mohammadabadi, M.; Wu, D.-D.; Esmailizadeh, A. Detection of breed-specific copy number variations in domestic chicken genome. Genome 2017, 61, 7–14. [Google Scholar] [CrossRef] [PubMed]
  44. Cutler, D.J.; Jensen, J.D. To pool, or not to pool? Genetics 2010, 186, 41–43. [Google Scholar] [CrossRef] [PubMed]
  45. Abyzov, A.; Urban, A.E.; Snyder, M.; Gerstein, M. CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011, 21, 974–984. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Seo, J.H.; Lee, J.H.; Kong, H.S. Assessment of genetic diversity and phylogenetic relationships of Korean native chicken breeds using microsatellite markers. Asian Australas. J. Anim. Sci. 2017, 30, 1365–1371. [Google Scholar] [CrossRef]
  47. Seol, Y.-J.; Lee, T.-H.; Park, D.-S.; Kim, C.-K. NABIC: A new access portal to search, visualize, and share agricultural genomics data. Evol. Bioinform. 2016, 12, EBO.S34493. [Google Scholar] [CrossRef]
  48. Yan, Y.; Yi, G.; Sun, C.; Qu, L.; Yang, N. Genome-wide characterization of insertion and deletion variation in chicken using next generation sequencing. PLoS ONE 2014, 9, e104652. [Google Scholar] [CrossRef]
  49. Wang, M.-S.; Zhang, R.-W.; Su, L.-Y.; Li, Y.; Peng, M.-S.; Liu, H.-Q.; Zeng, L.; Irwin, D.M.; Du, J.-L.; Yao, Y.-G.; et al. Positive selection rather than relaxation of functional constraint drives the evolution of vision during chicken domestication. Cell Res. 2016, 26, 556. [Google Scholar] [CrossRef]
  50. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  51. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Van der Auwera, G.A.; Carneiro, M.O.; Hartl, C.; Poplin, R.; del Angel, G.; Levy-Moonshine, A.; Jordan, T.; Shakir, K.; Roazen, D.; Thibault, J.; et al. From FastQ data to high-confidence variant calls: The genome analysis toolkit best practices pipeline. Curr. Protoc. Bioinform. 2013, 43, 11.10.1–11.10.33. [Google Scholar] [CrossRef] [PubMed]
  53. Trost, B.; Walker, S.; Wang, Z.; Thiruvahindrapuram, B.; MacDonald, J.R.; Sung, W.W.L.; Pereira, S.L.; Whitney, J.; Chan, A.J.S.; Pellecchia, G.; et al. A comprehensive workflow for read depth-based identification of copy-number variation from whole-genome sequence data. Am. J. Hum. Genet. 2018, 102, 142–155. [Google Scholar] [CrossRef] [PubMed]
  54. Pierce, M.D.; Dzama, K.; Muchadeyi, F.C. Genetic diversity of seven cattle breeds inferred using copy number variations. Front. Genet. 2018, 9, 163. [Google Scholar] [CrossRef]
  55. Yu, G.; Wang, L.-G.; He, Q.-Y. ChIPseeker: An R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 2015, 31, 2382–2383. [Google Scholar] [CrossRef]
  56. Huang da, W.; Sherman, B.T.; Lempicki, R.A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009, 37, 1–13. [Google Scholar] [CrossRef]
  57. Huang da, W.; Sherman, B.T.; Lempicki, R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009, 4, 44–57. [Google Scholar] [CrossRef]
  58. Hu, Z.L.; Park, C.A.; Reecy, J.M. Building a livestock genetic and genomic information knowledgebase through integrative developments of Animal QTLdb and CorrDB. Nucleic Acids Res. 2019, 47, D701–D710. [Google Scholar] [CrossRef]
  59. Quinlan, A.R.; Hall, I.M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26, 841–842. [Google Scholar] [CrossRef]
  60. Suzuki, R.; Shimodaira, H. Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics 2006, 22, 1540–1542. [Google Scholar] [CrossRef]
  61. Redon, R.; Ishikawa, S.; Fitch, K.R.; Feuk, L.; Perry, G.H.; Andrews, T.D.; Fiegler, H.; Shapero, M.H.; Carson, A.R.; Chen, W.; et al. Global variation in copy number in the human genome. Nature 2006, 444, 444–454. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Pedersen, B.S.; Quinlan, A.R. Mosdepth: Quick coverage calculation for genomes and exomes. Bioinformatics 2017, 34, 867–868. [Google Scholar] [CrossRef] [PubMed]
  63. Zhou, B.; Ho, S.S.; Zhang, X.; Pattni, R.; Haraksingh, R.R.; Urban, A.E. Whole-genome sequencing analysis of CNV using low-coverage and paired-end strategies is efficient and outperforms array-based CNV analysis. J. Med. Genet. 2018, 55, 735–743. [Google Scholar] [CrossRef] [PubMed]
  64. Wang, X.; Byers, S. Copy Number Variation in chickens: A review and future prospects. Microarrays 2014, 3, 24–38. [Google Scholar] [CrossRef]
  65. Zarrei, M.; MacDonald, J.R.; Merico, D.; Scherer, S.W. A copy number variation map of the human genome. Nat. Rev. Genet. 2015, 16, 172. [Google Scholar] [CrossRef]
  66. Lawal, R.A.; Al-Atiyat, R.M.; Aljumaah, R.S.; Silva, P.; Mwacharo, J.M.; Hanotte, O. Whole-genome resequencing of red Junglefowl and indigenous village chicken reveal new insights on the genome dynamics of the species. Front. Genet. 2018, 9, 264. [Google Scholar] [CrossRef]
  67. Miller, M.M.; Taylor, R.L., Jr. Brief review of the chicken major histocompatibility complex: The genes, their distribution on chromosome 16, and their contributions to disease resistance. Poult. Sci. 2016, 95, 375–392. [Google Scholar] [CrossRef]
  68. Moiseyeva, I.G.; Romanov, M.N.; Nikiforov, A.A.; Sevastyanova, A.A.; Semyenova, S.K. Evolutionary relationships of Red Jungle Fowl and chicken breeds. Genet. Sel. Evol. 2003, 35, 403. [Google Scholar] [CrossRef]
  69. Khawaja, T.; Khan, S.H.; Mukhtar, N.; Ullah, N.; Parveen, A. Production performance, egg quality and biochemical parameters of Fayoumi, Rhode Island Red and their reciprocal crossbred chickens. J. Appl. Anim. Res. 2013, 41, 208–217. [Google Scholar] [CrossRef]
  70. Qanbari, S.; Rubin, C.-J.; Maqbool, K.; Weigend, S.; Weigend, A.; Geibel, J.; Kerje, S.; Wurmser, C.; Peterson, A.T.; Brisbin, I.L., Jr.; et al. Genetics of adaptation in modern chicken. PLoS Genet. 2019, 15, e1007989. [Google Scholar] [CrossRef]
  71. Ma, J.; Yan, R.; Zu, X.; Cheng, J.-M.; Rao, K.; Liao, D.-F.; Cao, D. Aldo-keto reductase family 1 B10 affects fatty acid synthesis by regulating the stability of Acetyl-CoA Carboxylase-α in breast cancer cells. J. Biol. Chem. 2008, 283, 3418–3423. [Google Scholar] [CrossRef] [PubMed]
  72. Claire D’Andre, H.; Paul, W.; Shen, X.; Jia, X.; Zhang, R.; Sun, L.; Zhang, X. Identification and characterization of genes that control fat deposition in chickens. J. Anim. Sci. Biotechnol. 2013, 4, 43. [Google Scholar] [CrossRef] [PubMed]
  73. Wang, X.; Nahashon, S.; Feaster, T.K.; Bohannon-Stewart, A.; Adefope, N. An initial map of chromosomal segmental copy number variations in the chicken. BMC Genom. 2010, 11, 351. [Google Scholar] [CrossRef] [PubMed]
  74. Paudel, Y.; Madsen, O.; Megens, H.-J.; Frantz, L.A.F.; Bosse, M.; Bastiaansen, J.W.M.; Crooijmans, R.P.M.A.; Groenen, M.A.M. Evolutionary dynamics of copy number variation in pig genomes in the context of adaptation and domestication. BMC Genom. 2013, 14, 449. [Google Scholar] [CrossRef]
  75. Upadhyay, M.; da Silva, V.H.; Megens, H.-J.; Visker, M.H.P.W.; Ajmone-Marsan, P.; Bâlteanu, V.A.; Dunner, S.; Garcia, J.F.; Ginja, C.; Kantanen, J.; et al. Distribution and functionality of copy number variation across european cattle populations. Front. Genet. 2017, 8, 108. [Google Scholar] [CrossRef]
  76. Krause, E.T.; Schrader, L.; Caspers, B.A. Olfaction in chicken (Gallus gallus): A neglected mode of social communication? Front. Ecol. Evol. 2016, 4, 94. [Google Scholar] [CrossRef]
  77. Roth, T.C.; Cox, J.G.; Lima, S.L. Can foraging birds assess predation risk by scent? Anim. Behav. 2008, 76, 2021–2027. [Google Scholar] [CrossRef]
  78. Steiger Silke, S.; Fidler Andrew, E.; Valcu, M.; Kempenaers, B. Avian olfactory receptor gene repertoires: evidence for a well-developed sense of smell in birds? Proc. Biol. Sci. 2008, 275, 2309–2317. [Google Scholar] [CrossRef] [Green Version]
  79. Azadi, S.; Molday, L.L.; Molday, R.S. RD3, the protein associated with Leber congenital amaurosis type 12, is required for guanylate cyclase trafficking in photoreceptor cells. Proc. Natl. Acad. Sci. USA 2010, 107, 21158–21163. [Google Scholar] [CrossRef] [Green Version]
  80. Renema, R.A.; Robinson, F.E.; Feddes, J.J.R.; Fasenko, G.M.; Zuidhoft, M.J. Effects of light intensity from photostimulation in four strains of commercial egg layers: 2. egg production parameters. Poult. Sci. 2001, 80, 1121–1131. [Google Scholar] [CrossRef]
  81. Jácome, I.; Rossi, L.; Borille, R. Influence of artificial lighting on the performance and egg quality of commercial layers: A review. Braz. J. Poult. Sci. 2014, 16, 337–344. [Google Scholar] [CrossRef]
  82. Dalgin, G.S.; Holloway, D.T.; Liou, L.S.; DeLisi, C. Identification and characterization of renal cell carcinoma gene markers. Cancer Inform. 2007, 3, 65–92. [Google Scholar] [CrossRef] [PubMed]
  83. Huang, Y.; Temperley, N.D.; Ren, L.; Smith, J.; Li, N.; Burt, D.W. Molecular evolution of the vertebrate TLR1 gene family—A complex history of gene duplication, gene conversion, positive selection and co-evolution. BMC Evol. Biol. 2011, 11, 149. [Google Scholar] [CrossRef] [PubMed]
  84. Ruan, W.K.; Wu, Y.H.; An, J.; Cui, D.F.; Li, H.R.; Zheng, S.J. Toll-like receptor 2 type 1 and type 2 polymorphisms in different chicken breeds. Poult. Sci. 2012, 91, 101–106. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  85. Chen, J.M.; Masson, E.; Le Maréchal, C.; Férec, C. Copy number variations in chronic pancreatitis. Cytogenet. Genome Res. 2008, 123, 102–107. [Google Scholar] [CrossRef]
  86. Luo, S.; Yu, J.A.; Li, H.; Song, Y.S. Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans. Life Sci. Alliance 2019, 2, e201800221. [Google Scholar] [CrossRef] [Green Version]
  87. Muroya, S.; Oe, M.; Ojima, K. Thiamine accumulation and thiamine triphosphate decline occur in parallel with ATP exhaustion during postmortem aging of pork muscles. Meat Sci. 2018, 137, 228–234. [Google Scholar] [CrossRef]
  88. Bettendorff, L.; Wins, P. Thiamin diphosphate in biological chemistry: New aspects of thiamin metabolism, especially triphosphate derivatives acting other than as cofactors. FEBS J. 2009, 276, 2917–2925. [Google Scholar] [CrossRef]
Figure 1. General description of the identified copy number variation regions (CNVRs). (A) Venn diagram for overlapping CNVRs between Red Jungle Fowl (RJF) and domesticated breeds (above), and among Cornish (CN), Rhode Island Red (RIR), and White Leghorn (WL) (below); (B) distribution of CNVRs on chromosome ideogram according to their state (deletion and duplication). This was drawn using Rideogram R package. The color painted on the chromosome represents the gene density; (C) distribution of CNVRs size by state; (D) chromosomal distribution of CNVRs for three chicken breeds and length of chicken autosome.
Figure 1. General description of the identified copy number variation regions (CNVRs). (A) Venn diagram for overlapping CNVRs between Red Jungle Fowl (RJF) and domesticated breeds (above), and among Cornish (CN), Rhode Island Red (RIR), and White Leghorn (WL) (below); (B) distribution of CNVRs on chromosome ideogram according to their state (deletion and duplication). This was drawn using Rideogram R package. The color painted on the chromosome represents the gene density; (C) distribution of CNVRs size by state; (D) chromosomal distribution of CNVRs for three chicken breeds and length of chicken autosome.
Animals 09 00809 g001
Figure 2. Hierarchical clustering analysis according to the presence or absence of CNVR.
Figure 2. Hierarchical clustering analysis according to the presence or absence of CNVR.
Animals 09 00809 g002
Figure 3. Manhattan plot of genome-wide genes’ VST value. (A) CN vs. WL; (B) WL vs. RIR; (C) CN vs. RIR. The red line represents 0.79 VST.
Figure 3. Manhattan plot of genome-wide genes’ VST value. (A) CN vs. WL; (B) WL vs. RIR; (C) CN vs. RIR. The red line represents 0.79 VST.
Animals 09 00809 g003
Figure 4. Estimated copy numbers of genes with different copy numbers between breeds and depth of coverage of gene region. The center line of the box represents the median and the line at the end of the box represents the first and third quartiles. Tukey’s HSD test was used to determine which breeds were significantly different (left). The depth of coverage of the region around the gene was extracted for a specific window size and divided by the total depth of each sequence file. The window size was determined by gene size. The scatter plot represents each individual and the line represents breed average (right). (A) PRSS2, window size: 300bp; (B) TPK1, window size: 10,000bp.
Figure 4. Estimated copy numbers of genes with different copy numbers between breeds and depth of coverage of gene region. The center line of the box represents the median and the line at the end of the box represents the first and third quartiles. Tukey’s HSD test was used to determine which breeds were significantly different (left). The depth of coverage of the region around the gene was extracted for a specific window size and divided by the total depth of each sequence file. The window size was determined by gene size. The scatter plot represents each individual and the line represents breed average (right). (A) PRSS2, window size: 300bp; (B) TPK1, window size: 10,000bp.
Animals 09 00809 g004
Table 1. Statistical summary of detected CNVs.
Table 1. Statistical summary of detected CNVs.
BreedCNVsDeletionSize_Del (bp)DuplicationSize_Dup (bp)
Red Jungle Fowl71458420,377.413016,019.1
Cornish44332727,572.311616,156.9
Rhode Island Red47134626,778.912516,394.3
White Leghorn46734726,451.812015,823
Table 2. Functional annotation of genes with significant differences in copy number between breeds (p-value < 0.05).
Table 2. Functional annotation of genes with significant differences in copy number between breeds (p-value < 0.05).
CategoryIDTermCountp-ValueGenes
BPGO:2000251Positive regulation of actin cytoskeleton Reorganization21.88E-02NTRK3, HRAS
BPGO:0002224Toll-like receptor signaling pathway23.90E-02TLR2B, TLR2A
BPGO:0050852T cell receptor signaling pathway24.81E-02HRAS, CD247
BPGO:0060041Retina development in camera-type eye24.99E-02LPCAT1, RD3
CCGO:0031226Intrinsic component of plasma membrane21.58E-02TLR2B, TLR2A
MFGO:0016491Oxidoreductase activity53.44E-05LOC425137, LOC107053269, AKR1B1L, LOC772271, ADH6
MFGO:0004888Transmembrane signaling receptor activity32.57E-03CD247, TLR2B, TLR2A
KEGG PATHWAYgga01100Metabolic pathways73.10E-02TPK1, LOC425137, LPCAT1, AKR1B1L, ADH6, TWISTNB, ALDH3B1
KEGG PATHWAYgga00040Pentose and glucuronate interconversions24.25E-02LOC425137, AKR1B1L
BP = biological process; CC = cellular components; MF = molecular function.

Share and Cite

MDPI and ACS Style

Seol, D.; Ko, B.J.; Kim, B.; Chai, H.-H.; Lim, D.; Kim, H. Identification of Copy Number Variation in Domestic Chicken Using Whole-Genome Sequencing Reveals Evidence of Selection in the Genome. Animals 2019, 9, 809. https://doi.org/10.3390/ani9100809

AMA Style

Seol D, Ko BJ, Kim B, Chai H-H, Lim D, Kim H. Identification of Copy Number Variation in Domestic Chicken Using Whole-Genome Sequencing Reveals Evidence of Selection in the Genome. Animals. 2019; 9(10):809. https://doi.org/10.3390/ani9100809

Chicago/Turabian Style

Seol, Donghyeok, Byung June Ko, Bongsang Kim, Han-Ha Chai, Dajeong Lim, and Heebal Kim. 2019. "Identification of Copy Number Variation in Domestic Chicken Using Whole-Genome Sequencing Reveals Evidence of Selection in the Genome" Animals 9, no. 10: 809. https://doi.org/10.3390/ani9100809

APA Style

Seol, D., Ko, B. J., Kim, B., Chai, H. -H., Lim, D., & Kim, H. (2019). Identification of Copy Number Variation in Domestic Chicken Using Whole-Genome Sequencing Reveals Evidence of Selection in the Genome. Animals, 9(10), 809. https://doi.org/10.3390/ani9100809

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop