Next Article in Journal
Biodosimetry of Low Dose Ionizing Radiation Using DNA Repair Foci in Human Lymphocytes
Previous Article in Journal
Arterial Elasticity in Ehlers-Danlos Syndromes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Inference of Breed Structure in Farm Animals: Empirical Comparison between SNP and Microsatellite Performance

1
Université de Djelfa, Djelfa 17000, Algeria
2
Laboratory of Exploration and Valorization of Steppic Ecosystems (EVES), University of Djelfa, BP 3117, Djelfa 17000, Algeria
3
Science Veterinary Institute, University of Blida, BP 270, Blida 09000, Algeria
4
Laboratory of Biotechnology related to Animal Reproduction (LBRA), University of Blida, BP 270, Blida 09000, Algeria
5
Laboratory of Physiopathologie et biochimie de la Nutrition (PpBioNut) Department of Biology, Abou Bekr Belkaid Tlemcen University, Tlemcen 13000, Algeria
6
Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, 74100 Bari, Italy
7
Dr. Van Haeringen Laboratorium BV, Agro Business Park 100, 6708PW Wageningen, The Netherlands
8
Univ. Limoges, INRAE, EA7500, USC1061 GAMAA, F-87000 Limoges, France
9
Université Paris-Saclay, INRA, AgroParisTech, GABI 78350 Jouy-en-Josas, France
*
Author to whom correspondence should be addressed.
Genes 2020, 11(1), 57; https://doi.org/10.3390/genes11010057
Submission received: 15 December 2019 / Revised: 21 December 2019 / Accepted: 1 January 2020 / Published: 4 January 2020
(This article belongs to the Section Population and Evolutionary Genetics and Genomics)

Abstract

:
Knowledge of population structure is essential to improve the management and conservation of farm animal genetic resources. Microsatellites, which have long been popular for this type of analysis, are more and more neglected in favor of whole-genome single nucleotide polymorphism (SNP) chips that are now available for the main farmed animal species. In this study, we compared genetic patterns derived from microsatellites to that inferred by SNPs, considering three pairs of datasets of sheep and cattle. Population genetic differentiation analyses (Fixation index, FST), as well as STRUCTURE analyses showed a very strong consistency between the two types of markers. Microsatellites gave pictures that were largely concordant with SNPs, although less accurate. The best concordance was found in the most complex dataset, which included 17 French sheep breeds (with a Pearson correlation coefficient of 0.95 considering the 136 values of pairwise FST, obtained with both types of markers). The use of microsatellites reduces the cost and the related analyses do not require specific computer equipment (i.e., information technology (IT) infrastructure able to provide adequate computing and storage capacity). Therefore, this tool may still be a very appropriate solution to evaluate, in a first stage, the general state of livestock at national scales. At a time when local breeds are disappearing at an alarming rate, it is urgent to improve our knowledge of them, in particular by promoting tools accessible to the greatest number.

1. Introduction

According to the “Second Report on the State of the World’s Animal Genetic Resources for Food and Agriculture” [1], indiscriminate cross-breeding (i.e., breeding not carried out within the framework of selection plans) and the increased use of imported exotic breeds have been identified as the main causes leading to the genetic erosion of livestock. These practices directly threaten the genetic integrity of local breeds. Developing countries, under increasing economic pressure, are the most exposed to this phenomenon. Moreover, in these countries, locally adapted breeds are still mainly reared under traditional practices and generally there is less formal organization of the livestock sector. Thus, they are made more vulnerable. In addition, the ease of herds’ mobility, which became widespread in the age of motorization, has strengthened gene flows between populations. As emphasized by Sponenberg et al. [2], gene flows, until recently naturally limited by geographical, historical and cultural conditions, are intensifying, which endangers the genetic integrity of “traditional” breeds while “standardized” and “industrial” breeds are artificially maintained in a state of protective isolation.
In such a context, knowledge of population structure appears to be essential for managing and conserving Farm Animal Genetic Resources (FAnGR). This is particularly important in developing countries, where local breeds adapted to environment, disease and social uses, represent an invaluable reservoir of genetic diversity that is threatened by agricultural modernization.
In the last few decades, microsatellites (also called short tandem repeats, (STRs) or simple sequences repeats (SSRs)) have been the marker of choice in animal population genetics. In recent years, the use of single nucleotide polymorphisms (SNPs) has become increasingly important in this area. Nowadays, advances in high-throughput sequencing have identified thousands of SNPs, leading to the development of whole-genome SNP chips for the major farm animal species [3]. Illumina Inc. provides public and commercial SNP chips for cattle (BovineSNP50v2) [4], sheep (OvineSNP50) [5], goat (GoatSNP50) [6], chicken (chip with 57,636 SNPs) [7] and pig (PorcineSNP60) [8]. Faced with the power of chips, microsatellites are increasingly left aside by research focused on genome diversity. At this time of technological progress, the major concern in the Farm Animal Genetic Resources (FAnGR) field seems to be the management of the “transition” from microsatellite data to SNP data [3]. However, the use of array technology implies a higher cost as well as the availability of IT infrastructure able to provide adequate computing and storage capacity, and indeed the staff to maintain and operate these platforms. All these conditions can be difficult for developing countries to meet.
A significant amount of work has been dedicated to the comparison between microsatellites and SNPs considering animal and plant models. If we limit the overview to vertebrates alone, we can distinguish different types of research. Some studies have compared the effectiveness of the two markers: (i) for the purpose of performing kinship analyses [9,10,11,12,13], (ii) to infer diversity parameters e.g., [14,15], and (iii) to analyze genetic relationships between populations [16,17,18,19,20]. This last group is of particular interest for this study as it aims to enrich knowledge about the precise issue of the inference of population genetic structure. The studies [16,17,18,19,20] carried out comparative analyses using a few hundred SNPs, mainly due to the limited availability of SNPs markers for the models considered at the time of the investigation. The study by Gärke et al. [18] is an exception as it considers 29 microsatellites and 2931 SNPs in chicken, with the aim of determining the number of SNPs required to achieve the same differentiation power as for a given standard set of microsatellites.
In this study, we consider the case of farm animals for which the amount of SNP available is now very high. The question asked is whether a standard set of microsatellites can provide a picture of the population genetic structure that is comparable to the pattern deduced from a set of several tens of thousands of SNPs. From this perspective, we empirically compared the performance of microsatellites and SNPs to infer the relationship among breeds, considering datasets of sheep and cattle from the literature. For each breed, genotypes were extracted from studies conducted with microsatellites and from studies performed with SNP chips. The datasets were chosen to consider limited geographical scales (mainly national) so as to be able to conduct fine genetic analyses. The conclusions of this study may lead to reconsider the possible interest toward microsatellites in current and future management of FAnGR.

2. Materials and Methods

2.1. Datasets

We studied three pairs of datasets including (i) five local Algerian sheep breeds, (ii) 17 French sheep breeds (mostly local with one composite), and (iii) seven local French cattle breeds. All datasets were available online except the microsatellite dataset for French sheep [21]. For all the datasets, the sampling aimed to limit kinship as much as possible, using pedigree when available and/or sampling animals from as many different birth flocks as possible. As far as possible, the datasets were balanced, i.e., 20 individuals were considered for each population. The individual inbreeding levels (FIS) obtained with PLINK v1.07 [22] and Genepop v4.7 [23,24] software for SNPs and microsatellites, respectively, were used as criteria in the selection of individuals to be retained. All details concerning the datasets are shown in Table 1 and Table S1.

2.1.1. Microsatellites Datasets

The considered datasets included between 21 and 30 microsatellites (see details in Table 1). Most microsatellites, regardless of the dataset, were selected according to ISAG/FAO recommendations [25]. The list of the microsatellites used in each dataset is shown in Table S2. Details on primers, original references and experimental protocols (conditions of PCR, multiplexing) can be found in: [26] for the French cattle dataset; [27] for the Algerian sheep dataset; and [21] for the French sheep dataset.
Polymorphic information content (PIC), observed heterozygosity (HO) and effective number of alleles (Ae) were estimated for all markers using Molkin software (version 2.0) [28].

2.1.2. SNPs Datasets

Cattle genotypes [29] were obtained using the Illumina BovineSNP50v1 BeadChip. Algerian sheep genotypes [30,31] were obtained using the Illumina OvineSNP50K BeadChip. French sheep population genotypes [32] were obtained using the Illumina Ovine HD SNP chip; for the purpose of the study, a subset of SNPs was extracted using the OvineSNP50K BeadChip coordinates. The following filtering parameters were applied with PLINK: (i) SNP call rate ≤ 97%, (ii) SNP minor allele frequency (MAF) ≤ 10%, (iii) animals with ≥10% missing genotypes, and (iv) SNPs that did not pass the HWE test (p ≤ 0.001).

2.2. Data Analysis

We computed Nei’s [33] pair-wise FST values and the associated 95% confidence intervals using the Hierfstat package [34] available for R version 3.5.0 [35]. Following the recommendations of Malomane et al. [36], no linkage disequilibrium (LD)-based SNP pruning was performed prior to the FST computations. NeighborNet graphs based on Reynolds’ [37] genetic distances were constructed using SplitsTree [38].
To ensure that uncorrected LD did not distort the analysis, SNP pruning was used to identify a subset of SNPs using the --indep option of PLINK with the following settings: 50 SNPs per window, a shift of five SNPs between windows, and a variation inflation factor threshold of two (corresponding to r2 > 0.5).
Genetic clusters of individuals were identified via a Bayesian model-based approach implemented in STRUCTURE 2.3.4 [39,40,41] using: an admixture model, correlated allele frequencies, 50,000 burn-in followed by 200,000 simulations. Convergence was checked using ten runs for each K value. The most probable value of K was estimated on the basis of the ΔK statistic [42]. The program CLUMPAK [43] available at http://clumpak.tau.ac.il, was used to analyze the multiple independent runs at a single K value and to visualize the results.

3. Results and Discussion

Microsatellite datasets included 113 (Algerian sheep dataset) to 425 (French sheep dataset) individuals (Table 1). The total number of alleles varied from 283 (French cattle dataset) to 343 (Algerian sheep dataset) (Table 1). The informative significance of the different microsatellite datasets was substantial (see indices, mean A, mean PIC, mean HO and mean Ae).
The SNP datasets included 36 (Algerian sheep dataset) to 346 individuals (French sheep dataset). The number of SNPs available after filtration ranged from 36,493 (Algerian sheep dataset) to 47,286 (French cattle dataset) (Table 1).
Mean values of FST obtained from microsatellite datasets were very close to those obtained from SNP datasets (Table 1). Pearson correlation coefficient between pairwise FST values obtained for the microsatellite dataset and the SNP dataset ranged from 0.77 (French cattle dataset) to 0.95 (French sheep dataset). All correlation coefficients were significantly different from zero (Table 1). By considering in detail the 95% confidence intervals for each pair (values obtained from microsatellite and SNP datasets) of pairwise FST values, the following was observed: for the French sheep dataset, four pairs of confidence intervals were non-overlapping considering the 136 pairs available (Table S3). For the Algerian sheep dataset, out of the 10 pairs of FST values, one showed non-overlapping confidence intervals (Table S4). For the French cattle dataset, one pair on the 21 pairs of FST values showed no overlapping (Table S5).
NeighborNet graphs, based on Reynolds genetic distances, obtained from microsatellite genotyping or SNP genotyping gave highly coherent pictures of the breed relationships for each dataset (Figure 1, Figure 2 and Figure 3). The major difference was overestimation of genetic distance between Ouled-Djellal and Rembi in the Algerian dataset (Figure 1), by microsatellites compared to results obtained with SNP. The same situation occurred for Salers and Aubrac (cattle dataset, Figure 2). The French sheep dataset, comprising 17 breeds, requires more commentary (Figure 3). Both analyses correctly ordered the breeds according to their location of origin. Moreover, they clearly distinguished the breeds of the Massif Central/South of France from the others. In the same way, the genetic peculiarity of Mérinos de Rambouillet, a patrimonial breed maintained as a closed flock for around 230 years [21], was obvious. The Romane, a recent composite breed, appeared grouped with the Berrichon du Cher, an expected result given the breed was developed by crossing Romanov (not included in the dataset) with Berrichon du Cher. Differences were mainly seen in the cluster including North-West breeds (i.e., Roussin de la Hague, Rouge de l’Ouest) and Charmoise. This cluster appeared quite far from the cluster including Berrichon du Cher/Romane in the microsatellite analysis whereas the SNP analysis emphasized proximity for these two clusters versus the breeds of the Massif Central/South of France. Charmoise was developed in the middle of the 19th century by crossing local breeds (i.e., Berrichon du Cher, Solognot (not included in the dataset), and Mérinos de Rambouillet) with Romney sheep (not included in the dataset) imported from the United Kingdom. Moreover, Roussin de la Hague, Rouge de l’Ouest and Berrichon du Cher were largely subjected to the influences of English sheep breeds, to which is added the substantial influence of Merinos de Rambouillet for the Berrichon du Cher breed. The history of these breeds explains their grouping, which was only captured as a whole by the SNP analysis, whereas the microsatellite analysis was more sensitive to the strong link between Berrichon du Cher and Merinos de Rambouillet.
STRUCTURE analyses also showed very consistent patterns between the two types of markers (Figure 4, Figure 5 and Figure 6). In general, patterns obtained from microsatellite datasets were noisier than those from SNP datasets.
Considering the French sheep dataset (Figure 4), the most likely number of clusters according to the ΔK criterion was K = 3 for the microsatellite dataset, highlighting the existence of the three groups: (i) the Massif Central/South of France group, (ii) the Charmoise/Rouge de l’Ouest/Roussin de la Hague group, and (iii) the Berrichon/Romane/Mérinos de Rambouillet group. For the SNP analysis, the ΔK criterion identified K = 2 as the most likely number of clusters, highlighting Mérinos de Rambouillet as the major distinction versus the other breeds. Both analyses underlined the clear distinction between the Massif Central/South of France breeds and the others (K3 and K4). Considering the Massif Central/South of France group, both analyses emphasized the genetic peculiarity of Manech Tête Rousse, the proximity of Noire du Velay/Rava/Causse du Lot, as well as of Lacaune meat/Lacaune milk, and also Blanche du Massif Central/Préalpes du Sud/Moureros/Tarasconnaise. The major distinctions were as follows: for K = 2, the microsatellites clustered Charmoise/Roussin de la Hague/Rouge de l’Ouest/Berrichon/Romane/Mérinos de Rambouillet. Then, at K = 3, they postulate a split within this group, with Charmoise/Rouge de l’Ouest/Roussin de la Hague on one hand and Berrichon/Romane/Mérinos de Rambouillet on the other. The SNP analysis, on the contrary, grouped all these breeds together with the exception of the Mérinos de Rambouillet breed (K = 3). This difference was already discussed in relation with the results of the NeighborNet analysis (Figure 3).
Considering the Algerian sheep dataset (Figure 5), the most likely number of clusters according to the ΔK criterion was K = 3 for the microsatellite dataset, emphasizing the existence of the three groups: Sidaoun/D’Men, Ouled-Djellal/Rembi and Hamra. For the SNP analysis, K = 4 was identified as the most likely number of clusters. At this K value, all breeds were discriminated by both methods except the admixed Ouled-Djellal and Rembi breeds. Moreover, a clear link appeared between the Sidaoun and D’Men breeds. Hence, the two figures show a similar picture of the Algerian genetic structure.
Considering the French cattle dataset (Figure 6), the most likely number of clusters according to the ΔK criterion was K = 2 for both analyses. At this K value, the peculiarity of Rouge des Prés and its link with Bretonne Pie Noire and Charolais were highlighted. Both analyses were highly correlated and the main distinction was observed for Aubrac and Charolais, which show a clearer admixture with the SNP analysis (K = 7) than with microsatellites.
This study compared genetic structures derived from microsatellites to that inferred by SNPs, considering three pairs of datasets for two species of domesticated mammals.
Population genetic differentiation analyses, as well as STRUCTURE analyses showed a very strong consistency between the two types of markers. In particular, the French sheep dataset, including 17 breeds shaped by a rich history (English and Spanish influences among others), made it possible to fully appreciate the usefulness of microsatellite data. It must be noted that, for each pair of datasets (whatever the species considered), it is not the same individuals in each breed who have been genotyped with microsatellites and SNPs. Thus, it is quite remarkable to find such a correlation in the obtained patterns, while the variability introduced by this “sampling effect” constitutively limits the overlap of results from the beginning.
Faced with the power of SNP chips, which are able to capture even tenuous relationships between breeds and to estimate confidence intervals with the highest precision, etc., it turns out that microsatellites gave pictures that were largely concordant, although with less accuracy. Microsatellite markers were able to capture genetic patterns (gene flows, admixture, etc.) considering national scales, which is one of the first requirements for defining and prioritizing conservation measures.
According to the Food and Agriculture Organization (FAO) [1], the risk status for more than 80% of mammalian breeds in Africa and Latin America is unknown. These alarming figures reveal the extent to which ignorance about livestock is high in most developing countries. When the number of head per flock is often not even known, the implementation of expensive genetic analyses is difficult to envisage. In such a context, the use of microsatellites seems to be the most appropriate solution for various reasons. (i) The cost associated with microsatellite genotyping is significantly lower than that of the SNP chip. Indeed, commercially available kits (e.g., the Bovine Genotypes Panel 3.1, the Equine Genotypes Panel 1.1 Kit, the Canine ISAG STR Parentage Kit 2014) provide reagents for the genotyping of about twenty multiplexed microsatellites for an average sum of 13 USD per individual. Moreover, in-house genotyping may substantially reduce this price (i.e., approximately 7 USD per individual). For sheep, goat, pig and cattle the genotyping cost, via Beadchip of around 50K SNP markers, is 67 USD/individual on average, depending on the species and the manufacturer. (ii) High throughput genotyping techniques generate huge data files, the analyses of which require a powerful computing cluster and associated computer knowledge [44]. (iii) Finally, it should be stressed that breeds are still largely managed in the traditional way in developing countries. In order to understand the genetic pattern at a national scale and to answer very practical concerns such as the assessment of introgression and indiscriminate cross-breeding within local populations [45], a broad sample should be undertaken first. Indeed, sampling must be sufficiently broad to cover the global range of the breeds. Only this type of sampling will make it possible to really grasp the situation, knowing that a whole range of situations can be encountered between the preserved populations and the diluted populations [46]. The current cost of genetic chips is hardly compatible with the need for broad sampling, and once again emphasizes the appeal of using microsatellites to provide a first rough, but broad picture of the national situation (see [47]).
It should also be noted that microsatellite kits are still widely used in the private sector for paternity testing. As a result, there is a significant amount of data, especially for cattle and horses, that could be interesting to collect and analyze as part of studies focusing on genetic diversity.
By publishing microsatellite sets for different species of domestic animals [25], the FAO, has created an interesting dynamic in this context. For example, the use of common sets in different studies allows a comparison, although partial, between the datasets Indeed, data sets can be merged only when standardized protocols are followed (i.e., ISAG/ICAR standards) and laboratories use the same microsatellite panels and share a few individuals, playing the role of control, in common. The use of SNP chips, which implies a higher cost as well as the possession of associated computer platforms, could occur and be beneficial at the second stage in order to refine the conclusions, to push the studies towards much finer research (e.g., the search for selection signatures), or given the ease of merging, to compare datasets from different countries.

4. Conclusions

In conclusion, this study highlights the role that microsatellites still play in the management of FAnGR. Indeed, our results show that microsatellites provide a picture of the genetic structure, and even though it is less accurate than that obtained with SNPs, it is absolutely relevant. Therefore, microsatellites are a suitable tool to make an initial evaluation of the situation. Because the effects of climate change are becoming more pressing, it does not seem appropriate to wait until access to high throughput techniques is widespread to characterize the breeds of developing countries. These breeds, which are largely adapted to their environments, constitute an invaluable heritage that can help us to face climate changes [48].

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/11/1/57/s1, Table S1: Details of the datasets used, Table S2: List of microsatellites used for each dataset, Table S3: Pair-wise FST among French sheep breeds (with confidence intervals at 95, Table S4: Pair-wise FST among Algerian sheep breeds (with confidence intervals at 95%), Table S5: Pair-wise FST among French cattle breeds (with confidence intervals at 95%).

Author Contributions

Conceptualization, A.D.S.; Formal analysis, A.L., S.H. and A.D.S.; Resources, X.R.; Writing—Original draft, A.D.S.; Writing—Review & editing, A.L., S.H., M.L., S.B.S.G., I.B., E.C., M.D.G., V.B., G.L. and X.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

We would like to thank the breeders, the technicians of the Breeders Association, and Races-de-France (French Federation of Breeders Association) for their cooperation.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. FAO. The Second Report on the State of the World’s Animal Genetic Resources for Food and Agriculture; Scherf, B.D., Pilling, D., Eds.; FAO Commission on Genetic Resources for Food and Agriculture Assessments: Rome, Italy, 2015; Available online: http://www.fao.org/3/a-i4787e/index.html (accessed on 23 September 2019).
  2. Sponenberg, D.P.; Beranger, J.; Martin, A. An Introduction to Heritage Breeds: Saving and Raising Rare-Breed Livestock and Poultry; Storey Publishing: North Adams, MA, USA, 2014. [Google Scholar]
  3. Bruford, M.W.; Ginja, C.; Hoffmann, I.; Joost, S.; Orozco-terWengel, P.; Alberto, F.J.; Amaral, A.J.; Barbato, M.; Biscarini, F.; Colli, L.; et al. Prospects and challenges for the conservation of farm animal genomic resources, 2015–2025. Front. Genet. 2015, 6, 314. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Matukumalli, L.K.; Lawley, C.T.; Schnabel, R.D.; Taylor, J.F.; Allan, M.F.; Heaton, M.P.; O’Connell, J.; Moore, S.S.; Smith, T.P.; Sonstegard, T.S.; et al. Development and characterization of a high density SNP genotyping assay for cattle. PLoS ONE 2009, 4, e5350. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Magee, D.A.; Park, S.D.; Scraggs, E.; Murphy, A.M.; Doherty, M.L.; Kijas, J.W.; MacHugh, D.E. Technical note: High fidelity of whole-genome amplified sheep (Ovis aries) deoxyribonucleic acid using a high-density single nucleotide polymorphism array-based genotyping platform. J. Anim. Sci. 2010, 88, 3183–3186. [Google Scholar] [CrossRef] [Green Version]
  6. Tosser-Klopp, G.; Bardou, P.; Bouchez, O.; Cabau, C.; Crooijmans, R.; Dong, Y.; Donnadieu-Tonon, C.; Eggen, A.; Heuven, H.C.M.; Jamli, S.; et al. Design and Characterization of a 52K SNP Chip for Goats. PLoS ONE 2014, 9, e86227. [Google Scholar] [CrossRef] [PubMed]
  7. Groenen, M.A.; Megens, H.-J.; Zare, Y.; Warren, W.C.; Hillier, L.W.; Crooijmans, R.P.M.A.; Vereijken, A.; Okimoto, R.; Muir, W.M.; Cheng, H.H. The development and characterization of a 60K SNP chip for chicken. BMC Genom. 2011, 12, 274. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Ramos, A.M.; Crooijmans, R.P.M.A.; Affara, N.A.; Amaral, A.J.; Archibald, A.L.; Beever, J.E.; Bendixen, C.; Churcher, C.; Clark, R.; Dehais, P.; et al. Design of a high density SNP genotyping assay in the pig using SNPs identified and characterized by next generation sequencing technology. PLoS ONE 2009, 4, e6524. [Google Scholar] [CrossRef] [Green Version]
  9. Kaiser, S.A.; Taylor, S.A.; Chen, N.; Sillett, T.S.; Bondra, E.R.; Webster, M.S. A comparative assessment of SNP and microsatellite markers for assigning parentage in a socially monogamous bird. Mol. Ecol. Resour. 2017, 17, 183–193. [Google Scholar] [CrossRef]
  10. Trong, T.Q.; Bers, v.N.; Crooijmans, R.P.M.A.; Dibbits, B.W.; Komen, J. A comparison of microsatellites and SNPs in parental assignment in the GIFT strain of Nile tilapia (Oreochromis niloticus): The power of exclusion. Aquaculture 2013, 388–391, 14–23. [Google Scholar] [CrossRef]
  11. Weinman, L.R.; Solomon, J.W.; Rubenstein, D.R. A comparison of single nucleotide polymorphism and microsatellite markers for analysis of parentage and kinship in a cooperatively breeding bird. Mol. Ecol. Resour. 2015, 15, 502–511. [Google Scholar] [CrossRef]
  12. Fernández, M.E.; Goszczynski, D.E.; Lirón, J.P.; Villegas-Castagnasso, E.E.; Carino, M.H.; Ripoli, M.V.; Rogberg-Muoz, A.; Posik, D.M.; Peral-Garca, P.; Giovambattista, G. Comparison of the effectiveness of microsatellites and SNP panels for genetic identification, traceability and assessment of parentage in an inbred Angus herd. Genet. Mol. Biol. 2013, 36, 185–191. [Google Scholar] [CrossRef] [Green Version]
  13. Yu, G.C.; Tang, Q.Z.; Long, K.R.; Che, T.D.; Li, M.Z.; Shuai, S.R. Effectiveness of microsatellite and single nucleotide polymorphism markers for parentage analysis in European domestic pigs. Genet. Mol. Res. 2015, 14, 1362–1370. [Google Scholar] [CrossRef] [PubMed]
  14. Cortés, O.; Eusebi, P.; Dunner, S.; Sevane, N.; Cañón, J. Comparison of diversity parameters from SNP, microsatellites and pedigree records in the Lidia cattle breed. Livest. Sci. 2019, 219, 80–85. [Google Scholar] [CrossRef]
  15. Ljungqvist, M.; Akesson, M.; Hansson, B. Do microsatellites reflect genome-wide genetic diversity in natural populations? A comment on Väli et al. (2008). Mol. Ecol. 2010, 19, 851–855. [Google Scholar] [CrossRef] [PubMed]
  16. Morin, P.A.; Archer, F.I.; Pease, V.L.; Hancock-Hanser, B.L.; Robertson, K.M.; Huebinger, R.M.; Martien, K.K.; Bickham, J.W.; George, J.C.; Postma, L.D.; et al. Empirical comparison of single nucleotide polymorphisms and microsatellites for population and demographic analyses of bowhead whales. Endanger. Spec. Res. 2012, 19, 129–147. [Google Scholar] [CrossRef]
  17. Glover, K.A.; Hansen, M.M.; Lien, S.; Als, T.D.; Høyheim, B.; Skaala, Ø. A comparison of SNP and STR loci for delineating population structure and performing individual genetic assignment. BMC Genet. 2010, 11, 2. [Google Scholar] [CrossRef] [Green Version]
  18. Gärke, C.; Ytournel, F.; Bed’hom, B.; Gut, I.; Lathrop, M.; Weigend, S.; Simianer, H. Comparison of SNPs and microsatellites for assessing the genetic structure of chicken populations. Anim. Genet. 2012, 43, 419–428. [Google Scholar] [CrossRef]
  19. Saint-Pé, K.; Leitwein, M.; Tissot, L.; Poulet, N.; Guinand, B.; Berrebi, P.; Marselli, G.; Lascaux, J.-M.; Gagnaire, P.-A.; Blanchet, S. Development of a large SNPs resource and a low-density SNP array for brown trout (Salmo trutta) population genetics. BMC Genom. 2019, 20, 582. [Google Scholar] [CrossRef] [Green Version]
  20. Ciani, E.; Cecchi, F.; Castellana, E.; D’Andrea, M.; Incoronato, C.; D’Angelo, F.; Albenzio, M.; Pilla, F.; Matassino, D.; Cianci, D.; et al. Poorer resolution of low-density SNP vs. STR markers in reconstructing genetic relationships among seven Italian sheep breeds. Large Anim. Rev. 2013, 19, 236–241. [Google Scholar]
  21. Leroy, G.; Danchin-Burge, C.; Palhière, I.; SanCristobal, M.; Nédélec, Y.; Verrier, E.; Rognon, X. How do introgression events shape the partitioning of diversity among breeds: A case study in sheep. Genet. Sel. Evol. 2015, 47, 48. [Google Scholar] [CrossRef] [Green Version]
  22. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.W.; Darly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [Green Version]
  23. Raymond, M.; Rousset, F. GENEPOP (version 1.2): Population genetics software for exact tests and ecumenicism. J. Hered. 1995, 86, 248–249. [Google Scholar] [CrossRef]
  24. Rousset, F. Genepop’007: A complete reimplementation of the Genepop software for Windows and Linux. Mol. Ecol. Resour. 2008, 8, 103–106. [Google Scholar] [CrossRef] [PubMed]
  25. FAO, Food and Agricultural Organization of the United Nations. Molecular Genetic Characterization of Animal Genetic Resources; FAO Animal Production and Health Guidelines: Rome, Italy, 2011; pp. 65–84. [Google Scholar]
  26. FAO, Food and Agriculture Organization of the United Nations. Secondary Guidelines for Development of National Farm Animal Genetic Resources Management Plans. In Measurement of Domestic Animal Diversity (MoDAD): Recommended Microsatellite Markers; Available online: http://dad.fao.org/en/refer/library/guidelin/marker.pdf (accessed on 23 September 2019).
  27. Gaouar, S.B.S.; Da Silva, A.; Ciani, E.; Kdidi, S.; Aouissat, M.; Dhimi, L.; Lafri, M.; Maftah, A.; Mehtar, N. Admixture and Local Breed Marginalization Threaten Algerian Sheep Diversity. PLoS ONE 2015, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Gutiérrez, J.P.; Royo, L.J.; Álvarez, I.; Goyache, F. MolKin v2.0: A computer program for genetic analysis of populations using molecular coancestry information. J. Hered. 2005, 96, 718–721. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Sempéré, G.; Moazami-Goudarzi, K.; Eggen, A.; Laloë, D.; Gautier, M.; Flori, L. WIDDE: A Web-Interfaced next generation database for genetic diversity exploration, with a first application in cattle. BMC Genom. 2015, 16, 940. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Gaouar, S.B.S.; Lafri, M.; Djaout, A.; El-Bouyahiaoui, R.; Bouri, A.; Bouchatal, A.; Maftah, A.; Ciani, E.; Da Silva, A.B. Genome-wide analysis highlights genetic dilution in Algerian sheep. Heredity 2017, 118, 293–301. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Belabdi, I.; Ouhrouch, A.; Lafri, M.; Gaouar, S.B.S.; Ciani, E.; Benali, A.R.; Ouelhadj, H.O.; Haddioui, A.; Pompanon, F.; Blanquet, V.; et al. Genetic homogenization of indigenous sheep breeds in Northwest Africa. Sci. Rep. 2019, 9, 7920. [Google Scholar] [CrossRef] [Green Version]
  32. Moreno-Romieux, C.; Tortereau, F.; Raoul, J.; Servin, B. High density genotypes of French Sheep populations [Internet]. Zenodo 2017. [Google Scholar] [CrossRef]
  33. Nei, M. Analysis of gene diversity in subdivided populations. Proc. Natl. Acad. Sci. USA 1973, 70, 3321–3323. [Google Scholar] [CrossRef] [Green Version]
  34. Goudet, J. Hierfstat, a package for R to compute and test hierarchical F-statistics. Mol. Ecol. Notes 2005, 5, 184–186. [Google Scholar] [CrossRef] [Green Version]
  35. Team, R.C. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2006. [Google Scholar]
  36. Malomane, D.K.; Reimer, C.; Weigend, S.; Weigend, A.; Sharifi, A.R.; Simianer, H. Efficiency of different strategies to mitigate ascertainment bias when using SNP panels in diversity studies. BMC Genom. 2018, 19, 22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Reynolds, J.; Weir, B.S.; Cockerham, C. Estimation of the coancestry coefficient: Basis for a short-term genetic distance. Genetics 1983, 105, 767–779. [Google Scholar] [PubMed]
  38. Huson, D.H.; Bryant, D. Application of Phylogenetic Networks in Evolutionary Studies. Mol. Biol. Evol. 2006, 23, 254–627. [Google Scholar] [CrossRef] [PubMed]
  39. Pritchard, J.K.; Stephens, M.; Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 2000, 155, 945–959. [Google Scholar] [PubMed]
  40. Falush, D.; Stephens, M.; Pritchard, J.K. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics 2003, 164, 1567–1587. [Google Scholar]
  41. Falush, D.; Stephens, M.; Pritchard, J.K. Inference of population structure using multilocus genotype data: Dominant markers and null alleles. Mol. Ecol. Notes 2007, 7, 574–578. [Google Scholar] [CrossRef]
  42. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef] [Green Version]
  43. Kopelman, N.M.; Mayzel, J.; Jakobsson, M.; Rosenberg, N.A.; Mayrose, I. Clumpak: A program for identifying clustering modes and packaging population structure inferences across K. Mol. Ecol. Resour. 2015, 15, 1179–1191. [Google Scholar] [CrossRef] [Green Version]
  44. Ducrocq, V.; Laloe, D.; Sawaminathan, M.; Rognon, X.; Tixier-Boichard, M.; Zerjal, T. Genomics for Ruminants in Developing Countries: From Principles to Practice. Front. Genet. 2018, 9, 251. [Google Scholar] [CrossRef] [Green Version]
  45. Leroy, G.; Kayang, B.B.; Youssao, I.A.K.; Yapi-Gnaore, C.V.; Osei-Amponsah, R.; Loukou, N.E.; Fotsa, J.C.; Benabdeljelil, K.; Bed’hom, B.; Tixier-Boichard, M.; et al. 2012 Gene diversity, agroecological structure and introgression patterns among village chicken populations across north, west and central Africa. BMC Genet. 2012, 13, 34. [Google Scholar] [CrossRef] [Green Version]
  46. Harkat, S.; Laoun, A.; Belabdi, I.; Benali, R.; Outayeb, D.; Payet-Duprat, N.; Blanquet, V.; Lafri, M.; Da Silva, A. Assessing patterns of genetic admixture between sheep breeds: Case study in Algeria. Ecol Evol. 2017, 7, 16. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Selepe, M.M.; Ceccobelli, S.; Lasagna, E.; Kunene, N.W. Genetic structure of South African Nguni (Zulu) sheep populations reveals admixture with exotic breeds. PLoS ONE 2018, 13, e0196276. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Taberlet, P.; Valentini, A.; Rezaei, H.R.; Naderi, S.; Pompanon, F.; Negrini, R.; Ajmone-Marsan, P. Are cattle, sheep, and goats endangered species? Mol. Ecol. 2008, 17, 275284. [Google Scholar] [CrossRef] [PubMed]
Figure 1. NeighborNet graph considering Algerian sheep breeds, from a matrix of Reynolds’ distances. (a) Plot obtained from the microsatellite dataset; (b) plot obtained from the single nucleotide polymorphism (SNP) dataset. For breed names see codes in Table S1.
Figure 1. NeighborNet graph considering Algerian sheep breeds, from a matrix of Reynolds’ distances. (a) Plot obtained from the microsatellite dataset; (b) plot obtained from the single nucleotide polymorphism (SNP) dataset. For breed names see codes in Table S1.
Genes 11 00057 g001
Figure 2. NeighborNet graph considering French cattle breeds, from a matrix of Reynolds’ distances. (a) Plot obtained from the microsatellite dataset; (b) plot obtained from the SNP dataset. For breed names see codes in Table S1.
Figure 2. NeighborNet graph considering French cattle breeds, from a matrix of Reynolds’ distances. (a) Plot obtained from the microsatellite dataset; (b) plot obtained from the SNP dataset. For breed names see codes in Table S1.
Genes 11 00057 g002
Figure 3. NeighborNet graph considering French sheep breeds, from a matrix of Reynolds’ distances. (a) Plot obtained from the microsatellite dataset; (b) plot obtained from the SNP dataset. Information concerning the region of origin of each breed was extracted from [21]; for breed names see codes in Table S1.
Figure 3. NeighborNet graph considering French sheep breeds, from a matrix of Reynolds’ distances. (a) Plot obtained from the microsatellite dataset; (b) plot obtained from the SNP dataset. Information concerning the region of origin of each breed was extracted from [21]; for breed names see codes in Table S1.
Genes 11 00057 g003
Figure 4. Genetic structure of French sheep breeds by Bayesian analysis (K = number of clusters). (a) STRUCTURE plot obtained from the microsatellite dataset; (b) graph showing ΔK calculated according to [42] for the microsatellite dataset; (c) STRUCTURE plot obtained from the SNP dataset; (d) graph showing ΔK calculated according to [42] for the SNP dataset. Comp.: composite breeds; information concerning the region of origin of each breed was extracted from [21]; for breed names see codes in Table S1.
Figure 4. Genetic structure of French sheep breeds by Bayesian analysis (K = number of clusters). (a) STRUCTURE plot obtained from the microsatellite dataset; (b) graph showing ΔK calculated according to [42] for the microsatellite dataset; (c) STRUCTURE plot obtained from the SNP dataset; (d) graph showing ΔK calculated according to [42] for the SNP dataset. Comp.: composite breeds; information concerning the region of origin of each breed was extracted from [21]; for breed names see codes in Table S1.
Genes 11 00057 g004
Figure 5. Genetic structure of Algerian sheep breeds by Bayesian analysis (K = number of clusters). (a) STRUCTURE plot obtained from the microsatellite dataset; (b) graph showing ΔK calculated according to [42] for the microsatellite dataset; (c) STRUCTURE plot obtained from the SNP dataset; (d) graph showing ΔK calculated according to [42] for the SNP dataset. For breed names see codes in Table S1.
Figure 5. Genetic structure of Algerian sheep breeds by Bayesian analysis (K = number of clusters). (a) STRUCTURE plot obtained from the microsatellite dataset; (b) graph showing ΔK calculated according to [42] for the microsatellite dataset; (c) STRUCTURE plot obtained from the SNP dataset; (d) graph showing ΔK calculated according to [42] for the SNP dataset. For breed names see codes in Table S1.
Genes 11 00057 g005
Figure 6. Genetic structure of French cattle breeds by Bayesian analysis (K = number of clusters). (a) STRUCTURE plot obtained from the microsatellite dataset; (b) graph showing ΔK calculated according to [42] for the microsatellite dataset; (c) STRUCTURE plot obtained from the SNP dataset; (d) graph showing ΔK calculated according to [42] for the SNP dataset. For breed names see codes in Table S1.
Figure 6. Genetic structure of French cattle breeds by Bayesian analysis (K = number of clusters). (a) STRUCTURE plot obtained from the microsatellite dataset; (b) graph showing ΔK calculated according to [42] for the microsatellite dataset; (c) STRUCTURE plot obtained from the SNP dataset; (d) graph showing ΔK calculated according to [42] for the SNP dataset. For breed names see codes in Table S1.
Genes 11 00057 g006
Table 1. Genetic diversity measured by dataset.
Table 1. Genetic diversity measured by dataset.
Algerian Sheep DatasetsFrench Sheep DatasetsFrench Cattle Datasets
no. of breeds5177
Microsatellite datasets:
nb. of individuals113425175
nb. of microsatellites292130
nb. of alleles343292283
mean A (s.d.)11.83 (13.29)13.90 (21.59)9.43 (11.97)
mean HO (s.d.)0.77 (0.008)0.73 (0.018)0.71 (0.015)
mean PIC (s.d.)0.74 (0.010)0.70 (0.018)0.67 (0.018)
mean Ae (s.d.)5.06 (4.67)4.46 (3.89)3.98 (3.02)
SNP datasets:
nb. of individuals *36346152
nb. of SNP52,41240,45452,324
nb. of SNP after filtration36,49339,80047,286
nb. of SNP after Pruning **15,56031,18424,841
Mean FST from microsatellites datasets (s.d.)0.048 (<0.001)0.104 (0.004)0.076 (<0.001)
Mean FST from SNP datasets (s.d.)0.048 (<0.001)0.105 (0.004)0.078 (<0.001)
r Pearson *** (p-value)0.87 (0.001)0.95 (<0.001)0.77 (<0.001)
no.: number; s.d.: standard deviation; A: number of alleles; HO: observed heterozygosity; PIC: polymorphic information content; Ae: effective number of alleles; *: after filtration (see Material and Methods); **: see Material and Methods; ***: Pearson correlation coefficient between pairwise FST values obtained with the microsatellite dataset and the SNP dataset.

Share and Cite

MDPI and ACS Style

Laoun, A.; Harkat, S.; Lafri, M.; Gaouar, S.B.S.; Belabdi, I.; Ciani, E.; De Groot, M.; Blanquet, V.; Leroy, G.; Rognon, X.; et al. Inference of Breed Structure in Farm Animals: Empirical Comparison between SNP and Microsatellite Performance. Genes 2020, 11, 57. https://doi.org/10.3390/genes11010057

AMA Style

Laoun A, Harkat S, Lafri M, Gaouar SBS, Belabdi I, Ciani E, De Groot M, Blanquet V, Leroy G, Rognon X, et al. Inference of Breed Structure in Farm Animals: Empirical Comparison between SNP and Microsatellite Performance. Genes. 2020; 11(1):57. https://doi.org/10.3390/genes11010057

Chicago/Turabian Style

Laoun, Abbas, Sahraoui Harkat, Mohamed Lafri, Semir Bechir Suheil Gaouar, Ibrahim Belabdi, Elena Ciani, Maarten De Groot, Véronique Blanquet, Gregoire Leroy, Xavier Rognon, and et al. 2020. "Inference of Breed Structure in Farm Animals: Empirical Comparison between SNP and Microsatellite Performance" Genes 11, no. 1: 57. https://doi.org/10.3390/genes11010057

APA Style

Laoun, A., Harkat, S., Lafri, M., Gaouar, S. B. S., Belabdi, I., Ciani, E., De Groot, M., Blanquet, V., Leroy, G., Rognon, X., & Da Silva, A. (2020). Inference of Breed Structure in Farm Animals: Empirical Comparison between SNP and Microsatellite Performance. Genes, 11(1), 57. https://doi.org/10.3390/genes11010057

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop