1. Introduction
The conservation and management of domestic animal genetic diversity has become a key issue in modern livestock breeding since the globalization of breeding programs [
1]. Genetic diversity erosion, the increase in recessive allele frequency, and reduced performances in traits of breeding interest are the main consequences of mating among relatives [
2]. The availability and completeness of population data are key aspects to assess genetic variability and develop strategic goals in biodiversity conservation [
3]. In the last decades, traditional estimations based on pedigree data have been replaced or integrated with microsatellite data. Nowadays, the availability of high-density SNP chips has fostered the development of new tools to assess genetic diversity, offering a detailed picture of diversity across the genome [
4]. The correlation between pedigree-based and molecular diversity depends on the completeness of pedigree information, as well as on the number and frequency of molecular markers. The number of markers is a key aspect, since too few are only able to reflect inbreeding at some (random) points along the genome. In contrast, pedigree-based diversity offers a global estimate [
4]. Larger numbers of markers allow stronger correlations with pedigree inbreeding [
5], despite the inability to obtain correlations equal to one since Mendelian sampling is ignored in pedigree-based inbreeding, and pedigree information are in some cases wrong or incomplete [
4]. Therefore, the integration of pedigree-based and molecular information may provide a more meaningful overview on genetic diversity. [
5].
A number of recent studies in livestock species have focused on the estimation of genetic diversity using both pedigree data and molecular information: in cattle [
4,
5], pigs [
6], and horses. In horses, genetic diversity was estimated using pedigree data and microsatellites (e.g., in thoroughbred horses [
7], Franches–Montagnes [
8], and Belgian Draught horse [
9]) and, more recently, using pedigree and high-density SNP information (Norik of Muran horses [
10]).
The Italian Heavy Draught Horse (IHDH;
Supplementary Figure S1) is a local Italian horse breed whose origin can be traced back to the formation of the Kingdom of Italy in 1861. The breed was established to support the development of a heavy horse for rapid draught purposes both in agriculture and field artillery [
11]. The breed was settled by crossing Norfolk–Breton stallions (imported and used since 1911) from France (“Posthorse” or “Postier Breton”) with local heavy mares from the northeast of Italy. Since the institution of the studbook in 1927, a selection program was developed to obtain a homogeneous local population, even if French Bretons stallions were still introduced to keep genetic variability [
11]. In the 1960s, the population size progressively decreased due to an increased mechanization in agriculture. Nonetheless, the breed survived thanks to the addition of the meat attitude (anacaitpr.it). Moreover, horses started to be selected also in Central and South Italy during the 1960s, also contributing to the official breeding nucleus since the end of the 1970s [
11]. Breton’s stallions and mares were widely used in the 1980s and the early 1990s to increase the genetic variability toward meat production, but their use officially stopped in 2004. In those years, a complex situation occurred in IHDH breeding management: in North Italy Bretons stallions and mares were still used for breeding; in Central and South Italy, local mares were used only.
Nowadays, the meat attitude of the breed is still present, but the original heavy draught purpose has acquired increasing interest in recent years for agriculture, especially in organic farms, and for leisure activities such as team races [
11,
12]. The last official update (August 2019) of the DAD-IS database of the Food and Agriculture Organization-FAO (fao.org/dad-is/) reported a population size of 5137 individuals, including 353 stallions and 2962 mares in 792 studs. The population size has a slight decreasing trend, but the inbreeding rate is routinely monitored [
11].
A selection program for meat and heavy draught attitudes based on linear type traits collected on young foals has been adopted in this breed [
13]. The current IHDH individual is a bulky animal with an adult weighing 700–900 kg and an average wither height of 157 cm in males and 154 cm in females (anacaitpr.it; fao.org/dad-is/). The IHDH is nowadays spread across the whole country and reared both in stables (North Italy), and in feral or semi-feral conditions (Central and South Italy) [
12].
Even though pedigree information has been recorded since the institution of the stud book (1927), an investigation of genetic diversity of the breed based on genealogical data has not been performed yet [
11]. Except for individual inbreeding and average generation length [
11], no additional information on genetic diversity parameters and on the effective population size were investigated before the present study.
A pilot investigation on genetic variability based on molecular information was carried out using a panel of 23 microsatellite markers [
14]. Here, a complex and fragmented structure of the population was observed, suggesting the occurrence of two subpopulations within IHDH breed and the importance of further analysis. Genotypic information on IHDH individuals was recently available, thanks to a national project for rural development in Italian horse breeds. These data may allow investigating both the population structure and the presence of genomic regions potentially under selection (also known as “selection signatures” [
15,
16]) in the breed. Recent developments in genomic methodologies have allowed further exploring the effects of a positive selection in the genome in terms of stretches of consecutive homozygous loci, which are also known as runs of homozygosity (ROH [
17]).
Moving from these considerations, the present study aimed to perform an investigation of the genetic diversity of the IHDH breed by means of both pedigree and genotype data, including the investigation of a possible fragmentation of population structure looking at genotypic information and the search of selection signatures using ROH.
4. Discussion
The present study intended to use both pedigree and molecular information to depict the IHDH genetic diversity as clearly as possible. Since the population size of the IHDH breed has decreased consistently over the last few years, the evaluation and knowledge of the current genetic diversity is essential to eventually perform genetic conservation actions [
2].
The pedigree data of the IHDH breed are likely to offer reliable information, since the pedigree completeness is high, especially in the last decade, as shown in the reference population. Likewise, the equivalent number of generations higher than 3 suggested a rather complete pedigree [
22]. The definition of a decade (2010–2019) as a reference population is mainly because the breed is late maturing [
11], and the average generation length is close to 9 years [
11]. Long generation intervals were found also in other horse breeds, such as Bardigiano (8.47; [
56]), Italian Haflinger (9.71; [
57]), and Lusitano (10.52; [
58]). The variation of pedigree completeness, inbreeding level, and effective population size over time reflected the IHDH population history. Lower inbreeding levels and thus an increased population size were found when the population basis of the breed has been enlarged. This happened in the 1960s, after the enlargement of the breeding area to the center and south of Italy, and in the 1980s, when the horses from the new breeding areas started to be used for selection purposes (see Introduction). In the 1980s, the selective bases of the population enlarged also due to the increased number of French Breton stallions and mares used for breeding.
French animals were only 0.42% of the individuals in pedigree, but their contribution is massive. The analysis of ancestors and founders showed that 78.9% of the ancestors explaining the 50% of the population is represented by French horses. Nevertheless, the IHDH population did not experience recent events of bottlenecks, as suggested by the ratio among the effective number of ancestors and founders (
fa/fe; [
24] as important bottlenecks are expected with a
fa/fe below 0.5 [
51]).
The massive introduction of new animals along IHDH history has led to a low and rather constant value of average relatedness over time. This later aspect will potentially allow a better control of the inbreeding in the long term [
59]. Inbreeding has remained low over time, undergoing a reduction when the breed was enlarged (the new individuals had a pedigree-based inbreeding of zero (F_trad, [
28]). A second pedigree-based inbreeding coefficient (F_rec) was considered able to recover incomplete pedigree information [
21], but it has to be noticed that the latter methodology could overestimate the inbreeding level. This is because the founders’ animals up to 2004 (the year in which the introduction of French horses stopped) are not related. The F_rec was able to determine a pretty constant increment of inbreeding (ΔF) over time. However, this inbreeding rate is largely lower than the threshold of ΔF = 1% that FAO recommends for small populations [
60]. In addition, the effective population size from pedigree data was definitely higher than the recommended threshold of 50 individuals per generation [
60] along the whole history of IHDH.
Molecular information was able to complement the results found from pedigree data. The Ne based on linkage disequilibrium information from SNPs allowed tracing an historical trend of the effective population size, back to the beginning of the history of the breed [
36]. A similar approach was used to detect the ancestral effective population size also in other horse breeds, such as in Finnish horse [
58]. The current population size estimated with this approach is a bit higher than what was measured with pedigree information. As a matter of fact, a low level of homozygosity was found in IDHD genotyped animals, which was even lower than what was expected from SNP information. As a consequence, the inbreeding level computed as the difference between the observed and expected homozygous genotypes (F_het) was lower than zero in 168 of the 267 genotyped individuals. However, highly correlated estimates, roughly differing for a constant value, were obtained between F_het and F_roh. Many studies in various species, including horses [
35,
45], have demonstrated that ROH are a feasible source of information to describe genomic inbreeding [
61]. Moreover, the identification of ROH segments can be useful to investigate complex population histories and structures [
61], commonly assuming that long consecutive homozygous segments are the result of identical haplotypes from common ancestors [
62]. Therefore, they are useful to estimate inbreeding coefficients for individuals with incomplete pedigree information. The high correlation between pedigree-based inbreeding and genomic inbreeding coefficients is the result of both the pedigree completeness of IHDH (reliable information on individual inbreeding needs high-quality pedigree data [
62]), and of the density of SNP information [
4]. Moreover, ROH segments of different lengths reflect inbreeding events that occurred in different time frames [
61]. Most of the IHDH genotyped individuals show ROH in all the length classes considered, suggesting the occurring of inbreeding events both in the recent history of the breed (likely to be due after the 1980s) and in the past decades (1940s).
The use of French Breton in the breed was reflected by the fragmented structure of the population at the genotype level, which is a trend previously found using microsatellites [
14]. Genotype data showed that one subpopulation, here referred to as
subpop1, showed highly “connected” individuals with a genomic inbreeding measured from ROH (F_roh) on average a bit higher than in individuals who belonged to the other subpopulation (called
subpop2). A complex population structure with subpopulations identified using clustering methods based on genomic data was also found in Lipizzan horse [
45] and in Noriker [
63]. However, pedigree information showed a certain degree of overlap between the two subpopulations. Mostly, the same ancestors (all French Norfolk–Breton stallions) explained most of the genetic diversity of the subpopulation to which they refer but with different amounts of explained genetic variability. The extensive use of a single stallion contributing to 19% of the whole genetic diversity of
subpop1 and a wide contribution of a local mare not included among the ancestors of the other subpopulation, which highly contributed to determining the separation of the two subpopulations. Looking at the history of the breed, it is possible to note that many ancestors of
subpop2 came from the Regional Equestrian Breeding Centre of Ferrara that was used to introduce Breton stallions from France for breeding purposes until this practice was officially stopped for the whole IHDH breed. However, the two subpopulations have never been recognized or treated as different in IHDH breeding management, and also the selection pressures to which they were subdued were the same.
Runs of homozygosity have been widely used in animal genetics to detect both within-breed loss of genetic diversity and the potential signature of selections [
4,
64,
65]. Overlapping homozygous regions, highly shared among individuals belonging to the same population, are thought to be potential signs of selection around a target locus. Several examples of ROH analyses in horses are available, and they address key aspects of the breed history and selection pressure [
17,
35,
46,
61,
63,
66]. In this study, we detected three ROH islands shared in over the 70% of the animals, highlighting potential signatures of selection in two regions on ECA3 and one region on ECA11. The ROH islands located on ECA3 overlapped with known QTLs for white markings which are indeed highly present in this breed (
Supplementary Figure S1). These two ROH islands overlapped also with QTLs for two disease-related traits: the insect bite hypersensitivity and the guttural pouch tympani; therefore, we cannot rule out the hypothesis that those regions might be under selection due to their association with disease resistance. In addition, those two regions highly overlapped with the potential selection signatures found in two other draught horse breeds: the Noriker horse breed [
46] and the Muran horse breed [
67]. Therefore, we can suggest that some of the genes located in those two regions might have an important function for draught horse. The current breeding program in the IHDH is mainly designed for meat production, but we did not find any signs of potential selection related to this trait. The only molecular information linked to meat production was found in a previous study finding some associations between the morphological traits evaluated for meat and myostatin gene polymorphism [
4]. Moreover, fewer ROH islands were detected in the IHDH breed if compared to European breeds mainly selected for sport disciplines [
35]. A possible explanation might be related to the multiple aims currently present in the IHDH breeding program, including meat production and several types of leisure activities. The lack of significant differences from the ROH island analysis based on the two subpopulations suggested that the difference found from the population structure analyses is mainly due to ancestors’ effects rather than different breeding purposes [
46,
67]. A further use of high-density SNP panels (600 K), with a 10-fold number of SNPs, or a different marker panel that is more appropriate for this breed could maybe allow identifying additional ROH, if present. However, a high density would not be useful for better clarifying the occurrence of two subpopulations more clearly. Only the handling of the animals over time would eventually favor the definition of subgroups or not, depending on the breeding purpose.