Next Article in Journal
Genetically Transitional Disease and the Road to Personalized Medicine
Previous Article in Journal
A Comprehensive Evolutionary Analysis of the Dihydroflavonol 4-Reductase (DFR) Gene Family in Plants: Insights from 237 Species
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

New Insights into Genetic Diversity and Differentiation of 11 Buffalo Populations Using Validated SNPs for Dairy Improvement

by
Alfredo Pauciullo
1,*,
Giustino Gaspa
1,
Carmine Versace
1,
Gianfranco Cosenza
2,
Nadia Piscopo
3,
Meichao Gu
4,
Angelo Coletta
5,
Tanveer Hussain
6,
Alireza Seidavi
7,
Ioana Nicolae
8,
Attawit Kovitvadhi
9,
Qingyou Liu
10,
Jianghua Shang
11,
Jingfang Si
12,
Dongmei Dai
12 and
Yi Zhang
12
1
Department of Agricultural, Forest and Food Science, University of Torino, Grugliasco, 10095 Torino, Italy
2
Department of Agriculture, University of Napoli Federico II, Portici, 80055 Naples, Italy
3
Department of Veterinary Medicine and Animal Production, University of Napoli Federico II, 80137 Naples, Italy
4
National Institute of Biological Sciences, Beijing 102206, China
5
Research, Innovation and Selection for Buffalo, 81100 Caserta, Italy
6
Department of Biological Sciences, Virtual University of Pakistan, Islamabad 44000, Pakistan
7
Department of Animal Science, Rasht Branch, Islamic Azad University, Rasht 4147654919, Iran
8
Research and Development Institute for Bovine Breeding, Balotesti, 077015 Bucharest, Romania
9
Department of Physiology, Kasetsart University, Bangkok 10900, Thailand
10
School of Life Science and Engineering, Foshan University, Foshan 528225, China
11
Buffalo Research Institute, Chinese Academy of Agricultural Sciences, Nanning 530001, China
12
College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
*
Author to whom correspondence should be addressed.
Genes 2025, 16(4), 400; https://doi.org/10.3390/genes16040400
Submission received: 13 March 2025 / Revised: 23 March 2025 / Accepted: 28 March 2025 / Published: 30 March 2025
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
Background/Objectives: Buffalo populations exhibit distinct genetic variations influenced by domestication history, geographic distribution, and selection pressures. This study investigates the genetic structure and differentiation of 11 buffalo populations, focusing on five loci related to milk protein (CSN1S1 and CSN3) and fat metabolism (LPL, DGAT1 and SCD). The aim is to assess genetic variation between river, swamp, and wild-type buffaloes and identify key loci contributing to population differentiation. Methods: Genetic diversity was analyzed through allele frequency distribution, the Hardy−Weinberg equilibrium testing, and observed (Ho) and expected heterozygosity (He) calculations. Population structure was assessed using principal component analysis (PCA), FST statistics, and phylogenetic clustering (k-means and UPGMA tree). The silhouette score (SS) and the Davies−Bouldin index (DBI) were applied to determine optimal population clustering. Results: Significant genetic differentiation was observed between river and swamp buffaloes (p < 0.001). DGAT1 and CSN3 emerged as key markers distinguishing buffalo types. The Italian Mediterranean buffalo exhibited the highest genetic diversity (Ho = 0.464; He = 0.454), while the Indonesian, Chinese, and Vietnamese populations showed low heterozygosity, likely due to selection pressures and geographic isolation. The global FST (0.2143; p = 0.001) confirmed moderate differentiation, with closely related populations (e.g., Nepal and Pakistan) exhibiting minimal genetic divergence, while distant populations (e.g., Egypt and Indonesia) showed marked differences, and the Romanian population showed a unique genetic position. Conclusions: These findings contribute to a deeper understanding of buffalo genetic diversity and provide a valuable basis for exploiting the potential of this species in the light of future breeding and conservation strategies specific for each buffalo type.

1. Introduction

The water buffalo (Bubalus bubalis) plays a significant role in global agriculture, particularly in regions such as South Asia, the Mediterranean, and South America. This species, domesticated over 5000 years ago, is essential in providing milk, meat, and draught power in various agro-ecological settings. In particular, buffalo milk, with its high fat content and rich nutrient profile, is a critical resource for dairy industries, contributing significantly to the production of high-quality dairy products like mozzarella cheese, yogurt, butter, and other dairy products [1,2]. The economic importance of buffalo extends beyond its dairy production, as its meat is also highly valued for its tenderness and nutritional qualities, and its adaptability to challenging environments makes it a vital resource in many developing countries [3].
The genetic diversity within the buffalo population is a critical aspect that influences their adaptability, productivity, and overall genetic improvement. This diversity is shaped by centuries of natural selection and human intervention through breeding practices, leading to distinct breeds with unique characteristics. Most of breeds belong to the river type (2n = 50) that has been reared more as a dairy animal than the swamp type (2n = 48), with fewer breeds, which have instead been used more as draught and meat production [1]. Domestication, history, and genetic diversity of both buffalo types have been studied at a genetic level using microsatellites and/or mitochondrial and Y-chromosome DNA markers [4,5,6,7,8,9,10,11] and a genomic level by the 90 K Array SNP chip [12,13] and the whole genome sequencing [14,15]. The application of the most advanced molecular tool helped to support the scenario of two independent domestication events for the river and swamp buffalo [16].
Understanding and harnessing the found genetic variability is essential for improving economically important traits, such as milk yield, meat quality, and disease resistance. Among the key genetic elements influencing these traits are the genes CSN1S1, CSN3, DGAT1, SCD, and LPL. These genes are known to play vital roles in milk production and quality. The CSN1S1 (αs1-casein) and CSN3 (k-casein) genes are involved in the synthesis of casein, the main protein found in milk, which directly affects milk’s nutritional and processing properties [17,18,19,20]. Variations in these genes can significantly influence milk protein content and cheese-making properties, making them important targets for genetic improvement in dairy buffalo [18,21].
The diacylglycerol O-acyltransferase 1 (DGAT1) gene is essential in the final step of triglyceride synthesis, a major component of milk fat. Variants in this gene have been associated with variations in milk fat percentage, making it a significant marker for selecting high-fat yielding buffalo breeds [22]. The same gene in cattle is a well-known QTL for fat in meat and milk, demonstrated in many different breeds [23,24,25].
Similarly, the stearoyl-CoA desaturase (SCD) gene is another important gene influencing the level of unsaturated fatty acids (FAs) as the SCD enzyme catalyses the insertion of a Δ9 double bound using the saturated FAs as a substrate. Therefore, impacting the FAs composition, SCD is considered a key factor in determining the quality of milk and dairy products [26,27]. In addition, the lipoprotein lipase gene (LPL) plays a crucial role in lipid metabolism, influencing the fat content and overall energy balance in buffalo, which are important factors for both milk yield and meat quality [28].
Former studies demonstrated that single nucleotide polymorphisms (SNPs) within these genes have been associated with favorable performances in milk yield and quality, as well as carcass characteristics, thus providing valuable markers for genetic selection and breeding programs [18,19,21,26,29,30], which can improve the overall efficiency and economic returns of buffalo farming.
The aim of the study was to use SNPs in these five genes (CSN1S1, CSN3, DGAT1, SCD, and LPL) to estimate the genetic relationships, distances, and diversity among 11 buffalo populations, including river (2n = 50), swamp (2n = 48), and wild type, which go from the Mediterranean area to the South East Asia, and to understand how selection differentiated allele frequencies in these key genes for milk and meat traits.

2. Materials and Methods

2.1. Samples and DNA Isolation

A total of 918 domestic buffaloes belonging to 11 populations (48 subpopulations, SPs) were considered in this study. Briefly, populations were indicated by the codes: BDG—Bangladesh (3 SPs, 46 samples); CHN—China (18 SPs, 149 samples); EGY—Egypt (1 SP, 19 samples); IDN—Indonesia (3 SPs, 56 samples); IRN—Iran (4 SPs, 132 samples); ITA—Italy (1 SP, 232 samples); NPL—Nepal (5 SPs, 53 samples); PAK—Pakistan (3 SPs, 27 samples); ROM—Romania (1 SP, 98 samples); THA—Thailand (5 SPs, 91 samples); and VNM—Vietnam (3 SPs, 15 samples). The geographical sampling area is reported in Figure 1, whereas details on subpopulations and their respective animal distributions are listed in Table S1.
Genomic DNA was isolated from hair root samples and from dry blood spots by NucleoSpin® Tissue kit (Macheray-Nagel, Düren, Germany) following the manufacturer’s guidelines. DNA quality and integrity were assessed using a Nanodrop One Spectrophotometer (Thermo-Scientific, Waltham, MA, USA).

2.2. PCR Amplification and Genotyping

Standard methods based on PCR (artificially created restriction site (ACRS) and restriction fragment length polymorphism (RFLP)) were used for genotyping the following five SNPs: g.129,635,007A>G at LPL; g.32,148,856A>G at the CSN1S1; g.31,917,000A>G at the CSN3; g.81,685,203G>A at DGAT1 and g. 21,066,603C>A at SCD. SNP positions refer to UOA_WB_1 (GCF_003121395.1) reference genome. Chromosomal location, SNP positions in the gene, genotyping methods, and references are reported in Table 1.
PCR amplifications were carried out using BioRad T100 thermocyclers (BioRad, Hercules, CA, USA) in a total volume of 15 µL containing 50 ng of genomic DNA, 1 × PCR buffer (Promega, Madison, WI, USA), 2.5 mM of MgCl2, 5 pmol of each primer, dNTPs, each at 200 µM, and 1 U of Taq DNA Polymerase (Promega). Digestion products were loaded on 2.5% agarose gels for electrophoresis analysis in 0.5 × TBE buffer and stained with SYBR green nucleic acid stain (Lonza Rockland Inc., Rockland, ME, USA).

2.3. Statistical Analysis

Within-population variability was estimated by the allele frequencies, gene diversity per locus and population, and observed and expected heterozygosity using the PopGen32 and FSTAT 2.9.3.2 software. The Hardy−Weinberg equilibrium tests were computed for all five loci, and minor allele frequencies were indicated. Using the same software, the Wright’s fixation index (Fis) as a measure of heterozygote deficiency or excess was computed, as well as the diversity between populations was assessed by the pairwise fixation index (FST) and the global FST (GST). The sequential Bonferroni correction was applied to correct for the effects of multiple tests. Nei’s genetic distances were calculated by 100 permutations. To analyze the population differentiation in more detail, principal component analysis (PCA) was computed grouping both the buffalo types and using the genetic distances matrix combined with clustering analysis by k-means (K = 3 to K = 7) using Phyton software ver 3.13.1. Silhouette and Davies−Bouldin indices were compared for each k-means and used to establish the best clustering separation. Silhouette score measures the ratio between intra-cluster cohesion and inter-cluster separation for each data point, with values ranging from −1 to 1 (where 1 indicates well-defined clusters and −1 indicates poor clustering). In contrast, the Davies−Bouldin index provides a more global measure by averaging similarity between each cluster and its most similar counterpart; DBI values range from 0 to 1, with lower values indicating better clustering. Phyton software was also used to generate the dendogram by UPGMA (Unweighted Pair Group Method with Arithmetic mean), complete linkage, and Ward’s method cluster analysis (1000 bootstraps) [31].

3. Results and Discussion

The genetic diversity of 11 buffalo populations was estimated taking into consideration 5 loci of economic interest belonging to milk protein (CSN1S1 and CSN3) and fat metabolism in milk and meat (LPL, DGAT1, and SCD). Polymorphism was detected at all the investigated loci, although the degree of variability differed notably among populations.
Overall, considering all populations in their respective groups (river, swamp, and wild type), the allele frequencies cleraly showed differences among between the river and swamp buffaloes (Figure S1). The χ2 test showed extremely low p-values (all p-values: < 0.001), indicating highly significant differences in allele frequencies among the types and a strong genetic differentiation among the types (Table S2). In particular, the χ2 values were very high for the LPL g.129,635,007G>A (352.52; p = 5.00 × 10−75) and DGAT1 g.81,685,203G>A (324.19; p = 6.53 × 10−69), thus suggesting that for these genes, the differences between the buffalo populations are likely more marked than for the others. Principal component analysis did not show a very clear clusterization among the subpopulations, although a separation between river (green ellipse) and swamp (blu ellipse) types was evident, whereas the wild type overlapped both river and swamp buffalo populations (Figure 2).
The results were expected. The literature reports that wild buffalo showed ancestry from both river and swamp types, but a much closer relationship to the latter [32]. On the other hands, it was already observed that putative wild buffalo from Kosi Tappu Wildlife Reserve in Nepal [33] clustered strongly with river-type breeds [11]. More recently, a complete genomic landscape study revealed the phylogenetic position of wild Nepalese water buffaloes as a sister taxon of the river type, and similar results were reported also in other studies [34]. Our PCA is consistent with the latter observation since most of our wild types from Nepal (putative Bubalus arnee) overlapped with river cluster. Furhermore, the presence of similar allele frequencies in populations like NPL and BGD suggest admixture zones where crossbreeding may have occurred, either naturally in overlapping habitats or through human-mediated practices. Therefore, relatively more recent introgression events from domestic river buffalo in Nepal cannot be excluded, despite the geographical isolation of wild types. This is further supported by the evidence of a spatial hybridization zone documented for some isolated indigenous Nepalese populations (Lime and Parkote) that have shared grazing and breeding areas with possibilities of intercrosses [14].
The PC1 explaind 30% of genetic variance, and the PC2 explained 22% of it. From the PCA loading scores values (Table S2), the DGAT1 had the highest absolute loading on PC1, meaning among the considered markers it is the most influential gene in explaining the primary genetic variation among the buffalo types, thus confirming the χ2 observation. DGAT1 is well known to be a QTL for fat deposit in carcasses [24] and fat content in milk [23,25], demonstrated in many dairy breeds. This could mean that fat synthesis pathways vary significantly among buffalo populations. Association studies between SNPs at DGAT1 and milk traits (yield and fat) have been reported in Italian Mediterranean buffalo, Chinese and Anatolian buffalo breeds, and Indian and Brazilian Murrah [35]. This confirms DGAT1 as a target gene for improvement of milk and meat traits in this species. Similarly, CSN3 had the highest absolute loading on the PC2, indicating that this gene plays a dominant role in the secondary genetic variation. CSN3 is a milk protein gene associated with milk quality, coagulation properties, and cheese-making potential [18,20]. In particular, the CSN3 X2 variant in combination with CSN1S1 A allele produced higher curd yields primarily due to increased protein content rather than overall milk yield, along with a higher ratio of real to estimated curd yield (RCY/ECY) [19,20]. Therefore, DGAT1 and CSN3 can be considered as distinguishing factors playing a role in separating the buffalo types. For instance, casein genes were recently identified as key genes expressed in luminal cells of buffalo mammary gland, which were demonstrated to be the key cell types underlying the divergence between the dairy and non-dairy buffaloes. In this context, caseins exhibit significantly higher expressions in the luminal cells of the dairy than non-dairy buffaloes [15].
Allele frequencies specific for each buffalo population are reported in Table 2. The lowest MAF was 0.113 for the SCD g.21,066,603 C allele. A deviation from the HW equilibrium was found in five cases out of 55 calculations (9.09%) (Table 2). This result reflects real population dynamics and can potentially be interpreted due to historical inbreeding (e.g., Indonesia and Romania), small sample sizes (e.g., Vietnam), or selection pressure, as seen at LPL (e.g., south-eastern populations like CHN, IDN, THA, and VNM showed inverted MAF).
On average, low genetic diversity (Ho and He) was found in the IDN, VNM, and CHN populations (Table 3). In details, the VNM population resulted monomorphic for four loci (CSN1S1, CSN3, DGAT1, and SCD) out of five analyzed, and in general, the caseins (CSN1S1 and CSN3) and SCD also did not show polymorphism in the Chinese and Indonesian populations; therefore, their He was zero (Table 2 and Table 3).
Conversely, all other populations were polymorphic for all five loci under investigation, and ITA showed the highest Ho (0.464) and He (0.454) (Table 3).
The reduced heterozygosity or even monomorphism could suggest the influence of a selection pressures or past bottleneck events. While the second hypothesis has been excluded by other authors for a gradual and modest decrease in the microsatellite heterozygosity [10], more likely, the natural selection pressure and a strong geographic differentiation with a lack of gene flow can be a possible reason, especially in the swamp type [10]. For instance, the lack of gene flow is likely in the Indonesian buffaloes (IDN) that, coming from the Sulawesi and Sumbawa Island, is an isolated population characterized by a long period of inbreeding. The data confirms the relatively low mutational load recently found by Si et al. [15] in the Indonesian buffalo population.
In contrast, the Italian (ITA) Mediterranean buffalo population showed higher observed (Ho = 0.464) and expected (He = 0.454) heterozygosity, in a general agreement with data of Colli et al. [12] (Ho = 0.381 and He = 0.385) based on SNP array data, but in contrast with data observed nearly 25 years ago (Ho = 0.135 and He = 0.173) using microsatellites [7].
The lower heterozygosity values found by the latter authors was explained with a loss of biodiversity in the Mediterranean buffaloes as a consequence of the decline in the number and increased inbreeding. However, we have to consider that in the last decades, the systematic application of paternity tests and kinship analysis aiming at more performant selection schemes in Italian buffalo also enabled a more efficient control of the consanguinity with a consequent increase of heterozygosity. Moioli et al., [7] and Colli et al., [12] also analyzed Egyptian buffaloes, and our data align more with data based on the SNP array (Ho = 0.395 and He = 0.400) [12].
The global FST value (GST) was 0.2143 (p = 0.001), which means that about 21.5% of the total genetic variability is due to differences between populations and the remaining part is attributed to differences between individuals. This value is consistent with moderate genetic differentiation seen in other livestock studies [36].
Almost all the pairwise FST values were significant (p = 0.001), confirming a considerable level of genetic differentiation among the buffalo populations (Table 4). The lowest values were observed for the comparisons between NPL and BGD populations (0.0025) and between NPL and PAK populations (0.0067), whereas the highest values (>0.60) were found for comparisons involving EGY and IDN (0.6830) or EGY and CHN (0.6351) populations. Pairwise FST analyses further clarified the genetic landscape. Closely located populations (e.g., those from Nepal, Bangladesh and Pakistan) showed minimal differentiation, while comparisons involving populations such as Egypt and Indonesia revealed markedly higher divergence. These results support the notion that geographic isolation, together with divergent selection pressures, has contributed to the observed genetic structure [11].
In this respect, the Indonesian buffalo population (IDN), coming from the islands of Sulawesi and Sumbawa, has been described as highly inbred and exhibiting higher levels of runs of homozygosity (ROH) compared to other swamp buffalo populations [15].
Analyses of genetic distances are represented by a heatmap reported in Figure 3A. To have a better representation of the genetic differences and structure among the populations, a principal component analysis was carried out with a clustering analysis for k-means values ranging from 2 to 7, where each cross represents a population cluster evidenced by a different color.
PCA analysis evidenced that the first component (PC1) explained almost 90% of variance among buffalo populations, with PC2 accounting for an additional 8.55%. Some populations (e.g., VNM, CHN, and IDN) clustered together, suggesting a strong genetic similarity. Others, such as ROM, were more distant, indicating a greater genetic divergence. ITA appeared to be positioned close to other Asian populations, which could indicate historical genetic influences (Figure S2A). Moving form a 2D representation to a 3D representation and including also a third component (PC3), a better view of genetic variability was obtained.
Structure analysis within the PCA with K = 2 to K = 5 evidenced that populations, although geographically distant, were often clustered together in more heterogeneous groups (Figure S2B–E). Clastering validity metrics by the comparison of the silhouette score (SS) and the Davis−Bouldin index (DBI) showed that K = 6 gave an optimal balance between internal population cohesion (SS = 0.465) and cluster separability (DBI = 0.268); therefore, populations are better represented (Figure 3B).
Instead, K = 7 showed an over-segmentation of data and a worse performance of the SS and DBI (Table S3 and Figure S2F).
The phylogentic analysis was carried out by three different methods (Figure 4 and Figure S3). All dendograms showed consistent results and validated the PCA and k-means (K = 6) analysis, thus confirming the genetic structure and differences among the analysed buffalo populations. In the complex, the genetic diversity observed aligned with the geographic distances separating the different populations from their respective centers of domestication. The high divergence between certain populations (e.g., Egypt vs. Indonesia) and the clustering patterns reinforce the idea of parallel domestication and limited interbreeding between types [12]. A similar observation has also been made using a deeper genomic approach, but with a smaller number of animals, to describe the diversity and the divergence between river and swamp buffaloes [15]. Our data also suggest regional adaptation post-domestication, with selection acting on loci such as DGAT1 and CSN3, which are critical for dairy traits. The consistency of our result confirms that even a few number of SNPs in key genes is useful for structure population studies.
In the UPGMA consensus tree, CHN, VNM, and IDN were always genetically closer and BGD, NPL, and PAK were a joined group, whereas ROM was constantly the most divergent population (Figure 4).
The Romanian buffaloes analyzed in the present study were reared in the Transylvania region (Brașov district) in the area of Șercaia village. They phenotypically exhibited Murrah buffalo traits, especially in the curly horns shape (Figure S4). Although the literature reports that the origin of this buffalo remains uncertain, it is known that, from 1962 to 1990, Murrah buffaloes from India were imported into Bulgaria and spread in Romania to create a new population and upgrade the local buffaloes population after the dramatic decline generated by the Second World War [37].
Currently, the breed is considered as belonging to the Mediterranean type, but it has adapted to colder climates and the regional environment. Therefore, it is indicated as the Mediterranean Carpathian breed [38].
In a recent pilot study on the genetic diversity of German buffaloes, potential relations to herds in other parts of Europe have been investigated, including Romanian buffalo populations (Rom_Serc and Rom_Mera) from Șercaia and Mera villages, respectively [13]. No mixed ancestry was found for Rom_Serc, which was instead separated from the other population located in a former Hungarian village (Mera), now part of Romania. In addition, Rom_Serc has characterized by ancient inbreeding, showing a higher proportion of runs of homozygosity (ROH) compared to the other investigated populations [13]. Therefore, our data confirm the unique genetic position of the ROM population.
Considering the results as a whole, our study confirms that the observed genetic structure of the 11 buffalo populations is the result of a combination of factors including geographic barriers, such as that of island populations (IDN), and environmental adaptation to local agro-ecological conditions (for example, the altitude in Nepal, arid zones in Egypt, or high-humidity in Southeast Asia). Limited gene flow (IDN, VNM, and CHN), structured breeding practices (ITA), and selective pressures for dairy or draft purposes also contributed to differentiate between dairy and non-dairy buffaloes as recently demonstrated by Si et al. [15].

4. Conclusions

The use of only five SNPs may appear as a limitation for the resolution of population structure analyses overlooking rare variants or complex genomic regions under selection. However, the use of a gene-targeted strategy in functionally relevant genes offers insights into selection-driven differentiation, especially for applied breeding contexts, and is also cost-effective. This study provides insights into the genetic diversity and differentiation of 11 buffalo populations based on five loci considered as key genes for milk and meat traits. Significant genetic differentiation was observed between river and swamp buffaloes, with DGAT1 and CSN3 emerging as key markers of genetic variation. Low genetic diversity was noted in specific populations (IDN, VNM, and CHN), likely due to selection pressures, geographic isolation, and unique genetic position for the Romanian population. These findings contribute to the understanding of buffalo genetic diversity and may be useful for exploiting the potential of this species in the light of future breeding and conservation strategies specific for each buffalo type.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16040400/s1, Figure S1: Allele frequencies at the loci for the buffalo populations grouped per type; Figure S2: 2D and 3D representations of the principal component analysis of genetic distances among the 11 buffalo populations with different k-means (from K = 2 to K = 7); Figure S3: Consensus trees showing the phylogenetic relationship among the 11 buffalo populations performed by complete linkage and Ward’s methods; Figure S4: Example of Romanian buffalo. Samples collected at Șercaia—Research and Development Station for Buffalo Breeding (Brașov district, Transylvania region); Table S1: Populations and subpopulations of analyzed buffaloes; Table S2: Chi-square and relative p-values and loading scores of the PCA; Table S3. Clustering validity metrics from K = 2 to K = 7 by the silhouette score (SS) and the Davis−Bouldin index (DBI).

Author Contributions

A.P. conceived, designed and performed the experiments. M.G., C.V. and D.D. performed the experiments. A.P., G.G. and J.S. (Jingfang Si) analyzed the data. A.P. wrote the manuscript. A.P., G.C., A.C. and Y.Z. contributed reagents, materials, and analysis tools. T.H., A.S., I.N., A.K., N.P., A.C., Q.L. and J.S. (Jianghua Shang) contributed to sample collection. G.C., Y.Z., J.S. (Jingfang Si), J.S. (Jianghua Shang) and Q.L. revised the article critically for important intellectual content. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Italian Ministry of University and Research (D.D. 1332) from the project PON01_00486 (Sequenziamento del genoma bufalino per il miglioramento quali-quantitativo delle produzioni agro-alimentari—GENOBU) and by The University of Turin from Miglioramento genetico della bufala Mediterranea Italiana; grant number: PAUA_RIC_N_COMP_21_01).

Institutional Review Board Statement

The animal study protocol was approved by the Ethics Committee of the University of Torino (Protocol code: 0239748; date of approval: 17 May 2022).

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

The authors thank Liliana Di Stasio for her guidance, the buffalo owners over all the countries involved in the sampling and all technicians involved in laboratory activities for their excellent assistance.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
DBIDavies−Bouldin index
MAFminor allele frequency
PCAprincipal component analysis
PCRpolymerase chain reaction
QTLquantitative trait locus
RFLPrestriction fragment length polymorphism
ROHruns of homozygosity
SNPsingle nucleotide polymorphism
SSsilhouette score
UPGMAunweighted pair group method with the arithmetic mean

References

  1. Borghese, A. Buffalo Production and Research. REU Technical Series; Food and Agriculture Organization of the United Nations: Rome, Italy, 2005; pp. 1–315. [Google Scholar]
  2. Zicarelli, L. Buffalo milk: Its properties, dairy yield and mozzarella production. Vet. Res. Commun. 2004, 28, 127. [Google Scholar] [PubMed]
  3. Di Stasio, L.; Brugiapaglia, A. Current knowledge on river buffalo meat: A critical analysis. Animals 2021, 11, 2111. [Google Scholar] [CrossRef] [PubMed]
  4. Barker, J.S.F.; Moore, S.; Hetzel, D.; Evans, D.; Byrne, K.; Tan, S. Genetic diversity of Asian water buffalo (Bubalus bubalis): Microsatellite variation and a comparison with protein-coding loci. Anim. Genet. 1997, 28, 103–115. [Google Scholar] [CrossRef]
  5. Kumar, S.; Nagarajan, M.; Sandhu, J.; Kumar, N.; Behl, V.; Nishanth, G. Mitochondrial DNA analyses of Indian water buffalo support a distinct genetic origin of river and swamp buffalo. Anim. Genet. 2007, 38, 227–232. [Google Scholar] [PubMed]
  6. Kumar, S.; Nagarajan, M.; Sandhu, J.S.; Kumar, N.; Behl, V. Phylogeography and domestication of Indian river buffalo. BMC Evol. Biol. 2007, 7, 1–8. [Google Scholar]
  7. Moioli, B.; Georgoudis, A.; Napolitano, F.; Catillo, G.; Lucioli, S.; Ligda, C.; Boyazoglu, J. Genetic diversity between Italian and Greek buffalo populations. Anim. Genet. Resour. Resour. Génétiques Anim. Recur. Genéticos Anim. 2001, 29, 31–40. [Google Scholar]
  8. Yindee, M.; Vlamings, B.; Wajjwalku, W.; Techakumphu, M.; Lohachit, C.; Sirivaidyapong, S.; Thitaram, C.; Amarasinghe, A.; Alexander, P.; Colenbrander, B. Y-chromosomal variation confirms independent domestications of swamp and river buffalo. Anim. Genet. 2010, 41, 433–435. [Google Scholar]
  9. Zhang, Y.; Colli, L.; Barker, J.S.F. Asian water buffalo: Domestication, history and genetics. Anim. Genet. 2020, 51, 177–191. [Google Scholar]
  10. Zhang, Y.; Lu, Y.; Yindee, M.; Li, K.Y.; Kuo, H.Y.; Ju, Y.T.; Ye, S.; Faruque, M.O.; Li, Q.; Wang, Y. Strong and stable geographic differentiation of swamp buffalo maternal and paternal lineages indicates domestication in the China/Indochina border region. Mol. Ecol. 2016, 25, 1530–1550. [Google Scholar] [CrossRef]
  11. Zhang, Y.; Vankan, D.; Zhang, Y.; Barker, J.S. Genetic differentiation of water buffalo (Bubalus bubalis) populations in China, Nepal and south-east Asia: Inferences on the region of domestication of the swamp buffalo. Anim. Genet. 2011, 42, 366–377. [Google Scholar]
  12. Colli, L.; Milanesi, M.; Vajana, E.; Iamartino, D.; Bomba, L.; Puglisi, F.; Del Corvo, M.; Nicolazzi, E.L.; Ahmed, S.S.; Herrera, J.R. New insights on water buffalo genomic diversity and post-domestication migration routes from medium density SNP chip data. Front. Genet. 2018, 9, 53. [Google Scholar] [CrossRef] [PubMed]
  13. Noce, A.; Qanbari, S.; González-Prendes, R.; Brenmoehl, J.; Luigi-Sierra, M.G.; Theerkorn, M.; Fiege, M.-A.; Pilz, H.; Bota, A.; Vidu, L. Genetic diversity of Bubalus bubalis in Germany and global relations of its genetic background. Front. Genet. 2021, 11, 610353. [Google Scholar]
  14. Dhakal, A.; Si, J.; Sapkota, S.; Pauciullo, A.; Han, J.; Gorkhali, N.A.; Zhao, X.; Zhang, Y. Whole-genome sequencing reveals genetic structure and adaptive genes in Nepalese buffalo breeds. BMC Genom. 2024, 25, 1082. [Google Scholar] [CrossRef] [PubMed]
  15. Si, J.; Dai, D.; Gorkhali, N.A.; Wang, M.; Wang, S.; Sapkota, S.; Kadel, R.C.; Sadaula, A.; Dhakal, A.; Faruque, M.O. Complete Genomic Landscape Reveals Hidden Evolutionary History and Selection Signature in Asian Water Buffaloes (Bubalus bubalis). Adv. Sci. 2025, 12, 2407615. [Google Scholar] [CrossRef]
  16. Wang, S.; Chen, N.; Capodiferro, M.R.; Zhang, T.; Lancioni, H.; Zhang, H.; Miao, Y.; Chanthakhoun, V.; Wanapat, M.; Yindee, M. Whole mitogenomes reveal the history of swamp buffalo: Initially shaped by glacial periods and eventually modelled by domestication. Sci. Rep. 2017, 7, 4708. [Google Scholar] [CrossRef]
  17. Bonfatti, V.; Giantin, M.; Gervaso, M.; Coletta, A.; Dacasto, M.; Carnier, P. Effect of CSN1S1-CSN3 (αS1-κ-casein) composite genotype on milk production traits and milk coagulation properties in Mediterranean water buffalo. J. Dairy Sci. 2012, 95, 3435–3443. [Google Scholar] [CrossRef]
  18. Pauciullo, A.; Gaspa, G.; Zhang, Y.; Liu, Q.; Cosenza, G. CSN1S1, CSN3 and LPL: Three validated gene polymorphisms useful for more sustainable dairy production in the Mediterranean river buffalo. Animals 2024, 14, 1414. [Google Scholar] [CrossRef]
  19. Pauciullo, A.; Martorello, S.; Carku, K.; Versace, C.; Coletta, A.; Cosenza, G. A novel duplex ACRS-PCR for composite CSN1S1–CSN3 genotype discrimination in domestic buffalo. Ital. J. Anim. Sci. 2021, 20, 1264–1269. [Google Scholar] [CrossRef]
  20. Zicarelli, L.; Di Palo, R.; Napolano, R.; Tonhati, H.; De Carlo, E.; Gagliardi, R.; Di Luccia, A.; la Gatta, B. Influence of αS1-casein and κ-casein polymorphism on the curd yield of Italian Mediterranean buffalo (Bubalus bubalis L.) milk. Int. Dairy J. 2020, 100, 104559. [Google Scholar] [CrossRef]
  21. Cosenza, G.; Pauciullo, A.; Macciotta, N.; Apicella, E.; Steri, R.; La Battaglia, A.; Jemma, L.; Coletta, A.; Di Berardino, D.; Ramunno, L. Mediterranean river buffalo CSN1S1 gene: Search for polymorphisms and association studies. Anim. Prod. Sci. 2015, 55, 654–660. [Google Scholar] [CrossRef]
  22. Liu, J.; Wang, Z.; Li, J.; Li, H.; Yang, L. Genome-wide identification of Diacylglycerol Acyltransferases (DGAT) family genes influencing Milk production in Buffalo. BMC Genet. 2020, 21, 1–14. [Google Scholar] [CrossRef] [PubMed]
  23. Schennink, A.; Stoop, W.M.; Visker, M.W.; Heck, J.M.; Bovenhuis, H.; Van Der Poel, J.J.; Van Valenberg, H.J.; Van Arendonk, J.A. DGAT1 underlies large genetic variation in milk-fat composition of dairy cows. Anim. Genet. 2007, 38, 467–473. [Google Scholar] [CrossRef]
  24. Thaller, G.; Kühn, C.; Winter, A.; Ewald, G.; Bellmann, O.; Wegner, J.; Zühlke, H.; Fries, R. DGAT1, a new positional and functional candidate gene for intramuscular fat deposition in cattle. Anim. Genet. 2003, 34, 354–357. [Google Scholar] [CrossRef] [PubMed]
  25. Winter, A.; Krämer, W.; Werner, F.A.; Kollers, S.; Kata, S.; Durstewitz, G.; Buitkamp, J.; Womack, J.E.; Thaller, G.; Fries, R. Association of a lysine-232/alanine polymorphism in a bovine gene encoding acyl-CoA: Diacylglycerol acyltransferase (DGAT1) with variation at a quantitative trait locus for milk fat content. Proc. Natl. Acad. Sci. USA 2002, 99, 9300–9305. [Google Scholar] [CrossRef]
  26. Gu, M.; Cosenza, G.; Iannaccone, M.; Macciotta, N.; Guo, Y.; Di Stasio, L.; Pauciullo, A. The single nucleotide polymorphism g. 133A> C in the stearoyl CoA desaturase gene (SCD) promoter affects gene expression and quali-quantitative properties of river buffalo milk. J. Dairy Sci. 2019, 102, 442–451. [Google Scholar] [CrossRef] [PubMed]
  27. Schennink, A.; Heck, J.M.; Bovenhuis, H.; Visker, M.H.; van Valenberg, H.J.; van Arendonk, J.A. Milk fatty acid unsaturation: Genetic parameters and effects of stearoyl-CoA desaturase (SCD1) and acyl CoA: Diacylglycerol acyltransferase 1 (DGAT1). J. Dairy Sci. 2008, 91, 2135–2143. [Google Scholar] [CrossRef]
  28. Guo, Y.; Wei, Z.; Zhang, Y.; Cao, J. Research progress on the mechanism of milk fat synthesis in cows and the effect of conjugated linoleic acid on milk fat metabolism and its underlying mechanism: A Review. Animals 2024, 14, 204. [Google Scholar] [CrossRef]
  29. Gu, M.; Cosenza, G.; Gaspa, G.; Iannaccone, M.; Macciotta, N.; Chemello, G.; Di Stasio, L.; Pauciullo, A. Sequencing of lipoprotein lipase gene in the Mediterranean river buffalo identified novel variants affecting gene expression. J. Dairy Sci. 2020, 103, 6374–6382. [Google Scholar] [CrossRef]
  30. Gu, M.; Cosenza, G.; Nicolae, I.; Bota, A.; Guo, Y.; Di Stasio, L.; Pauciullo, A. Transcript analysis at DGAT1 reveals different mRNA profiles in river buffaloes with extreme phenotypes for milk fat. J. Dairy Sci. 2017, 100, 8265–8276. [Google Scholar] [CrossRef]
  31. Everitt, B.S.; Landau, S.; Leese, M. Cluster Analysis; Wiley: Hoboken, NJ, USA, 2001. [Google Scholar]
  32. Sarataphan, N.; Narongwanichgarn, W.; Maneerat, S. Phylogenetic analysis of a Thai wild water buffalo (Bubalus arnee) through mitochondrial control region. Int. J. Conserv. Sci. 2017, 8, 105. [Google Scholar]
  33. Flamand, J.; Vankan, D.; Gairhe, K.; Duong, H.; Barker, J. Genetic identification of wild Asian water buffalo in Nepal. In Animal Conservation Forum; Cambridge University Press: New York, NY, USA, 2003; pp. 265–270. [Google Scholar]
  34. Sun, T.; Shen, J.; Achilli, A.; Chen, N.; Chen, Q.; Dang, R.; Zheng, Z.; Zhang, H.; Zhang, X.; Wang, S. Genomic analyses reveal distinct genetic architectures and selective pressures in buffaloes. Gigascience 2020, 9, giz166. [Google Scholar] [CrossRef] [PubMed]
  35. Khan, M.Z.; Ma, Y.; Ma, J.; Xiao, J.; Liu, Y.; Liu, S.; Khan, A.; Khan, I.M.; Cao, Z. Association of DGAT1 with cattle, buffalo, goat, and sheep milk and meat production traits. Front. Vet. Sci. 2021, 8, 712470. [Google Scholar] [CrossRef] [PubMed]
  36. Excoffier, L.; Hofer, T.; Foll, M. Detecting loci under selection in a hierarchically structured population. Heredity 2009, 103, 285–298. [Google Scholar] [CrossRef] [PubMed]
  37. Moioli, B.; Borghese, A. Buffalo breeds and management systems. Buffalo Prod. Res. 2005, 51–76. [Google Scholar]
  38. Borghese, A.; Chiariotti, A.; Barile, V.L. Buffalo in the world: Situation and perspectives. In Biotechnological Applications in Buffalo Research; Springer: New York, NY, USA, 2022; pp. 3–31. [Google Scholar]
Figure 1. Distribution of buffalo populations under analysis.
Figure 1. Distribution of buffalo populations under analysis.
Genes 16 00400 g001
Figure 2. Principal component analysis plot showing the genetic differentiation among buffalo types (river, swamp, and wild). Each point represents an individual, and the ellipses indicate the 95% confidence interval for each group.
Figure 2. Principal component analysis plot showing the genetic differentiation among buffalo types (river, swamp, and wild). Each point represents an individual, and the ellipses indicate the 95% confidence interval for each group.
Genes 16 00400 g002
Figure 3. (A) Heat map of Nei’s genetic distances. (B) 3D representation of the principal component analysis of genetic distances with k-means (K = 6).
Figure 3. (A) Heat map of Nei’s genetic distances. (B) 3D representation of the principal component analysis of genetic distances with k-means (K = 6).
Genes 16 00400 g003
Figure 4. UPGMA consensus tree showing the phylogenetic relationships among the 11 buffalo populations.
Figure 4. UPGMA consensus tree showing the phylogenetic relationships among the 11 buffalo populations.
Genes 16 00400 g004
Table 1. List of SNPs with relative positions (chromosome, genome, and gene) and genotyping methods applied for the buffalo population’s study.
Table 1. List of SNPs with relative positions (chromosome, genome, and gene) and genotyping methods applied for the buffalo population’s study.
Chr.Genome PositionGeneLocationReference Allele 5′→3′MutationGenotyping MethodEndonucleaseReference
3129,635,007LPLExon 1GARFLPDdeIGu et al. [29]
732,148,856CSN1S1Exon 17AGACRSMbo IPauciullo et al. [19]
731,917,000CSN3Exon 4AGACRSHinf IPauciullo et al. [19]
1581,685,203DGAT1Exon 13GARFLPDdeIGu et al. [30]
2321,066,603SCDPromoterCARFLPTaqIGu et al. [26]
Table 2. Allele frequencies, chi-square (χ2), and p-values (PHW) of the Hardy−Weinberg (HW) calculations. The deviation from the HW equilibrium is indicated in bold. Population’s sizes are indicated in brackets.
Table 2. Allele frequencies, chi-square (χ2), and p-values (PHW) of the Hardy−Weinberg (HW) calculations. The deviation from the HW equilibrium is indicated in bold. Population’s sizes are indicated in brackets.
SNPAllelePopulation
BGDCHNEGYIDNIRNITANPLPAKROMTHAVNMAll
(45)(149)(19)(56)(132)(232)(53)(27)(98)(91)(15)(918)
LPL g.129,635,007G>AG0.8220.2250.9470.0960.9430.5670.8680.9440.9540.4230.2330.607
A0.1780.7750.0530.9040.0570.4330.1320.0560.0460.5770.7670.393
χ23.0800.5370.02810.0610.4452.0951.9330.0220.20014.4040.168152.165
PHW0.079250.463330.865770.001510.504480.147790.164410.881490.654080.000150.681320.00000
CSN1S1 g.32,148,856A>GA0.2110.0000.4740.0000.3330.3060.2080.2220.1940.1150.0000.196
G0.7891.0000.5261.0000.6670.6940.7920.7780.8060.8851.0000.804
χ21.042-0.634-0.0302.7680.4550.0032.4260.028-30.015
PHW0.30739-0.42587-0.861540.096150.499660.956990.119320.86715-0.00000
CSN3 g.31,917,000A>GA0.1780.0000.2110.0000.3560.4220.2830.2590.5970.3240.0000.291
G0.8221.0000.7891.0000.6440.5780.7170.7410.4030.6761.0000.709
χ20.111-0.119-2.0310.0380.0090.19913.6612.738-14.925
PHW0.73939-0.72935-0.154050.845150.922610.654970.000220.09795-0.00011
DGAT1 g.81,685,203G>AG0.7110.9900.7110.9910.7080.4330.7260.6670.5150.9121.0000.705
A0.2890.0100.2890.0090.2920.5670.2740.3330.4850.0880.0000.295
χ23.9610.0100.3360.0040.9020.8332.1960.1700.58919.978-58.428
PHW0.046560.919400.561801.000000.342150.361480.138340.679740.442830.00004-0.00000
SCD g.21,066,603C>AC0.0440.0000.0790.0000.1400.2460.1230.1670.1070.0330.0000.113
A0.9561.0000.9211.0000.8600.7540.8770.8330.8930.9671.0000.887
χ20.070-0.091-1.2451.0670.9480.8251.3380.088-158.077
PHW0.79056-0.76322-0.264430.301620.330200.363530.247390.76717-0.00000
Table 3. Average heterozygosity calculated for the five loci and gene diversity parameters per locus and population.
Table 3. Average heterozygosity calculated for the five loci and gene diversity parameters per locus and population.
Average heterozygosity
BGDCHNEGYIDNIRNITANPLPAKROMTHAVNM
Observed0.2440.0700.2740.0210.3200.4640.2940.3410.3550.2350.067
Expected0.2860.0740.3060.0460.3340.4540.3180.3170.3160.2720.074
Expected heterozygosity per locus and population
BGDCHNEGYIDNIRNITANPLPAKROMTHAVNM
LPL g.129,635,007G>A0.2960.3500.1020.1770.1080.4920.2320.1070.0880.4920.371
CSN1S1 g.32,148,856A>G0.3370.0000.5150.0000.4460.4260.3320.3520.3140.2050.000
CSN3 g.31,917,000A>G0.2950.0000.3420.0000.4610.4890.4100.3900.4830.4400.000
DGAT1 g.81,685,203G>A0.4170.0200.4240.0520.4150.4920.4020.4530.5020.1620.000
SCD g.21,066,603C>A0.0860.0000.1490.0000.2420.3710.2170.2820.1920.0640.000
Wright’s fixation index (Fis) as a measure of heterozygote deficiency or excess
BGDCHNEGYIDNIRNITANPLPAKROMTHAVNM
LPL g.129,635,007G>A0.2540.060−0.0290.403−0.056−0.0950.186−0.040−0.0430.3970.103
CSN1S1 g.32,148,856A>G0.148-0.182-0.0150.1090.092−0.0530.156−0.017-
CSN3 g.31,917,000A>G−0.048-0.077-0.1240.013−0.013−0.139−0.374−0.173-
DGAT1 g.81,685,203G>A0.291−0.0070.1310.0000.083−0.0600.2020.019−0.0780.456-
SCD g.21,066,603C>A−0.034-−0.059-−0.096−0.068−0.130−0.182−0.115−0.029-
Table 4. Pairwise FST values (below the diagonal) and related p-values (above the diagonal) among the 11 domestic buffalo populations.
Table 4. Pairwise FST values (below the diagonal) and related p-values (above the diagonal) among the 11 domestic buffalo populations.
BGDCHNEGYIDNIRNITANPLPAKROMTHAVNM
BGD ***NS*********NSNS*********
CHN0.4562 ***NS******************NS
EGY0.03820.6351 ***NS***NSNS*********
IDN0.49120.05620.6830 ******************NS
IRN0.03500.46400.01040.4669 ***NSNS*********
ITA0.09920.33080.11630.31830.0983 ***************
NPL0.00250.47220.03310.48790.00950.0886 NS*********
PAK0.00950.57860.02510.61220.00240.08900.0067 *******
ROM0.12590.57710.13450.56540.06190.09300.08130.0732 ******
THA0.14100.17770.23760.20590.19180.14200.14150.19710.2598 **
VNM0.29410.01940.44950.07850.34620.25210.30570.38820.44400.0999
NS—non significant; * p < 0.001; ** p < 0.0001; *** p < 0.00001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pauciullo, A.; Gaspa, G.; Versace, C.; Cosenza, G.; Piscopo, N.; Gu, M.; Coletta, A.; Hussain, T.; Seidavi, A.; Nicolae, I.; et al. New Insights into Genetic Diversity and Differentiation of 11 Buffalo Populations Using Validated SNPs for Dairy Improvement. Genes 2025, 16, 400. https://doi.org/10.3390/genes16040400

AMA Style

Pauciullo A, Gaspa G, Versace C, Cosenza G, Piscopo N, Gu M, Coletta A, Hussain T, Seidavi A, Nicolae I, et al. New Insights into Genetic Diversity and Differentiation of 11 Buffalo Populations Using Validated SNPs for Dairy Improvement. Genes. 2025; 16(4):400. https://doi.org/10.3390/genes16040400

Chicago/Turabian Style

Pauciullo, Alfredo, Giustino Gaspa, Carmine Versace, Gianfranco Cosenza, Nadia Piscopo, Meichao Gu, Angelo Coletta, Tanveer Hussain, Alireza Seidavi, Ioana Nicolae, and et al. 2025. "New Insights into Genetic Diversity and Differentiation of 11 Buffalo Populations Using Validated SNPs for Dairy Improvement" Genes 16, no. 4: 400. https://doi.org/10.3390/genes16040400

APA Style

Pauciullo, A., Gaspa, G., Versace, C., Cosenza, G., Piscopo, N., Gu, M., Coletta, A., Hussain, T., Seidavi, A., Nicolae, I., Kovitvadhi, A., Liu, Q., Shang, J., Si, J., Dai, D., & Zhang, Y. (2025). New Insights into Genetic Diversity and Differentiation of 11 Buffalo Populations Using Validated SNPs for Dairy Improvement. Genes, 16(4), 400. https://doi.org/10.3390/genes16040400

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop