Next Article in Journal
Correlation Between Fixed-Luminance Flicker Full-Field Electroretinogram Response and Macular Cone Density in Healthy Individuals
Previous Article in Journal
On an RNA-Membrane Protogenome
Previous Article in Special Issue
Comparative Cranial Joint Variation in Three Different Lizards: Impact of Feeding Habit
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of Body Morphometry to Classify Two Colombian Creole Pigs Using Statistical and Machine Learning Methods

by
Arcesio Salamanca-Carreño
1,*,
Mauricio Vélez-Terranova
2,
Pere M. Parés-Casanova
3,
Paula A. Toalombo-Vargas
4,
David E. Rangel-Pachón
1 and
Andrés F. Castillo-Pérez
1
1
Facultad de Medicina Veterinaria y Zootecnia, Universidad Cooperativa de Colombia, Villavicencio 50001, Colombia
2
Facultad de Ciencias Agropecuarias, Universidad Nacional de Colombia, Palmira 763531, Colombia
3
Departamento de Bromatología, Universitat Oberta de Catalunya, 08018 Barcelona, Spain
4
Programa de Zootecnia, Escuela Superior Politécnica de Chimborazo, Riobamba 060150, Ecuador
*
Author to whom correspondence should be addressed.
Life 2025, 15(5), 693; https://doi.org/10.3390/life15050693
Submission received: 21 February 2025 / Revised: 15 April 2025 / Accepted: 23 April 2025 / Published: 24 April 2025
(This article belongs to the Special Issue Feature Papers in Animal Science: 4th Edition)

Abstract

:
Creole pigs (Sus scrofa domestica), descendants of Iberian breeds, possess significant genetic and cultural importance but are under-researched and at risk due to the dominance of improved breeds for commercial production. The aim of this study was to identify the most representative body morphometric measurements for the differentiation of two Creole pig breeds, using statistical and machine learning methods. A sample of “Casco de Mula” (n = 54) and San Pedreño (n = 30) Creole pigs, aged between 2 and 6 months, belonging to seven traditional farms located in the department of Meta (Colombia), was studied. A total of 14 morphometric variables were recorded, as well as the animal’s sex. Four algorithms—linear discriminant analysis, quadratic discriminant analysis, logistic regression, and classification trees—were used to classify the breeds. The results indicated that head width, height at the withers, and right ear length measurements could be used to differentiate the “Casco de Mula” and San Pedreño Creole pigs. The decision tree was the most accurate algorithm (accuracy = 92%, sensitivity = 96%, specificity = 83%, and Matthews correlation coefficient = 0.82), and its performance can be improved by increasing the number of animals. Non-parametric supervised learning methods like decision trees can be used to morphometrically differentiate Creole pigs raised in the same or different environments in order to characterize animal genetic resources.

1. Introduction

Creole pigs (Sus scrofa domestica) are descended from Iberian pigs that were brought by Cristobal Colon on his second voyage (1493), arriving at the island of Hispaniola, to subsequently be introduced to the American continent (1495) [1]. These animals, with possible origin in the Iberian and Celtic breeds [2], served as initiators of the animal industry in different colonies formed in the New World [3,4]. Creole pigs have usually been managed traditionally and mostly with no special facilities, which has made them recognized for their lower productivity compared to improved pigs raised in more technical production systems [1,5]. This genetic resource has been undervalued, and efforts for its conservation and use have been limited [6].
In Latin America, Creole pigs are diverse and genetically similar, with an important cultural heritage. The Creole breeds are highly adaptable to different ecological conditions with a natural capacity to take advantage of the available resources. These characteristics make them a valuable reservoir of genetic variability useful for sustainable exploitation or to be used in crosses with commercial breeds [7]. Adaptive characteristics are especially important, since adverse environmental conditions could lead to morphometric variations within the populations, with distinctive adaptation traits between races, useful for consideration in recovery and preservation programmes [8,9,10].
In Colombia, the San Pedreño (SP) and “Casco de Mula” (CM) Creole pigs are commonly found, especially in small-scale production systems, representing an important source of energy, protein, and income for rural family units [1,11]. Nowadays, these Creole breeds are underutilized and at risk of disappearing due to the preferential use of foreign improved breeds in the national swine industry [12,13]. In the last decade, this situation has motivated increased awareness about the importance of preserving these genetic resources, in order to maintain the tradition and culture of rural families [5]. Both breeds represent a heritage of biodiversity, since they have lived and naturally adapted to different environments and nutritional limitations [11].
The “Casco de Mula” pig is found in the Eastern Plains and the foothills of the Llanos of Colombia (Figure 1a). This breed is medium-sized, with black skin and red or black hair. It has a concave profile, with large ears that fall slightly forward. The legs are strong and short and present syndactyly or union of the toes [4,14]. The San Pedreño pig (Figure 1b) can be found in several regions of Colombia and is characterized by a concave profile, small head, straight and medium-sized ears, and abundant black hair [4].
Racial characterization based on morphometry and morphology is one of the first phases for maintenance, conservation, and knowledge of local zoogenetic resources [15,16]. Animals’ morphological structure and functional qualities define the population characteristics, mark the productive, functional, and ethnological trends of the breeds [5,16,17], and allow phenotypic comparisons between domestic species of different breeds in different environments [17,18].
In recent years, morphometric techniques associated with multivariate statistical analysis have been shown to be useful for the study of animal’s phenotypic characteristics [18,19]. Classical statistical techniques, many of them now used in machine learning (ML) to fit data either for regression or classification analysis, have proven to be efficient for morphometric analysis in different scientific disciplines [19,20,21,22]. White-box supervised multivariate statistical algorithms, such as linear discriminant analysis, quadratic discriminant analysis, logistic regression, and classification trees, are commonly used for binary classification. In the first method, the covariance matrices within groups are assumed to be homogeneous. Even when this assumption is not true, it is still possible to develop linear functions with good classification capabilities or to try to fit quadratic functions to improve the classification task. In the logistic regression and classification trees, greater flexibility can be found, since it is possible to work with non-normal and even categorical characteristics. In animal science, multivariate algorithms are increasingly used to develop models in different areas such as animal nutrition, health, welfare, and reproduction [23]. The algorithms have been used to predict body weight in pigs and foals [23,24], the results of horse racing [25], mastitis infections [26], clinical ketosis [27], heat stress in cows [28], and parturition events in grazing sheep [29]. It can be generally assumed that logistic regression and discriminant analysis are more “statistical” methods, and trees can be placed closer to machine learning. Considering the importance of multivariate statistical methods to adjust high-performance predictive models, the aim of this study was to identify the most representative body morphometric measurements to differentiate two Colombian Creole pig breeds (“Casco de Mula” and San Pedreño) using statistical and machine learning methods.

2. Materials and Methods

2.1. Study Site

This study was carried out in the Meta department, Colombian Orinoquia (latitude: 01°36′29″ and 04°54′24″ N; longitude 71°04′42″ and 74°54′09″ W). The department is located in the tropical zone, with a flat and undulating topography, at an altitude between 200 and 450 m, and annual precipitation ranging between 2700 and 3500 mm. The average ambient temperature and relative humidity were 26 °C and 71%, respectively [30,31].

2.2. Animals Sampled

Body morphometric measurements of 84 Creole pigs (54 “Casco de Mula”-CM and 30 San Pedreño-SP) were taken, including both sexes and with ages between 2 and 6 months. The animals belonged to seven different traditional farms selected for convenience. Data collection lasted 12 months (September 2023 to September 2024).

2.3. Measured Variables

In total, 14 body measurements were recorded, including body weight (BW), thoracic perimeter (TP), body length (BL), height at the withers (HaW), sternum height (SH), knee perimeter (KP), head length (HL), head width (HW), left ear length (LEL), left ear width (LEW), right ear length (REL), right ear width (REW), posterior cannon perimeter (PCP), and hock height (HH). Body measurements were taken with a swine measuring tape and body weight with a digital hanging scale (Penesanio, CO). All measurements were taken by a single person, after training on the anatomical points for measurements.

2.4. Statistical Analysis

The morphometric variables were subjected to descriptive statistics, the Shapiro– Wilk normality test, and Spearman correlations (Sc). Statistical significance was declared at p ≤ 0.05. In highly correlated variables (>0.90), only one was chosen for further analysis to avoid multicollinearity effects. Four white-box multivariate statistical algorithms (linear discriminant analysis—LDA; quadratic discriminant analysis—QDA; logistic regression—LR; and classification trees—CTs) were analysed for the classification of the “Race” target variable (“Casco de Mula” or San Pedreño) based on the corporal morphometric characteristics, including the sex effects.
In the LDA and QDA analyses, variables were transformed (log10 or square root) to ensure an approximately normal distribution, when necessary. Later, they were centred and scaled. In these algorithms, thirteen morphometric characteristics (TP, BL, HaW, SH, KP, HL, HW, LEL, LEW, REL, REW, PCP, and HH) were used to predict the “Race” target variable. In the LR and CT algorithms, the same model mentioned above (with the variables on their original scale) plus the sex variable was used. In the LR, only significant characteristics (p < 0.05) were retained in the final model, while in the CT, the classification and regression tree algorithm (CART) was used, and pruning to reduce model complexity was performed, using the “complexity parameter (CP)” criterion, which imposes a penalty on trees with many branches. Ten CP values were evaluated, and the most accurate was estimated using 10-fold cross-validation.
The models were subjected to leave-one-out cross-validation, and their performance was analysed by constructing confusion matrices and estimating the accuracy (proportion of correct predictions in both races), sensitivity (proportion of CM animals correctly classified among the observed CM sample), specificity (proportion of SP animals correctly classified among the observed SP sample), precision or positive predictive value (proportion of CM animals correctly classified among the predicted CM animals), negative predictive value (NPV; proportion of SP animals correctly classified among the predicted SP animals), model error rate, and the Matthews correlation coefficient to account for the imbalance in the data.
Receiver operating characteristic curves (ROC curves) were created, and the areas under the curve (AUCs) were estimated. With the best classification model, the cut-off point with the highest sensitivity and specificity were obtained. Analyses were performed in Rstudio 4.3.0, using the Mass, Caret, rpart, pRoc, and boot libraries [32].

3. Results

Descriptive statistics of the morphometric variables are shown in Table 1. Most of the characteristics presented an acceptable variation (CV = 11.5 to 25.6%), except for body weight (CV = 73.8% on average for both races) and posterior cannon perimeter in the “Casco de Mula” animals (CV = 43.3%). The other morphometric measurements were similar between breeds. The Shapiro–Wilk test indicated that 11 of the 14 morphometric variables (BW, TP, BL, KP, HL, HW, LEW, REL, REW, PCP, and HH) were not normally distributed (p < 0.05). The Spearman correlation analysis showed that BW and TP characteristics were positively associated (rs = 0.95; p < 0.0001); so, to avoid multicollinearity, only TP was selected for further analyses, since BW was highly variable. In this way, 13 morphometric variables and the sex effect were used for analysis.
Confusion matrices and model performance measurements during the leave-one-out cross-validation are presented in Table 2 and Table 3, respectively. According to the classical confusion matrix parameters, the studied models classified both breeds appropriately, with proportions of correct predictions (accuracy) between 0.79 and 0.92, and errors ranging between 0.08 and 0.21. The model with the best performance was the CT, with a proportion of correct predictions of 92%, and it was able to adequately classify the “Casco de Mula” and San Pedreño pigs with a sensitivity of 96% and a specificity of 83%, respectively. To assess the model’s classification considering the imbalance of the data (CM = 54 and SP 30), the MCC was estimated, which ranged from 0.53 to 0.82 between models, with the CT algorithm performing best.
The ROC curves of the models and their respective AUCs are shown in Figure 2. This graph relates sensitivity vs. 1-specificity and allowed us to analyse the trade-off between correct and incorrect predictions. The areas under the curve (AUCs) were used to summarize the model’s performance during the classification, considering all the possible probability thresholds or cut-offs, and served to evaluate the model’s discriminatory capacity between “Casco de Mula” and San Pedreño animals. The higher the AUC values (>0.80), the better the classification capacity of the model [33]. The results showed that all models presented high classification capacities, with AUC values ranging from 0.836 to 0.911. Among the studied algorithms, the CT model was the best-performing, reaching maximum values of sensitivity and 1-specificity of 0.83 and 0.037, respectively, with a cut-off point of 0.55 (Figure 2).
The resulting classification tree with the ability to discriminate between “Casco de Mula” and San Pedreño Creole pigs, as well as the most important variables to its construction, is shown in Figure 3.
The most important morphometric measurements that contributed to differentiate the Creole pigs were head width (HW; 100%) and height at the withers (HaW; 89.02%). The right ear length (REL) variable had a 45.12% importance and, together with HW and HaW, improved the Creole pig’s classification. Figure 3 shows the description of the classification tree for the “Casco de Mula” and San Pedreño Creole pig breeds.

4. Discussion

Morphometric analysis provides information on the species morphological structure and the factors influencing their functional abilities and ethnological characteristics [34,35]. This information is required to define the species in the face of geographic dispersion and sexual differentiation [36]. In commercial scenarios, morphometric measurements as well as animal productive records constitute the base information to identify populations, races, and animals’ productive potential, in order to efficiently use the porcine genetic resources [37,38].
In the present study, the morphometric measurements that contributed to classifying the “Casco de Mula” and San Pedreño Creole pigs were the head width (HW), height at the withers (HaW), and right ear length (REL). The San Pedreño animals showed greater HW and REL, while the HaW was slightly higher in the “Casco de Mula” animals. These differences can be related to age differences among the sampled animals in both breeds, similar to what was reported in Creole pigs from Ecuador [39], Ch’orti pigs from Guatemala [40], and the Pampa Rocha pig from Uruguay [41]. The head width (HW) and right ear length (REL) are measurements related to ethnological characters and are important in the characterization and comparison between breeds, as well as to establish differences between sexes [42].
In the Casco de Mula and San Pedreño Creole breeds, the head width values were less than those reported in Creole pigs from the province of Loja, Ecuador [39], Ch’ortí pigs from Guatemala [40], Creole pigs from Nicaragua [43], Pampa Rocha pigs from Uruguay [41], Venezuelan Creole pigs [7], Pillareño pigs from Ecuador [44], pigs from Chocó, Colombia [45], and Cuban Creole pigs [46]. However, this cephalic measurement was greater than the value reported for the Araucanian Creole pig [47].
Ear length is a trait that correlates with the animal profile. The “Casco de Mula” and San Pedreño pigs have a concave profile, meaning that their ears are large and at the same time fall to the sides or forward. This contrasts with subconcave profiles, where the ears are medium-sized and point forward in an almost horizontal direction, and ultra-concave profiles, where the ears are small and point upwards [42]. The ear length of the San Pedreño and “Casco de Mula” pigs was shorter than the values reported for other Ibero-American Creole pigs [39,41,44,45], except for the Araucanian Creole pig [47].
The height at the withers is a measure that determines the animal size and is useful to define lines within breeds [48]. This character is almost not influenced by environmental conditions, so it is suitable to delimit differences or similarities between pigs of different origins [49]. Since the height at the withers is not associated with ethnological characters, the observed differences between Creole pigs could be attributed mainly to breeding and artificial selection rather than the ecological conditions of the region they inhabit. The height at the withers of “Casco de Mula” and San Pedreño animals was lower compared to other Ibero-American Creole pigs from Mexico [37,50], Ecuador [39,44], Nicaragua [43], Uruguay [41], Venezuela [7], Colombia [45], and Cuba [46]. This body measurement was higher compared to the Ch’ortí pig from Guatemala [40] and similar to the values reported for the Araucanian Creole pig [47].
According to the accuracies values obtained (0.79 to 0.92), the multivariate statistical models used in the present study classified both Creole pigs appropriately. These accuracies are similar to the 0.77 to 0.96 values reported in other studies, where algorithms such as support vector machine, decision tree, linear discriminant analysis, and artificial neural networks have been successfully used for population classification of carp [51], fruit fly species [52], and mosquito species [53].
The decision tree algorithm was the best classification model with the observed and predicted data. The model showed the lowest error rate, and according to the sensitivity and specificity, 96% and 83% of the CM and SP animals, respectively, were correctly classified in the observed data, while the precision and NPV suggested greater model efficiency with predicted data, since 91% and 93% of the CM and SP animals, respectively, were correctly classified. Theses metrics suggest an optimal performance of the DT model; however, imbalance between categories must be considered. The MCC is useful since it accounts for all the entries of the confusion matrix and provides a robust metric to evaluate the classifiers performances. MCC values close to 1 indicate a perfect classification, 0 is a random classification, and −1 indicates disagreement between the observed and predicted categories [54]. In the study, the highest MCC was also observed in the DT algorithm (0.82), nonetheless with a lower value than the other metrics, indicating that data imbalance influenced the model performance. An MCC of 0.82 is still close to 1, which allows a strong association between the observed and predicted categories to be inferred.
The decision tree algorithm has also been used successfully in a carp classification study [51]. This model is often used in biological data processing for its simplicity, interpretability, and the ability to provide solutions in classification, prediction, and pattern identification processes [51]. This result suggests that in morphometric studies that aim to characterize animal genetic resources, besides the use of traditional statistical methods such as the linear models, machine learning algorithms constitute another alternative to address this issue, providing in some situations more consistent, accurate, interpretable, and automated results than the standard geometric morphometric analysis [55]. The results of the present study constitute one of the first efforts to characterize and differentiate the “Casco de Mula” and San Pedreño Creole pigs, considering that it is difficult to carry out this type of research since, as they are animals at risk, their population is small and difficult to find. It is hoped that further, similar studies will be carried out, hopefully with a larger sample size.

5. Conclusions

This study indicates that “Casco de Mula” and San Pedreño Creole pigs could be differentiated using morphometric measures such as head width, height at the withers, and right ear length measurements. Studies with a larger sample size are required to confirm these results. The decision tree was the most accurate algorithm, and its performance was consistent even with unbalanced data. Non-parametric supervised learning methods like decision trees can be used to morphometrically differentiate Creole pigs raised in the same or different environments, in order to characterize the animals’ genetic resources.

Author Contributions

Writing—preparation of the original manuscript, A.S.-C. and M.V.-T.; methodology and formal analysis, M.V.-T., A.S.-C. and P.M.P.-C.; statistical analysis, M.V.-T.; review—editing, P.M.P.-C., A.F.C.-P., D.E.R.-P. and P.A.T.-V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Research Committee—CONADI (ID3353)—of the Universidad Cooperativa de Colombia.

Institutional Review Board Statement

Bioethical Concept No. BIO394, issued by the Research Bioethics Subcommittee of the Bucaramanga Sectional Universidad Cooperativa de Colombia (Act No. 4, 26 October 2023).

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the second author. The data have not been published because we are going to carry out further analysis.

Acknowledgments

We would like to thank the owners of the Mata de Agua, Vegón, and Casa Blanca farms and the SENA, AGROSAVIA, and UNILLANOS institutions for providing animals for data collection, as well as the students who helped with data collection and the anonymous referees for their valuable comments on the manuscript. All persons included in this section have consented to their acknowledgment.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Benítez, O.W.; Sánchez, M.D.; Los Cerdos Locales en Los Sistemas Tradicionales de Producción. Producción y Sanidad Animal. Estudio FAO 148. 2002. Available online: http://www.fao.org/3/a-y2292s.pdf (accessed on 4 November 2024).
  2. Ramirez, O.; Ojeda, A.; Tomas, A.; Gallardo, D.; Huang, L.S.; Folch, J.M.; Clop, A.; Sánchez, A.; Badaoui, B.; Hanotte, O.; et al. Integrating Y-chromosome, mitochondrial, and autosomal data to analyze the origin of pig breeds. Mol. Biol. Evol. 2009, 26, 2061–2072. [Google Scholar] [CrossRef] [PubMed]
  3. Pinzón, E. El cerdo ibérico, ascentro del porcino criollo. Rev. Carta Ganad. 1993, 30, 18–22. [Google Scholar]
  4. Espinosa, C.; Ly, J. Cerdos criollos colombianos y agricultura sostenible. Rev. Comput. Produc. Porc. 2015, 22, 1–9. [Google Scholar]
  5. Linares, V.; Linares, L.; Mendoza, G. Caracterización etnozootécnica y potencial carnicero de Sus scrofa“cerdo criollo” en Latinoamérica. Sci. Agropec. 2011, 2, 97–110. [Google Scholar] [CrossRef]
  6. Hurtado, E.; Gonzalez, C.; Vecchionacce, H. Los Sistemas de Producción del Cerdo Criollo en Los Estados Llaneros de Venezuela. In Proceedings of the VII Encuentro de Nutrición y Producción de Animales Monogástricos, Merida, Yucatán, México, 2–3 May 2003. [Google Scholar]
  7. Hurtado, E.; González, C.; Vecchionacce, H. Estudio morfológico del cerdo criollo del estado Apure, Venezuela. Zootec. Trop. 2005, 23, 17–25. [Google Scholar]
  8. Cortés, O.; Martínez, A.M.; Cañon, J.; Sevane, N.; Gama, L.T.; Ginja, C.; Landi, V.; Zaragoza, P.; Carolino, N.; Vicente, A.; et al. Conservation priorities of Iberoamerican pig breeds and their ancestors based on microsatellite information. Heredity 2016, 117, 14–24. [Google Scholar] [CrossRef]
  9. Espinosa Pullaguari, J.D. Caracterización Fenotípica del Cerdo Criollo en Los Cantones Zapotillo y Puyangode la Provincia de Loja; Título Médico Veterinario y Zootecnista; Universidad Nacional de Loja: Loja, Ecuador, 2016. [Google Scholar]
  10. Galíndez, R.; Ramis, C.; Martínez Galíndez, R.; Ángulo, L.; Bedoya, A.; de Farías, Y. Variabilidad genética de seis poblaciones del cerdo Criollo venezolano usando marcadores microsatélites. Rev. Fac. Agron. (UCV) 2016, 42, 91–100. [Google Scholar]
  11. Barrera, G.P.; Martínez, R.A.; Ortegón, J.; Ortiz, A.; Moreno, F.; Velázquez, H.; Pérez, J.E.A.Y. Cerdos Criollos Colombianos, Caracterización Racial, Productiva y Genética; CORPOICA. 2007. Available online: https://repositorio.minciencias.gov.co/entities/publication/5b28e29e-f471-4b2c-90a6-6e56343647c6 (accessed on 23 April 2025).
  12. Ocampo-Gallego, R.J.; Tobón-Castaño, J.A.; Martínez-Oquendo, P.Y.; Ramírez-Toro, J.; Lucero-Casanova, C.E. Análisis de diversidad genética en cerdo criollo san pedreño utilizando datos de pedigrí. Ecosist. Rec. Agropec. 2019, 6, 333–341. [Google Scholar] [CrossRef]
  13. Food and Agriculture Organization of the United Nations (FAO). Domestic Animal Diversity Information System (DAD-IS). Breed Data Sheet. Colombia, Pig. Available online: https://www.fao.org/dad-is/browse-by-country-and-species/en/ (accessed on 4 November 2024).
  14. Poveda, H.C.A.; Moncada Bueno, A. Cerdo Casco de Mula; Los Animales Domésticos Criollos y Colombianos en la Producción Pecuaria Nacional; ICA: Bogotá, Colombia, 2001. [Google Scholar]
  15. Food and Agriculture Organization of the United Nations (FAO). Phenotypic Characterization of Animal Genetic Resources. Animal Production and Health Guidelines No. 11. Rome. 2012. Available online: https://www.fao.org/4/i2686e/i2686e00.htm (accessed on 23 April 2025).
  16. Parés i Casnaova, P.M. Zoometría. In Valoración Morfológica de Los Animales Domésticos. Ministerio de Medio Ambiente y Medio Rural y Marino; Sañudo Astiz, C., Ed.; Artegraf, Industrias Graficas S.A: Madrid, Spain, 2009; pp. 167–198. [Google Scholar]
  17. Cattáneo, A.; Trigo, M.; Arias, R.; Antonini, A. Uso de Indices Zoométricos en un Hato Caprino Criollo Cruza Como HerramCienta Para evaluar Biotipos Según Categoría. In XIV Jornadas de Divulgación Técnico-Científicas; Jornada Latinoamericana; Universidad Nacional de Rosario: Rosario, Argentina, 2013. [Google Scholar]
  18. Salako, A. Application of morphological indices in the assessment of type and function in Sheep. Int. J. Morphol. 2006, 24, 13–18. [Google Scholar] [CrossRef]
  19. Domínguez-García, A.C.; Moclán, A.; Stoetzel, E.; Cucchi, T.; Sevilla, P.; Laplana, C. Morfometría Geométrica y Aprendizaje Automático Aplicados a la Diferenciación de Especies de Roedores del Género Mus Linnaeus, 1758. In Proceedings of the XXXVIII Jornadas de la Sociedad Española de Paleontología, Valencia, Spain, 4–7 October 2023; p. 70. [Google Scholar]
  20. Jordan, M.I.; Mitchell, T.M. Machine learning: Trends, perspectives, and prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef]
  21. Verma, V.K.; Verma, S. Machine learning applications in healthcare sector: An overview. Mater. Today Proc. 2022, 57, 2144–2147. [Google Scholar] [CrossRef]
  22. Aracena, C.; Villena, F.; Arias, F.; Dunstan, J. Aplicaciones de aprendizaje automático en salud. Rev. Med. Clin. Las Condes 2022, 33, 568–575. [Google Scholar] [CrossRef]
  23. Satola, A.; Łuszczyński, J.; Petrych, W.; Satoła, K. Body Weight Prediction from Linear Measurements of Icelandic Foals: A Machine Learning Approach. Animals 2022, 12, 1234. [Google Scholar] [CrossRef] [PubMed]
  24. He, Y.; Tiezzi, T.; Howard, J.; Maltecca, C. Predicting body weight in growing pigs from feeding behavior data using machine learning algorithms. Comput. Electron. Agric. 2021, 184, e106085. [Google Scholar] [CrossRef]
  25. Borowski, P.; Chlebus, M. Machine Learning in the Prediction of Flat Horse Racing Results in Poland; Working Papers N° 13/2021 (361); University of Warsaw: Warszawa, Poland, 2021. [Google Scholar]
  26. Abdul Ghafoor, N.; Sitkowska, B. MasPA: A machine learning application to predict risk of mastitis in cattle from AMS sensor data. AgriEngineering 2021, 3, 575–583. [Google Scholar] [CrossRef]
  27. Satoła, A.; Bauer, E.A. Predicting subclinical ketosis in dairy cows using machine learning techniques. Animals 2021, 11, 2131. [Google Scholar] [CrossRef]
  28. Becker, C.A.; Aghalari, A.; Marufuzzaman, M.; Stone, A.E. Predicting dairy cattle heat stress using machine learning techniques. J. Dairy Sci. 2021, 104, 505–524. [Google Scholar] [CrossRef]
  29. Fogarty, E.S.; Swain, D.L.; Cronin, G.M.; Moraes, L.E.; Bailey, D.W.; Trotter, M. Developing a simulated online model that integrates GNSS, accelerometer and weather data to detect parturition events in grazing sheep: A machine learning approach. Animals 2021, 11, 303. [Google Scholar] [CrossRef]
  30. Climate Data. Clima: Meta, Colombia. Available online: https://es.climate-data.org/america-del-sur/colombia/meta-77/ (accessed on 24 November 2024).
  31. Rincón Castillo, A.; Bueno Guzmán, G.A.; Álvarez de León, M.; Pardo Barbosa, O.; Pérez López, O.; Caicedo Guerrero, S. Establecimiento, Manejo y Utilización de Recursos Forrajeros en Sistemas Ganaderos de Suelos Ácidos; Manual Técnico No 15; Produmedios: Bogotá, Colombia, 2010. [Google Scholar]
  32. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2024; Available online: https://www.R-project.org (accessed on 27 December 2024).
  33. Cerda, J.; Cifuentes, L. Uso de curvas ROC en investigación clínica: Aspectos teórico-prácticos. Rev. Chil. Infectol. 2012, 29, 138–141. [Google Scholar] [CrossRef]
  34. Sañudo Astiz, C. Valoración morfologica de los animales domésticos. In Ministerio de Medio Ambiente y Medio Rural y Marino; Artegraf, Industrias Graficas S.A: Madrid, Spain, 2009; p. 865. [Google Scholar]
  35. Francesch, A.; Villalba, I.; Cartañà, M. Methodology for morphological characterization of chicken and its application to compare Penedesenca and Empordanesa breeds. Anim. Genet. 2011, 48, 79–84. [Google Scholar] [CrossRef]
  36. Benítez, H.A.; Püschel, T.A. Modelando la varianza de la forma: Morfometría Geométrica aplicaciones en Biología Evolutiva. Int. J. Morphol. 2014, 32, 998–1008. [Google Scholar] [CrossRef]
  37. Martínez Velázquez, G.; Román Ponce, S.I.; Vélez Izquierdo, A.; Cabrera Torres, E.; Cantú Covarrubias, A.; Colín, L.D.C.; Durán Aguilar, M.; Maldonado Jaquez, J.A.; Martínez Silva, F.E.; Ríos Utrera, A.; et al. Morfometría del cerdo de traspatio en áreas rurales de México. Rev. Mex. Cienc. Pecu. 2016, 7, 431–440. [Google Scholar] [CrossRef]
  38. Montenegro, M.; Castro, G.; Llambí, S.; Barlocco, N.; Vadell, A.; Delgado, J.V.; Barba, C. Comparación de medidas zoométricas entre el cerdo Pampa Rocha y variedades oficialmente reconocidas del cerdo ibérico. Actas Iberoam. Conserv. Anim. 2014, 4, 86–88. [Google Scholar]
  39. Esmeraldas, R.D.; Armijos, J.R.; Maldonado, D.F.; Delgado, J.V.; Navas, F.J.; Toalombo, P.A. Comparación de las variables morfológicas de cerdos criollos en la provincia de Chimborazo y Loja. Arch. Zootec. 2023, 72, 236–243. [Google Scholar] [CrossRef]
  40. Jáuregui, R.; Parés-Casanova, P.M.; Lorenzo, C.R. Ausencia de dimorfismo sexual y biotipos en el cerdo criollo de la región de Ch’ortí de Guatemala. Actas Iberoam. Conserv. Anim. 2019, 14, 32–36. [Google Scholar]
  41. Castro, G.; Montenegro, M.; Barlocco, N.; Vadell, A.; Gagliardi, R.; Llambi, S. Caracterización zoométrica en el cerdo Pampa Rocha de Uruguay (Descriptiva primaria). Actas Iberoam. Conserv. Anim. 2012, 2, 83–86. [Google Scholar]
  42. Rodero Serrano, E.; González Martínez, A. Las regiones corporales de los animales domésticos. In Valoración Morfológica de los Animales Domésticos. Ministerio de Medio Ambiente y Medio Rural y Marino; Sañudo Astiz, C., Ed.; Artegraf, Industrias Graficas S.A: Madrid, Spain, 2009; pp. 102–137. [Google Scholar]
  43. Hernández-Baca, M.; Gámez-Rivas, A.; Zeledón-Ortega, Y. Caracterización morfológica del cerdo criollo (Sus scrofa domesticus) en el municipio de Nueva Guinea, RACCS. La Calera-Cienc. Anim. 2017, 17, 21–27. [Google Scholar] [CrossRef]
  44. Olmedo, W.J.; Toalombo, P.A.; Flores, L.G.; Delgado, J.V.; Navas, F.J.; Duchi, N.A. Caracterización morfológica del cerdo criollo Pillareño del cantón Guamote de Ecuador. Arch. Zootec. 2021, 70, 160–170. [Google Scholar] [CrossRef]
  45. Arredondo Botero, J.V.; Muñoz Florez, J.E.; Arenas, L.E.; Mosquera, M.; Pacheco, E.; Álvarez, L.Á. Morfometría del cerdo criollo del Pacífico Colombiano. Arch. Latinoam. Produc. Anim. 2021, 29, 215–223. [Google Scholar]
  46. Reyes, Z.; Yusimy, C.; Robert, R.; Sánchez, N.; Santana, I. Estudio morfométrico de una muestra poblacional de cerdos criollo cubano. Rev. Comput. Produc. Porc. 2018, 25, 79–85. [Google Scholar]
  47. Salamanca-Carreño, A.; Parés-Casanova, P.M.; Vélez Terranova, O.M.; Castro Rosa, G.; Jáuregui, R. Valoración morfométrica de una población de cerdos criollos araucanos (Colombia). Rev. Investig. Vet. Peru 2022, 33, e23794. [Google Scholar] [CrossRef]
  48. Herrera, M.; Luque, M. Morfoestructura y sistemas para el futuro en la valoración morfológica. In Valoración Morfológica de los Animales Domésticos. Ministerio de Medio Ambiente y Medio Rural y Marino; Sañudo, C., Ed.; Artegraf, Industrias Graficas S.A: Madrid, Spain, 2009; pp. 83–101. [Google Scholar]
  49. Revidatti, M.A. Caracterización de Cerdos Criollos del Nordeste Argentino. Ph.D. Thesis, Universidad de Córdoba, Córdoba, Spain, 2009. [Google Scholar]
  50. Sierra-Vásquez, A.C.; Ortiz-Ortiz, J.R.; Bojórquez-Cat, J.C.; Canul-Solís, M.A.; Tamayo-Canul, J.R.; Rodríguez-Pérez, J.C.; Sanginés-García, J.R.; Magaña-Magaña, M.A.; Montes-Pérez, R.C.; Segura-Correa, J.C. Conservación y uso sustentable del cerdo pelón en Yucatán. Quehacer Cient. Chiapas 2016, 11, 13–28. [Google Scholar]
  51. Jafari, O.; Ebrahimi, M.; Hedayati, S.A.-A.; Zeinalabedini, M.; Poorbagher, H.; Nasrolahpourmoghadam, M.; Fernandes, J.M.O. Integration of Morphometrics and Machine Learning Enables Accurate Distinction between Wild and Farmed Common Carp. Life 2022, 12, 957. [Google Scholar] [CrossRef] [PubMed]
  52. Salifu, D.; Ibrahim, E.A.; Tonnang, H.E.Z. Leveraging machine learning tools and algorithms for analysis of fruit fly morphometrics. Sci. Rep. 2022, 12, 7208. [Google Scholar] [CrossRef]
  53. Bellin, N.; Calzolari, M.; Callegari, E.; Bonilauri, P.; Grisendi, A.; Dottori, M.; Rossi, V. Geometric morphometrics and machine learning as tools for the identification of sibling mosquito species of the Maculipennis complex (Anopheles). Infect. Genet. Evol. 2021, 95, e105034. [Google Scholar] [CrossRef]
  54. Chicco, D.; Jurman, G. The Matthews correlation coeffi-cient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification. BioData Min. 2023, 16, 4. [Google Scholar] [CrossRef]
  55. Macleod, N. On the Use of Machine Learning in Morphometric Analysis. Biol. Shape Anal. 2017, 134–171. [Google Scholar] [CrossRef]
Figure 1. Colombian Creole pigs. (a) Casco de Mula; photography: CI Agrosavia. (b) San Pedreño; photography: Casa Blanca Farm.
Figure 1. Colombian Creole pigs. (a) Casco de Mula; photography: CI Agrosavia. (b) San Pedreño; photography: Casa Blanca Farm.
Life 15 00693 g001
Figure 2. The ROC curve and area under the curve (AUC) of the studied models. LDA: linear discriminant analysis; QDA: quadratic lineal discriminant analysis; LR: logistic regression; CT: classification tree.
Figure 2. The ROC curve and area under the curve (AUC) of the studied models. LDA: linear discriminant analysis; QDA: quadratic lineal discriminant analysis; LR: logistic regression; CT: classification tree.
Life 15 00693 g002
Figure 3. (a) Final classification tree. Information within nodes from top to bottom corresponds to predictions, predicted probability of SP breed, and percentage of observation in the node, respectively. (b) Variable importance graph. CM: “Casco de Mula”; SP: San Pedreño; TP: thoracic perimeter; BL: body length; HaW: height at withers; SH: sternum height; KP: knee perimeter; HL: head length; HW: head width; LEL: left ear length; LEW: left ear width; REL: right ear length; REW: right ear width; PCP: posterior cannon perimeter; HH: hock height.
Figure 3. (a) Final classification tree. Information within nodes from top to bottom corresponds to predictions, predicted probability of SP breed, and percentage of observation in the node, respectively. (b) Variable importance graph. CM: “Casco de Mula”; SP: San Pedreño; TP: thoracic perimeter; BL: body length; HaW: height at withers; SH: sternum height; KP: knee perimeter; HL: head length; HW: head width; LEL: left ear length; LEW: left ear width; REL: right ear length; REW: right ear width; PCP: posterior cannon perimeter; HH: hock height.
Life 15 00693 g003
Table 1. Descriptive statistics of body morphometry (cm) from “Casco de Mula” and San Pedreño Creole pigs. Body weight is given in kg.
Table 1. Descriptive statistics of body morphometry (cm) from “Casco de Mula” and San Pedreño Creole pigs. Body weight is given in kg.
Creole PigVariableMeanSDCV (%)MinimumMaximum
“Casco de Mula”BW (kg)30.421.972.07.1150.0
TP (cm)69.414.621.044.0131.0
BL (cm)58.211.920.438.0120.0
HaW (cm)48.68.717.931.078.0
SH (cm)24.84.518.312.034.0
KP (cm)15.02.818.99.025.0
HL (cm)21.63.817.515.040.0
HW (cm)8.31.922.45.517.0
LEL (cm)15.12.617.49.922.0
LEW (cm)10.62.019.07.019.0
REL (cm)15.12.516.610.023.0
REW (cm)10.51.918.17.016.0
PCP (cm)16.06.943.310.062.0
HH (cm)19.42.915.015.029.0
San PedreñoBW (kg)40.830.875.614.7125.0
TP (cm)75.117.323.057.0120.0
BL (cm)63.113.521.441.091.0
HaW (cm)47.212.025.414.068.0
SH (cm)22.05.625.612.533.0
KP (cm)18.34.323.413.031.0
HL (cm)23.54.418.717.033.0
HW (cm)9.91.313.07.013.0
LEL (cm)15.93.219.810.022.0
LEW (cm)11.41.715.09.017.0
REL (cm)16.03.320.911.023.0
REW (cm)11.61.714.79.017.0
PCP (cm)17.73.821.613.028.0
HH (cm)19.72.311.516.023.0
SD: standard deviation; CV: coefficient of variation; BW: body weight; TP: thoracic perimeter; BL: body length; HaW: height at withers; SH: sternum height; KP: knee perimeter; HL: head length; HW: head width; LEL: left ear length; LEW: left ear width; REL: right ear length; REW: right ear width; PCP: posterior cannon perimeter; HH: hock height.
Table 2. Confusion matrices obtained with the studied models during the leave-one-out cross-validation.
Table 2. Confusion matrices obtained with the studied models during the leave-one-out cross-validation.
LDAObservedLRObserved
PredictedCMSPPredictedCMSP
CM467CM4810
SP823SP620
QDAObservedCTObserved
PredictedCMSPPredictedCMSP
CM459CM525
SP921SP225
CM: “Casco de Mula”; SP: San Pedreño; LDA: linear discriminant analysis; LR: logistic regression; QDA: quadratic discriminant analysis; CT: classification tree.
Table 3. Performance measurements of the studied models during the leave-one-out cross-validation.
Table 3. Performance measurements of the studied models during the leave-one-out cross-validation.
ModelLDAQDALRCT
Accuracy0.820.790.810.92
Sensitivity0.850.830.890.96
Specificity0.770.700.670.83
Precision0.870.830.830.91
NPV0.740.70.770.93
MCC0.610.530.580.82
Error rate0.180.210.190.08
LDA: linear discriminant analysis; QDA: quadratic discriminant analysis; LR: logistic regression; CT: classification tree; MCC: Matthews correlation coefficient. NPV: Negative predictive value.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Salamanca-Carreño, A.; Vélez-Terranova, M.; Parés-Casanova, P.M.; Toalombo-Vargas, P.A.; Rangel-Pachón, D.E.; Castillo-Pérez, A.F. Assessment of Body Morphometry to Classify Two Colombian Creole Pigs Using Statistical and Machine Learning Methods. Life 2025, 15, 693. https://doi.org/10.3390/life15050693

AMA Style

Salamanca-Carreño A, Vélez-Terranova M, Parés-Casanova PM, Toalombo-Vargas PA, Rangel-Pachón DE, Castillo-Pérez AF. Assessment of Body Morphometry to Classify Two Colombian Creole Pigs Using Statistical and Machine Learning Methods. Life. 2025; 15(5):693. https://doi.org/10.3390/life15050693

Chicago/Turabian Style

Salamanca-Carreño, Arcesio, Mauricio Vélez-Terranova, Pere M. Parés-Casanova, Paula A. Toalombo-Vargas, David E. Rangel-Pachón, and Andrés F. Castillo-Pérez. 2025. "Assessment of Body Morphometry to Classify Two Colombian Creole Pigs Using Statistical and Machine Learning Methods" Life 15, no. 5: 693. https://doi.org/10.3390/life15050693

APA Style

Salamanca-Carreño, A., Vélez-Terranova, M., Parés-Casanova, P. M., Toalombo-Vargas, P. A., Rangel-Pachón, D. E., & Castillo-Pérez, A. F. (2025). Assessment of Body Morphometry to Classify Two Colombian Creole Pigs Using Statistical and Machine Learning Methods. Life, 15(5), 693. https://doi.org/10.3390/life15050693

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop