A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset

Ji, Xintong; Li, Qifeng; Guo, Kaijun; Ma, Weihong; Li, Mingyu; Xu, Zhankang; Yang, Simon X.; Ren, Zhiyu

doi:10.3390/agriculture15080814

Open AccessArticle

A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset

by

Xintong Ji

^1,2,

Qifeng Li

^2,3,

Kaijun Guo

^1,*,

Weihong Ma

^2,3,*

,

Mingyu Li

²,

Zhankang Xu

²,

Simon X. Yang

⁴

and

Zhiyu Ren

²

¹

College of Animal Science and Technology, Beijing University of Agriculture, Beijing 100096, China

²

Information Technology Research Centre, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China

³

National Innovation Centre of Digital Technology in Animal Husbandry, Beijing 100097, China

⁴

Advanced Robotics and Intelligent Systems Laboratory, School of Engineering, University of Guelph, Guelph, ON N1G 2W1, Canada

^*

Authors to whom correspondence should be addressed.

Agriculture 2025, 15(8), 814; https://doi.org/10.3390/agriculture15080814

Submission received: 24 February 2025 / Revised: 3 April 2025 / Accepted: 4 April 2025 / Published: 9 April 2025

(This article belongs to the Section Farm Animal Production)

Download

Browse Figures

Versions Notes

Abstract

Traditional pig weighing methods are costly, require driving pigs onto electronic scales, and cannot collect real-time data without interference. Pig weight estimation using deep learning often demands significant computational resources and lacks real-time capabilities, highlighting the need for a more efficient method. To overcome these challenges, this study proposes a machine learning-based approach for real-time pig weight estimation by extracting image features. The method reduces computational demands while maintaining high accuracy. The SAM2-Pig model is employed for instant segmentation of pig RGB images to extract features such as relative projection area, body length, and body width, which are crucial for accurate weight prediction. Regression models, including the BPNN with Trainlm, are used to predict pig weight based on the extracted features, achieving the best performance in our experiments. This study demonstrates that machine learning methods using RGB image features provide accurate and adaptable results, offering a viable solution for real-time pig weight estimation. This study also publicly releases the PIGRGB-Weight dataset, consisting of 9579 RGB images of pigs in a free-moving state, annotated with weight information, enabling future research and model testing. The method demonstrates remarkable stability, low computational demand, and practical applicability, making it a lightweight and effective approach for estimating pig weight in real time.

Keywords:

machine learning; pig weight estimation; pig dataset

1. Introduction

The pig farming and pork industry is a crucial component of the global food supply chain, with profound impacts on the economy, society, and the environment [1]. Pig weight measurement is essential in livestock management, as it reflects growth conditions, helps optimize feeding strategies [2], and is closely related to feeding costs and feed conversion efficiency, thereby aiding in cost reduction. Weight data support scientific market-out planning, ensuring optimal market weight to maximize economic benefits [3]. Additionally, weight measurement helps in the timely identification of potential health risks, facilitating effective preventive measures. The application of intelligent weight measurement systems allows farms to record weight data in real time, improving decision-making accuracy and farming efficiency [4]. Timely and accurate monitoring of pig weight throughout the farming process is essential for effective farm management.

Traditional pig weight monitoring methods typically employ mechanical scales or weighing cages [5], which involve direct contact and can easily induce stress responses. The main types of stress include group stress, behavioral stress, and physiological stress. These stresses not only affect measurement accuracy but can also harm pig health, production performance, and economic returns [6]. In contrast with traditional weighing methods, the study of modern weight estimation techniques in animal husbandry has made significant advancements through various technological means [7,8]. Compared to traditional contact-based weighing methods, these non-contact technologies not only reduce stress responses but also avoid high costs and operational inconvenience. For example, Pezzuolo et al. [9] used the Kinect v1 depth camera for non-contact pig weight estimation and validated the effectiveness of the Kinect sensor under various lighting conditions; Hansen et al. [10] utilized 3D imaging technology to capture cattle images and were able to extract data on weight and body condition unconstrained; Zhang et al. [11] achieved rapid and accurate weight estimation by extracting features of pig height, body shape, and contour using a regression-based CNN model, with R² values ranging from 0.9879 to 0.9973. Furthermore, He et al. [12] proposed a method for predicting pig weight based on depth images, using an improved BotNet regression network and a series of preprocessing algorithms, achieving an MAE value of 6.37 kg on 5326 test images. Although deep learning methods perform well in terms of accuracy, they often require high-performance computational resources. The adoption of lightweight machine learning for edge computing weight estimation is, therefore, a very important approach.

RGB images, as a low-cost and easily accessible data source, can provide rich information for tasks such as object detection, pose estimation, and 3D Gaussian generation. For instance, Liu et al. [13] proposed an image segmentation-based point cloud generation method that can produce high-quality point cloud data from RGB images, suitable for the 3D modeling of complex objects such as animals. Shi et al. [14,15] developed a 3D surface reconstruction and body size measurement system based on multi-view RGB-D cameras. Liu et al. [16] introduced a partial convolution-based image completion method that can handle irregular missing regions, making it applicable for local repairs of animal body images. Current systems often face challenges such as immobile equipment, high costs, and a lack of open datasets, making it difficult for research to scale. For instance, Reza et al. [17] used RGB images for pig posture monitoring and early detection of welfare issues, optimizing farm management practices. Jorquera-Chavez et al. [18] applied computer vision techniques for continuous animal monitoring and rapid detection of physiological changes associated with market-weight pigs, providing crucial support for the development of intelligent farming systems. Hou et al. [19] employed the PointNet++ deep learning model to estimate cattle weight using LiDAR-acquired 3D point cloud data, achieving 95.1% accuracy with an RMSE of 10.2 kg. While highly precise, the method’s high computational cost and complex data processing limit its practical application. This highlights the necessity of lightweight, low-cost machine learning approaches for weight estimation. Although these studies are valuable, they often rely on complex setups that are difficult to adapt to small-scale operations. Moreover, there has been limited focus on integrating image feature extraction methods to improve the accuracy of weight estimation. Therefore, to overcome the limitations of existing technologies regarding immobile equipment and high costs, we propose a self-developed fixed-view lifting acquisition trolley that is not only lightweight and portable but also effectively integrates with weight estimation software, addressing the issue of fixed, immobile devices. This innovative solution enables real-time and flexible pig weight monitoring, offering greater practicality and scalability.

This study utilizes features extracted from the segmented mask images of pig RGB images (such as the relative projection area (SR), contour length (LC), body length (Length), body width (LW), and eccentricity (E)) to predict live weight. Compared to existing technologies, the machine learning methods employed not only improve the accuracy of weight estimation but also overcome the limitations of deep learning methods, which require high computational resources. Through this study, we have identified optimal machine learning models for estimating pig weight in a free-moving state. Additionally, we share a dataset of RGB images of pigs in a free-moving state captured at a fixed height, along with corresponding weight information.

Main Contributions:

A machine learning-based method for weight prediction using image feature extraction, integrated into a lightweight and portable acquisition system, that significantly improves prediction accuracy through hyperparameter optimization and model configuration selection. This innovative approach enables real-time, on-site weight estimation without the need for large-scale, fixed equipment, providing a more flexible and cost-effective solution for practical farm management.

This research provides a publicly available RGB image and weight dataset for researchers in the fields of computer vision and precision livestock farming. The pig images in the dataset were captured in a natural standing position and daily living conditions, ensuring data authenticity and representativeness [20].

2. Materials and Methods

The overall weight estimation process of the method is illustrated in Figure 1. Using collected pig body measurements, weight, and back image data from the farm, weight estimation tests were conducted. The collected data were preprocessed and then divided to create the required dataset. Next, manually segmented image data were used for transfer learning [21], where the SAM2 [22] model, an instance segmentation algorithm based on the Segment Anything Model (SAM), generates a large number of mask images for weight estimation. SAM2 enables accurate object segmentation without requiring extensive labeled training data, making it well-suited for our pig weight estimation task. After removing image portions that could affect the experimental results, image feature extraction was performed, followed by weight estimation testing using machine learning regression models. Finally, the model was evaluated.

The specific information regarding data collection is shown in Table 1.

The specific process of data collection is as follows: First, the pigs are driven out of the group pens and guided into a cage scale for weight measurement and recording. Then, a tape measure is used to collect and record the body measurements. Next, the pigs are driven into an empty pen where the image collection equipment is located, and the collection vehicle is adjusted to the appropriate position, ensuring that the camera is perpendicular to the pig’s back before starting data collection. During the collection process, the pigs are followed as they move freely, and data are continuously collected until the target number is reached. Finally, the pigs are driven back to the group pen.

2.1. Dataset Construction

2.1.1. Data Acquisition

Body Scale Data Acquisition

The main body measurement parameters for pigs include body length (Length), shoulder width (shoulder), withers height (Height), chest girth (Chest), abdominal girth (Abdominal), hip girth (Hip), and cannon bone circumference (Cannon Bone), as detailed in Table 2. Measuring personnel used a tape measure to collect data on the pigs, with the measurement schematic shown in Figure 2. Each pig was measured twice for body dimensions, and the average value was recorded.

Image Data Acquisition

The image collection equipment used is a self-developed, fixed-angle, suspended collection cart designed by the team (Figure 3). It is equipped with a main controller and a power supply for the depth camera, allowing for the fixed installation of both the main controller and the depth camera for data collection and weight estimation. The device is constructed using aluminum profiles and features a crane-like structure for camera installation. The camera used is the Orbbec Femto Bolt model, while the terminal is a Microsoft Surface Book 2, with an Intel I7-8650U CPU. The mobile power supply can simultaneously power both the camera and the terminal. The camera is mounted on a self-developed, fixed-view lifting acquisition trolley, where the camera is mounted parallel to the ground and the distance between its lens and the ground is maintained at a set angle. This equipment enhances the comprehensiveness and convenience of data collection, reduces pig stress, and provides better space efficiency and operational flexibility. It enables the tracking of pig behavior during free movement while maintaining a safe distance, facilitating efficient data collection. This functionality enriches the diversity of pig morphology in the dataset and minimizes the negative impact of stress on the growth of multiple pigs.

2.1.2. Data Pre-Processing

The camera operates in the WFOV 2X2BINNED mode, which effectively enhances the sensor’s signal strength, improves imaging quality under low-light conditions, and reduces image noise (Figure 4). It has a wide field of view of 120° × 120°, capable of effective measurements within a range of 0.25 m to 2.88 m. The system captures images at a rate of 30 frames per second, with the RGB camera resolution set to 1920 × 1080 pixels. Throughout the experiment, we collected a comprehensive dataset of back images from 73 pigs with varying body weights.

The next step involves data cleaning, where images containing pigs lying down, which may affect their standing posture, as well as images where the body features of the pig are incomplete for weight estimation, or images that are overexposed or underexposed, will be deleted. Only images where the pig is standing, with complete body features and appropriate lighting, will be retained. Subsequently, corresponding depth images are then obtained based on the retained RGB images.

2.1.3. Description of the Dataset

After data cleaning, the final dataset retained images where the pigs were standing with complete body features and appropriate lighting. These images meet the requirements for subsequent weight estimation and provide a high-quality foundation for dataset construction. Since no human intervention was made in the pigs’ behavior during the collection process (except for lying down), the images reflect a rich variety of pig morphology. Additionally, the collection pens were spacious and closely simulated the pigs’ daily living environment, allowing them to freely engage in activities such as eating and drinking. Therefore, the collected images accurately reflect the pigs’ natural morphology and behaviors, aligning with their daily life habits. The behavior states of the pigs included in the dataset are shown in Figure 5.

Detailed information on the dataset structure is provided in Appendix A: Dataset Structure.

2.1.4. Value of the Dataset

Advancing Computer Vision Algorithm Development: This dataset provides a rich set of training samples for the computer vision field, particularly in tasks like animal behavior analysis and weight estimation. It promotes the optimization and application of machine learning algorithms, fostering innovations in unsupervised learning, object detection, and instance segmentation technologies.

Accurate Weight Estimation and Health Monitoring: Through non-contact weight estimation of pigs using RGB images and combining behavior analysis, this method offers an efficient, low-cost solution for pig management.

This dataset can be used for training and validating machine learning models, particularly for 3D Gaussian-based applications [23]. It facilitates the generation of 3D models of pigs, enabling accurate weight estimation and further exploration of animal behavior in three-dimensional space.

The PIGRGB-Weight dataset information is shown in Table 3.

2.1.5. Dataset Classification

This study employed a five-fold cross-validation method to evaluate the performance of the live pig weight prediction model, with a training-to-test set ratio of 4:1 (Figure 6). To ensure that the weight distribution in both the training and test sets was representative, the weights and related feature values of all pigs were sorted in ascending order. The purpose of this approach was to ensure that each fold in the dataset presented a uniform increase in weight, allowing each fold to cover pigs from different weight ranges and ensuring that the model could effectively predict data across various weight ranges.

2.2. Segmentation of Pig Images in the Environment

To achieve precise segmentation of individual target pigs from the images, we first manually annotated the dataset using the EISeg tool [24], ensuring the quality of the training data. The manually annotated dataset provided a solid foundation for training the segmentation model, ensuring that the model could accurately learn the morphological features of the pigs. Subsequently, we employed the pre-trained SAM2 segmentation model, which performs excellently in image segmentation tasks in complex environments and can effectively handle target segmentation under various poses and backgrounds [25]. We chose to directly use this model for segmenting individual pigs, making minor adjustments to create SAM2-Pig, so as to fully leverage the general features learned by the model from large-scale datasets.

All input images were standardized before segmentation to ensure image consistency and the stability of model input. When using the SAM2 model for segmentation, the model could accurately identify and segment the pig regions from the images in a short time, especially maintaining high segmentation accuracy even under complex backgrounds or when the pigs’ postures varied significantly. The segmented images were primarily used for the subsequent weight estimation task.

2.3. Image Feature Extraction

After image segmentation, to improve the accuracy of weight estimation, a continuous morphological opening operation method with adaptive kernel size was used for image processing. This method is typically employed to exclude unnecessary parts of the image, particularly to eliminate noise or irrelevant areas in the background of the target object. In this experiment, it was used to remove irrelevant parts such as the ears, tail, and legs from the pig mask images [26]. These areas occupy a large portion of the image but are not directly related to the actual weight of the pig, so they needed to be excluded from the mask to avoid interference with the subsequent analysis. The result after segmentation is shown in Figure 7. Additionally, to confirm the reliability of this approach, in the Discussion section, we will perform feature extraction on images without removing these body parts and conduct weight estimation tests under the same experimental conditions, except for the data.

Regarding the selection of feature values for image feature extraction, elements such as the relative projection area on the pig’s back (SR), contour length of the pig (LC), body length (BL), and body width (BW) were chosen. Since the images were collected during the pig’s movement, the pigs’ postures were relatively free and could either be straight or curved, so eccentricity (E) was introduced as a feature to correct for this variation [5].

2.3.1. Relative Projection Area (SR)

The relative projection area is the ratio of the pig’s back area in the mask image to the total area of the entire binary image (1):

S R = \frac{n o n - z e r o p i x e l s}{t o t a l p i x e l s}

(1)

where “

n o n - z e r o p i x e l s

refers to the number of pixels in the image with a value of 255 (white) and “

t o t a l p i x e l s

” refers to the total number of pixels in the image.

2.3.2. Contour Perimeter (LC)

Contour length is the boundary length of the pig’s back in the mask image, which refers to the total number of pixels along the contour of the pig’s body (2):

L C = c v 2 . a r c L e n g t h (c o n t o u r s, T r u e)

(2)

where “

c o n t o u r s

” are obtained using cv2.findContours and ‘

T r u e

’ indicates that the contour is closed [27].

2.3.3. Body Length (BL) and Body Width (BW)

Body length is the length of the long side of the minimum rectangle enclosing the pig’s back in the mask image, which refers to the length of the longer side of the minimum bounding rectangle. Body width is the length of the short side of the minimum enclosing rectangle around the pig’s back, which refers to the length of the shorter side of the minimum bounding rectangle (3) and (4):

B L = \sqrt{{(b o x {[2]}_{x} - b o x {[1]}_{x})}^{2} + {(b o x {[2]}_{y} - b o x {[1]}_{y})}^{2}}

(3)

B W = \sqrt{{(b o x {[1]}_{x} - b o x {[0]}_{x})}^{2} + {(b o x {[1]}_{y} - b o x {[0]}_{y})}^{2}}

(4)

where box consists of the four corner coordinates (box[0], box[1], box[2], box[3]) of the minimum bounding rectangle returned by cv2.minAreaRect(contours). The values box[1] and box[2] correspond to the longer side of the rectangle, and box[0] and box[1] correspond to the shorter side. By explicitly using the Euclidean distance formula, we ensure an accurate computation of body length and body width.

2.3.4. Eccentricity (E)

The contour of the pig’s back in the mask image is extracted, and an ellipse is fitted to describe the contour shape using the least squares method. The eccentricity of the pig’s body is then calculated by finding the square of the difference between the ratio of the ellipse’s long axis and short axis. This method effectively describes the degree of deviation in the pig’s body shape and provides an important geometric feature for weight estimation and other analyses (5),

E c c e n t r i c i t y = \sqrt{1 - {(\frac{m i n o r a x i s}{m a j o r a x i s})}^{2}}

(5)

where the

m a j o r a x i s

is the major axis of the ellipse, and the

m i n o r a x i s

is the minor axis of the ellipse.

2.4. Regression Models

This study uses multiple regression algorithms for feature training, including Ordinary Least Squares (OLS), Support Vector Regression (SVR), Backpropagation Neural Networks (BPNNs, including trainbr, trainlm, trainscg, and traincgb), AdaBoost, CatBoost, XGBoost, and Random Forest (RF). OLS regression is a classic linear regression method aimed at fitting the data by minimizing the sum of squared errors, suitable for scenarios with a strong linear relationship and good interpretability [28]. SVR performs regression using the Support Vector Machine algorithm, effectively handling nonlinear relationships and utilizing the kernel trick to find the optimal regression hyperplane in high-dimensional space, thereby enhancing the robustness of the model [29]. The BPNN is a multilayer feedforward neural network that uses the backpropagation algorithm to update network weights and can handle complex nonlinear data [30]. In the BPNN, trainlm (Levenberg–Marquardt algorithm) and trainbr (Bayesian regularization) are two commonly used optimization variants that accelerate convergence and prevent overfitting. In addition, the trainscg (Scaled Conjugate Gradient) optimization algorithm is used for training large-scale datasets, providing a more stable training process by extending the conjugate gradient method and effectively avoiding convergence problems in traditional gradient descent methods [31]. On the other hand, traincgb (Conjugate Gradient with Box Constraints) is another variant of the conjugate gradient method, suitable for optimization problems with constraints, enabling efficient optimization under specific constraints on model parameters [32]. AdaBoost is an ensemble learning method that repeatedly trains multiple weak regressors, increasing the weight of misclassified samples in each iteration to improve model accuracy [33]. CatBoost is based on gradient boosting trees and has strong capabilities in handling categorical features, improving model performance through loss function optimization and feature ranking [34]. XGBoost improves traditional gradient boosting trees by introducing regularization terms and second-order Taylor expansions, significantly improving fitting accuracy and computational efficiency, with strong generalization ability [35,36]. Random Forest (RF) integrates multiple decision trees and combines a voting mechanism to improve prediction performance, effectively reducing the bias and variance of a single decision tree, enhancing the model’s robustness and stability [37]. By using these regression algorithms in combination, this study can select the most suitable model for weight estimation in different data types and task scenarios, maximizing the model’s predictive performance.

In all models, we consistently used the following parameter settings to ensure the stability and efficiency of the training process: we employed an Early Stopping strategy to prevent overfitting. The batch size was set to 100, and the specific training parameters were configured as follows: the learning rate (net.trainParam.lr) was set to 0.01, the goal error (net.trainParam.goal) was set to 1 × 10⁻⁶, and the minimum gradient (net.trainParam.min_grad) was set to 1 × 10⁻⁷. These settings ensured consistency across all models during the training process, helping the models achieve optimal performance and effectively prevent overfitting.

We used the coefficient of determination (R²) (6), the mean absolute error (MAE) (7), the mean squared error (MSE) (8), and the root mean square error (RMSE) (9) as measures to evaluate quality. They are defined as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - \overset{\land}{y_{i}})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(6)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - \overset{\land}{y_{i}}|

(7)

M S E = \frac{1}{n} {\sum_{i = 1}^{n} (y_{i} - \overset{\land}{y_{i}})}^{2}

(8)

R M S E = \sqrt{\frac{1}{n} {\sum_{i = 1}^{n} (y_{i} - \overset{\land}{y_{i}})}^{2}}

(9)

where

n

is the sample size,

y_{i}

is the actual value,

\overset{\land}{y_{i}}

is the predicted value, and

\bar{y}

is the mean of the actual values.

3. Results

3.1. Image Segmentation Results

Based on 1309 manually annotated samples, we applied the pre-trained SAM2 segmentation model to segment the single-target pigs. The model successfully extracted the pig regions from the input images, even in complex backgrounds and diverse postures, demonstrating good segmentation performance. In total, the model processed 9579 mask images (Figure 8), achieving precise segmentation results, which shows its good adaptability and stability in various scenes and pig postures. With these segmentation results, we provided accurate image data for the subsequent weight estimation task, ensuring the reliability of the analysis. Overall, the SAM2 segmentation model was still able to effectively complete the target pig segmentation task with a relatively small dataset, providing high-quality input data for the subsequent weight estimation and verifying its feasibility and effectiveness in practical applications (Figure 9).

3.2. Feature Extraction Results

After removing features such as the ears, tail, and legs, which occupy a large area in the back image but contribute little to the actual weight, the extraction and calculation of various features from the mask images resulted in a total of 9579 data entries for 73 pigs.

3.3. Weight Prediction Results

After the feature values were extracted, they were normalized, scaling the data to a specific range, typically [0, 1], in order to eliminate the influence of inconsistent units between different features during model training.

X_{S t a n d a r d S c a l e r} = \frac{X - μ}{σ}

(10)

where

X

is the original value of the data point to be standardized,

μ

is the mean of the entire dataset, calculated as the average of all data points, and

σ

is the standard deviation of the dataset, which measures the spread or dispersion of the data points.

In terms of feature selection, this study used the relative projection area, contour length, body width, body length, and eccentricity as input variables for the regression algorithms. These features help the model better capture the relationship between body shape and weight, thereby improving prediction accuracy. Next, the experiments were conducted in the Python 3.12.3 and PyTorch 2.5.1 environments, with a system supporting NVIDIA GPUs, specifically with NVIDIA driver version 475.14 and CUDA version 11.4. In this environment, we used several regression models, including OLS, SVR, AdaBoost, CatBoost, XGBoost, and RF, for the weight prediction task. The Backpropagation Neural Network (BPNN), including trainbr, trainlm, trainscg, and traincgb, was trained and tested on the MATLAB platform (MATLAB R2024a). By comparing the performance of these algorithms on different training sets, the effectiveness and robustness of each model in predicting live pig weight were evaluated.

To identify the most suitable BPNN configuration, a series of experiments were conducted to fine-tune the network structure, training algorithms, number of hidden neurons, and other key parameters. Various hidden layer configurations were tested, including single-layer and dual-layer combinations. Specifically, configurations such as [10], [20], and [50] for single hidden layers and [10, 10], [20, 10], [50, 30], [100, 50] for dual hidden layers were evaluated. Through experimentation, it was found that smaller single-layer networks (e.g., [10], [20]) are suitable for faster training and fewer computations, whereas dual-layer networks (e.g., [20, 10], [50, 30]) showed improved accuracy and generalization performance. Ultimately, by employing optimization algorithms such as trainbr, trainlm, trainscg, and traincgb and adjusting hyperparameters like the learning rate and training epochs, the optimal configuration for the BPNN network model was determined for this dataset (Table 4).

The BPNN with the Trainlm[20,10] configuration performed the best, achieving the lowest MSE, RMSE, and MAE, and the highest R². This indicates its superior prediction accuracy and fitting performance on this dataset. Following that, CatBoost and XGBoost also showed strong performance, particularly in terms of R² and error metrics, demonstrating strong predictive power. OLS also performed relatively stably, making it suitable for cases with strong linear relationships, and it provided good interpretability. AdaBoost and SVR, however, performed weaker with larger errors, likely due to their inability to fully capture the nonlinear characteristics of the data.

Overall, these results suggest that the model can maintain high accuracy across this dataset. As shown in Figure 10 and Figure 11, each model demonstrated strong prediction performance. The carefully designed dataset facilitates optimal predictive results across various regression algorithms, demonstrating excellent adaptability and efficiency. Figure 10 displays the scatter plots of the predicted values versus the actual values for each model. The models in the figure are arranged in descending order of their R² values, with the highest R² model placed at the top. This arrangement allows for an intuitive comparison of the prediction performance of each model, highlighting their alignment with the actual observed values.

Figure 11 shows the scatter plot of the optimal model, Trainlm. In this plot, the predicted weight values are tightly clustered around the ideal line (1:1 relationship), indicating the model’s excellent prediction accuracy. The closeness between the predicted values and actual values demonstrates that this model has high precision in estimating pig weight, with minimal deviation.

3.4. Supplementary Analysis and Further Research Directions

In this chapter, we discuss potential areas for improvement and optimization in the experiment. For instance, we will explore the possibility of directly using the collected pig body measurement data to develop the weight estimation model and investigate the relationship between body measurements and weight. Additionally, we will explore the use of the Tracking Regressor ensemble learning method, which could enhance model prediction accuracy by identifying and emphasizing the features most influential for weight prediction (such as chest girth and abdominal girth). Furthermore, we will assess the effect of changing the dataset partitioning method to determine whether the dataset demonstrates strong reliability and generalization ability in practical applications. We will also investigate whether removing or retaining areas such as the pig’s ears and tail in the mask significantly impacts weight estimation accuracy. Finally, we will apply the Pearson correlation coefficient to analyze the potential relationships between image features and body measurement data, offering a theoretical basis for effectively utilizing image features in weight estimation.

3.4.1. Analysis of Manual Measurement Data

We recorded manually collected body size data that included various body measurement characteristics such as body length, shoulder width, shoulder height, etc. These body measurements will be used as input characteristics to provide a practical basis for further research on the significance and impact of body measurement information on pig weight estimation.

Multiple Linear Regression Model

In the present experimental framework, manually measured body measurement data of pigs collected during image acquisition were utilized to construct a weight estimation model. The use of multiple linear regression (MLR) for weight estimation is a classic and effective statistical methodology. By analyzing the linear relationship between various body measurement variables and body weight, a regression equation is derived that enables accurate weight estimation. The multiple linear regression model can be expressed through the following weight estimation formula:

\begin{matrix} W e i g h t = - 197.27 + 0.7395 * A + 0.4527 * B + 0.0075 * C + 0.7475 * D + \\ 0.7916 * E + 0.6780 * F \end{matrix}

where A represents Shoulder, B represents Height, C represents Hip, D represents Chest, E represents Abdominal, and F represents Cannon Bone.

The weight estimation model was developed using manually measured body dimensions of pigs obtained during image acquisition, employing multiple linear regression techniques. By constructing a multiple linear regression equation, the model provides an effective method for predicting the weight of pigs based on their body measurements. This approach transforms the raw body measurement data into a robust mathematical framework, which not only enhances the accuracy of weight estimation but also offers practical applicability for further research.

Feature Importance Score

Based on the Feature Importance Score derived from the Stacking Regressor algorithm for pig weight prediction [38], we employed Random Forest Regressor and Gradient Boosting Regressor as base learners, with Linear Regression serving as the final regressor, thereby constructing the Stacking Regressor model. Each row in the resulting table represents a feature and its corresponding contribution value. The magnitude of the value reflects the degree to which each feature influences the prediction outcome. A higher contribution value indicates a greater impact of that particular feature on the prediction result, as illustrated in Figure 12.

The evaluation results demonstrate that chest girth and abdominal girth are the most influential features in predicting pig weight, accounting for the majority of the model’s explanatory variance. This observation aligns closely with the growth characteristics of pigs, where these dimensions directly correlate with body weight, thereby reinforcing the central role of these features in weight prediction. In contrast, features such as withers height, body length, hip girth, cannon bone circumference, and shoulder width contribute relatively less and may serve as auxiliary features. These less impactful features can be selectively retained during the model optimization process, depending on practical requirements. This analysis not only reveals the relative importance of each feature in weight prediction but also offers valuable insights for feature engineering, providing a clear strategy for optimizing model performance and improving prediction accuracy in real-world applications.

3.4.2. Train–Test Split

In the data partitioning stage, we compared the Train–Test Split and K-Fold Cross-Validation methods to evaluate the model’s performance under different partitioning strategies. The simple split method is suitable for situations with large datasets or limited computational resources, as it allows for quick validation of model performance. However, its results may have large fluctuations, especially when the dataset is small or unevenly distributed, which can lead to biased evaluation results, as shown in Table 5. In contrast, K-Fold Cross-Validation, by repeatedly partitioning and validating the dataset, provides a more stable and comprehensive model evaluation, making it particularly suitable for small datasets or scenarios requiring high-precision evaluation.

From the experimental results, K-Fold Cross-Validation showed significant superiority in model evaluation. Specifically, in the RF and XGBoost models, the K-Fold method exhibited more consistent performance on the test set, with smaller errors and better generalization ability and stability. On the other hand, the Train–Test Split method, due to the randomness of the partitioning, resulted in larger test set errors and higher variability in evaluation results. This comparison not only reveals the impact of different partitioning strategies on model evaluation but also further validates the reliability and generalization ability of this dataset in real-world applications. K-Fold Cross-Validation has been proven to be a more robust evaluation method in this study, effectively reducing the random effects of data partitioning and providing more reliable performance assessments for the model. This finding also provides an important reference for the choice of data partitioning strategies in future related research, especially in scenarios involving small datasets or requiring high-precision evaluations, where K-Fold Cross-Validation should be the preferred method.

3.4.3. Effect of Retaining Large but Low-Weight Features on Weight Prediction

In the study by Xie et al. [39], it was observed that the ears and tail of pigs occupy a significant portion of the image area but contribute minimally to the actual weight prediction. Based on this, we removed the ear, tail, and leg regions from the data images in our experiment. To further validate this hypothesis, we conducted a comparative experiment using the original, unprocessed image data, without any removal of these parts. The results of this experiment, showing the performance of the weight prediction task with the unprocessed images, are presented in Table 6.

The experimental results indicate that by removing the areas in the pig images that occupy a large volume but have a relatively low weight contribution (such as the ears, tail, and legs), the model’s prediction accuracy can be significantly improved. This preprocessing strategy not only makes the model results more interpretable but also helps the model focus more on the primary features that are highly correlated with weight, thereby effectively enhancing prediction performance. This finding highlights the critical role of data preprocessing in model training and further validates the necessity of removing irrelevant parts during the data processing phase. The research results provide important theoretical support and practical evidence for future model optimization and data cleaning while also offering valuable references for designing data preprocessing strategies in similar tasks.

3.4.4. Pearson Correlation Analysis

The calculation of the Pearson correlation coefficient is significant in weight prediction models. By quantifying the strength of the linear relationship between features and actual weight, it effectively evaluates the predictive ability of each feature and provides a basis for feature selection. Additionally, the Pearson correlation coefficient helps to uncover potential relationships between different features, further deepening the understanding of the data. Based on this, the most valuable features can be selected, reducing the interference of redundant features in model training and ultimately improving model accuracy. Therefore, the calculation of the Pearson correlation coefficient provides data support and a theoretical basis for model optimization, helping to improve the accuracy of weight estimation.

Manual Body Measurements with Weight Measurements

According to the Pearson correlation coefficient matrix (Figure 13), the correlations between weight and body length, chest girth, and abdominal girth are relatively strong, with values of 0.91, 0.96, and 0.94, respectively, indicating that these features have a significant impact on weight prediction. The correlation between weight and shoulder width and withers height is also strong (0.82 and 0.89) but slightly lower than that of chest girth and abdominal girth. The correlation between weight and hip girth (0.20) and cannon bone circumference (0.58) is relatively weak, so these features may be considered for exclusion during feature selection. Overall, chest girth, abdominal girth, and body length are key features for weight prediction, while shoulder width and withers height can serve as auxiliary features.

Image Extraction of Feature Values with Weight Measurements

By calculating the Pearson correlation coefficient (Figure 14), it was found that weight has a strong positive correlation with SR (0.95), indicating that SR is a key feature for weight estimation. The correlation between weight and LC is 0.66, further supporting its importance in weight prediction. The correlation between weight and BL and BW is relatively weak, especially for length (0.12). The eccentricity (E) has almost no correlation with weight (0.02). However, the selection of eccentricity as a feature was intended to account for the morphological variations of freely moving pigs. Since the dataset includes pigs in different postures and movement states, the shape of the pig’s projection may vary, affecting the stability of feature-based weight estimation. Eccentricity helps quantify these shape variations, ensuring that the model remains robust across different pig postures, even though its direct correlation with weight is low. Overall, SR and LC are key features, while other features like body length, body width, and eccentricity require further evaluation.

From these two sets of results, features like chest girth and abdominal girth in the body measurement data have a strong correlation with SR in the image features. This suggests that certain features in the image data can effectively reflect key indicators in the body measurement data, thereby indirectly influencing weight estimation. Features such as chest and abdominal, which have a strong correlation with image features like SR, imply that image-based weight estimation methods can achieve good prediction results by extracting key information similar to body measurements. Features like BL and BW have a weak correlation with weight and do not show a significant correlation in the image features either. This may indicate that these features are less effective in weight estimation compared to features like chest and abdominal. It also suggests that three-dimensional body measurements are more effective than two-dimensional data. In future experiments, we need to incorporate more three-dimensional data to enhance the performance of the weight estimation model.

3.4.5. Applicability in Different Test Scenarios

Our experimental acquisition device has low requirements for the data collection environment and can adapt to various farming conditions, such as large-scale commercial farms, smallholder operations, and different flooring materials and lighting conditions. This flexibility ensures that the proposed method can effectively capture data and estimate weight across different farming scenarios. However, since our dataset primarily consists of specific pig breeds, variations in body shape, back contour features, and coat color among different breeds may affect the model’s generalization ability. For instance, more compact or elongated breeds may exhibit different relationships between key features—such as relative projection area, body length, and body width—and weight. Additionally, dark-colored or patterned pigs may pose challenges for RGB-based feature extraction, potentially impacting the model’s prediction accuracy.

It is also important to consider that the dataset collection method may introduce some potential biases. Specifically, the fixed camera height and the inclusion of limited pig breeds in the dataset may result in biases that affect the robustness and generalizability of the model’s predictions. These biases, such as variations in pig shape and coat color that influence feature extraction, could lead to limitations in the model’s applicability across a wider range of pigs and environments.

To improve the applicability of our approach, we plan to expand the dataset in future studies by incorporating more pig breeds and varying environmental conditions. This will enhance the model’s robustness and generalization, ensuring better performance across diverse breeding environments and a broader spectrum of pig breeds.

4. Conclusions

The image data used in this study were collected from a pig farm in a free-standing state, including activity scenes such as drinking, feeding, and running. The data were captured using a self-developed suspended fixed-angle acquisition cart, ensuring the stability and consistency of the image data. Through the use of SAM2-Pig for instance segmentation, mask images of the pigs were extracted, and features such as relative projection area, contour length, body length, body width, and eccentricity were further obtained. Based on these feature values, a linear regression model was used for machine learning modeling to estimate the pigs’ body weight. The experimental results showed that the BPNN with the Trainlm configuration performed the best across all evaluation metrics, with the smallest MSE, RMSE, and MAE, and the highest R² value, demonstrating the superior prediction accuracy of this model on the dataset. The results of this study identified a good machine learning model for estimating pig body weight in a free-roaming state and verified the stability and reliability of the dataset under different models and partitioning strategies. Additionally, the RGB image dataset used in this study has been made publicly available for use by researchers in related fields.

Author Contributions

Conceptualization, W.M.; methodology, X.J.; validation, X.J.; formal analysis, S.X.Y.; investigation, M.L. and Z.X.; resources, M.L., W.M. and Z.R.; data curation, W.M. and X.J.; writing—original draft, Z.X. and W.M.; writing—review and editing, X.J., W.M. and S.X.Y.; project administration, K.G., Q.L. and Z.R.; funding acquisition, K.G. and Q.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Science and Technology Major Project of China (2022ZD0115702), the Beijing Academy of Agriculture and Forestry Sciences (JKZX202214), and the Beijing livestock innovation team (BAIC5-2025). The actual animal experiments underwent animal ethics evaluation and were approved by the Information Technology Research Center of the Beijing Academy of Agriculture and Forestry Sciences (Approval Number: AWM-2025-2-21).

Institutional Review Board Statement

The animal study protocol was approved by the Institutional Review Board (IRB) of the Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences (protocol code AWM-2024-3-28).

Data Availability Statement

The original contributions presented in this study are included in the article material. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to have influenced the work reported in this paper.

Appendix A. Dataset Structure

Appendix A.1. RGB_9579

The images in this folder were captured at a fixed height of 1.88 m. The structure of the dataset is organized as follows: The root directory is named PIGRGB-Weight, which contains all the dataset files. Under the first level, there is a folder named RGB_9579, which holds the RGB image data, comprising a total of 9579 images. The second level includes subfolders such as fold1, which is one of the five folds used for cross-validation, with the images evenly distributed across these folds according to pig weight. Within each fold, the third level contains subfolders named based on the pig’s weight, such as 73.36_124, where “73.36” represents the weight of the pig (in kg) and “124” indicates the number of images in that folder. Finally, the fourth level contains specific RGB image files, such as 73.36kg_1.png, where the filename includes the pig’s weight followed by a sequential number to indicate the order of image capture. This folder structure ensures that the dataset is well-organized, making it easier to manage, access, and use for training and validation purposes.

Appendix A.2. RGB_MASK_3394

The images in this folder include manually segmented mask images and their original RGB counterparts, with the images captured at a height of 1.78 m. These segmented images were used for training the SAM2 model, and the training data are publicly shared in the dataset to support further research and model validation.

Within the first level, there is a folder named ‘RGB_MASK_3394’, which holds both RGB images and corresponding mask images. Within the second level, the ‘RGB_3394’ subfolder contains the RGB image data, and the ‘MASK_3394’ subfolder contains the corresponding mask images. Within the third level, subfolders such as ‘158.56_6’ are named based on the pig’s weight (e.g., “158.56”) and the number of images in the folder (e.g., “6”). Finally, the fourth level contains specific image files, such as ‘158.56kg_1.png’, where the filename includes the pig’s weight followed by a sequential number based on the order of image capture. This structure ensures that the dataset is organized and easy to navigate, allowing for efficient access to the RGB and mask images for model training and validation.

Figure A1. Dataset structure.

References

Tzanidakis, C.; Simitzis, P.; Arvanitis, K.; Panagakis, P. An overview of the current trends in precision pig farming technologies. Livest. Sci. 2021, 249, 104530. [Google Scholar] [CrossRef]
Doeschl-Wilson, A.B.; Green, D.M.; Fisher, A.V.; Carroll, S.M.; Schofield, C.P.; Whittemore, C.T. The relationship between body dimensions of living pigs and their carcass composition. Meat Sci. 2005, 70, 229–240. [Google Scholar] [CrossRef] [PubMed]
Lee, S.; Ahn, H.; Seo, J.; Chung, Y.; Pan, S. Practical Monitoring of Undergrown Pigs for IoT-based Large-Scale Smart Farm. IEEE Access 2019, 7, 173796–173810. [Google Scholar] [CrossRef]
Nyalala, I.; Okinda, C.; Kunjie, C.; Korohou, T.; Chao, Q. Weight and volume estimation of poultry and products based on computer vision systems: A review. Poult. Sci. 2021, 100, 101072. [Google Scholar] [CrossRef] [PubMed]
Bhoj, S.; Tarafdar, A.; Chauhan, A.; Singh, M.; Gaur, G.K. Image processing strategies for pig liveweight measurement: Updates and challenges. Comput. Electron. Agric. 2022, 193, 106693. [Google Scholar] [CrossRef]
Guevara, R.D.; Pastor, J.J.; Manteca, X.; Tedo, G.; Llonch, P. Systematic review of animal-based indicators to measure thermal, social, and immune-related stress in pigs. PLoS ONE 2022, 17, e0266524. [Google Scholar] [CrossRef]
Chen, C.; Zhao, Y.; Wang, H.; Li, B. Research Progress on Livestock Intelligent Grouping Equipment. China Swine Ind. 2024, 19, 59–69. [Google Scholar]
Ma, W.; Qi, X.; Sun, Y.; Gao, R.; Ding, L.; Wang, R.; Peng, C.; Zhang, J.; Wu, J.; Xu, Z. Computer Vision-Based Measurement Techniques for Livestock Body Dimension and Weight: A Review. Agriculture 2024, 14, 306. [Google Scholar] [CrossRef]
Pezzuolo, A.; Guarino, M.; Sartori, L.; González, L.A.; Marinello, F. On-barn pig weight estimation based on body measurements by a Kinect v1 depth camera. Comput. Electron. Agric. 2018, 148, 29–36. [Google Scholar] [CrossRef]
Hansen, M.F.; Smith, M.L.; Smith, L.N.; Abdul Jabbar, K.; Forbes, D. Automated monitoring of dairy cow body condition, mobility and weight using a single 3D video capture device. Comput. Ind. 2018, 98, 14–22. [Google Scholar] [CrossRef]
Zhang, J.; Zhuang, Y.; Ji, H.; Teng, G. Pig Weight and Body Size Estimation Using a Multiple Output Regression Convolutional Neural Network: A Fast and Fully Automatic Method. Sensors 2021, 21, 3218. [Google Scholar] [CrossRef] [PubMed]
He, H.; Qiao, Y.; Li, X.; Chen, C.; Zhang, X. Automatic weight measurement of pigs based on 3D images and regression network. Comput. Electron. Agric. 2021, 187, 106299. [Google Scholar] [CrossRef]
Liu, Y.; Fan, B.; Meng, G.; Lu, J.; Xiang, S.; Pan, C. Densepoint: Learning densely contextual representation for efficient point cloud processing. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 5239–5248. [Google Scholar]
Shi, S.; Yin, L.; Liang, S.; Zhong, H.; Tian, X.; Liu, C.; Sun, A.; Liu, H. Research on 3D surface reconstruction and body size measurement of pigs based on multi-view RGB-D cameras. Comput. Electron. Agric. 2020, 175, 105543. [Google Scholar] [CrossRef]
Zollhöfer, M.; Stotko, P.; Görlitz, A.; Theobalt, C.; Nießner, M.; Klein, R.; Kolb, A. State of the art on 3D reconstruction with RGB-D cameras. Comput. Graph. Forum 2018, 37, 625–652. [Google Scholar] [CrossRef]
Liu, G.; Shih, K.; Wang, T.-C.; Tao, A.; Catanzaro, B. Image In-Painting for Irregular Holes Using Partial Convolutions. U.S. Patent 16,360,895, 26 September 2019. [Google Scholar]
Reza, M.N.; Kabir, M.S.; Haque, M.A.; Jin, H.; Kyoung, H.; Choi, Y.K.; Kim, G.; Chung, S.-O. Instance Segmentation and Automated Pig Posture Recognition for Smart Health Management. J. Anim. Sci. Technol. 2024, 1001. [Google Scholar] [CrossRef]
Jorquera-Chavez, M.; Fuentes, S.; Dunshea, F.R.; Warner, R.D.; Poblete, T.; Unnithan, R.R.; Morrison, R.S.; Jongman, E.C. Using imagery and computer vision as remote monitoring methods for early detection of respiratory disease in pigs. Comput. Electron. Agric. 2021, 187, 106283. [Google Scholar] [CrossRef]
Hou, Z.; Huang, L.; Zhang, Q.; Miao, Y. Body weight estimation of beef cattle with 3D deep learning model: PointNet++. Comput. Electron. Agric. 2023, 213, 108184. [Google Scholar] [CrossRef]
Ma, W. PIGRGB-Weight Dataset. Available online: https://github.com/maweihong/PIGRGB-Weight.git (accessed on 20 February 2025).
Liu, C.; Li, S.; Chen, H.; Xiu, X.; Peng, C. Semi-supervised joint adaptation transfer network with conditional adversarial learning for rotary machine fault diagnosis. Intell. Robot. 2023, 3, 131–143. [Google Scholar] [CrossRef]
Ravi, N.; Gabeur, V.; Hu, Y.-T.; Hu, R.; Ryali, C.; Ma, T.; Khedr, H.; Rädle, R.; Rolland, C.; Gustafson, L. Sam 2: Segment anything in images and videos. arXiv 2024, arXiv:2408.00714. [Google Scholar]
Chen, G.; Wang, W. A survey on 3d gaussian splatting. arXiv 2024, arXiv:2401.03890. [Google Scholar]
Hao, Y.; Liu, Y.; Wu, Z.; Han, L.; Chen, Y.; Chen, G.; Chu, L.; Tang, S.; Yu, Z.; Chen, Z. Edgeflow: Achieving practical interactive segmentation with edge-guided flow. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 1551–1560. [Google Scholar]
Kirillov, A.; Mintun, E.; Ravi, N.; Mao, H.; Rolland, C.; Gustafson, L.; Xiao, T.; Whitehead, S.; Berg, A.C.; Lo, W.-Y. Segment anything. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 2–3 October 2023; pp. 4015–4026. [Google Scholar]
Duan, E.; Hao, H.; Zhao, S.; Wang, H.; Bai, Z. Estimating Body Weight in Captive Rabbits Based on Improved Mask RCNN. Agriculture 2023, 13, 791. [Google Scholar] [CrossRef]
Vaishya, A. Mastering OpenCV with Python: Use NumPy, Scikit, TensorFlow, and Matplotlib to Learn Advanced Algorithms for Machine Learning Through a Set of Practical Projects; Orange Education PVT Limited: New Delhi, India, 2023. [Google Scholar]
Zdaniuk, B. Ordinary Least-Squares (OLS) Model; Springer: Dordrecht, The Netherlands, 2024; pp. 4867–4869. [Google Scholar]
Sun, Y.; Ding, S.; Zhang, Z.; Jia, W. An improved grid search algorithm to optimize SVR for prediction. Soft Comput. 2021, 25, 5633–5644. [Google Scholar] [CrossRef]
Cun, Y.L. A theoretical framework for back-propagation. Proc. Connect. Models Summer Sch. San Mateo Ca 1988, 1, 21–28. [Google Scholar]
Jusman, Y.; Widyasmoro, W.; Mohamed, Z.; Lubis, J.H. Classification Performance of BFGS Quasi-Newton Backpropagation and Scaled Conjugate Gradient Backpropagation Models for Lung X-Ray Images. In Proceedings of the 2024 International Conference on Artificial Intelligence, Blockchain, Cloud Computing, and Data Analytics (ICoABCD), Bali, Indonesia, 20–21 August 2024; pp. 237–242. [Google Scholar]
Mustafidah, H.; Hartati, S.; Wardoyo, R.; Harjoko, A. Selection of Most Appropriate Backpropagation Training Algorithm in Data Pattern Recognition. Int. J. Comput. Trends Technol. 2014, 14, 4727. [Google Scholar] [CrossRef]
Liu, B.; Liu, C.; Xiao, Y.; Liu, L.; Li, W.; Chen, X. AdaBoost-based transfer learning method for positive and unlabelled learning problem. Knowl.-Based Syst. 2022, 241, 108162. [Google Scholar] [CrossRef]
Qi, J.; Yang, R.; Wang, P. Application of explainable machine learning based on Catboost in credit scoring. J. Phys. Conf. Ser. 2021, 1955, 012039. [Google Scholar] [CrossRef]
Wang, X.; Chen, Q.; Sun, H.; Wang, X.; Yan, H. GMAW welding procedure expert system based on machine learning. Intell. Robot. 2023, 3, 56–75. [Google Scholar] [CrossRef]
Wan, B.; Xu, S.; Luo, S.; Wei, L.; Zhang, C.; Zhou, D.; Zhang, H.; Zhang, Y. Prediction of formation fracture pressure based on reinforcement learning and XGBoost. Open Geosci. 2024, 16, 36–39. [Google Scholar] [CrossRef]
Adetunji, A.B.; Akande, O.N.; Ajala, F.A.; Oyewo, O.; Akande, Y.F.; Oluwadara, G. House Price Prediction using Random Forest Machine Learning Technique. Procedia Comput. Sci. 2022, 199, 806–813. [Google Scholar] [CrossRef]
Ruchay, A.; Gritsenko, S.; Ermolova, E.; Bochkarev, A.; Ermolov, S.; Guo, H.; Pezzuolo, A. A Comparative Study of Machine Learning Methods for Predicting Live Weight of Duroc, Landrace, and Yorkshire Pigs. Animals 2022, 12, 1152. [Google Scholar] [CrossRef]
Xie, C.; Cang, Y.; Lou, X.; Xiao, H.; Xu, X.; Li, X.; Zhou, W. A novel approach based on a modified mask R-CNN for the weight prediction of live pigs. Artif. Intell. Agric. 2024, 12, 19–28. [Google Scholar] [CrossRef]

Figure 1. Test procedures.

Figure 2. Manual measurement position of pig body size data.

Figure 3. Acquisition equipment diagram (a) and actual data acquisition equipment (b).

Figure 4. Data types excluded during the cleaning process and their corresponding schematics. (a) Lying or Sitting: This data type refers to pigs that are in a lying or sitting posture. (b) Incomplete Physical Appearance: This data type indicates that part of the pig’s body is missing or incomplete in the image. (c) Brightness Irregularity: This data type refers to abnormal brightness in the image, such as overexposure or underexposure, which causes certain areas of the image to lack clear details, thus affecting image quality and usability.

Figure 5. Overview of Pig Behavioral Data. (a) Stand Upright: This behavior describes the pig standing upright with its body supported by all four limbs, maintaining an upright posture while standing on the ground. (b) Feeding: This state indicates that the pig is feeding, typically observed through its posture or mouth movements, often seen when the pig is foraging or eating from a food trough. (c) Walking: This state describes the pig’s walking behavior where it moves within an activity area, with its limbs alternately touching the ground in a regular, rhythmic pattern. (d) Drinking: This indicates that the pig is drinking water, shown by the pig lowering its head toward a water dispenser or trough, with its mouth in contact with the water source. (e) Standing Abnormally: This behavior describes the pig standing in an abnormal posture, with its body twisted or positioned irregularly. (f) Standing with Head Down: This behavior describes the pig standing while lowering its head, maintaining a standing posture with its head positioned downward. (g) Partially Obscured Multiple: This state refers to multiple pigs in the image whose parts are obscured or incomplete due to the viewing angle or obstruction by other pigs or objects, leading to incomplete visibility of each pig’s features. (h) Fully Visible Multiple: This state indicates that multiple pigs are clearly visible in the image, with each pig fully presented without any obstruction or distortion.

Figure 6. Data partitioning using the five-fold cross-validation method.

Figure 7. Instance segmentation process using SAM2.

Figure 8. Masked image and its individual feature extraction.

Figure 9. Image segmentation results using the SAM2 model.

Figure 10. Model comparison: Predicted vs. true values (sorted based on R²).

Figure 11. Relationship between true and predicted values of the Trainlm model.

Figure 12. Feature importance scores for pig weight prediction using Stacking Regressor.

Figure 13. Heat map of the Pearson correlation coefficients among actual measured body size components and body weight.

Figure 14. Heat map of the Pearson correlation coefficients among image-extracted eigenvalues and body weight.

Table 1. Materials involved in the test.

Type of collection		Specifics
Collection time		Begins in March 2024 and continues until December 2024
Collection place		A pig farm in Longyao, Xingtai City, Hebei Province, China
Collection object	Species	Finishing-stage Yorkshire pigs
	Quantity	73
	Age	150–210 d
	Weight	73–192 kg
Acquisition equipment	Camera	FemtoBolt camera from ORBBEC (Shenzhen, China)
	Equipment	A self-developed, fixed-view lifting acquisition trolley
	Terminal	The experiments were conducted on a Microsoft Surface Book 2 laptop with an Intel Core i7-8650U processor (4 cores, 8 threads) and 8 GB of RAM.
	Operating system	Windows10
	Power supply	Outdoor Mobile Power Source
	Software	Visual Studio Code 2019(VS Code 1.97.2.)
Acquisition height		1.88 m and 1.78 m
Body weight measurement		Pig weighing scale with holding frame
Body scale measurement		Manual tape measurements

Table 2. Manual measurement of body scale data content.

Content	Definition
Body Length	The straight-line distance from the anterior end of the head (usually the position between the two ears) to the base of the tail.
Shoulder Width	The horizontal distance between the widest points on both sides of the shoulders while the pig is in a standing position.
Withers Height	The vertical distance from the ground to the pig’s shoulders.
Chest Girth	The circumference of the widest part of the chest, typically measured behind the forelimbs at the sternum.
Abdominal Girth	The circumference of the widest part of the abdomen.
Hip Gitth	The distance from the anterior edge of the left stifle joint, passing around the anus, to the anterior edge of the right stifle joint.
Cannon Bone Circumference	The circumference measured around the narrowest part of the pig’s forelimb cannon bone.

Table 3. Specifications table of PIGRGB-Weight dataset.

Subject	Fixed-Height RGB Images and Image Segmentation Dataset of Freely Active Pigs’ Backs
Specific Academic Field	Machine learning, pig weight estimation, and computer vision
Data Formats	“.png” RGB Images, “png” Mask Images.
Data Tyes	RGB Image Data and Mask Image Data
Data Acquisition	The experimental team used a self-developed and designed fixed-view lifting acquisition trolley, equipped with an RGB camera, maintaining a height of 1.88 m for RGB image collection within the pig farm. The dataset contains 9579 RGB images. Additionally, it includes 3394 manually labeled mask images corresponding to the RGB images, collected at a height of 1.78 m, providing extra support for subsequent research and model testing. The image resolution is 960 × 540, and the compressed dataset size is 9.52 GB.
Data Source Location	Country: China; City: Longyao, Hebei Province.
Data Accessibility	Repository Name: PIGRGB-Weight Direct URL to the data: https://github.com/maweihong/PIGRGB-Weight.git (accessed on 20 February 2025)

Table 4. Performance of each valuation model on the dataset.

Models	MSE/kg²	RMSE/kg	MAE/kg	R²
OLS	41.536	6.423	4.809	0.960
AdaBoost	96.284	9.790	8.002	0.906
CatBoost	38.534	6.179	4.563	0.963
XGBoost	41.063	6.390	4.730	0.960
RF	42.221	6.483	4.705	0.959
SVR	44.780	6.660	5.170	0.956
Trainscg[10]	37.227	6.086	4.549	0.964
Trainlm[20,10]	35.439	5.944	4.421	0.965
Trainbr[20]	38.361	6.136	4.564	0.963
Traincgb[10]	38.128	6.149	4.630	0.963

Table 5. Performance of each valuation model on the dataset with the Train–Test Split method.

Model	MSE/kg²		RMSE/kg		MAE/kg		R²
Split	Train	Test	Train	Test	Train	Test	Train	Test
OLS	33.008	478.462	5.745	21.874	4.265	19.076	0.963	0.566
AdaBoost	59.122	119.465	7.689	10.930	6.164	8.531	0.933	0.892
CatBoost	9.757	210.068	3.124	14.494	2.318	11.839	0.989	0.810
XGBoost	13.605	194.191	3.689	13.935	2.664	11.833	0.985	0.824
RF	3.401	176.911	1.844	13.301	1.250	11.197	0.996	0.840
SVR	24.303	269.942	4.930	16.430	3.597	14.620	0.972	0.755
Trainbr	25.305	436.540	5.030	20.893	3.740	17.980	0.971	0.604
Trainlm	25.623	422.305	5.062	20.550	3.778	19.281	0.971	0.617
Trainscg	32.310	320.190	5.684	17.894	4.295	12.734	0.963	0.710
traincgb	27.817	305.935	5.274	17.491	3.959	15.572	0.969	0.723

Table 6. Performance of each valuation model on the dataset without re-segmentation.

Models	MSE/kg²	RMSE/kg	MAE/kg	R²
OLS	273.440	16.079	14.031	0.728
AdaBoost	189.051	13.505	11.151	0.814
CatBoost	218.177	13.857	11.902	0.780
XGBoost	214.781	13.783	11.738	0.784
RF	198.639	13.372	11.207	0.801
SVR	268.921	15.648	13.573	0.729
Trainscg	189.405	13.213	11.218	0.814
Trainl	262.418	14.665	12.364	0.732
Trainbr	337.177	16.496	14.363	0.658
Traincgb	384.170	17.839	15.925	0.609

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ji, X.; Li, Q.; Guo, K.; Ma, W.; Li, M.; Xu, Z.; Yang, S.X.; Ren, Z. A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset. Agriculture 2025, 15, 814. https://doi.org/10.3390/agriculture15080814

AMA Style

Ji X, Li Q, Guo K, Ma W, Li M, Xu Z, Yang SX, Ren Z. A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset. Agriculture. 2025; 15(8):814. https://doi.org/10.3390/agriculture15080814

Chicago/Turabian Style

Ji, Xintong, Qifeng Li, Kaijun Guo, Weihong Ma, Mingyu Li, Zhankang Xu, Simon X. Yang, and Zhiyu Ren. 2025. "A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset" Agriculture 15, no. 8: 814. https://doi.org/10.3390/agriculture15080814

APA Style

Ji, X., Li, Q., Guo, K., Ma, W., Li, M., Xu, Z., Yang, S. X., & Ren, Z. (2025). A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset. Agriculture, 15(8), 814. https://doi.org/10.3390/agriculture15080814

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Machine Learning-Based Method for Pig Weight Estimation and the PIGRGB-Weight Dataset

Abstract

1. Introduction

2. Materials and Methods

2.1. Dataset Construction

2.1.1. Data Acquisition

Body Scale Data Acquisition

Image Data Acquisition

2.1.2. Data Pre-Processing

2.1.3. Description of the Dataset

2.1.4. Value of the Dataset

2.1.5. Dataset Classification

2.2. Segmentation of Pig Images in the Environment

2.3. Image Feature Extraction

2.3.1. Relative Projection Area (SR)

2.3.2. Contour Perimeter (LC)

2.3.3. Body Length (BL) and Body Width (BW)

2.3.4. Eccentricity (E)

2.4. Regression Models

3. Results

3.1. Image Segmentation Results

3.2. Feature Extraction Results

3.3. Weight Prediction Results

3.4. Supplementary Analysis and Further Research Directions

3.4.1. Analysis of Manual Measurement Data

Multiple Linear Regression Model

Feature Importance Score

3.4.2. Train–Test Split

3.4.3. Effect of Retaining Large but Low-Weight Features on Weight Prediction

3.4.4. Pearson Correlation Analysis

Manual Body Measurements with Weight Measurements

Image Extraction of Feature Values with Weight Measurements

3.4.5. Applicability in Different Test Scenarios

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Dataset Structure

Appendix A.1. RGB_9579

Appendix A.2. RGB_MASK_3394

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI