Next Article in Journal
Advancements in Hyaluronic Acid Effect in Alveolar Ridge Preservation: A Narrative Review
Next Article in Special Issue
Virtual Biopsy for the Prediction of MGMT Promoter Methylation in Gliomas: A Comprehensive Review of Radiomics and Deep Learning Approaches Applied to MRI
Previous Article in Journal
Impact of Likelihood Ratios of Rheumatoid Factor and Anti-Cyclic Citrullinated Peptide Antibody in Clinical Diagnosis of Rheumatoid Arthritis by Two Available Platforms
Previous Article in Special Issue
Diagnostic Models for Differentiating COVID-19-Related Acute Ischemic Stroke Using Machine Learning Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Marker-Less Video Analysis of Infant Movements for Early Identification of Neurodevelopmental Disorders

1
Italian National Research Council, Institute for Biomedical Research and Innovation, Via Leanza, Istituto Marino, 98164 Messina, Italy
2
Research Coordination and Support Service, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Diagnostics 2025, 15(2), 136; https://doi.org/10.3390/diagnostics15020136
Submission received: 3 December 2024 / Revised: 29 December 2024 / Accepted: 30 December 2024 / Published: 8 January 2025

Abstract

:
Background/Objectives: The early identification of neurodevelopmental disorders (NDDs) in infants is crucial for effective intervention and improved long-term outcomes. Recent evidence indicates a correlation between deficits in spontaneous movements in newborns and the likelihood of developing NDDs later in life. This study aims to address this aspect by employing a marker-less Artificial Intelligence (AI) approach for the automatic assessment of infants’ movements from single-camera video recordings. Methods: A total of 74 high-risk infants were selected from the Italian Network for Early Detection of Autism Spectrum Disorders (NIDA) database and closely observed at five different time points, ranging from 10 days to 24 weeks of age. Automatic motion tracking was performed using deep learning to capture infants’ body landmarks and extract a set of kinematic parameters. Results: Our findings revealed significant differences between infants later diagnosed with NDD and typically developing (TD) infants in three lower limb features at 10 days old: ‘Median Velocity’, ‘Area differing from moving average’, and ‘Periodicity’. Using a Support Vector Machine (SVM), we achieved an accuracy rate of approximately 85%, a sensitivity of 64%, and a specificity of 100%. We also observed that the disparities in lower limb movements diminished over time points. Furthermore, the tracking accuracy was assessed through a comparative analysis with a validated semi-automatic algorithm (Movidea), obtaining a Pearson correlation (R) of 93.96% (88.61–96.60%) and a root mean square error (RMSE) of 9.52 pixels (7.29–12.37). Conclusions: This research highlights the potential of AI movement analysis for the early detection of NDDs, providing valuable insights into the motor development of infants at risk.

1. Introduction

Neurodevelopmental disorders (NDDs) are a heterogeneous group of conditions that typically manifest during early childhood and involve impairments in the growth and development of the nervous system. These conditions can affect various aspects of a person’s functioning, including behavior, cognition, communication, and social interaction. They include a wide range of conditions, such as autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), intellectual disability, specific learning disorders, and communication disorders, among others [1]. Timely diagnosis of NDD plays a crucial role in facilitating effective intervention and support for children and their families, having a profound impact on their long-term developmental outcomes [2].
Recent research has unveiled a deepening comprehension of the intricate interplay among developmental systems during infancy, highlighting how advancements in one area can trigger cascading effects, influencing an infant’s progress in other domains. Referred to as the “developmental cascades” perspective, this viewpoint has been the subject of scrutiny, with notable reviews [3,4,5]. The “developmental cascades” perspective holds significant implications for understanding the consequences of subtle early differences and delays in child development. It suggests that even minor disruptions in early behaviors, especially in seemingly unrelated domains, such as motor skills and language development, can set in motion a sequence of direct and indirect consequences [6]. In essence, slight variations or delays in early motor behaviors can set off a series of events that impact the development of language and communication skills and lead to developmental divergences over time. This point of view highlights the intricate and interlinked nature of child development, emphasizing how important early intervention and support are in addressing even minor developmental differences. Furthermore, extensive research indicates that the early phases of motor development can be a pivotal lens for assessing the broader neurodevelopmental trajectory in children. Delays in achieving motor milestones may suggest that the child’s development is not progressing as expected and may be indicative of underlying neurodevelopmental concerns [7]. Therefore, assessing motor skills in the first weeks of life takes on significant importance.
Clinical assessments of motor performance in infants within the scope of neurodevelopment evaluation typically employ standardized tools and observational methods. These assessments help healthcare professionals in evaluating an infant’s motor development and identifying potential delays or abnormalities. Widely used assessment tools include the General Movements (GMs) Assessment [8,9], the Alberta Infant Motor Scale [10], and the Peabody Developmental Motor Scales [11]. Specifically, the assessment of GMs has emerged as a clinical method to evaluate early motor abilities and has been extensively employed in the context of early neurodevelopment assessment of preterm newborns [12,13,14]. Several studies have highlighted the relevance of GMs in predicting developmental outcomes and the potential for early intervention. However, these methods have some limitations, including subjective assessment, time-consuming procedures, susceptibility to environmental factors, and the need for specialized expertise. Additionally, the capacity for continuous monitoring and predictive precision stands as potential challenges within clinical assessments. In response to these challenges, researchers are actively exploring technology-driven solutions to address some of these limitations and provide more objective and continuous insights into infants’ motor development.
Many studies have employed wearable sensors, such as the work conducted by Smith et al. [15], who investigated sample entropy from tri-axial accelerometer and gyroscope data as a metric for assessing the variability of spontaneous leg movements in infants, with the potential to serve as an indicator of neuromotor control development. Another line of research has focused on applying computer vision and machine learning techniques to develop innovative tools for directly assessing movements from video recordings of children. A scoping review of studies exploiting computer vision and machine learning is reported in [16]. Among these, Caruso et al. [17] presented Movidea, a novel semi-automatic software designed to support movement tracking and GMs Assessment in high-risk infants for ASD from video recordings. Another contribution comes from Tsuji et al. [18], who introduced a marker-less movement measurement method designed to evaluate 25 indices linked to GMs in infants with the aim of enabling early disability diagnosis. Video analysis includes background subtraction, frame difference computations, and movement classification and is executed using an artificial neural network characterized by a stochastic structure.
Furthermore, recent studies have explored complementary approaches, focusing on gamified platforms and machine learning to analyze behavioral features in older children for diagnosing neurodevelopmental disorders such as ASD and ADHD. For example, a study by [19] developed a system integrating crowdsourcing and machine learning to extract multimodal features, highlighting the scalability and adaptability of such approaches in clinical and home environments. While these methodologies address older age groups, they underscore the potential of computational tools to enhance neurodevelopmental assessments and inform real-time decision-making.
Notably, the introduction of AI stands out, presenting significant potential for the early and objective evaluation of infants’ motor skills. This approach not only enhances the efficiency of the assessment process but also introduces the prospect of continuous monitoring and in-depth analysis of motor development. A key advantage is that these methods obviate the need for sensors, affording infants the freedom to move naturally during the assessment [20]. Several AI-driven solutions have been proposed and employed for automated movement tracking. In a recent study by Shin et al. [21], automatic movement tracking based on AlphaPose was applied to high-risk preterm infants in the context of cerebral palsy. The research involved kinematic analysis of joint angles and angular velocities using sample entropy and Pearson correlation. The aim was to examine inter-limb synchronization and movement complexity of both upper and lower extremities during spontaneous movements for assessing developmental outcomes. Reich et al. [22], in a different study, used OpenPose to identify age-specific movement patterns known as “fidgety movements (FMs)” in TD infants. The infants’ complete body poses were extracted from videos, resulting in a 25-point skeleton, which served as the input data for a shallow multilayer neural network (SMNN). DeepLabCut was recently employed in another study by Moro et al. [23] to analyze spontaneous movements in preterm infants and detect abnormal motion patterns indicative of neurological disorders. Their paper introduces a computer-aided pipeline for processing 2D video recordings to characterize and classify infants’ motion. The pipeline involves detecting key body points to extract quantitative motion parameters for subsequent classification, distinguishing between normal and abnormal motion patterns. In a longitudinal study, Doi et al. [24] employed machine learning models to explore the connection between spontaneous bodily movements in 4-month-old infants and the development of ASD-like behaviors at 18 months by extracting 26 movement features through interframe difference images from video recordings. Subsequently, in another recent study [25], they applied the same marker-less approach to investigate the association between spontaneous movements in infants at approximately two days of age, particularly during sleep, and their risk of developing ASD at 18 months. The findings revealed that a higher risk of ASD is correlated with specific motor alteration patterns during sleep and awake states, pertaining to the rhythm and movement frequency of both the lower and upper body. Beyond these works, neural networks have been widely employed in various studies for the automated tasks of pose estimation and movement classification [26,27].
In the context of an escalating interest within the scientific community, this study introduces an innovative application of AI to automatically analyze spontaneous movements in newborns identified as high-risk for NDD. Employing video footage acquired at five discrete time points spanning from 10 days to 24 weeks postnatally, our objective was to identify and underscore novel early motor markers indicative of potential developmental delays. The employed methodology is marker-less, eliminating the need for direct operator intervention and thereby emphasizing its efficiency and non-intrusiveness. This research contributes valuable insights to the understanding of early neurodevelopmental indicators and holds implications for non-invasive monitoring in high-risk populations.

2. Materials and Methods

2.1. Participants

For this study, 74 high-risk (HR) infants (11F/9M) were recruited from the Italian Network for Early Detection of Autism Spectrum Disorders, also known as the NIDA network [28]. The NIDA network is the largest Italian cohort of infants at risk for NDDs and includes siblings of children diagnosed with ASD, preterm newborns, and small-for-gestational-age newborns, as well as low-risk (LR) infants, who serve as a comparison group. This network was established to detect early signs of NDD, including ASD, through a multidisciplinary approach that integrates motor and cry pattern analysis with comprehensive clinical assessments. For each participant, five videos capturing the infant in a state of natural and unrestricted movements at five specific time points corresponding to 10 days and 6, 12, 18, and 24 weeks of age were recorded and analyzed. These time points were chosen to capture crucial moments in motor development from birth onward. Furthermore, at 36 months of age, a comprehensive clinical/diagnostic assessment of the infants and toddlers using standardized tools/tests and parental structured interviews was conducted to confirm the presence/absence of NDD. Within this cohort, 5 infants dropped out of the study, 28 received a diagnosis of NDD, and 33 were assessed as children with typical development (TD). Additionally, 8 infants are still pending evaluation since they have not yet reached the appropriate age for a stable diagnosis. Videos of no-label or drop-out infants were used exclusively for validating our deep learning-based automatic tracking approach and not for the subsequent analysis steps aimed at identifying predictive variables for NDD. The dataset (summarized in Table 1) lacks complete sets of five videos for each participant, as not all individuals could be recorded at every designated time point. Furthermore, in addition to the videos reported in the dataset and recorded as described in the following paragraph, we used four additional amateur-recorded videos to test our tracking algorithm under completely unstructured conditions. Further details about the dataset can be found in Supplemental Table S1.
The study was approved by the ethics committee of the National Institute of Health (Istituto Superiore di Sanità, ISS) (Approval Number: Pre 469/2016). All the families that voluntarily participated in the study signed a written informed consent, and all methods were performed in accordance with the relevant guidelines and regulations. Informed consent for publication of identifying information/images in an online open-access publication was also obtained from the parents of the subjects whose images appear in this manuscript.

2.2. Data Collection & Video Editing

Infants were recorded in a home setting, lying on a green blanket supplied by the NIDA network. The camera was positioned at a distance of 50 cm above the infant’s chest. Each recording lasted for a minimum of 5 min, aiming to capture spontaneous movements of the infant’s entire body. Recorded videos were reviewed, revealing that in most cases, segments of high quality (i.e., without external disturbances such as operator intervention to soothe the infant) did not exceed 3 min in duration. Consequently, the decision to retain a 3 min video segment representing the highest-quality portion for each recording was made. To ensure consistency in the dataset, one author manually edited the videos to meet the following specific criteria: a duration of three minutes and the infant in a supine position, displaying a state of well-being and spontaneous motor activity, and without any episodes of crying. In cases where the videos contained more than 3 min of high-quality footage, we opted to analyze the initial 3 min of high-quality content. Any video frames that exhibited operator or parental interferences, as well as accidental camera movements, were excluded from the subsequent analysis. All the analysis presented in the following sections was performed using Python 3 [29] and Matlab R2022b [30].

2.3. Tracking Procedure

The trajectories of infants’ body landmarks during their spontaneous movements were automatically extracted from all the collected videos using a customized application of the Mediapipe Pose Landmark Detection solution [31]. MediaPipe is an open-source framework developed by Google for creating AI and machine learning pipelines, providing a suite of libraries and tools for multimedia processing across multiple platforms. The MediaPipe Pose Landmarker, a specific component of this framework, facilitates the detection of human body landmarks within images or videos, allowing for the identification of key body points, posture analysis, and movement categorization. The architecture, based on BlazePose [32], a specifically modified version of the MobileNet [33] convolutional neural network (CNN), outputs 33 body pose landmarks, expressed in both image coordinates (pixel values) and 3-dimensional world coordinates. These landmarks correspond to significant body components, including key joints and extremities, specifically hands and feet. The comprehensive list of landmarks is provided in Figure 1. For each reference point, we stored an Nx2 matrix containing the x and y coordinates in the image for each of the N frames in the respective video.
We selected the MediaPipe Pose Landmark Detection as the body pose estimation solution for our study due to several advantages. Notably, it has undergone extensive training on large datasets and fine-tuning to ensure precise landmark detection. Additionally, it offers real-time pose estimation with minimal latency. Furthermore, the CNN architecture is easily customizable to meet specific project requirements.
This AI approach in our study has demonstrated remarkable robustness also in conditions such as low resolution (Figure 2a), inadequate lighting (Figure 2b), inadvertent operator intrusion into the frame (Figure 2c), infants wearing socks or clothing that covered their limbs (Figure 2d,e), or variations in skin tones (Figure 2f). In all these scenarios, motion tracking was performed accurately, as further confirmed by the model validation described in the next sections.

2.4. Signal Processing

Using the outlined methodology, we identified the x and y coordinates of 33 body landmarks for each frame of each video in our dataset. The AI model ensures consistency across diverse frame sizes by normalizing coordinate values within the range of 0 to 1. The output for each processed video was a file containing the x and y coordinates of each landmark across every frame in which the model successfully performed body pose estimation. For subsequent analyses aimed at identifying early motor predictors of NDDs, we specifically focused on infants’ limbs. The centroid coordinates of each end-effector were computed for each frame, mediating the x and y coordinates of the wrist, pinky, and index for the hands and of the ankle, heel, and foot index for the feet (Figure 3). Each signal was then preprocessed by applying a zero-phase moving average filter with a selected window size of 5 samples in order to smooth noisy fluctuations.

2.5. Model Validation: Comparison with Movidea

To evaluate the performance of our automatic tracking approach, we conducted a comparative analysis between the trajectories derived from our procedure, described in the previous sections, and those obtained using Movidea [35], a validated software package purposefully designed for the semi-automatic extraction of kinematic features in assessing infants’ motor skills. Movidea is software that facilitates the tracking of infants’ end-effectors during the free movement of each limb from single-camera video footage. However, it requires the presence of an operator for the selection of the infant’s reference measures, specifically the head length and the symmetry line of the body. The operator is also responsible for selecting the central point of each limb to initialize the tracking algorithm, and he must oversee the entire process, intervening to reset the points in the event of errors and skip frames in cases where a limb is not visible.
During the validation procedure, in addition to the videos listed in Table 1 acquired through the method described in Section 2.2, we also tested four additional amateur videos. These videos were recorded under conditions different from the structured procedure established for the study. To assess the similarity between our tracking results and those from the operator-dependent software Movidea, we computed the Pearson’s correlation coefficient (R), the p-value testing the hypothesis of no relationship between the two signals, and the root mean square error (RMSE). For each signal, similarity measures were computed within 10 s windows and then averaged across all these windows. Subsequently, we checked the normality of the data distribution for each measure using the Shapiro–Wilk statistical test. Upon identifying non-normal distributions (p < 0.001), we expressed the aggregated results across all the signals using the median value (Q1–Q3).

2.6. Feature Extraction

To characterize the spontaneous movements of infants in terms of synchronicity, smoothness, repetitive movements, and symmetry, we used the trajectories of each end-effector’s centroid extracted from videos of NDD and TD children at each time point to compute the following features and kinematic parameters:
  • Velocity and Acceleration:
The velocity of each limb was calculated as the Euclidean distance between the location of the corresponding reference point in every two subsequent frames. The obtained signal was then multiplied by the frame rate to normalize any difference due to videos captured using different devices. To reduce the fast oscillations in the velocity profiles, a third-order low-pass Butterworth filter with a cutoff frequency equal to 95% of the Nyquist frequency was then applied. Analogously, the acceleration of each limb was calculated as the difference between two consecutive velocity samples [35]. Descriptive statistics, including mean, median, standard deviation, variance, mode, skewness, kurtosis, maximum, and minimum, were then computed for both the velocity and acceleration of each limb. Furthermore, the features obtained for the right and left hands, as well as for the right and left feet, were averaged to aggregate the kinematics of the upper body and lower body, respectively.
  • Cross-correlation (CC):
The calculation of the zero-lag cross-correlation between the velocity of each pair of limbs was performed by applying the following equation [36]:
C C v 1 v 2 =   σ v 1 v 2 σ v 1 2 · σ v 2 2
where C C v 1 v 2 is the cross-correlation between the velocity v 1   and the velocity v 2 , σ v 1 v 2 is the covariance of v 1 and v 2 , σ v 1   2 is the variance of v 1 , and σ v 2 2 is the variance of v 2 .
Cross-correlation (CC) serves as a metric to evaluate the synchronization of limb movements, fundamental information for describing the distinct motor patterns exhibited by infants. Consequently, it holds significant importance in the early assessment of NDD [37].
  • Area differing from moving average (Ama):
The smoothness is another crucial parameter for the assessment of infants’ proper development, as typical GMs are characterized by continuous, flowing patterns without jerky or abrupt transitions. To quantify this aspect, we computed the moving average for both the x and y components of each limb’s trajectory across the entire recording using a 2 s window size [37], as described by the following equation:
x i = 1 k j = i k 2 i + k 2 x j
where x i is the moving average computed at the i-th frame, k is the window’s size, and x j is the point position in the j-th frame.
Successively, for every sample in the trajectory, we calculated the deviation from the moving average by subtracting the trajectory value from the corresponding moving average value, as shown in the following equation:
A m a x = i = k 2 l k 2 x i x _ i
where A m a x is the area differing from the moving average of the x component and l is the total number of frames of the recording.
Finally, the total area differing from moving average (Ama) was computed for both the lower and upper limbs. This was accomplished by summing the areas that deviated from the moving average for the two components of the feet and the two components of the hands, respectively.
  • Periodicity:
Periodicity, as defined by Meinecke L. et al. [37], is a parameter designed to assess the presence of repetitive movements in limb motion. To measure the periodicity of infants’ spontaneous movements, we computed a high-order moving average for both the x and y components of each limb’s trajectory across the entire recording, and we segmented the recordings using a 1000-sample window size. This window length was chosen to capture the overall movement trend without closely tracking the trajectory, allowing us to detect fast movements with high amplitude [37]. Within each window, we computed the mean trajectory for each limb’s movement component, and we then identified the points where the trajectory intersected with the mean. Subsequently, we calculated the mean distance ( d ) and standard deviation ( σ d ) between consecutive intersections. Finally, the periodicity (P) was determined by combining the aforementioned parameters as follows:
P = 1 d + σ d
  • Maximum displacement along the x and y axes:
This was calculated for each limb by subtracting the minimum coordinate from the maximum coordinate occupied by the reference point during the recording. This measurement was employed to quantify motion amplitude, another important characteristic given that typical GMs exhibit high range and extent.
  • The smallest and largest eigenvalues of the 95% error ellipse between the x and y components of the body centroid trajectory:
A covariance error ellipse in two dimensions is a graphical representation of the spread or dispersion of a bivariate distribution. It is centered on the mean value of the two variables, and its shape and orientation depend on the covariance between them. The major and minor axes of the ellipse correspond to the directions of greatest variation in the data (eigenvectors), as determined by the covariance matrix. The orientation of the ellipse indicates the correlation between the variables. A longer major axis implies greater variability in that direction and vice versa.
The eigenvalues of the covariance matrix represent the variance of the data along the eigenvectors. In the case of correlated data, the eigenvectors indicate the direction of the largest spread, while the eigenvalues indicate the magnitude of that spread.
Therefore, covariance error ellipses are helpful in understanding the distribution of data points and, within our specific application, can be used to quantify the magnitude of motion [38].
  • Percentage of covered space:
For each limb, we computed the ratio between the sum of all the different positions occupied by that limb and the total number of pixels in the frame to quantify the area of movement.
  • Mean Pearson Correlation Coefficient [39]:
This was computed between the x and y components of the left and right hands and between the left and right feet trajectories to also measure the symmetry between the two sides.
  • The difference between the mean velocity of upper body and the mean velocity of lower body:
This has been calculated with the aim of comparing the amount of movement between the upper and lower parts of the body according to the following formula:
v d i f f = v l e f t h a n d + v r i g h t h a n d v l e f t f o o t + v r i g h t f o o t
where v d i f f   is the difference between the mean velocity of upper body and the lower body, v l e f t h a n d   is the mean velocity of left hand, v r i g h t h a n d   is the mean velocity of right hand, v l e f t f o o t   is the mean velocity of left foot, and v r i g h t f o o t   is the mean velocity of right foot.
Features were normalized by the infant’s bust size to remove differences due to children’s size. Infants’ bust measure was computed by calculating the Euclidean distance between the midpoint of the shoulders and the midpoint of the hips for each frame and subsequently extracting the median of these values.

2.7. Feature Selection

Applying the feature extraction procedure detailed in the preceding section, we derived a comprehensive set of parameters that characterize the spontaneous movements of each infant in our dataset across both the NDD and TD groups and throughout all time points. These features comprised 45 parameters, encompassing a range of kinematic variables tailored to assess the gross motor skills of infants. Specifically, they included 18 statistical descriptors related to velocity and acceleration for both lower and upper limbs, 6 cross-correlations among limb pairs, 2 parameters capturing the periodicity of upper and lower body movements, and 2 metrics representing deviations from the moving average for the upper and lower limbs. Additionally, our extraction process encompassed 8 features associated with the maximum displacement of each limb along the x and y axes, 2 eigenvalues of the centroid, 4 indicators measuring the percentage of space occupied by each limb, 2 Pearson correlation coefficients for upper and lower body movements, and 1 feature expressing the difference in velocity between hands and feet. All these variables were examined to identify potential early predictors of clinical outcomes, specifically distinguishing between children diagnosed with NDD and those with typical development. The following procedure was independently executed for each time point after applying min-max scaling normalization, which accounted for differences in measurement units and value ranges of features. As the initial step in reducing the high number of variables, we conducted a correlation analysis using the Pearson method. When variable pairs showed a correlation exceeding 80%, we systematically removed the one displaying lower variability within our sample. Subsequently, we utilized the ANOVA F-value between the labels (NDD versus TD) and each of the remaining features to rank them and identify the optimal subset of variables for each time point [40,41]. In the context of feature selection, the one-way ANOVA assesses the significance of a feature in explaining the variance in the data and distinguishing between the NDD and TD groups. Specifically, the F-value used in this test quantifies the ratio of the variance between groups to the variance within groups. A high F-value indicates that the feature is more relevant for distinguishing between groups, making it a strong candidate for feature selection [42]. The F-value is calculated as the ratio of the mean squares between groups (MSB) to the mean squares within groups (MSW):
F = M S B M S W
The variance between the group means (MSB) is calculated by dividing for the degrees of freedom for between groups (dfB), equal to k − 1, where k is the number of groups, the Sum of Squares Between Groups ( S S B ):
S S B = n i ( μ i μ ¯ ) 2  
where n i is the number of observations in the i-th group, μ i is the mean of the i-th group, and μ ¯ is the overall mean.
The variance within each group (MSW) is instead measured by dividing for the Degrees of freedom for within groups (dfW), equal to N − k, where N is the total number of observations, the Sum of Squares Within Groups ( S S W ):
S S W i = ( x i μ i ) 2
Here, S S W i is the sum of squared differences between individual data points x i and their group mean μ i for each group i. The total S S W is the sum of these S S W i values across all groups.
To select the optimal number of features for the subset at each time point, we employed a recursive algorithm. Starting with the subset comprising the two features that obtained the highest-ranking scores, the algorithm computed classification performance at each iteration while sequentially adding features in descending order of their ranking. Ultimately, we chose the optimal subset that achieved the best performance with the fewest features. The selected variables at each time point were then compared between the two groups using the non-parametric unpaired two-sample Wilcoxon statistical test.

2.8. Mixed-Effects Model for Selected Features

Given the longitudinal nature of our dataset, with repeated measurements for each subject at different time points, we decided to conduct an additional analysis focused only on the features selected as described in the preceding paragraph to address the potential correlation among repeated measures within the same individual. We used a mixed-effects model, which effectively handles this complex data structure, including missing data [43]. This approach enabled us to thoroughly examine our variables by incorporating fixed effects for Group (TD vs. NDD), Time Point, and the Group–Time Point interaction. Random effects were also included to account for within-subject variability (1|Subject). We used Restricted Maximum Likelihood (REML) as the estimation method. The model applied to analyze each feature was defined as follows:
Selected Feature ~ 1 + Time Point + Group + Time Point: Group + (1|Subject)
When significant results were found, we performed post hoc analyses to interpret them.

2.9. Classification Model

The selected and normalized features were employed as input to train a C-Support Vector Classifier (C-SVC) [44] for the early identification of NDD infants at a specific time point. C-SVC is a specific type of Support Vector Machine (SVM) designed to find an optimal decision boundary or hyperplane for effectively separating different classes of data in a dataset, ensuring the maximum margin between the classes while still allowing for some classification errors based on the chosen value of C. C represents a regularization parameter that controls the trade-off between maximizing the margin (the distance between the decision boundary and the nearest data points of each class) and minimizing classification errors. A smaller value of C will result in a larger margin but may allow some misclassification of data points, while a larger value of C will lead to a narrower margin but fewer misclassifications. This algorithm is particularly useful when dealing with non-linearly separable data, as it can map the data into a higher-dimensional space to find a linearly separable hyperplane; moreover, several studies have demonstrated the effectiveness of this classification model, even with small-scale datasets. In our work, we set C = 1, and we chose a radial basis function kernel since data are not linearly separable. Leave-one-out Cross-Validation (LOOCV) [45] was employed to train and test the SVM classifier, considering the limited size of the dataset [44]. The performance was assessed using the following metrics [46]:
  • Accuracy:
This metric evaluates the ratio of correct predictions to the total number of instances.
  • Precision:
Precision represents the ratio of true positive instances to the total instances classified as positive. It assesses the classifier’s capability to refrain from mislabeling a negative sample as positive. A high precision value indicates a low rate of false positives, suggesting that when the model predicts a positive result, it is likely correct.
  • Recall or Sensitivity:
This metric measures the proportion of positive instances that are accurately classified. Recall reflects the classifier’s ability to identify all positive samples.
  • Specificity:
Specificity gauges the proportion of true negatives correctly identified by a classification model out of the total number of actual negatives. Essentially, it quantifies the model’s ability to avoid false positives.
  • F1 Score:
The F1 score is the harmonic mean of precision and recall. It provides a balanced assessment where the relative contributions of precision and recall are equal.

3. Results

3.1. Model Validation with Movidea

The Pearson correlation coefficient between each signal extracted using our automatic tracking approach and its corresponding value obtained from Movidea is 93.96 (88.61–96.60)%. The RMSE was 9.52 (7.29–12.37) pixels. The p-value < 0.0001 evidenced a significative relationship between the two signals. An example comparing a trajectory extracted using our deep learning-based automatic approach with the same trajectory obtained from Movidea is provided in Figure 4.
Testing the four amateur videos, we achieved an R of 95.75% and an RMSE of 8.31 px, even when the background was not green (Figure 5a). Our approach also demonstrated robust tracking capabilities, achieving an R of 91.44% and an RMSE of 10.62 px, even when the child was not in a supine position (Figure 5b). However, for accurate tracking, it was essential that the entire body of the infant remained within the camera’s view, as also evidenced by the evaluation of missing data. The algorithm’s performance when the infant moved outside the frame was R = 42.56% and RMSE = 19.7 px (Figure 5c). Additionally, if the infants were dressed, their clothing needed to have a distinct color from the bedsheet or surface they were lying on. Tests that did not adhere to these constraints resulted in poorer performance, with an R of 64.58% and an RMSE of 11.69 px (Figure 5d).

3.2. Feature Selected and Mixed-Effects Model

The most important features identified through the feature selection procedure from the total set of 45 include the ‘Median Velocity of the Feet’, ‘Area differing from moving average’, and ‘Periodicity’, all within the lower-body domain.
A non-parametric unpaired two-sample Wilcoxon statistical test between TD vs. NDD showed that the median values of these three variables were significantly lower in NDD infants than in TD at 10 days. No significant effects were observed for the remaining time points (Figure 6, Figure 7 and Figure 8). Descriptive statistics for these three variables are reported in Table 2.
The mixed-effects model applied across the multiple time points (repeated measures) revealed a significant impact of both time and group (p < 0.05) for all three variables. However, the interaction between group and time did not reach statistical significance. Post hoc analyses confirmed that these three variables are significantly different between the two groups only at the initial time point (10 days). The results of post hoc tests are reported in Table 3. To verify that the assumptions were met, we examined the residual plots to confirm the absence of deviations from homoscedasticity or normality.

3.3. Classification

The combination of the three selected features was used as input for the SVM classifier to distinguish between NDD high-risk infants and TD infants, achieving an accuracy of 84.6%. Details of the metrics are reported in Table 4, and the confusion matrix is shown in Figure 9.
Figure 10 provides a detailed graphical summary of the developed model, illustrating its workflow and highlighting its intended applications.

4. Discussion

In this study, we employed AI to automatically track infants’ movements and analyze their trajectories to characterize their kinematic behaviors and general movements (GMs). We also investigated how motor skills during the early months of development can predict the onset of NDD in later stages of growth. Our objective was to identify early biomarkers that could differentiate between children who later develop NDD and those who do not, thereby facilitating timely therapeutic intervention for improved outcomes.
To the best of our knowledge, this is one of the first studies to assess motor performance in such young infants using AI techniques to identify early biomarkers of NDD. The method presented in this paper is entirely automatic, requiring no operator intervention. It enables the evaluation of children’s movements, even when they are clothed, and can be applied to videos that may not be perfectly captured. This makes it particularly well suited for integration into applications designed for remote monitoring of children from the comfort of their homes directly by their parents.
Such accessibility could broaden early motor skill assessments, especially in underserved or rural areas where healthcare facilities may be less accessible, thereby reducing disparities in early detection and intervention for NDD. The automatic tracking method eliminates the need for physical markers on the child, which can restrict their natural movements. This non-intrusive approach is essential for studying infants, as it allows for a more ecological observation of their spontaneous motor behaviors without the risk of discomfort or distraction. As a result, this non-contact tracking method preserves the integrity of observed movements while providing a more natural and comfortable environment for the infant’s motor exploration and development.
At the initial time point, precisely 10 days, our analysis revealed a robust correlation between three parameters—‘Median Velocity of Feet’, ‘Area differing from moving average’ of the lower body, and ‘Periodicity’ of the lower body—with the clinical outcome. Using these features, we achieved an accuracy of approximately 85% in discriminating children with NDD versus typically developing (TD) children. Notably, all parameters that showed significant correlations with clinical outcomes pertain to the feet. These findings underscore the diagnostic relevance of focusing on lower limb movements and highlight the need for further research in this direction.
Our study expands on previous work, such as that by Ref. [23], which identified the acceleration of the right foot as the most crucial parameter for the early identification of abnormal motor patterns. We demonstrate that lower limb parameters are critical as early as 10 days of life, a temporal window not thoroughly explored in prior studies. Additionally, our longitudinal analysis revealed that the significant differences observed between the two groups at 10 days gradually diminished over time. By 6 weeks, these differences were less pronounced, and by 12 weeks, they were negligible. This trend suggests that infants who exhibited delays in spontaneous lower limb movements at birth may gradually compensate as weeks progress but eventually manifest NDD. Further investigations are needed to understand the progression and potential mechanisms underlying these compensations. As in our previous investigations, the time window from birth to 6 weeks is particularly suitable for exploring motor performance and predicting NDD risk [17].
The predominance of lower-limb movement differences at such an early stage can be explained by the developmental trajectory of motor skills. Spontaneous lower-limb movements are among the earliest motor patterns to stabilize in neonates, while upper-limb movements related to environmental exploration emerge later as developmental milestones. Studies by Refs. [47,48] have shown that meaningful differences in hand movements predictive of NDD onset typically appear around six months of age.
This finding aligns with the “neurodevelopmental cascade” theory [5,49]. The motor delay identified at 10 days, assessed through the characterization of simple GMs, may have implications for the development of other motor or social domains in later stages. These aspects, however, were not captured by the indices employed in this study, and additional analyses are necessary to explore the potential implications of early motor delay on broader developmental trajectories.
The deep learning-based tracking methodology used in our study exhibited excellent performance when compared to operator-dependent, validated tracking software. Moreover, it has proven to be highly robust in extracting trajectories, even from amateur videos recorded under non-optimal conditions. This robustness underscores the utility of such approaches in real-world settings, enabling parents and caregivers to participate in the assessment process without requiring extensive training or specialized equipment.
The ability to automatically track and analyze infants’ movements from naturalistic video clips represents a significant advancement. This approach paves the way for the development of systems that support families in conducting preliminary assessments of their children’s motor skills and development at home without the need for healthcare facility visits. As widely documented in the literature, early motor skills have a direct correlation with the subsequent development of communication and social abilities, highlighting the potential value of such systems for the widespread early screening of infants for NDDs.
While prior studies have primarily focused on controlled clinical environments, our validation in both structured and unstructured settings demonstrates the broader applicability of this methodology, making it a practical option for remote monitoring. Such systems could also enable large-scale data collection. This would foster new research opportunities and collaborations across clinical and academic institutions, ultimately enhancing the understanding of neurodevelopmental disorders.
A limitation of this study is the relatively small dataset. However, we are actively in the process of collecting additional data and awaiting confirmation of children’s ages at diagnosis, which will allow us to expand the dataset. This limitation arises from challenges such as the rarity of the high-risk population, the need for longitudinal follow-up to confirm diagnoses, and logistical difficulties in accessing participants, as well as the ethical approvals required. Increasing the dataset will improve the generalization capacity of our models and provide a more comprehensive characterization of infants’ motor performance, including additional features such as fluidity, sudden movements, tremors, and rotations—clinically relevant indicators of NDDs. These features are derived from both existing literature and clinical parameters used in neonatal assessments, ensuring a robust approach to feature extraction.
We are also conducting further analyses at time points where significant variables have not yet been identified. Additionally, leveraging the longitudinal nature of our data, we are performing trend analysis of various parameters across the five time points. This study aims to elucidate how motor skills and characteristics evolve during the early weeks of life, facilitating the identification of patterns that differentiate children with NDD from TD children. In line with the neurodevelopmental cascade theory, we hypothesize that early delays in spontaneous movements may progress to impact more complex motor domains and other developmental areas over time. This longitudinal perspective is critical for understanding how early motor performance relates to later developmental outcomes and for paving the way for more targeted early interventions.

5. Conclusions

In summary, our paper introduces an AI-based approach for the automatic tracking of infants’ movements using recorded videos in the initial weeks of life across five different time points, spanning from 10 days to 24 weeks. Through the analysis of parameters derived from the trajectories to characterize infants’ movements, we identified a developmental delay in motor skills at the first time point (10 days after birth) for infants later diagnosed with NDD compared to those with typical development. These delays were notably associated with lower body movements, revealing significant distinctions between the two groups. The novel emphasis on lower limb features, identified as key predictors of NDD at such an early stage, underscores the unique contributions of our study. The observation that such differences emerge shortly after birth, diminish during development, and later manifest as delays in other domains underscores the importance of attentiveness to even subtle anomalies in children’s behavioral patterns from birth. This highlights the critical need for systems capable of promptly and effectively supporting both healthcare providers and families in identifying such anomalies. By validating performance in both structured and unstructured settings, our approach demonstrates its versatility, making it suitable for diverse contexts, including home environments. The proposed system stands out for its transformative potential, as it can be implemented using widely accessible tools like smartphones. Offering a low-cost, scalable solution for early assessment, it provides a practical alternative for early detection and intervention. The fully automatic approach described in this paper, requiring only a simple video recording device such as a smartphone, aligns perfectly with these needs.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics15020136/s1, Table S1: Comprehensive dataset overview providing details on the videos available for each participant, along with their corresponding labels.

Author Contributions

Conceptualization: G.T., R.B. and M.L.S.; Methodology: G.T., R.B., M.L.S. and A.C.; Formal analysis: R.B., G.T., A.C. and S.C.; Investigation: R.B., A.C., M.M., S.C., G.T., G.P. and M.L.S.; Writing—original draft preparation: R.B., G.T., A.C., M.M. and S.C.; Writing—review and editing: M.L.S., G.P. and G.T.; Supervision: M.L.S., G.P. and G.T.; Funding acquisition: M.L.S. and G.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the READS project (MIMIT, Prog. n. F/180026/01-04/X43) and the MSCA-ITN-2018—European Training Networks under Grant agreement ID: 814302 (SAPIENS) “Shaping the social brain through early interactions”.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the ethics committee of the National Institute of Health (Istituto Superiore di Sanità, ISS) (Approval Number: Pre 469/2016, 21 June 2016).

Informed Consent Statement

Informed consent was obtained from all the families involved in the study. Written informed consent has also been obtained from the families to publish this paper.

Data Availability Statement

The dataset presented in this study is available on request from the corresponding author due to ethical restrictions.

Acknowledgments

Roberta Bruschetta is a PhD student enrolled in the National PhD in Artificial Intelligence, XXXVII cycle, course on Health and life sciences, organized by Università Campus Bio-Medico di Roma; Simona Campisi is a PhD student enrolled in the National PhD in Artificial Intelligence, XXXVIII cycle, course on Health and life sciences, organized by Università Campus Bio-Medico di Roma.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders; DSM-5-TR; American Psychiatric Association Publishing: Washington, DC, USA, 2022; ISBN 978-0-89042-575-6. [Google Scholar]
  2. Zwaigenbaum, L.; Bauman, M.L.; Stone, W.L.; Yirmiya, N.; Estes, A.; Hansen, R.L.; McPartland, J.C.; Natowicz, M.R.; Choueiri, R.; Fein, D.; et al. Early Identification of Autism Spectrum Disorder: Recommendations for Practice and Research. Pediatrics 2015, 136 (Suppl. S1), S10–S40. [Google Scholar] [CrossRef] [PubMed]
  3. Adolph, K.E.; Tamis-LeMonda, C.S. The Costs and Benefits of Development: The Transition From Crawling to Walking. Child Dev. Perspect. 2014, 8, 187–192. [Google Scholar] [CrossRef] [PubMed]
  4. Bradshaw, J.; Schwichtenberg, A.J.; Iverson, J.M. Capturing the Complexity of Autism: Applying a Developmental Cascades Framework. Child Dev. Perspect. 2022, 16, 18–26. [Google Scholar] [CrossRef]
  5. Iverson, J.M.; West, K.L.; Schneider, J.L.; Plate, S.N.; Northrup, J.B.; Roemer Britsch, E. Early Development in Autism: How Developmental Cascades Help Us Understand the Emergence of Developmental Differences. Adv. Child Dev. Behav. 2023, 64, 109–134. [Google Scholar] [CrossRef]
  6. Thelen, E. The Central Role of Action in Typical and Atypical Development. In Movement and Action in Learning and Development; Academic Press: Cambridge, MA, USA, 2004; pp. 49–73. ISBN 978-0-12-671860-7. [Google Scholar]
  7. Campbell, S.K.; Hedeker, D. Validity of the Test of Infant Motor Performance for Discriminating among Infants with Varying Risk for Poor Motor Outcome. J. Pediatr. 2001, 139, 546–551. [Google Scholar] [CrossRef] [PubMed]
  8. Einspieler, C.; Prechtl, H.F.; Ferrari, F.; Cioni, G.; Bos, A.F. The Qualitative Assessment of General Movements in Preterm, Term and Young Infants—Review of the Methodology. Early Hum. Dev. 1997, 50, 47–60. [Google Scholar] [CrossRef] [PubMed]
  9. Prechtl, H.F. Qualitative Changes of Spontaneous Movements in Fetus and Preterm Infant Are a Marker of Neurological Dysfunction. Early Hum. Dev. 1990, 23, 151–158. [Google Scholar] [CrossRef] [PubMed]
  10. Eliks, M.; Gajewska, E. The Alberta Infant Motor Scale: A Tool for the Assessment of Motor Aspects of Neurodevelopment in Infancy and Early Childhood. Front. Neurol. 2022, 13, 927502. [Google Scholar] [CrossRef] [PubMed]
  11. Peabody Developmental Motor Scales-Second Edition | RehabMeasures Database. Available online: https://www.sralab.org/rehabilitation-measures/peabody-developmental-motor-scales-second-edition (accessed on 24 November 2023).
  12. Pires, C.d.S.; Marba, S.T.M.; Caldas, J.P.d.S.; Stopiglia, M.d.C.S. Predictive Value of The General Movements Assessment in Preterm Infants: A Meta-Analysis. Rev. Paul. Pediatr. 2020, 38, e2018286. [Google Scholar] [CrossRef]
  13. Einspieler, C.; Sigafoos, J.; Bölte, S.; Bratl-Pokorny, K.D.; Landa, R.; Marschik, P.B. Highlighting the First 5 Months of Life: General Movements in Infants Later Diagnosed with Autism Spectrum Disorder or Rett Syndrome. Res. Autism Spectr. Disord. 2014, 8, 286–291. [Google Scholar] [CrossRef]
  14. Einspieler, C.; Bos, A.F.; Libertus, M.E.; Marschik, P.B. The General Movement Assessment Helps Us to Identify Preterm Infants at Risk for Cognitive Dysfunction. Front. Psychol. 2016, 7, 406. [Google Scholar] [CrossRef] [PubMed]
  15. Smith, B.A.; Vanderbilt, D.L.; Applequist, B.; Kyvelidou, A. Sample Entropy Identifies Differences in Spontaneous Leg Movement Behavior between Infants with Typical Development and Infants at Risk of Developmental Delay. Technologies 2017, 5, 55. [Google Scholar] [CrossRef] [PubMed]
  16. Silva, N.; Zhang, D.; Kulvicius, T.; Gail, A.; Barreiros, C.; Lindstaedt, S.; Kraft, M.; Bölte, S.; Poustka, L.; Nielsen-Saines, K.; et al. The Future of General Movement Assessment: The Role of Computer Vision and Machine Learning—A Scoping Review. Res. Dev. Disabil. 2021, 110, 103854. [Google Scholar] [CrossRef] [PubMed]
  17. Caruso, A.; Gila, L.; Fulceri, F.; Salvitti, T.; Micai, M.; Baccinelli, W.; Bulgheroni, M.; Scattoni, M.L. Early Motor Development Predicts Clinical Outcomes of Siblings at High-Risk for Autism: Insight from an Innovative Motion-Tracking Technology. Brain Sci. 2020, 10, 379. [Google Scholar] [CrossRef] [PubMed]
  18. Tsuji, T.; Nakashima, S.; Hayashi, H.; Soh, Z.; Furui, A.; Shibanoki, T.; Shima, K.; Shimatani, K. Markerless Measurement and Evaluation of General Movements in Infants. Sci. Rep. 2020, 10, 1422. [Google Scholar] [CrossRef] [PubMed]
  19. Jaiswal, A.; Kruiper, R.; Rasool, A.; Nandkeolyar, A.; Wall, D.P.; Washington, P. Digitally Diagnosing Multiple Developmental Delays Using Crowdsourcing Fused With Machine Learning: Protocol for a Human-in-the-Loop Machine Learning Study. JMIR Res. Protoc. 2024, 13, e52205. [Google Scholar] [CrossRef]
  20. Zago, M.; Kleiner, A.F.R.; Federolf, P.A. Editorial: Machine Learning Approaches to Human Movement Analysis. Front. Bioeng. Biotechnol. 2021, 8, 638793. [Google Scholar] [CrossRef] [PubMed]
  21. Shin, H.I.; Shin, H.-I.; Bang, M.S.; Kim, D.-K.; Shin, S.H.; Kim, E.-K.; Kim, Y.-J.; Lee, E.S.; Park, S.G.; Ji, H.M.; et al. Deep Learning-Based Quantitative Analyses of Spontaneous Movements and Their Association with Early Neurological Development in Preterm Infants. Sci. Rep. 2022, 12, 3138. [Google Scholar] [CrossRef]
  22. Reich, S.; Zhang, D.; Kulvicius, T.; Bölte, S.; Nielsen-Saines, K.; Pokorny, F.B.; Peharz, R.; Poustka, L.; Wörgötter, F.; Einspieler, C.; et al. Novel AI Driven Approach to Classify Infant Motor Functions. Sci. Rep. 2021, 11, 9888. [Google Scholar] [CrossRef]
  23. Moro, M.; Pastore, V.P.; Tacchino, C.; Durand, P.; Blanchi, I.; Moretti, P.; Odone, F.; Casadio, M. A Markerless Pipeline to Analyze Spontaneous Movements of Preterm Infants. Comput. Methods Programs Biomed. 2022, 226, 107119. [Google Scholar] [CrossRef]
  24. Doi, H.; Iijima, N.; Furui, A.; Soh, Z.; Yonei, R.; Shinohara, K.; Iriguchi, M.; Shimatani, K.; Tsuji, T. Prediction of Autistic Tendencies at 18 Months of Age via Markerless Video Analysis of Spontaneous Body Movements in 4-Month-Old Infants. Sci. Rep. 2022, 12, 18045. [Google Scholar] [CrossRef] [PubMed]
  25. Doi, H.; Furui, A.; Ueda, R.; Shimatani, K.; Yamamoto, M.; Sakurai, K.; Mori, C.; Tsuji, T. Spatiotemporal Patterns of Spontaneous Movement in Neonates Are Significantly Linked to Risk of Autism Spectrum Disorders at 18 Months Old. Sci. Rep. 2023, 13, 13869. [Google Scholar] [CrossRef]
  26. Groos, D.; Adde, L.; Støen, R.; Ramampiaro, H.; Ihlen, E.A.F. Towards Human-Level Performance on Automatic Pose Estimation of Infant Spontaneous Movements. Comput. Med. Imaging Graph. 2022, 95, 102012. [Google Scholar] [CrossRef]
  27. Marschik, P.B.; Pokorny, F.B.; Peharz, R.; Zhang, D.; O’Muircheartaigh, J.; Roeyers, H.; Bölte, S.; Spittle, A.J.; Urlesberger, B.; Schuller, B.; et al. A Novel Way to Measure and Predict Development: A Heuristic Approach to Facilitate the Early Detection of Neurodevelopmental Disorders. Curr. Neurol. Neurosci. Rep. 2017, 17, 43. [Google Scholar] [CrossRef] [PubMed]
  28. Caruso, A.; Micai, M.; Gila, L.; Fulceri, F.; Scattoni, M.L. The Italian Network for Early Detection of Autism Spectrum Disorder: Research Activities and National Policies. Psychiatr. Danub. 2021, 33, 65–68. [Google Scholar] [PubMed]
  29. Welcome to Python.Org. Available online: https://www.python.org/ (accessed on 23 November 2023).
  30. MathWorks—Makers of MATLAB and Simulink. Available online: https://www.mathworks.com/ (accessed on 18 June 2024).
  31. Lugaresi, C.; Tang, J.; Nash, H.; McClanahan, C.; Uboweja, E.; Hays, M.; Zhang, F.; Chang, C.-L.; Yong, M.G.; Lee, J.; et al. MediaPipe: A Framework for Building Perception Pipelines. arXiv 2019, arXiv:1906.08172. [Google Scholar]
  32. Bazarevsky, V.; Grishchenko, I.; Raveendran, K.; Zhu, T.; Zhang, F.; Grundmann, M. BlazePose: On-Device Real-Time Body Pose Tracking. arXiv 2020, arXiv:2006.10204. [Google Scholar]
  33. Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.-C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018. [Google Scholar]
  34. Pose Landmark Detection Guide | MediaPipe. Available online: https://developers.google.com/mediapipe/solutions/vision/pose_landmarker (accessed on 23 November 2023).
  35. Baccinelli, W.; Bulgheroni, M.; Simonetti, V.; Fulceri, F.; Caruso, A.; Gila, L.; Scattoni, M.L. Movidea: A Software Package for Automatic Video Analysis of Movements in Infants at Risk for Neurodevelopmental Disorders. Brain Sci. 2020, 10, 203. [Google Scholar] [CrossRef]
  36. Anstey, N.A. Correlation Techniques —A Review. Geophys. Prospect. 1964, 12, 355–382. [Google Scholar] [CrossRef]
  37. Meinecke, L.; Breitbach-Faller, N.; Bartz, C.; Damen, R.; Rau, G.; Disselhorst-Klug, C. Movement Analysis in the Early Detection of Newborns at Risk for Developing Spasticity Due to Infantile Cerebral Palsy. Hum. Mov. Sci. 2006, 25, 125–144. [Google Scholar] [CrossRef] [PubMed]
  38. Manly, B.; Navarro Alberto, J. Multivariate Statistical Methods. A Primer, 4th ed.; Routledge: London, UK, 2016; ISBN 978-1-4987-2896-6. [Google Scholar]
  39. Pearson’s Correlation Coefficient. In Encyclopedia of Public Health; Kirch, W., Ed.; Springer: Dordrecht, The Netherlands, 2008; pp. 1090–1091. ISBN 978-1-4020-5614-7. [Google Scholar]
  40. Elssied, N.; Ibrahim, O.; Hamza Osman, A. A Novel Feature Selection Based on One-Way ANOVA F-Test for E-Mail Spam Classification. Res. J. Appl. Sci. Eng. Technol. 2014, 7, 625–638. [Google Scholar] [CrossRef]
  41. Sawilowsky, S. Fermat, Schubert, Einstein, and Behrens-Fisher: The Probable Difference Between Two Means When Σ_1^2≠σ_2^2. J. Mod. Appl. Stat. Methods 2002, 1, 461–472. [Google Scholar] [CrossRef]
  42. Liu, H.; Motoda, H. Feature Selection for Knowledge Discovery and Data Mining; Kluwer Academic Publishers: New York, NY, USA, 1998; ISBN 978-0-7923-8198-3. [Google Scholar]
  43. Two-Stage Analysis versus Linear Mixed-Effects Models for Longitudinal Data [The Metafor Package]. Available online: https://www.metafor-project.org/doku.php/tips:two_stage_analysis (accessed on 18 June 2024).
  44. Bishop, C.M. Pattern Recognition and Machine Learning; Information Science and Statistics; Springer: New York, NY, USA, 2009; ISBN 978-0-387-31073-2. [Google Scholar]
  45. Berrar, D. Cross-Validation. In Encyclopedia of Bioinformatics and Computational Biology; Ranganathan, S., Gribskov, M., Nakai, K., Schönbach, C., Eds.; Academic Press: Oxford, UK, 2019; pp. 542–545. ISBN 978-0-12-811432-2. [Google Scholar]
  46. Hossin, M.; Sulaiman, M.N. A Review on Evaluation Metrics for Data Classification Evaluations. Int. J. Data Min. Knowl. Manag. Process 2015, 5, 1–11. [Google Scholar] [CrossRef]
  47. Iverson, J.M.; Shic, F.; Wall, C.A.; Chawarska, K.; Curtin, S.; Estes, A.; Gardner, J.M.; Hutman, T.; Landa, R.J.; Levin, A.R.; et al. Early Motor Abilities in Infants at Heightened versus Low Risk for ASD: A Baby Siblings Research Consortium (BSRC) Study. J. Abnorm. Psychol. 2019, 128, 69–80. [Google Scholar] [CrossRef] [PubMed]
  48. Adolph, K.E.; Franchak, J.M. The Development of Motor Behavior. Wiley Interdiscip. Rev. Cogn. Sci. 2017, 8, e1430. [Google Scholar] [CrossRef]
  49. Hadders-Algra, M. Early Human Motor Development: From Variation to the Ability to Vary and Adapt. Neurosci. Biobehav. Rev. 2018, 90, 411–427. [Google Scholar] [CrossRef]
Figure 1. List of the 33 body landmark locations tracked using the Mediapipe Pose solution [34].
Figure 1. List of the 33 body landmark locations tracked using the Mediapipe Pose solution [34].
Diagnostics 15 00136 g001
Figure 2. Examples of videos with low resolution (a), poor lighting (b), hand operator intrusion in the frame (c), and children wearing socks and clothes that covered their limbs (d,e) or had different skin tones (f).
Figure 2. Examples of videos with low resolution (a), poor lighting (b), hand operator intrusion in the frame (c), and children wearing socks and clothes that covered their limbs (d,e) or had different skin tones (f).
Diagnostics 15 00136 g002
Figure 3. Examples of two frames with the detected reference points and the corresponding skeleton overlaid. The hands and feet centroids are indicated by larger dots. Figure (a) shows the overlaid skeleton when the limbs are fully extended, whereas Figure (b) illustrates the case where the limbs are bent.
Figure 3. Examples of two frames with the detected reference points and the corresponding skeleton overlaid. The hands and feet centroids are indicated by larger dots. Figure (a) shows the overlaid skeleton when the limbs are fully extended, whereas Figure (b) illustrates the case where the limbs are bent.
Diagnostics 15 00136 g003
Figure 4. Overlay of a foot trajectory extracted using our deep learning-based automatic approach (blue line) and the same trajectory obtained with the Movidea software used as the gold standard for validating our approach (red line).
Figure 4. Overlay of a foot trajectory extracted using our deep learning-based automatic approach (blue line) and the same trajectory obtained with the Movidea software used as the gold standard for validating our approach (red line).
Diagnostics 15 00136 g004
Figure 5. Examples of amateur-recorded videos in unstructured conditions: (a) no green background, (b) no supine position, (c) infant outside the camera’s view, and (d) infant clothed in a color similar to the bedsheet.
Figure 5. Examples of amateur-recorded videos in unstructured conditions: (a) no green background, (b) no supine position, (c) infant outside the camera’s view, and (d) infant clothed in a color similar to the bedsheet.
Diagnostics 15 00136 g005
Figure 6. Median Velocity of Feet—(a) violin plot for the first time point (10 days) and (b) trend across the 5 time points with mean values and the standard errors (SEs) for the two groups: NDD (light blue) and TD (red). The p-values related to the comparison between the two groups were computed using the unpaired two-sample Wilcoxon test. * indicates significance levels as follows: * p < 0.05, ** p < 0.01, *** p < 0.001.
Figure 6. Median Velocity of Feet—(a) violin plot for the first time point (10 days) and (b) trend across the 5 time points with mean values and the standard errors (SEs) for the two groups: NDD (light blue) and TD (red). The p-values related to the comparison between the two groups were computed using the unpaired two-sample Wilcoxon test. * indicates significance levels as follows: * p < 0.05, ** p < 0.01, *** p < 0.001.
Diagnostics 15 00136 g006
Figure 7. Area differing from moving average of lower body—(a) violin plot for the first time point (10 days) and (b) trend across the 5 time points with the mean values and the standard errors (SEs) for the two groups: NDD (light blue) and TD (red). The p-values related to the comparison between the two groups were computed using the unpaired two-sample Wilcoxon test. * indicates significance levels as follows: * p < 0.05, ** p < 0.01, *** p < 0.001.
Figure 7. Area differing from moving average of lower body—(a) violin plot for the first time point (10 days) and (b) trend across the 5 time points with the mean values and the standard errors (SEs) for the two groups: NDD (light blue) and TD (red). The p-values related to the comparison between the two groups were computed using the unpaired two-sample Wilcoxon test. * indicates significance levels as follows: * p < 0.05, ** p < 0.01, *** p < 0.001.
Diagnostics 15 00136 g007
Figure 8. Periodicity of lower body—(a) violin plot for the first time point (10 days) and (b) trend across the 5 time points with the mean values and the standard errors (SEs) for the two groups: NDD (light blue) and TD (red). The p-values related to the comparison between the two groups were computed using the unpaired two-sample Wilcoxon test. * indicates significance levels as follows: * p < 0.05, ** p < 0.01, *** p < 0.001.
Figure 8. Periodicity of lower body—(a) violin plot for the first time point (10 days) and (b) trend across the 5 time points with the mean values and the standard errors (SEs) for the two groups: NDD (light blue) and TD (red). The p-values related to the comparison between the two groups were computed using the unpaired two-sample Wilcoxon test. * indicates significance levels as follows: * p < 0.05, ** p < 0.01, *** p < 0.001.
Diagnostics 15 00136 g008
Figure 9. Confusion matrix showing the percentages of subjects accurately and mistakenly associated with each class (children who developed NDD and TD children) for the first time point (10 days). Correctly classified data are shown in green, while misclassified data are shown in red. The rightmost column, denoted as precision, indicates how many infants assigned to a particular group by the classifier truly belong to that class. Similarly, the bottom row of the matrix shows the recall or sensitivity, indicating for each class how many of the total subjects of that class were correctly recognized by the classifier and providing valuable insights into the accuracy and reliability of the classification process. The total accuracy is displayed in the lower right corner.
Figure 9. Confusion matrix showing the percentages of subjects accurately and mistakenly associated with each class (children who developed NDD and TD children) for the first time point (10 days). Correctly classified data are shown in green, while misclassified data are shown in red. The rightmost column, denoted as precision, indicates how many infants assigned to a particular group by the classifier truly belong to that class. Similarly, the bottom row of the matrix shows the recall or sensitivity, indicating for each class how many of the total subjects of that class were correctly recognized by the classifier and providing valuable insights into the accuracy and reliability of the classification process. The total accuracy is displayed in the lower right corner.
Diagnostics 15 00136 g009
Figure 10. Graphical summary of the model.
Figure 10. Graphical summary of the model.
Diagnostics 15 00136 g010
Table 1. Overall dataset with the number of infant videos available for each time point. Subsequently, for analyses aimed at identifying early motor features predictive of clinical outcomes, only data from children who had received a diagnosis (NDD and TD) were included.
Table 1. Overall dataset with the number of infant videos available for each time point. Subsequently, for analyses aimed at identifying early motor features predictive of clinical outcomes, only data from children who had received a diagnosis (NDD and TD) were included.
Time PointNDDTDNo LabelDrop-Out
10 days111524
6 weeks182264
12 weeks222562
18 weeks142652
24 weeks181661
Table 2. Descriptive statistics and results of the non-parametric unpaired two-sample Wilcoxon statistical test between the two groups for the three selected variables at 10 days.
Table 2. Descriptive statistics and results of the non-parametric unpaired two-sample Wilcoxon statistical test between the two groups for the three selected variables at 10 days.
VariableGroupMedianMinMax95% Confidence IntervalUnpaired Wilcoxon Test
LowerUpperUp-Value
Median Velocity of the Feet [px/s]NDD0.1450.01400.3850.08480.249240.002
TD0.3370.16411.0600.27420.536
Area differing from moving average (lower body) [px]NDD576.86686.19311361.686368.1953878.608290.004
TD1003.886589.67731798.358864.56231253.916
Periodicity (lower body) [1/px]NDD0.1040.03230.1490.07760.123310.006
TD0.1440.10200.2370.12610.167
Table 3. Post hoc test of the mixed-effects models.
Table 3. Post hoc test of the mixed-effects models.
Comparison
VariableTime PointGroupTime PointGroupDifferenceSEtDfp
Median Velocity of Feet10 daysNDD10 daysTD−0.255320.1262−2.02341770.045
6 weeksNDD6 weeksTD−0.147620.1019−1.44841730.149
12 weeksNDD12 weeksTD−0.026270.0942−0.27901690.781
18 weeksNDD18 weeksTD−0.072170.1061−0.68011750.497
24 weeksNDD24 weeksTD0.008520.10980.07761760.938
Area differing from moving average (lower body)10 daysNDD10 daysTD−488.51232−2.10361770.037
6 weeksNDD6 weeksTD−329.69190−1.73631670.084
12 weeksNDD12 weeksTD−51.00177−0.28881580.773
18 weeksNDD18 weeksTD−240.54197−1.22011700.224
24 weeksNDD24 weeksTD−89.90203−0.44181730.659
Periodicity (lower body)10 daysNDD10 daysTD−0.043380.0260−1.67031770.097
6 weeksNDD6 weeksTD−0.014170.0211−0.67091700.503
12 weeksNDD12 weeksTD−0.001210.0196−0.06161630.951
18 weeksNDD18 weeksTD−0.010110.0219−0.46051730.646
24 weeksNDD24 weeksTD0.012510.02270.55151740.582
Table 4. Performance metrics achieved for the first time point (10 days) by the SVC classifier.
Table 4. Performance metrics achieved for the first time point (10 days) by the SVC classifier.
AccuracyPrecisionSensitivityF1 ScoreSpecificity
10 days84.62%100%63.64%77.78%100%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bruschetta, R.; Caruso, A.; Micai, M.; Campisi, S.; Tartarisco, G.; Pioggia, G.; Scattoni, M.L. Marker-Less Video Analysis of Infant Movements for Early Identification of Neurodevelopmental Disorders. Diagnostics 2025, 15, 136. https://doi.org/10.3390/diagnostics15020136

AMA Style

Bruschetta R, Caruso A, Micai M, Campisi S, Tartarisco G, Pioggia G, Scattoni ML. Marker-Less Video Analysis of Infant Movements for Early Identification of Neurodevelopmental Disorders. Diagnostics. 2025; 15(2):136. https://doi.org/10.3390/diagnostics15020136

Chicago/Turabian Style

Bruschetta, Roberta, Angela Caruso, Martina Micai, Simona Campisi, Gennaro Tartarisco, Giovanni Pioggia, and Maria Luisa Scattoni. 2025. "Marker-Less Video Analysis of Infant Movements for Early Identification of Neurodevelopmental Disorders" Diagnostics 15, no. 2: 136. https://doi.org/10.3390/diagnostics15020136

APA Style

Bruschetta, R., Caruso, A., Micai, M., Campisi, S., Tartarisco, G., Pioggia, G., & Scattoni, M. L. (2025). Marker-Less Video Analysis of Infant Movements for Early Identification of Neurodevelopmental Disorders. Diagnostics, 15(2), 136. https://doi.org/10.3390/diagnostics15020136

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop