2.2.1. Data Collection
All fifteen patients in their twenties, thirties, and forties, after signing a consent form, were instructed to wear the coded cap with eight stickers attached to their faces, as shown in
Figure 4.
A more thorough definition of the chosen landmarks, according to [
35,
36], would be as follows:
Nasion: the anatomical point located at the midline of the skull, where the frontal bone and the two nasal bones intersect;
Tragus: the cartilaginous projection located in front of the external ear canal;
Zygion: the most lateral point on the zygomatic arch, a prominent bony structure on the side of the skull;
Gonion: the most lateral, inferior point on the angle of the mandible;
Pogonion: the most forward-projecting point on the anterior surface of the mandible.
The placement of the five additional stickers served two primary purposes. First, it aimed to replicate the distribution of Control Points (CPs) in a photogrammetric block by covering the edges of the surface to be imaged, thereby ensuring a more robust bundle block adjustment. Additionally, stickers with unique markings assist algorithms in identifying homologous points in regions of the face where the skin is smooth and lacks contrast. Conversely, areas near the lips and eyes naturally provide sufficient contrast, offering plenty of identifiable homologous points in stereoscopic images, making additional stickers unnecessary in those regions. Therefore, the proposed arrangement was considered sufficient to provide a robust set of easily identifiable homologous points for photogrammetric processing.
Additional round stickers were primarily applied to the coded cap to enhance the alignment functionality of the 3D scanner. Their purpose was to improve the quality of the mesh generated by the scanner but also served as additional high-contrast marks for photogrammetric processing.
Once fully equipped, each patient was invited to sit in a chair and stand still for data collection (
Figure 5). Firstly, their faces and necks were scanned using the Academia 50 3D scanner, which was calibrated at the beginning of each scanning session. Afterwards, cell phone pictures were taken from different angles around their face and neck. Later, a single-shot video was taken, following a pre-determined path around each patient, similar to the cell phone pictures.
For both images and videos, a standardized protocol was established to ensure consistency across patients. A minimum of forty-five images was required for each subject: fifteen following a semicircular path around the upper part of the head, fifteen along a semicircular path around the individual's head, and fifteen more along a semicircular path around the individual's jaw and neck (
Figure 6). The video followed the same path as the images, captured continuously without interruption. All cell phone images and videos were taken at a resolution of 4000 × 3000 pixels for static photos and 1920 × 1080 pixels for video frames, avoiding the highest resolutions supported by the smartphone used for this experiment: 4 K and 8 K.
Table 2 summarizes the key data collected for each patient. For video processing, the frames were extracted at a rate of three frames per second, resulting in a number of photos three times the length of each video in seconds. The procedure went smoothly for thirteen out of the fifteen patients, with scanning times ranging from four to seven minutes. However, Patient 11's voluminous beard posed challenges for the scanner (though not for the image and video processing), even with round markers placed in various areas of the beard. Additionally, Patient 8 exhibited significant involuntary facial movements, leading to some artifacts in the scanned model and a few blurred images and video frames (
Figure 7).
2.2.2. Data Processing
This process resulted in three 3D models per volunteer: a scanner-generated mesh, a mesh derived from photogrammetry, and another from videogrammetry. Each mesh was then trimmed in CloudCompare to retain only the region corresponding roughly to the person's face. Since the photogrammetric meshes were referenced to arbitrary coordinate systems, they were registered to the scanner mesh by applying a scale transformation, ensuring that all three meshes for each subject shared a consistent coordinate system (
Figure 8).
Once all meshes were properly trimmed and registered, Hausdorff distances were calculated as a potential measure of asymmetry between the two halves of a face. The Hausdorff distance is a measure of the maximum of the closest in two sets of points, commonly used to assess shape similarity by measuring errors in creating a triangular mesh to approximate a surface [
37,
38]. There are alternatives for calculating distances between objects, such as Root Mean Squared Error (RMSE), Chamfer distance [
39], and Procrustes analysis [
40]. However, the Hausdorff distance was chosen because it serves as a measure of dissimilarity between two sets of points, hence being a useful computer graphics tool for determining degrees of asymmetry between two halves of the same face. Ref. [
41] provides the following definition:
Let
. The unilateral Hausdorff distance between
A and
B is calculated as follows:
And the Hausdorff distance is defined by the following:
For the calculation of Hausdorff distances, all meshes were cut into two halves with a vertical plane. The calculation was carried out using Meshlab 2023.12 software and its built-in Hausdorff distance algorithm [
42]. According to [
43], ideally, the top of the head and the center of the chin would be used. However, since all volunteers wore the cap, their hairline and part of their forehead were covered. Thus, the references used for splitting faces into two halves were the pogonion, pronasale, and glabella, as shown in (
Figure 9). This step was performed using Blender 4.2 software [
34] with native algorithms.
The second stage involved reflecting the left, mirroring it according to the vertical plane and then contrasting both for dissimilarity comparisons. Finally, the Hausdorff distance was calculated for each pair of face halves with a graphical representation of the shortest distances between two points of the meshes as a heatmap (
Figure 10). This step was also entirely developed using Meshlab 2023.12 software [
42].
These procedures, as described above, produce datasets composed of heatmaps describing which areas of the individual's face are more asymmetrical than their equivalent on the other side. These heatmaps were then converted into a series of histograms containing frequencies of each distance and heatmaps on the specific dissimilar areas. For each histogram, statistics such as mean, median, and standard deviation were also calculated.
In both cases, the approach included analyzing their linear association through the Pearson Product–Moment Correlation and conducting hypothesis tests on the mean, median, and variance of such histograms.
2.2.3. Statistical Analysis
The Pearson Product–Moment Correlation can be understood as a measure of the degree of linear relationship between two random variables. Therefore, the correlation coefficient emphasizes predicting the degree of dependence between two random variables [
44,
45]. Its estimator is defined by the following:
where
is the sample covariance;
is the sample standard variation for the independent variable (in other words, variable x);
is the sample standard variation for the dependent variable (in other words, variable y as a function of x: ).
A strong positive correlation, reflecting a high degree of dependency between variables (and their similarity), results in values close to 1. Values near zero suggest little to no linear relationship between the variables. Negative values, down to −1, indicate an inverse relationship, meaning that as one variable increases, the other decreases. For this procedure, Pearson's Product–Moment Correlation coefficient was computed to evaluate pairwise correlations between the means of Hausdorff distances of meshes generated via photogrammetry, videogrammetry, and 3D scanning. High correlations were anticipated under the assumption that comparable measurement techniques would yield consistent results.
The Paired
t-Test is employed to compute mean differences when observations from two populations of interest are collected in pairs [
46,
47,
48]. This test examines the differences between two observations from the same subject taken under similar conditions. It is a specific instance of the two-sample
t-Test, which applies to samples with unknown population means and standard deviations but is assumed to follow a normal distribution. It produces a
T-Stat that must be smaller than the
T-Critical. It also produces a
p-value which must be larger than the alpha value and reflects the probability of obtaining the observed results, assuming that the null hypothesis is true. In this scenario, the analysis was conducted to determine whether statistically significant differences exist between the means of Hausdorff distances of meshes produced by photogrammetry, videogrammetry, and 3D scanning, evaluated pairwise.
When testing means of two groups, for similarity, its statistic can be simplified as follows:
The hypotheses are formulated as follows:
;
;
;
,
where
and are the hypothetized means of the two paired groups;
is the mean of the differences between the two samples;
is the standard deviation of the differences between the two samples;
n refers to the size of the sample.
The Repeated Measures Analysis of Variance (rANOVA) is employed to determine if there is a statistically significant difference between the means of three or more groups related to measures taken from the same subjects [
49,
50,
51]. It is, therefore, an extension of the Paired
t-Test. Similarly, it yields a
F-Stat which must be smaller than the
F-Critical, and a
p-value that must be larger than the alpha value. Therefore, it complements the Paired
t-Test in that it helps determine whether statistically significant differences exist between the means of Hausdorff distances of meshes produced by photogrammetry, videogrammetry, and 3D scanning, evaluated together as a group. Its statistic is defined by the following:
Its statistic is given below:
The hypotheses are formulated as follows:
;
At least two hypothesized means are statistically different,
where
is the mean squared error of between-group variance;
is the mean squared error of within-group variance;
The Wilcoxon–Mann–Whitney (or simply Mann–Whitney) test may be used when two samples are different from the normal distribution but have similar distribution shapes and variances. It is especially useful for assessing the medians of two groups. For this test, the
U-Stat must be larger than the
U-Critical. For this study, it was employed to determine whether statistically significant differences exist between the medians of Hausdorff distances of meshes produced by photogrammetry, videogrammetry, and 3D scanning, evaluated pairwise. The test uses the ranks of measurements in the following manner [
52]:
The hypotheses are formulated as follows:
The hypothetised measures are not statistically different;
The hypothetised measures are statistically different,
where
and are the number of observations in samples 1 and 2;
and are the sum of the ranks of the observations in samples 1 and 2.
Finally, the Kruskal–Wallis test is, in a certain way, a non-parametric equivalent to ANOVA. According to [
53], it determines if independent groups have the same mean on ranks, instead of the data themselves. For that reason, it may be used to assess medians of more than two samples. Parallel to the Mann–Whitney test, it was suggested to verify whether statistically significant differences exist between the medians of Hausdorff distances of meshes produced by photogrammetry, videogrammetry, and 3D scanning, this time, evaluated all at once. It requires a sample size of 5 or more and provides a
H-Stat and a
p-value that must be larger than the alpha value. Its statistic is given by the following:
The hypotheses are formulated as follows:
The hypothesized measures are not statistically different;
At least two hypothesized measures are statistically different,
where
is the size of sample i;
N is the total sample size;
k is the number of groups being compared;
is the sum of the ranks of the observations in sample i.
The four aforementioned tests, in that regard, extensively compare meshes of the same subject in pairs and as a group. They provide statistically significant answers that help evaluate if meshes obtained from photogrammetry and videogrammetry are just as effective as meshes obtained from the 3D scanner counterpart—but with a special emphasis on determining facial asymmetries among two face halves.