3.1. Performance of the Original Color-Difference Formulas
Table 3 shows the
STRESS values [
25] for the eight color-difference formulas considered in the current study (CIELAB, CIEDE2000, CAM02-(LCD/SCD/UCS), and CAM16-(LCD/SCD/UCS)), for each of the eight phases (see
Table 1, column 1) in the experiments using 3D objects [
24] as well as for the combined visual results of such experiments, which included a total of 440 color pairs (see
Table 1, phase 9). In
Table 3, the lowest
STRESS values for each phase are in bold to indicate the best color-difference formula.
For the combined dataset (
Table 3, phase 9), the CAM16-LCD and CIELAB formulas had the best and worst performances, respectively. However, we noted that for phase 9 the
STRESS values of the eight formulas were in a narrow range (27.3–31.9
STRESS units), which meant that all of the formulas performed similarly. In addition, this
STRESS range was close to the inter-observer variability (30.3–31.2
STRESS units) reported in [
24].
Table 4 shows the numbers of the phases in which the color-difference formula shown in the first row was significantly better statistically than the color-difference formula shown in the first column based on F-tests with two-tailed distributions and 95% confidence levels [
25]. The empty cells in
Table 4 signify no statistically significant improvements for the corresponding formulas in any phase. In agreement with the fact that the
STRESS values for the eight color-difference formulas were quite similar (
Table 3),
Table 4 shows that very few formulas (10/56) were significantly better statistically than others in some phase/s. Specifically, for the combined dataset (phase 9), we can see that only CAM02-(LCD-SCD-UCS) and CAM16-(LCD-SCD-UCS) were significantly better statistically than CIELAB, and that CAM16-LCD was significantly better statistically than CIEDE2000.
3.2. Optimizations of Color-Difference Formulas
Table 3 illustrates the accuracy of the predictions of the visually perceived color differences of 3D objects [
24] made by eight color-difference formulas developed using 2D objects. In the current subsection, we will improve the performance of each one of these formulas. That is, we will minimize the
STRESS values shown in
Table 3 using the Excel Solver GRG non-linear method for the following three methods:
- (1)
Optimization of the value of the lightness parametric factor (
kL) described in
Section 2.3.1. For each phase and color-difference formula,
Table 5 shows the optimal
kL values that provided the minimum
STRESS values, which are shown in
Table 6.
Table 5 indicates that most of the optimal
kL values were slightly higher than 1.0. In any case, based on the comparison of the
STRESS values in
Table 3 and
Table 6, the most important conclusion we reached was that the improvement achieved via the introduction of a lightness parametric factors was slight. Specifically, no color-difference formula achieved a statistically significant improvement in any phase with respect to the original formula via the introduction of the lightness parametric factor. For example, for phase 9 (the combined dataset), the average decrease in
STRESS values for the eight color-difference formulas was only 2.6% (5.9% when considering the average of phases 1–8).
- (2)
Optimization of the value of exponent
b (power function) described in
Section 2.3.2. For each phase and color-difference formula,
Table 7 shows the optimal
b values that provided the minimum
STRESS values, which are shown in
Table 8. In
Table 7, all of the optimal exponents have values lower than 1.0, which was in agreement with the previous literature [
26,
27] and were slightly higher than those shown in
Table 2 for 2D objects. After comparing
Table 3 and
Table 8, we concluded that the introduction of exponents (power functions) produced a very important decrease in the
STRESS values, which may be in part related to the use of color differences in a wide range of magnitude (see
Figure 1). For example, for phase 9 (the combined dataset), the average decrease in the
STRESS values for the eight color-difference formulas with the exponents shown in
Table 7 was as high as 26.9% (29.7% when considering the average of phases 1–8). As a consequence, any of these eight color-difference formulas that was modified by the corresponding exponent was much better than the original formula.
- (3)
Simultaneous optimizations of the values of the lightness parametric factor (
kL) and exponent (
b).
Table 9 shows the optimal
kL and
b values that provided the minimum
STRESS values, which are shown in
Table 10. In
Table 9, we can note that most of the values of
kL were slightly higher than those shown in
Table 5, while the values for exponent
b were almost the same as those shown in
Table 7 (e.g., differences below 0.1 units for all phases and formulas), which were only slightly higher than those found for 2D objects (see
Table 2). The
STRESS values shown in
Table 10 were slightly lower than those in
Table 8, which meant that the simultaneous optimization of
kL and
b slightly improved the good results obtained by only optimizing exponent
b. Specifically, after comparing
Table 3 and
Table 10, we concluded that based on the simultaneous optimization of
kL and b, the average decrease in the
STRESS values for the eight formulas in phase 9 (the combined dataset) was 29.6% (34.8% when considering the average of phases 1–8), and, once again, any of these modified color-difference formulas was much better than the original formula for phase 9 (and also for most cases in phases 1–8). We thus concluded that the
kL and
b values in
Table 9 provided the best optimized color-difference formulas that can be proposed based on the visual results previously reported in [
24].
For phase 9 (the combined dataset), the lowest
STRESS values (i.e., best performances) among the tested color-difference formulas were found for CAM16-LCD in the case of the original formulas (
Table 3) and formulas optimized by parametric factors
kL (
Table 6), CAM16-SCD for formulas optimized by exponents
b (
Table 8), and CAM16-UCS for formulas simultaneously optimized by
kL and
b (
Table 10). These results were encouraging because they may suggest that CIECAM16 [
9], the latest color appearance model proposed by the CIE, is a good basis for proposing future successful color-difference formulas for 3D objects.
Table 11 shows the results of the F-tests for the
STRESS values [
25] achieved by the best-optimized color-difference formulas (
Table 10) with respect to the original formulas (
Table 3). Although
Table 11 shows that the number of cases with statistically significant differences was higher than in
Table 4, the main conclusions we reached based on
Table 4 and
Table 11 were similar: (1) in most cases, the original (or modified) CIELAB color-difference formula was significantly worse statistically than any of the seven remaining original (or modified) color-difference formulas; and (2) in most cases, the differences among the original (or modified) CIEDE2000, CAM02-(LCD/SCD/UCS), and CAM16-(LCD/SCD/UCS) color-difference formulas were not statistically significant.
Figure 2 shows another way to compare the improvements achieved by each of the eight original color-difference formulas when modified using the three methods described above: the lightness parametric factor
kL (
Table 5); exponent
b (
Table 7); and the lightness parametric factor plus exponent
kL +
b (
Table 9). Specifically,
Figure 2 shows the average scores (phases 1–8) for each modified formula arbitrarily assigning 10, 7.5, 5, 2.5, and 0 points when a modified formula was statistically significantly better, better, identical, worse, or significantly worse, respectively, than the original formula. In
Figure 2, it can be noted that maximum scores (10 points) were only obtained for the
kL +
b modified CIEDE2000 and CAM16-SCD formulas (see their corresponding
kL and
b parameters in
Table 9). However, we must add that unfortunately, a general recommendation of either of these two color-difference formulas for 3D objects was not completely justified based on the current visual data [
24]. As indicated above, a comparison among all of the
kL +
b-optimized formulas indicated no statistically significant differences between the CIEDE2000, CAM02-(LCD/SCD/UCS), and CAM16-(LCD/SCD/UCS) formulas in most cases (see
Table 11). We could therefore only conclude that for the current experiments using 3D objects [
24], the CIELAB color-difference formula (original or modified) was significantly worse statistically than the seven remaining (original or modified) formulas we tested.
3.3. Parametric Effects
In the current section, we will discard the results from the CIELAB formula because in most cases it was found to be significantly worse statistically than the remaining seven color-difference formulas, as explained previously (see
Section 3.1 and
Section 3.2). We will therefore analyze the effect on the
STRESS values of the CIEDE2000, CAM02-(LCD/SCD/UCS), and CAM16-(LCD/SCD/UCS) color-difference formulas according to changes in four parametric factors: color-difference magnitude, shape, gloss, and size of the 3D objects. The magnitude of color differences is an interesting factor because the CIEDE2000 formula was recommended for color-differences below 5.0 CIELAB units [
4], while in the current experiments [
24] there was a high percentage (45.1%) of color pairs with larger color differences (see
Figure 1). Specific color-difference formulas for large color differences have been proposed in the literature [
39]. We will also analyze the influence of shape (cones, spheres, and cylinders), gloss (3.6 vs. 96.6 GU on the average), and size (4 cm and 2 cm) on the
STRESS values for the 3D samples studied.
Table 12 shows the average
STRESS changes for seven (original and
kL +
b-optimized) color-difference formulas for each of the four above-mentioned parametric factors. The footnote in
Table 12 indicates the specific differences in the phases that we averaged to achieve the values shown in this table. All of the results in
Table 12 came from differences in the
STRESS values for pairs of phases (the order of such phases was relevant) in which only one parametric factor was changed at a time. For example, for the results in
Table 12 regarding size, we computed the
STRESS differences between phase 1 (Sp-4-m) and phase 3 (Sp-2-m) in that order, as well as the
STRESS differences between phase 2 (Sp-4-g) and phase 4 (Sp-2-g) in that order; finally, we averaged both
STRESS differences. That is to say, in this example of the effect of size, we considered differences between pairs of 3D samples with the same shape and gloss and with only a change from 4 cm to 2 cm (in that order). Therefore, the positive values for the parameter size shown in
Table 12 for all color-difference formulas (except CAM16-LCD) mean higher
STRESS values (i.e., lower performances) of the color-difference formulas for samples using 4 cm.
In
Table 12, we can note that in most cases the values for the
kL +
b-optimized formulas were lower than those corresponding to the original formulas. These lower
STRESS differences may have been a consequence of the
STRESS values for
kL +
b-optimized formulas (
Table 10), which were considerably lower than those for the original formulas (
Table 3), as discussed in
Section 3.2. Based on the values in the last column of
Table 12, we also concluded that the most important effects on the performance of original or
kL +
b-optimized formulas were the color-difference magnitude, shape, gloss, and size (in that order). We will make some comments on each one of these parametric factors in the paragraphs that follow.
Regarding the magnitude of color differences,
Table 12 indicates that the seven original (or
kL +
b-optimized) formulas performed worse (i.e., had higher
STRESS values) for color pairs with color differences below 5.0 CIELAB units. This result was not expected for CIEDE2000 because it was recommended for color pairs with color differences below 5.0 CIELAB units [
4], although in fact pairs with color differences up to 18.2 CIELAB units were employed in the development of CIEDE2000.
The shape of 3D objects seemed to be another relevant parametric factor that produced variations in color in the samples, which were related to specific geometrical lighting and viewing conditions in the visual experiments in [
24]. In the case here, cones and cylinders led to the highest and lowest
STRESS values, respectively, with spheres adopting intermediate
STRESS values. Specifically, for cone–cylinder pairs, we obtained on average 8.1 and 4.4
STRESS units of difference for the original and
kL +
b-optimized color-difference formulas, respectively; while for cone–sphere pairs, the average difference was 1.5
STRESS units for both types of formulas. A potential explanation for this result was that the visual contours for cylinders were rectangles (circles for spheres and nearly triangles for cones), which was the shape most similar to the 2D pairs of samples. The color gradients for the cones and spheres were also perhaps higher than for cylinders.
Gloss is an important perceptual factor that can strongly influence the visual perception of color differences [
40,
41]. The results for gloss given in
Table 12 were the average of four pairs of phases (interested readers may find more detailed results in the
STRESS values shown in
Table 3 and
Table 10); they suggested that the color-difference formulas tested performed slightly worse (i.e., had higher
STRESS values) for the matte samples. However, interactions between the gloss and shape may have existed.
Finally, based on the data in
Table 12, the size of the samples seemed to be the least influential parametric factor in the performance of the tested color-difference formulas. Perhaps the change in size from 4 to 2 cm at a viewing distance of 40 cm was too small to produce relevant changes in the perception of color differences. In any case, there also seemed to be some kind of interaction between the size and gloss, as our results (not shown in
Table 12) indicated that for the matte samples, the performance of most of the color-difference formulas was worse for the 4 cm size than for the 2 cm size, with the opposite being true for the gloss samples.
To evaluate the statistical significance of the four above-mentioned parametric factors on the performance of the color-difference formulas tested, we carried out a statistical analysis that was different from the conventional F-tests [
25]. The usual F-tests [
25,
30] require the same visual data be evaluated using two different color-difference formulas, while here we had pairs of phases rather than the same visual data. Using Student’s
t-tests, we obtained no statistically significant differences for any of the four tested parametric factors using any of the (original or
kL +
b-optimized) color-difference formulas, which was in part expected because the values shown in
Table 12 are small and the amount of data (pairs of phases) was very small in our study.