1. Introduction
The Gestalt psychologists Max Wertheimer and Wolfgang Metzger [
1,
2] formulated and discussed “laws of perception” to predict how perceptual grouping operates under specific conditions of visual configuration. Their important work was translated into the English language in 2012 and 2009 respectively by Lothar Spillmann and colleagues [
1,
2], making this important early conceptual work available to a broader audience. In physical science, a law is a prediction that can be proven true and, ideally, the limits of which can also be clearly determined. In perceptual science, the Gestalt laws are used to express principles or conditions of visual configuration to explain why we see the world as we do. It is argued that specific principles of, or conditions for, “good Gestalt” need to be fulfilled to enable what is called perceptual grouping, i.e., a perceptual solution that yields the most plausible interpretation of a given physical configuration. Since all physical stimuli are by nature ambiguous to our perception, they need to be interpreted by the brain to produce coherent and unambiguous percepts that allow us to act on the physical world effectively. The “Law of Symmetry” is a major Gestalt law. It predicts that visual elements that are symmetrical would be more likely to form a group, i.e., to be perceived as a “good Gestalt”, in comparison with asymmetrical ones. Visual symmetry has, indeed, proven a determining factor in shape perception [
3,
4,
5,
6]. In particular, vertical mirror symmetry has proven an important cue to shape extraction from abstract, non-familiar visual elements presented in conditions of heightened ambiguïty (noise). Across different noise levels, symmetric elements form perceptually more salient shapes than asymmetric ones and are, therefore, more readily detected [
5].
The Italian Gestalt psychologist Gaetano Kanizsa [
7] discussed a series of ambiguous planar configurations that give rise to powerful figure-ground percepts, with apparent shapes emerging in the foreground, delineated by contours that are perceptually completed beyond physically specified contrast edges. The Gestalt school and Kanizsa himself considered these phenomena as marginal cases of perception (
“margini quasi-percettivi”) and argued that these latter provide insight into the fundamentals of perceptual organization because they put underlying processes to the test under extreme conditions at the capacity limits of the perceptual system. Later-on, the figures seen in such configurations were termed “illusory” by cognitive psychologists; the Gestalt psychologists themselves never used this term, which is, of course, misleading. An illusion, by definition, cannot be verified by independent observation—it only exists in the mind of the person experiencing it. The phenomena described by Kanizsa have clearly defined physical correlates, with measurable systematic effects on perception. One of these configurations is the famous Kanizsa triangle (
Figure 1). The Kanizsa figures have been studied extensively to single out factors of physical variation that affect the subjective brightness or darkness of the figures and/or the figure contours. The results from these studies, based on a variety of different experimental measures, are reviewed in sufficient detail elsewhere [
8,
9,
10,
11,
12,
13]. They are not the object of this study here. Here, we measured the perceptual salience of figure-ground percepts irrespective of the relative darkness, brightness, or clarity of either the induced surfaces or their boundaries, as is made perfectly clear in the instructions given to subjects. As raised previously by others, the response criterion of the subjects in judgment tasks using this type of ambiguous figure [
8] is directly dependent on the semantic precision of the instructions given. Formulating these latter appropriately is making sure that the perceptual phenomenon under study, and not a related one, is reflected by the psychophysical data.
The influence of variations in the intensity of the local luminance contrast of physical inducers producing the perceptual filling in [
14] of bright or dark surfaces, leading to figure-ground percepts at the centre of the configurations in the case of the Kanizsa triangle and similar figures, was demonstrated for the first time by Heinemann’s pioneering studies [
15]. These were extended later on by others [
12] to various configurations including those of the Kanizsa type, where observers had to adjust the luminance of the central figure region until it matched the phenomenal appearance of the general background. This is equivalent to a cancellation of the phenomenal appearance of figure and ground. Configurations produced filling-in consistent with classic simultaneous contrast effects, where a figure appears darker than the general background when it is surrounded by phenomenally white inducers, and brighter when surrounded by phenomenally black inducers. The perceptual salience of such figures increases consistently with the luminance contrast intensity of the inducers, up to some optimal limit. When that optimal limit is reached and the contrast intensity of inducers increases further, the figure intensity is diminished again and may be annulled completely at the highest physical contrast levels [
15]. The simultaneous contrast filling-in that leads to the figure-ground percepts in Kanizsa configurations is therefore predicted by specific physical parameters of the inducers. In this study here, these parameters were all controlled experimentally to keep them constant across conditions created to single out an effect of symmetry.
When the physical contrast intensity of the configurational elements is optimal [
12,
13,
14,
15] and not varied between displays, the next most important physical parameter that straightforwardly determines the subjective strength of the figure-ground percepts in Kanizsa configurations is the physically specified-to-total contour ratio, or support ratio. This was proven in a series of experiments by Shipley and Kellmann [
11] using subjective magnitude estimation, a classic psychophysical rating procedure similar to the one applied in this study. Here, the Kanizsa triangle is exploited to probe for effects of symmetry on figure-ground from occlusion cues. The Kanizsa triangle is one of the most cited examples of a specific class of Gestalt configurations where perceived surface depth arises from local occlusion cues. In this specific shape class, figure-ground results directly from a process of surface completion through boundary interpolation across the physically specified edge contours of the inducers providing these local occlusion cues [
16,
17,
18,
19,
20]. Occluded object completion thus reflects the workings of fundamental visual mechanisms for recovering object percepts from fragmented input, and the ability of human perception to read structure into an apparently chaotic physical world [
21,
22]. The functional interactions between configurational symmetry and other structural factors in this important perceptual process are still unknown. The dependency of figure-ground salience on the support ratio (
Figure 1), a scale-invariant metric, is associated with the ecologically desirable consequence that perceptual salience will not change with variations in viewing distance [
11].
At constant physical contrast intensity of configurations with a constant support ratio, the contrast polarity of inducers, i.e., whether they are dark on lighter backgrounds, bright on darker backgrounds or a mixture of both on a medium grey background, does not affect the salience or subjective strength of the resulting figure-ground percept, provided the contrast polarity is homogenous within each of the inducing elements [
12,
23,
24,
25,
26,
27,
28,
29]. When contrast polarity is not homogenous within elements, then, and only then, may the perceptual salience of the figure-ground percept be reduced. This effect may be strongly dependent on the task instructions [
8]. Perceptual figure–ground organization is determined by visual mechanisms that integrate contrast intensity and spatial information carried by the configurational elements while mostly discarding information on contrast polarity. This is predicted by sign-invariance models based on functional properties of cortical neurons of the complex type, which are orientation selective but insensitive to contrast polarity [
14,
23,
24,
25,
29]. Such sign-invariant visual mechanisms have the ecologically desirable consequence that the simplest plausible representation of figure and ground can be achieved when the signal input from local contrast regions is particularly ambiguous.
The classic version of the Kanizsa triangle is a configuration with perfect vertical mirror symmetry (
Figure 1, top, and bottom left). Whether physical display variations producing asymmetric configurations (
Figure 1, bottom right) would affect figure-ground salience in this specific case is not known. The motivation of this study was to test whether symmetry contributes to figure-ground strength in this classic Gestalt configuration, where surface depth results from visual interpolation across fragments. Two variations of the Kanizsa triangle with identical support ratio, as defined by Shipley and Kellmann [
11], and identical triangular area size were generated; one with perfect bilateral symmetry (
Figure 1, bottom left), the other asymmetric (
Figure 1, bottom right). To test for possible interactions between symmetry and the orientation of the configurations in the plane, presentations were varied and bilateral symmetry was not always vertical but could be vertical or horizontal, in a random order. In the light of earlier findings, with abstract shapes presented in noisy contexts [
5], vertical mirror symmetry significantly increased the probability that a shape was seen as a figure against the ground. Thus, the bilateral symmetry of vertical orientations may also generate stronger effects on the figure-ground salience of surfaces completed by interpolation. The physical inducers, either dark on grey, light on grey, or light and dark on grey, displayed variations in contrast polarity across inducers, but never within, in both types of configuration, symmetric and asymmetric. In the light of previous findings, these variations should not affect figure-ground strength, given that the polarity of contrast was always homogenous within inducers in the different configurations [
8,
10,
12,
26,
29].
In a first experiment, psychophysical magnitude estimation was used to measure the salience, or subjective strength, of the figure-ground percepts in the different conditions. In a second experiment, a choice response time was run, with a selected set of configurations, to test whether more salient figure-ground percepts consistently produce, as would be expected, shorter response times with consistent “foreground” response probabilities.
2. Materials and Methods
Triangular Kanizsa configurations with and without bilateral symmetry (
Figure 1 bottom left and right respectively, for illustration), identical support ratio and surface area, variable orientation (vertical base down, as in
Figure 1 bottom left, vertical base up, sideways base left, sideways base right) and variable inducer polarity (three black inducers on grey or ‘− − −’, three inducers white on grey or ‘+ + +’, two black and one white inducer on grey or ‘− − +’, two white and one black inducer on grey or ‘+ + −’) were presented in random order to ten human observers in a single-presentation-per-figure subjective magnitude estimation (rating) experiment with
moduli. Four of these configurations with a single orientation (vertical base down, as in
Figure 1 bottom left) and uniformly positive (+ + +) or uniformly negative (− − −) contrast polarity, two with vertical symmetry and two without, were presented to six additional human observers in a repeated measures (four presentations per configuration) choice response time experiment with two additional control stimuli (triangles with minimally visible line contours (“ghosts”) only, no surface contrast).
2.1. Stimuli
The image configurations were computer generated using an HP Zbook 15 G2 Mobile Workstation equipped with a 4th generation Intel Core i7-6700 processor and an NVIDIA Quadro K5100 graphic card.
Configurational dimensions in terms of size (in pixels and in cm on screen) of triangle base and triangle sides, the inducer radius, which was identical in all the configurations, and the overall physical-to-total contour ratio, also identical in all the configurations, are summarized in
Table 1. Luminance values of the different configural elements were determined photometrically using an OPTICAL photometer (Cambridge Research Systems). The ADOBE RGB coordinates of the phenomenally grey background (RGB: 140, 140, 140) yield a background luminance of 55 cd/m
2. The phenomenally black inducers (RGB: 5, 5, 5) a luminance of 4 cd/m
2, and the phenomenally white inducers (RGB: 240, 240, 240) a luminance of 98 cd/m
2. The
moduli from the subjective rating task (RGB: 135, 135, 135 for the phenomenally darker ones and RGB: 145, 145, 145 for the phenomenally lighter ones) had a luminance of 52 cd/m
2 or 58 cd/m
2 respectively. The line contour control configurations (RGB: 120, 120, 120) from the choice response time task had a luminance of 48 cd/m
2. The physically specified contrast intensities with positive and negative signs may be calculated using the Weber Contrast (Weber Ratio,
W) formula:
As a consequence, we have a positive W of +0.92 for the phenomenally white inducers, a negative W of −0.78 for the phenomenally black inducers, a positive and a negative W of +0.09 and −0.09 respectively for the minimal-contrast moduli from the subjective rating task, and a negative W of −0.13 for the minimal contrast line contour control configurations from the choice response time experiment.
2.2. Presentation of Configurations
The configurations were presented in random order on the screen of the HP Zbook 15 G2 Mobile Workstation, which has a pixel resolution of 1920 × 1080 and a 60 Hz refresh rate. Random selection, presentation, and response coding were computer controlled using Python for Windows. The duration of presentation of each single configuration was observer controlled in both experiments, a subsequent presentation always initiated 800 milliseconds after the observer had typed his/her response on the computer keyboard. The 32 configurations from the single-presentation subjective magnitude estimation (rating) task, with the different variations in orientation and in local contrast polarity, are shown in
Figure 2a,b, for illustration. Illustrations of the 6 configurations from the repeated measures choice response time task are shown in
Figure 3.
2.3. Experimental Procedure
Subjects were seated in front of the workstation at a distance of about 90 cm from the screen in a semi-dark room. In the subjective magnitude estimation task, they were shown a set of
moduli consisting of minimal-contrast triangular surfaces of the same spatial dimensions as the symmetric and asymmetric triangular centers of the test configurations. These
moduli are shown in
Figure 2c for illustration. Subjects were told to associate the phenomenal strength of the
moduli with a rating score of ‘11’. It was then made clear to them that they would be shown different triangular configurations, with black and white patches around them. They were then asked to rate the subjective strength of the figure-ground percept at the centre of the test configurations, in terms of the strength of the segregation into foreground and background, regardless of the direction of the perceived contrast (i.e., subjectively darker or subjectively lighter), by a number between ‘0’ and ‘10’, bearing in mind that the highest number was to reflect a figure salience closest to that of the real-contrast
moduli they had seen just before. Each of the 16 asymmetric and the 16 symmetric configurations (
Figure 2a,b, respectively, for illustration) was presented only once to each of the subjects in a single random order session. In the choice response time task, subjects were asked to judge as swiftly as possible whether the triangle displayed on the screen seemed to stand out as foreground against the grey general background, or to lie behind the general grey background. In this experiment, the outlined triangular shape control configurations (
Figure 3, on the right) were presented at a minimal, just visible negative line contrast intensity and no surface contrast. This renders a highly ambiguous (one subject mentioned “ghost-like”) appearance on the screen with no clear figure-ground assignment. Each of the six configurations (
Figure 3, for illustration) was shown four times, in random order, to each of six subjects in a single individual session.
2.4. Subjects
Ten individuals (six men, four women) between 20 and 31 years old, all of them with normal or corrected to normal vision, participated in the subjective magnitude estimation experiment. Six further individuals (five men, one woman), also young and with normal or corrected to normal vision, participated in the choice response time experiment. Participants were mostly undergraduates involved in medical or language studies. None of them was familiar with the configurations presented to them, and all of them were naïve to the purpose of the study. The experiments were conducted in accordance with the Declaration of Helsinki (1964) and in full conformity with the author’s host institution’s (CNRS) ethical standards committee. Informed consent was obtained from each of the participants.
2.5. Data Analysis
The data from the subjective rating experiment, with a Cartesian design plan written in terms of Subject10 × Symmetry2 × Orientation4 × Polarity4, produced a total of 320 subjective ratings. These data were fed into a Three-Way ANOVA. Means, standard errors, effect sizes, and F statistics with probability limits were determined.
The data from the choice response time experiment, with a Cartesian design plan written in terms of Subject6 × Symmetry2 × Polarity3 × RepeatedMeasures4, produced a total of 144 choice data and a total of 144 response times. In the experimental design plan, the control configuration represents the third modality of the “polarity” factor, with the three factor levels “positive” or ‘+ + +’, “negative” or ‘− − −’, and “control”. The response times were fed into a Two-Way Repeated Measures ANOVA with individual data averaged over the four levels of the repetition factor R4 and without the third level of the “polarity” factor, i.e., the analysis plan therefore reads Subject6 × Symmetry2 × Polarity2.
4. Discussion
Although symmetry has been discussed in terms of a major grouping principle or law of good Gestalt since Wertheimer and Metzger [
1,
2], the specific effects symmetry may produce on feature grouping, figure-ground segregation, visual discrimination, or time to respond to visual configurations have become subject to systematic quantitative investigation in perceptual science only recently. In the case of visual perception, symmetry may be conceived as a geometric property that yields configurational simplicity and, therefore, represents an ecological advantage for information processing [
30]. Symmetry may also be seen as a perceptual feature that attracts attention, enhances configural salience, and facilitates grouping [
5,
31,
32,
33,
34,
35].
It is found that bilateral configurational symmetry, i.e., mirror symmetry within the whole configuration, strengthens the perceptual salience of figure against ground in triangular Kanizsa configurations. The results from the magnitude estimation (rating) experiment clearly show that the subjective strength of the foreground at the center of the configurations is significantly higher when the configurations have bilateral symmetry. This holds for triangular configurations when mirror symmetric configurations are compared to asymmetric configurations with the same number of inducers, and the same support ratio as defined by Shipley and Kellman [
11]. This symmetry effect can be exploited to further quantify critical interactions between occlusion-based surface properties, symmetry, and figure-ground salience. Variations in symmetry could be tested against variations in the support ratio in the first instance. Figure-ground segregation from interpolation is an early-stage process in perceptual grouping [
16,
36,
37,
38,
39,
40,
41,
42,
43,
44,
45], as is symmetry detection [
39,
43]. In particular, as shown by Erlikhman and Kellmann [
44,
45], the human perceptual system uses critical spatial cues of local position and alignment within a restricted spatiotemporal window (~165 msec) for the rapid extraction of co-oriented edge fragments from the visual input. As predicted by the Gestalt Law of Good Continuation, the fragments then connect by known neural interpolation processes [
16,
25,
35], producing larger surfaces that will stand out as figures against ground. The results from the experiments here show that symmetry contributes to this early process of perceptual organization.
The results suggest no influence of the orientation (vertical versus horizontal) of the axis of symmetry on the salience of the figure-ground percepts. Vertical and horizontal mirror symmetry produced equally strong phenomenal salience of figure-ground. Since Mach [
33], it is suggested that symmetry around the vertical axis may be more effectively processed by the visual system than symmetry about the horizontal, or any other, axis in the plane. Some studies have confirmed this prediction [
5]. However, more recent reviews indicate that there may be no systematic functional advantage of vertical symmetry [
34]. Effects of axis of symmetry on perception may be dependent on what Bertamini termed “objectness” [
31], i.e., whether the cognitive interpretation of the visual shape changes with translational or rotational changes of the latter. Psychophysical data on shape perception [
43], using radial frequency patterns and other objects, indeed suggest that variations in the location and orientation of relevant (with respect to the perceptual task) object features may generate effects of axis of symmetry. In the two experiments here, relevant perceptual features within and across objects (
symmetric vs. asymmetric) can be considered invariant, since there was no effect of contrast polarity and no interaction between contrast polarity and symmetry. This could explain why the axis of symmetry had no effect here either. Also, earlier psychophysical studies have shown that bilateral symmetry is significantly more salient within objects, significantly less between objects [
39,
40]. The layout characteristics, including symmetry, of complex figure-ground solutions are more easily processed within single perceptual objects [
40]. Symmetry detection becomes harder with complex shape configurations where other factors, such as positional uncertainty or convexity, interact with the symmetry factor, especially when the psychophysical task requires comparing across objects. It may be that vertical symmetry generates a measurable advantage for perception only under such conditions.
The results from the choice response time task show a consistently higher percentage of “foreground” responses to the symmetric configurations, which is accompanied by significantly shorter response times. Bilateral symmetry, therefore, represents a measurable functional advantage in the perceptual processing of figure-ground. Variations in the contrast polarity of the inducing elements had no marked effect on either the subjective strength of the figure-ground percepts or the response times. This observation is consistent with results from earlier work with similar configurations where symmetry was not varied [
9,
12,
29], and predicted by non-linear neural models of figure-ground based on the long-range integration of antagonistic brightness signals [
12,
13,
14,
23,
24]. Interestingly, when inducers of both positive and negative contrast signs, i.e., phenomenally white and black inducers, are present in the same configuration, the latter may be perceived as phenomenally asymmetric with respect to brightness. The perceptual system, however, is not influenced by the symmetry/asymmetry in contrast signals, only by geometrically defined symmetry. Although this study here was not specifically aimed at singling out the hierarchal level of perceptual processing at which the symmetry effect arises, it is unlikely that conscious processing was involved here. After the experiments, subjects were asked whether they had noticed anything in particular in the configurations or used a particular strategy to respond. Most of them stated that “some were the same, some were different”, or “some had white parts, some had black parts, some had both”, but none of them was able to specify any particular structural difference or response strategy. We may therefore infer that subjects were not immediately aware of the systematic variations in symmetry.
Bilateral symmetry is identified as a key prior for three-dimensional shape perception in humans [
41,
42]. The perceptual integration of symmetry in this process does not necessarily happen consciously and, as explained above, may vary with shape complexity and shape interpretation [
40] without subjects being aware of it. Therefore, configurational complexity and shape interpretation need to be controlled for to single out the effects of symmetry
per se on any particular aspect of perceptual organization. This is clarified further by some of the additional illustrations in
Figure 6, showing configurations where the manipulation of symmetry inevitably implies changing also the complexity of the configuration as a whole, and the resulting shape interpretation. In the square version of the Kanizsa figure, breaking the configurational symmetry inevitably requires changing the shape borders. This produces a new, far more complex, qualitatively different shape geometry leading to a radically different shape interpretation. The Kanizsa square is therefore ill-suited for singling out effects of symmetry without ambivalence. The advantage of the triangular configuration used in this study is that the geometric transformations needed to manipulate mirror symmetry affect neither the structural complexity of the configurations, nor the resulting shape interpretation: with or without bilateral symmetry, the perceptual solution is always and only a “triangle”.
The new effect found here is fully consistent with the adaptive logic of visual preference for symmetry, where symmetry strengthens the figure-ground salience of surfaces from occlusion cues, formed through visual spatial interpolation across fragments within a narrow temporal window of processing [
44,
45]. Symmetry strongly influences perception-based decision making in humans and in many animals, and survival-relevant responses to symmetry, or lack thereof, are found not only in primates but also in other species [
37]. This highlights the wider biological significance of symmetry as a visual signal, with further implications for higher order adaptive human behavior, such as structural design [
46], or image-guided precision tasks [
47,
48]. The early Gestalt theories intuitively captured this fundamental importance in a large body of observations on phenomena of human perceptual organization. Their intuitions were astute, pointing towards functional aspects of symmetry which perception science has only just begun to quantify and predict.