A Model of Information Visualization Interpretation

Hilgers, Michael G.

doi:10.3390/app14156731

Open AccessArticle

A Model of Information Visualization Interpretation

by

Michael G. Hilgers

Business and Information Technology Department, Kummer College of Innovation, Entrepreneurship, and Economic Development, Missouri University of Science and Technology, 106D Fulton Hall, 301 W. 14th St., Rolla, MO 65409, USA

Appl. Sci. 2024, 14(15), 6731; https://doi.org/10.3390/app14156731

Submission received: 28 June 2024 / Revised: 22 July 2024 / Accepted: 26 July 2024 / Published: 1 August 2024

(This article belongs to the Special Issue Recent Applications of Information Visualization and Graphics)

Download

Browse Figures

Versions Notes

Abstract

:

Since the groundbreaking work by Cleveland and McGill in 1984, studies have revealed the difficulties humans have extracting quantitative data from visualizations as simple as bar graphs. As a first step toward understanding this situation, this paper proposes a mathematical model of the interpretation effort of a bar graph using concepts drawn from eye tracking. First, three key areas of interest (AOIs) are identified, and fixations are modeled as random point clouds within the AOIs. Stochastic geometry is introduced via random triangles connecting fixations within the adjacent key visual regions. The so-called landmark methodology provides the basis for the probabilistic analysis of the constructed system. It is found that the random length of interest in a stochastic triangle has a noncentral chi distribution with a known mean. Unique to this model, in terms of previous landmark applications, is the inclusion of a correlation between fixations, which is justified by physiological studies of the eyes. This approach introduces several model parameters, such as the noncentrality parameter, variance of the fixation cloud, correlation between fixations, and a visualization scale. A detailed parametric analysis examining the dependence of the mean on these parameters is conducted. The paper ties this work to the visualization via a definition of the expected visual measurement error. An asymptotic analysis of the visual error is performed, and a simple expression is found to relate the expected visual measurement error to the key model parameters. From this expression, the influence these parameters have on a visualization’s interpretation is considered.

Keywords:

eye tracking; visualization; stochastic geometry

1. Introduction

The visual display of quantitative data requires the viewer to decode numerical information from visual objects [1], as shown in Figure 1. As the volume and dimension of data increase, the associated visualizations become increasingly more complex [2], making the decoding process more challenging. Much attention has been given to the success or failure of the decoding process. Research agendas have had a variety of objectives and have utilized various methodologies. A discussion of topics relevant to this paper follows.

An information visualization communicates numeric data using objects such as lines, rectangles, bars, circles, and so forth [1]. Cleveland and McGill, in their classic work on graphical perception [3], studied how the viewer assigns numbers to these objects using their geometric properties of points, lengths, areas, and so forth. Their results were repeated and extended on a larger experimental scale using crowd sourcing on Amazon’s Mechanical Turk platform by Heer and Bostock [4]. The systemic difference they found between the user-perceived value and the desired numeric value is called the “expected visual measurement error” in this paper, and modeling it is one of the goals.

All models are based on assumptions. In order to build ours, we need to clarify the situation and context in which the decoding occurs. This requires an understanding of how people approach a visualization to read it.

Shah and Hoeffner published a survey of graph comprehension research [5]. They found in the literature three major factors influencing a student’s comprehension of graphs: visual characteristics of the graph, prior experience with visualizations, and knowledge and expectations with the content of the data in a graph. While measuring comprehension is not an objective of this research, it is assumed the viewer understands how to read a bar graph, which is not universally true.

Here we encounter a subtle point, which is the reason the viewer is inspecting the graph. Is the the person seeking an answer to a specific question, or is the viewer free to explore and deduce meaning? This introduces the concept of top-down versus bottom-up viewing of a visualization [6,7]. Top-down is goal-oriented, and visual attention is associated with the viewer’s goals and expectations, whereas in bottom-up, visual attention is driven by the graphical characteristics of the image, such as color and contrast. Bottom-up is the free visual search. Matzen et al. have explored bottom-up situations in several studies using eye tracking [8,9]. This can be complicated due to the matter of memory. If the viewer is given a limited period to investigate the visualization, then memory must be used to fulfill a task [10]. In this paper, the concern is with a viewer who is given a specific task to fulfill, making this a top-down situation.

In moving from related cognitive issues to focus on the eye-tracking methodology, we find rich literature. Jacob and Karn [11] offer a detailed historical review of eye tracking starting in the 1800s. Much early research tried to tie eye tracking to the cognitive process. That is, how eye movement relates to the thought process [12,13,14]. A thread of recent research seeks to use eye tracking to discover how a viewer seeks out the salient features of the visualization [15]. This can be tied to visualization design via a so-called saliency map [8,16], which is used to predict user behavior. These maps are most applicable in a bottom-up situation.

Important to the use of eye-tracking data in the development of models is the capability to quantize it. The most common approach is through various metrics. In [11], 21 usability studies utilizing eye tracking were surveyed, and the metrics were classified and counted. Some of this is relevant herein. First, the concept of a fixation is needed. The authors defined it as follows [11]:

“Fixation: A relatively stable eye-in-head position within some threshold of dispersion (typically 2°) over some minimum duration (typically 100–200 ms), and with a velocity below some threshold (typically 15–100 degrees per second).”

Next the concept of the area of interest (AOI) is needed, quoting [11]:

“Area of interest: Area of a display or visual environment that is of interest to the research or design team and thus defined by them (not by the participant).”

Finally, their notion of the scanpath is needed, so again quoting [11]:

“scanpath: Spatial arrangement of a sequence of fixations.”

Acknowledged as an important metric [17], the scanpath contains information about the arrangement of elements in the image being viewed.

Eye tracking has also been used to study the physical motion of the eye during the reading of text or viewing of an image. Rather than give a complete summary of all that is known in this interesting area, attention is restricted to differences between horizontal and vertical motion of the eye and any potential correlation. Collewijn et al. studied these issues using eye tracking in a pair of papers [18,19]. It had already been established by Bahill and Stark that horizontal and vertical channels of eye movement are independent [20]. That is, there are different muscles, motor neurons, and brain stem staging areas controlling the two types of eye movements. The horizontal and vertical motions do not happen simultaneously so that an oblique scanpath is curved. Horizontal movement, with its associated saccadic behavior (rapid jumps), is dominant and more accurate. Vertical saccades are less accurate, often undershooting the target followed by a correction. Furthermore, parameters describing upward saccades are heavily depended on the position of the eye, while downward saccades are almost independent of eye position. Similarly, the authors found that the parametric dependence of horizontal motion is affected by the size, direction, and initial position of the motion.

Tying these threads to this paper, several observations are in order. The visualization under consideration will be a simple bar graph. It is assumed the viewer understands the graph and the underlying data structure. The visual characteristics of the graph are important. All elements are supposed to be of equal salience so that none draws unwarranted attention due to color or contrast. The viewer has a simple task to perform: determine the height of a particular bar using the y-axis (or ruler) provided. This makes for a top-down investigation. There is no time restriction that forces the viewer to rely on memory. In this stage of the model’s development, the visual search factor is minimized by considering only one bar.

The developed model attempts to capture aspects of the physiological motion of the eyes using stochastic geometry. It is like the landmark theory by Bookstein [21]. However, it is a novel contribution to both eye tracking and applied stochastic geometry in that the random horizontal and vertical motions of the eye have different variances and are correlated. It should be noted that though the horizontal and vertical saccades are physiologically independent, they need not be statistically independent as suggested by the oblique [20] scanpaths that are often observed. Hence, the model is developed in its fullest generality; then, several examples under simplifying assumptions are examined.

The mathematical model is based on eye-tracking concepts in the following manner. A viewer is tasked with finding the numerical value associated with bar A. See Figure 2. The viewer must look at the base of the bar to identify it as the correct bar and to verify that its bottom aligns with the horizontal axis. The top of the bar is studied as part of the sighting process to match it with the associated location on the ruler. This defines three areas of interest. It should be noted that these three AOIs have been identified on bar graphs in other eye-tracking studies on information visualization [22]. During eye tracking, fixations occur in the AOIs. A type of mean case analysis is performed by assuming the fixations are normally distributed in the plane about the points

(b, 0)

,

(b, h)

, and

(0, h)

. The nature of this normal distribution will be much discussed below. This is, of course, an abstraction of what happens in an eye-tracking experiment, but all of the results to follow will involve the mean of these fixations. Spatial statistics estimate this with the average of the fixation locations.

2. Materials and Methods

In this section, we will develop a model of eye tracking as applied to a bar graph, followed by extensive parametric analysis. Before this, we will consider the purpose and goals of parametric analysis. Once a sense of direction is established, we will construct the eye-tracking model in a general form. With this in hand, the model is examined under various restrictions that prove useful for a rich analysis of the errors in reading the graphs.

2.1. Form and Purpose of Parametric Analysis

The parametric analysis of the probability density function mean before us is involved, and it might help to work through the principles in a simpler setting. Consider the normal distribution

φ (χ; m, s) = \frac{1}{\sqrt{2 π s^{2}}} e^{- \frac{{(χ - m)}^{2}}{2 s^{2}}} .

Throughout this paper, symbols in argument lists of functions will often have a semi-colon separating them. Symbols to the left of the semi-colon are variables, and those to the right are parameters. When working with model parameters, the first step is usually to identify them in some situation-specific way. For example, it can be shown that for

φ

, m is the mean of the probability distribution and

s^{2}

is the variance. The next step in the analysis is to systematically vary a parameter while holding the others constant. In this setting, if we hold s constant and increase m, the familiar bell-shaped curve of the normal distribution shifts right. Decreasing m makes it shift left. Holding m constant and decreasing s makes the “bell” become tall and narrow. Increasing s causes the bell-shape to flatten out.

The situation in this paper is one step more complicated: the model parameters themselves depend on parameters. In our example, that would be the same as

m (; α, β, γ) = f (α, β, γ) s (; α, β, γ) = g (α, β, γ)

We know that increasing m shifts the curve, but what causes m to increase? Now we must perform a parametric analysis on m in terms of the other model parameters. As part of this process, we might observe

\frac{\partial m}{\partial α} > 0 .

Therefore, we would conclude m increases as

α

increases. Hence, increasing

α

shifts the curve to the right. This type of analysis is typical of what follows.

2.2. Overview of the Eye-Tracking Model and Parametric Analysis

The method of development of our mathematical model occurs in several steps. The approach detailed is based on Bookstein’s landmark model [21] adapted for eye tracking. First, we consider the geometry of the visualization, identifying important areas of interest. A fixation point

P_{i}

is selected from each AOI. A vector

z_{i}

points to the center of the AOI. We then decompose

P_{i}

as

P_{i} = d_{i} + z_{i} .

(1)

Here,

z_{i}

is fixed and is determined by the geometry of the visualization, and

d_{i}

is a random vector marking the displacement about the center of the AOI (see Figure 3 and Figure 4). One point

P_{i}

is selected from each AOI, and these points are connected by line segments, forming a triangle. Since these points are randomly located, this forms a stochastic triangle (See Figure 3). Next, we find the probability distribution of a side length and take its mean. With that in hand, a parametric analysis of the expected visual measurement error is performed.

2.3. Geometrical Considerations

Following the notation Stoyan and Stoyan [23] used in their section on the Bookstein model, we label the fixed reference points on the corners of the triangle as

z_{1} = [\begin{matrix} b \\ 0 \end{matrix}], z_{2} = [\begin{matrix} b \\ h \end{matrix}], and z_{3} = [\begin{matrix} 0 \\ h \end{matrix}] .

(2)

Relative to the fixed points, the fixations

P_{i}

are

P_{i} = d_{i} + z_{i}

, where

d_{i} = [\begin{matrix} d_{i x} \\ d_{i y} \end{matrix}] .

(3)

The original side lengths are

\begin{matrix} d_{12} & = ∥ z_{2} - z_{1} ∥ = \sqrt{{(b - b)}^{2} + {(h - 0)}^{2}} = h \\ d_{23} & = ∥ z_{3} - z_{2} ∥ = \sqrt{{(0 - b)}^{2} + {(h - h)}^{2}} = b \\ d_{13} & = ∥ z_{3} - z_{1} ∥ = \sqrt{{(0 - b)}^{2} + {(h - 0)}^{2}} = \sqrt{b^{2} + h^{2}} . \end{matrix}

Figure 3 shows the layout of the sides of the stochastic triangle. We begin with

D_{12}

:

\begin{matrix} D_{12} & = ∥ P_{2} - P_{1} ∥ = ∥ (z_{2} + d_{2}) - (z_{1} + d_{1}) ∥ \\ = \sqrt{{((b + d_{2 x}) - (b + d_{1 x}))}^{2} + {((h + d_{2 y}) - (0 + d_{1 y}))}^{2}} \\ = \sqrt{({(d_{2 x} - d_{1 x})}^{2} + {(h + (d_{2 y} - d_{1 y}))}^{2}} . \end{matrix}

Let

η = d_{2 x} - d_{1 x}

and

ζ = d_{2 y} - d_{1 y}

; we obtain

D_{12} = \sqrt{η^{2} + {(h + ζ)}^{2}} .

(4)

Similarly,

\begin{matrix} D_{23} & = ∥ P_{2} - P_{3} ∥ \\ = \sqrt{{(b + α)}^{2} + β^{2}} \end{matrix}

where

α = d_{2 x} - d_{3 x}

and

β = d_{2 y} - d_{3 y}

. And for

γ = d_{1 x} - d_{3 x}

and

ω = d_{1 y} - d_{3 y}

,

\begin{matrix} D_{13} & = ∥ P_{1} - P_{3} ∥ \\ = \sqrt{{(b + γ)}^{2} + {(h + ω)}^{2}} . \end{matrix}

2.4. General Case

The side length

D_{12}

is a random variable. The end objective is to find its probability density function and associated mean. This will involve the so-called Gaussian distributions [24]. To obtain these, we must be very precise about the model used for

η

and

ζ

. In his book [25], Miller has 45 different pdfs for Gaussian distribution processes based on subtle variations in the joint mean and correlation of these two random variables. We will explore this first under minimal assumptions and then for various specific cases.

2.4.1. Fixation Components

It is reasonable to assume that the fixations are balanced about the fixed points, such as

z_{1}

. This makes, for

i = 1, 2, 3

,

E (d_{i x}) = E (d_{i y}) = 0 .

(5)

Maintaining generality, for the variance,

\begin{matrix} Var (d_{i x}) & = σ_{i x}^{2} \end{matrix}

(6)

\begin{matrix} Var (d_{i y}) & = σ_{i y}^{2} \end{matrix}

(7)

for

i = 1, 2, 3

. It is traditional in landscape models to assume the horizontal and vertical components are uncorrelated; however, such is questionable in eye motion. As discussed in [18,19], horizontal and vertical movements behave differently. Horizontal movement of the eye is stronger and more accurate, whereas vertical movement is weaker. It behaves differently for up and down directions and often misses its mark by overshooting or undershooting. It frequently “fishtails” at the end to correct its location. Hence, in this model, it is allowed that a correlation exists, and the impact of the parameter is studied. To be specific, we name

ρ_{i} = corr (d_{i x}, d_{i y}) for i = 1, 2, 3 .

(8)

It now assumed that the fixations are normally distributed about the fixed points

z_{i}

, resulting in

[\begin{matrix} d_{i x} \\ d_{i y} \end{matrix}] = N_{2} ([\begin{matrix} 0 \\ 0 \end{matrix}], [\begin{matrix} σ_{i x}^{2} & ρ_{i} σ_{i x} σ_{i y} \\ ρ_{i} σ_{i x} σ_{i y} & σ_{i y}^{2} \end{matrix}]) .

(9)

The variance matrix must be invertible, leading to the requirement

det [\begin{matrix} σ_{i x}^{2} & ρ_{i} σ_{i x} σ_{i y} \\ ρ_{i} σ_{i x} σ_{i y} & σ_{i y}^{2} \end{matrix}] = σ_{i x}^{2} σ_{i y}^{2} (1 - ρ_{i}^{2}) > 0

(10)

for

i = 1, 2, 3

, which is satisfied if the correlation is

- 1 < ρ_{i} < 1

.

2.4.2. Side Lengths

Recalling Equation (4), in order to find the probability distribution of

D_{12}

, the distributions of

η

and

ζ

must be calculated. First,

E (η) = E (d_{2 x} - d_{1 x}) = 0 .

Similarly,

E (ζ) = E (d_{2 y} - d_{1 y}) = 0 .

For the variance,

\begin{matrix} Var (η) & = Var (d_{2 x} - d_{1 x}) \\ = Var (d_{2 x}) + Var (d_{1 x}) - 2 Cov (d_{2 x}, d_{1 x}) \\ = Var (d_{2 x}) + Var (d_{1 x}) - 2 corr (d_{2 x}, d_{1 x}) \sqrt{Var (d_{2 x}) Var (d_{1 x})} \\ = σ_{2 x}^{2} + σ_{1 x}^{2} - 2 corr (d_{2 x}, d_{1 x}) σ_{2 x} σ_{1 x} . \end{matrix}

The term

corr (d_{2 x}, d_{1 x})

requires discussion. The first correlation term encountered,

ρ_{i}

, involves the horizontal and vertical components of the fixation about the same fixed point. This term is the correlation between the horizontal components of adjacent fixed points. On the one hand, it could be argued that the fixations in the AOIs about different fixed points are independent, which was considered in [26]. In this case,

corr (d_{2 x}, d_{1 x}) = 0

. On the other hand, considering the scanpath of the eye about the triangle, the model views the eye moving rapidly from one AOI to another. It seems possible that the horizontal offsets between the fixed points would influence each other, leading to a nonzero correlation. In the spirit of generality, then the correlation is maintained in the model, at least in this case, so that

ρ_{12 x} = corr (d_{2 x}, d_{1 x})

(11)

and

Var (η) = σ_{2 x}^{2} + σ_{1 x}^{2} - 2 ρ_{12 x} σ_{2 x} σ_{1 x} .

(12)

Denoting

ρ_{12 y} = corr (d_{2 y}, d_{1 y})

(13)

similar calculations show

Var (ζ) = σ_{2 y}^{2} + σ_{1 y}^{2} - 2 ρ_{12 y} σ_{2 y} σ_{1 y} .

(14)

The covariance presents different issues. It is

\begin{matrix} Cov (η, ζ) & = E (η ζ) - E (η) E (ζ) \\ = E (η ζ) = E ((d_{2 x} - d_{1 x}) (d_{2 y} - d_{1 y})) \\ = E (d_{2 x} d_{2 y}) - E (d_{2 x} d_{1 y}) - E (d_{1 x} d_{2 y}) + E (d_{1 x} d_{1 y}) . \end{matrix}

The first and last terms have been encountered. They are

\begin{matrix} E (d_{1 x} d_{1 y}) & = ρ_{1} \\ E (d_{2 x} d_{2 y}) & = ρ_{2} . \end{matrix}

The remaining terms again require modeling consideration. Again, the correlation around adjacent fixed points is encountered. This time it is the horizontal component at one point and the vertical component at the other. The previous discussion remains relevant, and the general case is considered. Here, we introduce the correlations

\begin{matrix} ρ_{1 y 2 x} & = corr (d_{1 y}, d_{2 x}) \end{matrix}

(15)

\begin{matrix} ρ_{1 x 2 y} & = corr (d_{1 x}, d_{2 y}) \end{matrix}

(16)

allowing one to write the covariance as

Cov (η, ζ) = ρ_{1} σ_{1 x} σ_{1 y} + ρ_{2} σ_{2 x} σ_{2 y} - ρ_{1 y 2 x} σ_{1 y} σ_{2 x} - ρ_{1 x 2 y} σ_{1 x} σ_{2 y} .

(17)

2.5. Special Cases

With the general case stated, several useful specializations can be enumerated. As will be elaborated, the problem in its full generality is, as of now, intractable. The first simplifying assumption to be made is that all the correlations have the same value. Namely,

ρ = ρ_{1} = ρ_{2} = ρ_{12 x} = ρ_{12 y} = ρ_{1 y 2 x} = ρ_{1 x 2 y} .

(18)

Under this assumption, the general case becomes

\hat{Λ} = Cov (η, ζ) = [\begin{matrix} {\hat{σ}}_{1}^{2} & {\hat{σ}}_{12} \\ {\hat{σ}}_{12} & {\hat{σ}}_{2}^{2} \end{matrix}]

(19)

where

\begin{matrix} {\hat{σ}}_{1}^{2} & = σ_{1 x}^{2} + σ_{2 x}^{2} - 2 ρ σ_{1 x} σ_{2 x} \end{matrix}

(20)

\begin{matrix} {\hat{σ}}_{2}^{2} & = σ_{1 y}^{2} + σ_{2 y}^{2} - 2 ρ σ_{1 y} σ_{2 y} \end{matrix}

(21)

\begin{matrix} {\hat{σ}}_{12} & = ρ (σ_{1 x} σ_{1 y} + σ_{2 x} σ_{2 y} - σ_{1 y} σ_{2 x} - σ_{1 x} σ_{2 y}) . \end{matrix}

(22)

In order to proceed with further analysis, simplifications of this system are required. The most progress has been made on positive definite diagonal variance matrices. This means one should examine cases in which

η

and

ζ

are not correlated. Note, in situations in which

corr (η, ζ) = 0

, the eye motion associated with the fixations can still be correlated. That is,

ρ \neq 0

.

A further common restriction on the problem is

Λ = [\begin{matrix} ψ_{0}^{2} & 0 \\ 0 & ψ_{0}^{2} \end{matrix}] = ψ_{0}^{2} I

(23)

where

I

is the identity matrix. That is,

Λ

is a diagonal matrix with the same positive number as its diagonal elements. (At the moment,

ψ_{0}^{2}

is just a label and has no interpretation in terms of model parameters. In the following, specific examples will be given.) Equation (23) being a common assumption might be surprisingly strong, but Miller says in [25] that determining the probability density function for an arbitrary positive definite diagonal covariance matrix proves to be “extremely difficult”.

Shortly, we will work through several cases for

Λ

by populating it with model parameters. In each case, restrictions are put into place to simplify Equation (19). We will see that various restrictions applied in combination can produce a diagonal matrix with identical elements on the diagonal, like Equation (23), as well as unequal elements on the diagonal. Furthermore, we will demonstrate that we can recover the same model as Stoyan and Stoyan’s [23] by using restrictions similar to theirs.

Progress has been made on the

2 \times 2

correlated case, but an interesting snag arises. To explain this, material from the next section must be briefly considered. In that section, a normalization process is performed. Namely,

\begin{matrix} X_{1} = \frac{η}{{\hat{σ}}_{1}} & X_{2} = \frac{ζ + h}{{\hat{σ}}_{2}} . \end{matrix}

(24)

The problem is that

E (X_{1}) = 0

and

E (X_{2}) = h / {\hat{σ}}_{2}

. Finding the joint pdf for two variables having different means in conjunction with a non-diagonal covariance matrix is still an open question. The author has some results on this and will submit them in future publications.

Now, various cases will be considered that remove the correlation between

η

and

ζ

, which allows nontrivial relations for the mean.

2.5.1. Case A: Diagonal Covariance Matrix with the Same Diagonal Elements

This is the case typically considered in the literature on the Bookstein model [21]. In this situation,

σ = σ_{1 x} = σ_{2 x} = σ_{1 y} = σ_{2 y} .

(25)

This means

\begin{matrix} {\hat{σ}}_{1}^{2} & = σ^{2} + σ^{2} - 2 ρ σ σ = 2 σ^{2} (1 - ρ) \\ {\hat{σ}}_{2}^{2} & = σ^{2} + σ^{2} - 2 ρ σ σ = 2 σ^{2} (1 - ρ) \\ {\hat{σ}}_{12} & = ρ (σ σ + σ σ - σ σ - σ σ) = 0 . \end{matrix}

Writing this in matrix notation,

Λ_{A} = [\begin{matrix} 2 σ^{2} (1 - ρ) & 0 \\ 0 & 2 σ^{2} (1 - ρ) \end{matrix}] = ψ_{A}^{2} I

where

ψ_{A}^{2} = 2 σ^{2} (1 - ρ)

.

A few observations are in order. The correlation between the various fixations satisfies

- 1 < ρ < 1

. This means

ψ_{A}^{2} > 0

. However, as

ρ

nears 1 from below, the variance becomes nearly singular. The determinant is

det (Λ_{A}) = 4 σ^{4} {(1 - ρ)}^{2} > 0

as long as

ρ < 1

. The significance of

ρ

is important. No other investigations have included a correlation between the horizontal and vertical components. It will be seen that there is a difference in behavior for positive and negative correlation.

2.5.2. Case B: Diagonal Covariance Matrix with Different Diagonal Elements

We present three situations in which this can happen.

Case B1: Set

σ_{y} = σ_{1 y} = σ_{2 y}

.

In this case,

\begin{matrix} {\hat{σ}}_{1}^{2} & = σ_{1 x}^{2} + σ_{2 x}^{2} - 2 ρ σ_{1 x} σ_{2 x} \\ {\hat{σ}}_{2}^{2} & = σ_{y}^{2} + σ_{y}^{2} - 2 ρ σ_{y} σ_{y} = 2 σ_{y}^{2} (1 - ρ) \\ {\hat{σ}}_{12} & = ρ (σ_{1 x} σ_{y} + σ_{2 x} σ_{y} - σ_{y} σ_{2 x} - σ_{1 x} σ_{y}) = 0 \end{matrix}

or in matrix notation,

Λ_{B 1} = [\begin{matrix} σ_{1 x}^{2} + σ_{2 x}^{2} - 2 ρ σ_{1 x} σ_{2 x} & 0 \\ 0 & 2 σ_{y}^{2} (1 - ρ) \end{matrix}] .

Case B2: Set

σ_{x} = σ_{1 x} = σ_{2 x}

.

With calculations analogous to those above, the covariance matrix is found to be

Λ_{B 2} = [\begin{matrix} 2 σ_{x}^{2} (1 - ρ) & 0 \\ 0 & σ_{1 y}^{2} + σ_{2 y}^{2} - 2 ρ σ_{1 y} σ_{2 y} \end{matrix}] .

Case B3: Set

σ_{x} = σ_{1 x} = σ_{2 x} σ_{y} = σ_{1 y} = σ_{2 y}

.

This case yields the two previous to obtain

Λ_{B 3} = [\begin{matrix} 2 σ_{x}^{2} (1 - ρ) & 0 \\ 0 & 2 σ_{y}^{2} (1 - ρ) \end{matrix}] .

2.5.3. Case C: Bookstein’s Model Assumptions

In Stoyan and Stoyan’s book [23], they report the following assumptions with regard to the Bookstein model:

\begin{matrix} σ_{1} = σ_{1 x} = σ_{1 y} & σ_{2} = σ_{2 x} = σ_{2 y} . \end{matrix}

Under these assumptions, the model of this paper becomes

\begin{matrix} {\hat{σ}}_{1}^{2} & = σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2} \\ {\hat{σ}}_{2}^{2} & = σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2} \\ {\hat{σ}}_{12} & = ρ (σ_{1}^{2} + σ_{2}^{2} - 2 σ_{1} σ_{2}) . \end{matrix}

Hence, the covariance matrix is

Λ_{C} = [\begin{matrix} σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2} & ρ (σ_{1}^{2} + σ_{2}^{2} - 2 σ_{1} σ_{2}) \\ ρ (σ_{1}^{2} + σ_{2}^{2} - 2 σ_{1} σ_{2}) & σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2} . \end{matrix}] .

It is seen then that

η

and

ζ

are correlated under these assumptions in this model, though the model does have the same values on the diagonal. Of course, in their approach, there was no correlation between the horizontal and vertical components. Setting

ρ = 0

, their model is recovered.

2.6. Side Length Probability Density Function

In this section, the focus is on the determination of the probability density function of the height of bar

D_{12}

(Equation (4)). As previously mentioned, the next step is to normalize the variables using the transform given in Equation (24). This highlights the need for a simplifying assumption. The elements of the diagonal of the covariance matrix are seen in the denominator. In Case A, these values are equal. So, as has been done in the previous investigations of the Bookstein model, Case A is assumed. Namely,

ψ_{A} = {\hat{σ}}_{1} = {\hat{σ}}_{2} = \sqrt{2 σ^{2} (1 - ρ)} .

Hence,

\begin{matrix} X_{1} = \frac{η}{ψ_{A}} & X_{2} = \frac{ζ + h}{ψ_{A}} . \end{matrix}

This allows one to write

\begin{matrix} D_{12} & = \sqrt{ψ_{A}^{2} ({(\frac{η}{ψ_{A}})}^{2} + {(\frac{ζ + h}{ψ_{A}})}^{2})} \\ = \sqrt{ψ_{A}^{2} (X_{1}^{2} + X_{2}^{2})} \end{matrix}

or

\begin{matrix} \frac{D_{12}}{ψ_{A}} & = \sqrt{X_{1}^{2} + X_{2}^{2}} . \end{matrix}

It follows that

\begin{matrix} E (X_{1}) = 0 & E (X_{2}) = h / ψ_{A} \end{matrix}

which is the aforementioned situation in which the means of the variables are different, prompting the need for a simple covariance structure.

The variable

y = D_{12} / ψ_{a}

has the generalized Rayleigh distribution, which is also called the noncentral chi distribution (not to be confused with the chi-squared distribution) or Rice distribution [27,28]. The noncentrality parameter is

y_{0}^{2} = \sum_{j = 1}^{2} E {(X_{j})}^{2} = h^{2} / ψ_{A}^{2} .

According to Park [29], the probability density function is

\begin{matrix} g (y; y_{0}) & = y exp [(y^{2} + y_{0}^{2}) / 2] I_{0} (y y_{0}) H (y) \\ g (y; h, ψ_{A}) & = y exp [(y^{2} + h^{2} / ψ_{A}^{2}) / 2] I_{0} (y (h / ψ_{A})) H (y) . \end{matrix}

In this expression,

I_{0}

is a modified Bessel function of the first kind of order zero, and

H (y)

is the Heavyside function.

3. Results

With the model formulated and the basic probability density function determined, it is appropriate to begin the analysis of key features of interest. Once the mean of

D_{12}

is given, a parametric analysis is performed. This will offer insight into possible sources of the expected visual measurement error as well as confirm basic characteristics of favorable design features.

3.1. The Mean and Useful Properties

By definition,

E (D_{12} / ψ_{A}) = \int_{0}^{\infty} y g (y) d y .

(26)

Park determined this mean [29]. Multiplying by

ψ_{A}

, he found

μ = E (D_{12}) = ψ_{A} \sqrt{2} exp (- \frac{h^{2}}{2 ψ_{A}^{2}}) \frac{Γ (3 / 2)}{Γ (1)}_{1} F_{1} (\frac{3}{2}; 1; \frac{h^{2}}{2 ψ_{A}^{2}})

(27)

where

_{1} F_{1} (a; b; x)

is the confluent hypergeometric function. It is also denoted as

M (a, b, x)

by several important authors. It is labeled this way to be consistent with generalized hypergeometric functions.

There will be need for the power series expansion of

_{1} F_{1} (a; b; x) .

For

z \in C

,

_{1} F_{1} (a; b; z) = \sum_{s = 0}^{\infty} \frac{{(a)}_{s}}{{(b)}_{s} s!} z^{s} = 1 + \frac{a}{b} z + \frac{a (a + 1)}{b (b + 1) 2!} z^{2} + \dots (b \neq 0, - 1, - 2, - 3, \dots)

(28)

where

{(a)}_{s}

is the Pochhammer symbol with the properties

\begin{matrix} {(a)}_{0} & = 1 \\ {(a)}_{n} & = a (a + 1) (a + 2) \dots (a + n - 1) = \frac{Γ (a + n)}{Γ (a)} . \end{matrix}

The Gamma function is used repeatedly, so a few useful properties are given for the convenience of the reader:

\begin{matrix} Γ (n) & = (n - 1)! & n & = 1, 2, 3, \dots \\ Γ (z + 1) & = z Γ (z) & z & \in C \\ Γ (1 / 2) & = \sqrt{π} \\ Γ (3 / 2) & = \sqrt{π} / 2 . \end{matrix}

As the confluent hypergeometric function is less common and yet critical to the analysis, several of its basic properties are given. Then, a simple lemma encapsulating useful attributes is proven. Many features of this function are available in the NIST Handbook of Mathematical Functions [30]. (The numbers below, such as (13.2.39), locate the given equation in the sizable NIST reference [30] and are provided for ease of reference for those who wish to explore other properties of this remarkable function).

The power series in Equation (28) for $_{1} F_{1} (a; b; z)$ is entire in z and a and meromorphic in b for $b \neq 0, - 1, - 2, \dots$ ;
Kummer’s theorem states

$_{1} F_{1} (a; b; z) = e^{z}_{1} F_{1} (b - a; b; - z) (13.2.39);$

(29)
Recurrence relationship

$b_{1} F_{1} (a; b; z) - b_{1} F_{1} (a - 1; b; z) - z_{1} F_{1} (a; b + 1; z) = 0 (13.3.4);$

(30)
Derivative formulae

$\frac{d}{d z}_{1} F_{1} (a; b; z) = \frac{a}{b}_{1} F_{1} (a + 1; b + 1; z) (13.3.15)$

(31)

and

$\frac{d}{d z} (e^{- z}_{1} F_{1} (a; b; z)) = {(- 1)}^{1} \frac{{(b - a)}_{1}}{{(b)}_{1}} e^{- z}_{1} F_{1} (a; b + 1; z) (13.3.20) .$

(32)

Now, we provide a lemma concerning

_{1} F_{1} (a; b; x)

that is used extensively.

Lemma 1.

For

a, b \in R

such that

a, b > 0

and

x \in R

,

1.: $_{1} F_{1} (a; b; x) \in R$ ;
2.: $_{1} F_{1} (a; b; x) > 0$ ;
3.: $_{1} F_{1} (a; b; x)$ is strictly increasing in x;
4.: For $x > 0$ , $_{1} F_{1} (a; b; x) > 1$ .

Proof.

(1) This follows immediately from the power series expansion in Equation (28) and that a, b, and x are real numbers and b is positive. Since the real numbers are closed under arithmetic operations (

b > 0

excludes division by zero) and are complete, the power series converges to a real number.

(2) It is given in [30] that for

a, b > 0

,

_{1} F_{1} (a; b; x)

has no real zeros. Again, from the power series, it is seen that

_{1} F_{1} (a; b; 0) = 1

. Hence,

_{1} F_{1} (a; b; x)

must remain positive for all real values of x.

(3) By Equation (31),

\frac{d}{d x}_{1} F_{1} (a; b; x) = \frac{a}{b}_{1} F_{1} (a + 1, b + 1, x)

and since

a + 1 > a > 0

and

b + 1 > b > 0

, part 2 that was just proven can be applied using

a + 1

and

b + 1

to conclude

\frac{d}{d x}_{1} F_{1} (a; b; x) = \frac{a}{b}_{1} F_{1} (a + 1, b + 1, x) > 0 .

Hence,

_{1} F_{1} (a; b; x)

is strictly increasing in x.

(4) For

0 < x

,

1 =_{1} F_{1} (a; b; 0) <_{1} F_{1} (a; b; x)

since it is strictly increasing in x. Hence, the result follows. □

3.2. Parametric Behavior of the Mean

Many parameters have been introduced into this model: so many that their number was reduced to produce a model that was tractable with currently available mathematical machinery. The set that remains includes the height of the bar (h), a correlation of the fixations (

ρ

), a variance of the fixations (

ψ

), and the noncentrality parameter of the chi distribution of the side length (

λ = h / ψ

). (The subscript A is dropped from

ψ

for simplicity, as it is understood.) Closely related to

λ

is a scale (

τ = σ / h

). In working with this model,

σ

appears almost exclusively in ratio with h, allowing relabeling with

τ

. It has been found in simulations and real data analyses that

τ

is a critical indicator of the success of the method under consideration. Therefore, the analysis to follow will focus on h,

λ

,

τ

, and

ρ

. If needed, we can always write

σ = h τ

. No claim is being made that these form a minimal set of parameters.

3.2.1. Parametric Representation of the Mean

Using

Γ (3 / 2) = \sqrt{π} / 2

and

Γ (1) = 1

along with the following relationships between parameters

\begin{matrix} ψ & = \sqrt{2} σ \sqrt{1 - ρ} \end{matrix}

(33)

\begin{matrix} = \sqrt{2} h τ \sqrt{1 - ρ} \end{matrix}

(34)

\begin{matrix} λ & = \frac{h}{ψ} \end{matrix}

(35)

the mean in Equation (27) can be expressed in terms of the various parameters as

\begin{matrix} μ (σ; h, ρ) & = h (\sqrt{π} σ exp (- \frac{h^{2}}{2 σ^{2} (1 - ρ)})_{1} F_{1} (3 / 2; 1; \frac{h^{2}}{2 σ^{2} (1 - ρ)})) \end{matrix}

(36)

\begin{matrix} μ (ρ; h, τ) & = h (\sqrt{π} τ \sqrt{1 - ρ} exp (- \frac{1}{4 τ^{2} (1 - ρ)})_{1} F_{1} (3 / 2; 1; \frac{1}{4 τ^{2} (1 - ρ)})) \end{matrix}

(37)

\begin{matrix} μ (τ; h, ρ) & = h (\sqrt{π} τ \sqrt{1 - ρ} exp (- \frac{1}{4 τ^{2} (1 - ρ)})_{1} F_{1} (3 / 2; 1; \frac{1}{4 τ^{2} (1 - ρ)})) \end{matrix}

(38)

\begin{matrix} μ (λ; h) & = h (\sqrt{\frac{π}{2}} \frac{exp (- λ^{2} / 2)}{λ}_{1} F_{1} (3 / 2; 1; λ^{2} / 2)) \end{matrix}

(39)

3.2.2. General Framework for Parametric Analysis

For purposes of discussion, let

θ

be one of the parameters in the model. The analysis proceeds by first finding the derivative of the mean (

μ

) with respect to

θ

. This is used in conjunction with Lemma 1 to determine if the mean is monotone in

θ

. Next, the asymptotic properties of both

μ

and

d μ / d θ

are examined.

These objectives will be facilitated by the next two theorems. Theorem 1 provides half of a chain rule argument needed to determine the derivative of

μ

with respect to

θ

. Unfortunately, the derivatives cannot follow as simple corollaries because their nonlinear, coupled nature makes each case different, and they must be handled separately. With that said, Theorem 2 supplies an asymptotic analysis that can be easily exploited to obtain the large (or small)

θ

behavior of

μ

.

3.2.3. An Intermediate Form for Analysis

It is convenient for purposes of analysis to derive an intermediate expression for the mean in terms of the ratio of certain parameters. Namely,

u (ψ) = \frac{h^{2}}{2 ψ^{2}} = \frac{h^{2}}{4 h^{2} τ^{2} (1 - p)} .

(40)

Equation (27) becomes

μ = \sqrt{\frac{π}{2}} ψ exp (- u (ψ))_{1} F_{1} (3 / 2; 1; u (ψ)) .

(41)

Note that

ψ

can be written in terms of u as

ψ = \frac{h}{\sqrt{2} \sqrt{u}}

(42)

and

μ

becomes a function of u as

μ (u; h) = h \frac{\sqrt{π}}{2} \frac{e^{- u}}{\sqrt{u}}_{1} F_{1} (3 / 2; 1; u) . \equiv h Φ (u)

(43)

for

u > 0

. In much of the analysis to follow, the function

Φ (u)

plays a critical role. Expressions in one set of parameters will be transformed into

Φ (u)

, and its properties will be exploited. Hence, two fundamental results about it are given.

Theorem 1.

The function

Φ (u) = \frac{\sqrt{π}}{2} \frac{e^{- u}}{\sqrt{u}}_{1} F_{1} (3 / 2; 1; u)

(44)

is decreasing for positive values of u.

Proof.

Taking the derivative of both sides of Equation (44) with respect to u gives

\begin{matrix} \frac{2}{\sqrt{π}} \frac{d Φ}{d u} (u) & = (\frac{d}{d u} \frac{1}{\sqrt{u}}) e^{- u}_{1} F_{1} (3 / 2; 1; u) + \\ \frac{1}{\sqrt{u}} \frac{d}{d u} (e^{- u}_{1} F_{1} (3 / 2; 1; u)) . \end{matrix}

(45)

Using Equation (32) gives

\begin{matrix} \frac{d}{d u} (e^{- u}_{1} F_{1} (3 / 2; 1; u)) & = (- 1) \frac{{(1 - 3 / 2)}_{1}}{{(1)}_{1}} e^{- u}_{1} F_{1} (3 / 2, 1 + 1, u) \\ = (- 1) (- 1 / 2) e^{- u}_{1} F_{1} (3 / 2; 2; u) \\ = (1 / 2) e^{- u}_{1} F_{1} (3 / 2; 2; u) . \end{matrix}

Combining these into Equation (45) gives

\begin{matrix} \frac{2}{\sqrt{π}} \frac{d Φ (u)}{d u} & = - \frac{1}{2} u^{- 3 / 2} e^{- u}_{1} F_{1} (3 / 2; 1; u) + \frac{1}{2} u^{- 1 / 2} e^{- u}_{1} F_{1} (3 / 2; 2; u) \end{matrix}

(46)

\begin{matrix} = \frac{e^{- u}}{2 u^{3 / 2}} [-_{1} F_{1} (3 / 2; 1; u) + u_{1} F_{1} (3 / 2; 2; u)] . \end{matrix}

(47)

Now, the recurrence in Equation (30) is applied with

a = 3 / 2

and

b = 1

:

_{1} F_{1} (3 / 2; 1; u) -_{1} F_{1} (1 / 2, 1 / u) - u_{1} F_{1} (3 / 2; 2; u) = 0

or

-_{1} F_{1} (3 / 2; 1; u) + u_{1} F_{1} (3 / 2; 2; u) = -_{1} F_{1} (1 / 2; 1; u) .

(48)

Substituting Equation (48) into Equation (47) yields for positive values u:

\frac{d Φ}{d u} = - \frac{\sqrt{π}}{4} \frac{e^{- u}}{u^{3 / 2}}_{1} F_{1} (1 / 2; 1; u) < 0 .

(49)

by Lemma 1. It follows that

Φ (u)

is a decreasing function of u. □

From this, we have immediately

\frac{d μ}{d u} = h \frac{d Φ}{d u} = - h \frac{\sqrt{π}}{4} \frac{e^{- u}}{u^{3 / 2}}_{1} F_{1} (1 / 2; 1; u) .

(50)

Now, a general theorem is proven for the variable u, binding together the key parameters in the way they naturally appear for the expected value of the length of a side of the triangle.

Theorem 2.

For

Φ (u),

defined in Equation (44),

Φ (u) \sim 1

(51)

as

u \to \infty

.

Proof.

In [30], (13.2.23) and (13.2.4) can be combined to give

_{1} F_{1} (a; b; z) \sim \frac{Γ (b)}{Γ (a)} e^{z} z^{a - b} z \to \infty, | p h z | \leq \frac{π}{2} - δ

(52)

when

δ

is an arbitrarily small positive number. (This does not hold for the polynomial cases

a = 0, - 1, - 2, \dots

.) In particular,

_{1} F_{1} (3 / 2; 1; u) \sim \frac{Γ (1)}{Γ (3 / 2)} e^{u} u^{3 / 2 - 1} .

Since

Γ (1) = 1

and

Γ (3 / 2) = \sqrt{π} / 2

, this yields

_{1} F_{1} (3 / 2; 1; u) \sim \frac{2}{\sqrt{π}} e^{u} \sqrt{u} .

(53)

Using Equation (53) in Equation (44), one obtains

Φ (u) \sim 1 \frac{\sqrt{π}}{2} \frac{e^{- u}}{\sqrt{u}} \frac{2}{\sqrt{π}} e^{u} \sqrt{u} = 1 u \to \infty .

(54)

□

Using this, we have from Equation (43) and Theorem 2 that

μ (u; h) \sim h u \to \infty

(55)

3.2.4. Analysis of the Correlation Parameter

In order to do a parametric analysis of

μ (ρ; h, τ)

, it is sufficient to use material in the proof of Theorem 1 and do a chain rule argument. The following corollary summarizes the results.

Corollary 1.

μ (ρ; h, τ)

, given in Equation (37), is a decreasing function of ρ over the interval

(- 1, 1)

.

Proof.

The proof proceeds by transforming

μ (ρ; h, τ)

into the intermediate form using

\begin{matrix} u & = \frac{h^{2}}{2 ψ^{2}} & ψ & = \sqrt{2} h τ \sqrt{1 - ρ} . \end{matrix}

then taking the derivative with respect to

ρ

and showing it is negative. We rewrite Equation (37) as

μ (ρ; h, τ) = (\sqrt{\frac{π}{2}} \sqrt{2} h τ \sqrt{1 - ρ} exp (- \frac{h^{2}}{4 h^{2} τ^{2} (1 - ρ)})_{1} F_{1} (3 / 2; 1; \frac{h^{2}}{4 h^{2} τ^{2} (1 - ρ)})) .

Noting

\sqrt{2} h τ \sqrt{1 - ρ} = \frac{h}{\sqrt{2} u}

(56)

and substituting, we obtain

μ (u; h) = h \frac{\sqrt{π}}{2} \frac{e^{- u}}{\sqrt{u}}_{1} F_{1} (3 / 2; 1; u) .

(57)

By the chain rule,

\frac{d μ}{d ρ} = \frac{d μ}{d u} \frac{d u}{d ψ} \frac{d ψ}{d ρ} .

(58)

We have

\frac{d u}{d ψ} = - \frac{2}{ψ} u

(59)

and

\frac{d ψ}{d ρ} = - \frac{h τ}{\sqrt{2}} \frac{1}{\sqrt{1 - ρ}} .

(60)

Using Equation (59), Equation (60), and Equation (50) in Equation (58) gives

\frac{d μ}{d ρ} = - h \frac{\sqrt{π}}{4} \frac{e^{- u}}{u^{3 / 2}}_{1} F_{1} (1 / 2; 1; u) (- \frac{2}{ψ} u) (- \frac{h τ}{\sqrt{2}} \frac{1}{\sqrt{1 - ρ}}) .

(61)

After canceling cross-terms, we apply

\frac{1}{\sqrt{u}} = \frac{\sqrt{2} ψ}{h}

and

u = \frac{h^{2}}{2 ψ^{2}} = \frac{1}{4 τ^{2} (1 - ρ)}

to return to the original variables, obtaining

\frac{d μ}{d ρ} (ρ; h, τ) = - h \frac{\sqrt{π}}{2} (τ) \frac{e^{- \frac{1}{4 τ^{2} (1 - ρ)}}}{\sqrt{1 - ρ}}_{1} F_{1} (1 / 2; 1; \frac{1}{4 τ^{2} (1 - ρ)}) .

(62)

From this, it is seen that

d μ / d ρ < 0

by part 2 of Lemma 1 over the interval

- 1 < ρ < 1

. Therefore,

μ (ρ; h, τ)

is decreasing as

ρ

approaches 1 from below. □

We now know that the expected length of a side of the triangle actually decreases as the fixation points become more strongly correlated in a positive sense. (We are discussing that the particular side connecting two points labeled one and two impacts the model only through the value of the noncentrality parameter. Ultimately, the value we labeled as

h^{2}

could become

b^{2}

or

h^{2} + b^{2}

. We have chosen the stochastic edge

D_{12}

because of its association with the eye attempting to measure the height of the bar.)

Corollary 2.

For

μ (ρ; h, τ),

given in Equation (37),

μ (ρ; h, τ) \sim h .

(63)

as

ρ \to 1^{-}

.

Proof.

It was shown in the proof of Corollary 1 that

μ (ρ; h, τ)

can be transformed into the intermediate form using

\begin{matrix} u & = \frac{h^{2}}{2 ψ^{2}} & ψ & = \sqrt{2} h τ \sqrt{1 - ρ}, \end{matrix}

which can be written as

μ (u; h) = h Φ (u) .

It is easily seen that

u \to \infty

as

ρ \to 1^{-}

and vice versa as

τ

is fixed. In this case, Theorem 2 is applied to show us

Φ (u) \sim 1

and, therefore,

μ (ρ; h, τ) \sim h

. □

If it were true that

d μ / d ρ = 0

everywhere, then the correlation introduced in the model has no influence on the mean length of the sides of the stochastic triangles for estimating the bar height visually. That is not that case, however, as Corollary 1 shows the derivative is nonzero everywhere over

(- 1, 1)

. In order to understand the nature of this, the following theorem was proven. Discussion will follow to explain it from an eye-tracking perspective.

Theorem 3.

For

\frac{d μ}{d ρ} (ρ; h, τ),

given in Equation (62),

\frac{d μ}{d ρ} \sim - h τ^{2} .

(64)

as

ρ \to 1^{-}

Proof.

Using (10.6.9) from the NIST handbook [30], one can relate the confluent hypergeometric function to the modified Bessel function of the first kind of order zero. We have

_{1} F_{1} (1 / 2; 1; 2 \frac{1}{8 τ^{2} (1 - ρ)}) = e^{\frac{1}{8 τ^{2} (1 - ρ)}} I_{0} (\frac{1}{8 τ^{2} (1 - ρ)}) .

(65)

Let

w = \frac{1}{8 τ^{2} (1 - ρ)}

so that

2 \sqrt{2} τ \sqrt{w} = \frac{1}{\sqrt{1 - ρ}} .

Substituting in Equation (62) gives

\begin{matrix} \frac{d μ}{d ρ} (w; h, τ) & = - h \frac{\sqrt{π}}{2} τ 2 \sqrt{2} τ \sqrt{w} e^{- 2 w} e^{w} I_{0} (w) \\ = - h \sqrt{2 π} τ^{2} \sqrt{w} e^{- w} I_{0} (w) . \end{matrix}

From the NIST handbook (10.10.4), we have

I_{0} (w) \sim \frac{e^{w}}{\sqrt{2 π w}} w \to \infty .

(66)

Substituting this in the above, one finds, as

w \to \infty

,

\frac{d μ}{d ρ} (w; h, τ) \sim - h \sqrt{2 π} τ^{2} \sqrt{w} e^{- w} (\frac{e^{w}}{\sqrt{2 π w}}) = - h τ^{2} .

By the definition of w, there are three ways it can be made to become infinite:

ρ \to 1^{-}

,

τ^{2} \to 0

, or both. However,

τ

is a parameter and is held constant, so we take take

w \to \infty

to be equivalent to

ρ \to 1^{-}

. Thus, the theorem is proven. □

3.2.5. Analysis of the Noncentrality Parameter

Now, we consider the mean in Equation (27) as a function of the noncentrality parameter

λ

.

Corollary 3.

μ (λ; h),

given in Equation (39), is a decreasing function of λ over the interval

(0, \infty)

.

Proof.

For

μ

given in Equation (39) and with u and

ψ

defined in terms of

λ

as

\begin{matrix} u & = \frac{1}{2} \frac{h^{2}}{ψ^{2}} & ψ & = \frac{h}{λ} \end{matrix}

we can transform Equation (39) into the intermediate form. Combining these transformations, we see that

λ = \sqrt{2} \sqrt{u}

and obtain

μ (u; h) = h \frac{\sqrt{π}}{2} \frac{e^{- u}}{\sqrt{u}}_{1} F_{1} (3 / 2; 1; u) .

(67)

Now, we proceed as before and use the properties of the intermediate form. By the chain rule,

\frac{d μ}{d λ} = \frac{d μ}{d u} \frac{d u}{d ψ} \frac{d ψ}{d λ} .

(68)

We note

\frac{d ψ}{d λ} = - h ψ^{- 2} .

Using this with Equation (59) and Equation (50) in Equation (68) gives

\frac{d μ}{d λ} = - h \frac{\sqrt{π}}{4} \frac{e^{- u}}{u^{3 / 2}}_{1} F_{1} (1 / 2; 1; u) (- \frac{2 u}{ψ}) (- \frac{h}{λ^{2}}) .

(69)

Now, we use

\frac{1}{\sqrt{u}} = \frac{\sqrt{2}}{λ}

and

\frac{1}{ψ} = \frac{λ}{h}

to obtain

\frac{d μ}{d λ} (λ; h) = - h \sqrt{\frac{π}{2}} \frac{e^{- \frac{λ^{2}}{2}}}{λ^{2}}_{1} F_{1} (1 / 2; 1; \frac{λ^{2}}{2}) .

(70)

From this, it is seen that

d μ / d λ < 0

by Lemma 1 over the interval

(0, \infty)

. Therefore,

μ (λ; h)

is decreasing. □

Corollary 4.

For

μ (λ; h),

given in Equation (39),

μ (λ; h) \sim h

(71)

as λ approaches ∞

Proof.

It was just shown that under appropriate relationships among the parameters that

μ (λ; h)

can be transformed into the intermediate form

μ (u; h) = h Φ (u)

, where

u = \frac{1}{2} λ^{2} .

(72)

Undoubtedly

u \to \infty

as

λ \to \infty

and vice versa. Hence, we apply Theorem 2 and conclude that

Φ (u) \sim 1

; therefore,

μ (λ; h) \sim h

as

λ \to \infty

. □

Now, we consider the derivative of the mean with respect to the noncentrality parameter.

Theorem 4.

For

\frac{d μ}{d λ} (λ; h)

, given in Equation (70),

\frac{d μ}{d λ} (λ; h) \sim - \frac{h}{λ^{3}}

as

λ \to \infty

.

Proof.

We change the variable to

t = \frac{λ^{2}}{4}

resulting in

\frac{d μ}{d λ} (t; h) = - h \sqrt{\frac{π}{2}} \frac{e^{- 2 t}}{4 t}_{1} F_{1} (1 / 2; 1; 2 t) .

(73)

As previously noted, the following relationship holds:

_{1} F_{1} (1 / 2; 1; 2 t) = e^{t} I_{0} (t) .

(74)

Using this in Equation (73) gives

\frac{d μ}{d λ} (t; h) = - \frac{h}{4} \sqrt{\frac{π}{2}} \frac{e^{- t}}{t} I_{0} (t) .

As seen in Equation (66), for

t \to \infty

,

\begin{matrix} \frac{d μ}{d λ} (t; h) & \sim - \frac{h}{4} \sqrt{\frac{π}{2}} \frac{e^{- t}}{t} \frac{e^{t}}{\sqrt{2 π t}} \\ \sim - \frac{h}{8} \frac{1}{t^{3 / 2}} . \end{matrix}

Obviously,

t \to \infty

if and only if

λ \to \infty

. Since

1 / t^{3 / 2} = 8 / λ^{3}

, we conclude that

\frac{d μ}{d λ} (λ; h) \sim - \frac{h}{8} \frac{8}{λ^{3}} = - \frac{h}{λ^{3}}

(75)

as

λ \to \infty

. □

3.2.6. Analysis of the Scale Parameter

Now, we consider the influence of the

τ

parameter. Recall

τ = σ / h

, which is the ratio of the spread of the fixation points to the height of the bar. It gives the scale of the visualization problem. Let us consider the sensitivity of the mean to it.

Corollary 5.

μ (τ; h, ρ)

, given in Equation (38), is an increasing function of τ over the interval

(0, \infty)

.

Proof.

Using u and

ψ

satisfying

\begin{matrix} u & = \frac{1}{2} \frac{h^{2}}{ψ^{2}} & ψ^{2} & = 2 h^{2} τ^{2} (1 - ρ) \end{matrix}

then

μ (τ; h, ρ)

can be transformed into the intermediate form in a fashion identical to what has already be done for

μ (ρ; h, τ)

.

By the chain rule,

\frac{d μ}{d τ} = \frac{d μ}{d u} \frac{d u}{d ψ} \frac{d ψ}{d τ} .

(76)

We need

d ψ / d τ

:

\frac{d ψ}{d τ} = \frac{d}{d τ} (\sqrt{2} h τ \sqrt{1 - ρ}) = \sqrt{2} h \sqrt{1 - ρ} .

(77)

Using Equation (77), Equation (59), and Equation (50) in Equation (76) gives

\frac{d μ}{d τ} = - h \frac{\sqrt{π}}{4} \frac{e^{- u}}{u^{3 / 2}}_{1} F_{1} (1 / 2; 1; u) (- \frac{2 u}{ψ}) (\sqrt{2} h \sqrt{1 - ρ}) .

We must eliminate u and

ψ

from the expression, leaving the parameters h and

ρ

. To this end, we transform it according to

\begin{matrix} \frac{1}{ψ} & = \frac{1}{\sqrt{2} h τ \sqrt{1 - ρ}} & \frac{1}{\sqrt{u}} & = 2 τ \sqrt{1 - ρ} . \end{matrix}

After making the appropriate substitutions followed by basic manipulations, we obtain

\frac{d μ}{d τ} (τ; h, ρ) = h \sqrt{π} \sqrt{1 - ρ} e^{- \frac{1}{4 τ^{2} (1 - ρ)}}_{1} F_{1} (1 / 2; 1; \frac{1}{4 τ^{2} (1 - ρ)}) .

(78)

From this, it is seen that

d μ / d τ > 0

by Lemma 1 over the interval

(0, \infty)

. Therefore,

μ (τ; h, ρ)

is increasing. □

Corollary 6.

For

μ (τ; h, ρ),

given in Equation (38),

μ (τ; h, ρ) \sim h

(79)

as

τ \to 0^{+}

.

Proof.

As has already been demonstrated for

μ (ρ; h, τ)

, it is possible to transform this expression into the intermediate form. The details are seen in Corollary 1 and shall be omitted here.

Once

μ (τ; h, ρ)

is transformed into

μ (u; h) = h Φ (u)

, we note that, unlike the other key parameters, as it becomes small, u becomes large, as can be seen from the definition

u = \frac{1}{4} τ^{- 2} \frac{1}{1 - ρ} .

(80)

That is,

u \to \infty

as

τ \to 0^{+}

and vice versa as

ρ

is a parameter that is held constant. Hence, Theorem 2 yields

Φ (u) \sim 1

, and therefore,

μ (τ; h, ρ) \sim h

. □

Finally, we calculate the derivative with respect to

τ

and look at its asymptotic properties, as has been done similarly before.

Theorem 5.

For

\frac{d μ}{d τ} (τ; h, ρ)

, given in Equation (78),

\frac{d μ}{d τ} (τ; h, ρ) \sim 2 h (1 - ρ) τ

(81)

as

τ \to 0^{+}

.

Proof.

We change the variables to

v = \frac{1}{8 τ^{2} (1 - ρ)} .

(82)

Substituting Equation (82) into Equation (78), we obtain

\frac{d μ}{d τ} (v; h, ρ) = h \sqrt{π} \sqrt{1 - ρ} e^{- 2 v}_{1} F_{1} (1 / 2; 1; 2 v) .

(83)

Again, we have

_{1} F_{1} (1 / 2; 1; 2 v) = e^{v} I_{0} (v)

which we use in Equation (83):

\frac{d μ}{d τ} (v; h, ρ) = h \sqrt{π} \sqrt{1 - ρ} e^{- v} I_{0} (v) .

Recalling the asymptotic behavior of the modified Bessel function given in Equation (66) and substituting into the previous equation, we obtain

\begin{matrix} \frac{d μ}{d τ} (v; h, ρ) & \sim h \sqrt{π} \sqrt{1 - ρ} e^{- v} \frac{e^{v}}{\sqrt{2 π v}} \\ \sim \frac{h}{\sqrt{2}} \frac{1}{\sqrt{v}} \sqrt{1 - ρ} \end{matrix}

as

v \to \infty

. We invert the mapping from v to

τ

using

\frac{1}{\sqrt{v}} = 2 \sqrt{2} τ \sqrt{1 - ρ} .

Note that as

v \to \infty

, we have

τ \to 0^{+}

\begin{matrix} \frac{d μ}{d τ} (τ; h, ρ) & \sim \frac{h}{\sqrt{2}} 2 \sqrt{2} τ (1 - ρ) \\ \sim 2 h (1 - ρ) τ . \end{matrix}

□

4. Discussion

A primary objective of this paper is to understand the so-called expected visual measurement error (EVME): that is, the mistakes people make when using the structure of the visualization to recover the number encoded in the height or length of a geometric mark. In particular, the error made in measuring the height of a bar in a bar graph.

There is no universally accepted definition of the expected visual measurement error, so this research proposes a simple one to explore it and see what can be learned. The definition of the EVME will be given in the next section. With a definition in hand, we can use all the extensive groundwork that has been prepared to explore its behavior.

Particular attention is given to the correlation parameter

ρ

, as this is a new contribution to this landmark literature. It can be said that there is no significance to such correlations if being applied to discovering the shape of a baboon’s vertebrae, but we believe including the correlation is important, if not required, to model eye motion. It will be shown that the other parameters appear naturally in the analysis of the expected visual measurement error.

4.1. Expected Visual Measurement Error

Error is typically the difference between an actual and measured value. Knowing how the height of a bar is measured visually is problematic. This may be different person-to-person. Hence, we developed the approach for this course of work. Namely, we make the height of the bar a random variable. Let us work through the concept of this model.

First, the viewer looks at the base, leaving a fixation, then moves rapidly to the top of the bar (the next AOI). This is followed by a glance toward the ruler in an effort to obtain a reading on the height. The process is repeated by the path returning to the base of the bar to again obtain a feel for the height of the bar. This creates a family the forms a random triangle. See Figure 5. The circled edges are part of the height estimation process.

Figure 3 shows a particular triangle. The edge of the triangle

P_{1} P_{2}

has length

D_{12}

, which is a random variable with a noncentral chi distribution with noncentrality parameter

λ

, as has been discussed. The random variable is a model of the height of the bar. Admittedly, how the viewer uses the ruler to assign a length to

D_{12}

is not clear. From the model’s perspective, each triangle would have associated with it a pairing

(r (P_{3}), D_{12} (P_{1}, P_{2}))

, where

r (P_{3})

is the reading taken from the ruler based on vertex

P_{3}

.

In this paper, it is proposed that after viewing the bar graph and creating multiple triangles, the viewer assigns the expected value of

D_{12}

as the measured height of the bar. (The exact mechanics of how this is done are not well explained. We envision that for each fixation by the ruler, a number is captured. This becomes the assigned length of the edge. In some sort of limiting process, the mean is recovered from an unbiased statistic. This is being actively investigated and will be an area of future publication.). That is,

r (P_{3}) = E (D_{12})

. This makes the expected visual measurement error

EVME = E (D_{12} - h) = E (D_{12}) - h .

(84)

This brings us to the following key theorem.

Theorem 6.

For μ, given in Equation (27),

EVME = (E (D_{12}) - h) \sim h (1 - ρ) τ^{2} + \frac{1}{2} h {(1 - ρ)}^{2} τ^{4} + \dots

(85)

as

τ \to 0^{+}

.

Proof.

We appeal to the paper by Parks [29] for the formula for the asymptotic expansion of the mean as the noncentrality parameter becomes large. In his formula in the paper, we set

a = 1

for the first moment,

N = 2

for the number of degrees of freedom, and

y_{0} = λ

for the noncentrality parameter. This yields, after simplification,

\begin{matrix} E (\frac{D_{12}}{ψ}; λ) & \sim λ [1 + \frac{1}{2} λ^{- 2} + \frac{1}{8} λ^{- 4} + \dots] λ \to \infty \\ \sim λ + \frac{1}{2} λ^{- 1} + \frac{1}{8} λ^{- 3} + \dots λ \to \infty \end{matrix}

so that

\begin{matrix} E (D_{12}) & \sim ψ λ + \frac{1}{2} ψ λ^{- 1} + \frac{1}{8} ψ λ^{- 3} + \dots λ \to \infty . \end{matrix}

Now, we use

λ = h / ψ

:

\begin{matrix} E (D_{12}) & \sim ψ (h / ψ) + \frac{1}{2} ψ {(h / ψ)}^{- 1} + \frac{1}{8} ψ {(h / ψ)}^{- 3} + \dots τ \to 0^{+} \\ \sim h + \frac{1}{2} (ψ^{2} / h) + \frac{1}{8} (ψ^{4} / h^{3}) + \dots τ \to 0^{+} \end{matrix}

where

τ = 1 / λ

. We substitute

ψ^{2} = 2 h^{2} τ^{2} (1 - ρ)

and rearrange terms to obtain

E (D_{12}) - h = E (D_{12} - h) \sim h τ^{2} (1 - ρ) + \frac{1}{2} h τ^{4} {(1 - ρ)}^{2} + \dots τ \to 0^{+} .

(86)

Therefore, we conclude

EVME \sim h τ^{2} (1 - ρ) + \frac{1}{2} h τ^{4} {(1 - ρ)}^{2} + \dots τ \to 0^{+} .

(87)

□

This is a remarkable expression. It shows the higher-order dependency of the expected visual measurement error in terms of the parameters we have been studying. That the quality of the estimate should depend on

τ

was already suspected in the literature. Now that dependency is clearly seen. Before we study this matter and others, it is worthwhile to summarize what we have learned in our analysis of the basic parametric dependence of the mean.

4.2. Remarks on the Correlation Parameter

One of the main contributions of this paper is the analysis of the influence of the correlation. Let us look at it more closely. Recall that the general model originally proposed had seven correlations in it. In order to proceed, we had to consider cases that were tractable with the available mathematical machinery. Hence, we made decisions that uncorrelated

η

and

ζ

and give them identical variances. In particular, we set

ρ = ρ_{1} = ρ_{2} = ρ_{12 x} = ρ_{12 y} = ρ_{1 y 2 x} = ρ_{1 x 2 y}

with

σ = σ_{1 x} = σ_{2 x} = σ_{1 y} = σ_{2 y}

making

C o v (η, ζ) = 0

(see Equation (17) and

ψ = 2 σ^{2} (1 - ρ)

, which has allowed interesting analysis, but notice what happened in particular. Making

C o v (η, ζ) = 0

essentially removed four correlations from the model: namely,

ρ_{1}

,

ρ_{2}

,

ρ_{1 y 2 x}

, and

ρ_{1 x 2 y}

. The two correlations that remained in the model under the name of

ρ

were

ρ_{12 x}

and

ρ_{12 y}

. Furthermore, we required them to have the same sign and magnitude.

Figure 6 visualizes the consequences of the assumption we made regarding the correlation. Consider situations for which

corr (d_{2 x}, d_{1 x}) = corr (d_{2 y}, d_{1 y}) > 0 .

This corresponds to fixation clouds that are either shifted to the right and up or to the left and down. Since we have assumed

σ = σ_{1 x} = σ_{2 x} = σ_{1 y} = σ_{2 y},

(88)

the fixation clouds have roughly similar shapes at each end of the bar. This results in what is essentially a rigid body shift of the bar without a significant change of length. This is not the case with negative correlation. As shown in Figure 6, the fixation cloud shifts rotate the bar and either compress or stretch it.

Does the model support these suspicions? Recall Corollary 1 that shows the mean is decreasing for

ρ

over the interval

(- 1, 1)

. The mean is the largest for fixation clouds with correlations near −1. This means the expected visual measurement error will be worse for negative correlations. This seems to be consistent with the foregoing discussion. Similarly, translations with little to no dilation would change the mean little. These correspond to strongly positive correlations. Furthermore, we know from Corollary 2 that

μ \sim h

as

ρ \to 1^{-}

, so the EVME will become small as the correlation becomes strongly positive. This would seem to be consistent with the previous observations shown in Figure 6.

What remains to question is whether there is a significant difference caused by including the correlation. We have calculated the derivative of the mean with respect to the correlation in Equation (62). Furthermore we know that as

ρ \to 1^{-}

,

\frac{d μ}{d ρ} \sim - h τ^{2} .

(89)

As we normally consider small values of

τ

, this says the derivative in these cases is small, particularly since

τ

is squared. Therefore, for positive correlation, there might not be much gained by including this parameter. However, for negatively correlated fixations, the relative size of the derivative is unknown.

4.3. Remarks on the Scale Parameter

The term

τ

has proven to be an important parameter. We know from Corollary 5 that the mean value of the random variable

D_{12}

is increasing in

τ

. This means that the mean is decreasing as

τ

approaches zero positively. Furthermore, Corollary 6 shows the mean asymptotically approximates h as

τ \to 0^{+}

. From this, it follows that the expected visual measurement error is very small when

τ

is small.

It is reasonable to explore how the mean behaves as

τ \to 0^{+}

. We found

\frac{d μ}{d τ} \sim 2 h τ (1 - ρ) .

(90)

This suggests that

μ (τ)

is approximately quadratic in

τ

as it becomes small. We have already seen this in Equation (87), which was derived in a different manner. We provide more on this in the next section.

The need for small

τ = σ / h

is pervasive across the literature. In their book [23], Stoyan and Stoyan address the topic several times in a variety of settings, but mostly concerning the quality of statistical estimators. Anderson observes [31,32], for instance, that if

σ ≪ h

, simple empirical estimators are as good as more complex maximum likelihood estimators. It should be noted that they often use a slightly different

τ

: namely,

σ

divided by the triangle’s longest side, which is

d_{13}

. This affects the parametric analysis in interesting ways but will be saved for a future line of work.

Since we have keen interest in the correlation, we notice the role

ρ

plays. Strongly positive correlation will diminish the effects of scale on the mean, whereas strong negative correlation increases the effects of

τ

. Recall the interpretation that positive correlation results in less change in

D_{12}

and more rigid displacement. It seems reasonable that there would be less change in

μ

as

τ

changes, as indicated in asymptotic Equation (90). However, negative correlation was reasoned to be associated with a potentially greater length change. Therefore,

μ (τ)

having a steeper slope near zero indicates greater change is happening.

The term

τ

is the parameter that ties the viewer and the visualization together. And

σ

is a property of the fixations and their spread about the ends of the bar. The term h is a property of visualization. The ratios of their values determine the quality of the visual assessment of the bar. In studies of figure shapes (which can be subdivided into sets of triangles if needed),

τ

is estimated [33,34]. This questions whether h and

σ

can be individual, separate parameters, or is their ratio intrinsic to the viewer? That is, will a viewer increase

σ

as h increases to keep some unstated value of

τ

constant? This seems like an interesting question for future experimental research.

4.4. Interesting Relationships

We have defined the expected visual measurement error as the expected value of the difference between the height of the bar and the length of a side of a stochastic triangle and found an asymptotic expansion for it:

EVME = E (D_{12} - h) = E (D_{12}) - h \sim h (1 - ρ) τ^{2} + \frac{1}{2} h {(1 - ρ)}^{2} τ^{4} + \dots

(91)

as

τ \to 0^{+}

. A few final observations are in order. Recall Equation (64):

\frac{d μ}{d ρ} \sim - h τ^{2}

(92)

as

ρ \to 1^{-}

. If we take the partial derivative of the EVME in Equation (91) with respect to

ρ

and keep the term to the lowest order, we have

\frac{\partial EVME}{\partial ρ} \sim - h τ^{2} .

(These are just formal manipulations. In order to be able to take derivatives of asymptotic expansions, certain conditions must hold [35]. Our purpose is just to make a casual observation, not to prove a rigorous result.)

Similarly, we have from Equation (81) that

\frac{d μ}{d τ} (τ; h, ρ) \sim 2 h (1 - ρ) τ .

If this time we formally take the partial derivative of the expected visual measurement error with respect to

τ

, we have

\frac{\partial EVME}{\partial τ} \sim 2 h (1 - ρ) τ .

The significance of this is that these relations were reached from two different directions. In one direction, the derivatives of the mean were calculated directly and the asymptotic properties of the Bessel functions were used to reach the asymptotic approximations. In the other direction, Park used the asymptotic expansion of the confluent hypergeometric function found in [36], then we took the derivative of it. This process could have easily failed. That it did not suggests there is an internal consistency to what has been developed.

5. Conclusions

The purpose of this paper was to develop estimations of the expected error in reading a bar graph. This estimate was a consequence of a mathematical model we constructed of the decoding process based upon concepts within eye tracking. With this accomplished, several questions are encountered. To what extent can this model be adapted to other visualizations? What are its limitations? How can it be applied, and what work remains to be done?

5.1. Limitations of the Model

If handed a dataset and a visualization, a reasonable question is whether the model can be adapted to provide error estimates in the decoding process of the visualization. To answer that, we would first look to see if the visualization represents numeric data. That is a requirement. Next, we would consider how many measurements must be made to decode the data value. A bar graph is one measurement, and this is what the model simulates in its current state. We say this is one-dimensional since there is one measurement. A line chart is two measurements—one for the x coordinate and one for the y coordinate; we call this two-dimensional. Finally, we would inquire which elementary task must be performed to decode that number.

It is worthwhile to review the elementary perceptional tasks as categorized by Cleveland and McGill [3]. They identified 10 ways by which a number can be encoded in a visualization. They performed a series of experiments and were able to group the tasks together by how accurately the encoded number could be extracted. They produced six groups.

As can be seen from Table 1, there are three elementary perceptual tasks for which it is believed the model could be applied, three that present research challenges, and four that are simply beyond the model. This allows us to make some basic lists of visualizations that could work with a modified model and a list of those that are beyond the boundaries of the model.

Let us look at some common visualizations that are outside of the boundaries of the model we built. A favorite chart for people that is beyond this model is the pie chart, which depends upon angles to encode its numeric values. A pie chart is just a bar graph mapped onto the polar coordinate system, but that passage from linear (or rectangular) to radial (or circular) challenges the effectiveness of the stochastic model developed herein. Almost any visualization based on a polar grid is beyond the limitations of this model. These include: radial bar charts, donut charts, polar area charts, radial histograms, radial line charts, multilevel pie charts, spiral histograms, and so forth.

It should also be noted that the model does not fit relational data: that is, that two people are friends or brother/sister. Any of the relational charts are beyond it: graphs, digraphs, weighted digraphs, arc diagrams, trees, chord diagrams, non-ribbon chord diagrams, etc.

5.2. Extensions to Other Visualization Types

Before proceeding, it is important to note that this paper developed a model from which various things were learned and an error estimate produced. It did not produce a method. A method is a tool that could be used on other visualizations. A model is a house built on a foundation (visualization). It has rooms, attributes, and characteristics and provides certain functionality for us. Taking the results of this paper and trying to apply them to a different type of visualization means the house must be built again. The more analogous the new visualization is to a bar graph, the simpler the construction of the new house. The less a visualization resembles a bar graph, the more effort will have to go into building the model for that visualization. In time, perhaps we will have a “general model” that covers a variety of visualizations without the need for reproof. However, this paper was a first step in showing the model exists and has some use for a bar graph.

Now let us consider some extensions. The strip plot is a one-dimensional scatter plot (see Figure 7). The data are numeric. There is only one measurement required, and finding a position along an aligned axis is the perceptional task. The geometry is linear. Once a point is picked, the problem of reading the coordinate on the axis is very similar to that of a bar chart. It looks like a good candidate for a successful build of our model. From a certain perspective, it is a many-valued bar chart. How that might impact the analysis would be interesting to see.

A step up in complexity is the line graph, which has the same triangular structure when reading the coordinates of a point as a bar graph (See Figure 8). We place one AOI over the point. We drop down vertically and place an AOI over the x coordinate and cross horizontally to cover the y coordinate. In this case, there are two estimates: one for x and one for y, which differs from the bar graph. This capability to estimate two lengths is in this model. The random triangle that is constructed has three sides that we can utilize for length estimation. Our intuition is that for the analysis to extend, it would be necessary for the

(x, y)

pairs to be sufficiently spaced so that a reasonable AOI contains only one data point.

A scatter plot is a step up in complexity over the line graph. The reason is that there may be multiple data points in the AOIs. This will make the analysis to build the model more complex.

Putting this together, then natural visualizations open to models similar to the one developed herein also include vertical bar graphs, horizontal bar graphs, line charts, time series, scatter plots, and so forth. Requiring the visualization to have equal salience means that coloring or shading cannot be used to distinguish levels of categorical variables. Hence, a stacked bar chart, as well as a clustered bar chart, etc., is currently out of the boundaries of the model.

5.3. Applications

While eye-tracking concepts permeate the development of this model, its application has nothing to do with eye tracking. The purpose of the model is to produce an expected error inherent in reading a bar graph. That is, anyone reading a bar graph may be expected to experience an expected error of the amount of the EVME. Keep in mind, however, that the EVME is an expected value, so it is an average of user behavior. Some people may accurately read the graph; others will be less accurate. The amount of error, on average, will be about the EVME. Ideally, visualization designers could attempt to minimize the EVME for a particular chart in order to produce more reliable reading of the data.

Recalling Equation (85) to the lowest order in

τ

,

EVME = (E (D_{12}) - h) \sim h (1 - ρ) τ^{2} + \dots

as

τ \to 0^{+}

. Here, we see that the expected error is approximately the product of three parametric expressions. The term h is the height of the bar and is under the control of the designer, and

ρ

is due to the eyes of the viewer and how they correlate in adjacent AOIs. For discussion purposes, let us suppose that there is no correlation. That is,

ρ = 0

. Then

EVME = (E (D_{12}) - h) \sim h τ^{2} + \dots

Suppose we can hold

τ

small but constant while changing h. Then this shows us that designers should avoid longer bars for that scaled visualization. With that said, we must confront the issue of whether we can increase h while holding

τ

constant, since

τ = σ / h

. Will the fixation variation naturally adjust to compensate for the larger h? It seems possible that the more isolated the bar is from other bars, the tighter the point cloud of fixations will group around the tip of the bar. If the bars are tightly grouped, then eye fixations may involve other bars, creating a larger variation and, thereby, increasing

σ

. This suggests that

τ

can be controlled by the designer; however, it will likely take experimentation to understand the nature of this dependency.

Allowing for

ρ \neq 0

means that the correlation influences the EVME. The nature of this dependency has been discussed in the previous section. We observed that negative correlations can increase the EVME up to a factor of two times. This is the first research to include correlation in this fashion, so how it can be controlled is still open. However, it might account for unexplained error in a particular visualization design.

5.4. Future Work

This paper is the first step down a path requiring further theoretical development and experimental exploration. As is true in most scientific investigation, these are coupled together. It is our intention to pursue both.

On the theoretical side, we need to have error estimates for the side called

D_{23}

, which is originally of length b and is the distance of the bar from the vertical axis. This will show the influence b has on visualization design and error in reading the height of the bar. It would seem that the further the bar is from the axis, the greater the EVME would be. This has yet to be established.

Experimentally, there are several items to explore. Of course, the most basic experiment is to vary the bar height and record the accuracy of the reading. This would need to be repeated for several test subjects and averaged to obtain an approximate expected value curve of error vs. height. It should be approximately linear.

The problem in doing this is that the different visualizations may have different values of

τ

. This means a more complete experimental investigation of scale is required. Fortunately, this has been much-studied in landmark theories from stochastic geometry, so there is a rich statistical foundation upon which we can apply these ideas to visualizations.

The correlation parameter is new to this setting. There is no established analogy in landmark theories, so this will require further investigation to obtain the proper statistics and testing to see if the design of the visualization influences the correlated movement of the eye.

In conclusion, the work has presented a novel approach to error analysis of the decoding process of numerical bar graphs. While it is only the beginning, further theoretical and experimental investigation may open the door to a broader setting and a deeper understanding of visualization interpretation.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The author wishes to acknowledge the Department of Business and Information Technology, which provided funds for the purchase of research books relevant to this project.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AOI	area of interest
pdf	probability density function
EVME	expected visual measurement error

References

Tufte, E.R. The Visual Display of Quantitative Information, 2nd ed.; Graphics Press: Cheshire, CT, USA, 2001. [Google Scholar]
Michalos, M.; Tselenti, P.; Nalmpantis, S. Visualization Techniques for Large Datasets. J. Eng. Sci. Technol. Rev. 2012, 5, 72–76. [Google Scholar] [CrossRef]
Cleveland, W.S.; McGill, R. Graphical perception: Theory, experimentation, and application to the development of graphical methods. J. Am. Stat. Assoc. 1984, 79, 531–554. [Google Scholar] [CrossRef]
Heer, J.; Bostock, M. Crowdsourcing graphical perception: Using mechanical turk to assess visualization design. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Atlanta, GA, USA, 10–15 April 2010; pp. 203–212. [Google Scholar]
Shah, P.; Hoeffner, J. Review of graph comprehension research: Implications for instruction. Educ. Psychol. Rev. 2002, 14, 47–69. [Google Scholar] [CrossRef]
Connor, C.E.; Egeth, H.E.; Yantis, S. Visual attention: Bottom-up versus top-down. Curr. Biol. 2004, 14, R850–R852. [Google Scholar] [CrossRef]
Pinto, Y.; van der Leij, A.R.; Sligte, I.G.; Lamme, V.A.; Scholte, H.S. Bottom-up and top-down attention are independent. J. Vis. 2013, 13, 16. [Google Scholar] [CrossRef]
Matzen, L.E. Using Eye Tracking Metrics and Visual Saliency Maps to Assess Data Visualizations; Technical Report; Sandia National Lab.(SNL-NM): Albuquerque, NM, USA, 2016. [Google Scholar]
Matzen, L.E.; Haass, M.J.; McNamara, L.A. Using Eye Tracking to Assess Cognitive Biases; Technical Report; Sandia National Lab.(SNL-NM): Albuquerque, NM, USA, 2014. [Google Scholar]
Borkin, M.A.; Vo, A.A.; Bylinskii, Z.; Isola, P.; Sunkavalli, S.; Oliva, A.; Pfister, H. What makes a visualization memorable? IEEE Trans. Vis. Comput. Graph. 2013, 19, 2306–2315. [Google Scholar] [CrossRef]
Jacob, R.J.; Karn, K.S. Eye tracking in human-computer interaction and usability research: Ready to deliver the promises. In The Mind’s Eye; Elsevier: Amsterdam, The Netherlands, 2003; pp. 573–605. [Google Scholar]
Rayner, K. Eye movements in reading and information processing: 20 years of research. Psychol. Bull. 1998, 124, 372. [Google Scholar] [CrossRef]
Rayner, K. Eye movements and attention in reading, scene perception, and visual search. Q. J. Exp. Psychol. 2009, 62, 1457–1506. [Google Scholar] [CrossRef]
Higgins, E.; Leinenger, M.; Rayner, K. Eye movements when viewing advertisements. Front. Psychol. 2014, 5, 210. [Google Scholar] [CrossRef]
Harel, J.; Koch, C.; Perona, P. Graph-based visual saliency. In Proceedings of the 19th International Conference on Neural Information Processing Systems (NIPS’06), Vancouver, BC, Canada, 4–7 December 2006; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Matzen, L.E.; Haass, M.J.; Tran, J.; McNamara, L.A. Using Eye Tracking Metrics and Visual Saliency Maps to Assess Image Utility; Technical Report; Sandia National Lab.(SNL-NM): Albuquerque, NM, USA, 2016. [Google Scholar]
Haass, M.J.; Matzen, L.E.; Butler, K.M.; Armenta, M. A new method for categorizing scanpaths from eye tracking data. In Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research & Applications, Charleston, SC, USA, 4–17 March 2016; pp. 35–38. [Google Scholar]
Collewijn, H.; Erkelens, C.J.; Steinman, R.M. Binocular co-ordination of human horizontal saccadic eye movements. J. Physiol. 1988, 404, 157–182. [Google Scholar] [CrossRef]
Collewijn, H.; Erkelens, C.J.; Steinman, R.M. Binocular co-ordination of human vertical saccadic eye movements. J. Physiol. 1988, 404, 183–197. [Google Scholar] [CrossRef]
Bahill, A.T.; Stark, L. Oblique saccadic eye movements: Independence of horizontal and vertical channels. Arch. Ophthalmol. 1977, 95, 1258–1261. [Google Scholar] [CrossRef]
Bookstein, F.L. Size and shape spaces for landmark data in two dimensions. Stat. Sci. 1986, 1, 181–222. [Google Scholar] [CrossRef]
Goldberg, J.; Helfman, J. Eye tracking for visualization evaluation: Reading values on linear versus radial graphs. Inf. Vis. 2011, 10, 182–195. [Google Scholar] [CrossRef]
Stoyan, D.; Stoyan, H. Fractals, Random Shapes and Point Fields: Methods of Geometrical Statistics; John Wiley & Sons Incorporated: Hoboken, NJ, USA, 1994; Volume 302. [Google Scholar]
Miller, K. Distributions involving norms of correlated Gaussian vectors. Q. Appl. Math. 1964, 22, 235–243. [Google Scholar] [CrossRef]
Miller, K.S. Multidimensional Gaussian Distributions; Wiley: Hoboken, NJ, USA, 1964. [Google Scholar]
Hilgers, M.G.; Burke, A. Exploring Errors in Reading a Visualization via Eye Tracking Models Using Stochastic Geometry. In HCI in Business, Government and Organizations. Information Systems and Analytics, Proceedings of the 6th International Conference, HCIBGO 2019, Held as Part of the 21st HCI International Conference, HCII 2019, Orlando, FL, USA, 26–31 July 2019; Proceedings, Part II 21; Springer: Cham, Switzerland, 2019; pp. 53–71. [Google Scholar]
Krishnaiah, P.; Hagis, P., Jr.; Steinberg, L. A note on the bivariate chi distribution. SIAM Rev. 1963, 5, 140–144. [Google Scholar] [CrossRef]
Lawrence, J. Moments of the noncentral chi distribution. Sankhya A 2021, 85, 1243–1259. [Google Scholar] [CrossRef]
Park, J., Jr. Moments of the generalized Rayleigh distribution. Q. Appl. Math. 1961, 19, 45–49. [Google Scholar] [CrossRef]
Olver, F.W.; Lozier, D.W.; Boisvert, R.F.; Clark, C.W. NIST Handbook of Mathematical Functions Hardback and CD-ROM; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Anderson, D.A. Maximum likelihood estimation in the non-central chi distribution with unknown scale parameter. Sankhyā Indian J. Stat. Ser. B 1981, 43, 58–67. [Google Scholar]
Anderson, D.A. The circular structural model. J. R. Stat. Soc. Ser. B (Methodol.) 1981, 43, 131–141. [Google Scholar] [CrossRef]
Mardia, K.; Dryden, I. The statistical analysis of shape data. Biometrika 1989, 76, 271–281. [Google Scholar] [CrossRef]
Mardia, K.; Dryden, I. Shape distributions for landmark data. Adv. Appl. Probab. 1989, 21, 742–755. [Google Scholar] [CrossRef]
Olver, F. Asymptotics and Special Functions; CRC Press: Boca Raton, FL, USA, 1997. [Google Scholar]
Jahnke, E.; Emde, F. Tables of Functions with Formulae and Curves; Dover Publications: Mineola, NY, USA, 1945. [Google Scholar]

Figure 1. Conceptual structure of a simple bar graph. It shows the relationship between the table of data and the visual objects encoding it. The ruler is the key to the decoding process.

Figure 2. Conceptional view of the AOIs needed to read a bar graph. Each AOI contains fixations.

Figure 3. The fundamental stochastic triangle.

P_{i}

is a fixation,

z_{i}

(blue) is a fixed vector pointing to a key focal point, and

d_{i}

(orange) is the random displacement of the fixation about the focal point.

Figure 3. The fundamental stochastic triangle.

P_{i}

is a fixation,

z_{i}

(blue) is a fixed vector pointing to a key focal point, and

d_{i}

(orange) is the random displacement of the fixation about the focal point.

Figure 4. Labeling of the decomposition of the fixations. The dots are fixation points

P_{i}

. The solid arrows represent the vectors in the decomposition

P_{i} = d_{i} + z_{i} .

The dashed arrows are the horizontal and vertical components of

d_{i}

.

Figure 4. Labeling of the decomposition of the fixations. The dots are fixation points

P_{i}

. The solid arrows represent the vectors in the decomposition

P_{i} = d_{i} + z_{i} .

The dashed arrows are the horizontal and vertical components of

d_{i}

.

Figure 5. Conceptual view of the stochastic triangles connecting one fixation in each of the areas of interest. The circled edges have random lengths called

D_{12}

that possess an expected value called

μ

.

Figure 5. Conceptual view of the stochastic triangles connecting one fixation in each of the areas of interest. The circled edges have random lengths called

D_{12}

that possess an expected value called

μ

.

Figure 6. A visualization of the various cases surrounding values of the correlation.

Figure 7. The fundamental stochastic triangle on a strip plot.

Figure 8. The fundamental stochastic triangle on a line graph.

Table 1. Elementary perceptional tasks ordered by accuracy.

Elementary Task	Ordering	Model Can Interpret This Task
Position along a common scale	1	Yes. 1-dimensional length measure.
Positions along nonaligned scales	2	Yes. 1-dimensional length measure.
Length	3	Yes. 1-dimensional length measure.
Direction	3	No, for now.
Angle	3	No, for now.
Area	4	Maybe.
Volume	5	No.
Curvature	5	No.
Shading	6	No. Model is blind to shading.
Color Saturation	6	No. Model is colorblind.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hilgers, M.G. A Model of Information Visualization Interpretation. Appl. Sci. 2024, 14, 6731. https://doi.org/10.3390/app14156731

AMA Style

Hilgers MG. A Model of Information Visualization Interpretation. Applied Sciences. 2024; 14(15):6731. https://doi.org/10.3390/app14156731

Chicago/Turabian Style

Hilgers, Michael G. 2024. "A Model of Information Visualization Interpretation" Applied Sciences 14, no. 15: 6731. https://doi.org/10.3390/app14156731

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Model of Information Visualization Interpretation

Abstract

1. Introduction

2. Materials and Methods

2.1. Form and Purpose of Parametric Analysis

2.2. Overview of the Eye-Tracking Model and Parametric Analysis

2.3. Geometrical Considerations

2.4. General Case

2.4.1. Fixation Components

2.4.2. Side Lengths

2.5. Special Cases

2.5.1. Case A: Diagonal Covariance Matrix with the Same Diagonal Elements

2.5.2. Case B: Diagonal Covariance Matrix with Different Diagonal Elements

2.5.3. Case C: Bookstein’s Model Assumptions

2.6. Side Length Probability Density Function

3. Results

3.1. The Mean and Useful Properties

3.2. Parametric Behavior of the Mean

3.2.1. Parametric Representation of the Mean

3.2.2. General Framework for Parametric Analysis

3.2.3. An Intermediate Form for Analysis

3.2.4. Analysis of the Correlation Parameter

3.2.5. Analysis of the Noncentrality Parameter

3.2.6. Analysis of the Scale Parameter

4. Discussion

4.1. Expected Visual Measurement Error

4.2. Remarks on the Correlation Parameter

4.3. Remarks on the Scale Parameter

4.4. Interesting Relationships

5. Conclusions

5.1. Limitations of the Model

5.2. Extensions to Other Visualization Types

5.3. Applications

5.4. Future Work

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI