Next Article in Journal
Underwriter Discourse, IPO Profit Distribution, and Audit Quality: An Entropy Study from the Perspective of an Underwriter–Auditor Network
Previous Article in Journal
The Inverse of Exact Renormalization Group Flows as Statistical Inference
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Lévy Flight Model of Gaze Trajectories to Assist in ADHD Diagnoses

by
Christos Papanikolaou
1,
Akriti Sharma
1,
Pedro G. Lind
1,2,3,4 and
Pedro Lencastre
1,2,3,*
1
Department of Computer Science, Oslo Metropolitan University, N-0130 Oslo, Norway
2
OsloMet Artificial Intelligence Lab, Pilestredet 52, N-0166 Oslo, Norway
3
NordSTAR—Nordic Center for Sustainable and Trustworthy AI Research, Pilestredet 52, N-0166 Oslo, Norway
4
Simula Research Laboratory, Numerical Analysis and Scientific Computing, N-0164 Oslo, Norway
*
Author to whom correspondence should be addressed.
Entropy 2024, 26(5), 392; https://doi.org/10.3390/e26050392
Submission received: 3 January 2024 / Revised: 29 March 2024 / Accepted: 25 April 2024 / Published: 30 April 2024
(This article belongs to the Special Issue Stochastic Thermodynamics of Microscopic Systems)

Abstract

:
The precise mathematical description of gaze patterns remains a topic of ongoing debate, impacting the practical analysis of eye-tracking data. In this context, we present evidence supporting the appropriateness of a Lévy flight description for eye-gaze trajectories, emphasizing its beneficial scale-invariant properties. Our study focuses on utilizing these properties to aid in diagnosing Attention-Deficit and Hyperactivity Disorder (ADHD) in children, in conjunction with standard cognitive tests. Using this method, we found that the distribution of the characteristic exponent of Lévy flights statistically is different in children with ADHD. Furthermore, we observed that these children deviate from a strategy that is considered optimal for searching processes, in contrast to non-ADHD children. We focused on the case where both eye-tracking data and data from a cognitive test are present and show that the study of gaze patterns in children with ADHD can help in identifying this condition. Since eye-tracking data can be gathered during cognitive tests without needing extra time-consuming specific tasks, we argue that it is in a prime position to provide assistance in the arduous task of diagnosing ADHD.

1. Introduction

The study of time-series is central to mathematics and physics with an extensive range of interdisciplinary applications. A class of time-series that has gathered increased interest in recent years is the one with rare extreme events, an example of which is the recording of human eye-movements while looking at a screen, i.e., eye-tracking. It is currently an open question what is the most suitable mathematical equation describing gaze trajectories. There are two schools of thought: one that claims that gaze trajectories are two alternating subprocesses and another that claims that gaze trajectories are described by a single, scale-independent process [1].
In the first description, the two alternating types of eye-movements are fixations, during which an individual’s gaze is centered around one specific point with very little deviation and, presumably, visual information is being extracted, and saccades, fast re-locations of the gaze from one point to another [2]. In the second mathematical model, it is posited that the visual and oculomotor systems may conform to the Lévy flight model [3]. This has been empirically supported by the work presented in Ref. [4], with the emergence of superdiffusive dynamics arising in this context [5]. In fact, the superdiffusive nature of gaze trajectories has been presented over two decades ago [6]. In that study, the authors proposed that visual scan-paths generated under natural circumstances bear similarities to Lévy flights, indicating a power-law dependency in their magnitude distribution, a phenomenon that occurs even in tasks with pre-planned positions for the visual targets [7]. In the context of visual scan-paths, this indicates that a viewer’s attention may occasionally make extensive shifts across the scene, intermixed with smaller, local movements. It has also been shown that Lévy flights reproduce human gaze patterns well over a saliency map [8].
This is similar to the foraging subclass of stochastic processes, i.e., time-series where an agent is searching for food, water, or, in our particular case, visual information. Indeed, it is speculated that Lévy flights describe foraging strategies observed in nature in all its contexts [9]. These strategies are manifest in the movement of bacteria [10,11], the search for fish by albatrosses [12], or the foraging for food by sharks in the ocean [13]. This pattern is also present in several human tasks, such as the exploration of the Walt Disney Resort by children [14], the exploration of virtual environments [15], and was even proposed as a mathematical model for the way humans search for memories [16] or the response time in decision making based on the accumulation of evidence [17,18,19]. Using this framework, eye-tracking experiments found the characteristic Lévy exponent ( α ) to be approximately α = 0.9 (or α = 1.9 using a different convention than the one we are using here) [20], regardless of the level of difficulty of the visual task.
While it is still an open question if eye-gaze trajectories follow Lévy-like processes or others, such as intermittent processes [21], this finding supports the notion that the Lévy flight model can be applied to visual scan-paths, and it emphasizes the potential relevance of the scaling exponent α in characterizing eye movements. The question of universality is not new, and it has been argued that an optimal α for search strategies is α = 1 [22].
Besides the theoretical question of the best mathematical description of gaze trajectories, eye-tracking data has been gaining popularity due to its wide range of applications in health sciences. It can be used to determine an individual’s age, gender, ethnicity [23], personality traits such as extroversion, curiosity, neuroticism, openness, agreeableness [24], Intelligence Quotient (IQ) [25], as well as helping detect conditions such as Alzheimer’s [26], autism [27], or dyslexia [28,29].
Due to its diagnostic challenges, one of the conditions where eye-tracking might have the most impact is Attention-Deficit/Hyperactivity Disorder (ADHD). It has been suggested that around 20% of ADHD diagnoses could be false positives [30] with an unknown expected percentage of children classified as false negatives. Indeed, significant differences in gaze dynamics were found in people with ADHD, namely regarding fixation durations in reading tasks [31] and saccade abnormalities during working memory tasks [32].
While the condition is not curable, an early diagnosis is important to mitigate its impact, provide adequate medication, and teach patients coping skills. This is increasingly important when taking into account that it is the most prevailing neurodevelopmental disorder among children and adolescents, impacting academic performance, overall well-being, and social interactions in children [33]. When unmitigated, the impact can be felt even in adulthood, with affected individuals having shorter and more discordant romantic relationships [34].
In the important task of assisting with ADHD diagnoses, eye-tracking has been applied in reading tasks [31], during memory performance tests [35], or while playing in virtual reality [36]. We intend to add to this body of literature by justifying and applying a Lévy flight-inspired methodology. These methods also have the advantage of being scale invariant and, thus, in a time when phone cameras can start tracking eye-movements [37], the robustness of this approach to different scales being employed is a definite advantage.
This paper is structured as follows: in Section 2, the methods applied, and the data used are introduced; in Section 3, the results of our analysis are presented. Finally, in Section 4, we present the conclusions of the paper.

2. Data and Methods

2.1. Data: Eye-Gaze Trajectories and ADHD Cognitive Test

The dataset we used in this study originates from an experiment detailed in Ref. [38], focusing on visuospatial working memory and involving participants both with and without ADHD. ADHD in children was identified by a trained child neurologist following the DSM-IV criteria [39] with Te Conner’s Abbreviated Parent–Teacher Questionnaire [40] being used as an additional metric to count ADHD symptoms. Comorbidities were evaluated using a general medical opinion, parent interview, and the M.I.N.I. Kid test [40]. Further details can be found in Ref. [41]. The experiment included 50 participants engaged in a Sternberg-type delayed visuospatial working memory task, adapted from Ref. [38]. From these 50 participants, 47 had complete data records, with 21 children diagnosed with ADHD and 26 without.
This working memory task showed one or two dots on the screen, with dots positioned in one of sixteen locations within a 4 × 4 grid. The goal of the participants was to memorize the position of the dots, while some distractors (images or videos) were shown between the time when the participants first saw the dots and the time when they were finally evaluated on memorizing the dots’ position.
The assumption nested in this experiment is that children with ADHD are more prone to being distracted and, thus, will fail this test more often. This assumption also arises in the studies from Refs. [42,43], which used the WISC-III cognitive test and found that the ADHD group scored significantly lower on Freedom from Distraction (FD), which, in the context of the WISC-III cognitive test, is composed of the Digit Span and Arithmetic subtests. Results of the WISC-III cognitive test are also provided in this dataset.
Other parameters that might affect a child’s performance in the memory test were controlled for, namely, age ( 10.71 ± 0.54 years for the ADHD group and 11.58 ± 0.50 years for non-ADHD) and IQ ( 100 ± 11.6 for ADHD and 105 ± 7.3 for non-ADHD). Eye movements and pupil diameter were recorded for the duration of the experiment using Eyelink 1000, a powerful eye-tracker with a 1 kHz sampling frequency. Fixation and saccade labeling of data points were not provided. To create such classification, a velocity-based Hidden Markov Model identification algorithm was implemented, the details of which can be found alongside the code of our analysis [44].

2.2. Lévy Flights and Superdiffusion

A Lévy flight is a type of random walk named after the French mathematician Paul Lévy, which has seen a wide range of applications since the 1980s [45]. As a random walk, it describes a series of positions across time via the equation
R ( t ) = R ( 0 ) + i = 1 N ( t ) Δ r i ,
where R ( t ) is the position of the particle at time t, R ( 0 ) the initial position, Δ r i the position increments or steps, and N ( t ) is the number of steps taken up to time t.
The distribution of the increments, Δ r i , is what distinguishes a Lévy flight from other random walks such as Brownian motion [46]. Unlike the latter, where Δ r i are normally distributed, in a Lévy flight, the steps follow a heavy-tailed distribution characterized by its power-law tail.
The right-sided tail of the probability density function (PDF) of a Lévy flight is given by
p ( | Δ r | ) 1 | Δ r | α + 1 ,
where the coefficient 0 < α < 2 determines the heaviness of the tails [47]. When α > 2 , the process converges to a Gaussian random walk [48] by the central limit theorem, while for values α 0 , the process does not yield a normalized probability distribution. For 0 < α 1 , the distribution has no convergent mean and variance, and for 1 < α 2 , the distribution has a finite mean but divergent variance [21]. This is why this type of distribution escapes the range of applications of the central limit theorem (which requires a finite mean and variance) and does not converge to a Gaussian random walk when the steps are measured over large time-windows, τ = N Δ t s , where N is an integer, and Δ t s is the interval of time at which the data is sampled, which, in our case, is Δ t s = 10 3 .
The fact that, asymptotically, Lévy flights lead to a different behavior than the Wiener process is well documented in the literature, with this behavior being known as superdiffusivity [5], a phenomenon characterized by a higher-than-normal rate of spread or dispersion compared to classical diffusion. In a superdiffusive process, the mean-squared displacement (MSD) of a particle grows faster than linearly with time. The MSD is defined as
MSD ( τ ) = | R ( τ ) R ( 0 ) | 2 .
For classical diffusion, the MSD grows linearly with time ( MSD ( τ ) τ ), but, in superdiffusion, the growth is faster than linear with
MSD ( τ ) τ γ ,
with, in the context of Lévy flights,
γ = 4 2 + α .
Multifractal detrended fluctuation analysis (MFDFA) is a technique that can be used to estimate the parameter γ . It is an extension of the detrended fluctuation analysis (DFA) method, allowing for the assessment of the multifractal nature of complex systems. The details of the algorithm can be found in Ref. [49].

2.3. Machine Learning Classification Methods

In order to classify individuals as belonging to the ADHD category or not, based on parameters such as age, test performance, or features of their gaze trajectories, we employed three different machine learning methods for this type of classification task.

2.3.1. Logistic Regression

Logistic regression is one of the most simple binary classification methods. In it, the logistic function, also known as the sigmoid function, is used to map the output of a linear combination of features to a value between 0 and 1.
The logistic function is defined as
σ ( z ) = 1 1 + e z ,
where z is the linear combination of input features and model parameters
z = β 0 + β 1 x 1 + β 2 x 2 + + β n x n .
In our case, the x variables represent metrics such as age, IQ, or eye-tracking-derived metrics, which we will use to distinguish between children with ADHD and those without the condition.
Conversely, the β -coefficients associated with x are estimated via a maximum likelihood estimation and each coefficient β i can be interpreted as the weight of the variable x i to the model, with β 0 being the intercept of the model.
After estimating the β -coefficients, one can calculate the probability of an instance characterized by a variable array x belonging to the positive class:
P ( Y = 1 ) = σ ( z ) ,
where Y is the binary target variable, where 1 corresponds to the case of a child with ADHD and 0 otherwise.
The logistic regression model predicts the class label by thresholding the predicted probability as follows:
y ^ = 1 if P ( Y = 1 ) 0.5 0 if P ( Y = 1 ) < 0.5 .

2.3.2. Support Vector Machines

Support Vector Machines (SVMs) are another class of supervised algorithms used for classification tasks. In this context, this algorithm has the goal of finding the hyperplane that best separates the data into different classes. In our case, that creates two mutually exclusive regions where, given the value of the feature vector x , an instance is assigned a value Y = 0 or Y = 1 .
In the most simple case, when a hyperplane exists that perfectly separates between classes (ADHD and non-ADHD, in our case), the hyperplane defined by the SVM algorithm has the form
f ( x ) = w · x + b ,
where, similarly to the β -coefficients in the logistic regression, the vector w represent the weight vector and b represents the bias term (similarly to β 0 in the logistic regression).
In that case, the classification decision is made based on the sign of f ( x ) :
y ^ = 1 if f ( x ) 0 , 0 if f ( x ) < 0 .
In most practical applications, it is not possible to find a hyperplane that perfectly separates between classes and some overlap exists, even after finding an appropriate hyperplane. In this case, SVMs can use kernel functions to map the input features into a higher-dimensional space. The decision function becomes
f ( x ) = i = 1 N α i y i K ( x i , x ) + b ,
where K ( x i , x ) is a kernel function and α i are the Lagrange multipliers.
Common kernels include the linear kernel K ( x i , x ) = x i · x , the polynomial kernel K ( x i , x ) = ( x i · x + c ) d , and the radial basis function kernel K ( x i , x ) = exp x i x 2 2 s 2 , with s being a width parameter. Here, we used a linear kernel in our analysis. To avoid overfitting, in the case of SVM and logistic regression, we used the Ridge regularization with the strength parameter chosen via cross-validation.

2.3.3. Decision Trees and Random Forests

A decision tree is a non-linear, non-parametric model used for both classification and regression tasks. A decision tree recursively partitions the feature space (where x lies) to create homogeneous subsets until a stopping criterion is met. The decision at each node is typically based on optimizing impurity measures such as Gini impurity or entropy.
Mathematically, a vector x encapsulates the relevant metrics for a classification task and a series of corresponding labels y . A decision tree can be represented by a function f ( x ) that maps feature vectors to predicted labels:
f ( x ) = i = 1 N w i · I ( x R i ) ,
where N is the number of terminal nodes, w i the predicted value at terminal node i, R i the region corresponding to terminal node i, and I ( · ) is the indicator function. Here, we chose N via cross-validation and found 8 to the optimal number for this quantity.
A Random Forest is a method that groups multiple decision trees and combines their predictions. Random Forests improve generalization and reduce overfitting compared to individual decision trees. Each tree is trained on a subset of the data and may use a random subset of features at each split. The final prediction is a majority vote of individual tree predictions.

2.4. Cross-Validation Strategies

Given the limited number of participants, we could not use the common approach of fitting the parameters of the model in 80 % of the data and testing its accuracy in the remaining 20 % . Instead, we used different cross-validation (CV) strategies.
In the first CV strategy, we employed an 80 % 20 % split, choosing a different combination of participants each time to fit the parameters and test the accuracy of the model, in a process that is known as the Stratified K-Fold (SKF) CV ( k = 5 in this case). The final accuracy reported an average of 5000 80 % 20 % combinations.
In the second type of CV, we applied a Leave One Out cross-validation (LOOC), where we fit the parameters of a model with all the data except for 1 entry, which was then left to be tested. In our case, with 47 participants in total, this was done 47 times, each time fitting the parameters of the model on 46 entries and testing the model’s accuracy on 1 participant. Again, the reported accuracy was the average over all 47 realizations.

3. Results: Lévy Flight Exponents to Identify ADHD Children

From here onwards, we will analyze the data from the one-dot memorandum task. In this task, participants saw one dot in a 4 × 4 grid and had to memorize its position for 1.5 s, a period of time during which they were shown pictures and videos that served as distractors. Since these distractors varied from trial to trial, for consistency reasons, the eye-tracking data that we analyzed focused solely on the gaze patterns of participants while looking at the 4 × 4 grid with one dot in it (Figure 1).

3.1. Eye-Gaze Dynamics and the Lévy Flight Exponent

While the Lévy flight framework diverges from the usual picture of two alternating processes, one slow (called fixation in the oculomotor field), one fast (called saccades), in our case, we can see that not only is it adequate, but actually preferable. In Figure 2, we observe two relevant features in the PDF of the distribution of the increment’s norm Δ r : firstly, we observe a power-law decay that is integral to the PDF and not just a small part of the tail of the pdf and, secondly, we do not observe a bi-modal velocity PDF that is often used as a justification for the assumption of two separate subprocesses of fixations and saccades. To estimate α from the PDF, we firstly identified the mode M of the distribution. From that point, we fixed the constant factor k and calculated the linear regression in the log–log PDF plot in the interval [ M , M × k ] . The quantity k is chosen with the goal of maximizing the average R 2 across all participants and, in our case, it was chosen as 5.9 .
We also see, in Figure 3, the values of the mean, standard deviation, skewness, and kurtosis of Δ r . We observe that they are quite disparate across experimental participants, often with variations of two orders of magnitude. That is not surprising, if we consider that these quantities are divergent in a Lévy flight model, with their values being extremely sensitive to small parameter variations. One further advantage of looking at the Lévy exponent α (or for the scaling exponent γ ) is that this approach is scale invariant.
We observe as well, in Figure 4, that the relation MSD ( τ ) τ γ holds to a significant degree. Furthermore, in Figure 5, we computed the coefficient of determination (r-square). With values of r-square over 0.9 , we see that both the estimation of α via the PDF and the estimation using the relation MSD ( τ ) τ γ corroborate the Lévy flight assumption as a valid one. The MFDFA method is often considered the most robust algorithm to estimate the parameter α and, in Figure 5, we see that it generates better quality fits. It is thus this method that we use in what follows.

3.2. Classifying ADHD without Eye-Tracking

Before investigating the impact of including a Lévy description of gaze movements in our classification model, we created a benchmark model using other metrics solely dependent on non-eye-tracking related metrics. These are the age of the children, the specific metrics from their WISC test, and their test performance. The accuracy of this model will subsequently be compared with the model, including eye-tracking metrics.
To create the benchmark model, we used logistic regression with the following variables: age, full-scale IQ, arithmetic and digit span subtests of the WISC-III cognitive test, and the one-dot memory test performance. While we had fifty test participants, three of them had incomplete data, leaving us with forty-seven test subjects.
The results are presented in Table 1. There, we see that, in either case of CV strategies, the logistic regression is the best-performing method, with an accuracy of 72.3 % , regardless of the type of CV. Thus, we will use the LOOC from now onwards, since we can more easily cover all possible CV combinations. The model results can also be seen in Table 2 (left), with other metrics such as a precision of 72.4 % , specificity of 61.9 % , and recall 80.7 % . The IQ variable was not included in the analysis, as using it led to overfitting and a decrease in CV accuracy.

3.3. Classifying ADHD with Eye-Tracking

When we calculated the value of the Lévy exponent α , we found that there was a discrepancy between children with and without ADHD. We see in Figure 6 that the median of α for children without ADHD is around α = 1 , while, for children with ADHD, it is centered between 0.6 and 0.8 . According to the Mann–Whitney U test, this difference is significant with a p-value of around 0.01 .
While there was a difference in the median of the distribution of α , this parameter alone did not yield better accuracy than the benchmark model (around 60 % ). However, when incorporated with the other variables, it did increase the accuracy of the logistic regression from 72.4 % to 76.5 % (see Table 2 on the right). The accuracy increase for all the other models is reported in Table 1.
The increase in accuracy was reflected in the other metrics, namely, the precision (from 72.4 % to 77.7 % ) and specificity (from 61.9 % to 71.5 % ). Even though this increase was not very substantial, we must take into account that we had a limited amount of data. When this limitation takes place, typically, adding more variables can lead to overfitting, which can decrease the overall accuracy. In fact, when using cross-validation and recursive feature elimination, we excluded the participants’ IQ from our model, as it worsened the accuracy of the model. It is expected that, if more data were available, the accuracy increase would also be higher, and it is unclear if IQ would be excluded from the classification model in that circumstance.
Children with ADHD typically had a lower α coefficient, indicating that they were more prone to having large, extreme relocations and exploring a larger area of the screen image. In this context, gaze dynamics mimicked some of the body language of children with ADHD, which often seems uneasy with a tendency for extra movements when engaged in tasks that require concentration. Children without ADHD presented a value of α close to 1. This value is quite significant in the literature, as a conjecture exists stating that α = 1 reflects the optimal strategy for searching processes [22,50,51,52,53]. It is the same value that was found in other contexts, such as memory retrieval [16], and is close to the values of the exponents found in other eye-tracking studies [20,54]
These results were replicated in another dataset [55] with ADHD and typically developed children, and we found a similar α = 1 exponent for typically developed children. Furthermore, while a difference in α exponents was not found when analyzing | Δ r | , a significant difference in the α distribution’s mean ( p = 0.015 ) and median ( p = 0.015 ) was found for the horizontal increments | Δ x | and for the median ( p = 0.015 ) of the vertical increments | Δ y | . In this dataset, without additional cognitive tests, using only α as a variable in a logistic regression model resulted in an accuracy of 66.9 % . All the data and code of our analysis, including the replication of our results in a different dataset, can be found in Figshare [44].

4. Discussion and Conclusions

In this work, we argued that Lévy flights are good mathematical models of gaze dynamics and that their characteristic exponent α is distributed differently between ADHD and non-ADHD children. This is a natural expectation, given that this is a defining quality of gaze patterns, which, in turn, reflect attention patterns. Our findings show that having a typically smaller α exponent, ADHD children explore more remote areas of images, which might be a consequence of their attention deficit. This contrasts with the case of autism, another neurodevelopmental condition, where it was found that children with this condition have gaze trajectories with a larger α Ref. [54].
We observe that the Lévy flight description of gaze dynamics can improve the classification accuracy that is obtained when using a cognitive test. While the improvement is not very substantial, this also stems from the fact that we have a limited amount of data, and, when that is the case, adding extra variables to an already multi-variable model can lead to overfitting or to reduced marginal accuracy gains. In particular, using cross-validation, we eliminated the participants’ IQ as a variable of our classification model, which might prove to be useful in an expanded dataset. Moreover, the classification accuracy that we obtained by analyzing gaze dynamics is in line with the one obtained by performing the pupil diameter analysis [56], which is the quantity of focus in the dataset description [38].
The increase in accuracy when using the Lévy model stands in contrast to the case when one uses traditional eye-tracking metrics, namely the duration of saccade and fixation and their respective average velocities. When average saccadic velocity was added to the logistic regression model, classification accuracy remained stable ( 72.4 % ). However, accuracy decreased to 68.0 % when we included average fixation velocity or fixation and saccade durations in the logistic regression model. Other studies have used metrics based on accurate gaze shifts and gaze duration applied to other datasets, namely autism [54]. In that study, the authors reported an accuracy up to 76 % using standard eye-tracking metrics. For this other dataset, we obtained, with our method, an accuracy of 77.2 % . More recently, we also analyzed the same dataset of autistic children [57] to investigate the ability to detect autism using a method, not based on eye-gaze velocity, but on a sort of visited area of the eye-gaze, measuring how “exploratory” a specific eye-gaze is. There, we obtained an accuracy of 93%, even when using shorter trajectories (less data). Other studies have analyzed reading tasks to investigate dyslexia [28,29], with similar standard metrics; the authors aimed to characterize the eye movements of children with dyslexia.
All in all, our method, based on the exponent of the Lévy flight model, presented an overall good performance, while other traditional metrics also showed a comparable level of performance when applied in other datasets; the Lévy flight model can be regarded as an additional method to assist in diagnosing ADHD. An interesting next step in the context of stochastic statistical physics, which can be explored for an extension of the methodology proposed here, deals with the so-called detailed fluctuation theorem. This theorem expresses the balance between entropy–consumption (positive variations of entropy, Δ S ) and entropy–production (negative variations of entropy) of particular stochastic trajectories, and states that the quotient between the probability of each one, i.e., p ( Δ S ) / p ( Δ S ) , is proportional to exp ( Δ S ) [58]. Therefore, by computing the entropy fluctuations of eye-gaze trajectories, one can derive the distribution of entropy fluctuations for both groups of children, with and without diagnosed ADHD, which might further improve the accuracy of our method.
Our approach also has the advantage of being scale-independent. This means that it is not as sensitive to particular experimental parameters, such as the eye-tracker’s native frequency or the screen distance, as other approaches. Consequently, our approach could be used in eye-trackers with different characteristics and operated outside strict laboratory conditions. This is of particular importance for assisting in the diagnosis of ADHD in children, who typically present challenges for laboratory experimental setups.

Author Contributions

Conceptualization, P.L.; Methodology, P.G.L. and P.L.; Formal analysis, C.P.; Investigation, C.P.; Writing—review & editing, C.P., A.S., P.G.L. and P.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

This study used open available data as stated in the manuscript (Section 2).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Lencastre, P.; Bhurtel, S.; Yazidi, A.; e Mello, G.B.; Denysov, S.; Lind, P.G. EyeT4Empathy: Dataset of foraging for visual information, gaze typing and empathy assessment. Sci. Data 2022, 9, 752. [Google Scholar] [CrossRef]
  2. Takahashi, M.; Sugiuchi, Y.; Na, J.; Shinoda, Y. Brainstem circuits triggering saccades and fixation. J. Neurosci. 2022, 42, 789–803. [Google Scholar] [CrossRef]
  3. Dubkov, A.A.; Spagnolo, B.; Uchaikin, V.V. Lévy flight superdiffusion: An introduction. Int. J. Bifurc. Chaos 2008, 18, 2649–2672. [Google Scholar] [CrossRef]
  4. Rhodes, T.; Kello, C.T.; Kerster, B. Intrinsic and extrinsic contributions to heavy tails in visual foraging. Vis. Cogn. 2014, 22, 809–842. [Google Scholar] [CrossRef]
  5. Błażejczyk, P.; Magdziarz, M. Stochastic modeling of Lévy-like human eye movements. Chaos 2021, 31, 043129. [Google Scholar] [CrossRef] [PubMed]
  6. Brockmann, D.; Geisel, T. The ecology of gaze shifts. Neurocomputing 2000, 32–33, 643–650. [Google Scholar] [CrossRef]
  7. Stephen, D.G.; Mirman, D.; Magnuson, J.S.; Dixon, J.A. Lévy-like diffusion in eye movements during spoken-language comprehension. Phys. Rev. E 2009, 79, 056114. [Google Scholar] [CrossRef]
  8. Boccignone, G.; Ferraro, M. Modelling gaze shift as a constrained random walk. Phys. A Stat. Mech. Its Appl. 2004, 331, 207–218. [Google Scholar] [CrossRef]
  9. Viswanathan, G.M.; Raposo, E.; Da Luz, M. Lévy flights and superdiffusion in the context of biological encounters and random searches. Phys. Life Rev. 2008, 5, 133–150. [Google Scholar] [CrossRef]
  10. Ariel, G.; Rabani, A.; Benisty, S.; Partridge, J.; Harshey, R.; Be’Er, A. Swarming bacteria migrate by Lévy Walk. Nat. Commun. 2015, 6, 1–6. [Google Scholar] [CrossRef]
  11. Yan, X.F.; Ye, D.Y. Improved bacterial foraging optimization algorithm based on Levy flight. Comput. Syst. Appl. 2015, 24, 124–132. [Google Scholar]
  12. Viswanathan, G.; Afanasyev, V.; Buldyrev, S.; Murphy, E.; Prince, P.; Stanley, E. Lévy flight search patterns of wandering albatrosses. Nature 1996, 381, 413–415. [Google Scholar] [CrossRef]
  13. Sims, D.; Humphries, N.; Bradford, R.; Bruce, B. Lévy flight and Brownian search patterns of a free-ranging predator reflect different prey field characteristics. J. Anim. Ecol. 2012, 81, 432–442. [Google Scholar] [CrossRef]
  14. Rhee, I.; Shin, M.; Hong, S.; Lee, K.; Kim, S.; Chong, S. On the levy-walk nature of human mobility. IEEE/ACM Trans. Netw. 2011, 19, 630–643. [Google Scholar] [CrossRef]
  15. Radicchi, F.; Baronchelli, A.; Amaral, L.A. Rationality, irrationality and escalating behavior in lowest unique bid auctions. PLoS ONE 2012, 7, e29910. [Google Scholar] [CrossRef]
  16. Rhodes, T.; Turvey, M.T. Human memory retrieval as Lévy foraging. Phys. A Stat. Mech. Its Appl. 2007, 385, 255–260. [Google Scholar] [CrossRef]
  17. Rasanan, A.H.H.; Rad, J.A.; Sewell, D.K. Are there jumps in evidence accumulation, and what, if anything, do they reflect psychologically? An analysis of Lévy Flights models of decision-making. Psychon. Bull. Rev. 2023, 31, 32–48. [Google Scholar] [CrossRef]
  18. Wieschen, E.M.; Voss, A.; Radev, S. Jumping to conclusion? a lévy flight model of decision making. Quant. Methods Psychol. 2020, 16, 120–132. [Google Scholar] [CrossRef]
  19. Voss, A.; Lerche, V.; Mertens, U.; Voss, J. Sequential sampling models with variable boundaries and non-normal noise: A comparison of six models. Psychon. Bull. Rev. 2019, 26, 813–832. [Google Scholar] [CrossRef]
  20. Credidio, H.; Teixeira, E.; Reis, S.; Moreira, A.A.; Andrade, J.S., Jr. Statistical patterns of visual search for hidden objects. Sci. Rep. 2012, 2, 920. [Google Scholar] [CrossRef]
  21. Bénichou, O.; Loverdo, C.; Moreau, M.; Voituriez, R. Intermittent search strategies. Rev. Mod. Phys. 2011, 83, 81. [Google Scholar] [CrossRef]
  22. Viswanathan, G.M.; Buldyrev, S.V.; Havlin, S.; Da Luz, M.; Raposo, E.; Stanley, H.E. Optimizing the success of random searches. Nature 1999, 401, 911–914. [Google Scholar] [CrossRef]
  23. Kröger, J.L.; Lutz, O.H.M.; Müller, F. What does your gaze reveal about you? On the privacy implications of eye tracking. In Proceedings of the IFIP International Summer School on Privacy and Identity Management, Windisch, Switzerland, 19–23 August 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 226–241. [Google Scholar]
  24. Berkovsky, S.; Taib, R.; Koprinska, I.; Wang, E.; Zeng, Y.; Li, J.; Kleitman, S. Detecting personality traits using eye-tracking data. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, Glasgow, UK, 4–9 May 2019; pp. 1–12. [Google Scholar]
  25. Kasneci, E.; Kasneci, G.; Appel, T.; Haug, J.; Wortha, F.; Tibus, M.; Trautwein, U.; Gerjets, P. TüEyeQ, a rich IQ test performance data set with eye movement, educational and socio-demographic information. Sci. Data 2021, 8, 154. [Google Scholar] [CrossRef]
  26. Holden, J.G.; Cosnard, A.; Laurens, B.; Asselineau, J.; Biotti, D.; Cubizolle, S.; Dupouy, S.; Formaglio, M.; Koric, L.; Seassau, M.; et al. Prodromal Alzheimer’s disease demonstrates increased errors at a simple and automated anti-saccade task. J. Alzheimer’s Dis. 2018, 65, 1209–1223. [Google Scholar] [CrossRef] [PubMed]
  27. Wadhera, T.; Kakkar, D. Eye Tracker: An Assistive Tool in Diagnosis of Autism Spectrum Disorder. In Emerging Trends in the Diagnosis and Intervention of Neurodevelopmental Disorders; IGI Global: Hershey, PA, USA, 2019; pp. 125–152. [Google Scholar]
  28. Wang, H.; Liu, F.; Dong, Y.; Yu, D. Entropy of eye movement during rapid automatized naming. Front. Hum. Neurosci. 2022, 16, 945406. [Google Scholar] [CrossRef] [PubMed]
  29. Wang, H.; Liu, F.; Dong, Y.; Yu, D. Features of eye movements during rapid automatized naming in Chinese children aged 7–11 years. Front. Pediatr. 2022, 10, 1051432. [Google Scholar] [CrossRef]
  30. Elder, T.E. The importance of relative standards in ADHD diagnoses: Evidence based on exact birth dates. J. Health Econ. 2010, 29, 641–656. [Google Scholar] [CrossRef]
  31. Caldani, S.; Acquaviva, E.; Moscoso, A.; Peyre, H.; Delorme, R.; Bucci, M.P. Reading performance in children with ADHD: An eye-tracking study. Ann. Dyslexia 2022, 72, 552–565. [Google Scholar] [CrossRef]
  32. Mohammadhasani, N.; Caprì, T.; Nucita, A.; Iannizzotto, G.; Fabio, R.A. Atypical visual scan path affects remembering in ADHD. J. Int. Neuropsychol. Soc. 2020, 26, 557–566. [Google Scholar] [CrossRef]
  33. Wolraich, M.L.; Hagan, J.F., Jr.; Allan, C.; Chan, E.; Davison, D.; Earls, M.; Evans, S.W.; Flinn, S.K.; Froehlich, T.; Frost, J.; et al. Clinical Practice Guideline for the Diagnosis, Evaluation, and Treatment of Attention-Deficit/Hyperactivity Disorder in Children and Adolescents. Pediatrics 2019, 144, e20192528. [Google Scholar] [CrossRef]
  34. Wymbs, B.T.; Canu, W.H.; Sacchetti, G.M.; Ranson, L.M. Adult ADHD and romantic relationships: What we know and what we can do to help. J. Marital Fam. Ther. 2021, 47, 664–681. [Google Scholar] [CrossRef]
  35. Lev, A.; Braw, Y.; Elbaum, T.; Wagner, M.; Rassovsky, Y. Eye tracking during a continuous performance test: Utility for assessing ADHD patients. J. Atten. Disord. 2022, 26, 245–255. [Google Scholar] [CrossRef]
  36. Stokes, J.D.; Rizzo, A.; Geng, J.J.; Schweitzer, J.B. Measuring attentional distraction in children with ADHD using virtual reality technology with eye-tracking. Front. Virtual Real. 2022, 3, 855895. [Google Scholar] [CrossRef]
  37. Valliappan, N.; Dai, N.; Steinberg, E.; He, J.; Rogers, K.; Ramachandran, V.; Xu, P.; Shojaeizadeh, M.; Guo, L.; Kohlhoff, K.; et al. Accelerating eye movement research via accurate and affordable smartphone eye tracking. Nat. Commun. 2020, 11, 4553. [Google Scholar] [CrossRef]
  38. Rojas-Líbano, D.; Wainstein, G.; Carrasco, X.; Aboitiz, F.; Crossley, N.; Ossandón, T. A pupil size, eye-tracking and neuropsychological dataset from ADHD children during a cognitive task. Sci. Data 2019, 6, 25. [Google Scholar] [CrossRef]
  39. First, M.B.; France, A.; Pincus, H.A. DSM-IV-TR Guidebook; American Psychiatric Publishing, Inc.: Washington, DC, USA, 2004. [Google Scholar]
  40. Sheehan, D.V.; Lecrubier, Y.; Sheehan, K.H.; Amorim, P.; Janavs, J.; Weiller, E.; Hergueta, T.; Baker, R.; Dunbar, G.C. The Mini-International Neuropsychiatric Interview (MINI): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J. Clin. Psychiatry 1998, 59, 22–33. [Google Scholar]
  41. Wainstein, G.; Rojas-Líbano, D.; Crossley, N.; Carrasco, X.; Aboitiz, F.; Ossandón, T. Pupil size tracks attentional performance in attention-deficit/hyperactivity disorder. Sci. Rep. 2017, 7, 8228. [Google Scholar] [CrossRef]
  42. Mealer, C.; Morgan, S.B.; Luscomb, R.L. Cognitive functioning of ADHD and non-ADHD boys on the WISC-III and WRAML: An analysis within a memory model. J. Atten. Disord. 1996, 1, 133–145. [Google Scholar] [CrossRef]
  43. Lopes, R.; Farina, M.; Wendt, G.; Esteves, C.; Argimon, I. WISC-III Sensibility in the identification of Attention Deficit Hyperactivity Disorder (ADHD). Panam. J. Neuropsychol. 2012, 6, 128–140. [Google Scholar] [CrossRef]
  44. Lencastre e Silva, P. Lévy Flight Model of Gaze Trajectories Code and Data. 2024. Available online: https://figshare.com/articles/software/ADHD-for-Github_replication_ipynb/25274065 (accessed on 10 April 2024).
  45. Mandelbrot, B.B.; Mandelbrot, B.B. The Fractal Geometry of Nature; WH Freeman: New York, NY, USA, 1982; Volume 1. [Google Scholar]
  46. Einstein, A. On the motion of small particles suspended in liquids at rest required by the molecular-kinetic theory of heat. Ann. Der Phys. 1905, 17, 208. [Google Scholar]
  47. Klafter, J.; Sokolov, I.M. First Steps in Random Walks: From Tools to Applications; Oxford University Press: Oxford, UK, 2011. [Google Scholar] [CrossRef]
  48. Buldyrev, S.; Goldberger, A.; Havlin, S.; Peng, C.; Simons, M.; Stanley, H. Generalized Levy-Walk Model for DNA Nucleotide Sequences. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Top. 1993, 47, 4514–4523. [Google Scholar] [CrossRef]
  49. Gorjao, L.R.; Hassan, G.; Kurths, J.; Witthaut, D. MFDFA: Efficient multifractal detrended fluctuation analysis in python. Comput. Phys. Commun. 2022, 273, 108254. [Google Scholar] [CrossRef]
  50. Campeau, W.; Simons, A.M.; Stevens, B. The evolutionary maintenance of Lévy flight foraging. PLoS Comput. Biol. 2022, 18, e1009490. [Google Scholar] [CrossRef]
  51. Guinard, B.; Korman, A. Intermittent inverse-square Lévy walks are optimal for finding targets of all sizes. Sci. Adv. 2021, 7, eabe8211. [Google Scholar] [CrossRef]
  52. Levernier, N.; Textor, J.; Bénichou, O.; Voituriez, R. Inverse square Lévy walks are not optimal search strategies for d ≥ 2. Phys. Rev. Lett. 2020, 124, 080601. [Google Scholar] [CrossRef]
  53. Buldyrev, S.; Raposo, E.; Bartumeus, F.; Havlin, S.; Rusch, F.; Da Luz, M.; Viswanathan, G. Comment on “inverse square Lévy walks are not optimal search strategies for d ≥ 2”. Phys. Rev. Lett. 2021, 126, 048901. [Google Scholar] [CrossRef]
  54. Wang, H.; Zhao, X.; Yu, D. Nonlinear features of gaze behavior during joint attention in children with autism spectrum disorder. Autism Res. 2023, 16, 1786–1798. [Google Scholar] [CrossRef]
  55. Deng, S.; Prasse, P.; Reich, D.R.; Dziemian, S.; Stegenwallner-Schütz, M.; Krakowczyk, D.; Makowski, S.; Langer, N.; Scheffer, T.; Jäger, L.A. Detection of ADHD based on eye movements during natural viewing. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Grenoble, France, 19–23 September 2022; Springer: Berlin/Heidelberg, Germany, 2022; pp. 403–418. [Google Scholar]
  56. Das, W.; Khanna, S. A robust machine learning based framework for the automated detection of ADHD using pupillometric biomarkers and time series analysis. Sci. Rep. 2021, 11, 16370. [Google Scholar] [CrossRef]
  57. Lencastre, P.; Lotfigolian, M.; Lind, P.G. Identifying autism from five-second records of eye-gaze trajectories. 2024; submitted. [Google Scholar]
  58. Fuchs, A.; Queirós, S.M.D.; Lind, P.G.; Girard, A.; Bouchet, F.; Wächter, M.; Peinke, J. Small scale structures of turbulence in terms of entropy and fluctuation theorems. Phys. Rev. Fluids 2020, 5, 034602. [Google Scholar] [CrossRef]
Figure 1. Example of one gaze trajectory for ADHD (left) and non-ADHD (middle) children, composed of 1500 data points, corresponding to 1.5 seconds of visual engagement. On the (right), we show a simulated Lévy flight with α = 1 .
Figure 1. Example of one gaze trajectory for ADHD (left) and non-ADHD (middle) children, composed of 1500 data points, corresponding to 1.5 seconds of visual engagement. On the (right), we show a simulated Lévy flight with α = 1 .
Entropy 26 00392 g001
Figure 2. The probability density function of the increment norm for different experiment participants. The power-law decay of the PDF is marked with a red line.
Figure 2. The probability density function of the increment norm for different experiment participants. The power-law decay of the PDF is marked with a red line.
Entropy 26 00392 g002
Figure 3. The probability density for the distribution of the first statistical moments of Δ r across participants: mean (top left), standard deviation (top right), skewness (bottom left), and the kurtosis (bottom right).
Figure 3. The probability density for the distribution of the first statistical moments of Δ r across participants: mean (top left), standard deviation (top right), skewness (bottom left), and the kurtosis (bottom right).
Entropy 26 00392 g003
Figure 4. Representation of the MSD as a function of the time window τ . The line in green represents the best quadratic fit to the data points, while the red one represents the best linear fit. We observe that, on a log–log plot, the empirically estimated points fall along a straight line, which corroborates the relation MSD ( τ ) τ γ .
Figure 4. Representation of the MSD as a function of the time window τ . The line in green represents the best quadratic fit to the data points, while the red one represents the best linear fit. We observe that, on a log–log plot, the empirically estimated points fall along a straight line, which corroborates the relation MSD ( τ ) τ γ .
Entropy 26 00392 g004
Figure 5. The probability density of the R-square for the straight line fits on the log–log plot of the PDF decay, by virtue of which α is estimated (left). On the (right) is shown the coefficient of determination corresponding to the straight line fit on the MFDFA.
Figure 5. The probability density of the R-square for the straight line fits on the log–log plot of the PDF decay, by virtue of which α is estimated (left). On the (right) is shown the coefficient of determination corresponding to the straight line fit on the MFDFA.
Entropy 26 00392 g005
Figure 6. Lévy exponent α distributions for ADHD and normally developed children are calculated via the increment’s PDF (left) and via the MFDFA method (right). The average standard error for the α estimation was 0.08 (maximum of 0.22 ) for the PDF method and around 0.03 (maximum of 0.08 ) for the MFDFA method. A Mann–Whitney U statistical test confirms that the difference in medians is significant, with a p-value of around p = 0.037 (PDF estimation) and p = 0.012 (MFDFA estimation).
Figure 6. Lévy exponent α distributions for ADHD and normally developed children are calculated via the increment’s PDF (left) and via the MFDFA method (right). The average standard error for the α estimation was 0.08 (maximum of 0.22 ) for the PDF method and around 0.03 (maximum of 0.08 ) for the MFDFA method. A Mann–Whitney U statistical test confirms that the difference in medians is significant, with a p-value of around p = 0.037 (PDF estimation) and p = 0.012 (MFDFA estimation).
Entropy 26 00392 g006
Table 1. Accuracy values for different models using SKF and LOOC CV strategies. The two top rows show the accuracy benchmark for models without including α as a model variable, while the bottom row shows the LOOC accuracy of the models including α .
Table 1. Accuracy values for different models using SKF and LOOC CV strategies. The two top rows show the accuracy benchmark for models without including α as a model variable, while the bottom row shows the LOOC accuracy of the models including α .
α in Model Type of CVLogisticSVCDecisionRandom
Variables Regression TreeForest
Excluding α SKF (k = 5)0.724 0.680 0.6110.680
Excluding α LOOC0.724 0.724 0.680 0.724
Including α LOOC 0.765 0.765 0.724 0.744
Table 2. On the left, the confusion matrix of the logistic regression benchmark model with an accuracy of 72.3 % , precision of 72.4 % , specificity of 61.9 % , and recall 80.7 % . On the right, we have the confusion matrix, including the benchmark parameters as well as the Lévy exponent, yielding an accuracy of 76.5 % , precision of 77.7 % , specificity of 71.5 % , and recall of 80.7 % .
Table 2. On the left, the confusion matrix of the logistic regression benchmark model with an accuracy of 72.3 % , precision of 72.4 % , specificity of 61.9 % , and recall 80.7 % . On the right, we have the confusion matrix, including the benchmark parameters as well as the Lévy exponent, yielding an accuracy of 76.5 % , precision of 77.7 % , specificity of 71.5 % , and recall of 80.7 % .
Predicted Label
True label ADHDNon-ADHDADHDNon-ADHD
ADHD138156
non-ADHD521521
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Papanikolaou, C.; Sharma, A.; Lind, P.G.; Lencastre, P. Lévy Flight Model of Gaze Trajectories to Assist in ADHD Diagnoses. Entropy 2024, 26, 392. https://doi.org/10.3390/e26050392

AMA Style

Papanikolaou C, Sharma A, Lind PG, Lencastre P. Lévy Flight Model of Gaze Trajectories to Assist in ADHD Diagnoses. Entropy. 2024; 26(5):392. https://doi.org/10.3390/e26050392

Chicago/Turabian Style

Papanikolaou, Christos, Akriti Sharma, Pedro G. Lind, and Pedro Lencastre. 2024. "Lévy Flight Model of Gaze Trajectories to Assist in ADHD Diagnoses" Entropy 26, no. 5: 392. https://doi.org/10.3390/e26050392

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop