Abstract
Spatial autocorrelation, which describes the similarity between signals on adjacent vertices, is central to spatial science, and Geary’s c is one of the most-prominent numerical measures of it. Using concepts from spectral graph theory, this paper documents new theoretical results on the measure. MATLAB/GNU Octave user-defined functions are also provided.
Keywords:
spatial autocorrelation; Geary’s c; spectral graph theory; graph Laplacian; graph Fourier transform MSC:
62H11; 05C50
1. Introduction
Using concepts from spectral graph theory, in this paper, we document new theoretical results on Geary’s c, which is one of the most-prominent numerical measures of spatial autocorrelation. (Moran’s I is another prominent numerical measure. For the measure, see, e.g., [1,2].) Here, spatial autocorrelation describes the similarity between signals on adjacent vertices and is central to spatial science ([3]). Therefore, the results about the measure contribute to the development of spatial science.
More specifically, we provide a new representation of Geary’s c. It is an expansion of it into a linear combination of variables with different degrees of spatial autocorrelation. By using the distribution of the coefficients, we can characterize spatial data. It is somewhat similar to Fourier series. Subsequently, we develop a way to compute the graph Laplacian eigenvectors needed for the graph Fourier transform. MATLAB/GNU Octave user-defined functions are also provided.
This paper can be considered complementary to [4]. As in this paper, using concepts from spectral graph theory, [4] provided three types of representations for it: (a) graph Laplacian representation, (b) graph Fourier transform representation, and (c) Pearson’s correlation coefficient representation. Our new representation can be regarded as an addition to them. Moreover, the way to compute the graph Laplacian eigenvectors is useful not only for this paper, but also for [4].
We make two remarks on Geary’s c. First, Geary’s c was developed by [5] and modified by [6,7,8,9]) into what it is today. It is a spatial generalization of the von Neumann ratio ([10]). Second, unlike Pearson’s correlation coefficient, there exists the following:
See, e.g., [11] (Equation (6)).
This paper is organized as follows. In Section 2, we provide some preliminaries for the following two sections. In Section 3, we present a new representation of Geary’s c. In Section 4, we develop a way to compute the graph Laplacian eigenvectors needed for the graph Fourier transform. Section 5 concludes. In Appendix A, we provide MATLAB/GNU Octave user-defined functions.
Some Notations
Let , be the identity matrix of order n and be the n-dimensional column vector of ones, i.e., . For an full column rank matrix , denote the column space of and its orthogonal complement by and , respectively. For square matrices , denotes the block diagonal matrix, whose diagonals are .
2. Preliminaries
Let denote the realization of a variable on a spatial unit i for . Here, we exclude the case where , i.e., . Accordingly, , where . In addition, let be the nonnegative spatial weight between the spatial units i and j. Here, we suppose that and for . Accordingly, the spatial weights matrix is an symmetric hollow matrix. In addition, let , which is assumed positive.
Geary’s c for , denoted by , is defined by
Let , where for . Then, the graph Laplacian in spectral graph theory (see, e.g., [12,13,14]) is defined as
Accordingly, as shown in, e.g., [4,14], is a nonnegative definite matrix such that .
Ref. [4] (Proposition 3.1) showed that can be represented using as
where , which is a symmetric idempotent matrix, i.e., and .
Given that is a real symmetric matrix, it can be spectrally decomposed as
where and is an orthogonal matrix. Here, denotes an eigenpair of for , and the eigenvalues, , are in ascending order. Given that is a nonnegative definite matrix and , we can suppose that . Let m denote the number of connected components. Then, it is known that . See, e.g., [14]. Let and . We show how to obtain , as well as from in Section 4.
In spectral graph theory, the linear transformation given by is referred to as the graph Fourier transform of ([15]). In addition, and for are referred to as graph Laplacian eigenvalues and graph Laplacian eigenvectors, respectively. Let .
Given that , for , and is an orthogonal projection matrix onto , it follows that , which yields
In addition, given that is symmetric and , it follows that
Moreover, is an orthogonal matrix. By combining these results, can be represented as
Here, given that , it follows that . Finally, we note that (7) is a part of [4] (Proposition 3.3).
3. A New Representation of Geary’s
Given that for , we can consider
which can be regarded as Geary’s c when . We note that, given , Geary’s c when is excluded. (Actually, it cannot be defined. This is because .)
For , it follows that and , where is the k-th column of . Thus, in (8) can be represented as
Then, from the inequalities, , it follows that
Moreover, given that , it follows that
The next proposition summarizes the above-mentioned results.
Proposition 1.
Remark 1.
Concerning Proposition 1, we make three remarks:
- (i)
- Proposition 1(a) implies that Geary’s c can be represented as a weighted average of . Concerning the weight, , the larger is, the larger is. We note that .
- (ii)
- Proposition 1(b) implies that the graph Laplacian eigenvectors, , can be sorted in the spatial autocorrelation measured by Geary’s c. In other words, is more positively spatially autocorrelated than or equal to for . Accordingly, is positively (respectively negatively) spatially autocorrelated if is a monotonically decreasing (respectively increasing) sequence. can be characterized by the distribution of the coefficients, . It is somewhat similar to the Fourier series.
- (iii)
- If , then regardless of .
Let , where and . Then, equals . That is, for all and , it follows that
Given that , is an example of (14). From (14) and Proposition 1(a), we obtain .
The next corollary summarizes the above result.
Corollary 1.
For all and , equals .
In addition, from Proposition 1, it immediately follows that
Likewise, follows.
The next corollary summarizes the above result.
Corollary 2.
belongs to the closed interval given by .
Remark 2.
Concerning Corollary 2, we make two remarks:
- (i)
- If , then the interval given by reduces to a singleton. For example, if , which is the binary adjacency matrix of the complete graph with n vertices, then and, accordingly, . Then, in this case, regardless of .
- (ii)
- Ref. [11] showed that belongs to the closed interval given by . Given (9), Corollary 2 is its equivalent.
4. A Way to Compute the Eigenvectors in (4)
In this section, we develop a way to compute , which also provides . Here, we explain the reason why it is useful. When there is only one connected component, i.e., , there is no problem. This is because has single 0 eigenvalue and the corresponding normalized eigenvector is . However, when there is more than one connected component, i.e., , has multiple 0 eigenvalues, and then, is not necessarily one of the eigenvectors returned from a computer program. For example, when
there are two connected components and, accordingly, (Figure 1). In this case, both and , where and , and and , where and , are eigenpairs of the corresponding graph Laplacian. The results below can handle such a situation.
Figure 1.
Undirected graph whose binary adjacency matrix is in (16).
Let denote any orthonormal basis of and , where and . Then, is an orthogonal matrix. Accordingly, from (4) and (6), it follows that
where . Here, from , , and , it follows that
where , which is an orthogonal matrix. This is because and . In addition, given that , it follows that
By combining (18) and (19), (17) becomes . Therefore, it follows that
Here, recall that is an orthogonal matrix. In addition, given that is an orthogonal matrix, premultiplying (18) by yields . Therefore, it follows that
The next proposition summarizes the above-mentioned results.
Proposition 2.
Denote the k-th column of by for , i.e., . Then, for are the eigenpairs of . In addition, is obtainable from by (21).
Remark 3.
Concerning Proposition 2, we make two remarks:
- (i)
- The following matrix is an example of :
- (ii)
- MATLAB/GNU Octave user-defined functions required for the calculation of , , and the bounds of Geary’s c are provided in Appendix A.
5. Concluding Remarks
In this paper, we showed new theoretical results on Geary’s c, which included (i) a new representation of Geary’s c and (ii) a way to compute the graph Laplacian eigenvectors. The obtained results are summarized in Propositions 1 and 2 and Corollaries 1 and 2. The required MATLAB/GNU Octave user-defined functions are also provided. Finally, as stated, this paper can be considered complementary to [4].
Funding
The Japan Society for the Promotion of Science supported this work through KAKENHI (grant number: 23K013770A).
Acknowledgments
The author thanks three anonymous referees for their valuable comments. The usual caveat applies.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. MATLAB/GNU Octave User-Defined Functions
In this section, we provide three MATLAB/GNU Octave user-defined functions. Among them, Lam2U2 is a function for calculating , as well as from . Gearycbounds is a function for calculating the bounds of Geary’s c corresponding to . Note that (A+A’)/2 in the functions is to ensure symmetry. Finally, these two functions depend on Fmat, which is a function to make .
- 1
- 2
- function [Lam2,U2]=Lam2U2(W)
- 3
- n=size(W,1);
- 4
- L=diag(sum(W,2))-W;
- 5
- F=Fmat(n); F2=F(:,2:n); A=F2’*L*F2;
- 6
- [X,E]=eig((A+A’)/2);
- 7
- [e,ind]=sort(diag(E),’ascend’);
- 8
- Lam2=diag(e); V2=X(:,ind);
- 9
- U2=F2*V2;
- 10
- end
- 1
- function [c_lb,c_ub]=Gearycbounds(W)
- 2
- n=size(W,1);
- 3
- Omega=sum(sum(W));
- 4
- L=diag(sum(W,2))-W;
- 5
- F=Fmat(n); F2=F(:,2:n); A=F2’*L*F2;
- 6
- eigv=sort(eig((A+A’)/2),’ascend’);
- 7
- c_lb=((n-1)/Omega)*eigv(1);
- 8
- c_ub=((n-1)/Omega)*eigv(n-1);
- 9
- end
- 1
- function [F]=Fmat(n)
- 2
- F=zeros(n,n);
- 3
- F(:,1)=ones(n,1)/sqrt(n);
- 4
- for k=2:n
- 5
- F(:,k)=[ones(k-1,1);-(k-1);zeros(n-k,1)]/sqrt((k-1)*k);
- 6
- end
- 7
- end
References
- Dray, S. A new perspective about Moran’s coefficient: Spatial autocorrelation as a linear regression problem. Geogr. Anal. 2011, 43, 127–141. [Google Scholar]
- Yamada, H. A unified perspective on some autocorrelation measures in different fields: A note. Open Math. 2023, 21, 20220574. [Google Scholar] [CrossRef]
- Getis, A. A history of the concept of spatial autocorrelation: A geographer’s perspective. Geogr. Anal. 2008, 40, 297–309. [Google Scholar] [CrossRef]
- Yamada, H. Geary’s c and spectral graph theory. Mathematics 2021, 9, 2465. [Google Scholar] [CrossRef]
- Geary, R.C. The contiguity ratio and statistical mapping. Inc. Stat. 1954, 5, 115–127+129–146. [Google Scholar] [CrossRef]
- Cliff, A.D.; Ord, J.K. The problem of spatial autocorrelation. In Studies in Regional Science; Scott, A.J., Ed.; Pion: London, UK, 1969; pp. 25–55. [Google Scholar]
- Cliff, A.D.; Ord, J.K. Spatial autocorrelation: A review of existing and new measures with applications. Econ. Geogr. 1970, 46, 269–292. [Google Scholar] [CrossRef]
- Cliff, A.D.; Ord, J.K. Spatial Autocorrelation; Pion: London, UK, 1973. [Google Scholar]
- Cliff, A.D.; Ord, J.K. Spatial Processes: Models and Applications; Pion: London, UK, 1981. [Google Scholar]
- Von Neumann, J. Distribution of the ratio of the mean square successive difference to the variance. Ann. Math. Stat. 1941, 12, 367–395. [Google Scholar] [CrossRef]
- De Jong, P.; Sprenger, C.; van Veen, F. On extreme values of Moran’s I and Geary’s c. Geogr. Anal. 1984, 16, 17–24. [Google Scholar] [CrossRef]
- Bapat, R.B. Graphs and Matrices, 2nd ed.; Springer: London, UK, 2014. [Google Scholar]
- Estrada, E.; Knight, P. A First Course in Network Theory; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
- Gallier, J. Spectral theory of unsigned and signed graphs. Applications to graph clustering: A survey. arXiv 2016, arXiv:1601.04692. [Google Scholar]
- Shuman, D.I.; Narang, S.K.; Frossard, P.; Ortega, A.; Vandergheynst, P. The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Process. Mag. 2013, 30, 83–98. [Google Scholar] [CrossRef]
- Maruyama, Y. An alternative to Moran’s I for spatial autocorrelation. arXiv 2015, arXiv:1501.06260v1. [Google Scholar]
- Lancaster, H.O. The Helmert Matrices. Am. Math. Mon. 1965, 72, 4–12. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).