Linear Regression with Optimal Rotation

Livadiotis, George

doi:10.3390/stats2040028

Open AccessArticle

Linear Regression with Optimal Rotation

by

George Livadiotis

Southwest Research Institute, Space Science & Engineering, San Antonio, TX 78238, USA

Stats 2019, 2(4), 416-425; https://doi.org/10.3390/stats2040028

Submission received: 4 September 2019 / Revised: 20 September 2019 / Accepted: 25 September 2019 / Published: 28 September 2019

Download

Browse Figures

Versions Notes

Abstract

:

The paper shows how the linear regression depends on the selection of the reference frame. The slope of the fitted line and the corresponding Pearson’s correlation coefficient are expressed in terms of the rotation angle. The correlation coefficient is found to be maximized for a certain optimal angle, for which the slope attains a special optimal value. The optimal angle, the value of the optimal slope, and the corresponding maximum correlation coefficient were expressed in terms of the covariance matrix, but also in terms of the values of the slope, derived from the fitting at the nonrotated and right-angle-rotated axes. The potential of the new method is to improve the derived values of the fitting parameters by detecting the optimal rotation angle, that is, the one that maximizes the correlation coefficient. The presented analysis was applied to the linear regression of density and temperature measurements characterizing the proton plasma in the inner heliosheath, the outer region of our heliosphere.

Keywords:

fitting; linear regression; correlation; heliosheath

1. Introduction

The fitting of a given dataset

{y_{i} \pm σ_{y i}}_{i = 1}^{N}

to the values

{V_{i}}_{i = 1}^{N}

of a statistical model

V (X; α)

in the domain

X \in D_{x} \subseteq ℜ

[1,2,3,4,5] involves finding the optimal parameter value

α = α^{*}

in

α \in D_{α} \subseteq ℜ

that minimizes the sum of squared residuals, also known in physical sciences as total square deviations (TSD) between model and data,

T S D {(α)}^{2} = \sum_{i = 1}^{N} σ_{y i}^{- 2} {[y_{i} - V (x_{i}; α)]}^{2},

(1)

where the inverse of variance of the data measurements

{w_{i} = σ_{y i}^{- 2}}_{i = 1}^{N}

is weighting the summation. In the case of the multiparametrical fitting, we consider that the statistical model depends on n independent parameters,

{α_{k}}_{k = 1}^{n}

. Then, TSD becomes (e.g., [6,7,8,9]):

T S D {({α_{k}}_{k = 1}^{n})}^{2} = \sum_{i = 1}^{N} σ_{y i}^{- 2} {[y_{i} - V (x_{i}; {α_{k}}_{k = 1}^{n})]}^{2},

(2)

or, for a biparametrical (α and β) linear regression, we have:

T S D {(α, β)}^{2} = \sum_{i = 1}^{N} {(y_{i} - α - β x_{i})}^{2} .

(3)

Minimization of TSD leads to the optimal value of the involved parameters (least-square method), that is,

β_{0} = \frac{σ_{x y}^{2}}{σ_{x x}^{2}}, α = \bar{y} - β \cdot \bar{x} .

(4)

The optimal fitting value of the slope β is noted as β₀. The optimal value of the slope β for the inverse linear regression, that is, between (y,x), is:

β_{90} = \frac{σ_{y y}^{2}}{σ_{x y}^{2}},

(5)

where the ratio

β_{0} / β_{90}

equals the correlation of these linear regressions, which is maximized for

β_{0} = β_{90}

.

In this paper, we focus on the optimal fitting value of the slope β, derived from the linear regression of data expressed on the rotated Cartesian axes. The slope β and the corresponding Pearson’s correlation coefficient are calculated as a function of the rotation angle ϑ, while the optimal fitting value of the slope is derived from the maximization of the correlation coefficient. Finally, we expose an application from the discipline of space physics. In particular, the presented analysis was applied to the linear regression of density and temperature measurements characterizing the proton plasma in the inner heliosheath, the outer region of our heliosphere. Those measurements can be fitted with a line of negative slope, which gives the polytropic index of the plasma. The new statistical method improves the applied linear regression and the estimated value of the polytropic index.

2. Rotated Optimal Slope

The orthogonal Cartesian system is rotated by an angle ϑ, according to

(\begin{matrix} \tilde{x} \\ \tilde{y} \end{matrix}) = (\begin{matrix} \cos ϑ & \sin ϑ \\ - \sin ϑ & \cos ϑ \end{matrix}) (\begin{matrix} x \\ y \end{matrix}) .

(6)

Then, the total square deviations are given by:

T S D {(\tilde{α}, \tilde{β})}^{2} = \sum_{i = 1}^{N} {({\tilde{y}}_{i} - \tilde{α} - \tilde{β} {\tilde{x}}_{i})}^{2} .

(7)

Next, we express the parameter values, the slope

\tilde{β}

and intercept

\tilde{α}

, in terms of their values at the nonrotated system. First, we observe that the slope is transformed as follows:

\frac{d \tilde{y}}{d \tilde{x}} = \frac{\cos ϑ + \sin ϑ \cdot \frac{d y}{d x}}{- \sin ϑ + \cos ϑ \cdot \frac{d y}{d x}},

(8)

hence,

\tilde{β} = \frac{\cos ϑ + \sin ϑ \cdot β}{- \sin ϑ + \cos ϑ \cdot β} .

(9)

In order to find the transform intercept, we consider the yellow-shaded triangle in Figure 1, where we have:

β = \frac{a - \tilde{a} \cdot \cos ϑ}{\tilde{a} \cdot \sin ϑ},

(10)

thus, we end up with:

\tilde{a} = \frac{a}{\cos ϑ + \sin ϑ \cdot β} .

(11)

Therefore, the total square deviations in the rotated reference frame is given by:

T S D {(\tilde{α}, \tilde{β})}^{2} = {(\frac{T S D (α, β)}{\cos ϑ + \sin ϑ \cdot β})}^{2} = \sum_{i = 1}^{N} {(\frac{y_{i} - α - β \cdot x_{i}}{\cos ϑ + \sin ϑ \cdot β})}^{2},

(12)

(with

\cos ϑ \neq - β

). Hence, the minimization of TSD can be still expressed in terms of the linear parameters α and β of the nonrotated frame; namely, we first derive the normal equation with respect to α:

\begin{matrix} 0 = \frac{\partial}{\partial α} \sum_{i = 1}^{N} {(\frac{y_{i} - α - β \cdot x_{i}}{\cos ϑ + \sin ϑ \cdot β})}^{2} = - \frac{2}{{(\cos ϑ + \sin ϑ \cdot β)}^{2}} \cdot \sum_{i = 1}^{N} (y_{i} - α - β \cdot x_{i}) \\ = - \frac{2 N}{{(\cos ϑ + \sin ϑ \cdot β)}^{2}} \cdot (\bar{y} - α - β \cdot \bar{x}) = 0, \end{matrix}

(13)

leading to

\bar{y} - α - β \cdot \bar{x} = 0 .

(14)

Next, we derive the normal equation with respect to β:

\begin{matrix} 0 = \frac{\partial}{\partial β} \sum_{i = 1}^{N} {(\frac{y_{i} - α - β \cdot x_{i}}{\cos ϑ + \sin ϑ \cdot β})}^{2} = \frac{2}{{(\cos ϑ + \sin ϑ \cdot β)}^{3}} \\ \times \sum_{i = 1}^{N} (y_{i} - α - β \cdot x_{i}) [\cos ϑ \cdot x_{i} + \sin ϑ \cdot (y_{i} - α)] \\ = \frac{2 N}{{(\cos ϑ + \sin ϑ \cdot β)}^{3}} \\ \times \sum_{i = 1}^{N} [(\cos ϑ - β \cdot \sin ϑ) \cdot x_{i} (y_{i} - α) - β \cdot \cos ϑ \cdot x_{i}^{2} + \sin ϑ \cdot {(y_{i} - α)}^{2}] \\ = \frac{2 N}{{(\cos ϑ + \sin ϑ \cdot β)}^{3}} \\ \times [(\cos ϑ - β \cdot \sin ϑ) \cdot \bar{x (y - α)} - β \cdot \cos ϑ \cdot \bar{x^{2}} + \sin ϑ \cdot \bar{{(y - α)}^{2}}] = 0, \end{matrix}

(15)

or

\begin{array}{l} (\cos ϑ - β \cdot \sin ϑ) \cdot \bar{x (y - α)} - β \cdot \cos ϑ \cdot \bar{x^{2}} + \sin ϑ \cdot \bar{{(y - α)}^{2}} \\ = (\cos ϑ - β \cdot \sin ϑ) \cdot \bar{(x - \bar{x}) (y - \bar{y})} + (\cos ϑ - β \cdot \sin ϑ) \cdot \bar{x} (\bar{y} - α) \\ - β \cdot \cos ϑ \cdot \bar{x^{2}} + \sin ϑ \cdot \bar{{(y - \bar{y})}^{2}} + \sin ϑ \cdot {(\bar{y} - α)}^{2} \\ = (\cos ϑ - β \cdot \sin ϑ) \cdot \bar{(x - \bar{x}) (y - \bar{y})} + (β \cos ϑ - β^{2} \cdot \sin ϑ) \cdot {\bar{x}}^{2} \\ - β \cdot \cos ϑ \cdot \bar{x^{2}} + \sin ϑ \cdot \bar{{(y - \bar{y})}^{2}} + β^{2} \cdot \sin ϑ \cdot {\bar{x}}^{2} \\ = (\cos ϑ - β \cdot \sin ϑ) \cdot \bar{(x - \bar{x}) (y - \bar{y})} - β \cdot \cos ϑ \cdot \bar{{(x - \bar{x})}^{2}} + \sin ϑ \cdot \bar{{(y - \bar{y})}^{2}} \\ = (\cos ϑ - β \cdot \sin ϑ) \cdot σ_{x y}^{2} - β \cdot \cos ϑ \cdot σ_{x x}^{2} + \sin ϑ \cdot σ_{y y}^{2} = 0 . \end{array}

(16)

Hence, we have

β = β (ϑ) = \frac{σ_{x y}^{2} + σ_{y y}^{2} \cdot \tan ϑ}{σ_{x x}^{2} + σ_{x y}^{2} \cdot \tan ϑ},

(17)

or

β (ϑ) = \frac{1 + \frac{σ_{y y}^{2}}{σ_{x y}^{2}} \cdot \tan ϑ}{\frac{σ_{x x}^{2}}{σ_{x y}^{2}} + \tan ϑ} = \frac{1 + β_{90} \cdot \tan ϑ}{β_{0}^{- 1} + \tan ϑ} .

(18a)

We observe that

β = β_{0}

for ϑ = 0 and

β = β_{90}

for ϑ = 90⁰.

Finally, we note that

\tilde{β} (ϑ)

is the slope of the fitted curve derived from the fitting performed in the rotated frame, while

β (ϑ)

is the expression of the slope

\tilde{β} (ϑ)

in the nonrotated frame. One may find

\tilde{β} (ϑ)

, that is, the slope expressed in the rotated frame, simply by substituting Equation (18) in Equation (9):

\tilde{β} = \frac{1 + \tan ϑ \cdot β}{- \tan ϑ + β} .

(18b)

3. Maximized Correlation Coefficient

The Pearson’s correlation coefficient is derived as follows. First, we construct the covariance and variances, and then the correlation coefficient at the rotated reference frame. We have:

\bar{\tilde{x} \tilde{y}} = \sin ϑ \cos ϑ \cdot (\bar{y^{2}} - \bar{x^{2}}) + (\cos^{2} ϑ - \sin^{2} ϑ) \cdot \bar{x y},

(19)

\bar{\tilde{x}} \bar{\tilde{y}} = \sin ϑ \cos ϑ \cdot ({\bar{y}}^{2} - {\bar{x}}^{2}) + (\cos^{2} ϑ - \sin^{2} ϑ) \cdot \bar{x} \bar{y},

(20)

thus, we obtain:

{\tilde{σ}}_{x y}^{2} = \sin ϑ \cos ϑ \cdot (σ_{y y}^{2} - σ_{x x}^{2}) + (\cos^{2} ϑ - \sin^{2} ϑ) \cdot σ_{x y}^{2} .

(21)

In similar, we have:

{\tilde{σ}}_{x x}^{2} = \cos^{2} ϑ \cdot σ_{x x}^{2} + \sin^{2} ϑ \cdot σ_{y y}^{2} + 2 \sin ϑ \cos ϑ \cdot σ_{x y}^{2},

(22)

{\tilde{σ}}_{y y}^{2} = \sin^{2} ϑ \cdot σ_{x x}^{2} + \cos^{2} ϑ \cdot σ_{y y}^{2} - 2 \sin ϑ \cos ϑ \cdot σ_{x y}^{2} .

(23)

Hence, the Pearson’s correlation coefficient for the normal and the rotated frames:

r_{0}^{2} = \frac{σ_{x y}^{4}}{σ_{x x}^{2} σ_{y y}^{2}}, r^{2} (ϑ) = \frac{{\tilde{σ}}_{x y}^{4}}{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}} .

(24)

Note: While the covariance cov(x,y) or the Pearson’s correlation coefficient can be either positive or negative, their algebraic sign has no effect on the method of correlation maximization presented here. Indeed, the examined datasets may be correlated, r > 0, or anti-correlated, r < 0, but it is important to ask just how big is the correlation or anticorrelation; hence, the meaningful quantity needed to be maximized is the absolute correlation or the square of the correlation, r². For the same reason, the sign of the covariance does not affect the presented analysis, and it is ignored. For this, we denote the covariance with the square symbol

σ_{x y}^{2}

, though it may have negative sign.

We observe that

\begin{matrix} {\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2} = [\cos^{2} ϑ \cdot σ_{x x}^{2} + \sin^{2} ϑ \cdot σ_{y y}^{2} + 2 \sin ϑ \cos ϑ \cdot σ_{x y}^{2}] \\ \times [\sin^{2} ϑ \cdot σ_{x x}^{2} + \cos^{2} ϑ \cdot σ_{y y}^{2} - 2 \sin ϑ \cos ϑ \cdot σ_{x y}^{2}] \\ = \sin^{2} ϑ \cos^{2} ϑ \cdot (σ_{x x}^{4} + σ_{y y}^{4} - 4 σ_{x y}^{4}) + (1 - 2 \sin^{2} ϑ \cos^{2} ϑ) \cdot σ_{x x}^{2} σ_{y y}^{2} \\ + 2 \sin ϑ \cos ϑ (\cos^{2} ϑ - \sin^{2} ϑ) \cdot σ_{x y}^{2} (σ_{y y}^{2} - σ_{x x}^{2}) . \end{matrix}

(25)

Then, we have

\begin{matrix} {\tilde{σ}}_{x y}^{4} = {[\sin ϑ \cos ϑ \cdot (σ_{y y}^{2} - σ_{x x}^{2}) + (\cos^{2} ϑ - \sin^{2} ϑ) \cdot σ_{x y}^{2}]}^{2} \\ = {(\cos^{2} ϑ - \sin^{2} ϑ)}^{2} \cdot σ_{x y}^{4} + \sin^{2} ϑ \cos^{2} ϑ \cdot (σ_{x x}^{4} + σ_{y y}^{4} - 2 σ_{x x}^{2} σ_{y y}^{2}) \\ + 2 \sin ϑ \cos ϑ (\cos^{2} ϑ - \sin^{2} ϑ) \cdot (σ_{y y}^{2} - σ_{x x}^{2}) σ_{x y}^{2} \\ = σ_{x y}^{4} - σ_{x x}^{2} σ_{y y}^{2} + {\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}, \end{matrix}

(26)

where we used the identity

\sin^{4} ϑ + \cos^{4} ϑ = 1 - 2 \sin^{2} ϑ \cos^{2} ϑ

. Hence, we find the invariant:

{\tilde{σ}}_{x y}^{4} - {\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2} = σ_{x y}^{4} - σ_{x x}^{2} σ_{y y}^{2} .

(27)

Then, the correlation coefficient at the rotated system becomes:

\begin{matrix} r^{2} (ϑ) - 1 = \frac{{\tilde{σ}}_{x y}^{4} - {\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}}{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}} = \frac{σ_{x y}^{4} - σ_{x x}^{2} σ_{y y}^{2}}{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}} \\ = \frac{σ_{x y}^{4} - σ_{x x}^{2} σ_{y y}^{2}}{σ_{x x}^{2} σ_{y y}^{2}} \cdot \frac{σ_{x x}^{2} σ_{y y}^{2}}{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}} = (r_{0}^{2} - 1) \cdot \frac{σ_{x x}^{2} σ_{y y}^{2}}{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}}, \end{matrix}

(28)

or

\frac{1 - r_{0}^{2}}{1 - r^{2} (ϑ)} = \frac{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}}{σ_{x x}^{2} σ_{y y}^{2}} .

(29)

Hence, we first calculate the right-hand side of Equation (29):

\begin{array}{l} \frac{{\tilde{σ}}_{x x}^{2} {\tilde{σ}}_{y y}^{2}}{σ_{x x}^{2} σ_{y y}^{2}} = 1 + \sin^{2} ϑ \cos^{2} ϑ \cdot (\frac{σ_{x x}^{2}}{σ_{y y}^{2}} + \frac{σ_{y y}^{2}}{σ_{x x}^{2}} - 2 - 4 r_{0}^{2}) \\ + 2 \sin ϑ \cos ϑ (\cos^{2} ϑ - \sin^{2} ϑ) \cdot r_{0} \cdot (\frac{σ_{y y}^{}}{σ_{x x}^{}} - \frac{σ_{x x}^{}}{σ_{y y}^{}}), \end{array}

(30)

thus, Equation (29) becomes:

\begin{matrix} [\frac{1 - r_{0}^{2}}{1 - r^{2} (ϑ)} - 1] \cdot \frac{1}{r_{0}^{2}} = 4 \sin^{2} ϑ \cos^{2} ϑ \cdot {{[\frac{1}{2 r_{0}} (\frac{σ_{y y}^{}}{σ_{x x}^{}} - \frac{σ_{x x}^{}}{σ_{y y}^{}})]}^{2} - 1} \\ + 4 \sin ϑ \cos ϑ (\cos^{2} ϑ - \sin^{2} ϑ) \cdot [\frac{1}{2 r_{0}} (\frac{σ_{y y}^{}}{σ_{x x}^{}} - \frac{σ_{x x}^{}}{σ_{y y}^{}})] . \end{matrix}

(31)

Using trigonometric functions of angle 4ϑ, we easily find:

\begin{matrix} [\frac{1 - r_{0}^{2}}{1 - r^{2} (ϑ)} - 1] \cdot \frac{1}{r_{0}^{2}} = \sin 4 ϑ \cdot [\frac{1}{2 r_{0}} (\frac{σ_{y y}^{}}{σ_{x x}^{}} - \frac{σ_{x x}^{}}{σ_{y y}^{}})] \\ + \frac{1}{2} (1 - \cos 4 ϑ) \cdot {{[\frac{1}{2 r_{0}} (\frac{σ_{y y}^{}}{σ_{x x}^{}} - \frac{σ_{x x}^{}}{σ_{y y}^{}})]}^{2} - 1} . \end{matrix}

(32)

This can be rewritten in a compact way:

[\frac{1 - r_{0}^{2}}{1 - r^{2} (ϑ)} - 1] \cdot \frac{1}{r_{0}^{2}} = \frac{1}{2} (1 - \cos 4 ϑ) \cdot (X^{2} - 1) + \sin 4 ϑ \cdot X,

(33)

where

X \equiv \frac{σ_{y y}^{2} - σ_{x x}^{2}}{2 σ_{x y}^{2}} = \frac{1}{2} (β_{90}^{} - β_{0}^{- 1}) .

(34)

Given

\sin 4 ϑ = 2 \tan 2 ϑ / (1 + \tan^{2} 2 ϑ)

and

\cos 4 ϑ = (1 - \tan^{2} 2 ϑ) / (1 + \tan^{2} 2 ϑ)

, we rewrite the correlation coefficient as:

[\frac{1 - r_{0}^{2}}{1 - r^{2} (ϑ)} - 1] \cdot \frac{1}{r_{0}^{2}} = \frac{{(\tan 2 ϑ \cdot X + 1)}^{2}}{\tan^{2} 2 ϑ + 1} - 1 .

(35)

Note that, as expected, for ϑ = 0 and ϑ = 90⁰, the correlation coefficient remains the same,

r_{90}^{2} = r_{0}^{2}

.

The correlation coefficient is maximized [10] for the optimal angle

ϑ = ϑ_{opt}

:

\frac{\partial}{\partial ϑ} [\frac{1 - r_{0}^{2}}{1 - r^{2} (ϑ)} - 1] \cdot \frac{1}{r_{0}^{2}} = 2 \sin 4 ϑ \cdot (X^{2} - 1) + 4 \cos 4 ϑ \cdot X = 0,

(36)

or

\tan 4 ϑ_{opt} = \frac{2 X}{1 - X^{2}} .

(37)

Given the half-tangent formula

\tan 2 ϑ = 2 \tan ϑ / (1 - \tan^{2} ϑ)

, we derive the solutions:

(i) \tan 2 ϑ_{opt} = X and (ii) \tan 2 ϑ_{opt} = - \frac{1}{X}

(38)

or

(i) \tan ϑ_{opt} = (- 1 \pm \sqrt{1 + X^{2}}) / X and (ii) \tan ϑ_{opt} = X \pm \sqrt{1 + X^{2}},

(39)

where the subscript “opt” denotes the optimal rotation angle,

ϑ_{opt}

, for which the correlation is maximized (i.e.,

r_{opt}^{2} \equiv r^{2} (ϑ_{opt})

). Then, from Equation (35), we derive the respective optimal correlation coefficient for each of the two solutions:

- For the (i) solution, we have a maximum of r²:

{(\frac{1 - r_{0}^{2}}{1 - r^{2}} - 1)}_{opt} \cdot \frac{1}{r_{0}^{2}} = X^{2} or {(\frac{1 - r_{0}^{2}}{1 - r^{2}})}_{opt} = 1 + r_{0}^{2} X^{2},

(40)

and solving in terms of

r_{opt}^{2}

:

r_{opt}^{2} = r_{0}^{2} \cdot \frac{1 + X^{2}}{1 + r_{0}^{2} X^{2}} .

(41)

We observe that the maximized correlation coefficient at the rotated system ranges from

r_{opt}^{2} = r_{0}^{2}

for X = 0 to

r_{opt}^{2} = 1

for X→∞. The case

r_{opt}^{2} = r_{0}^{2}

is consistent with ϑ = 0; hence, we expect for X = 0 to have ϑ = 0. The positive root in the (i) solution gives ϑ→0 for X→0, which is consistent with

r_{opt}^{2} = r_{0}^{2}

. However, the negative root gives ϑ→45⁰ for X→0; thus, the negative sign case must be rejected.

- For the (ii) solution, we have a minimum of r²:

{(\frac{1 - r_{0}^{2}}{1 - r^{2}} - 1)}_{opt} \cdot \frac{1}{r_{0}^{2}} = - 1 or {(\frac{1 - r_{0}^{2}}{1 - r^{2}})}_{opt} = 1 - r_{0}^{2}, or r_{opt}^{2} = 0 .

(42)

We seek the maximum correlation, so we reject the minimum of the (ii) solution and Equation (42). Therefore, the only acceptable solution is (i):

\tan ϑ_{opt} = (- 1 + \sqrt{1 + X^{2}}) / X .

(43)

Solving Equation (18) in terms of tanϑ, we obtain:

β_{opt} = β_{0} \cdot \frac{1 + β_{90} \cdot \tan ϑ_{opt}}{1 + β_{0} \cdot \tan ϑ_{opt}} \Leftrightarrow \tan ϑ_{opt} = β_{0}^{- 1} \cdot \frac{β_{0} - β_{opt}}{β_{opt} - β_{90}} .

(44)

Therefore, the optimal value of the slope can be expressed in terms of the slope for ϑ = 0,

β_{0}

, and ϑ = 90⁰,

β_{90}

. Namely,

β_{0}^{- 1} \cdot \frac{β_{0} - β_{opt}}{β_{opt} - β_{90}} = \tan ϑ_{opt} = (- 1 \pm \sqrt{1 + X^{2}}) / X, X = \frac{1}{2} (β_{90}^{} - β_{0}^{- 1}) .

(45)

We obtain

β_{opt} = β_{0}

for X→0, leading to

r_{opt}^{2} = r_{0}^{2}

, and

β_{opt}

= β_{45} = β_{0} \cdot (1 + β_{90}) / (1 + β_{0})

for X→∞, leading to

r_{opt}^{2} = 1

; (see: [11]).

In general, the optimal value of the slope is given by:

β_{opt} = β_{90} \cdot \frac{- \frac{1}{2} (1 + β_{90}^{- 1} β_{0}^{- 1}) + \sqrt{1 + \frac{1}{4} {(β_{90}^{} - β_{0}^{- 1})}^{2}}}{\frac{1}{2} β_{0}^{- 1} (β_{90}^{} - β_{0}^{- 1}) - 1 + \sqrt{1 + \frac{1}{4} {(β_{90}^{} - β_{0}^{- 1})}^{2}}} .

(46)

4. Application to the Inner Heliosheath

Classical collisional particle systems residing in thermal equilibrium have their particle velocity/energy distribution function stabilized into a Maxwell-Boltzmann distribution. On the contrary, space and astrophysical plasmas are collisionless particle systems residing in stationary states characterized by kappa distributions [12,13,14,15]. The role of kappa distributions has become increasingly widespread across the physics of space plasma processes, describing particles in the heliosphere, from the solar wind and planetary magnetospheres to the heliosheath and beyond, the interstellar and intergalactic plasmas.

The presented statistical analysis is applied to the measurements of number density n and temperature T of the proton plasma in the inner heliosheath, the outer boundary of our heliosphere. By fitting kappa distributions to the proton energy distributions derived from the IBEX mission [16], Livadiotis et al. [17,18,19] derived the sky maps of the radially averaged values of several thermodynamic quantities in the inner heliosheath, such as the number density n, the temperature T, and the kappa indices κ₀; these thermal observables parameterize the kappa distributions. The kappa index is the parameter that labels the kappa distribution, and its physical meaning is interwoven with the correlation between particles; in fact, the stronger the correlations, the smaller the kappa index [13,15,20].

The measurements of density and temperature of the inner heliosheath are anticorrelated. In particular, the logarithms of these measurements are linearly related with a negative slope, that is, the polytropic index. The polytropic behavior stands for the powe-law relationship,

T \propto n^{γ - 1}

, where γ is the polytropic index, along a certain streamline of the plasma flow. A polytrope is a certain thermodynamic process characterized by such a relation. Typically, this is a power-law between two thermodynamic variables,

[P (\vec{r}) / P ({\vec{r}}_{*})] = {[n (\vec{r}) / n ({\vec{r}}_{*})]}^{γ}, or [n (\vec{r}) / n ({\vec{r}}_{*})] = {[T (\vec{r}) / T ({\vec{r}}_{*})]}^{ν}, with ν \equiv 1 / (γ - 1),

(47)

where the exponent a denotes the typical polytropic index; ν is another way of expressing the polytropic index, which corresponds to the effective degrees of freedom. While most of the space plasmas exhibit positive correlations between density and temperature, there are several plasmas with negative correlations, consistent with constant or quasi-constant thermal pressure, that were found in heliosheath [10,17,18,19,20,21] and planetary magnetosheaths (e.g., the low-latitude boundary layer at the terrestrial magnetosheath [22]; in the terrestrial central plasma sheet [23]; and in the Jovian magnetosheath, [24]). Other plasmas exhibit more complicate relationships that involves also the kappa index [17,25].

The temperature T and density n values of the proton plasma along the equatorial streamline from the heliospheric nose towards the heliotail were shown to be characterized by a negative correlation relationship [10,26,27]. More precisely, it was shown that the variations of T and n follow a near isobaric process, that is a polytropic index

γ \approx 0

(or

ν \approx - 1

).

Here, we examine the linear regression of the data (x,y), where x = log(n) and y = log(1/T). The slope β coincides with the quantity 1−γ, thus, the polytropic index γ can be retrieved once we estimate the slope β. The correlation coefficient r²(ϑ) and the slope β(ϑ) for a rotated reference frame are plotted in panels (a) and (b) in Figure 2, respectively. We also exclude the data points with small kappa index (as more erroneous). The plots are depicted for different thresholds, κ_0min, that is, 0.1 (red), 0.25 (blue), and 0.4 (green). We observe that the larger the kappa index threshold, the more the correlation coefficient shifts to the left, to the negative values of tanϑ. This corresponds to smaller values of the slope, thus larger values of the polytropic index. The optimal value of tanϑ is the one that maximizes the correlation coefficient; this is plotted against the kappa index threshold κ_0min in panel Figure 2c.

5. Conclusions

The paper presented the dependence of the linear regression on the selection of the reference frame. Both the slope of the fitted line and the corresponding Pearson’s correlation coefficient were expressed in terms of the rotation angle. The correlation coefficient was found to be maximized for a certain optimal angle, for which the slope attains a special optimal value. The optimal angle, the value of the optimal slope, and the corresponding maximum correlation coefficient were expressed in terms of the covariance matrix, but also in terms of the slopes β₀ and β₉₀, namely, those derived from the nonrotated and right-angle-rotated fittings.

The presented analysis can be used as a method for improving the fitting and optimizing the derived values of the involved parameters. The analysis can be further completed for including the following cases:

-: the data points are given with errors ${x_{i} \pm σ_{x i}}_{i = 1}^{N}$ , ${y_{i} \pm σ_{y i}}_{i = 1}^{N}$ .
-: the statistical model is nonlinear and multiparametrical, for example, a polynomial of order M, that is, $V (x_{i}; {α_{k}}_{k = 1}^{n})$ $= \sum_{k = 0}^{M} α_{k} x_{i}^{k}$ .
-: the estimation of the errors of the parameters, ${δ α_{k}}_{k = 1}^{n}$ .
-: minimization of least deviation in L_p norm [10,11,12,28].

The presented analysis was applied in the density and temperature measurements of the proton plasma in the inner heliosheath, the outer boundary of our heliosphere. Those measurements are known to be fitted with a line with a negative slope, which is related to the polytropic index of the plasma. The application of the new statistical method improved the estimated value of this index.

Finally, Table 1 summarizes the critical equations shown in this paper.

Funding

This research was funded by NASA’s HGI Program, grant number NNX17AB74G.

Conflicts of Interest

The author declares no conflict of interest.

References

Kenney, J.F.; Keeping, E.S. Linear Regression and Correlation. In Mathematics of Statistics, 3rd ed.; Van Nostrand: Princeton, NJ, USA, 1962; pp. 252–285. [Google Scholar]
McCullagh, P. What is statistical model? Ann. Stat. 2002, 30, 1225–1310. [Google Scholar] [CrossRef]
Adèr, H.J. Modelling. In Advising on Research Methods: A Consultant’s Companion; Adèr, H.J., Mellenbergh, G.J., Eds.; Johannes van Kessel Publishing: Huizen, The Netherlands, 2008; pp. 271–304. [Google Scholar]
Melissinos, A.C. Experiments in Modern Physics; Academic Press Inc.: London, UK, 1966; pp. 438–464. [Google Scholar]
Burden, R.L.; Faires, J.D. Numerical Analysis; PWS Publishing Company: Boston, MA, USA, 1993; pp. 437–438. [Google Scholar]
Livadiotis, G. Approach to general methods for fitting and their sensitivity. Physica A 2007, 375, 518–536. [Google Scholar] [CrossRef]
Livadiotis, G. Expectation values and Variance based on L^p norms. Entropy 2012, 14, 2375–2396. [Google Scholar] [CrossRef]
Livadiotis, G. Chi-p distribution: Characterization of the goodness of the fitting using L^p norms. J. Stat Distr. Appl. 2014, 1, 4. [Google Scholar] [CrossRef]
Livadiotis, G.; Moussas, X. The sunspot as an autonomous dynamical system: A model for the growth and decay phases of sunspots. Physica A 2007, 379, 436–458. [Google Scholar] [CrossRef]
Livadiotis, G.; McComas, D.J. Fitting method based on correlation maximization: Applications in Astrophysics. J. Geophys. Res. 2013, 118, 2863–2875. [Google Scholar] [CrossRef]
Schmid, J., Jr. The relationship between the coefficient of correlation and the angle included between regression lines. J. Educ. Res. 1947, 41, 311–313. [Google Scholar] [CrossRef]
Livadiotis, G. Kappa Distribution: Theory & Applications in Plasmas, 1st ed.; Elsevier: Amsterdam, The Netherlands, 2017. [Google Scholar]
Livadiotis, G.; McComas, D.J. Beyond kappa distributions: Exploiting Tsallis statistical mechanics in space plasmas. J. Geophys. Res. 2009, 114, A11105. [Google Scholar] [CrossRef]
Livadiotis, G.; McComas, D.J. Understanding kappa distributions: A toolbox for space science and astrophysics. Space Sci. Rev. 2013, 175, 183–214. [Google Scholar] [CrossRef]
Livadiotis, G. Thermodynamic origin of kappa distributions. Europhys. Lett. 2018, 122, 50001. [Google Scholar] [CrossRef]
McComas, D.J.; Allegrini, F.; Bochsler, P.; Bzowski, M.; Christian, E.R.; Crew, G.B.; DeMajistre, R.; Fahr, H.; Fichtner, H.; Frisch, P.C.; et al. Global observations of the interstellar interaction from the Interstellar Boundary Explorer (IBEX). Science 2009, 326, 959. [Google Scholar] [CrossRef] [PubMed]
Livadiotis, G.; McComas, D.J.; Dayeh, M.A.; Funsten, H.O.; Schwadron, N.A. First sky map of the inner heliosheath temperature using IBEX spectra. Astrophys. J. 2011, 734, 1. [Google Scholar] [CrossRef]
Livadiotis, G.; McComas, D.J.; Randol, B.; Mӧbius, E.; Dayeh, M.A.; Frisch, P.C.; Funsten, H.O.; Schwadron, N.A.; Zank, G.P. Pick-up ion distributions and their influence on ENA spectral curvature. Astrophys. J. 2012, 751, 64. [Google Scholar] [CrossRef]
Livadiotis, G.; McComas, D.J.; Schwadron, N.A.; Funsten, H.O.; Fuselier, S.A. Pressure of the proton plasma in the inner heliosheath. Astrophys. J. 2013, 762, 134. [Google Scholar] [CrossRef]
Livadiotis, G.; McComas, D.J. Invariant kappa distribution in space plasmas out of equilibrium. Astrophys. J. 2011, 741, 88. [Google Scholar] [CrossRef]
Elliott, H.A.; McComas, D.J.; Zirnstein, E.J.; Randol, B.M.; Delamere, P.A.; Livadiotis, G.; Bagenal, F.; Barnes, N.P.; Stern, S.A.; Young, L.A.; et al. Slowing of the solar wind in the outer heliosphere. Astrophys. J. 2019, in press. [Google Scholar]
Sckopke, N.; Paschmann, G.; Haerendel, G.; Sonnerup, B.U.O.; Bame, S.J.; Forbes, T.G.; Hones, E.W., Jr.; Russell, C.T. Structure of the low-latitude boundary layer. J. Geophys. Res. 1981, 86, 2099–2110. [Google Scholar] [CrossRef]
Pang, X.X.; Cao, J.B.; Liu, W.; Ma, Y.; Lu, H.; Yang, J.; Li, L.; Liu, X.; Wang, J.; Wang, T.; et al. Polytropic index of central plasma sheet ions based on MHD Bernoulli integral. J. Geophys. Res. 2015, 120, 4736–4747. [Google Scholar] [CrossRef]
Nicolaou, G.; McComas, D.J.; Bagenal, F.; Elliott, H.A.; Wilson, R.J. Plasma properties in the deep Jovian magnetotail. Planet. Space Sci. 2015, 119, 222–232. [Google Scholar] [CrossRef]
Ogasawara, K.; Angelopoulos, V.; Dayeh, M.A.; Fuselier, S.A.; Livadiotis, G.; McComas, D.J.; McFadden, J.P. Characterizing the dayside magnetosheath using ENAs: IBEX and THEMIS observations. J. Geophys. Res. 2013, 118, 3126–3137. [Google Scholar] [CrossRef]
Livadiotis, G.; McComas, D.J. Non-equilibrium thermodynamic processes: Space plasmas and the inner heliosheath. Astrophys. J. 2012, 749, 11. [Google Scholar] [CrossRef]
Livadiotis, G. Superposition of polytropes in the inner heliosheath. Astrophys. J. Suppl. Ser. 2016, 223, 13. [Google Scholar] [CrossRef]
Livadiotis, G. Non-Euclidean-normed Statistical Mechanics. Physica A 2016, 445, 240–255. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Scheme of the rotated axes and the corresponding symbols. The angle noted by the symbol “))” in the yellow-shaded triangle has its tangent equal to β and is related to Equation (10).

Figure 2. (a) The Pearson’s correlation coefficient (square) r², and (b) the slope β, both at the rotation reference frame, are plotted against the tangent of the rotation angle tanϑ. We exclude data points with small kappa index, and the plots are depicted for different thresholds (i.e., κ_0min: 0.1 (red), 0.25 (blue), and 0.4 (green)). (c) The optimal tanϑ is plotted against the kappa index threshold κ_0min.

Table 1. Formulae of the developed method for finding the optimal rotation of maximum correlation.

Optimal Quantity	Formula
$\tan ϑ_{opt}$	$(- 1 + \sqrt{1 + X^{2}}) / X$
$β_{opt}$	$β_{0} \cdot \frac{1 + β_{90} \cdot \tan ϑ_{opt}}{1 + β_{0} \cdot \tan ϑ_{opt}}$
$r^{2}_{opt}$	$r_{0}^{2} \cdot \frac{1 + X^{2}}{1 + r_{0}^{2} X^{2}}$

Note: We set

β_{90} = σ_{y y}^{2} / σ_{x y}^{2}

,

β_{0}^{} = σ_{x y}^{2} / σ_{x x}^{2}

, and

X = \frac{1}{2} (β_{90} - β_{0}^{- 1})

.

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Livadiotis, G. Linear Regression with Optimal Rotation. Stats 2019, 2, 416-425. https://doi.org/10.3390/stats2040028

AMA Style

Livadiotis G. Linear Regression with Optimal Rotation. Stats. 2019; 2(4):416-425. https://doi.org/10.3390/stats2040028

Chicago/Turabian Style

Livadiotis, George. 2019. "Linear Regression with Optimal Rotation" Stats 2, no. 4: 416-425. https://doi.org/10.3390/stats2040028

APA Style

Livadiotis, G. (2019). Linear Regression with Optimal Rotation. Stats, 2(4), 416-425. https://doi.org/10.3390/stats2040028

Article Menu

Linear Regression with Optimal Rotation

Abstract

1. Introduction

2. Rotated Optimal Slope

3. Maximized Correlation Coefficient

4. Application to the Inner Heliosheath

5. Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI