Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry

Chalikias, Miltiadis S.

doi:10.3390/math7040378

Open AccessArticle

Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry

by

Miltiadis S. Chalikias

Department of Accounting and Finance, School of Business, Economics and Social Sciences, University of West Attica, 12244 Egaleo, Greece

Mathematics 2019, 7(4), 378; https://doi.org/10.3390/math7040378

Submission received: 18 February 2019 / Revised: 9 April 2019 / Accepted: 13 April 2019 / Published: 25 April 2019

(This article belongs to the Special Issue Applied and Computational Statistics)

Download Versions Notes

Abstract

:

In this paper, we construct optimal repeated measurement designs of two treatments for estimating direct effects, and we examine the case of compound symmetry dependency. We present the model and the design that minimizes the variance of the estimated difference of the two treatments. The optimal designs with dependent observations in a compound symmetry model are the same as in the case of independent observations.

Keywords:

repeated measurement designs; compound symmetry

1. Introduction

In repeated measurement designs, a sequence of treatments is applied to each experimental unit (e.u.). In particular, one treatment is applied in each period. For example, for two treatments, A and B, and three periods, a possible sequence is ABA, which means that the treatments A, B, and A are respectively applied at the beginning of each of the three periods. The direct effect of a treatment is the effect of the treatment which is applied in the period that is examined. The residual effect is the effect of the treatment which is applied in the period preceding the period that is examined. In the case of two treatments, A and B, the direct

τ_{A}

and

τ_{B}

can be estimated. In every period, a treatment is applied, so either

τ_{A}

or

τ_{B}

is estimable. In this paper, the parameter of interest is the difference of direct effects

τ = τ_{A} - τ_{B}

.

Most researchers who have investigated repeated measurement designs, such as [1,2,3,4,5,6], have been occupied with universally optimal designs where the observations are independent. However, researchers have also shown interest in designs with dependent observations, as in the cases of [7,8,9,10,11].

The model we use in this paper, and which is presented below, was first introduced by Hedayat and Afsarinejad [12,13]. In previous research [14,15] using this model, the author of this article studied two treatment designs under the assumption that consecutive observations were independent. Building on that previous work, in the present article the author examines the case of compound symmetry dependency. The aim is to find a design that corresponds to a minimum variance estimator.

2. The Model

A compound symmetry model has the following characteristics:

(i): For each sequence, the variance matrix is of the form $Σ_{m} = a I_{m} + b J_{m}$ , where $I_{m}$ is the unit m × m matrix, and $J_{m}$ is the m × m matrix where all elements are equal to 1 (m is the number of periods).
(ii): The observations corresponding to different treatment sequences (different e.u.) are independent, and the number of sequences is $2^{m}$ .

The goal is to find the design that corresponds to the minimum variance estimator. I show that, in this case, the optimal design regarding the direct effect is the same as in the model of independent observations, and only the variance of the estimator is different.

The model is [12]:

y_{i j k} = μ + τ + π_{j} + δ_{i, j - 1} + γ_{i} + ζ_{k} + e_{i j k}

(1)

j corresponds to the j-th period, j = 1, 2, …, m;
i corresponds to the i-th sequence, $i = 0, 1, \dots 2^{m} - 1$ ;
k corresponds to the unit k = 1, 2, …, n;
$τ_{A}, τ_{B}$ : are direct effects of treatments A and B;
$π_{j}$ : is the effect of the j-th period;
$δ_{A}, δ_{B}$ : are the residual effects of A and B;
$γ_{i}$ : is the effect of the i-th sequence; and
$ζ_{k}$ : is the effect of the k-th e.u. (subject effect), which is a random variable, independent of the error $e_{i j k}$ .

The errors

e_{i j k}

are assumed to be independent. However, the quantities

ζ_{k} + e_{i j k}

are independent only between sequences and not within sequences.

The overparameterized model vector form of the above model is written as:

Y = τ_{A} τ_{A} + τ_{B} τ_{B} + δ_{A} δ_{A} + δ_{B} δ_{B} + π_{1} π_{1} + \dots + π_{m} π_{m} + + γ_{0} γ_{0} + \dots + γ_{q} γ_{q} + e

(2)

where

q = 2^{m} - 1

and

Y, τ_{A}, τ_{B}, δ_{A}, δ_{B}, π_{1}, \dots π_{m}, γ_{0}, \dots γ_{q}, e

are

1 \times m n

vectors; the direct effect vector is 1 if the treatment is A, and zero if it is B. For example, for the sequence ABB…,

τ_{A} = [\begin{matrix} 1 \\ 0 \\ 0 \\ ⋮ \end{matrix}]

and, in the same way,

τ_{B}, δ_{A}, δ_{B} π_{ι} and γ_{ι}

are defined so that

τ_{A} + τ_{B} = 1_{m n}

,

δ_{A} + δ_{B} + π_{1} = 1_{m n}

, and

π_{1} + π_{2} + \dots + π_{m} = 1_{m n}

. Also, 1 when the ith unit is employed, and 0 elsewhere, so

γ_{0} + γ_{1} + γ_{2} + \dots + γ_{2^{m} - 1} = 1_{m n}

. So, in equation (2) there are linearly dependent vectors.

Keeping only the linear independent vector [16], the model (2) is transformed to

E (Y) = τ (τ_{A} - τ_{B}) + δ (δ_{A} - δ_{B}) + π_{1} π_{1} + \dots + π_{m - 1} π_{m - 1} + + γ_{0} γ_{0} + \dots + γ_{q - 1} γ_{q - 1}

where

q = 2^{m} - 1

. In a vector form:

Y = X b + e \Leftrightarrow Y = (\begin{matrix} X_{1} & X_{2} \end{matrix}) (\begin{matrix} b_{1} \\ b_{2} \end{matrix})

(3)

where Y is (mn) × 1, the design matrix X is (mn) × s, b is s × 1, e is (mn) × 1, and s is the number of unknown parameters. If we are interested only in some and not in all of the parameters, then we write

b^{'} = (\begin{matrix} b_{1}^{'} & b_{2}^{'} \end{matrix})

, where

b_{1}

is the r parameters of interest, and

b_{2}

is the s-r remaining parameters.

We assume only one parameter of interest for the difference of the direct effects,

τ = τ_{A} - τ_{B}

, which can be considered as the direct effect of A in the case of

τ_{B} = 0

. In order to guarantee the estimability of the model, we postulate the restrictions

τ_{B} = 0, π_{m} = 0, γ_{2^{m} - 1} = 0

.

The matrix

X_{1}

corresponds to the coefficients of τ, and the matrix

X_{2}

corresponds to the coefficients of the rest of the non-random variables. Let us assume

V = X_{2} {(X_{2}^{T} Σ^{- 1} X_{2})}^{- 1} X_{2}^{T}

is a

(m n) \times (m n)

matrix, P the projection matrix of

X_{2}

,

P = X_{2} {(X_{2}^{T} X_{2})}^{- 1} X_{2}^{T}

and Σ are the

(m n) \times (m n)

variance matrix of the observations.

From the ordinal least-squares equations, we derive the following relation for the estimation of the main effect τ:

(X_{1}^{T} Σ^{- 1} X_{1} - X_{1}^{T} Σ^{- 1} V Σ^{- 1} X_{1}) \hat{τ} = X_{1}^{T} Σ^{- 1} (I - P Σ^{- 1}) Y

We also have

var (τ) = σ^{2} {(X_{1}^{T} Σ^{- 1} X_{1} - X_{1}^{T} Σ^{- 1} V Σ^{- 1} X_{1})}^{- 1} = σ^{2} Q^{- 1}

(4)

3. The Case of Compound Symmetry

The observations are dependent within sequences with variance matrix

Σ_{m}

. The observations from different sequences are independent, therefore:

Σ = [⋮ \begin{matrix} Σ_{m} & 0 & \dots & 0 \\ 0 & Σ_{m} & \dots & 0 \\ ⋱ & ⋮ \\ 0 & 0 & \dots & Σ_{m} \end{matrix}] and V = [\begin{matrix} V_{m 0} & 0 & \dots & 0 \\ 0 & V_{m 1} & \dots & 0 \\ ⋱ & ⋮ \\ 0 & 0 & \dots & V_{m q} \end{matrix}]

where

q = 2^{m} - 1

and

V_{m j} = X_{2 j} {(X_{2 j}^{T} Σ_{m}^{- 1} X_{2 j})}^{- 1} X_{2 j}^{T}

.

In order to obtain a sequence enumeration, the binary enumeration system was used, with 0 corresponding to A, and 1 to B. Thus, we obtained the enumerations

0, 1, \dots, 2^{m} - 1

. For example, if we have five periods and the sequence BABBA, then this is the 13th sequence, since

B A B B A \leftrightarrow 1 \cdot 2^{0} + 0 \cdot 2^{0} + 1 \cdot 2^{2} + 1 \cdot 2^{3} + 0 \cdot 2^{4} = 13

. For two periods, we have four sequences, that is,

A A \leftrightarrow 0, B A \leftrightarrow 1, A B \leftrightarrow 2, B B \leftrightarrow 3

. For three periods (two treatments) we have eight sequences:

A	B	A	B	A	B	A	B
A	A	B	B	A	A	B	B
A	A	A	A	B	B	B	B
u₀	u₁	u₂	u₃	u₄	u₅	u₆	u₇

where

u_{i}

i = 0, 2, 3, 4, 5, 6, 7 is the number of units that received the i-th sequence of treatments. The sequences that we obtain by substituting A for B and vice versa are called dual or reversal designs. Observe that for these sequences, we obtain the enumeration 7 − i, i = 0, 1, 2, 3.

Proposition 1.

For a repeated measurement design with m periods, n experimental units, and a variance matrix

Σ

that consists of n diagonal block matrices of the form

Σ_{m} = a I_{m} + b J_{m}

,

(X_{1}^{T} Σ^{- 1} X_{1} - X_{1}^{T} Σ^{- 1} V Σ^{- 1} X_{1}) = \frac{1}{a} (X_{1}^{T} X_{1}^{} - X_{1}^{T} P X_{1}^{})

where

P = X_{2} {(X_{2}^{T} X_{2})}^{- 1} X_{2}^{T}

.

Proof.

Let

{\tilde{X}}_{1} = Σ^{- 1 / 2} X_{1}, {\tilde{X}}_{2} = Σ^{- 1 / 2} X_{2}

, and

\tilde{Y} = Σ^{- 1 / 2} Y

. Then

(X_{1}^{T} Σ^{- 1} X_{1} - X_{1}^{T} Σ^{- 1} V Σ^{- 1} X_{1}) = X_{1}^{T} X_{1}^{} - {\tilde{X}}_{1}^{T} P {\tilde{X}}_{1}^{}

where

\tilde{P} = {\tilde{X}}_{2} {({\tilde{X}}_{2}^{T} {\tilde{X}}_{2})}^{- 1} {\tilde{X}}_{2}^{T}

. In other words,

\tilde{P}

is the matrix of the orthogonal projection to

R ({\tilde{X}}_{2})

. □

X_{1 j} j = 0, 1, 2 \dots m

is the m × 1 matrix of τ in the j-th sequence, and

X_{2 j} j = 0, 1, 2 \dots m

is the

m x (m + 2^{m})

matrix of the parameters

μ, π_{1}, π_{2}, \dots π_{m - 1}, δ_{A}, δ_{B}, γ_{1}, γ_{2}, \dots, γ_{q}

, where

q = 2^{m} - 1

.

For example, for three periods (m = 3), we have the matrices:

X_{1} = [\begin{matrix} X_{10}} u_{0} \\ X_{11}} u_{1} \\ ⋮ \\ X_{17}} u_{7} \end{matrix}] and X_{2} = [\begin{matrix} X_{20} \\ X_{21} \\ ⋮ \\ X_{27} \end{matrix}]

For the linear space

R ({\tilde{X}}_{2 j})

and for any sequence (m observations)

R ({\tilde{X}}_{2 j}) = R (X_{2 j})

, we observe the following:

(i) The matrix

(a I_{m} + b J_{m})

is positive definite, so the matrix

{(a I_{m} + b J_{m})}^{- \frac{1}{2}}

is also positive definite, and we conclude that:

{(a I_{m} + b J_{m})}^{- \frac{1}{2}} = \frac{1}{a} (I_{m} - \frac{b}{a + b m} J_{m}) \Leftrightarrow {(a I_{m} + b J_{m})}^{- \frac{1}{2}} = \frac{1}{\sqrt{a}} (I_{m} - δ J_{m})

where

δ = \frac{\sqrt{\frac{a}{a + b m}}}{m}

,

1 - δ \cdot m > 0

, and we have

{\tilde{X}}_{2 j} = \frac{1}{\sqrt{α}} {(I_{m} - δ J_{m})}^{- 1} X_{2 j}

.

(ii) The coefficients of the general mean are 1, so

1_{m} \in R (X_{2 j})

and.

\frac{1}{\sqrt{a}} (I_{m} - δ J_{m}) \cdot 1_{m} = \frac{1}{\sqrt{a + b m}} 1_{m} \Rightarrow 1_{m} \in R ({\tilde{X}}_{2 j})

(iii) If z is another column vector, and

z \in R (X_{2 j})

, then

\frac{1}{\sqrt{a}} (I_{m} - δ J_{m}) z = \frac{1}{\sqrt{a}} (z - δ (1_{m_{}}^{T} z) 1_{m}) \Rightarrow z \in R ({\tilde{X}}_{2 j}) \Leftrightarrow R ({\tilde{X}}_{2 j}) = R (X_{2 j})

(iv) If

{\tilde{P}}_{m}

is the matrix of the orthogonal projection to the linear space

R ({\tilde{X}}_{2 j})

, then

{\tilde{P}}_{m j} = P_{m j}

, where

P_{m j} = X_{2 j} {(X_{2 j}^{T} X_{2 j})}^{- 1} X_{2 j}^{T}

is the matrix of the orthogonal projection to

R (X_{2 j})

and

P_{m j} \cdot 1_{m} = 1_{m} \Rightarrow P_{m j} \cdot J_{m} = J_{m}

. From the above, we conclude that:

({\tilde{X}}_{1 j}^{T} {\tilde{X}}_{1 j} - {\tilde{X}}_{1 j}^{T} {\tilde{P}}_{m j} {\tilde{X}}_{1 j}) = {\tilde{X}}_{1 j}^{T} {\tilde{X}}_{1 j}^{} - {\tilde{X}}_{1 j}^{T} {\tilde{P}}_{m j} {\tilde{X}}_{1 j}^{} = \frac{1}{a} (X_{1 j}^{T} X_{1 j}^{} - X_{1 j}^{T} P_{m j} X_{1 j}^{}) (I_{m} - {\tilde{P}}_{m j}) {\tilde{X}}_{1 j} = \frac{1}{\sqrt{a}} (I_{m} - P_{m j}) (I_{m} - δ J_{m}) X_{1 j} = \frac{1}{\sqrt{a}} (I_{m} - P_{m j}) X_{1 j} ({\tilde{X}}_{1}^{T} {\tilde{X}}_{1} - {\tilde{X}}_{1}^{T} \tilde{P} {\tilde{X}}_{1}) = \sum_{j = 0}^{q} ({\tilde{X}}_{1 j}^{T} {\tilde{X}}_{1 j}^{} - {\tilde{X}}_{1 j}^{T} {\tilde{P}}_{m j} {\tilde{X}}_{1 j}^{}) = \frac{1}{a} (X_{1}^{T} X_{1}^{} - X_{1}^{T} P X_{1}^{})

Corollary 1.

The designs that result in the estimators with the minimum variance, i.e.,

\min var (\hat{τ})

are exactly the optimal designs of the model with independent observations. In this case, the variance

var (\hat{τ})

is multiplied by α:

var (τ) = σ^{2} {(X_{1}^{T} X_{1} - X_{1}^{T} P X_{1})}^{- 1} = σ^{2} a \cdot {(Q^{*})}^{- 1}

σ^{2} {(Q^{*})}^{- 1}

is the variance of the optimal designs in the model with independent observations).

Proof.

From the previous proof, we conclude that the variance of the estimator of the direct effect, which is given by Formula (3), equals to

var (τ) = σ^{2} a \cdot {(Q^{*})}^{- 1}

□

Comments: (1) If we consider that an observation can influence another observation, the e.u are correlated, and the correlation is given by ρ, −1 < ρ < 1. Dependent observations are often considered observations of the same cluster [17]. A simple example of dependency appears when children of the same mother are included in a sample. Due to their common household environment and genes, it is expected that these children have a greater chance of having the same characteristics.

(2) In the case of compound symmetry, the variance matrix of each sequence observations is

Σ_{m} = (1 - ρ) I_{m} + ρ J_{m}

, so α = 1 − ρ, and b = ρ. In order for the matrix to be positive definite, the condition

- \frac{1}{m - 1} < ρ < 1

is necessary. If ρ = 0, then we obtain the model with independent observations and α = 1.

(3) The variance of the estimator of the direct effect,

var (\hat{τ})

, decreases when the correlation coefficient ρ increases and it approaches 0, when ρ approaches 1, since α = 1 − ρ.

(4) For two periods with dependent observations, the 2 × 2 variance matrix of the observations in the compound symmetry model is

Σ_{2} = (1 - ρ) I_{2} + ρ J_{2}

. The optimal design for this model is the same as the optimal design for independent observations for every ρ, −1 < ρ < 1.

For an even n, such an optimal design is obtained when to the sequences AA and AB correspond to n/2 e.u, while for an odd n, the optimal design is obtained when to the sequences AA and AB correspond to (n − 1)/2 and (n + 1)/2 e.u., respectively [11]. The reverse sequences BB, BA also correspond to an optimal design with:

var (τ) = σ^{2} (1 - ρ) {(Q^{*})}^{- 1}

(5) As illustrated, the examined model with dependent observations is also associated with variance matrices Σ for which the optimal designs are the same as the ones of the model with independent observations [14,18].

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

Carriere, K.C.; Reinsel, G.C. Optimal two period repeated measurement designs with two or more treatments. Biometrika 1993, 80, 924–929. [Google Scholar] [CrossRef]
Chalikias, M.; Kounias, S. Extension and necessity of Cheng and Wu conditions. J. Stat. Plan. Infer. 2012, 142, 1794–1800. [Google Scholar] [CrossRef]
Cheng, C.S.; Wu, C.F. Balanced repeated measurements designs. Ann. Stat. 1980, 11, 29–50. [Google Scholar] [CrossRef]
Hedayat, A.S.; Yang, M. Universal Optimality of Selected Crossover Designs. J. Am. Stat. Assoc. 2004, 99, 461–466. [Google Scholar] [CrossRef]
Hedayat, A.S.; Zheng, W. Optimal and efficient crossover designs for test-control study when subject effects are random. J. Am. Stat. Assoc. 2010, 105, 1581–1592. [Google Scholar] [CrossRef]
Stufken, J. Some families of optimal and efficient repeated measurements designs. J. Stat. Plan. Infer. 1991, 27, 75–83. [Google Scholar] [CrossRef]
Hedayat, A.S.; Yan, Z. Crossover designs based on type I orthogonal arrays for a self and simple mixed carryover effects model with correlated errors. J. Stat. Plan. Infer. 2008, 138, 2201–2213. [Google Scholar] [CrossRef]
Kounias, S.; Chalikias, M.S. An algorithm applied to designs of repeated measurements. J. Appl. Stat. Sci. 2005, 14, 243–250. [Google Scholar]
Kushner, H.B. Allocation rules for adaptive repeated measurements designs. J. Stat. Plan. Infer. 2003, 113, 293–313. [Google Scholar] [CrossRef]
Laska, E.M.; Meisner, M. A variational approach to optimal two treatment crossover designs: Application to carryover effect models. J. Am. Stat. Assoc. 1985, 80, 704–710. [Google Scholar] [CrossRef]
Matthews, J.N.S. Optimal crossover designs for the comparison of two treatments in the presence of carryover effects and autocorrelated errors. Biometrika 1987, 74, 311–320. [Google Scholar] [CrossRef]
Hedayat, A.; Afsarinejad, K. Repeated measurements designs, I. Survey Stat. Des. Linear Models 1975, 229–242. Available online: http://ani.stat.fsu.edu/techreports/M261.pdf (accessed on 23 April 2019).
Hedayat, A.S.; Afsarinejad, K. Repeated measurements designs II. Ann. Stat. 1978, 18, 1805–1816. [Google Scholar] [CrossRef]
Kounias, S.; Chalikias, M. Optimal and Universally Optimal Two Treatment Repeated Measurement Designs; Vonta, F., Nikulin, M., Eds.; Statistics for industry and technology Birkhauser: Boston, MA, USA; Basel, Switzerland; Berlin, Germany, 2008; pp. 465–477. [Google Scholar]
Kounias, S.; Chalikias, M.S. Optimal two treatment repeated measurement designs with treatment-period interaction in the model. Util. Math. 2015, 96, 243–261. [Google Scholar]
Kounias, S.; Chalikias, M. Estimability of Parameters in a Linear Model. Stat. Probab. Lett. 2008, 28, 2437–2439. [Google Scholar] [CrossRef]
Liang, K.Y.; Zeger, S.L. Regression analysis for correlated data. Annu. Rev. Pub. Health 1993, 14, 43–68. [Google Scholar] [CrossRef] [PubMed]
Chalikias, M.; Kounias, S. Optimal two Treatment Repeated Measurement Designs for three Periods. Commun. Stat. Theory Methods 2017, 46, 200–209. [Google Scholar] [CrossRef]

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chalikias, M.S. Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry. Mathematics 2019, 7, 378. https://doi.org/10.3390/math7040378

AMA Style

Chalikias MS. Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry. Mathematics. 2019; 7(4):378. https://doi.org/10.3390/math7040378

Chicago/Turabian Style

Chalikias, Miltiadis S. 2019. "Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry" Mathematics 7, no. 4: 378. https://doi.org/10.3390/math7040378

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry

Abstract

1. Introduction

2. The Model

3. The Case of Compound Symmetry

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI