Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables

Yamamoto, Kouji; Ban, Yohei; Tomizawa, Sadao

doi:10.3390/e10040776

Open AccessArticle

Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables

by

Kouji Yamamoto

,

Yohei Ban

and

Sadao Tomizawa

^*

Department of Information Sciences, Faculty of Science and Technology, Tokyo University of Science, Noda City, Chiba, 278-8510, Japan

^*

Author to whom correspondence should be addressed.

Entropy 2008, 10(4), 776-785; https://doi.org/10.3390/e10040776

Submission received: 31 October 2008 / Accepted: 16 December 2008 / Published: 22 December 2008

(This article belongs to the Special Issue Information and Entropy)

Download Versions Notes

Abstract

:

For 2 × 2 × K contingency tables, Tomizawa considered a Shannon entropy type measure to represent the degree of departure from a log-linear model of no three-factor interaction (the NOTFI model). This paper proposes a generalization of Tomizawa’s measure for 2 × 2 × K tables. The measure proposed is expressed by using Patil-Taillie diversity index or Cressie-Read power-divergence. A special case of the proposed measure includes Tomizawa’s measure. The proposed measure would be useful for comparing the degrees of departure from the NOTFI model in several tables.

Keywords:

Diversity index; odds-ratio; power-divergence

1. Introduction

For the

I \times J \times K

contingency table, let

p_{i j k}

denote the probability that an observation will fall in the cell

(i, j, k)

of the table

(i = 1, \dots, I; j = 1, \dots, J; k = 1, \dots, K)

. One can express

log p_{i j k}

as

log p_{i j k} = u + u_{1 (i)} + u_{2 (j)} + u_{3 (k)} + u_{12 (i j)} + u_{13 (i k)} + u_{23 (j k)} + u_{123 (i j k)},

(1)

where

\begin{matrix} \sum_{i} u_{s (i)} = 0 (s = 1, 2, 3), \\ \sum_{i} u_{s t (i j)} = \sum_{j} u_{s t (i j)} = 0 (1 \leq s < t \leq 3), \\ \sum_{i} u_{123 (i j k)} = \sum_{j} u_{123 (i j k)} = \sum_{k} u_{123 (i j k)} = 0; \end{matrix}

see, e.g., Bishop, Fienberg and Holland [1, Chap. 2]. Let

l_{i j k} = log p_{i j k}

. The u-term in (1) are, for example,

\begin{matrix} u = \frac{l_{\cdot \cdot \cdot}}{I J K} (overall mean), \\ u_{1 (i)} = \frac{l_{i \cdot \cdot}}{J K} - \frac{l_{\cdot \cdot \cdot}}{I J K} (main effect of variable 1), \\ u_{12 (i j)} = \frac{l_{i j \cdot}}{K} - (\frac{l_{i \cdot \cdot}}{J K} + \frac{l_{\cdot j \cdot}}{I K}) + \frac{l_{\cdot \cdot \cdot}}{I J K} (two - factor effect between variables 1 and 2), \end{matrix}

and

\begin{matrix} u_{123 (i j k)} = l_{i j k} - (u + u_{1 (i)} + u_{2 (j)} + u_{3 (k)} + u_{12 (i j)} + u_{13 (i k)} + u_{23 (j k)}) \\ (three - factor effect (interaction)), \end{matrix}

where

l_{\cdot \cdot \cdot} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{K} l_{i j k}, l_{i \cdot \cdot} = \sum_{j = 1}^{J} \sum_{k = 1}^{K} l_{i j k}, l_{i j \cdot} = \sum_{k = 1}^{K} l_{i j k}, l_{\cdot j \cdot} = \sum_{i = 1}^{I} \sum_{k = 1}^{K} l_{i j k};

see, e.g., Bishop et al. [1, Chap. 2].

We obtain the well-known four models by setting the parameters in (1) as

\begin{matrix} (i) u_{12 (i j)} = u_{13 (i k)} = u_{23 (j k)} = u_{123 (i j k)} = 0, \\ (ii) u_{13 (i k)} = u_{23 (j k)} = u_{123 (i j k)} = 0, \\ (iii) u_{13 (i k)} = u_{123 (i j k)} = 0, \\ (iv) u_{123 (i j k)} = 0, \end{matrix}

for all

i, j, k

. Model (1) imposed restriction (iv) is usually referred to as the no three-factor interaction (NOTFI) model (or no second-order interaction model). Model (1) imposed restrictions (i), (ii), (iii) and (iv) also can be expressed as

\begin{matrix} H_{1} : p_{i j k} = p_{i \cdot \cdot} p_{\cdot j \cdot} p_{\cdot \cdot k}, \\ H_{2} : p_{i j k} = p_{i j \cdot} p_{\cdot \cdot k}, \\ H_{3} : p_{i j k} = \frac{p_{i j \cdot} p_{\cdot j k}}{p_{\cdot j \cdot}}, \\ H_{4} : θ_{i j (1)} = \dots = θ_{i j (K)}, \end{matrix}

respectively, where

\begin{matrix} p_{i \cdot \cdot} = \sum_{j} \sum_{k} p_{i j k}, p_{\cdot j \cdot} = \sum_{i} \sum_{k} p_{i j k}, p_{\cdot \cdot k} = \sum_{i} \sum_{j} p_{i j k}, \\ p_{i j \cdot} = \sum_{k} p_{i j k}, p_{\cdot j k} = \sum_{i} p_{i j k}, \\ θ_{i j (t)} = \frac{p_{i j t} p_{i + 1, j + 1, t}}{p_{i, j + 1, t} p_{i + 1, j, t}}; \end{matrix}

see, e.g., Fienberg [2, Chap. 3]. When none of models

H_{1}, H_{2}, H_{3}

and

H_{4}

holds, namely, when model

H_{4}

does not hold, we are interested in seeing the degree of departure from model

H_{4}

, i.e., the degree of non-uniformity of odds-ratios

{θ_{i j (t)}}

.

For the

2 \times 2 \times K

contingency table, Tomizawa [3] considered a measure which represents the degree of departure from the NOTFI model. The measure is expressed by using the Shannon entropy (see Appendix).

By the way, Patil and Taillie [4] considered the diversity index, which includes the Shannon entropy in a special case. We are interested in a measure of departure from the NOTFI model, based on the diversity index.

The purpose of this paper is to propose a generalization of Tomizawa’s measure for the

2 \times 2 \times K

table. The proposed measure includes Tomizawa’s measure in a special case. The measure would be useful for comparing the degrees of departure from the NOTFI model in several tables.

2. A generalization of measure

Consider the

2 \times 2 \times K

contingency table. The NOTFI model is expressed as

\begin{matrix} θ_{1} = θ_{2} = \dots = θ_{K}, \end{matrix}

where

\begin{matrix} θ_{t} = \frac{p_{11 t} p_{22 t}}{p_{12 t} p_{21 t}} . \end{matrix}

This shows that the K odds-ratios are identical. Let

\begin{matrix} D = \sum_{k = 1}^{K} θ_{k}, θ_{t}^{*} = \frac{θ_{t}}{D}, \end{matrix}

for

t = 1, \dots, K

.

Assuming that the

{p_{i j k}}

are positive, consider a measure to represent the degree of departure from the NOTFI model, defined by

\begin{matrix} φ^{(λ)} = 1 - \frac{H^{(λ)} (θ^{*})}{C^{(λ)}}, for λ > - 1 \end{matrix}

(2)

where

\begin{matrix} H^{(λ)} (θ^{*}) & = & \frac{1}{λ} (1 - \sum_{t = 1}^{K} {(θ_{t}^{*})}^{λ + 1}), \\ C^{(λ)} & = & \frac{1}{λ} [1 - {(\frac{1}{K})}^{λ}], \end{matrix}

and the value at

λ = 0

is taken to be the limit as

λ \to 0

, where λ is a real value that is chosen by the user. Thus,

φ^{(0)}

is equal to φ in Appendix. Note that

φ^{(0)}

in equation (2) is the same as Tomizawa’s measure. Also, note that

H^{(λ)} (θ^{*})

is Patil and Taillie’s diversity index of degree λ for {

θ_{t}^{*}

}, which includes the Shannon entropy (when

λ = 0

) in a special case.

The measure

φ^{(λ)}

may be expressed as

φ^{(λ)} = \frac{λ + 1}{K^{λ} C^{(λ)}} I^{(λ)} (\{θ_{t}^{*}\}; \{\frac{1}{K}\}),

where

I^{(λ)} (\cdot; \cdot) = \frac{1}{λ (λ + 1)} \sum_{t = 1}^{K} θ_{t}^{*} [{(\frac{θ_{t}^{*}}{1 / K})}^{λ} - 1] .

Note that

I^{(λ)} ({θ_{t}^{*}}; {\frac{1}{K}})

is the power-divergence between

{θ_{t}^{*}}

and

{\frac{1}{K}}

. For more details of the power-divergence

I^{(λ)} (\cdot; \cdot)

, see Cressie and Read [5], and Read and Cressie [6, p. 15].

The

H^{(λ)} (θ^{*})

must lie between 0 and

C^{(λ)}

but it cannot attain the lower limit of 0 in terms of the assumption that the

{p_{i j k}}

are positive. Thus the measure

φ^{(λ)}

must lie between 0 and 1, but it cannot attain the upper limit of 1. Now it is easily seen that the NOTFI model holds if and only if the measure

φ^{(λ)}

is equal to zero. According to the diversity index or the power-divergence,

φ^{(λ)}

represents the degree of departure from NOTFI model, and the degree increases as the value of

φ^{(λ)}

increases.

3. Approximate confidence interval for measure

Let

n_{i j k}

denote the observed frequency in the cell

(i, j, k)

of the

2 \times 2 \times K

table (

i = 1, 2; j = 1, 2; k = 1, \dots, K

). Assuming that

{n_{i j k}}

result from full multinomial sampling, we shall consider an approximate standard error and large-sample confidence interval of measure

φ^{(λ)}

, using the delta method of which descriptions are given by, for example, Bishop et al. [1, Sec. 14.6]. The sample version of measure

φ^{(λ)}

, i.e.,

{\hat{φ}}^{(λ)}

, is given by

φ^{(λ)}

with

{p_{i j k}}

replaced by

{{\hat{p}}_{i j k}}

, where

{\hat{p}}_{i j k} = n_{i j k} / n

and

n = \sum \sum \sum n_{i j k}

. Using the delta method,

\sqrt{n} ({\hat{φ}}^{(λ)} - φ^{(λ)})

has asymptotically (as

n \to \infty

) a normal distribution with mean zero and variance

\begin{matrix} σ^{2} [φ^{(λ)}] & = & {(\frac{λ + 1}{λ C^{(λ)} D^{λ + 2}})}^{2} \\ \times \sum_{t = 1}^{K} θ_{t}^{2} {(D θ_{t}^{λ} - \sum_{k = 1}^{K} θ_{k}^{λ + 1})}^{2} (\frac{1}{p_{11 t}} + \frac{1}{p_{12 t}} + \frac{1}{p_{21 t}} + \frac{1}{p_{22 t}}) . \end{matrix}

Let

{\hat{σ}}^{2} [φ^{(λ)}]

denote

σ^{2} [φ^{(λ)}]

with

{p_{i j k}}

replaced by

{{\hat{p}}_{i j k}}

. Then

\hat{σ} [φ^{(λ)}] / \sqrt{n}

is an estimated approximate standard error for

{\hat{φ}}^{(λ)}

, and

{\hat{φ}}^{(λ)} \pm z_{p / 2} \hat{σ} [φ^{(λ)}] / \sqrt{n}

is an approximate

100 (1 - p)

percent confidence interval for

φ^{(λ)}

, where

z_{p / 2}

is the percentage point from the standard normal distribution corresponding to a two-tail probability equal to p.

4. Examples

Table 1 taken from Agresti [7, p. 68] refers to the effect of passive smoking on lung cancer. It summarizes results of case-control studies from three countries among nonsmoking women married to smokers. For these data, the estimated odds-ratios between having passive smoking and lung cancer in Japan, Great Britain, and United States are 0.66, 0.63, and 0.76, respectively.

Let X, Y and Z denote the first, second and third variables, respectively. For Table 2 which is the

2 \times 2 \times 3

artificial data, the estimated odds-ratios between variables X and Y at each level of Z are 7.50, 0.33, and 1.33.

Table 1. The results of case-control studies from three countries among nonsmoking women married to smokers; from Agresti [7, p. 68].

**Table 1.** The results of case-control studies from three countries among nonsmoking women married to smokers; from Agresti [7, p. 68].
Country	Spouse Smoked	Cases	Controls
Japan	No	21	82
Japan	Yes	73	188
Great Britain	No	5	16
Great Britain	Yes	19	38
United States	No	71	249
United States	Yes	137	363

Table 2. Artificial data (n is sample size).

**Table 2.** Artificial data (n is sample size).
n = 300
		Y
Z	X	(1)	(2)
(1)	(1)	50	20
(1)	(2)	10	30
(2)	(1)	10	30
(2)	(2)	20	20
(3)	(1)	20	20
(3)	(2)	30	40

Because the confidence intervals for

φ^{(λ)}

applied to the data in Table 1 include zero for all λ (see Table 3a), this would indicate that there is a structure of NOTFI model in Table 1; or, if this is not the case, then it indicates that the degree of departure from NOTFI model is slight. In contrast, since the confidence intervals for

φ^{(λ)}

applied to the data in Table 2 do not include zero for all λ (see Table 3b), this would indicate that there is not a structure of NOTFI model in Table 2.

When the degrees of departure from NOTFI model in Table 1 and Table 2 are compared using the confidence intervals for

φ^{(λ)}

, the degree of departure in Table 2 would be greater than that in Table 1. This is because, for any given λ

(> - 1)

, the values in the confidence interval for

φ^{(λ)}

applied to the data in Table 2 are greater than the values in the corresponding confidence interval for

φ^{(λ)}

applied to the data in Table 1. We note that in Table 3a the confidence interval for

φ^{(λ)}

includes the negative values and this is natural because

{\hat{φ}}^{(λ)}

has asymptotically a normal distribution.

Note: Let

W^{(λ)}

denote the power-divergence statistic for testing goodness-of-fit of the NOTFI model with

K - 1

degrees of freedom, i.e.,

\begin{matrix} W^{(λ)} = \frac{2}{λ (λ + 1)} \sum_{i = 1}^{2} \sum_{j = 1}^{2} \sum_{k = 1}^{K} n_{i j k} [{(\frac{n_{i j k}}{{\hat{m}}_{i j k}})}^{λ} - 1], for - \infty < λ < \infty \end{matrix}

where

{\hat{m}}_{i j k}

is the maximum likelihood estimate of the expected frequency

m_{i j k}

under the NOTFI model and the values at

λ = - 1

and

λ = 0

are taken to be the limits as

λ \to - 1

and as

λ \to 0

, respectively. For the details of power-divergence test statistic, see Cressie and Read [5], and Read and Cressie [6, p. 15]. In particular, note that

W^{(0)}

and

W^{(1)}

are the likelihood ratio and Pearson chi-squared statistics, respectively. Table 4 gives the values of

W^{(λ)}

applied to the data in Table 1 and Table 2. Therefore, the NOTFI model fits the data in Table 1 well, but it does not fit the data in Table 2 well.

Table 3. Estimates of

φ^{(λ)}

, estimated approximate standard error for

{\hat{φ}}^{(λ)}

, approximate 95% confidence interval for

φ^{(λ)}

, applied to Table 1 and Table 2.

(a) For Table 1

(a) For Table 1
Values of λ	Estimated measure	Standard error	Confidence interval
-0.4	0.002	0.012	(-0.021, 0.025)
0	0.003	0.016	(-0.028, 0.034)
0.6	0.003	0.018	(-0.031, 0.038)
1.0	0.003	0.017	(-0.031, 0.037)
1.6	0.003	0.015	(-0.027, 0.032)

(b) For Table 2

(b) For Table 2
Values of λ	Estimated measure	Standard error	Confidence interval
-0.4	0.388	0.124	(0.145, 0.630)
0	0.486	0.149	(0.194, 0.777)
0.6	0.536	0.166	(0.211, 0.861)
1.0	0.538	0.172	(0.200, 0.876)
1.6	0.517	0.180	(0.165, 0.869)

Table 4. Values of power-divergence statistic

W^{(λ)}

(with 2 degrees of freedom) for testing goodness-of-fit of the NOTFI model, applied to Table 1 and Table 2.

**Table 4.** Values of power-divergence statistic $W^{(λ)}$ (with 2 degrees of freedom) for testing goodness-of-fit of the NOTFI model, applied to Table 1 and Table 2.
Values of λ	For Table 1	For Table 2
-0.4	0.240	24.889
0	0.240	24.462
0.6	0.239	24.056
1.0	0.238	23.933
1.6	0.237	23.957

5. Remark

Consider the case of

K = 2

, i.e.,

2 \times 2 \times 2

contingency table. Then the measure

φ^{(λ)}

can be simply expressed as

\begin{matrix} φ^{(λ)} = \{\begin{matrix} 1 - \frac{1}{λ C^{(λ)}} (1 - \frac{r^{λ + 1} + 1}{{(1 + r)}^{λ + 1}}), & for λ > - 1; λ \neq 0, \\ 1 - \frac{1}{(log 2) (1 + r)} ((1 + r) log (1 + r) - r log r), & for λ = 0, \end{matrix} \end{matrix}

where

\begin{matrix} r = \frac{θ_{1}}{θ_{2}} = \frac{p_{111} p_{221} p_{122} p_{212}}{p_{121} p_{211} p_{112} p_{222}} . \end{matrix}

In addition, the approximate variance of

\sqrt{n} ({\hat{φ}}^{(λ)} - φ^{(λ)})

, which was given in Section 3, can be simply expressed as

\begin{matrix} σ^{2} [φ^{(λ)}] = {(\frac{λ + 1}{λ C^{(λ)}})}^{2} {(\frac{r^{λ + 1} - r}{{(1 + r)}^{λ + 2}})}^{2} \sum_{i = 1}^{2} \sum_{j = 1}^{2} \sum_{k = 1}^{2} \frac{1}{p_{i j k}} . \end{matrix}

Note that

σ^{2} [φ^{(λ)}] = 0

when

r = 1

. Now, three kinds of expressions of r are obtained as

\begin{matrix} r & = & (\frac{p_{111} p_{221}}{p_{121} p_{211}}) / (\frac{p_{112} p_{222}}{p_{122} p_{212}}) \\ = & (\frac{p_{111} p_{212}}{p_{112} p_{211}}) / (\frac{p_{121} p_{222}}{p_{122} p_{221}}) \\ = & (\frac{p_{111} p_{122}}{p_{112} p_{121}}) / (\frac{p_{211} p_{222}}{p_{212} p_{221}}) . \end{matrix}

Therefore, the measure

φ^{(λ)}

, which represents the degree of departure from the equality of odds-ratio between variables X and Y at each level of variable Z, also represents the degree of departure from the equality of odds-ratio between X and Z at each level of Y and further represents it between Y and Z at each of X.

6. Concluding Remarks

The measure

{\hat{φ}}^{(λ)}

would be useful for comparing the degrees of departure from the NOTFI model in several tables.

Table 5. (a), (b) Artificial data (n is sample size).

(a) n = 315

(a) n = 315
		Y
Z	X	(1)	(2)
(1)	(1)	25	20
(1)	(2)	25	40
(2)	(1)	45	15
(2)	(2)	20	20
(3)	(1)	30	20
(3)	(2)	20	15

(b) n = 1575

(b) n = 1575
		Y
Z	X	(1)	(2)
(1)	(1)	125	100
(1)	(2)	125	200
(2)	(1)	225	75
(2)	(2)	150	150
(3)	(1)	150	100
(3)	(2)	100	75

Table 6. Values of

{\hat{φ}}^{(λ)}

applied to Table 5a and Table 5b.

**Table 6.** Values of ${\hat{φ}}^{(λ)}$ applied to Table 5a and Table 5b.
Values of λ	For Table 5a	For Table 5b
-0.4	0.050	0.050
0	0.066	0.066
0.6	0.073	0.073
1.0	0.070	0.070
1.6	0.061	0.061

Table 7. Values of power-divergence statistic

W^{(λ)}

(with 2 degrees of freedom) for testing goodness-of-fit of the NOTFI model, applied to Table 5a and Table 5b.

**Table 7.** Values of power-divergence statistic $W^{(λ)}$ (with 2 degrees of freedom) for testing goodness-of-fit of the NOTFI model, applied to Table 5a and Table 5b.
Values of λ	For Table 5a	For Table 5b
-0.4	2.734	13.669
0	2.730	13.648
0.6	2.726	13.630
1.0	2.726	13.628
1.6	2.727	13.637

Consider the artificial data in Table 5a and Table 5b. For Table 5a, the estimated odds-ratios between variables X and Y at each level of Z are 2.00, 3.00, and 1.13. All values of observed frequencies in Table 5a multiplied by 5 equal the values in Table 5b. Thus, it is natural that the estimated odds-ratios between variables X and Y at each level of Z for Table 5b are equal to those for Table 5a. Therefore, the value of

{\hat{φ}}^{(λ)}

(for every λ) for Table 5a is identical with that for Table 5b (see Table 6). However the value of

W^{(λ)}

is greater for Table 5b than for Table 5a (see Table 7). Therefore the measure

{\hat{φ}}^{(λ)}

rather than test statistic

W^{(λ)}

would be useful for comparing the degrees of departure from the NOTFI model in several tables.

The

W^{(λ)}

is also an information measure on the cell probability scale, and moreover

W^{(λ)} / n

seems to be a reasonable measure of departure from the NOTFI model (though it is not a function of odds-ratios

{θ_{i}}

,

i = 1, \dots, K

). However,

{\hat{φ}}^{(λ)}

rather than

W^{(λ)} / n

would be useful for comparing the degrees of departure from the NOTFI model in several tables. This is because

{\hat{φ}}^{(λ)}

is always in the range between 0 and 1, but

W^{(λ)} / n

is not; namely,

{\hat{φ}}^{(λ)}

can measure the degree of departure toward the maximum departure from uniformity of odds-ratios

{θ_{i}}

,

i = 1, \dots, K

; but the

W^{(λ)} / n

cannot measure it.

Table 8. (a), (b) Artificial data (n is sample size) and (c) corresponding values of

{\hat{φ}}^{(λ)}

applied to Table 8a and Table 8b.

(a) n = 291

(a) n = 291
		Y
Z	X	(1)	(2)
(1)	(1)	27	9
(1)	(2)	10	16
(2)	(1)	14	35
(2)	(2)	31	45
(3)	(1)	28	18
(3)	(2)	13	45

(b) n = 291

(b) n = 291
		Y
Z	X	(1)	(2)
(1)	(1)	22	23
(1)	(2)	30	16
(2)	(1)	20	18
(2)	(2)	22	43
(3)	(1)	11	21
(3)	(2)	26	39

(c) Values of

{\hat{φ}}^{(λ)}

(c) Values of ${\hat{φ}}^{(λ)}$
Values of λ	For Table 8a	For Table 8b
-0.4	0.186	0.126
0	0.213	0.170
0.6	0.200	0.197
1.0	0.178*	0.198
1.6	0.140*	0.183

* indicates that

{\hat{φ}}^{(λ)}

is less for Table 8a than for Table 8b.

The readers may be interested in which value of λ is preferred for a given table. However, in comparing tables, it seems difficult to discuss this. For example, consider the artificial data in Table 8a and Table 8b. We see from Table 8c that the value of

{\hat{φ}}^{(0)}

is greater for Table 8a than for Table 8b, but the value of

{\hat{φ}}^{(1)}

is less for Table 8a than for Table 8b. So, for these cases, it may be impossible to decide (by using

{\hat{φ}}^{(λ)}

) whether the degree of departure from the NOTFI model is greater for Table 8a or for Table 8b. But generally, for the comparison between two tables, it would be possible to draw a conclusion if

{\hat{φ}}^{(λ)}

(for every λ) is always greater (or always less) for one table than for the other table. Thus, it seems to be important that which value of λ is preferred for a given table, the analyst calculates the value of

{\hat{φ}}^{(λ)}

for various values of λ and discusses the degree of departure from the NOTFI model in terms of

{\hat{φ}}^{(λ)}

values. It may seem to readers that when the odds-ratios of Table 8a vary more widely (relatively in ratio) than those of Table 8b, the

φ^{(λ)}

values in Table 8c may vary with a pattern; namely, they are large for Table 8a for smaller values of λ, but the other way round when λ is greater than certain value less than 1. However, we cannot prove that the case holds. It may be dangerous to compare the degrees of departure from the NOTFI model in several tables in terms of only Tomizawa’s [3] measure, i.e.,

{\hat{φ}}^{(0)}

; because it may arise that for two tables (say, table A and table B),

{\hat{φ}}^{(0)}

is greater for table A than for table B, however,

{\hat{φ}}^{(λ_{1})}

with some

λ_{1} (\neq 0)

is less for table A than for table B.

The measure

{\hat{φ}}^{(λ)}

would be useful when one wants to measure how far the odds-ratios

{θ_{t}}

are directly distant from the uniformity, although

W^{(λ)} / n

may be useful when one wants to measure how far the estimated cell probability distribution with the structure of NOTFI is distant from the sample cell probability distribution.

The readers may be interested in extending the measure

φ^{(λ)}

to a

2 \times 3 \times K

table or

I \times J \times K

table; however, it may be difficult to consider a single-valued measure to represent the degree of departure from no three-factor interaction.

Appendix

For the

2 \times 2 \times K

contingency table, a measure of departure from the NOTFI model by Tomizawa [3] is given as follows:

\begin{matrix} φ = 1 - \frac{H (θ^{*})}{log K}, \end{matrix}

where

\begin{matrix} H (θ^{*}) & = & - \sum_{t = 1}^{K} θ_{t}^{*} log θ_{t}^{*} \end{matrix}

and

{θ_{t}^{*}}

are defined in Section 2.

Acknowledgements

The authors would like to thank two referees for the helpful comments.

References and Notes

Bishop, Y.M.M.; Fienberg, S.E.; Holland, P.W. Discrete Multivariate Analysis: Theory and Practice; The MIT Press: Cambridge, Massachusetts, 1975. [Google Scholar]
Fienberg, S.E. The Analysis of Cross-Classified Categorical Data, 2nd Ed. ed; The MIT Press: Cambridge, Massachusetts, 1980. [Google Scholar]
Tomizawa, S. A measure of departure from no three-factor interaction model in a 2 × 2 × k contingency table. J. Stat. Res. 1993, 27, 1–8. [Google Scholar]
Patil, G.P.; Taillie, C. Diversity as a concept and its measurement. J. Am. Stat. Assoc. 1982, 77, 548–561. [Google Scholar]
Cressie, N.A.C.; Read, T.R.C. Multinomial goodness-of-fit tests. J. Royal Stat. Soc., Series B 1984, 46, 440–464. [Google Scholar]
Read, T.R.C.; Cressie, N.A.C. Goodness-of-Fit Statistics for Discrete Multivariate Data; Springer: New York, NY, 1988. [Google Scholar]
Agresti, A. An Introduction to Categorical Data Analysis; Wiley: New York, NY, 1996. [Google Scholar]

© 2008 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Yamamoto, K.; Ban, Y.; Tomizawa, S. Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables. Entropy 2008, 10, 776-785. https://doi.org/10.3390/e10040776

AMA Style

Yamamoto K, Ban Y, Tomizawa S. Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables. Entropy. 2008; 10(4):776-785. https://doi.org/10.3390/e10040776

Chicago/Turabian Style

Yamamoto, Kouji, Yohei Ban, and Sadao Tomizawa. 2008. "Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables" Entropy 10, no. 4: 776-785. https://doi.org/10.3390/e10040776

APA Style

Yamamoto, K., Ban, Y., & Tomizawa, S. (2008). Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables. Entropy, 10(4), 776-785. https://doi.org/10.3390/e10040776

Article Menu

Generalized Measure of Departure from No Three-Factor Interaction Model for 2 x 2 x K Contingency Tables

Abstract

1. Introduction

2. A generalization of measure

3. Approximate confidence interval for measure

4. Examples

5. Remark

6. Concluding Remarks

Appendix

Acknowledgements

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

		Y
Z	X	(1)	(2)
(1)	(1)	25	20
(1)	(2)	25	40
(2)	(1)	45	15
(2)	(2)	20	20
(3)	(1)	30	20
(3)	(2)	20	15

		Y
Z	X	(1)	(2)
(1)	(1)	125	100
(1)	(2)	125	200
(2)	(1)	225	75
(2)	(2)	150	150
(3)	(1)	150	100
(3)	(2)	100	75

		Y
Z	X	(1)	(2)
(1)	(1)	27	9
(1)	(2)	10	16
(2)	(1)	14	35
(2)	(2)	31	45
(3)	(1)	28	18
(3)	(2)	13	45

		Y
Z	X	(1)	(2)
(1)	(1)	22	23
(1)	(2)	30	16
(2)	(1)	20	18
(2)	(2)	22	43
(3)	(1)	11	21
(3)	(2)	26	39

		Y
Z	X	(1)	(2)
(1)	(1)	25	20
(1)	(2)	25	40
(2)	(1)	45	15
(2)	(2)	20	20
(3)	(1)	30	20
(3)	(2)	20	15

		Y
Z	X	(1)	(2)
(1)	(1)	125	100
(1)	(2)	125	200
(2)	(1)	225	75
(2)	(2)	150	150
(3)	(1)	150	100
(3)	(2)	100	75

		Y
Z	X	(1)	(2)
(1)	(1)	27	9
(1)	(2)	10	16
(2)	(1)	14	35
(2)	(2)	31	45
(3)	(1)	28	18
(3)	(2)	13	45

		Y
Z	X	(1)	(2)
(1)	(1)	22	23
(1)	(2)	30	16
(2)	(1)	20	18
(2)	(2)	22	43
(3)	(1)	11	21
(3)	(2)	26	39

		Y
Z	X	(1)	(2)
(1)	(1)	25	20
(1)	(2)	25	40
(2)	(1)	45	15
(2)	(2)	20	20
(3)	(1)	30	20
(3)	(2)	20	15

		Y
Z	X	(1)	(2)
(1)	(1)	125	100
(1)	(2)	125	200
(2)	(1)	225	75
(2)	(2)	150	150
(3)	(1)	150	100
(3)	(2)	100	75

		Y
Z	X	(1)	(2)
(1)	(1)	27	9
(1)	(2)	10	16
(2)	(1)	14	35
(2)	(2)	31	45
(3)	(1)	28	18
(3)	(2)	13	45

		Y
Z	X	(1)	(2)
(1)	(1)	22	23
(1)	(2)	30	16
(2)	(1)	20	18
(2)	(2)	22	43
(3)	(1)	11	21
(3)	(2)	26	39