Testing for Stochastic Dominance up to a Common Relative Poverty Line

Mehdi, Tahsin

doi:10.3390/econometrics8010005

Open AccessArticle

Testing for Stochastic Dominance up to a Common Relative Poverty Line

by

Tahsin Mehdi

Department of Economics, Ryerson University, Toronto, ON M5B 2K3, Canada

Econometrics 2020, 8(1), 5; https://doi.org/10.3390/econometrics8010005

Submission received: 18 October 2019 / Revised: 7 January 2020 / Accepted: 5 February 2020 / Published: 11 February 2020

Download Versions Notes

Abstract

:

Although a wide array of stochastic dominance tests exist for poverty measurement and identification, they assume the income distributions have independent poverty lines or a common absolute (fixed) poverty line. We propose a stochastic dominance test for comparing income distributions up to a common relative poverty line (i.e., some fraction of the pooled median). A Monte Carlo study demonstrates its superior performance over existing methods in terms of power. The test is then applied to some Canadian household survey data for illustration.

Keywords:

stochastic dominance; common poverty line; relative poverty line; pooled quantile; bootstrap inference

JEL Classification:

C01; C12; I32

1. Introduction

The seminal works of Sen (1976), Foster et al. (1984), Atkinson (1987) have propagated a growing body of literature surrounding poverty measurement. While earlier works emphasize the identification of poverty, later works highlight the need for accurate and reliable statistical inference (e.g., Kakwani 1993; Davidson and Duclos 2000; Zheng 2001; Thompson 2012; Mehdi 2017). From a research as well as a policy standpoint, there is always interest in comparing poverty outcomes between different income distributions. However, as pointed out by Garcia-Gomez et al. (2019), analysts often face the multiplicity of poverty indices problem. Since poverty measures depend on a poverty line, which is an income threshold dividing the poor and non-poor, distributional orderings are sensitive to the choice of poverty lines (i.e., rank reversals could occur when switching poverty lines).

To overcome the multiplicity of poverty indices problem, researchers appeal to stochastic dominance which is a robust distributional ranking method with wide-ranging applications in multiple fields. Rather than relying on estimates from singular points in distributions, stochastic dominance examines the entire support by considering the cumulative distribution function (CDF). In the context of poverty comparisons, stochastic dominance permits partial orderings of income distributions by considering all permissible income thresholds within a pre-specified range of poverty lines. If a dominance relation can be established between distributions, debates surrounding the choice of poverty lines can effectively be avoided since one distribution would always exhibit lower poverty regardless of the poverty line (see Atkinson 1987).

Several statistical tests of stochastic dominance have been put forth in the literature (e.g., McFadden 1989; Kaur et al. 1994; Anderson 1996; Davidson and Duclos 2000; Barrett and Donald 2003; Barrett et al. 2016; Thompson and Stengos 2012), but the ones geared towards poverty measurement assume either separate poverty lines for each distribution or a common absolute poverty line between distributions. As Thompson (2012) points out, if there is a situation where we may be interested in comparing poverty outcomes between subgroups of the same population (e.g., males and females), setting two separate poverty lines may lead to rather incongruous findings.1 With increasing usage of relative poverty measures by international organizations (e.g., OECD 2016), we develop the asymptotic framework for testing for stochastic dominance up to a common relative poverty line (i.e., some fraction of the pooled median income level of two distributions).

Barrett and Donald (2003) propose a test for stochastic dominance based on the one-sided Kolmogorov–Smirnov type test statistic which considers the supremum of the distances between the CDFs. Our test, on the other hand, is similar to that of Davidson and Duclos (2000) in the sense that it relies on evaluating a finite number of distances between the distributions throughout the support. Our proposed test, much like those of Anderson (1996), Davidson and Duclos (2000), and Thompson and Stengos (2012), suffers from the issue of inconsistency due to the fact that such tests rely on examining the CDFs at a finite number of points as opposed to those put forth by (Barrett and Donald (2003), McFadden (1989), or Kaur et al. (1994)) which consider either the supremum or infimum of the distances (see, e.g., Davidson and Duclos 2000; Thompson and Stengos 2012). However, the advantage offered by the former type of tests is that they make use of the covariances between estimates at the different points of the CDFs, which in theory leads to increased statistical power.

The remainder of this article is organized as follows. Section 2 briefly discusses the notion of stochastic dominance and its relation to poverty measures. Section 3 derives the asymptotic framework of our proposed test for stochastic dominance when the poverty line is some fraction of the pooled median income level. Section 4 presents a Monte Carlo study to assess the size and power of the test. Section 5 illustrates the proposed test using Canadian household survey data. Section 6 provides the conclusions.

2. Stochastic Dominance and Poverty Measurement

Consider two income distributions (or some other measure of individual welfare), characterized by CDFs,

F_{A}

and

F_{B}

, with support contained in the non-negative real number line. Following similar notation as Davidson and Duclos (2000), let

D_{A}^{s} (x) = \frac{1}{(s - 1)!} \int {(x - y)}_{+}^{s - 1} d F_{A} (y),

(1)

where

s \geq 1

, and

{(x - y)}_{+}^{s - 1} = {(x - y)}^{s - 1} I (y \leq x)

, where

I (\cdot)

is an indicator function that equals 1 if its argument is true, and 0 otherwise. It is straightforward to check that

D_{A}^{1} (x) = F_{A} (x)

. If a poverty line z is established, an individual with income y is said to be poor if

y \leq z

(this follows from the so-called “focus axiom”; see, e.g., Foster 1984). Thus,

F_{A} (z)

measures the proportion of individuals in subgroup A below the poverty line (also known as the headcount ratio). Let

D_{B}^{s} (x)

be defined analogously. The

D_{A}^{1}

curve is typically referred to as the poverty incidence curve,

D_{A}^{2}

is the poverty deficit curve, and

D_{A}^{3}

is the poverty severity curve (see, e.g., Ravallion 1994). Distribution A is said to stochastically dominate B at order s up to poverty line z if

D_{A}^{s} (x) \leq D_{B}^{s} (x) \forall x \leq z

. First-order stochastic dominance (i.e.,

s = 1

) guarantees dominance at higher orders (i.e., if

D_{A}^{1} (x) \leq D_{B}^{1} (x) \forall x \leq z

, then

D_{A}^{s} (x) \leq D_{B}^{s} (x) \forall x \leq z, s > 1

; see, e.g., Lemma 1, Davidson and Duclos 2000).

The notion of stochastic dominance has broader implications for popular classes of poverty indices such as those proposed by Foster et al. (1984):

P_{γ} = \int {[(z - y) / z]}^{γ} I (y \leq z) d F (y)

, where

γ \geq 0

is a poverty “aversion” parameter. Thus, the class of indices is based on the normalized poverty gap,

(z - y) / z

, or income shortfall as a share of the poverty line, of the poor. It is easy to see that, when

γ = 0

, the index simply becomes the headcount ratio (i.e.,

P_{0} = F (z)

), which measures the proportion of the population below the poverty line. Thus, if indeed

D_{A}^{1} (x) \leq D_{B}^{1} (x) \forall x \leq z

, then the implication is that

P_{γ}

will not only show lower poverty incidence for distribution A for all poverty lines up to z, but lower poverty deficit, lower poverty severity, etc.

Consider a population of size

N = N_{A} + N_{B}

composed of

N_{A}

individuals from subgroup A, and

N_{B}

individuals from subgroup B. Let

F (x) = w_{A} F_{A} (x) + w_{B} F_{B} (x)

be the pooled income distribution where

w_{A} = N_{A} / N

and

w_{B} = N_{B} / N

. Assume a pooled quantile-based poverty line set at

z = c ξ_{q}

for some fraction

c \in [0, 1]

, where

ξ_{q}

is a quantile of order q (i.e.,

F (ξ_{q}) = q

).

Checking for restricted stochastic dominance is tantamount to examining the differences between the distributions at all points leading up to z. We follow Davidson and Duclos (2000) and construct test statistics at equidistant grid points that lie below z. Consider J grid points

{α_{i} ξ_{q}}_{i = 1}^{J}

where

α_{i} \in [0, c]

.2 Let the vector of differences between the two distributions at the J grid points be given by

Δ^{s} = (Δ_{1}^{s}, \dots, Δ_{J}^{s}) = (D_{B, 1}^{s} - D_{A, 1}^{s}, \dots, D_{B, J}^{s} - D_{A, J}^{s})

.

3. Estimation and Inference

Let

{y_{A, i}}_{i = 1}^{n_{A}}

and

{y_{B, i}}_{i = 1}^{n_{B}}

be random iid draws from

F_{A}

and

F_{B}

, respectively, and let

n = n_{A} + n_{B}

be the pooled sample size. Assume that

n \to \infty

implies

n_{A} \to \infty

and

n_{B} \to \infty

, and

n_{A} / N_{A}

and

n_{B} / N_{B}

are sufficiently small so that no finite population adjustment is necessary. At the jth grid point,

D_{A}^{s}

can be consistently estimated by

{\hat{D}}_{A, j}^{s} = \frac{1}{n_{A} (s - 1)!} \sum_{i = 1}^{n_{A}} {({\hat{z}}_{j} - y_{A, i})}_{+}^{s - 1},

(2)

where

{\hat{z}}_{j} = α_{j} y_{(r)}

,

y_{(r)}

is the rth order statistic of the pooled sample

y = {(y^{A}, y^{B})}^{'}

with

r = [n q]

, and

[n q]

is the integer part of

n q

. For subgroup B,

D_{B}^{s}

can be estimated in a similar manner.

If

F_{A}

and

F_{B}

are differentiable and have finite first two moments, then

Δ^{s}

can be consistently estimated by

{\hat{Δ}}^{s} = ({\hat{Δ}}_{1}^{s}, \dots, {\hat{Δ}}_{J}^{s}) = ({\hat{D}}_{B, 1}^{s} - {\hat{D}}_{A, 1}^{s}, \dots, {\hat{D}}_{B, J}^{s} - {\hat{D}}_{A, J}^{s})

. Using similar arguments as (Zheng 2001, Section 4.2), an asymptotic expression for the jth difference is given by

Δ_{j}^{s} = \frac{1}{n_{B} (s - 1)!} \sum_{i = 1}^{n_{B}} {(z_{j} - y_{B, i})}_{+}^{s - 1} - \frac{1}{n_{A} (s - 1)!} \sum_{i = 1}^{n_{A}} {(z_{j} - y_{A, i})}_{+}^{s - 1} + α_{j} g_{j} (y_{(r)} - ξ_{q}) + o_{p} (n^{- 1 / 2}),

(3)

where

g_{j} = a_{j}^{B} - a_{j}^{A} + I (s = 1) [f_{B} (z_{j}) - f_{A} (z_{j})]

,

a_{j}^{A} = \partial D_{A, j}^{s} / \partial z_{j}

, and

f_{A}

is the underlying density function of distribution A.

Using the Bahadur representation (see, e.g., Zheng 2001, p. 351), we can express the difference between the sample quantile and population quantile as

y_{(r)} - ξ_{q} = \frac{q - \hat{F} (ξ_{q})}{f (ξ_{q})} + o_{p} (n^{- 1 / 2}),

(4)

where

\hat{F} (ξ_{q}) = n^{- 1} \sum_{i = 1}^{n} I (y_{i} \leq ξ_{q})

, and

f (ξ_{q}) = w_{A} f_{A} (ξ_{q}) + w_{B} f_{B} (ξ_{q})

is the underlying population density function. Thus, (3) becomes

Δ_{j}^{s} = \frac{1}{n_{B} (s - 1)!} \sum_{i = 1}^{n_{B}} {(z_{j} - y_{B, i})}_{+}^{s - 1} - \frac{1}{n_{A} (s - 1)!} \sum_{i = 1}^{n_{A}} {(z_{j} - y_{A, i})}_{+}^{s - 1} - \frac{α_{j} g_{j}}{f (ξ_{q})} \hat{F} (ξ_{q}) + \frac{α_{j} g_{j} q}{f (ξ_{q})} + o_{p} (n^{- 1 / 2}) .

(5)

Let the joint population moments of order

2 s - 2

of

y_{A}

and

y_{B}

be finite and suppose that

F_{A}

and

F_{B}

are differentiable. Then,

\sqrt{n} ({\hat{Δ}}^{s} - Δ^{s})

will converge in distribution to a normal random vector with mean vector zero and covariance matrix Σ with typical element

\begin{matrix} Cov ({\hat{Δ}}_{j}^{s}, {\hat{Δ}}_{k}^{s}) & = [\frac{1}{{[(s - 1)!]}^{2}} E_{A} [{(z_{j} - y)}_{+}^{s - 1} {(z_{k} - y)}_{+}^{s - 1}] - D_{A, j}^{s} D_{A, k}^{s}] / w_{A} \\ + [\frac{1}{{[(s - 1)!]}^{2}} E_{B} [{(z_{j} - y)}_{+}^{s - 1} {(z_{k} - y)}_{+}^{s - 1}] - D_{B, j}^{s} D_{B, k}^{s}] / w_{B} \\ - α_{k} g_{k} [D_{B, j}^{s} [1 - F_{B} (ξ_{q})] - D_{A, j}^{s} [1 - F_{A} (ξ_{q})]] / f (ξ_{q}) \\ - α_{j} g_{j} [D_{B, k}^{s} [1 - F_{B} (ξ_{q})] - D_{A, k}^{s} [1 - F_{A} (ξ_{q})]] / f (ξ_{q}) \\ + α_{j} α_{k} g_{j} g_{k} [w_{A} F_{A} (ξ_{q}) [1 - F_{A} (ξ_{q})] \\ + w_{B} F_{B} (ξ_{q}) [1 - F_{B} (ξ_{q})]] / f^{2} (ξ_{q}), \forall j, k . \end{matrix}

In practice, Σ can be consistently estimated by

\hat{Σ}

which will have typical element

\begin{matrix} \hat{Cov} ({\hat{Δ}}_{j}^{s}, {\hat{Δ}}_{k}^{s}) & = [\frac{1}{n_{A} {[(s - 1)!]}^{2}} \sum_{i = 1}^{n_{A}} {(z_{j} - y_{A, i})}_{+}^{s - 1} {(z_{k} - y_{A, i})}_{+}^{s - 1} - {\hat{D}}_{A, j}^{s} {\hat{D}}_{A, k}^{s}] \frac{n}{n_{A}} \\ + [\frac{1}{n_{B} {[(s - 1)!]}^{2}} \sum_{i = 1}^{n_{B}} {(z_{j} - y_{B, i})}_{+}^{s - 1} {(z_{k} - y_{B, i})}_{+}^{s - 1} - {\hat{D}}_{B, j}^{s} {\hat{D}}_{B, k}^{s}] \frac{n}{n_{B}} \\ - α_{k} {\hat{g}}_{k} [{\hat{D}}_{B, j}^{s} [1 - {\hat{F}}_{B} (y_{(r)})] - {\hat{D}}_{A, j}^{s} [1 - {\hat{F}}_{A} (y_{(r)})]] / \hat{f} (y_{(r)}) \\ - α_{j} {\hat{g}}_{j} [{\hat{D}}_{B, k}^{s} [1 - {\hat{F}}_{B} (y_{(r)})] - {\hat{D}}_{A, k}^{s} [1 - {\hat{F}}_{A} (y_{(r)})]] / \hat{f} (y_{(r)}) \\ + α_{j} α_{k} {\hat{g}}_{j} {\hat{g}}_{k} [n_{A} {\hat{F}}_{A} (y_{(r)}) [1 - {\hat{F}}_{A} (y_{(r)})] \\ + n_{B} {\hat{F}}_{B} (y_{(r)}) [1 - {\hat{F}}_{B} (y_{(r)})]] / [n {\hat{f}}^{2} (y_{(r)})], \forall j, k, \end{matrix}

where

{\hat{F}}_{A} (y_{(r)}) = n_{A}^{- 1} \sum_{i = 1}^{n_{A}} I (y_{A, i} \leq y_{(r)})

,

{\hat{g}}_{j} = {\hat{a}}_{j}^{B} - {\hat{a}}_{j}^{A} + I (s = 1) [{\hat{f}}_{B} ({\hat{z}}_{j}) - {\hat{f}}_{A} ({\hat{z}}_{j})]

,

{\hat{a}}_{j}^{A} = \frac{\partial {\hat{D}}_{A, j}^{s}}{\partial {\hat{z}}_{j}} = \frac{s - 1}{n_{A} (s - 1)!} \sum_{i = 1}^{n_{A}} {({\hat{z}}_{j} - y_{A, i})}_{+}^{s - 2},

and

{\hat{f}}_{A}

is the estimated underlying density function of

F_{A}

.3 The estimates of distribution B are just the analogues of A.

To test the null hypothesis that distribution A stochastically dominates B at order s,

H_{0} : Δ^{s} \geq 0 .

The alternate hypothesis is simply the negation of

H_{0}

.

Since we are testing multiple inequality restrictions, the relevant statistical inference methods can be found in Kodde and Palm (1986). Davidson and Duclos (2000) also uses this framework for their hypothesis tests. First, we compute the Wald-type test statistic

W = \min_{Δ^{s} \geq 0} n {({\hat{Δ}}^{s} - Δ^{s})}^{'} {\hat{Σ}}^{- 1} ({\hat{Δ}}^{s} - Δ^{s})

(6)

where the right-hand side is a quadratic programming problem. Under the null, W will converge in distribution to a mixture of

χ^{2}

distributions.

Obtaining critical values is not a straightforward process, so we follow Davidson and Duclos (2000) and advocate the use of the bootstrap. The procedure can be explained as follows. Given samples

y_{A}

and

y_{B}

, we pool them and obtain

y = {(y^{A}, y^{B})}^{'}

. Then, the bootstrap samples

y_{A}^{*}

and

y_{B}^{*}

are generated by resampling

n_{A}

and

n_{B}

observations (with replacement) from y. Next, using the bootstrap samples, we compute the bootstrap test statistic

W^{*}

in a similar manner to W. After repeating this process, a large number of times, the bootstrap p-value is the proportion of times that

W^{*}

exceeds W. A value less than the nominal size of the test should lead to the rejection of

H_{0}

.

Failure to infer dominance at order s by either distribution may imply that there exists some critical poverty line,

z_{s} < z

where the distributions cross and thus a rank reversal occurs. If such a threshold exists, it can be implicitly characterized by

D_{A}^{s} (z_{s}) = D_{B}^{s} (z_{s})

. A natural estimator of

z_{s}

is

{\hat{z}}_{s}

which solves

{\hat{D}}_{A}^{s} ({\hat{z}}_{s}) = {\hat{D}}_{B}^{s} ({\hat{z}}_{s})

. Let

D^{0} (x) = f (x)

and suppose

D_{A}^{s} (x) < D_{B}^{s} (x) \forall x < z_{s}

. Davidson and Duclos (2000) showed in Theorem 3, in that case,

\sqrt{n_{B}} ({\hat{z}}_{s} - z_{s})

will be asymptotically normal with mean zero and asymptotic variance

\frac{var ({(z_{s} - y_{A})}_{+}^{s - 1}) + n_{B} / n_{A} var ({(z_{s} - y_{B})}_{+}^{s - 1})}{{[(s - 1)! (D_{B}^{s - 1} (z_{s}) - D_{A}^{s - 1} (z_{s}))]}^{2}},

which, of course, can be estimated by simply replacing the terms with their respective empirical analogues.

4. Simulation Evidence

We now assess the size and power of our proposed test using a series of Monte Carlo experiments. The experiments were carried out using 10,000 independent trials, sample sizes of

n_{A} = n_{B} = n / 2

, and nominal size set equal to 5%. We consider tests of first-order stochastic dominance (i.e.,

s = 1

) for which

a_{j}^{A} = 0

. The poverty line is set equal to 50% of the pooled median (i.e.,

c = q = 0.5

).

Five different parametric distributions are considered in assessing the size of the test: gamma, Singh–Maddala, log-normal, unit exponential, and uniform. The CDF of the gamma distribution is given by

F (y) = γ (a_{2}, y / a_{1}) / Γ (a_{2})

, where

a_{1}

is a scale parameter,

a_{2}

is a shape parameter,

γ (\cdot)

is the gamma function, and

Γ (\cdot)

is the incomplete gamma function. The CDF of the Singh–Maddala distribution is given by

F (y) = 1 - {(1 + b_{1} y^{b_{2}})}^{- b_{3}}

, where

b_{1}

is a scale parameter, and

b_{2}

and

b_{3}

are shape parameters. Following McDonald (1984), we set

a_{2} = 2.1557

for the gamma distribution, and

b_{2} = 1.697

and

b_{3} = 8.368

for the Singh–Maddala distribution, which were used to simulate 1980 U.S. income distribution. The scale parameters for both distributions are set to unity. For the log-normal distribution, the mean and standard deviation are set to

2.9372

and

0.7797

, respectively, which were also used by McDonald (1984) to simulate 1980 U.S. income distribution. For the uniform distribution, we follow Zheng (2001) and specify the support as the unit interval

[0, 1]

. We consider five grid points set to 10%, 20%, 30%, 40%, and 50% of the pooled median.4

To assess the size of the test, we generate observations for subgroup A and B from the same distribution. We test the null hypothesis that

Δ^{1} \geq 0

, which is (weakly) true in this case. We consider pooled sample sizes varying from

n = 100

to

n = 1000

and utilize 199 bootstrap replications. Table 1 reports the rejection frequencies. Overall, we can conclude that a combined sample size of 1000 observations should be sufficient for achieving asymptotic normality. This is not a very demanding requirement at all as typical household survey datasets tend to have thousands of observations.

We focus exclusively on the gamma distribution in assessing the power of the test and consider the tests of Davidson and Duclos (2000) and Barrett and Donald (2003) as benchmarks.5 The shape and scale parameters for subgroup A remain set to their original levels from the size simulation. For subgroup B, we vary the parameters such that the CDF of B lies below the CDF of A for all points up to the poverty line. Rejection frequencies based on 199 bootstrap replications are reported in Table 2. The test exhibits excellent power properties as evidenced by the fact that it outperforms the test of Barrett and Donald (2003) regardless of sample size. A direct comparison of our test with Davidson and Duclos (2000) cannot really be made since their test is based on the assumption of either an absolute poverty line or poverty lines relative to each distribution (not pooled). Nonetheless, there is similarity between the two tests in terms of power. However, note that, in order to enable a somewhat fair comparison, the shape and scale parameters of the two distributions were chosen such that the medians remain the same for both distributions (thus, the pooled median is just the median of either subgroup A or subgroup B). This reduces sampling variability for the test of Davidson and Duclos (2000) and permits a more valid comparison. The advantage offered by our proposed test is that it accounts for the sampling variance of the common poverty line that depends on both distributions while the other two tests do not.

5. Illustration

In this section, we provide a simple example of how our proposed test can be applied in a real-world scenario. Using data from the 2017 Canadian Income Survey, we compare poverty outcomes among men and women. One of the ways Canada’s national statistical agency, Statistics Canada, measures “low income” is through its low income measure (LIM), which sets the low income line equal to 50% of the median adjusted household income (household income is divided by the square root of the household size).6 The pooled median adjusted household income is determined to be $46,461 implying a poverty line of $23,230.50.

The benefit of using scalar poverty indices such as the low income measure is that it allows policy makers to monitor and assess trends across socio-economic groups, regions, and time. The downside is that robust comparisons are not assured since distributional comparisons are not made at points below the low income line. Since our proposed test assumes a common pooled relative poverty line such as Statistics Canada’s low income line, we can make robust poverty comparisons between any subgroups of the population while still maintaining that common relative poverty line.

To illustrate our test, we compare poverty outcomes among Canadian men and women by testing for first-order stochastic dominance up to the low income line from the 2017 Canadian Income Survey. The sample consists of 47,800 men and 49,388 women. We consider five grid points set at 10%, 20%, 30%, 40%, and 50% of the median adjusted household income. The headcount ratios at the different points along with the median and standard deviation (SD) of adjusted household income for men and women are reported in Table 3. In testing the null that the male income distribution stochastically dominates the female income distribution up to the low income line, we obtain a bootstrap p-value of exactly zero leading to the rejection of the null. We also obtain a p-value of zero in testing the reverse hypothesis. This suggests that the CDFs of the two distributions cross at least once at some point below the low income line, which leads to our ambiguous conclusion that no dominance could be detected leading up to the low income line.

From Table 3, observe that, according to the low income line, the poverty rate is 11.3% for men and 13.2% for women (note that our estimates vary slightly from the official estimates because we treat negative incomes as zero and ignore the complex sampling scheme of the survey to simplify this exposition).7 Notice that, at the first three grid points, the poverty rates are lower for women. The trend reverses at the higher grid points which suggests that the CDFs cross, negating the possibility of first-order dominance by either subgroup.

However, can we make any inference regarding the critical poverty line where the rank reversal occurs? In other words, we want to determine the threshold level of income up to which the womens’ distribution dominate the mens’. We determine first-order stochastic dominance of womens’ income distribution over men up to an adjusted household income level of

$ 15, 058

, which is

32 %

of the median. The

95 %

confidence interval for the critical threshold in our case is

$ 12, 731.56

to

$ 17, 384.44

, or

27 %

to

37 %

of the median.

6. Conclusions

In this article, we proposed a test for stochastic dominance up to a common relative poverty line. Much of the existing tests of stochastic dominance, in the context of poverty measures, assume either separate poverty lines for each distribution or a common absolute poverty line. It is increasingly the case that relative poverty lines (i.e., 50% of the median income level) are being used in cross-country and group comparison studies.

A series of Monte Carlo experiments validates our asymptotic framework. The proposed test exhibits good size and power properties, under varying conditions, and outperforms existing methods due to the fact that this test utilizes the underlying covariances between estimates at the different points of the distributions. A sample size of 1,000 observations appears to be sufficient for achieving asymptotic normality which is not a very demanding requirement as household surveys tend to have thousands of observations. For illustration purposes, household income data from the 2017 Canadian Income Survey were used to rank poverty outcomes among men and women using the Canadian national statistical agency’s low income line.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

Anderson, Gordon. 1996. Nonparametric tests of stochastic dominance in income distributions. Econometrica 64: 1183–93. [Google Scholar] [CrossRef] [Green Version]
Atkinson, Anthony Barnes. 1987. On the measurement of poverty. Econometrica 55: 749–64. [Google Scholar] [CrossRef]
Barrett, Garry F., and Stephen G. Donald. 2003. Consistent tests for stochastic dominance. Econometrica 71: 71–104. [Google Scholar] [CrossRef]
Barrett, Garry F., Stephen G. Donald, and Yu-Chin Hsu. 2016. Consistent tests for poverty dominance relations. Journal of Econometrics 191: 360–73. [Google Scholar] [CrossRef]
Chen, Wen-Hao, and Jean-Yves Duclos. 2011. Testing for poverty dominance: An application to Canada. Canadian Journal of Economics 44: 781–803. [Google Scholar] [CrossRef] [Green Version]
Davidson, Russell, and Jean-Yves Duclos. 2000. Statistical inference for stochastic dominance and the measurement of poverty and inequality. Econometrica 68: 1435–64. [Google Scholar] [CrossRef] [Green Version]
Foster, James, Joel Greer, and Erik Thorbecke. 1984. A class of decomposable poverty measures. Econometrica 52: 761–66. [Google Scholar] [CrossRef]
Foster, James. 1984. On economic poverty: A survey of aggregate measures. Advances in Econometrics 3: 215–51. [Google Scholar]
Garcia-Gomez, Cesar, Ana Perez, and Mercedes Prieto-Alaiz. 2019. A review of stochastic dominance methods for poverty analysis. Journal of Economic Surveys 33: 1437–62. [Google Scholar] [CrossRef]
Kakwani, Nanak. 1993. Statistical inference in the measurement of poverty. The Review of Economics and Statistics 75: 632–39. [Google Scholar] [CrossRef]
Kaur, Amarjot, Bhagavatula Lakshmi Surya Prakasa Rao, and Harshinder Singh. 1994. Testing for second-order stochastic dominance of two distributions. Econometric Theory 10: 849–66. [Google Scholar] [CrossRef]
Kodde, David A., and Franz C. Palm. 1986. Wald criteria for jointly testing equality and inequality restrictions. Econometrica 54: 1243–48. [Google Scholar] [CrossRef]
Li, Qi, and Jeffrey Scott Racine. 2007. Nonparametric Econometrics: Theory and Practice. Princeton: Princeton University Press. [Google Scholar]
McDonald, James B. 1984. Some generalized functions for the size distribution of income. Econometrica 52: 647–63. [Google Scholar] [CrossRef]
McFadden, Daniel. 1989. Testing for stochastic dominance. In Studies in the Economics of Uncertainty. Edited by T. B. Fomby and T. K. Seo. New York: Springer. [Google Scholar]
Mehdi, Tahsin. 2017. Poverty comparisons with common relative poverty lines. Communications in Statistics—Theory and Methods 46: 2029–36. [Google Scholar] [CrossRef]
OECD. 2016. Poverty rates and gaps. In OECD Factbook 2015-2016: Economic, Environmental and Social Statistics. Paris: OECD Publishing. [Google Scholar] [CrossRef]
Ravallion, Martin. 1994. Poverty Comparisons, Fundamentals of Pure and Applied Economics. Chur: Harwood Academic Publishers. [Google Scholar]
Sen, Amartya. 1976. Poverty: An ordinal approach to measurement. Econometrica 44: 219–31. [Google Scholar] [CrossRef]
Statistics Canada. 2016. Low Income Lines: What They Are and How They Are Created; Ottawa: Statistics Canada.
Thompson, Brennan Scott, and Thanasis Stengos. 2012. Testing for bivariate stochastic dominance using inequality restrictions. Economics Letters 115: 60–62. [Google Scholar]
Thompson, Brennan Scott. 2012. Empirical likelihood-based inference for poverty measures with relative poverty lines. Econometric Reviews 32: 513–23. [Google Scholar] [CrossRef]
Zheng, Buhong. 2001. Statistical inference for poverty measures with relative poverty lines. Journal of Econometrics 101: 337–56. [Google Scholar] [CrossRef]

1	Thompson (2012) proposes an empirical likelihood-based test for comparing poverty measures between two distributions using a poverty line set to some fraction of the pooled median of the combined distribution, but the method cannot detect stochastic dominance since it permits only equality restrictions on the hypotheses.
2	For instance, if the poverty line is set to 50% ( $c = 0.5$ ) of the pooled median ( $q = 0.5$ ), then some possible grid points could be 10%, 20%, 30%, 40%, and 50% of the pooled median.
3	A well-known method for estimating densities is kernel estimation.The consistency of such estimators has been rigorously established in the literature (see, e.g., Li and Racine 2007).
4	Since quantile-based poverty lines require density estimation in calculating the covariance structure, we use kernel estimation with a Gaussian kernel and a “rule-of-thumb” bandwidth (see, e.g., Li and Racine 2007, Ch. 1).
5	For the case of a common relative poverty line, the test statistic of Barrett and Donald (2003) is based on the supremum of the distances between the censored CDFs which sets all income values above the poverty line equal to the poverty line.
6	The low-income cut-offs (LICOs) and market basket measures (MBMs) are two of the other complementary ways Statistics Canada measures and monitors low income and poverty. Unlike the LIM, the LICO and MBM are absolute poverty lines that vary by region and attempt to account for cost-of-living. As of 2018, the MBM became Canada’s official poverty line. However, the LIM continues to be the most commonly used measure for international poverty comparisons. For detailed information regarding the different measures, see Statistics Canada (2016).
7	Chen and Duclos (2011) provide an illustration of stochastic dominance tests using Canadian data that take into account sampling weights.

Table 1. Rejection frequencies for size simulation.

Distribution	n
Distribution	100	300	500	1000
Gamma	6.32	5.07	5.02	4.68
Singh–Maddala	5.72	5.42	4.98	4.87
Log-normal	6.27	5.32	4.87	5.28
Unit Exponential	5.33	5.57	5.09	4.89
Uniform	5.20	5.28	4.62	4.88

Note: The nominal size of the test is 5%. The grid points are set to 10%, 20%, 30%, 40%, and

50 %

of the pooled median.

Table 2. Rejection frequencies for power simulation.

$Δ_{1}^{1}$	$Δ_{2}^{1}$	$Δ_{3}^{1}$	$Δ_{4}^{1}$	$Δ_{5}^{1}$	n
$Δ_{1}^{1}$	$Δ_{2}^{1}$	$Δ_{3}^{1}$	$Δ_{4}^{1}$	$Δ_{5}^{1}$	100	300	500	1000
					W
−0.01	−0.02	−0.04	−0.05	−0.05	23.22	44.47	61.84	86.92
−0.01	−0.04	−0.07	−0.09	−0.10	51.37	92.28	98.91	100.00
					$D D$
−0.01	−0.02	−0.04	−0.05	−0.05	20.90	44.78	61.77	86.75
−0.01	−0.04	−0.07	−0.09	−0.10	52.07	92.32	98.83	100.00
					$B D$
−0.01	−0.02	−0.04	−0.05	−0.05	13.63	24.10	33.24	51.09
−0.01	−0.04	−0.07	−0.09	−0.10	26.75	56.86	74.81	94.92

Note: The nominal size of the test is 5% and

Δ_{j}^{1}

is the jth difference between the CDFs of subgroup B and A with grid points set to

10 % (j = 1)

,

20 % (j = 2)

,

30 % (j = 3)

,

40 % (j = 4)

, and

50 % (j = 5)

of the pooled median. Both distributions are generated from the gamma distribution. The scale parameter of distribution A is set to unity and its shape parameter is set to 2.1557. For distribution B, the scale is 0.651 and shape is 3.143 for the first row, while, for the second row, scale is 0.421 and shape is 4.688. DD denotes the test of Davidson and Duclos (2000), and BD denotes the bootstrap test of Barrett and Donald (2003).

Table 3. Descriptive statistics and estimated headcount ratios from the 2017 Canadian Income Survey.

	Median	SD	${\hat{D}}_{1}^{1}$	${\hat{D}}_{2}^{1}$	${\hat{D}}_{3}^{1}$	${\hat{D}}_{4}^{1}$	${\hat{D}}_{5}^{1}$
Men	$47,581	$52,766	0.008	0.016	0.034	0.062	0.113
Women	$45,401	$35,179	0.006	0.014	0.033	0.066	0.132

Note:

D_{j}^{1}

denotes the headcount ratios at grid points set to 10%

(j = 1)

, 20%

(j = 2)

, 30%

(j = 3)

, 40%

(j = 4)

, and 50%

(j = 5)

of the pooled sample median ($46,461).

© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mehdi, T. Testing for Stochastic Dominance up to a Common Relative Poverty Line. Econometrics 2020, 8, 5. https://doi.org/10.3390/econometrics8010005

AMA Style

Mehdi T. Testing for Stochastic Dominance up to a Common Relative Poverty Line. Econometrics. 2020; 8(1):5. https://doi.org/10.3390/econometrics8010005

Chicago/Turabian Style

Mehdi, Tahsin. 2020. "Testing for Stochastic Dominance up to a Common Relative Poverty Line" Econometrics 8, no. 1: 5. https://doi.org/10.3390/econometrics8010005

APA Style

Mehdi, T. (2020). Testing for Stochastic Dominance up to a Common Relative Poverty Line. Econometrics, 8(1), 5. https://doi.org/10.3390/econometrics8010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Testing for Stochastic Dominance up to a Common Relative Poverty Line

Abstract

1. Introduction

2. Stochastic Dominance and Poverty Measurement

3. Estimation and Inference

4. Simulation Evidence

5. Illustration

6. Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI