Construction of Uniform Designs over a Domain with Linear Constraints

Yang, Luojing; Yang, Xiaoping; Zhou, Yongdao

doi:10.3390/math13030438

Open AccessArticle

Construction of Uniform Designs over a Domain with Linear Constraints

by

Luojing Yang

¹

,

Xiaoping Yang

² and

Yongdao Zhou

^1,*

¹

School of Statistics and Data Science, LPMC & KLMDASR, Nankai University, Tianjin 300071, China

²

National Elite Institute of Engineering, Northwestern Polytechnical University, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(3), 438; https://doi.org/10.3390/math13030438

Submission received: 22 December 2024 / Revised: 22 January 2025 / Accepted: 25 January 2025 / Published: 28 January 2025

(This article belongs to the Special Issue Computational Methods and Applications for Numerical Analysis, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

Uniform design is a powerful and robust experimental methodology that is particularly advantageous for multidimensional numerical integration and high-level experiments. As its applications expand across diverse disciplines, the theoretical foundation of uniform design continues to evolve. In real-world scenarios, experimental factors are often subject to one or more linear constraints, which pose challenges in constructing efficient designs within constrained high-dimensional experimental spaces. These challenges typically require sophisticated algorithms, which may compromise uniformity and robustness. Addressing these constraints is critical for reducing costs, improving model accuracy, and identifying global optima in optimization problems. However, existing research primarily focuses on unconstrained or minimally constrained hypercubes, leaving a gap in constructing designs tailored to arbitrary linear constraints. This study bridges this gap by extending the inverse Rosenblatt transformation framework to develop innovative methods for constructing uniform designs over arbitrary hyperplanes and hyperspheres within unit hypercubes. Explicit construction formulas for these constrained domains are derived, offering simplified calculations for practitioners and providing a practical solution applicable to a wide range of experimental scenarios. Numerical simulations demonstrate the feasibility and effectiveness of these methods, setting a new benchmark for uniform design in constrained experimental regions.

Keywords:

mixture design; central composite discrepancy; inverse Rosenblatt transformation; constrained hypercubes

MSC:

62K15

1. Introduction

Experimental design can investigate the relationship between factors and responses by systematically arranging the combinations of factors and the number of tests. When the model relating the response to the factors is unknown, it can be explored using a space-filling design. As an implementation of space-filling design, quasi-Monte Carlo methods have been widely utilized in multidimensional numerical integration, statistical simulations, and other statistical domains due to their robust modeling capabilities. For instance, in optimization problems involving high-dimensional, nonlinear, and irregular objective functions, the uniformly distributed samples generated by quasi-Monte Carlo methods can better explore the search space, thereby improving optimization efficiency. In model training problems with high-dimensional feature spaces, quasi-Monte Carlo methods, through uniform sampling, can avoid the sample clustering issue seen in traditional methods, improving the learning efficiency and generalization ability of the model and having a faster convergence ratio than the Monte Carlo method [1]. Classical theory suggests that certain quasi-Monte Carlo methods can achieve convergence at a rate of

O (n^{- 1} {(log n)}^{s})

, where n is the number of a point set and s is the smoothness of the integrand. Usually, the Monte Carlo method has the convergence ratio

O (n^{- 1 / 2})

, while the convergence ratio of a quasi-Monte Carlo method can be close to

O (n^{- 1})

when an appropriate low-discrepancy sequence is used in moderate-dimensional problems. For the problem of high-dimensional numerical integration, quasi-Monte Carlo methods may lead to the phenomenon of the curse of dimensionality occurring. Then, Huang and Zhou [2] developed a randomized quasi-Monte Carlo method for those cases by using Baker’s transformation, achieving significantly reduced errors and faster convergence than the Monte Carlo method. As an important type of quasi-Monte Carlo method, uniform design scatters the design points evenly over the integration domain. It is a deterministic method used to obtain a point set by optimizing a given criterion of uniformity, such as the star discrepancy and other discrepancies [3]. The optimization method for obtaining a uniform design is NP-hard and some approximate construction methods are summarized in [3].

Usually, uniform designs are obtained in a hypercube, which means that there are no constraints among the variables. In many cases, uniform designs within linearly constrained experimental regions are particularly useful for enhancing the predictive capability of the agent model within the constrained domain. Feng et al. [4] proposed an encrypted neural network inference framework for secure neural network inference between two computing parties. This encrypted inference framework involves numerous security constraints. Using linear constrained uniform design can ensure the uniform distribution of input data, thereby ensuring data privacy and enhancing inference performance during the encrypted inference process. In the study of resilience methods for urban rail transit systems, it is necessary to consider linear constraints among various factors, such as traffic flow, train speed, and station capacity. To ensure the comprehensiveness and reliability of the analysis, using uniform designs for experimental design can guarantee the uniformity of experimental points within the feasible region, thereby enhancing the representativeness of the experiments [5]. Moreover, in pharmacology, patients with multiple conditions often require several medications simultaneously, which can result in drug–drug interactions that significantly affect therapeutic outcomes. Using uniform design with linear constraints can be applied to optimize the drug dose to maximize the therapeutic effect and reduce side effects [6]. Similarly, in other fields, uniform designs within linear constrained regions can improve the experimental efficiency, the experimental precision, and the accuracy of model estimations in high-dimensional scenarios, such as the development of durable composites, the optimization of portfolio investment allocation, and experimental problems in automotive lightweighting. Thus, in these contexts, it is crucial to determine how to construct a uniform design under these linear constraints.

Currently, there have been some relevant studies that have focused on the construction of uniform designs in irregular regions. Experimental regions with linear constraints are typically considered as convex polyhedra or subsets of convex polyhedra within a regular region, and number-theoretic methods and heuristic algorithms are used to obtain uniform designs in such regions [7,8,9]. For example, Fang and Wang [7] used the transformation method to generate design points in the experimental region

T^{s} = {x = (x_{1}, \dots, x_{s}) : x_{j} \geq 0, j = 1, \dots, s, \sum_{i = 1}^{s} x_{i} = 1}

. Wang and Fang [10] extended it to the region

T^{s} (a, b) = {x : 0 \leq a \leq x \leq b \leq 1, \sum_{i = 1}^{s} x_{i} = 1}

.

The construction of uniform designs in regions with more complex linear constraints remains a challenging problem. Two number-theoretic approaches have been developed for constrained mixture experiments in Borkowski and Piepel [11]. These methods effectively handle single and multiple constraints. In addition, the traditional uniformity criterion is no longer applicable to all experimental regions. Lin et al. [12] proposed using the central composite discrepancy (CCD) criterion to assess the uniformity of the design across arbitrary experimental regions and optimized the design using a threshold acceptance algorithm. Building on these, Liu and Liu [13] proposed a method for generating nearly uniform designs for mixture experiments with complex constraints using the switching algorithm. Furthermore, Zhang et al. [14] introduced the inverse Rosenblatt transformation (IRT) under the CCD criterion, facilitating the development of uniform designs in high-dimensional spaces. In terms of other deterministic sampling methods, minimum energy design can be used to obtain uniformly distributed points [15,16]. To address probability constraints, Huang et al. [17] incorporated sequentially constrained Monte Carlo principles into the minimum energy design method, introducing the constrained minimum energy design as a versatile sampling strategy for constrained spaces. Using Latin hypercube design principles, Schneider et al. [18] proposed an incremental approach for tightly constrained input spaces. This method projects candidate points onto the constrained region, sequentially selecting the optimal point to add to the design. Then, Schneider et al. [19] extended this approach by projecting onto constrained experimental regions and proposed maximally uniform Latin hypercube designs. Furthermore, Jourdan [20] introduced a new uniformity deviation measure by quantifying the discrepancy between the distribution functions of the design points and the Dirichlet distribution. This method aims to minimize the deviation and further enhance the uniformity. Most existing deterministic construction methods focus on the unit hypercube or the standard simplex, while construction results for uniform designs on regions with arbitrary linear constraints are relatively scarce.

In this paper, we propose a method for constructing uniform designs in experimental regions with arbitrary sets of linear equality constraints. Our approach is based on the IRT framework developed by Zhang et al. [14], which incorporates relaxed variables to provide greater flexibility in handling constraints [21]. This method is further extended to designs constrained to the hypersphere. We use the CCD criterion to evaluate the uniformity of the distribution of the resulting design within the experimental region. Through numerical examples, we compare our approach against two existing methods: the stochastic representation (SR) method by Fang and Wang [7] and acceptance–rejection (AR) method by Borkowski and Piepel [11]. Compared with other existing methods, our method can handle the problem of uniform design in regions with general linear constraints. The new construction results demonstrate competitiveness compared to traditional stochastic representations and acceptance rejection methods. This study extends the inverse Rosenblatt transformation framework to develop innovative methods for constructing uniform designs over arbitrary hyperplanes and hyperspheres within unit hypercubes. Explicit construction formulas for these constrained domains are derived, offering simplified calculations for practitioners and providing a practical solution applicable to a wide range of experimental scenarios.

This paper is organized as follows. Section 2 reviews the conceptual framework underlying the IRT method. Section 3 presents construction algorithms for uniform designs within linearly constrained regions, addressing single and multiple constraints. It also provides a numerical case in which the proposed method is applied to construct uniform designs for these regions, and discusses the properties and implementation of the algorithms. Section 4 presents construction algorithms for uniform designs within quadratic constrained regions, along with a numerical case where the method is applied to construct uniform designs for a given hypersphere. The results demonstrate the effectiveness of our approach in achieving both uniformity and robustness. Finally, Section 5 concludes with a brief discussion of the findings and potential directions for future research.

2. Preliminary Results

To assess whether a design is uniformly distributed in

X

, a uniformity criterion can be used, which quantifies the discrepancy between the empirical distribution function

F_{P}

and the global uniform distribution function

F_{*} (x)

. Extensive research has focused on the uniformity criteria for the unit hypercube

C^{s} = {[0, 1]}^{s}

, including the

L_{p} -

star discrepancy, and the mixture discrepancy [3]. To measure the uniformity of the designs in arbitrary experimental regions, Lin et al. [12] proposed the CCD criterion. Let

x^{(i)} = {r \in R : x + a_{i} < r < x + a_{i + 1}, i = 0, 1, a_{0} = - \infty < a_{1} < a_{2} = \infty

; the CCD can be written as

{CCD}_{2} (P) = {(1 / V (D)) \int_{D} (1 / 2^{s}) \sum_{k = 1}^{2^{s}} | (N (D_{k} (x), P) / n) - (V (D_{k} (x) / V (D)) {|^{2} d x}}^{1 / 2},

where

D_{k} = {x^{(i_{1})} \times \dots \times x^{(i_{K})}} \cap D

denotes the subdomains of D. The measures of the experimental domain D and its subdomains

D_{k}

can be represented as

V (D)

and

V (D_{k} (x))

, respectively. The set

P = {x_{1}, \dots, x_{n}}

represents the given n-point design. For improved computational efficiency, Chen et al. [8] proposed a set of particle-swarm-optimization-based algorithms that can effectively find optimal uniform designs according to the CCD criterion. As the dimensionality of the experimental region increases and its shape becomes more complex, numerical methods can be employed to approximate the CCD values [13].

Zhang et al. [14] proposed a deterministic construction method, the IRT method, based on the CCD criterion, providing a general framework for transforming uniformly distributed points from

C^{s}

to arbitrary domains. The core idea of the IRT method is to obtain uniformly distributed experimental points in a region

X \subset R^{s}

by applying the inverse of the uniform distribution function. Let

X = (X_{1}, \dots, X_{s})

represent a random variable in

X

with a cumulative distribution function

F (X)

. We assume that

F_{1} (\cdot)

is the cumulative distribution function of

X_{1}

, and

F_{i | 1, \dots, i - 1} (\cdot)

is the conditional cumulative distribution function of

X_{i}

given

X_{1}, \dots, X_{i - 1}

. As defined in Arnold et al. [22], a transformation is defined from

(X_{1}, \dots, X_{s})

to

(U_{1}, \dots, U_{s})

to produce uniformly distributed random variables. This transformation satisfies the following conditions:

For $1 \leq i \leq s, U_{1} = F_{1} (X_{1}), U_{i} = F_{i | 1, \dots, i - 1} (X_{i} | X_{1}, \dots, X_{i - 1})$ ;
$(U_{1}, \dots, U_{s})$ is uniformly distributed in the unit cube $C^{s}$ ;
The inverse transformation of this process exists;
The Jacobian matrix J involved in the transformation depends solely on the density function of $(X_{1}, \dots, X_{s})$ .

It is important to note that this transformation is not permutation-invariant. Hence, we must consider all

s!

permutations of

(X_{1}, \dots, X_{s})

. Specifically, we define the transformation as

(X_{i_{1}}, \dots, X_{i_{s}}) \to (U_{1}, \dots, U_{s})

, denoted as

T_{(i_{1}, \dots, i_{s})}

, which satisfies the following:

$T_{i_{1}} (X_{i_{1}}) = U_{1}$ ;
$T_{i_{j} | i_{1}, \dots, i_{j - 1}} (X_{i_{j}}) = U_{j}, 2 \leq j \leq s$ ;
$F (X_{i_{1}}, \dots, X_{i_{s}}) = T_{i_{1}} (X_{i_{1}}) \dots T_{i_{s} | i_{1}, \dots, i_{s - 1}} (X_{i_{s}} | X_{i_{1}}, \dots, X_{i_{s - 1}})$ .

From this, we obtain

x_{i_{1}} = T_{i_{1}}^{- 1} (U_{1})

and

x_{i_{j}} = T_{i_{j} | i_{1}, \dots, i_{j - 1}}^{- 1} (U_{j})

, for

2 \leq j \leq s

, by applying the inverse transformation

T_{(i_{1}, \dots, i_{s})}

. Finally, the best uniform design is obtained by selecting the smallest design points according to the CCD criterion.

3. Uniform Design of Experiments for Linear Constraints

In practical applications, experimental regions are often subject to multiple constraints. Linear constraints involve linear relationships between variables, while quadratic constraints involve quadratic terms. Geometrically, linear constraints form a hyperplane in higher-dimensional spaces, while quadratic constraints form a surface, making the feasible region more complex. Linear constraints are generally simpler to compute, and they often describe limitations on costs, time, or resources in practical applications. Quadratic constraints are used for more complex problems, such as when variables have non-linear relationships. For example, in portfolio optimization, the objective is to maximize returns. Using uniform designs for simulation, investment portfolios that yield higher returns can be identified. Suppose that the total investment amount is b, and there are s types of securities available for investment. Based on the prior information, the effect of each security is represented by

x_{1}, \dots, x_{s}

, where

0 < x_{i} < 1

for

i = 1, \dots, s

, and the corresponding weights are denoted by

a_{1}, \dots, a_{s}

. A linear constraint can be formulated as

a_{1} x_{1} + \dots + a_{s} x_{s} = b

. Geometrically, this constrained region corresponds to the intersection of

C^{s}

and the given hyperplane. When additional prior information is available, multiple linear constraints can be introduced, expressed as

a_{j 1} x_{1} + \dots + a_{j s} x_{s} = b_{j}

, for

j = 1, \dots, t, t < s

. In this case, when

s = 3

and

t = 2

, the constrained region is simplified to a line segment within the unit cube

{(0, 1)}^{3}

. In physical engineering systems, experimental factors may exhibit quadratic relationships. In such cases, the constraint can be expressed as

a_{1} x_{1}^{2} + \dots + a_{s} x_{s}^{2} = b

, which represents the intersection of a hypersphere and

C^{s}

. When dealing with high-dimensional variables and limited resources, a uniform design is an effective approach to extracting a small number of samples to identify patterns within a specified region. A practical strategy involves first generating points that are uniformly distributed in

C^{s}

. These points are then mapped to the target region through an appropriate transformation, ensuring that the resulting points maintain a uniform distribution within the target region.

3.1. One Arbitrary Linear Constraint

We aim to identify experimental points that follow a uniform distribution using the distribution function of random vectors with a uniform distribution. A key challenge in this process is to determine the measure of the experimental region, which involves computing high-dimensional integrals. Based on the work presented in Lasserre [23], which utilized the Laplace transform method to compute the measure of the intersection between a simplex in

R^{m}

and

C^{m}

, we derive the following results.

Lemma 1.

In the non-empty bounded space,

X = {x = (x_{1}, \dots, x_{m}) | 0 < x_{i} < 1, t_{1} < \sum_{k = 1}^{m} c_{k} x_{k} < t_{2}, i = 1, \dots, m}

, where

c_{j} \neq 0

for

j = 1, \dots, m

; then, the measure of

X

is

V (X) = \frac{1}{Γ (m + 1) \prod_{k = 1}^{m} c_{k}} \sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} [{(t_{2} - c^{T} 1_{k})}_{+}^{m} - {(t_{1} - c^{T} 1_{k})}_{+}^{m}],

where

\sum_{i_{0}} {(t_{2} - c^{T} 1_{k})}_{+}^{m} = 1, c = {(0, c_{1}, \dots, c_{m})}^{T}, {(x)}_{+} = max {0, x} .

Here,

1_{k}

denotes a vector in

R^{m + 1}

, where the entries at the 1st,

(i_{i} + 1)

-th, … and

(i_{k} + 1)

-th positions are equal to 1, and all other positions are equal to 0.

Proof.

We begin by proving that Lemma 1 holds when

c_{j} > 0

for

j = 1, \dots, m

and

0 < m \leq s

. Let

f (t) = \int_{V_{1 t}} d x

, where

V_{1 t} = M_{1 t} \cap C^{m}

and

M_{1 t} = {x = (x_{1}, \dots, x_{m}) | x_{i} > 0, \sum_{k = 1}^{m} c_{k} x_{k} \leq t, i = 1, \dots, m} .

By the existence theorem for Laplace transforms, the Laplace transform

L [f]

of

f (t)

exists when the parameter s, or the real part of s, is positive. The Laplace transform of

f (t)

is defined as

L [f]

:

f (t) \mapsto F (λ)

, where

λ

or the real part of

λ

is positive. According to the definition of the Laplace transform, we have

F (λ) = \int_{0}^{+ \infty} e^{- λ t} f (t) d t = \int_{0}^{+ \infty} e^{- λ t} (\int_{V_{1 t}} d x) d t .

By applying Fubini’s theorem, we obtain

F (λ) = \frac{1}{λ} (\int_{0}^{1} e^{- λ c_{1} x_{1}} d x_{1}) \times \dots \times (\int_{0}^{1} e^{- λ c_{m} x_{m}} d x_{m}) .

This simplifies to

F (λ) = \frac{1}{λ^{m + 1} \prod_{k = 1}^{m} c_{k}} \prod_{i = 1}^{m} (1 - e^{- λ c_{i}}) .

Therefore, we have

\begin{matrix} F (λ) & = \frac{1}{Γ (m + 1) \prod_{k = 1}^{m} c_{k}} \sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} \frac{Γ (m + 1) e^{- λ (c^{T} 1_{k})}}{λ^{m + 1}} . \end{matrix}

According to the proposition for the Laplace transform,

\frac{Γ (m + 1) e^{- λ (c^{T} 1_{k})}}{λ^{m + 1}}

is the Laplace transform of the function

I (0 < c^{T} 1_{k} \leq t) {(t - c^{T} 1_{k})}^{m}

. Therefore, we have

L [f] = \frac{1}{Γ (m + 1) \prod_{k = 1}^{m} c_{k}} \sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} L [I (t \geq c^{T} 1_{k}) {(t - c^{T} 1_{k})}^{m}] .

Thus, we obtain

f (t) = L^{- 1} [f] = \frac{1}{Γ (m + 1) \prod_{k = 1}^{m} c_{k}} \sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} {(t - c^{T} 1_{k})}_{+}^{m} .

When

\sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} {(t - c^{T} 1_{k})}_{+}^{m} > 0,

the set

X

is non-empty. Taking

t = t_{1}, t_{2}

, we have

V (X) = \frac{1}{Γ (m + 1) \prod_{k = 1}^{m} c_{k}} \sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} [{(t_{2} - c^{T} 1_{k})}_{+}^{m} - {(t_{1} - c^{T} 1_{k})}_{+}^{m}] .

Next, we will prove that Lemma 1 holds when

c_{j} > 0

for

j = 1, \dots, q

,

c_{j} < 0

for

j = q + 1, \dots, m

and

0 < q < m \leq s

. Let

f (t) = \int_{V_{2 t}} d x, g (t) = I (t + m d > 0) f (t + m d), d = min {c_{i}, i = 1, \dots, m},

where

V_{2 t} = M_{2 t} \cap C^{m}

and

M_{2 t} = {x = (x_{1}, \dots, x_{m}) | x_{i} > 0, \sum_{k = 1}^{m} c_{k} x_{k} \leq t, i = 1, \dots, m}

. When

t + m d < 0

, it is clear that

g (t) = 0

. When the parameter s or the real part of s is positive, the Laplace transforms

L [f]

of

f (t)

and

L [g]

of

g (t)

both exist. Define the Laplace transforms of

g (t)

and

f (t)

as

L [f] : f (t) \mapsto F (λ), L [g] : g (t) \mapsto G (λ),

Similarly, let

t + m d = t_{1}, t_{2}

; then, we have

V (χ) = \frac{1}{Γ (m + 1) \prod_{k = 1}^{m} c_{k}} \sum_{k = 0}^{m} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} [{(t_{2} - c^{T} 1_{k})}_{+}^{m} - {(t_{1} - c^{T} 1_{k})}_{+}^{m}] .

When

\sum_{k = 0}^{m} {(- 1)}^{k + m - q} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m} [{(t_{2} - c^{T} 1_{k})}^{m} - {(t_{1} - c^{T} 1_{k})}^{m}] > 0

, the set

X

is non-empty and the formula is well defined.

If

c_{i} = 0

, the terms with

c_{i} = 0

can be excluded first, and then Lemma 1 can be applied to obtain the measure of the corresponding experimental region. □

For convenience in subsequent calculations, we can derive the following result for the integral computation based on Lemma 1.

Lemma 2.

Let

h (p, k) = \int_{0}^{1} \dots \int_{0}^{1} I (0 < t_{i} < 1, l_{1} < \sum_{i = 1}^{p} c_{i} t_{i} < l_{2}) d t_{p} \dots d t_{k}, 1 \leq k \leq p

; then, we have

\begin{matrix} h (p, k) & = [I (\sum_{i = 1}^{k - 1} c_{i} x_{i} < l_{2}) \sum_{j = 0}^{p - k + 1} {(- 1)}^{j} \sum_{i_{1} < \dots < i_{j}} {(l_{2} - \sum_{i = 1}^{k - 1} - c_{k, p}^{T} 1_{j}^{p - k + 2})}_{+}^{p - k + 1}] / \prod_{l = k}^{p} c_{l} Γ (p - k + 2) \\ - [I (\sum_{i = 1}^{k - 1} c_{i} x_{i} < l_{1}) \sum_{j = 0}^{p - k + 1} {(- 1)}^{j} \sum_{i_{1} < \dots < i_{j}} {(l_{1} - \sum_{i = 1}^{k - 1} - c_{k, p}^{T} 1_{j}^{p - k + 2})}_{+}^{p - k + 1}] / \prod_{l = k}^{p} c_{l} Γ (p - k + 2), \end{matrix}

where

c_{k, p} = {(0, c_{k}, \dots, c_{p})}^{T}

,

1_{j}^{p - k + 2}

is a vector in

R^{p - k + 2}

with 1s at the

i_{0}, i_{1} + 1, \dots, i_{j} + 1

positions and 0s at all other positions.

Proof.

Based on Lemma 1, by representing the measure in Lemma 1 as a high-dimensional integral, we treat some of the variables in the constraint conditions as constants. Through a coordinate transformation, we can obtain Lemma 2. □

Lemmas 1 and 2 provide the foundation for determining the transformation that maps the uniformly distributed points from

C^{m}

to a uniform distribution within the target region.

First, we consider the case where the experimental region is subject to an arbitrary single linear constraint. Define

X_{0} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, a_{1} x_{1} + \dots + a_{s} x_{s} = b}

, where

a_{1}, \dots, a_{s}, b \in R

. Through a straightforward analysis, experimental regions with a single arbitrary equality constraint can be classified into the following cases:

The intercept term b is zero, and the coefficients consist of positive, negative, and zero values: $X_{01} = {x = (x_{1}, \dots, x_{s}) ∣ 0 < x_{i} < 1, a_{1} x_{1} + \dots + a_{m} x_{m} = 0,$ $i = 1, \dots, s, a_{1}, \dots, a_{t} > 0, a_{t + 1}, \dots, a_{m} < 0, 0 < t < m \leq s}$ .
The intercept term is non-zero, with some coefficients being positive while the rest are zero. Let us now examine the case where $b \neq 0$ . If all coefficients are positive, we arrange them in ascending order and denote the smallest coefficient as $a_{1}$ . If $a_{1} < 1$ , $X_{02} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, \sum_{i = 1}^{m} a_{i} x_{i} = 1, a_{1} < 1}$ , the corresponding variable $x_{1}$ is treated as a slack variable, allowing the experimental region to be relaxed into

$X_{02}^{'} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, 1 - a_{1} < \sum_{i = 2}^{m} a_{i} x_{i} < 1} .$

However, if $a_{1} \geq 1$ , $X_{03} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, \sum_{i = 1}^{m} a_{i} x_{i} = 1, a_{k} \geq 1,$ $k = 1, \dots, m}$ , the corresponding variable $x_{1}$ is treated as a slack variable, and the experimental region fully coincides with its intersection with the $C^{s}$ :

$X_{03}^{'} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, 0 < \sum_{i = 2}^{m} a_{i} x_{i} < 1} .$
The intercept term is non-zero, and not all coefficients are positive: $X_{04} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, \sum_{i = 1}^{m} a_{i} x_{i} = 1$ , where $a_{1}, \dots, a_{t} > 0$ and $a_{t + 1}, \dots, a_{m} < 0$ , with $0 < t < m \leq s$ }.

For case (i), let

X_{01}^{'} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, 0 < \sum_{i = 1}^{m - 1} a_{i} x_{i} < - a_{m}},

X

be a random vector uniformly distributed on

X_{01}^{'}

with a density function

f (x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) = \frac{I (0 < x_{i} < 1, 0 < \sum_{i = 1}^{m - 1} a_{i} x_{i} < - a_{m})}{V (X_{01}^{'})},

where

V (X_{01}^{'})

represents the measure of

X_{01}^{'}

. By Lemma 1, we can obtain

V (X_{01}^{'}) = \sum_{k = 0}^{m - 1} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} \leq \dots \leq i_{k} \leq m - 1} \frac{{(- a_{m} - a_{1, m - 1}^{T} 1_{k}^{m})}_{+}^{m - 1} - {(- a_{1, m - 1}^{T} 1_{k}^{m})}_{+}^{m - 1}}{Γ (m) \prod_{k = 1}^{m - 1} a_{k}},

Following the approach outlined in Section 2, we assume that the marginal distribution function of

X_{1}

is

F_{X_{1}}^{1} (x_{1}) = \int_{0}^{x_{1}} f_{X_{1}} (x_{1}) d t_{1} = \frac{1}{V (X_{01}^{'})} \int_{0}^{x_{1}} A_{1} d t_{1},

where

A_{1} = \int_{0}^{1} \dots \int_{0}^{1} I (0 < t_{i} < 1, 0 < \sum_{k = 1}^{m - 1} a_{k} t_{k} < - a_{m}) d t_{m - 1} \dots d t_{2} .

Additionally, the conditional distribution can be expressed as

\begin{matrix} F_{i ∣ 1, \dots, i - 1}^{1} (x_{i}) & = \int_{0}^{x_{i}} f_{i ∣ 1, \dots, i - 1} (t_{i}) d t_{i} = \int_{0}^{x_{i}} \frac{f (t_{1}, \dots, t_{i})}{f (t_{1}, \dots, t_{i - 1})} d t_{i} \\ = \int_{0}^{x_{i}} \frac{\int_{0}^{1} \dots \int_{0}^{1} I (0 < t_{i} < 1, 0 < \sum_{k = 1}^{m - 1} a_{k} t_{k} < - a_{m}) d t_{m - 1} \dots d t_{i + 1}}{\int_{0}^{1} \dots \int_{0}^{1} I (0 < t_{i} < 1, 0 < \sum_{k = 1}^{m - 1} a_{k} t_{k} < - a_{m}) d t_{m - 1} \dots d t_{i}} d t_{i} . \end{matrix}

Thus, we have

\begin{matrix} F_{X_{1}}^{1} (x_{1}) & = \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(- a_{m} - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1} I (- a_{m} - a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1} \geq 0)}{V (X_{01}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}} \\ + \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(- a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1}}{V (X_{01}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}} \\ - \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(- a_{m} - a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1}}{V (X_{01}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}} \\ - \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(- a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1} I (- a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1} \geq 0)}{V (X_{01}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}}, \end{matrix}

(1)

\begin{matrix} F_{i ∣ 1, \dots, i - 1}^{1} (x_{i}) & = \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{K_{1} {(- a_{m} - \sum_{k = 1}^{i - 1} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} I (- a_{m} - \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i} \geq 0) \\ + \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{K_{1} {(- \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} \\ - \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{K_{1} {(- a_{m} - \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} \\ - \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{K_{1} {(- \sum_{k = 1}^{i - 1} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} I (- \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i} \geq 0), \end{matrix}

(2)

where

K_{1} = [I (\sum_{k = 1}^{i} a_{k} x_{k} < 0, 2 \leq i \leq m - t - 1) + I (m - t < i \leq m - 1)] / h (m - 1, i, 0, - a_{m}),

\begin{matrix} h (p, i, l_{1}, l_{2}) & = \sum_{j = 0}^{p - i + 1} {(- 1)}^{j} \sum_{i_{1} < \dots < i_{j}} {(l_{1} - \sum_{k = 1}^{i - 1} a_{k} x_{k} - a_{i, p}^{T} 1_{+}^{p - k + 2})}_{+}^{p - i + 1} / [Γ (p - k + 2) \prod_{l = i}^{p} a_{l}] \\ - \sum_{j = 0}^{p - i + 1} {(- 1)}^{j} \sum_{i_{1} < \dots < i_{j}} {(l_{2} - \sum_{k = 1}^{i - 1} a_{k} x_{k} - a_{i, p}^{T} 1_{+}^{p - k + 2})}_{+}^{p - i + 1} / [Γ (p - k + 2) \prod_{l = i}^{p} a_{l}], \end{matrix}

a_{i, p} = {(0, a_{i}, \dots, a_{p})}^{T} .

Building on the concept from Section 2, let each component of the uniformly distributed random vector

(u_{1}, \dots, u_{s - 1})

in the unit hypercube correspond to the distribution function and conditional distribution function of

(X_{1}, \dots, X_{m - 1}, X_{m + 1}, \dots, X_{s})

; we can derive the transformation formula by combining (1) and (2):

\{\begin{matrix} F_{X_{1}}^{1} (x_{1}) = u_{i_{1}}, \\ F_{j ∣ 1, \dots, j - 1}^{1} (x_{j}) = u_{i_{j}}, j = 2, \dots, m - 1, \\ x_{m} = - \sum_{i = 1}^{m - 1} (a_{i} / a_{m}) x_{i}, \\ x_{k} = u_{i_{k - 1}}, k = m + 1, \dots, s, \end{matrix}

(3)

where

u = (u_{i_{1}}, \dots, u_{i_{s - 1}})

are the test points uniformly distributed on the

C^{s}

. In Equation (3), we first assume that

(X_{1}, \dots, X_{m - 1},

X_{m + 1}, \dots, X_{s})

is uniformly distributed in the experimental region. Then, the vector composed of its distribution function

F_{X_{1}}^{1} (x_{1})

and conditional distribution function

F_{i | 1, \dots, i - 1}^{i} (x_{i}), i = 1, \dots, m - 1

follows a uniform distribution in the unit hypercube of the same dimension. Thus, each component of the uniformly distributed random vector

(u_{1}, \dots, u_{s - 1})

in the unit hypercube corresponds to the distribution function and conditional distribution function of

(X_{1}, \dots, X_{m - 1}, X_{m + 1}, \dots, X_{s})

. The components that are not subject to linear constraints can be directly represented by the corresponding components in

(u_{1}, \dots, u_{s - 1})

. By utilizing the linear constraints, we can express the slack variable

X_{m}

, and this relationship can be obtained in the equation.

We now turn our attention to the uniformity of the distribution of experimental points obtained through the transformation (3). Based on the above results, we can derive the following result.

Theorem 1.

In the non-empty bounded space

X_{01} = {x = (x_{1}, \dots, x_{s}) ∣ 0 < x_{i} < 1,

a_{1} x_{1} + \dots + a_{m} x_{m} = 0, i = 1, \dots, s, a_{1}, \dots, a_{t} > 0, a_{t + 1}, \dots, a_{m} < 0, 0 < t < m \leq s}

, let

u = (u_{1}, \dots, u_{s - 1})

be a random vector uniformly distributed over

{(0, 1)}^{s - 1}

. Then, the random vector obtained through the transformation in (3) is uniformly distributed over

X_{01}

.

Proof.

1.Let

X = (X_{1}, \dots, X_{s})

and

U = (U_{1}, \dots, U_{s - 1})

represent random vectors uniformly distributed over

X_{01}

and

{(0, 1)}^{s - 1}

, respectively, and satisfying (3). The distribution function of

X

is given as

\begin{matrix} F_{X} (x) & = P (X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{m - 1}, X_{m + 1} \leq x_{m + 1}, \dots, X_{s} \leq x_{s}, \frac{- 1}{a_{m}} \sum_{i = 1}^{m - 1} a_{i} X_{i} \leq x_{m}) \\ = P (\frac{- 1}{a_{m}} \sum_{i = 1}^{m - 1} a_{i} X_{i} \leq x_{m} ∣ X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{m - 1}) \\ \times P (X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{m - 1}, X_{m + 1} \leq x_{m + 1}, \dots, X_{s} \leq x_{s}) . \end{matrix}

Let

C = (C_{1}, \dots, C_{m - 1}, U_{m}, \dots, U_{s - 1})

represent a random vector uniformly distributed over

X_{01}^{'}

, respectively. We know that

X_{- m} = (X_{1}, \dots, X_{m - 1}, X_{m + 1}, \dots, X_{s})

is a random vector over the space

X_{01}^{'}

. Define

g = (F_{1}^{- 1}, F_{2 ∣ 1}^{- 1}, \dots, F_{m - 1 ∣ 1, \dots, m - 2}^{- 1}, σ, \dots, σ)

as an invertible transformation from

U

to

X_{- m}

such that

g (U) = X_{- m}

, where

F_{1}^{- 1} (\cdot)

is the inverse of the marginal distribution function

F_{1} (\cdot)

of

C

, and

F_{2 ∣ 1}^{- 1} (\cdot), \dots, F_{m - 1 ∣ 1, \dots, m - 2}^{- 1} (\cdot)

are the inverse of the conditional distribution functions

F_{i ∣ 1, \dots, i - 1} (\cdot ∣ C_{1} = c_{1}, \dots, C_{i - 1} = c_{i - 1})

of

C

, respectively. The joint distribution functions of

U

and

C

are given by

F_{U} (u) = u_{1} \dots u_{s - 1} I (0 < u_{i} < 1, i = 1, \dots, s - 1),

F_{C} (c) = c_{1} \dots c_{m - 1} u_{m} \dots u_{s - 1} I ((c_{1}, \dots, c_{m - 1}, u_{m}, \dots, u_{s - 1}) \in X_{01}^{'}) / V (X_{01}^{'}) .

Thus, we have

\begin{matrix} P (X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{m - 1}, X_{m + 1} \leq x_{m + 1}, \dots, X_{s - 1} \leq x_{s - 1}) \\ = P (g_{1} (U_{1}) \leq x_{1}, \dots, g_{m - 1} (U_{m - 1}) \leq x_{m - 1}, U_{m} \leq x_{m + 1}, \dots, U_{s - 1} \leq x_{s}) \\ = P (U_{1} \leq F_{1} (x_{1}), \dots, U_{m - 1} \leq F_{m - 1 ∣ 1, \dots, m - 2} (x_{m - 1}), U_{m} \leq x_{m + 1}, \dots, U_{s - 1} \leq x_{s}) \\ = F_{1} (x_{1}), \dots, F_{m - 1 ∣ 1, \dots, m - 2} (x_{m - 1}) F_{m, \dots, s - 1} (x_{m + 1}, \dots, x_{s}) \\ = I ((x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) \in X_{01}^{'}) F_{C} (x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) \\ = \frac{x_{1} \dots x_{m - 1} x_{m + 1} \dots x_{s}}{V (X_{01}^{'})} I ((x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) \in X_{01}^{'}) . \end{matrix}

From the above discussion, we can conclude that

F_{X} (x) = P (\frac{- 1}{a_{m}} \sum_{i = 1}^{m - 1} a_{i} X_{i} \leq x_{m} ∣ X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{m - 1}) \frac{x_{1} \dots x_{m - 1} x_{m + 1} \dots x_{s}}{V (X_{01}^{'})}

\times I ((x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) \in X_{01}^{'})

= \frac{V (X_{01}) P (\frac{- 1}{a_{m}} \sum_{i = 1}^{m - 1} a_{i} X_{i} \leq x_{m} ∣ X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{m - 1})}{V (X_{01}^{'})} \times \frac{x_{1} \dots x_{m - 1} x_{m + 1} \dots x_{s}}{V (X_{01})}

\times I ((x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) \in X_{01}) ≐ \frac{x_{1} \dots x_{s}}{V (X_{01})} I ((x_{1}, \dots, x_{s}) \in X_{01}) .

Therefore,

X = (X_{1}, \dots, X_{s})

, obtained through the transformation (3), is uniformly distributed over

X_{01}

. □

From Theorem 1, we can conclude that the transformed test points are uniformly distributed over

X_{01}

.

For case (ii), consider the case where

0 < a_{1} < 1

and

1 - a_{1} < \sum_{i = 2}^{m} a_{i} x_{i} = 1 - a_{1} x_{1} < 1

. Treat

x_{1}

as a slack variable, and the new experimental region can be represented as

X_{02}^{'} = {x = (x_{2}, \dots, x_{s}) | 0 < x_{i} < 1, 1 - a_{1} < \sum_{i = 2}^{m} a_{i} x_{i} < 1, a_{i} > 0, 0 < a_{1} < 1, i = 2, \dots, m}

. Let

X = (X_{2}, \dots, X_{s})

be a uniformly distributed random vector in

X_{02}^{'}

. We can derive the density function

f (x_{2}, \dots, x_{s})

and the measure of the region

X_{02}^{'}

V (X_{02}^{'}) = \sum_{k = 0}^{m - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} [{(1 - a_{2, m}^{T} 1_{k}^{m})}^{t - 1} - {(1 - a_{1} - a_{2, m}^{T} 1_{k}^{m})}^{m - 1}] / [Γ (m) \prod_{k = 2}^{m} a_{k}],

Then, by Lemma 2, the corresponding marginal distribution function can be expressed as

\begin{matrix} F_{X_{2}}^{2} (x_{2}) & = K_{2} \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{3, t}^{T} 1_{k}^{m - 1})}_{+}^{m - 1} I (1 - a_{2} x_{2} - a_{3, m}^{T} 1_{k}^{m - 1} \geq 0)}{Γ (m) \prod_{k = 1}^{m - 1} a_{k + 1}} \\ + K_{2} \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{1} - a_{2} x_{2} - a_{3, m}^{T} 1_{k}^{m - 1})}_{+}^{m - 1}}{Γ (m) \prod_{k = 1}^{m - 1} a_{k + 1}} \\ - K_{2} \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{2} x_{2} - a_{3, m}^{T} 1_{k}^{m - 1})}_{+}^{m - 1}}{Γ (m) \prod_{k = 1}^{t - 1} a_{k + 1}} \\ - K_{2} \sum_{k = 0}^{t - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{1} - a_{3, m}^{T} 1_{k}^{m - 1})}_{+}^{m - 1} I (1 - a_{1} - a_{2} x_{2} - a_{3, m}^{T} 1_{k}^{m - 1} \geq 0)}{Γ (m) \prod_{k = 1}^{m - 1} a_{k + 1}}, \end{matrix}

(4)

where

K_{2} = \frac{I (0 < a_{2} x_{2} < 1)}{V (X_{02}^{'})},

and the conditional distribution functions

\begin{matrix} F_{i ∣ 2, \dots, i - 1}^{2} (x_{i}) & = \sum_{k = 0}^{m - i} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{K_{3} {(1 - \sum_{k = 2}^{i - 1} a_{k} x_{k} - a_{i + 1, m}^{T} 1_{k}^{m - i + 1})}_{+}^{m - i + 1} I (1 - \sum_{k = 2}^{i} a_{k} x_{k} - a_{i + 1, m}^{T} 1_{k}^{m - i + 1} \geq 0)}{Γ (m - i + 2) \prod_{k = i - 1}^{m - 1} a_{k + 1}} \\ + K_{3} \sum_{k = 0}^{t - i} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{1} - \sum_{k = 2}^{i} a_{k} x_{k} - a_{i + 1, m}^{T} 1_{k}^{m - i + 1})}_{+}^{m - i + 1}}{Γ (m - i + 2) \prod_{k = i - 1}^{m - 1} a_{k + 1}} \\ - K_{3} \sum_{k = 0}^{m - i} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - \sum_{k = 2}^{i} a_{k} x_{k} - a_{i + 1, m}^{T} 1_{k}^{m - i + 1})}_{+}^{m - i + 1}}{Γ (m - i + 2) \prod_{k = i - 1}^{m - 1} a_{k + 1}} \\ - K_{3} \sum_{k = 0}^{m - i} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{1} - \sum_{k = 2}^{i - 1} a_{k} x_{k} - a_{i + 1, m}^{T} 1_{k}^{m - i + 1})}_{+}^{m - i + 1}}{Γ (m - i + 2) \prod_{k = i - 1}^{m - 1} a_{k + 1}}, \\ \times I (1 - a_{1} - \sum_{k = 2}^{i} a_{k} x_{k} - a_{i + 1, m}^{T} 1_{k}^{m - i + 1} \geq 0), \end{matrix}

(5)

where

K_{3} = \frac{I (0 < \sum_{k = 2}^{i} a_{k} x_{k} < 1)}{h (m - 1, i - 1, 1 - a_{1}, 1)} .

Building on the concept from Section 2, let each component of the uniformly distributed random vector

(u_{1}, \dots, u_{s - 1})

in the unit hypercube correspond to the distribution function and conditional distribution function of

(X_{2}, \dots, X_{s})

. The third equation in the expression can be obtained from the constraint condition; then, we can derive the transformation formula by combining (4) and (5):

\{\begin{matrix} F_{X_{2}}^{2} (x_{2}) = u_{i_{1}}, \\ F_{j ∣ 2, \dots, j - 1}^{2} (x_{j}) = u_{i_{j - 1}}, j = 3, \dots, m, \\ x_{1} = \frac{1}{a_{1}} - \frac{1}{a_{1}} \sum_{i = 2}^{m} a_{i} x_{i}, \\ x_{k} = u_{i_{k - 1}}, k = m + 1, \dots, s, \end{matrix}

(6)

where

u = (u_{i_{1}}, \dots, u_{i_{s - 1}})

are the test points uniformly distributed on the

C^{s - 1}

. We now turn our attention to the uniformity of the distribution of the experimental points obtained by the transformation (6). Based on the previous results, we have the following conclusion.

Theorem 2.

In the non-empty bounded space

X_{02} = {x = (x_{1}, \dots, x_{s}) ∣ 0 < x_{i} < 1, \sum_{i = 1}^{m} a_{i} x_{i} = 1, 0 < a_{1} < 1}

with

a_{1} < \dots < a_{m}

, let

u = (u_{1}, \dots, u_{s - 1})

be a uniformly distributed random vector over

{(0, 1)}^{s - 1}

. Then, the random vector obtained through the following transformation (6) is also uniformly distributed over

X_{02}

.

Thus, by Theorem 2, it is established that the transformed test points are also uniformly distributed over

X_{02}

.

When

a_{1} \geq 1

, we have

0 < \sum_{i = 2}^{t} a_{i} x_{i} = 1 - a_{1} x_{1} < 1

. In this scenario, we can treat

x_{1}

as a slack variable, and the new experimental region can be represented as

X_{03}^{'} = {x = (x_{2}, \dots, x_{s}) | 0 < x_{i} < 1, 0 < \sum_{i = 2}^{m} a_{i} x_{i} < 1, a_{i} \geq 1}

. Let

X = (X_{2}, \dots, X_{s})

be a uniformly distributed random vector in

X_{03}^{'}

. The density function can then be expressed as

f (x_{2}, \dots, x_{s}) = I (0 < x_{i} < 1, 0 < \sum_{i = 2}^{m} a_{i} x_{i} < 1, a_{i} \geq 1) / V (X_{03}^{'})

, where

V (X_{03}^{'}) = 1 / [Γ (m) \prod_{k = 2}^{m} a_{k}]

. By Lemma 2, the corresponding marginal distribution function can then be expressed as

F_{X_{2}}^{3} (x_{2}) = [1 - {(1 - a_{2} x_{2})}^{m - 1}] I (0 < x_{2} < 1 / a_{2}),

(7)

Additionally, the conditional distribution functions for

i = 3, \dots, m

can be expressed as

F_{i | 2, \dots, i - 1}^{3} (x_{i}) = {1 - {[1 - a_{i} x_{i} / (1 - \sum_{k = 2}^{i - 1} a_{k} x_{k})]}^{m - i + 1}} I (0 < x_{i} < (1 - \sum_{k = 2}^{i - 1} a_{k} x_{k}) / a_{i} < 1 / a_{i}) .

(8)

Similarly, we can derive the design using the transformation

\{\begin{matrix} x_{1} = a_{1}^{- 1} \prod_{j = 0}^{t - 1} {(1 - u_{i_{j}})}^{1 / (t - j)}, \\ x_{k} = a_{k}^{- 1} [1 - {(1 - u_{i_{k - 1}})}^{1 / (t - k + 1)}] \prod_{j = 0}^{k - 2} {(1 - u_{i_{j}})}^{1 / (m - j)}, k = 2, \dots, m, \\ x_{k} = u_{i_{k - 1}}, k = m + 1, \dots, s, \\ 0 < u_{i_{j}} < 1, j = 1, \dots, s - 1, \end{matrix}

(9)

where

u = (u_{i_{1}}, \dots, u_{i_{s - 1}})

are the test points uniformly distributed on

C^{s - 1}

. We now turn our attention to the uniformity of the distribution of the experimental points obtained by the transformation (9).

Theorem 3.

In the non-empty bounded space

X_{03} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, \sum_{i = 1}^{m} a_{i} x_{i} = 1, 1 \leq a_{1} \leq \dots \leq a_{m}}

, let

u = (u_{1}, \dots, u_{s - 1})

be a random vector uniformly distributed over

{(0, 1)}^{s - 1}

, and suppose that

{(1 - u_{0})}^{\frac{1}{t}} = 1

. Then, the random vector obtained through the transformation (9) is also uniformly distributed over

X_{03}

.

From Theorem 3, we can conclude that the transformed test points are uniformly distributed over

X_{03}

. When all coefficients are equal to 1, this conclusion degenerates into the result of uniform designs over the standard simplex presented in Fang and Wang [7].

For the third case, consider the condition

1 < \sum_{i = 1}^{m - 1} a_{i} x_{i} = 1 - a_{m} x_{m} < 1 - a_{m}

. Treat

x_{m}

as a slack variable, and the new experimental region can be represented as

X_{04}^{'} = {x = (x_{1}, \dots, x_{m - 1}, x_{m + 1}, \dots, x_{s}) | 0 < x_{i} < 1, i = 1, \dots, s, 1 < \sum_{i = 1}^{m - 1} a_{i} x_{i} < 1 - a_{m}, a_{1}, \dots, a_{t} > 0, a_{t + 1}, \dots, a_{m - 1} < 0}

. Let

X = (X_{1}, \dots, X_{m - 1}, X_{m + 1}, \dots, X_{s})

be a uniform random variable in

X_{04}^{'}

. The density function

f (x)

can then be obtained, and the measure of the region

X_{04}^{'}

is given by

V (X_{04}^{'}) = \sum_{k = 0}^{m - 1} {(- 1)}^{k} \sum_{i_{0}, 1 \leq i_{1} < \dots < i_{k} \leq m - 1} \frac{[{(1 - a_{m} - a_{1, m - 1}^{T} 1_{k}^{m})}^{m - 1} - {(1 - a_{1, m - 1}^{T} 1_{k}^{m})}^{m - 1}]}{Γ (m) \prod_{k = 1}^{m} a_{k}} .

By Lemma 2, the corresponding marginal distribution function can be expressed as

\begin{matrix} F_{X_{1}}^{4} (x_{1}) & = \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{m} - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1} I (1 - a_{m} - a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1} \geq 0)}{V (X_{04}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}} \\ + \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1}}{V (X_{04}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}} \\ - \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{m} - a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1}}{V (X_{04}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}} \\ - \sum_{k = 0}^{m - 2} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{2, m - 1}^{T} 1_{k}^{m - 1})}_{+}^{m - 1} I (1 - a_{1} x_{1} - a_{2, m - 1}^{T} 1_{k}^{m - 1} \geq 0)}{V (X_{04}^{'}) Γ (m) \prod_{k = 1}^{m - 1} a_{k}}, \end{matrix}

(10)

and the conditional distribution functions

\begin{matrix} F_{i ∣ 2, \dots, i - 1}^{4} (x_{i}) \\ = K_{4} \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{m} - \sum_{k = 1}^{i - 1} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} \\ \times I (1 - a_{m} - \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i} \geq 0) \\ + K_{4} \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} \\ - K_{4} \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - a_{m} - \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} \\ - K_{4} \sum_{k = 0}^{m - i - 1} {(- 1)}^{k} \sum_{i_{1} < \dots < i_{k}} \frac{{(1 - \sum_{k = 1}^{i - 1} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i})}_{+}^{m - i}}{Γ (m - i + 1) \prod_{k = i}^{m - 1} a_{k}} \\ \times I (1 - \sum_{k = 1}^{i} a_{k} x_{k} - a_{i + 1, m - 1}^{T} 1_{k}^{m - i} \geq 0), \end{matrix}

(11)

where

K_{4} = \frac{I (\sum_{k = 1}^{i} a_{k} x_{k} < 0) + I (m - t < i \leq m - 1)}{h (m - 1, i, 0, - a_{m})} .

Similarly, we can derive the design using the transformation

\{\begin{matrix} F_{X_{1}}^{4} (x_{1}) = u_{i_{1}}, \\ F_{j ∣ 2, \dots, j - 1}^{4} (x_{j}) = u_{i_{j}}, j = 2, \dots, m - 1, \\ x_{m} = \frac{1}{a_{m}} - \frac{1}{a_{m}} \sum_{i = 1}^{m - 1} a_{i} x_{i}, \\ x_{k} = u_{i_{k - 1}}, k = m + 1, \dots, s . \end{matrix}

(12)

Based on the previous results, we can derive the following result.

Theorem 4.

In the non-empty bounded space

X_{04} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, \sum_{i = 1}^{m} a_{i} x_{i} = 1, a_{1}, \dots, a_{t} > 0, a_{t + 1}, \dots, a_{m} < 0}

, let

u = (u_{1}, \dots, u_{s - 1})

be a random vector uniformly distributed over

{(0, 1)}^{s - 1}

. Then, the random vector obtained through the following transformation (12) is also uniformly distributed over

X_{04}

.

The proofs of Theorems 2–4 can be established in a manner similar to that of Theorem 1, and we omit them. Thus, by Theorem 4, we have established that the transformed test points are also uniformly distributed over

X_{04}

.

Based on the aforementioned results, the steps for identifying a uniform design over the experimental region

X_{0}

, subject to an arbitrary linear constraint, can be summarized in Algorithm 1.

Algorithm 1 General construction algorithm for one arbitrary linear constraint

1:: Input: The experiment domain $X_{0}$ ;
2:: Step 1: Compare the magnitude of b with 0. If $b = 0$ , proceed to step 2. If $b \neq 0$ , divide both sides of the constraint equation by b to obtain a new constraint: $a_{1} x_{1} + \dots + a_{s} x_{s} = 1$ , where $a_{i} = a_{i} / b$ , Then, proceed to step 3;
3:: Step 2: If all $a_{i} > 0$ (or $a_{i} \leq 0$ for $i = 1, \dots, s$ , then $X_{0}$ is empty. Otherwise, reorder $a_{1}, \dots, a_{s}$ such that $a_{1}, \dots, a_{t} > 0$ and $a_{t + 1}, \dots, a_{m} < 0, 0 < t < m \leq s$ . Then, transform $X_{0}$ into $X_{01}^{'}$ ;
4:: Step 3: If all $a_{i} > 0$ for $i = 1, \dots, s$ , arrange them in ascending order. If $a_{1} < 1$ , transform $X_{0}$ into $X_{02}^{'}$ ; If $a_{1} \geq 1$ , transform $X_{0}$ to $X_{03}^{'}$ ; If a rearrangement exists such that $a_{1}, \dots, a_{t} > 0$ and $a_{t + 1}, \dots, a_{m} < 0$ , with $0 < t < m \leq s$ , transform $X_{0}$ into $X_{04}^{'}$ ; otherwise, $X_{0}$ is empty;
5:: Step 4: Calculate the corresponding marginal distribution function (1) (or (4), (7), (10)) and the conditional distribution functions (2) (or (5), (8), (11)) on $X_{01}^{'}$ (or $X_{02}^{'}$ , $X_{03}^{'}$ , $X_{04}^{'}$ );
6:: Step 5: Given a set of uniformly distributed test points in ${(0, 1)}^{s - 1}$ generated using existing methods (such as the good lattice method), randomly permute each point and denote it as $(u_{i_{1}}, \dots, u_{i_{s - 1}})$ . Then, for each point, we can obtain $(s - 1)!$ distinct design configurations uniformly distributed over $X_{0}$ using the transformation (3) (or (6), (9), (12));
7:: Output: Compare the CCD values of the $(s - 1)!$ designs and select the one with the smallest CCD as the final design points, denoted by $P_{n}$ .

In Step 2, based on the idea of slack variable models, as discussed by Schneider et al. [18] and Schneider et al. [19], we can choose

x_{i}

as a slack variable in a linear programming problem. Using the existing constraints, the new experimental domain can be expressed as

X_{01}^{'}

. Similarly, in Step 3, the new experimental domains can be denoted as

X_{02}^{'}

,

X_{03}^{'}

and

X_{04}^{'}

. After identifying the uniformly distributed test points within these experimental domains, it remains to prove that these design points are also uniformly distributed in the original experimental domain.

In the following, we provide an example to facilitate a better understanding of Algorithm 1.

Example 1.

Consider

D_{1} = {x = {(x_{1}, x_{2}, x_{3})}^{t} | 0 < x_{i} < 1, 2 x_{1} + \frac{1}{2} x_{2} - x_{3} = 1, i = 1, 2, 3} .

By Algorithm 1, we obtain

\begin{matrix} u_{1} & = F (x_{1}) \\ = - [{(2 - 2 x_{1})}^{2} - 4] I (0 < x_{1} < 1) + [{(1 - 2 x_{1})}^{2} - 1] I (0 < x_{1} \leq 1 / 2) \\ + [{(3 / 2 - 2 x_{1})}^{2} - 9 / 4] I (0 < x_{1} \leq 3 / 4) - [{(1 / 2 - 2 x_{1})}^{2} - 1 / 4] I (0 < x_{1} < 1 / 4), \end{matrix}

(13)

and

\begin{matrix} u_{2} & = F (x_{2} | x_{1}) \\ = (4 - 4 x_{1}) I (0 < x_{1} \leq 1) I (2 - 2 x_{1} - x_{2} / 2 \geq 0) + (2 - 4 x_{1} - x_{2}) I (2 - 4 x_{1} - x_{2} \geq 0) \\ - (4 - 4 x_{1} - x_{2}) I (4 - 4 x_{1} - x_{2} \geq 0) - (2 - 4 x_{1}) I (2 - 4 x_{1} \geq 0) I (2 - 4 x_{1} - x_{2} \geq 0) . \end{matrix}

(14)

We can solve Equations (13) and (14) to obtain

\begin{matrix} x_{1} & = 0.25 (1 + 2 \sqrt{u_{1}}) I (0 \leq u_{1} < 0.25) + (0.5 u_{1} + 0.375) I (0.25 \leq u_{1} < 0.75) \\ + 0.5 (2 - \sqrt{1 - u_{1}}) I (0.75 \leq u_{1} < 1) \end{matrix}

(15)

and

\begin{matrix} x_{2} & = [1 + 2 (u_{1} - 1) \sqrt{u_{1}}] I (0 \leq u_{1} < 0.25) + u_{2} I (0.25 \leq u_{1} < 0.75) \\ + 2 u_{2} \sqrt{1 - u_{1}} I (0.75 \leq u_{1} < 1) \end{matrix}

(16)

Thus, based on constraint

x_{3} = (2 x_{1} + x_{2} / 2) - 1

, we have

\begin{matrix} x_{3} & = u_{2} \sqrt{u_{1}} I (0 \leq u_{1} < 0.25) + (u_{1} + 0.5 u_{2} - 0.25) I (0.25 \leq u_{1} < 0.75) \\ + [1 - (1 - u_{2}) \sqrt{1 - u_{1}}] I (0.75 \leq u_{1} < 1) \end{matrix}

(17)

Using the modified good lattice point method to generate design points in

{(0, 1)}^{2}

, the corresponding design points on

D_{1}

are obtained from Equations (15)–(17). The construction results are visualized in Figure 1, which shows the results of the IRT, AR, and SR methods for constructing a 10, 50, and 100-run uniform design on

D_{1}

.

In Figure 1, the plus sign represents the design points obtained using the IRT-based method, the circle represents the design points obtained using the AR method, and the asterisk represents the design points obtained using the SR method. Based on Figure 1, we can directly observe that the design points obtained using the method proposed in this paper, based on the IRT method, are more widely and evenly distributed in the experimental space. In practical experiments, we can find the nearest design points to arrange the experiments.

Moreover, we run experiments for

n = 10, 20, \dots, 100

to compare the IRT, AR, and SR methods, with numerical results presented in Table 1. We observe that the IRT method consistently outperforms the other two methods. Additionally, the AR method takes the longest time. Both the method proposed in this paper and the SR method consume relatively less computational time. However, the designs generated using the proposed method have a smaller CCD scores in the experimental region, indicating better uniformity.

3.2. Multiple Arbitrary Linear Constraints

Regarding the case where the number of distinct linear constraints t (where

1 \leq t \leq s

) exceeds one, the feasible region for selecting experimental points is the intersection of multiple

(s - t)

-dimensional hyperplane segments. In this case, algebraic methods can be used to compute the normal vector of the intersecting hyperplanes, thereby representing the general solution of intersecting parts. The problem is reduced to the case of a single linear constraint.

The experimental region

X_{1} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, a_{j 1} x_{1} + \dots + a_{j s} x_{s} = b_{j}, j = 1, \dots, t, t < s}

is considered, where

a_{1}, \dots, a_{j s}, b_{j} \in R

. Given the number of non-redundant linear constraints in this experimental region, we can directly determine the experimental points

x

within the region

X_{1}

, assuming that

X_{1}

is non-empty. Alternatively, we can express the components of

x

that are unconstrained by the linear conditions and then use the IRT method to construct a uniform design on

X_{1}

. However, due to the arbitrary values of

a_{j 1}, \dots, a_{j s}, b_{j}

and t, it is challenging to intuitively determine the number of non-redundant linear constraints within the experimental region, making it difficult to establish the dimension of

X_{1}

and identify which components of

x

are constrained. A natural approach is to view the constraints in the experimental region as a system of linear equations over the real number field, denoted as (I):

A x = b

, where A is the coefficient matrix of the system, expressed as

A_{(t \times s)} = (\begin{matrix} a_{11} & \dots & a_{1 s} \\ ⋮ & ⋱ & ⋮ \\ a_{t 1} & \dots & a_{t s} \end{matrix}),

x = (x_{1}, \dots, x_{s})

is the unknown vector in the linear system (I), and

b = {(b_{1}, \dots, b_{t})}^{T}

is the constant vector. The augmented matrix of the system is indicated by

\bar{A}

, and the dimension of the solution space of system (I) corresponds to the dimension of the experimental space

X_{1}

. The dimension d of the experimental space

X_{1}

can be determined based on the rank

r_{1}

(where

r_{1} \leq min {t, s}

) of the coefficient matrix A, the rank

r_{2}

(where

r_{2} \geq r_{1}

) of the augmented matrix

\bar{A}

, the number of linear constraints t, and the total dimensionality s of the space. Specifically, if the rank

r_{1}

of the design matrix A is smaller than the dimension s of

C^{s}

, the dimension of the experimental region is the absolute difference between s and

r_{1}

; that is,

|r_{1} - s| = s - r_{1}

.

When

b = 0

, system (I) becomes a homogeneous system of linear equations, where

r_{1} = r_{2} = r

. Based on the solvability of homogeneous linear systems, if

J < s

or

r < s \leq t

, the system will have non-zero solutions. If

r = s \leq t

, the system will have only the trivial zero solution. When

b \neq 0

, system (I) is a non-homogeneous system of linear equations. Based on the solvability of non-homogeneous systems, if

r_{1} = r_{2} < s

, the system will have non-zero solutions. If

r_{1} = r_{2} = s

, the system will have a unique solution. If

r_{1} < r_{2}

, the system will not have a solution.

This construction process can be summarized in Algorithm 2.

Algorithm 2 General construction algorithm for multiple arbitrary linear constraints

1:: Input: The experimental domain $X_{1} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, a_{j 1} x_{1} + \dots + a_{j s} x_{s} = b_{j}, j = 1, \dots, t, t < s$ }, where $a_{1}, \dots, a_{j s}, b_{j} \in R$ ;
2:: Step 1: Express the constraints as a system of linear equations and determine whether solutions exist for the system;
3:: Step 2: Find the general solution to the system and use it to determine the normal vector of the hyperplane segment;
4:: Step 3: Use the obtained normal vector to represent the experimental region with a single linear equality constraint, then apply Algorithm 1 to obtain the design;
5:: output: The final design points $P_{n}$ .

In Step 1, it is essential to first analyze the magnitude of

X_{1}

under conditions

b = {(b_{1}, \dots, b_{t})}^{T} = 0

and

b \neq 0

, as well as to derive the expressions for the experimental points in the space. Subsequently, the uniformly distributed experimental points can be determined for each case accordingly.

4. Uniform Design of Experiments for Quadratic Constraints

Next, we consider a more general case where the experimental region is the hyperspherical surface of an arbitrary hypersphere within

{(0, 1)}^{s}

:

X_{2} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, a_{1} x_{1}^{2} + \dots + a_{s} x_{s}^{2} = 1, i = 1, \dots, s, a_{1}, \dots a_{s} \geq 1}

. By treating

x_{s}

as a slack variable, the new experimental region can be represented as

X_{2}^{'} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, a_{1} x_{1}^{2} + \dots + a_{s} x_{s}^{2} = 1, i = 1, \dots, s, a_{1}, \dots a_{s} \geq 1}

. Assume that

X = (X_{1}, \dots, X_{s - 1})

is a uniformly distributed random vector in

X_{2}^{'}

. Following the construction principles of the IRT method, our aim is to derive the marginal distribution function of

X_{1}

and the corresponding conditional distribution functions. Based on the Dirichlet distribution [24], Dirichlet integrals [25], and the results from the previous section, we can derive the density function as follows:

f (x_{1}, \dots, x_{s - 1}) = \frac{[\prod_{i = 1}^{s - 1} \sqrt{a_{i}} 2^{s - 1} Γ ((s + 1) / 2)] I (0 < x_{i} < 1, 0 < \sum_{k = 1}^{s - 1} a_{k} x_{k}^{2} < 1, i = 1, \dots, s - 1)}{π^{(s - 1) / 2}} .

Then, the corresponding marginal distribution function can be expressed as

F_{X_{1}}^{5} (x_{1}) = {2 Γ ((s + 1) / 2) / [\sqrt{π} Γ (s / 2)]} I (0 < x_{1} < 1) J_{s - 1} (\sqrt{a_{1}} x_{1}),

(18)

where

J_{n} (x) = \int_{0}^{arcsin x} {cos}^{n} t d t = x {(1 - x^{2})}^{\frac{n - 1}{2}} / n + (1 - 1 / n) J_{n - 2} (x),

J_{0} (x) = arcsin x, J_{1} (x) = x, J_{2} (x) = arcsin x / 2 + x \sqrt{1 - x^{2}} / 2 .

And, the conditional distribution functions are given by

\begin{matrix} F_{i | 2, \dots, i - 1}^{5} (x_{i}) & = [2 Γ ((s - i + 2) / 2) / \sqrt{π} Γ ((s - i + 1) / 2)] J_{s - i} (\sqrt{a_{i}} x_{i} / \sqrt{1 - \sum_{j = 1}^{i - 1} a_{j} x_{j}^{2}}) \\ \times I (0 < \sqrt{a_{i}} x_{i} < \sqrt{1 - \sum_{j = 1}^{i - 1} a_{j} x_{j}^{2}}) . \end{matrix}

(19)

Similarly, we can derive the design as follows:

\{\begin{matrix} F_{X_{1}}^{5} (x_{1}) = u_{i_{1}}, \\ F_{j ∣ 2, \dots, j - 1}^{5} (x_{j}) = u_{i_{j}}, j = 2, \dots, s - 1, \\ x_{s} = \sqrt{{a_{s}}^{- 1} (1 - \sum_{i = 1}^{s - 1} a_{i} x_{i}^{2})} . \end{matrix}

(20)

Based on the previous results, we can derive the following conclusion.

Theorem 5.

In the non-empty bounded space

X_{2} = {x = (x_{1}, \dots, x_{s}) | 0 < x_{i} < 1, \sum_{i = 1}^{s} a_{i} x_{i}^{2} = 1, a_{1}, \dots, a_{s} \geq 1}

, let

u = (u_{1}, \dots, u_{s - 1})

be a uniformly distributed random vector over

{(0, 1)}^{s - 1}

. Then, the random vector obtained through the transformation in (20) is also uniformly distributed over

X_{2}

.

Proof.

Let

X = (X_{1}, \dots, X_{s})

and

C = (C_{1}, \dots, C_{s - 1})

represent random vectors uniformly distributed over

X_{2}

and

X_{2}^{'}

, respectively. Let

U = (U_{1}, \dots, U_{s - 1})

represent random vectors uniformly distributed over

{(0, 1)}^{s - 1}

such that

X

and

U

satisfy (20). The distribution function of

X

is given by

\begin{matrix} F_{X} (x) = P (X_{1} \leq x_{1}, \dots, X_{s - 1} \leq x_{s - 1}, \frac{1}{a_{s}} (1 - \sum_{i = 1}^{s - 1} a_{i} X_{i}^{2}) \leq x_{s}^{2}), \end{matrix}

From the proof of Theorem 1, it follows that

F_{X} (x) = P (\frac{1}{a_{s}} (1 - \sum_{i = 1}^{s - 1} a_{i} X_{i}^{2}) \leq x_{s}^{2} ∣ X_{1} \leq x_{1}, \dots, X_{m - 1} \leq x_{s - 1}) \frac{x_{1} \dots x_{s - 1}}{V (X_{2}^{'})}

\times I ((x_{1}, \dots, x_{s - 1}) \in X_{2})

= \frac{V (X_{2}) P (\frac{1}{a_{s}} (1 - \sum_{i = 1}^{s - 1} a_{i} X_{i}^{2}) \leq x_{s}^{2} ∣ X_{1} \leq x_{1}, \dots, X_{s - 1} \leq x_{s - 1})}{V (X_{2}^{'})} \times \frac{x_{1} \dots x_{s - 1}}{V (X_{2})}

\times I ((x_{1}, \dots, x_{s - 1}) \in X_{2})

≐ \frac{x_{1} \dots x_{s}}{V (X_{2})} I ((x_{1}, \dots, x_{s}) \in X_{2}) .

Therefore,

X = (X_{1}, \dots, X_{s})

, obtained through the transformation (20), is uniformly distributed over

X_{2}

. □

Theorem 5 implies that the transformed test points are also uniformly distributed over

X_{2}

. When all coefficients are equal to 1, this simplifies the result on uniform designs over the standard sphere. Fang and Wang [7] and Hardin and Saff [26] have investigated the construction of uniform designs on hyperspheres using number-theoretic methods and minimum energy approaches, respectively. In the following, we provide an example to facilitate a better understanding of the above conclusion.

Example 2.

Consider the following experimental region:

D_{2} = {x = {(x_{1}, x_{2}, x_{3})}^{t} | 0 < x_{i} < 1, 4 x_{1}^{2} + 9 x_{2}^{2} + x_{3}^{2} = 1, i = 1, 2, 3} .

By treating

x_{3}

as a slack variable, the new experimental region can be represented as

D_{2}^{'} = {x = {(x_{1}, x_{2})}^{t} | 0 < x_{i} < 1, 0 < 4 x_{1}^{2} + 9 x_{2}^{2} < 1, i = 1, \dots, s - 1} .

Let

X = (X_{1}, \dots, X_{s - 1})

be a uniform random variable vector in

D_{2}^{'}

. By Equations (18) and (19), and applying a Taylor expansion, we can derive the marginal distribution function

F (x_{1})

and the corresponding conditional distribution functions

F (x_{2} | x_{1})

:

F (x_{1}) = (2 arcsin 2 x_{1}) / π + (4 x_{1} \sqrt{1 - 4 x_{1}^{2}}) / π ≐ 2.5 x_{1} = u_{1},

(21)

F (x_{2} | x_{1}) = [3 x_{2} I (0 < x_{2} < \sqrt{1 - 4 x_{1}^{2}} / 3)] / \sqrt{1 - 4 x_{1}^{2}} = u_{2} .

(22)

Using the modified good lattice point method to generate design points in

{(0, 1)}^{2}

, the corresponding design points on

D_{2}

are obtained from Equations (21) and (22). The construction results are visualized in Figure 2, which shows the results of the IRT, AR, and SR methods for constructing a 10, 50, and 100-run uniform design on

D_{2}

.

In Figure 2, the plus sign represents the design points obtained using the IRT-based method, the circle represents the design points obtained using the AR method, and the asterisk represents the design points obtained using the SR method. Based on Figure 2, we can directly observe that the design points obtained using the method proposed in this paper, based on the IRT method, are more widely and evenly distributed in the experimental space. In practical experiments, we can find the nearest design points to arrange the experiments.

Moreover, we run the experiments for some

n = 10, 20, \dots, 100

to compare the IRT, AR, and SR methods, with the numerical results presented in Table 2. It is evident that the IRT method consistently outperforms the other two methods. Additionally, the AR method takes the longest time. Both the method proposed in this paper and the SR method consume relatively less computational time. However, the designs generated using the proposed method have lower CCD scores in the experimental region, indicating better uniformity.

5. Conclusions

This paper applies the IRT method to experimental regions with linear and quadratic constraints, which offers simplified calculations for practitioners and provides a practical solution applicable to a wide range of experimental scenarios. By introducing the concept of slack variables, general formulas for constructing uniform designs within these constrained regions are derived. Compared to the existing AR and SR methods, this approach offers faster computational speed. Although the formulas become more complex with increasing dimensionality, numerical methods and Taylor expansions can approximate these formulas, simplifying computations. This method is versatile and effectively addresses the challenge of finding global optimal solutions within constrained experimental regions. Based on the CCD criterion, the designs obtained using this method exhibit superior uniformity compared to those derived from the AR and SR methods, thus facilitating the development of more complex models. However, the construction of uniform designs within regions with linear constraints based on the IRT method has certain limitations. For instance, the uniformity metrics used in this study are not completely comprehensive. The uniformity of the designs relies on their uniformity within the unit hypercube in the experimental space. Furthermore, because of the asymmetry of the experimental region, evaluating uniformity using CCD is computationally demanding. Approximate calculations or the development of simpler, more feasible uniformity measures could help to address this challenge. However, there are still some limitations that warrant further research:

From the perspective of uniformity, the uniformity measurement standard is not sufficiently refined. The uniformity of the design constructed in this paper depends on the uniformity of the corresponding design in the unit hypercube in the experimental space. Additionally, due to the asymmetry of the experimental region, the uniformity evaluation using the CCD requires repeated computations. Therefore, finding a more appropriate uniformity measure for such experimental regions and determining how to quickly identify the best experimental points with the most uniformity in the target experimental region are open problems for further research.
From the perspective of computational complexity, the computation is not simple enough. This paper provides analytical expressions of the marginal distribution function and conditional distribution function for uniformly distributed random vectors in the experimental region, but the formulas are still not concise enough. It is possible to find simpler functions to approximate these expressions, thus obtaining transformation formulas that can be computed more quickly.
From the perspective of the applicability of the conclusions, the scope of applicability is relatively limited. This paper studies experimental regions with linear equality constraints. When the constraints are linear inequality constraints, they can be transformed into equality constraints by adding slack and residual variables. However, the formulas provided in this paper for constructing uniform designs are only applicable to such experimental regions, while, in practical problems, experimental regions may be more complex.

Author Contributions

Conceptualization, L.Y. and Y.Z.; methodology, L.Y.; software, L.Y.; validation, L.Y. and X.Y.; formal analysis, L.Y.; investigation, L.Y.; resources, Y.Z. and X.Y.; writing—original draft preparation, L.Y.; writing—review and editing, Y.Z. and X.Y.; visualization, L.Y.; supervision, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China of 12131001, and the Fundamental Research Funds for the Central Universities.

Data Availability Statement

The codes for generating the figures and tables in this paper are available at the following website: https://github.com/Ylj-2001 (accessed on 22 January 2025).

Acknowledgments

The authorship is listed in alphabetical order.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

CCD	Central composite discrepancy
IRT	Inverse Rosenblatt transformation
AR	Acceptance–rejection
SR	Stochastic representation

References

Hua, L.K.; Wang, Y. Applications of Number Theory to Numerical Analysis; Springer: Berlin/Heidelberg, Germany, 1981. [Google Scholar]
Huang, Y.; Zhou, Y. Convergence of uniformity criteria and the application in numerical integration. Mathematics 2022, 10, 3717. [Google Scholar] [CrossRef]
Fang, K.; Liu, M.Q.; Qin, H.; Zhou, Y.D. Theory and Application of Uniform Experimental Designs; Spinger: Singaporer, 2018; Volume 221. [Google Scholar]
Feng, J.; Wu, Y.; Sun, H.; Zhang, S.; Liu, D. Panther: Practical Secure 2-Party Neural Network Inference. IEEE Trans. Inf. Forensics Secur. 2025, 20, 1149–1162. [Google Scholar] [CrossRef]
Ma, Z.; Yang, X.; Shang, W.; Wu, J.; Sun, H. Resilience analysis of an urban rail transit for the passenger travel service. Transp. Res. Part Transp. Environ. 2024, 128, 104085. [Google Scholar] [CrossRef]
Tan, M.T.; Fang, H.B. Drug combination studies, uniform experimental design and extensions. In Contemporary Experimental Design, Multivariate Analysis and Data Mining: Festschrift in Honour of Professor Kai-Tai Fang; Springer: Cham, Germany, 2020; pp. 127–144. [Google Scholar]
Fang, K.T.; Wang, Y. Number-Theoretic Methods in Statistics; CRC Press: Boca Raton, FL, USA, 1993; Volume 51. [Google Scholar]
Chen, R.B.; Hsu, Y.W.; Hung, Y.; Wang, W. Discrete particle swarm optimization for constructing uniform design on irregular regions. Comput. Stat. Data Anal. 2014, 10, 282–297. [Google Scholar] [CrossRef]
Qi, Z.F.; Yang, J.F.; Liu, Y.; Liu, M.Q. Construction of nearly uniform designs on irregular regions. Commun. Stat. Theory Methods 2017, 46, 8318–8327. [Google Scholar] [CrossRef]
Wang, Y.; Fang, K.T. Uniform design of experiments with mixtures. Sci. China Ser. A 1996, 39, 264–275. [Google Scholar]
Borkowski, J.J.; Piepel, G.F. Uniform designs for highly constrained mixture experiments. J. Qual. Technol. 2009, 41, 35–47. [Google Scholar] [CrossRef]
Lin, D.K.; Sharpe, C.; Winker, P. Optimized U-type designs on flexible regions. Comput. Stat. Data Anal. 2010, 54, 1505–1515. [Google Scholar] [CrossRef]
Liu, Y.; Liu, M.Q. Construction of uniform designs for mixture experiments with complex constraints. Commun.-Stat.-Theory Methods 2016, 45, 2172–2180. [Google Scholar] [CrossRef]
Zhang, M.; Zhang, A.; Zhou, Y. Construction of uniform designs on arbitrary domains by inverse rosenblatt transformation. In Contemporary Experimental Design, Multivariate Analysis and Data Mining: Festschrift in Honour of Professor Kai-Tai Fang; Springer: Cham, Germany, 2020; pp. 111–126. [Google Scholar]
Joseph, V.R.; Dasgupta, T.; Tuo, R.; Wu, C.J. Sequential exploration of complex surfaces using minimum energy designs. Technometrics 2015, 57, 64–74. [Google Scholar] [CrossRef]
Joseph, V.R.; Wang, D.; Gu, L.; Lyu, S.; Tuo, R. Deterministic Sampling of Expensive Posteriors Using Minimum Energy Designs. Technometrics 2019, 61, 297–308. [Google Scholar] [CrossRef]
Huang, C.; Joseph, V.R.; Ray, D.M. Constrained minimum energy designs. Stat. Comput. 2021, 31, 80. [Google Scholar] [CrossRef]
Schneider, F.; Schüssler, M.; Hellmig, R.; Nelles, O. Constrained design of experiments for data-driven models. In Proceedings of the Workshop Computational Intelligence, Berlin, Germany, 1–2 December 2022; Volume 1, p. 193. [Google Scholar]
Schneider, F.; Hellmig, R.J.; Nelles, O. Uniform Design of Experiments for Equality Constraints. In Proceedings of the Intelligent Data Engineering and Automated Learning, Évora, Portugal, 22–24 November 2023; Volume 14404, pp. 311–322. [Google Scholar]
Jourdan, A. Space-filling designs with a Dirichlet distribution for mixture experiments. Stat. Pap. 2024, 65, 2667–2686. [Google Scholar] [CrossRef]
Javier, C.S. Selecting the slack variable in mixture experiment. Ing. Investig. Tecnol. 2015, 16, 613–623. [Google Scholar] [CrossRef]
Azikiwe, H.; Bello, A. Families of multivariate distributions involving the Rosenblatt construction. J. Am. Stat. Assoc. 2006, 101, 1652–1662. [Google Scholar]
Lasserre, J.B. Volume of slices and sections of the simplex in closed form. Optim. Lett. 2015, 9, 1263–1269. [Google Scholar] [CrossRef]
Sobel, M. Selected Tables in Mathematical Statistics; American Mathematical Society: Providence, RI, USA, 1970; Volume 4. [Google Scholar]
Sobel, M.; Uppuluri, V.R.R.; Frankowski, K. Dirichlet Integrals of Type 2 and Their Applications; American Mathematical Society: Providence, RI, USA, 1985; Volume 9. [Google Scholar]
Hardin, D.P.; Saff, E.B. Discretizing manifolds via minimum energy points. Not. Am. Math. Soc. 2004, 51, 1186–1194. [Google Scholar]

Figure 1. (a) Scatter plot of design points on

D_{1}

for

n = 10

. (b) Scatter plot of design points on

D_{1}

for

n = 50

. (c) Scatter plot of design points on

D_{1}

for

n = 100

.

Figure 1. (a) Scatter plot of design points on

D_{1}

for

n = 10

. (b) Scatter plot of design points on

D_{1}

for

n = 50

. (c) Scatter plot of design points on

D_{1}

for

n = 100

.

Figure 2. (a) Scatter plot of design points on

D_{2}

for

n = 10

. (b) Scatter plot of design points on

D_{2}

for

n = 50

. (c) Scatter plot of design points on

D_{2}

for

n = 100

.

Figure 2. (a) Scatter plot of design points on

D_{2}

for

n = 10

. (b) Scatter plot of design points on

D_{2}

for

n = 50

. (c) Scatter plot of design points on

D_{2}

for

n = 100

.

Table 1. CCD scores and the CPU time for uniform designs constructed on

D_{1}

.

Table 1. CCD scores and the CPU time for uniform designs constructed on

D_{1}

.

n	IRT		AR		SR
	CCD	Time	CCD	Time	CCD	Time
10	0.0866	0.0040	0.0866	2.3678	0.1118	0.0082
20	0.0353	0.0048	0.0499	5.4143	0.1224	0.0015
30	0.0288	0.0089	0.0833	11.385	0.0552	0.0009
40	0.0216	0.0143	0.0728	16.768	0.0684	0.0009
50	0.0173	0.0149	0.0608	15.527	0.0818	0.0010
60	0.0363	0.0553	0.0656	19.796	0.0964	0.0016
70	0.0071	0.0840	0.0294	23.717	0.0468	0.0009
80	0.0062	0.0659	0.0905	24.910	0.0250	0.0009
90	0.0055	0.1449	0.0242	31.059	0.0372	0.0014
100	0.3192	0.0070	0.0489	32.049	0.0273	0.0010

Table 2. CCD scores and CPU time for uniform designs constructed on

D_{2}

.

Table 2. CCD scores and CPU time for uniform designs constructed on

D_{2}

.

n	IRT		AR		SR
	CCD	Time	CCD	Time	CCD	Time
10	0.0500	0.0082	0.0591	9.7024	0.0670	0.0073
20	0.0335	0.0105	0.0500	17.367	0.0474	0.0009
30	0.0268	0.0262	0.0387	40.335	0.0372	0.0010
40	0.0268	0.0373	0.0335	66.832	0.0387	0.0013
50	0.0256	0.0371	0.0286	80.310	0.0366	0.0020
60	0.0255	0.0752	0.0288	75.884	0.0349	0.0024
70	0.0172	0.0859	0.0308	95.633	0.0241	0.0034
80	0.0201	0.0678	0.0233	110.38	0.0268	0.0019
90	0.0166	0.2001	0.0231	113.10	0.0277	0.0019
100	0.0164	0.4554	0.0187	152.15	0.0277	0.0041

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, L.; Yang, X.; Zhou, Y. Construction of Uniform Designs over a Domain with Linear Constraints. Mathematics 2025, 13, 438. https://doi.org/10.3390/math13030438

AMA Style

Yang L, Yang X, Zhou Y. Construction of Uniform Designs over a Domain with Linear Constraints. Mathematics. 2025; 13(3):438. https://doi.org/10.3390/math13030438

Chicago/Turabian Style

Yang, Luojing, Xiaoping Yang, and Yongdao Zhou. 2025. "Construction of Uniform Designs over a Domain with Linear Constraints" Mathematics 13, no. 3: 438. https://doi.org/10.3390/math13030438

APA Style

Yang, L., Yang, X., & Zhou, Y. (2025). Construction of Uniform Designs over a Domain with Linear Constraints. Mathematics, 13(3), 438. https://doi.org/10.3390/math13030438

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Construction of Uniform Designs over a Domain with Linear Constraints

Abstract

1. Introduction

2. Preliminary Results

3. Uniform Design of Experiments for Linear Constraints

3.1. One Arbitrary Linear Constraint

3.2. Multiple Arbitrary Linear Constraints

4. Uniform Design of Experiments for Quadratic Constraints

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI