Next Article in Journal
The Second Law of Infodynamics: A Thermocontextual Reformulation
Next Article in Special Issue
On the Convergence Rate for the Longest at Most T-Contaminated Runs of Heads
Previous Article in Journal
Lattice Boltzmann Modeling of Additive Manufacturing of Functionally Graded Materials
Previous Article in Special Issue
Some Open Questions About the Anisotropic Random Walks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Simple Wide Range Approximation of Symmetric Binomial Distribution

Department of Stochastics, Institute of Mathematics, Budapest University of Technology and Economics, Muegyetem rkp. 3, H ep, 5 em, 1521 Budapest, Hungary
Entropy 2025, 27(1), 21; https://doi.org/10.3390/e27010021
Submission received: 1 December 2024 / Revised: 13 December 2024 / Accepted: 15 December 2024 / Published: 30 December 2024
(This article belongs to the Special Issue The Random Walk Path of Pál Révész in Probability)

Abstract

:
The paper gives a wide range, uniform, local approximation of symmetric binomial distribution. The result clearly shows how one has to modify the classical de Moivre–Laplace normal approximation in order to give an estimate at the tail as well as to minimize the relative error.
MSC:
60E15; 60F05

The topic of this paper is a wide range, uniform, local approximation of symmetric binomial distribution, an extension of the classical de Moivre–Laplace theorem in the symmetric case [1] (Ch. VII) [2]. In other words, I would like to approximate individual binomial probabilities not only in a classical neighborhood of the center, but at the tail as well. The result clearly shows how one has to modify the normal approximation in order to give a wide range estimate to minimize the relative error. The method will be somewhat similar to the ones applied by [3,4,5] or in the proof of Tusnády’s lemma, see, e.g., [6]; however, my task is much simpler than those. This simplification makes the proof short, transparent and natural. Moreover, the result is non-asymptotic, that is, it gives explicit, nearly optimal, upper and lower bounds for the relative error with a finite n. Thus, I hope that it can be used in both applications and teaching.
Let ( X r ) r 1 be a sequence of independent, identically distributed random steps with P ( X r = ± 1 ) = 1 2 and S = r = 1 X r ( 1 ) , S 0 = 0 , be the corresponding simple, symmetric random walk. Then
P ( S = j ) = + j 2 2 ( | j | ) .
Here, we use the convention that the above binomial coefficient is zero whenever + j is not divisible by 2. Since P ( S = j ) = P ( S = j ) for any j, it is enough to consider only the case j 0 whenever it is convenient.
First we consider the case when = 2 n , even. Let us introduce the notation
a k , n : = P ( S 2 n = 2 k ) = 2 n n + k 2 2 n ( | k | n ) .
Also, introduce the notation
b k , n : = n 1 + k + 1 2 n log 1 + k n + 1 k 1 2 n log 1 k n
when n 1 and | k | < n , and
b ± n , n : = 2 n + 1 2 log 2 1 2 log ( 2 π n ) ( n 1 ) .
Theorem 1.
(a) For any n 1 and | k | n , we have
a k , n 1 π n e b k , n .
(b) For any n 1 and | k | r n , r ( 0 , 1 ) , we have
a k , n > 1 π n e b k , n exp 1 7 n r 4 3 ( 1 r 2 ) 2 n .
(c) In accordance with the classical de Moivre–Laplace normal approximation, for n 3 and | k | n 2 3 one has
1 2 n 1 3 < a k , n 1 π n e k 2 n < 1 + 2 n 1 3 .
Proof. 
As usual, the first step is to estimate the central term
a 0 , n = 2 n n 2 2 n = ( 2 n ) ! ( n ! ) 2 2 2 n .
By Stirling’s formula, see, e.g., [1] (p. 54), we have:
2 π n n e n e 1 12 n + 1 < n ! < 2 π n n e n e 1 12 n ( n 1 ) .
Thus, after simplification we obtain
1 π n e 1 7 n < a 0 , n < 1 π n e 1 9 n ( n 1 ) .
Second, also by the standard way, for 1 k n ,
a k , n = a 0 , n n ( n 1 ) ( n k + 1 ) ( n + 1 ) ( n + 2 ) ( n + k ) = a 0 , n 1 1 n 1 2 n 1 k 1 n 1 + 1 n 1 + 2 n 1 + k n .
So it follows that
log a k , n = log a 0 , n log 1 + k n 2 j = 1 k 1 1 2 log 1 + j n 1 j n = log a 0 , n log 1 + k n 2 j = 1 k 1 tanh 1 j n .
To approximate the sum here, let us introduce the integral
I ( x ) : = 0 x tanh 1 ( t ) d t = 1 2 ( 1 + x ) log ( 1 + x ) + 1 2 ( 1 x ) log ( 1 x )
for | x | < 1 , and its approximation by a trapezoidal sum
T k , n : = 1 n 1 2 tanh 1 ( 0 ) + j = 1 k 1 tanh 1 j n + 1 2 tanh 1 k n ,
where 0 k < n . It is well-known that the error of the trapezoidal formula for a function f C 2 ( [ a , b ] ) is
T n ( f ) a b f ( t ) d t = ( b a ) 3 12 n 2 f ( x ) , x [ a , b ] .
Since ( tanh 1 ) ( x ) = 2 x ( 1 x 2 ) 2 , we obtain that
0 T k , n I k n r 4 6 ( 1 r 2 ) 2 n 2 , when 0 k r n , r ( 0 , 1 ) .
Let us combine Formulas (8)–(12):
log a k , n = log a 0 , n 2 n T k , n 1 2 log 1 + k n 1 2 log 1 k n ,
thus
log 1 π n b k , n 1 7 n r 4 3 ( 1 r 2 ) 2 n < log a k , n < log 1 π n b k , n 1 9 n ,
where 0 k r n and
b k , n = 2 n I k n + 1 2 log 1 k 2 n 2 .
Clearly, (14) is the same as (2). Thus, (13) proves (a) and (b) of the theorem.
Let us see now, using Taylor expansions, a series expansion of b k , n when | k | < n . First, for | x | < 1 ,
I ( x ) = x 2 1 · 2 + x 4 3 · 4 + x 6 5 · 6 + x 8 7 · 8 + .
Second, also for | x | < 1 ,
log ( 1 x 2 ) = x 2 x 4 2 x 6 3 x 8 4 .
So by (14), for | k | < n we have a convergent series for b k , n :
b k , n = k 2 n 1 1 2 n + k 4 2 n 3 1 3 1 2 n + k 6 3 n 5 1 5 1 2 n + = j = 1 k n 2 j 1 j n 2 j 1 1 2 .
By (15), for n 3 and | k | n 2 3 we obtain that
b k , n k 2 n n 2 3 2 + j = 2 n 2 3 j 1 j n 2 j 1 + 1 2 n 2 3 2 + n 2 1 3 + 1 6 j = 2 n 2 3 j n 1 3 .
Thus, by (4), for n 3 and | k | n 2 3 ,
a k , n 1 π n e k 2 n e n 1 / 3 < 1 π n e k 2 n 1 + 2 n 1 3 .
Similarly, by (5), for n 3 and | k | n 2 3 ,
a k , n > 1 π n e k 2 n exp 1 7 n n 7 3 3 ( 1 n 2 3 ) 2 n 1 3 1 π n e k 2 n e 1.21 n 1 / 3 > 1 π n e k 2 n 1 2 n 1 / 3 .
(16) and (17) together prove (c) of the theorem. □
The main moral of Theorem 1 is that the exponent k 2 / n in the exponent of the normal approximation is only a first approximation of the series (15). In (c), the bound | k | n 2 3 was chosen somewhat arbitrarily. It is clear from the series (15) that the bound should be o ( n 3 4 ) . The above given bound was picked because it seemed to be satisfactory for usual applications with large deviation and gave a nice relative error bound 2 n 1 3 .
It is not difficult to extend the previous results to the odd-valued case of the symmetric binomial probabilities
a k , n * : = P ( S 2 n 1 = 2 k 1 ) = 2 n 1 n + k 1 2 2 n + 1 ( n + 1 k n ) .
Define
b k , n * : = n 1 + k 1 2 n log 1 + k n + 1 k 1 2 n log 1 k n
for n 1 and | k | < n , and
b n , n * : = 2 n 1 2 log 2 1 2 log ( 2 π n ) ( n 1 ) .
Theorem 2.
(a) For any n 1 and n + 1 k n , we have
a k , n * < 1 π n e b k , n * exp 2 3 n .
(b) For any n 1 and | k | r n , r ( 0 , 1 ) , we have
a k , n * > 1 π n e b k , n * exp 1 n r 4 3 ( 1 r 2 ) 2 n .
(c) In accordance with the classical de Moivre–Laplace normal approximation, for n 4 and | k | n 2 3 one has
1 3 n 1 3 < a k , n * 1 π n e k 2 n < 1 + 6 n 1 3 .
Proof. 
Since this proof is very similar to the previous one, several details are omitted. First, by Stirling’s Formula (7), after simplifications we obtain that
1 π n e 1 n < a 0 , n * < 1 π n e 2 3 n ( n 1 ) .
Second, similarly to (13), for n 1 and | k | r n , r ( 0 , 1 ) , we obtain that
log 1 π n b k , n * 1 n r 4 3 ( 1 r 2 ) 2 n < log a k , n * < log 1 π n b k , n * + 2 3 n ,
where
b k , n * : = 2 n I k n tanh 1 k n = j = 1 k n 2 j 1 1 2 j 1 k j 1 .
(25) clearly agrees with (18). (24) proves (a) and (b).
By the series in (25), for n 4 and | k | n 2 3 we obtain that
b k , n * k 2 n < 2 n 1 3 .
This and (24) imply (c). □

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Feller, W. An Introduction to Probability Theory and Its Applications, 3rd ed.; Wiley: New York, NY, USA, 1968; Volume I. [Google Scholar]
  2. Révész, P. Random Walk in Random and Non-Random Environments, 2nd ed.; World Scientific: Singapore, 2005. [Google Scholar]
  3. Feller, W. On the normal approximation to the binomial distribution. Ann. Math. Statist. 1945, 16, 319–329. [Google Scholar] [CrossRef]
  4. Littlewood, J.E. On the probability in the tail of a binomial distribution. Adv. Appl. Prob. 1969, 1, 43–72. [Google Scholar] [CrossRef]
  5. McKay, B.D. On Littlewood’s estimate for the binomial distribution. Adv. Appl. Prob. 1989, 21, 475–478. [Google Scholar] [CrossRef]
  6. Massart, P. Tusnady’s lemma, 24 years later. Ann. I. H. Poincaré—PR 2002, 38, 991–1007. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Szabados, T. A Simple Wide Range Approximation of Symmetric Binomial Distribution. Entropy 2025, 27, 21. https://doi.org/10.3390/e27010021

AMA Style

Szabados T. A Simple Wide Range Approximation of Symmetric Binomial Distribution. Entropy. 2025; 27(1):21. https://doi.org/10.3390/e27010021

Chicago/Turabian Style

Szabados, Tamás. 2025. "A Simple Wide Range Approximation of Symmetric Binomial Distribution" Entropy 27, no. 1: 21. https://doi.org/10.3390/e27010021

APA Style

Szabados, T. (2025). A Simple Wide Range Approximation of Symmetric Binomial Distribution. Entropy, 27(1), 21. https://doi.org/10.3390/e27010021

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop