<< Chapter < Page Chapter >> Page >
This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by EwaPaszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

Poisson distribution

Some experiments results in counting the number of times particular events occur in given times of on given physical objects. For example, we would count the number of phone calls arriving at a switch board between 9 and 10 am, the number of flaws in 100 feet of wire, the number of customers that arrive at a ticket window between 12 noon and 2 pm, or the number of defects in a 100-foot roll of aluminum screen that is 2 feet wide. Each count can be looked upon as a random variable associated with an approximate Poisson process provided the conditions in the definition below are satisfied.

POISSON PROCCESS
Let the number of changes that occur in a given continuous interval be counted. We have an approximate Poisson process with parameter λ > 0 if the following are satisfied:
  • The number of changes occurring in nonoverlapping intervals are independent.
  • The probability of exactly one change in a sufficiently short interval of length h is approximately λ h .
  • The probability of two or more changes in a sufficiently short interval is essentially zero.

Suppose that an experiment satisfies the three points of an approximate Poisson process. Let X denote the number of changes in an interval of "length 1" (where "length 1" represents one unit of the quantity under consideration). We would like to find an approximation for P ( X = x ) , where x is a nonnegative integer. To achieve this, we partition the unit interval into n subintervals of equal length 1/ n . If N is sufficiently large (i.e., much larger than x ), one shall approximate the probability that x changes occur in this unit interval by finding the probability that one change occurs exactly in each of exactly x of these n subintervals. The probability of one change occurring in any one subinterval of length 1/ n is approximately λ ( 1 / n ) by condition (2). The probability of two or more changes in any one subinterval is essentially zero by condition (3). So for each subinterval, exactly one change occurs with a probability of approximately λ ( 1 / n ) . Consider the occurrence or nonoccurrence of a change in each subinterval as a Bernoulli trial. By condition (1) we have a sequence of n Bernoulli trials with probability p approximately equal to λ ( 1 / n ) . Thus an approximation for P ( X = x ) is given by the binomial probability n ! x ! ( n x ) ! ( λ n ) x ( 1 λ n ) n x .

In order to obtain a better approximation, choose a large value for n . If n increases without bound, we have that

lim n n ! x ! ( n x ) ! ( λ n ) x ( 1 λ n ) n x = lim n n ( n 1 ) ... ( n x + 1 ) n x λ x x ! ( 1 λ n ) n ( 1 λ n ) x .

Now, for fixed x , we have

lim n n ( n 1 ) ... ( n x + 1 ) n x = lim n [ 1 ( 1 1 n ) ... ( 1 x 1 n ) ] = 1, lim n ( 1 λ n ) n = e λ ,

and lim n ( 1 λ n ) x = 1.

Thus, lim n n ! x ! ( n x ) ! ( λ n ) x ( 1 λ n ) n x = λ x e λ x ! = P ( X = x ) , approximately. The distribution of probability associated with this process has a special name.

POISSON DISTRIBUTION
We say that the random variable X has a Poisson distribution if its p.d.f. is of the form f ( x ) = λ x e λ x ! , x = 0,1,2,..., where λ > 0 .

It is easy to see that f ( x ) enjoys the properties pf a p.d.f. because clearly f ( x ) 0 and, from the Maclaurin’s series expansion of e λ , we have

x = 0 λ x e λ x ! = e λ x = 0 λ x x ! = e λ e λ = 1.

To discover the exact role of the parameter λ > 0 , let us find some of the characteristics of the Poisson distribution . The mean for the Poisson distribution is given by

E ( X ) = x = 0 x λ x e λ x ! = e λ x = 1 x λ x ( x 1 ) ! , because ( 0 ) f ( 0 ) = 0 and x / x ! = 1 / ( x 1 ) ! , when x > 0 .

If we let k = x 1 , then E ( X ) = e λ k = 0 λ k + 1 k ! = λ e λ k = 0 λ k k ! = λ e λ e λ = λ .

That is, the parameter λ is the mean of the Poisson distribution . On the Figure 1 is shown the p.d.f. and c.d.f. of the Poisson Distribution for λ = 1 , λ = 4 , λ = 10.

Poisson distribution

The p.d.f. function.
The c.d.f. function.
The p.d.f. and c.d.f. of the Poisson Distribution for λ = 1 , λ = 4 , λ = 10.

To find the variance, we first determine the second factorial moment E [ X ( X 1 ) ] . We have,

E [ X ( X 1 ) ] = x = 0 x ( x 1 ) λ x e λ x ! = e λ x = 2 λ x ( x 2 ) ! , because ( 0 ) ( 0 1 ) f ( 0 ) = 0, ( 1 ) ( 1 1 ) f ( 1 ) = 0 , and x ( x 1 ) / x ! = 1 / ( x 2 ) ! , when x > 1 .

If we let k = x 2 , then E [ X ( X 1 ) ] = e λ k = 0 λ k + 2 k ! = λ 2 e λ k = 0 λ x k ! = λ 2 e λ e λ = λ 2 .

Thus, V a r ( X ) = E ( X 2 ) [ E ( X ) ] 2 = E [ X ( X 1 ) ] + E ( X ) [ E ( X ) ] 2 = λ 2 + λ λ 2 = λ .

That is, for the Poisson distribution, μ = σ 2 = λ .

Let X have a Poisson distribution with a mean of λ = 5 , (it is possible to use the tabularized Poisson distribution).

P ( X 6 ) = x = 0 6 5 x e 5 x ! = 0.762 , P ( X > 5 ) = 1 P ( X 5 ) = 1 0.616 = 0.384,

and P ( X = 6 ) = P ( X 6 ) P ( X 5 ) = 0.762 0.616 = 0.146.

Got questions? Get instant answers now!

Telephone calls enter a college switchboard on the average of two every 3 minutes. If one assumes an approximate Poisson process, what is the probability of five or more calls arriving in a 9-minute period? Let X denotes the number of calls in a 9-minute period. We see that E ( X ) = 6 ; that is, on the average, sic calls will arrive during a 9-minute period. Thus using tabularized data,

P ( X 5 ) = 1 P ( X 4 ) = 1 x = 0 4 6 x e 6 x ! = 1 0.285 = 0.715.

Got questions? Get instant answers now!
Not only is the Poisson distribution important in its own right, but it can also be used to approximate probabilities for a binomial distribution.

If X has a Poisson distribution with parameter λ , we saw that with n large,

P ( X = x ) ( n x ) ( λ n ) x ( 1 λ n ) n x , where, p = λ / n so that λ = n p in the above binomial probability. That is, if X has the binomial distribution b ( n , p ) with large n , then

( n p ) x e n p x ! = ( n x ) p x ( 1 p ) n x .

This approximation is reasonably good if n is large. But since λ was fixed constant in that earlier argument, p should be small since n p = λ . In particular, the approximation is quite accurate if n 20 and p 0.05 , and it is very good if n 100 and n p 10 .

A manufacturer of Christmas tree bulbs knows that 2% of its bulbs are defective. Approximate the probability that a box of 100 of these bulbs contains at most three defective bulbs. Assuming independence, we have binomial distribution with parameters p =0.02 and n =100. The Poisson distribution with λ = 100 ( 0.02 ) = 2 gives

x = 0 3 2 x e 2 x ! = 0.857,

using the binomial distribution, we obtain, after some tedious calculations,

x = 0 3 ( 100 x ) ( 0.02 ) x ( 0.98 ) 100 x = 0.859 .

Hence, in this case, the Poisson approximation is extremely close to the true value, but much easier to find.

Got questions? Get instant answers now!

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introduction to statistics. OpenStax CNX. Oct 09, 2007 Download for free at http://cnx.org/content/col10343/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introduction to statistics' conversation and receive update notifications?

Ask