<< Chapter < Page Chapter >> Page >
This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by EwaPaszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

Continuous distribution

Random variables of the continuous type

Random variables whose spaces are not composed of a countable number of points but are intervals or a union of intervals are said to be of the continuous type . Recall that the relative frequency histogram h ( x ) associated with n observations of a random variable of that type is a nonnegative function defined so that the total area between its graph and the x axis equals one. In addition, h ( x ) is constructed so that the integral

a b h ( x ) d x

is an estimate of the probability P ( a < X < b ) , where the interval ( a , b ) is a subset of the space R of the random variable X .

Let now consider what happens to the function h ( x ) in the limit, as n increases without bound and as the lengths of the class intervals decrease to zero. It is to be hoped that h ( x ) will become closer and closer to some function, say f ( x ) , that gives the true probabilities , such as P ( a < X < b ) , through the integral

P ( a < X < b ) = a b f ( x ) d x .
PROBABILITY DENSITY FUNCTION
We say that the probability density function (p.d.f.) of the random variable X of the continuous type, with space R that is an interval or union of intervals, is an integrable function f ( x ) satisfying the following conditions:
  • f ( x ) > 0 , x belongs to R ,
  • R f ( x ) d x = 1,
  • The probability of the event A belongs to R is P ( X ) A A f ( x ) d x .

Let the random variable X be the distance in feet between bad records on a used computer tape. Suppose that a reasonable probability model for X is given by the p.d.f.

f ( x ) 1 40 e x / 40 ,0 x < .

R = ( x : 0 x < ) and f ( x ) for x belonging to R , R f ( x ) d x = 0 1 40 e x / 40 d x = lim b [ e x / 40 ] 0 b = 1 lim b e b / 40 = 1.

The probability that the distance between bad records is greater than 40 feet is

P ( X > 40 ) = 40 1 40 e x / 40 d x = e 1 = 0.368.

The p.d.f. and the probability of interest are depicted in FIG.1 .

Got questions? Get instant answers now!

The p.d.f. and the probability of interest.

We can avoid repeated references to the space R of the random variable X , one shall adopt the same convention when describing probability density function of the continuous type as was in the discrete case.

Let extend the definition of the p.d.f. f ( x ) to the entire set of real numbers by letting it equal zero when, x belongs to R . For example,

f ( x ) = { 1 40 e x / 40 0, e l s e w h e r e , ,0 x < ,

has the properties of a p.d.f. of a continuous-type random variable x having support ( x : 0 x < ) . It will always be understood that f ( x ) = 0 , when x belongs to R , even when this is not explicitly written out.

PROBABILITY DENSITY FUNCTION
For the fundamental theorem of calculus we have, for x values for which the derivative F ' ( x ) exists, that F ’( x )= f ( x ).

continuing with Example 1

If the p.d.f. of X is

f ( x ) = { 0, < x < 0, 1 40 e x / 40 ,0 x < ,

The distribution function of X is F ( x ) = 0 for x 0

F ( x ) = x f ( t ) d t = 0 x 1 40 e t / 40 d t = e t / 40 | 0 x = 1 e x / 40 .

F ( x ) = { 0, < x < 0, 1 40 e x / 40 ,0 < x < .
Got questions? Get instant answers now!

Also F ' ( 0 ) does not exist. Since there are no steps or jumps in a distribution function F ( x ) , of the continuous type, it must be true that P ( X = b ) = 0 for all real values of b . This agrees with the fact that the integral

a b f ( x ) d x is taken to be zero in calculus. Thus we see that P ( a X b ) = P ( a < X < b ) = P ( a X < b ) = P ( a < X b ) = F ( b ) F ( a ) , provided that X is a random variable of the continuous type. Moreover, we can change the definition of a p.d.f. of a random variable of the continuous type at a finite (actually countable) number of points without alerting the distribution of probability.

For illustration, f ( x ) = { 0, < x < 0, 1 40 e x / 40 ,0 x < , and f ( x ) = { 0, < x 0, 1 40 e x / 40 ,0 < x < ,

are equivalent in the computation of probabilities involving this random variable.

Let Y be a continuous random variable with the p.d.f. g ( y ) = 2 y , 0 < y < 1 . The distribution function of Y is defined by

G ( y ) = ( 0, y < 0, 1, y 1, 0 y 2 t d t = y 2 ,0 y < 1.

Figure 2 gives the graph of the p.d.f. g ( y ) and the graph of the distribution function G ( y ) .

The p.d.f. and the probability of interest.
Got questions? Get instant answers now!

For illustration of computations of probabilities, consider

P ( 1 2 < Y 3 4 ) = G ( 3 4 ) G ( 1 2 ) = ( 3 4 ) 2 ( 1 2 ) 2 = 5 16

and P ( 1 4 Y < 2 ) = G ( 2 ) G ( 1 4 ) = 1 ( 1 4 ) 2 = 15 16 .

The p.d.f. f ( x ) of a random variable of the discrete type is bounded by one because f ( x ) gives a probability, namely f ( x ) = P ( X = x ) .

For random variables of the continuous type, the p.d.f. does not have to be bounded. The restriction is that the area between the p.d.f. and the x axis must equal one. Furthermore, it should be noted that the p.d.f. of a random variable X of the continuous type does not need to be a continuous function.

For example,

f ( x ) = { 1 2, 0 < x < 1 o r 2 < x < 3, 0, e l s e w h e r e ,

enjoys the properties of a p.d.f. of a distribution of the continuous type, and yet f ( x ) had discontinuities at x = 0,1,2, and 3. However, the distribution function associates with a distribution of the continuous type is always a continuous function. For continuous type random variables, the definitions associated with mathematical expectation are the same as those in the discrete case except that integrals replace summations.

FOR ILLUSTRATION , let X be a random variable with a p.d.f. f ( x ) . The expected value of X or mean of X is

μ = E ( X ) = x f ( x ) d x . The variance of X is σ 2 = V a r ( X ) = ( x μ ) 2 f ( x ) d x .

The standard deviation of X is σ = V a r ( X ) .

For the random variable Y in the Example 3 .

μ = E ( Y ) = 0 1 y ( 2 y ) d y = [ ( 2 3 y 3 ) ] 0 1 = 2 3 and

σ 2 = V a r ( Y ) = E ( Y 2 ) μ 2 = 0 1 y 2 ( 2 y ) d y ( 2 3 ) 2 = [ ( 1 2 y 4 ) ] 0 1 4 9 = 1 18 .

Got questions? Get instant answers now!

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introduction to statistics. OpenStax CNX. Oct 09, 2007 Download for free at http://cnx.org/content/col10343/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introduction to statistics' conversation and receive update notifications?

Ask