12.1 Covariance and the correlation coefficient

Applied probability Page 1 / 2

The mean value and the variance give important information about the distribution for a real random variable X. We consider the expectation of an appropriate function of a pair (X, Y) which gives useful information about their joint distribution. This is the covariance function.

Covariance and the correlation coefficient

The mean value $μ_{X} = E [X]$ and the variance $σ_{X}^{2} = E [{(X - μ_{X})}^{2}]$ give important information about the distribution for real random variable X . Can the expectation of an appropriate function of $(X, Y)$ give useful information about the joint distribution? A clue to one possibility is given in the expression

Var [X \pm Y] = Var [X] + Var [Y] \pm 2 (E [X Y] - E [X] E [Y])

The expression $E [X Y] - E [X] E [Y]$ vanishes if the pair is independent (and in some other cases). We note also that for $μ_{X} = E [X]$ and $μ_{Y} = E [Y]$

E [(X - μ_{X}) (Y - μ_{Y})] = E [X Y] - μ_{X} μ_{Y}

To see this, expand the expression $(X - μ_{X}) (Y - μ_{Y})$ and use linearity to get

E [(X - μ_{X}) (Y - μ_{Y})] = E [X Y - μ_{Y} X - μ_{X} Y + μ_{X} μ_{Y}] = E [X Y] - μ_{Y} E [X] - μ_{X} E [Y] + μ_{X} μ_{Y}

which reduces directly to the desired expression. Now for given ω , $X (ω) - μ_{X}$ is the variation of X from its mean and $Y (ω) - μ_{Y}$ is the variation of Y from its mean. For this reason, the following terminology is used.

Definition . The quantity $Cov [X, Y] = E [(X - μ_{X}) (Y - μ_{Y})]$ is called the covariance of X and Y .

If we let $X^{'} = X - μ_{X}$ and $Y^{'} = Y - μ_{Y}$ be the centered random variables, then

Cov [X, Y] = E [X^{'} Y^{'}]

Note that the variance of X is the covariance of X with itself.

If we standardize, with $X^{*} = (X - μ_{X}) / σ_{X}$ and $Y^{*} = (Y - μ_{Y}) / σ_{Y}$ , we have

Definition . The correlation coefficient $ρ = ρ [X, Y]$ is the quantity

ρ [X, Y] = E [X^{*} Y^{*}] = \frac{E [(X - μ_{X}) (Y - μ_{Y})]}{σ_{X} σ_{Y}}

Thus $ρ = Cov [X, Y] / σ_{X} σ_{Y}$ . We examine these concepts for information on the joint distribution. By Schwarz' inequality (E15), we have

ρ^{2} = E^{2} [X^{*} Y^{*}] \leq E [{(X^{*})}^{2}] E [{(Y^{*})}^{2}] = 1 with equality iff Y^{*} = c X^{*}

Now equality holds iff

1 = c^{2} E^{2} [{(X^{*})}^{2}] = c^{2} which implies c = \pm 1 and ρ = \pm 1

We conclude $- 1 \leq ρ \leq 1$ , with $ρ = \pm 1$ iff $Y^{*} = \pm X^{*}$

Relationship between ρ and the joint distribution

We consider first the distribution for the standardized pair $(X^{*}, Y^{*})$
Since $P (X^{*} \leq r, Y^{*} \leq s) = P (\frac{X - μ_{X}}{σ_{X}} \leq r, \frac{Y - μ_{Y}}{σ_{Y}} \leq s)$
$= P (X \leq t = σ_{X} r + μ_{X}, Y \leq u = σ_{Y} s + μ_{Y})$
we obtain the results for the distribution for $(X, Y)$ by the mapping
$\begin{matrix} t = σ_{X} r + μ_{X} \\ u = σ_{Y} s + μ_{Y} \end{matrix}$

Joint distribution for the standardized variables $(X^{*}, Y^{*})$ , $(r, s) = (X^{*}, Y^{*}) (ω)$

$ρ = 1$ iff $X^{*} = Y^{*}$ iff all probability mass is on the line $s = r$ .
$ρ = - 1$ iff $X^{*} = - Y^{*}$ iff all probability mass is on the line $s = - r$ .

If $- 1 < ρ < 1$ , then at least some of the mass must fail to be on these lines.

Figure one is comprised of a diagonal line with a right triangle. A portion of the line is the base of the triangle. The line is labeled, s = r. One point of the triangle located on the diagonal line is labeled (r, r). The point of the triangle that is not located on the line is labeled, (r, s). The side of the triangle in between these two labeled points is labeled as the absolute value of s - r. The side of the triangle on the line is not labeled. The third side is labeled as the absolute value of s - r divided by the square root of two. — Distance from point (r,s) to the line s = r.

The $ρ = \pm 1$ lines for the $(X, Y)$ distribution are:

\frac{u - μ_{Y}}{σ_{Y}} = \pm \frac{t - μ_{X}}{σ_{X}} or u = \pm \frac{σ_{Y}}{σ_{X}} (t - μ_{X}) + μ_{Y}

Consider $Z = Y^{*} - X^{*}$ . Then $E [\frac{1}{2} Z^{2}] = \frac{1}{2} E [{(Y^{*} - X^{*})}^{2}]$ . Reference to [link] shows this is the average of the square of the distancesof the points $(r, s) = (X^{*}, Y^{*}) (ω)$ from the line $s = r$ (i.e., the variance about the line $s = r$ ). Similarly for $W = Y^{*} + X^{*}$ , $E [W^{2} / 2]$ is the variance about $s = - r$ . Now

\frac{1}{2} E [{(Y^{*} \pm X^{*})}^{2}] = \frac{1}{2} \{E [{(Y^{*})}^{2}] + E [{(X^{*})}^{2}] \pm 2 E [X^{*} Y^{*}]\} = 1 \pm ρ

Thus

$1 - ρ$ is the variance about $s = r$ (the $ρ = 1$ line)
$1 + ρ$ is the variance about $s = - r$ (the $ρ = - 1$ line)

Now since

E [{(Y^{*} - X^{*})}^{2}] = E [{(Y^{*} + X^{*})}^{2}] iff ρ = E [X^{*} Y^{*}] = 0

the condition $ρ = 0$ is the condition for equality of the two variances.

Transformation to the $(X, Y)$ plane

t = σ_{X} r + μ_{X} u = σ_{Y} s + μ_{Y} r = \frac{t - μ_{X}}{σ_{X}} s = \frac{u - μ_{Y}}{σ_{Y}}

The $ρ = 1$ line is:

\frac{u - μ_{Y}}{σ_{Y}} = \frac{t - μ_{X}}{σ_{X}} or u = \frac{σ_{Y}}{σ_{X}} (t - μ_{X}) + μ_{Y}

Questions & Answers

what is phylogeny

Odigie Reply

evolutionary history and relationship of an organism or group of organisms

AI-Robot

Deng

what is biology

Hajah Reply

the study of living organisms and their interactions with one another and their environments

AI-Robot

what is biology

Victoria Reply

HOW CAN MAN ORGAN FUNCTION

Alfred Reply

the diagram of the digestive system

Assiatu Reply

allimentary cannel

Ogenrwot

How does twins formed

William Reply

They formed in two ways first when one sperm and one egg are splited by mitosis or two sperm and two eggs join together

Oluwatobi

what is genetics

Josephine Reply

Genetics is the study of heredity

Misack

how does twins formed?

Misack

What is manual

Hassan Reply

discuss biological phenomenon and provide pieces of evidence to show that it was responsible for the formation of eukaryotic organelles

Joseph Reply

what is biology

Yousuf Reply

the study of living organisms and their interactions with one another and their environment.

Wine

discuss the biological phenomenon and provide pieces of evidence to show that it was responsible for the formation of eukaryotic organelles in an essay form

Joseph Reply

what is the blood cells

Shaker Reply

list any five characteristics of the blood cells

Shaker

lack electricity and its more savely than electronic microscope because its naturally by using of light

Abdullahi Reply

advantage of electronic microscope is easily and clearly while disadvantage is dangerous because its electronic. advantage of light microscope is savely and naturally by sun while disadvantage is not easily,means its not sharp and not clear

Abdullahi

cell theory state that every organisms composed of one or more cell,cell is the basic unit of life

Abdullahi

is like gone fail us

DENG

cells is the basic structure and functions of all living things

Ramadan

What is classification

ISCONT Reply

is organisms that are similar into groups called tara

Yamosa

in what situation (s) would be the use of a scanning electron microscope be ideal and why?

Kenna Reply

A scanning electron microscope (SEM) is ideal for situations requiring high-resolution imaging of surfaces. It is commonly used in materials science, biology, and geology to examine the topography and composition of samples at a nanoscale level. SEM is particularly useful for studying fine details,

Hilary

Got questions? Join the online conversation and get instant answers!

Jobilize.com Reply

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Applied probability. OpenStax CNX. Aug 31, 2009 Download for free at http://cnx.org/content/col10708/1.6

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Applied probability' conversation and receive update notifications?

Ask

©flickr: Luis	Atoms By Carly Allen Start Quiz
	My OCA Mock By Mike Wolf Start Exam
	13 AP 13 Nervous System MCQ By OpenStax Start Quiz
	18 Sociology 18 Work and the Economy MCQ By OpenStax Start Quiz
©flickr: Ruben	Grade 10 Module 2.1 IT Quiz (Part 2) By Christine Zeelie Start Quiz
©flickr: Gareth	Professional Etiquette MCQ By Abby Sharp Start Quiz
	28 AP 28 Development Inheritance Essay By OpenStax Start Flashcards
	38 Biology 38 The Musculoskeletal System MCQ By OpenStax Start Quiz
	Grade 10 Module 2.1 IT Quiz (Part 1) By Christine Zeelie Start Quiz
	Chemistry Final By Briana Hamilton Start Flashcards