<< Chapter < Page Chapter >> Page >
This module introduces the contingency table as a way of determining conditional probabilities.

A contingency table provides a way of portraying data that can facilitate calculating probabilities. The table helps in determining conditional probabilities quite easily. The table displays sample values in relation to two different variables that may be dependent or contingent on one another. Later on, we will use contingency tables again, but in another manner. Contingincy tables provide a way of portraying data that can facilitate calculating probabilities.

Suppose a study of speeding violations and drivers who use car phones produced the following fictional data:

Speeding violation in the last year No speeding violation in the last year Total
Car phone user 25 280 305
Not a car phone user 45 405 450
Total 70 685 755

The total number of people in the sample is 755. The row totals are 305 and 450. The column totals are 70 and 685. Notice that 305 + 450 = 755 and 70 + 685 = 755 .

Calculate the following probabilities using the table

P(person is a car phone user) =

number of car phone users total number in study = 305 755

P(person had no violation in the last year) =

number that had no violation total number in study = 685 755

P(person had no violation in the last year AND was a car phone user) =

280 755

P(person is a car phone user OR person had no violation in the last year) =

( 305 755 + 685 755 ) - 280 755 = 710 755

P(person is a car phone user GIVEN person had a violation in the last year) =

25 70 (The sample space is reduced to the number of persons who had a violation.)

P(person had no violation last year GIVEN person was not a car phone user) =

405 450 (The sample space is reduced to the number of persons who were not car phone users.)

The following table shows a random sample of 100 hikers and the areas of hiking preferred:

Hiking area preference
Sex The Coastline Near Lakes and Streams On Mountain Peaks Total
Female 18 16 ___ 45
Male ___ ___ 14 55
Total ___ 41 ___ ___

Complete the table.

Hiking area preference
Sex The Coastline Near Lakes and Streams On Mountain Peaks Total
Female 18 16 11 45
Male 16 25 14 55
Total 34 41 25 100

Are the events "being female" and "preferring the coastline" independent events?

Let F = being female and let C = preferring the coastline.

  • P(F AND C) =
  • P(F) P(C) =

Are these two numbers the same? If they are, then F and C are independent. If they are not, then F and C are not independent.

  • P(F AND C) = 18 100 = 0.18
  • P(F) P(C) = 45 100 34 100 = 0.45 0.34 = 0.153

P(F AND C) P(F) P(C) , so the events F and C are not independent.

Find the probability that a person is male given that the person prefers hiking near lakes and streams. Let M = being male and let L = prefers hiking near lakes and streams.

  • What word tells you this is a conditional?
  • Fill in the blanks and calculate the probability: P(___|___) = ___ .
  • Is the sample space for this problem all 100 hikers? If not, what is it?
  • The word 'given' tells you that this is a conditional.
  • P(M|L) = 25 41
  • No, the sample space for this problem is 41.

Find the probability that a person is female or prefers hiking on mountain peaks. Let F = being female and let P = prefers mountain peaks.

  • P(F) =
  • P(P) =
  • P(F AND P) =
  • Therefore, P(F OR P) =
  • P(F) = 45 100
  • P(P) = 25 100
  • P(F AND P) = 11 100
  • P(F OR P) = 45 100 + 25 100 - 11 100 = 59 100

Muddy Mouse lives in a cage with 3 doors. If Muddy goes out the first door, the probability that he gets caught by Alissa the cat is 1 5 and the probability he is not caught is 4 5 . If he goes out the second door, the probability he gets caught by Alissa is 1 4 and the probability he is not caught is 3 4 . The probability that Alissa catches Muddy coming out of the third door is 1 2 and the probability she does not catch Muddy is 1 2 . It is equally likely that Muddy will choose any of the three doors so the probability of choosing each door is 1 3 .

Door choice
Caught or Not Door One Door Two Door Three Total
Caught 1 15 1 12 1 6 ____
Not Caught 4 15 3 12 1 6 ____
Total ____ ____ ____ 1
  • The first entry 1 15 = ( 1 5 ) ( 1 3 ) is P(Door One AND Caught) .
  • The entry 4 15 = ( 4 5 ) ( 1 3 ) is P(Door One AND Not Caught) .

Verify the remaining entries.

Complete the probability contingency table. Calculate the entries for the totals. Verify that the lower-right corner entry is 1.

Door choice
Caught or Not Door One Door Two Door Three Total
Caught 1 15 1 12 1 6 19 60
Not Caught 4 15 3 12 1 6 41 60
Total 5 15 4 12 2 6 1

What is the probability that Alissa does not catch Muddy?

41 60

What is the probability that Muddy chooses Door One OR Door Two given that Muddy is caught by Alissa?

9 19

You could also do this problem by using a probability tree. See the Tree Diagrams (Optional) section of this chapter for examples.

Questions & Answers

how do you get the 2/50
Abba Reply
number of sport play by 50 student construct discrete data
Aminu Reply
width of the frangebany leaves on how to write a introduction
Theresa Reply
Solve the mean of variance
Veronica Reply
Step 1: Find the mean. To find the mean, add up all the scores, then divide them by the number of scores. ... Step 2: Find each score's deviation from the mean. ... Step 3: Square each deviation from the mean. ... Step 4: Find the sum of squares. ... Step 5: Divide the sum of squares by n – 1 or N.
kenneth
what is error
Yakuba Reply
Is mistake done to something
Vutshila
Hy
anas
hy
What is the life teble
anas
hy
Jibrin
statistics is the analyzing of data
Tajudeen Reply
what is statics?
Zelalem Reply
how do you calculate mean
Gloria Reply
diveving the sum if all values
Shaynaynay
let A1,A2 and A3 events be independent,show that (A1)^c, (A2)^c and (A3)^c are independent?
Fisaye Reply
what is statistics
Akhisani Reply
data collected all over the world
Shaynaynay
construct a less than and more than table
Imad Reply
The sample of 16 students is taken. The average age in the sample was 22 years with astandard deviation of 6 years. Construct a 95% confidence interval for the age of the population.
Aschalew Reply
Bhartdarshan' is an internet-based travel agency wherein customer can see videos of the cities they plant to visit. The number of hits daily is a normally distributed random variable with a mean of 10,000 and a standard deviation of 2,400 a. what is the probability of getting more than 12,000 hits? b. what is the probability of getting fewer than 9,000 hits?
Akshay Reply
Bhartdarshan'is an internet-based travel agency wherein customer can see videos of the cities they plan to visit. The number of hits daily is a normally distributed random variable with a mean of 10,000 and a standard deviation of 2,400. a. What is the probability of getting more than 12,000 hits
Akshay
1
Bright
Sorry i want to learn more about this question
Bright
Someone help
Bright
a= 0.20233 b=0.3384
Sufiyan
a
Shaynaynay
How do I interpret level of significance?
Mohd Reply
It depends on your business problem or in Machine Learning you could use ROC- AUC cruve to decide the threshold value
Shivam
how skewness and kurtosis are used in statistics
Owen Reply
yes what is it
Taneeya
Got questions? Join the online conversation and get instant answers!
Jobilize.com Reply
Practice Key Terms 1

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics using spreadsheets. OpenStax CNX. Jan 05, 2016 Download for free at http://legacy.cnx.org/content/col11521/1.23
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?

Ask