# 11.1 Comparing two independent population means with unknown

 Page 1 / 4
This module provides an overview of Comparing Two Independent Population Means with Unknown Population Standard Deviations as a part of Collaborative Statistics collection (col10522) by Barbara Illowsky and Susan Dean.
1. The two independent samples are simple random samples from two distinct populations.
2. Both populations are normally distributed with the population means and standard deviations unknown unless the sample sizes are greater than 30. In that case, the populations need not be normally distributed.
The test comparing two independent population means with unknown and possibly unequal population standard deviations is called the Aspin-Welch t-test. The degrees of freedom formula was developed by Aspin-Welch.

The comparison of two population means is very common. A difference between the two samples depends on both the means and the standard deviations. Verydifferent means can occur by chance if there is great variation among the individual samples. In order to account for the variation, we take the difference of the samplemeans, $\overline{{X}_{1}}$ - $\overline{{X}_{2}}$ , and divide by the standard error (shown below) in order to standardize the difference. The result is a t-score test statistic (shown below).

Because we do not know the population standard deviations, we estimate them using the two sample standard deviations from our independent samples. For thehypothesis test, we calculate the estimated standard deviation, or standard error , of the difference in sample means , $\overline{{X}_{1}}$ - $\overline{{X}_{2}}$ .

## The standard error is:

$\sqrt{\frac{\left({S}_{1}{\right)}^{2}}{{n}_{1}}+\frac{\left({S}_{2}{\right)}^{2}}{{n}_{2}}}$

The test statistic (t-score) is calculated as follows:

## T-score

$\phantom{\rule{12pt}{0ex}}\frac{\left(\overline{{x}_{1}}-\overline{{x}_{2}}\right)-\left({\mu }_{1}-{\mu }_{2}\right)}{\sqrt{\frac{\left({S}_{1}{\right)}^{2}}{{n}_{1}}+\frac{\left({S}_{2}{\right)}^{2}}{{n}_{2}}}}$

## Where:

• ${s}_{1}$ and ${s}_{2}$ , the sample standard deviations, are estimates of ${\sigma }_{1}$ and ${\sigma }_{2}$ , respectively.
• ${\sigma }_{1}$ and ${\sigma }_{2}$ are the unknown population standard deviations.
• $\overline{{x}_{1}}$ and $\overline{{x}_{2}}$ are the sample means. ${\mu }_{1}$ and ${\mu }_{2}$ are the population means.

The degrees of freedom (df) is a somewhat complicated calculation. However, a computer or calculator calculates it easily. The dfs are not always a whole number. The test statisticcalculated above is approximated by the student's-t distribution with dfs as follows:

## Degrees of freedom

$\mathrm{df}=\frac{{\left[\frac{{\left({s}_{1}\right)}^{2}}{{n}_{1}}+\frac{{\left({s}_{2}\right)}^{2}}{{n}_{2}}\right]}^{2}}{\frac{1}{{n}_{1}-1}·{\left[\frac{{\left({s}_{1}\right)}^{2}}{{n}_{1}}\right]}^{2}+\frac{1}{{n}_{2}-1}·{\left[\frac{{\left({s}_{2}\right)}^{2}}{{n}_{2}}\right]}^{2}}$

When both sample sizes ${n}_{1}$ and ${n}_{2}$ are five or larger, the student's-t approximation is very good. Notice that the sample variances ${{s}_{1}}^{2}$ and ${{s}_{2}}^{2}$ are not pooled. (If the question comes up, do not pool the variances.)

It is not necessary tocompute this by hand. A calculatoror computer easily computes it.

## Independent groups

The average amount of time boys and girls ages 7 through 11 spend playing sports each day is believed to be the same. Anexperiment is done, data is collected, resulting in the table below. Both populations have a normal distribution.

Sample Size Average Number of Hours Playing Sports Per Day Sample Standard Deviation
Girls 9 2 hours $\sqrt{0.75}$
Boys 16 3.2 hours 1.00

Is there a difference in the mean amount of time boys and girls ages 7 through 11 play sports each day? Test at the 5% level of significance.

The population standard deviations are not known. Let $g$ be the subscript for girls and $b$ be the subscript for boys. Then, ${\mu }_{g}$ is the population mean for girls and ${\mu }_{b}$ is the population mean for boys. This is a test of two independent groups , two population means .

Random variable : $\overline{{X}_{g}}-\overline{{X}_{b}}$ = difference in the sample mean amount of time girls and boys play sports each day.

${H}_{o}$ : ${\mu }_{g}={\mu }_{b}\phantom{\rule{50pt}{0ex}}{\mu }_{g}-{\mu }_{b}=0$

${H}_{a}$ : ${\mu }_{g}\ne {\mu }_{b}\phantom{\rule{50pt}{0ex}}{\mu }_{g}-{\mu }_{b}\ne 0$

The words "the same" tell you ${H}_{o}$ has an "=". Since there are no other words to indicate ${H}_{a}$ , then assume "is different." This is a two-tailed test.

Distribution for the test: Use ${t}_{\mathrm{df}}$ where $\mathrm{df}$ is calculated using the $\mathrm{df}$ formula for independent groups, two population means. Using a calculator, $\mathrm{df}$ is approximately 18.8462. Do not pool the variances.

Calculate the p-value using a student's-t distribution: p-value = 0.0054

Graph:

${s}_{g}=\sqrt{0.75}$

${s}_{b}=1$

So, $\overline{{x}_{g}}-\overline{{x}_{b}}=2-3.2=-1.2$

Half the p-value is below -1.2 andhalf is above 1.2.

Make a decision: Since $\alpha >$ p-value, reject ${H}_{o}$ .

This means you reject ${\mu }_{g}={\mu }_{b}$ . The means are different.

Conclusion: At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the mean number of hours that girls and boys aged 7through 11 play sports per day is different (mean number of hours boys aged 7 through 11 play sports per day is greater than the mean number of hours played by girls OR the mean number of hours girls aged 7 through 11 play sports per day is greater than the mean number of hours played by boys).

TI-83+ and TI-84: Press STAT . Arrow over to TESTS and press 4:2-SampTTest . Arrow over to Stats and press ENTER . Arrow down and enter 2 for the first sample mean,  $\sqrt{0.75}$  for Sx1, 9 for n1, 3.2 for the second sample mean, 1 for Sx2, and 16 for n2. Arrow down to μ1: and arrow to does not equal μ2. Press ENTER . Arrow down to Pooled: and No . Press ENTER . Arrow down to Calculate and press ENTER . The p-value is p = 0.0054, the dfs are approximately 18.8462, and the teststatistic is -3.14. Do the procedure again but instead of Calculate do Draw.

A study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples11 graduates. Their average is 4 math classes with a standard deviation of 1.5 math classes. College B samples 9 graduates. Their average is 3.5 math classes with astandard deviation of 1 math class. The community group believes that a student who graduates from college A has taken more math classes, on the average. Both populations have a normal distribution. Test at a 1% significance level.Answer the following questions.

Is this a test of two means or two proportions?

two means

Are the populations standard deviations known or unknown?

unknown

Which distribution do you use to perform the test?

student's-t

What is the random variable?

$\overline{{X}_{A}}-\overline{{X}_{B}}$

What are the null and alternate hypothesis?

• ${H}_{o}:{\mu }_{A}\le {\mu }_{B}$
• ${H}_{a}:{\mu }_{A}>{\mu }_{B}$

Is this test right, left, or two tailed?

right

What is the p-value?

0.1928

Do you reject or not reject the null hypothesis?

Do not reject.

## Conclusion:

At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from college A hastaken more math classes, on the average, than a student who graduates from college B.

what is Nano technology ?
write examples of Nano molecule?
Bob
The nanotechnology is as new science, to scale nanometric
brayan
nanotechnology is the study, desing, synthesis, manipulation and application of materials and functional systems through control of matter at nanoscale
Damian
Is there any normative that regulates the use of silver nanoparticles?
what king of growth are you checking .?
Renato
What fields keep nano created devices from performing or assimulating ? Magnetic fields ? Are do they assimilate ?
why we need to study biomolecules, molecular biology in nanotechnology?
?
Kyle
yes I'm doing my masters in nanotechnology, we are being studying all these domains as well..
why?
what school?
Kyle
biomolecules are e building blocks of every organics and inorganic materials.
Joe
anyone know any internet site where one can find nanotechnology papers?
research.net
kanaga
sciencedirect big data base
Ernesto
Introduction about quantum dots in nanotechnology
what does nano mean?
nano basically means 10^(-9). nanometer is a unit to measure length.
Bharti
do you think it's worthwhile in the long term to study the effects and possibilities of nanotechnology on viral treatment?
absolutely yes
Daniel
how to know photocatalytic properties of tio2 nanoparticles...what to do now
it is a goid question and i want to know the answer as well
Maciej
Abigail
for teaching engĺish at school how nano technology help us
Anassong
Do somebody tell me a best nano engineering book for beginners?
there is no specific books for beginners but there is book called principle of nanotechnology
NANO
what is fullerene does it is used to make bukky balls
are you nano engineer ?
s.
fullerene is a bucky ball aka Carbon 60 molecule. It was name by the architect Fuller. He design the geodesic dome. it resembles a soccer ball.
Tarell
what is the actual application of fullerenes nowadays?
Damian
That is a great question Damian. best way to answer that question is to Google it. there are hundreds of applications for buck minister fullerenes, from medical to aerospace. you can also find plenty of research papers that will give you great detail on the potential applications of fullerenes.
Tarell
what is the Synthesis, properties,and applications of carbon nano chemistry
Mostly, they use nano carbon for electronics and for materials to be strengthened.
Virgil
is Bucky paper clear?
CYNTHIA
carbon nanotubes has various application in fuel cells membrane, current research on cancer drug,and in electronics MEMS and NEMS etc
NANO
so some one know about replacing silicon atom with phosphorous in semiconductors device?
Yeah, it is a pain to say the least. You basically have to heat the substarte up to around 1000 degrees celcius then pass phosphene gas over top of it, which is explosive and toxic by the way, under very low pressure.
Harper
Do you know which machine is used to that process?
s.
how to fabricate graphene ink ?
for screen printed electrodes ?
SUYASH
What is lattice structure?
of graphene you mean?
Ebrahim
or in general
Ebrahim
in general
s.
Graphene has a hexagonal structure
tahir
On having this app for quite a bit time, Haven't realised there's a chat room in it.
Cied
how did you get the value of 2000N.What calculations are needed to arrive at it
Privacy Information Security Software Version 1.1a
Good
Got questions? Join the online conversation and get instant answers!