<< Chapter < Page Chapter >> Page >

The number line may help you understand standard deviation. If we were to put five and seven on a number line, seven is to the right of five. We say, then, that seven is one standard deviation to the right of five because 5 + (1)(2) = 7.

If one were also part of the data set, then one is two standard deviations to the left of five because 5 + (–2)(2) = 1.

This shows a number line in intervals of 1 from 0 to 7.
  • In general, a value = mean + (#ofSTDEV)(standard deviation)
  • where #ofSTDEVs = the number of standard deviations
  • #ofSTDEV does not need to be an integer
  • One is two standard deviations less than the mean of five because: 1 = 5 + (–2)(2).

The equation value = mean + (#ofSTDEVs)(standard deviation) can be expressed for a sample and for a population.

  • sample: x  =  x ¯  +  ( # o f S T D E V ) ( s )
  • Population: x = μ + ( # o f S T D E V ) ( σ )
The lower case letter s represents the sample standard deviation and the Greek letter σ (sigma, lower case) represents the population standard deviation.

The symbol x is the sample mean and the Greek symbol μ is the population mean.

Calculating the standard deviation

If x is a number, then the difference " x – mean" is called its deviation . In a data set, there are as many deviations as there are items in the data set. The deviations are used to calculate the standard deviation. If the numbers belong to a population, in symbols a deviation is x μ . For sample data, in symbols a deviation is x x ¯ .

The procedure to calculate the standard deviation depends on whether the numbers are the entire population or are data from a sample. The calculations are similar, but not identical. Therefore the symbol used to represent the standard deviation depends on whether it is calculated from a population or a sample. The lower case letter s represents the sample standard deviation and the Greek letter σ (sigma, lower case) represents the population standard deviation. If the sample has the same characteristics as the population, then s should be a good estimate of σ .

To calculate the standard deviation, we need to calculate the variance first. The variance is the average of the squares of the deviations (the x x ¯ values for a sample, or the x μ values for a population). The symbol σ 2 represents the population variance; the population standard deviation σ is the square root of the population variance. The symbol s 2 represents the sample variance; the sample standard deviation s is the square root of the sample variance. You can think of the standard deviation as a special average of the deviations.

If the numbers come from a census of the entire population and not a sample, when we calculate the average of the squared deviations to find the variance, we divide by N , the number of items in the population. If the data are from a sample rather than a population, when we calculate the average of the squared deviations, we divide by n – 1 , one less than the number of items in the sample.

Formulas for the sample standard deviation

  • s = Σ ( x x ¯ ) 2 n 1 or s = Σ f ( x x ¯ ) 2 n 1
  • For the sample standard deviation, the denominator is n - 1 , that is the sample size MINUS 1.

Formulas for the population standard deviation

  • σ   =   Σ ( x μ ) 2 N or σ   =   Σ f ( x μ ) 2 N
  • For the population standard deviation, the denominator is N , the number of items in the population.

Questions & Answers

If X is a Uniform random variable in [ -2, 2 ], find the pdf of Y X  and E Y[ ].
Kezang Reply
I want to know statistics
Okosa Reply
why is data so important in statistics
Trixie Reply
want summary statistic on gender, age group, weight, and weight loss
Trixie
are you asking question or looking for Solution
Arun
solution pls
Trixie
1st convert gender and group to factor than use summary function It will give mean median and mode with other details
Arun
its a bit complicated could u bring it to my level of under standing
Trixie
u know the question was put in a tabular form where we were to find the variable type, summary statistics and graph type of the given variables that's the gender,age group, weight and weight loss
Trixie
if you see, gender and group are not numerical due to which they will not give you correct statistics
Arun
how you denote gender m or f
Arun
or t
Arun
ok tnk u
Trixie
these are not numerical so you have to convert they as f=1; m=2; t=3 same thing you have to do with group or any variable which is character else you should drop them
Arun
from your calculation
Arun
oh OK tnk u
Trixie
so pls why is data important in statistics
Trixie
not only data correct data is imp
Arun
statistics works on data only
Arun
without data you can not summarize, can not predict future, can not establish relationship between two and more variables, can not prepare reports and make decisions on it
Arun
so I'll give example. suppose you want to open a restaurant and you have to choose one best location out of 5. then how you will decide which location is best for you
Arun
awww thank you pls
Trixie
pls I want a brief note on observation, survey and experimentation way of obtaining data
Trixie
hi
Abdiwahab
please Tell me difference parameters and non parameter
Abdiwahab
can you tell me about the scopes of statistics?
Minhal
plzzz answer me anyone .
Minhal
Methods of Collecting Data Observation Observational studies allow researchers to document behavior in a natural setting and witness events that could not be produced in a lab.
Arun
Key Points Observation differs from most other forms of data collection in that the researcher does not manipulate variables or directly question participants. The advantages of observation include observing natural behavior, refining hypotheses, and allowing for observation of behavior that canno
Arun
 be produced in an artificial environment for ethical or practical reasons. The disadvantages of observation are that these studies do not produce quantitative data, do not allow for cause and effect statements, may be very time consuming, and can be prone to researcher bias.
Arun
Key Terms observational research: Research focusing on the observation of behavior outside of a laboratory setting. external validity: In research, whether or not study findings can be generalized to real world scenarios.
Arun
Surveys and Interviews Surveys are a low-cost option for gathering a large amount of data, but they are also susceptible to reporting bias.
Arun
Key Points The survey method of data collection is likely the most common of the four major research methods. The benefits of this method include low cost, large sample size, and efficiency.
Arun
The major problem with this method is accuracy: since surveys depend on subjects’ motivation, honesty, memory, and ability to respond, they are very susceptible to bias. A researcher must have a strong understanding of how to properly frame survey questions in order to gather reliable and relevant
Arun
Key Terms reliability: The degree to which a measure is likely to yield consistent results each time it is used. validity: The degree to which a measure is actually assessing the concept it was designed to measure. survey: A method for collecting qualitative and quantitative information about ind
Arun
individuals in a population.
Arun
Interviews Interviews are a type of qualitative data in which the researcher asks questions to elicit facts or statements from the interviewee. Interviews used for research can take several forms:
Arun
Informal Interview: A more conversational type of interview, no questions are asked and the interviewee is allowed to talk freely. General interview guide approach: Ensures that the same general areas of information are collected from each interviewee. Provides more focus than the conversational ap
Arun
approach, but still allows a degree of freedom and adaptability in getting the information from the interviewee. Standardized, open-ended interview: The same open-ended questions are asked to all interviewees. This approach facilitates faster interviews that can be more easily analyzed and compared
Arun
Closed, fixed-response interview (Structured): All interviewees are asked the same questions and asked to choose answers from among the same set of alternatives.
Arun
experiments An experiment involves the creation of a contrived situation in order that the researcher can manipulate one or more variables whilst controlling all of the others and measuring the resultant effects.
Arun
Boyd and Westfall1 have defined experimentation as: "...that research process in which one or more variables are manipulated under conditions which permit the collection of data which show the effects, if any, in unconfused fashion."
Arun
Experiments can be conducted either in the field or in a laboratory setting. When operating within a laboratory environment, the researcher has direct control over most, if not all, of the variables that could impact upon the outcome of the experiment
Arun
When experiments are conducted within a natural setting then they are termed field experiments. The variety test carried out by United Fruits on their Gros Michel and Valery bananas is an example of a field experiment.
Arun
parameter Parameters are factors or limits which affect the way that something can be done or made
Arun
Minhal didny get your question, can you please elaborate more
Arun
pls can u use mean n mode at the statistical summary pls
Trixie
yes, statistical summary itself gives all value
Arun
but u didn't tell me the advantage and disadvantage of the experimental method
Trixie
but you didn't tell me the advantage and disadvantage of experimental method
Trixie
by using a sampling distribution? how to estimate the population mean using a ramdom variable n?
Jade Reply
The “average increase” for all NASDAQ stocks is the:
da Reply
any video any proof...what is point of estimation in statistics
Younis Reply
Define the meaning of statistics
Robert Reply
sampli
Hidayat
roductory Statistics is intended for the one-semester introduction to statistics course for students who are not mathematics or engineering majors. It focuses on the interpretation of statistical results, especially in real world settings, and assumes that students have an understanding of intermedi
Hidayat
statistics is science collection of method planning experiment then organizing summarizing presenting analyzing and drawing conclusion.
MOVIES
uses and miss uses of statistics
Pure
Identify the population, sample, parameter, statistic, variable, and data for this example. population sample parameter statistic variable data
Woyo
kinds of probability samples and there advantage
Hajira Reply
are you going to explain it.
Anil
🙂
Meera
hy
Muhammad
what....?
Muhammad
Sampling takes on two forms in statistics: probability sampling and non-probability sampling: Probability sampling uses random sampling techniques to create a sample. Non-probability samplingtechniques use non-random processes like researcher judgment or convenience sampling.
Muhammad
Advantages Cluster sampling: convenience and ease of use. Simple random sampling: creates samples that are highly representative of the population. Stratified random sampling: creates strata or layers that are highly representative of strata or layers in the population. Systematic sampling: creates
Muhammad
any example plz?
Anil
Plz write the uses and miss uses of statistical theory
Pure
what is the difference between weighted simple price index (WSPI ) & Laspeyre's Price Index ( LPI )
Basil Reply
What are the 5 steps of hypothesis testing?
Sixolisiwe Reply
5 steps of hypothesis testing
Sixolisiwe
Make guesses (e.g., customers will leave if we raise our rates) State the null H0 and alternative H1 hypotheses (e.g., H0: there is no correlation) and alpha Select the sampling distribution and specify the test statistic Compute the test statistic Make a decision and interpret the results
Ara
.Five Steps in Hypothesis Testing: 1_Specify the Null Hypothesis. 2_Specify the Alternative Hypothesis. 3_Set the Significance Level (a) 4_Calculate the Test Statistic and Corresponding P-Value. 5_Drawing a Conclusion.
Rachel
.Econometric Results uses Multiple Regression for the basis of looking at number of casual factors (independent χ Variables) such as Employment, being Female etc., to test for any relationship with the dependent γ Variable Wages, in order to find any evidence to support the Alternative Hypothesis(Ha
Rachel
.Alternative Hypothesis (H1 or Ha) of Wage Differentials or in the extreme case, if the strength of relationship is strong enough between the dependent γ Variable, and multiple χ Variables, suggesting evidence for the Null Hypothesis ( Ho) that Wage Discrimination may exist.
Rachel
.The Significance Level which is also the Critical Value gives the maximum allowable probability of making a Type I error – the Significance Level value of which is decided upon before the data sample is collected and analysed, as a guide to avoid or control making a Type I error.
Rachel
Type I Error occurs when the Null Hypothesis (Ho) is not accepted when in reality the Null Hypothesis is true. A Type II Error however, occurs when one fails to reject the Null Hypothesis when in reality, the Null Hypothesis (Ho) is not true.
Rachel
.The #P-Value measures the likelihood of getting the sample results if the Null Hypothesis were true, and could be defined as the smallest level of significance (observed level of significance) at which the Null Hypothesis will be rejected, assuming the Null Hypothesis (Ho) is true.
Rachel
.In most cases, the research attempt is to find support for the Alternative Hypothesis (Ha or H1). Thus, the smaller the P-Value, the more the (the father out the #Test-Statistics is on the Standard Normal Distribution Diagram, and the more confident the researcher can be about rejecting the Null H
Rachel
.#Test-Statistics is on the Standard Normal Distribution Diagram, and the more confident the researcher can be about rejecting the Null Hypothesis (Ho) in support for the Alternative Hypothesis (H1 / Ha).
Rachel
.The #P-Value is less than the Critical Values (Significance Level) of 1% (0.01), 5% (0.05), and 10% (0.10) given in Table (1) in the Appendix, means the Null Hypothesis (Ho) that there is Wage Discrimination is not reflective of the population or equal to the Mean of the Population
Rachel
.Mean of the Population(data sample of Sample Mean distribution of the Population ) which confirms that the Researcher Rejects the Null Hypothesis (Ho) and Accepts the (Alternative Hypothesis).
Rachel
.See ISBN 1537512757 ; link : https://smile.amazon.co.uk/Winston-Chellie-Economics-TheBachelor-questions/dp/1537512757/ref=mp_s_a_1_1?keywords=Rachel+Adeniji&qid=1572318698&sr=8-1
Rachel
see publication ' Winston and Chellie by Rachel Adeniji '
Rachel
correction, dependent x variables such as Employment, being Female; dependent y variable Wages
Rachel
correction, Wage Differentials such as Employment, Region affecting Wages; Wage Discrimination such as being Female or Ethnicity affecting Wages
Rachel
correction_, linear regression/equation is computed as y=mx + c or y=m • x1+x2+x3+c where independent x variables eg Employment x1, Female x2 , Ethnicity x3, and dependent y variable Wages
Rachel
how do you draw a line of best fit?
Josh Reply
***youtu.be/l2BOZDosuIk
William
informal explanation:lets suppose you have 10 points and you want a line to best fit on all of them. all you need to keep in mind that the distance and error should be minimum and you will get the best fit line.
umair
how was the data collected to draw the graph
Nji
draw a straight line through the points on the graph that are most clustered with other data / points
Rachel
suppose that 30% of the employees in a large factory of smokers what is the probability that there will be exactly two smokers in a randomly-chosen five-person workgroup
rayhaanah Reply
binomialPdf(5, .3, 2) .3087
Ara
are the fraction integers
Amir Reply
The ratio of male to female nurses is 2:3 or 2/3. There are 40 nurses in the ward. For every 5 nurses, how many male and female nurses are there? How many groups can be divided into shifts. Pls show the solution and explain.
DokBads Reply
in a group of 5, the probability tbat exactly 3 of the nurses are male is .6630 or 66% calculation P(X=0)+(...)+P(X=3)=.6630
Ara
i dont think u got a correct answer. you are computing for the probability not the ratio and proportion
DokBads
(2+5)/40*2 = male , (2+5)/40*3 = female
Rachel
40/(2+3)*2 = male , 40/(2+3)*3=female, ....sorry correction
Rachel
thank you so much for the help
DokBads
x
Rachel
Iyhoo
Sixolisiwe
if x is a continuous random variable and` c` is a constant then p(x=c)
Neha Reply
the length of human pregnancies from conception to birth approximates a normal distribution with a mean of 266days and a standard deviation of 16days.(i) what length of time marks the shortest 10%of all pregnancies ?
Neha
27.6390625 days
festus
steps?
Neha
how can I solve a Hypothetic problem that provide sample data such as 45,3_,45,28,17 ect...what is the first step
Leticia
find the mean and standard deviation first
Kwadwo
how can I get line of best fit?
Josh

Get the best Introductory statistics course in your pocket!





Source:  OpenStax, Introductory statistics. OpenStax CNX. May 06, 2016 Download for free at http://legacy.cnx.org/content/col11562/1.18
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask