<< Chapter < Page Chapter >> Page >

Try it

Construct a frequency polygon of U.S. Presidents’ ages at inauguration shown in [link] .

Age at Inauguration Frequency
41.5–46.5 4
46.5–51.5 11
51.5–56.5 14
56.5–61.5 9
61.5–66.5 4
66.5–71.5 2

The first label on the x -axis is 39. This represents an interval extending from 36.5 to 41.5. Since there are no ages less than 41.5, this interval is used only to allow the graph to touch the x -axis. The point labeled 44 represents the next interval, or the first “real” interval from the table, and contains four scores. This reasoning is followed for each of the remaining intervals with the point 74 representing the interval from 71.5 to 76.5. Again, this interval contains no data and is only used so that the graph will touch the x -axis. Looking at the graph, we say that this distribution is skewed because one side of the graph does not mirror the other side.

This figure shows a graph entitled, 'President's Age at Inauguration.' The x-axis is labeled 'Ages' and is marked off at 39, 44, 49, 54, 59, 64, 69 and 74. The y-axis is labeled, 'Frequency,' and is marked off in intervals of 1 from 0 to 15. The following points are plotted and a line connects one to the other to create the frequency polygon: (39, 0), (44, 4), (49, 11), (54, 14), (59, 9), (64, 4), (69, 2), (74, 0).
Got questions? Get instant answers now!

Frequency polygons are useful for comparing distributions. This is achieved by overlaying the frequency polygons drawn for different data sets.

We will construct an overlay frequency polygon comparing the scores from [link] with the students’ final numeric grade.

Frequency Distribution for Calculus Final Test Scores
Lower Bound Upper Bound Frequency Cumulative Frequency
49.5 59.5 5 5
59.5 69.5 10 15
69.5 79.5 30 45
79.5 89.5 40 85
89.5 99.5 15 100
Frequency Distribution for Calculus Final Grades
Lower Bound Upper Bound Frequency Cumulative Frequency
49.5 59.5 10 10
59.5 69.5 10 20
69.5 79.5 30 50
79.5 89.5 45 95
89.5 99.5 5 100
This is an overlay frequency polygon that matches the supplied data. The x-axis shows the grades, and the y-axis shows the frequency.
Got questions? Get instant answers now!

Suppose that we want to study the temperature range of a region for an entire month. Every day at noon we note the temperature and write this down in a log. A variety of statistical studies could be done with this data. We could find the mean or the median temperature for the month. We could construct a histogram displaying the number of days that temperatures reach a certain range of values. However, all of these methods ignore a portion of the data that we have collected.

One feature of the data that we may want to consider is that of time. Since each date is paired with the temperature reading for the day, we don‘t have to think of the data as being random. We can instead use the times given to impose a chronological order on the data. A graph that recognizes this ordering and displays the changing temperature as the month progresses is called a time series graph.

Constructing a time series graph

To construct a time series graph, we must look at both pieces of our paired data set . We start with a standard Cartesian coordinate system. The horizontal axis is used to plot the date or time increments, and the vertical axis is used to plot the values of the variable that we are measuring. By doing this, we make each point on the graph correspond to a date and a measured quantity. The points on the graph are typically connected by straight lines in the order in which they occur.

The following data shows the Annual Consumer Price Index, each month, for ten years. Construct a time series graph for the Annual Consumer Price Index data only.

Year Jan Feb Mar Apr May Jun Jul
2003 181.7 183.1 184.2 183.8 183.5 183.7 183.9
2004 185.2 186.2 187.4 188.0 189.1 189.7 189.4
2005 190.7 191.8 193.3 194.6 194.4 194.5 195.4
2006 198.3 198.7 199.8 201.5 202.5 202.9 203.5
2007 202.416 203.499 205.352 206.686 207.949 208.352 208.299
2008 211.080 211.693 213.528 214.823 216.632 218.815 219.964
2009 211.143 212.193 212.709 213.240 213.856 215.693 215.351
2010 216.687 216.741 217.631 218.009 218.178 217.965 218.011
2011 220.223 221.309 223.467 224.906 225.964 225.722 225.922
2012 226.665 227.663 229.392 230.085 229.815 229.478 229.104
Year Aug Sep Oct Nov Dec Annual
2003 184.6 185.2 185.0 184.5 184.3 184.0
2004 189.5 189.9 190.9 191.0 190.3 188.9
2005 196.4 198.8 199.2 197.6 196.8 195.3
2006 203.9 202.9 201.8 201.5 201.8 201.6
2007 207.917 208.490 208.936 210.177 210.036 207.342
2008 219.086 218.783 216.573 212.425 210.228 215.303
2009 215.834 215.969 216.177 216.330 215.949 214.537
2010 218.312 218.439 218.711 218.803 219.179 218.056
2011 226.545 226.889 226.421 226.230 225.672 224.939
2012 230.379 231.407 231.317 230.221 229.601 229.594
This is a times series graph that matches the supplied data. The x-axis shows years from 2003 to 2012, and the y-axis shows the annual CPI.
Got questions? Get instant answers now!
Got questions? Get instant answers now!

Questions & Answers

If X is a Uniform random variable in [ -2, 2 ], find the pdf of Y X  and E Y[ ].
Kezang Reply
I want to know statistics
Okosa Reply
why is data so important in statistics
Trixie Reply
want summary statistic on gender, age group, weight, and weight loss
Trixie
are you asking question or looking for Solution
Arun
solution pls
Trixie
1st convert gender and group to factor than use summary function It will give mean median and mode with other details
Arun
its a bit complicated could u bring it to my level of under standing
Trixie
u know the question was put in a tabular form where we were to find the variable type, summary statistics and graph type of the given variables that's the gender,age group, weight and weight loss
Trixie
if you see, gender and group are not numerical due to which they will not give you correct statistics
Arun
how you denote gender m or f
Arun
or t
Arun
ok tnk u
Trixie
these are not numerical so you have to convert they as f=1; m=2; t=3 same thing you have to do with group or any variable which is character else you should drop them
Arun
from your calculation
Arun
oh OK tnk u
Trixie
so pls why is data important in statistics
Trixie
not only data correct data is imp
Arun
statistics works on data only
Arun
without data you can not summarize, can not predict future, can not establish relationship between two and more variables, can not prepare reports and make decisions on it
Arun
so I'll give example. suppose you want to open a restaurant and you have to choose one best location out of 5. then how you will decide which location is best for you
Arun
awww thank you pls
Trixie
pls I want a brief note on observation, survey and experimentation way of obtaining data
Trixie
hi
Abdiwahab
please Tell me difference parameters and non parameter
Abdiwahab
can you tell me about the scopes of statistics?
Minhal
plzzz answer me anyone .
Minhal
Methods of Collecting Data Observation Observational studies allow researchers to document behavior in a natural setting and witness events that could not be produced in a lab.
Arun
Key Points Observation differs from most other forms of data collection in that the researcher does not manipulate variables or directly question participants. The advantages of observation include observing natural behavior, refining hypotheses, and allowing for observation of behavior that canno
Arun
 be produced in an artificial environment for ethical or practical reasons. The disadvantages of observation are that these studies do not produce quantitative data, do not allow for cause and effect statements, may be very time consuming, and can be prone to researcher bias.
Arun
Key Terms observational research: Research focusing on the observation of behavior outside of a laboratory setting. external validity: In research, whether or not study findings can be generalized to real world scenarios.
Arun
Surveys and Interviews Surveys are a low-cost option for gathering a large amount of data, but they are also susceptible to reporting bias.
Arun
Key Points The survey method of data collection is likely the most common of the four major research methods. The benefits of this method include low cost, large sample size, and efficiency.
Arun
The major problem with this method is accuracy: since surveys depend on subjects’ motivation, honesty, memory, and ability to respond, they are very susceptible to bias. A researcher must have a strong understanding of how to properly frame survey questions in order to gather reliable and relevant
Arun
Key Terms reliability: The degree to which a measure is likely to yield consistent results each time it is used. validity: The degree to which a measure is actually assessing the concept it was designed to measure. survey: A method for collecting qualitative and quantitative information about ind
Arun
individuals in a population.
Arun
Interviews Interviews are a type of qualitative data in which the researcher asks questions to elicit facts or statements from the interviewee. Interviews used for research can take several forms:
Arun
Informal Interview: A more conversational type of interview, no questions are asked and the interviewee is allowed to talk freely. General interview guide approach: Ensures that the same general areas of information are collected from each interviewee. Provides more focus than the conversational ap
Arun
approach, but still allows a degree of freedom and adaptability in getting the information from the interviewee. Standardized, open-ended interview: The same open-ended questions are asked to all interviewees. This approach facilitates faster interviews that can be more easily analyzed and compared
Arun
Closed, fixed-response interview (Structured): All interviewees are asked the same questions and asked to choose answers from among the same set of alternatives.
Arun
experiments An experiment involves the creation of a contrived situation in order that the researcher can manipulate one or more variables whilst controlling all of the others and measuring the resultant effects.
Arun
Boyd and Westfall1 have defined experimentation as: "...that research process in which one or more variables are manipulated under conditions which permit the collection of data which show the effects, if any, in unconfused fashion."
Arun
Experiments can be conducted either in the field or in a laboratory setting. When operating within a laboratory environment, the researcher has direct control over most, if not all, of the variables that could impact upon the outcome of the experiment
Arun
When experiments are conducted within a natural setting then they are termed field experiments. The variety test carried out by United Fruits on their Gros Michel and Valery bananas is an example of a field experiment.
Arun
parameter Parameters are factors or limits which affect the way that something can be done or made
Arun
Minhal didny get your question, can you please elaborate more
Arun
pls can u use mean n mode at the statistical summary pls
Trixie
yes, statistical summary itself gives all value
Arun
but u didn't tell me the advantage and disadvantage of the experimental method
Trixie
but you didn't tell me the advantage and disadvantage of experimental method
Trixie
by using a sampling distribution? how to estimate the population mean using a ramdom variable n?
Jade Reply
The “average increase” for all NASDAQ stocks is the:
da Reply
any video any proof...what is point of estimation in statistics
Younis Reply
Define the meaning of statistics
Robert Reply
sampli
Hidayat
roductory Statistics is intended for the one-semester introduction to statistics course for students who are not mathematics or engineering majors. It focuses on the interpretation of statistical results, especially in real world settings, and assumes that students have an understanding of intermedi
Hidayat
statistics is science collection of method planning experiment then organizing summarizing presenting analyzing and drawing conclusion.
MOVIES
uses and miss uses of statistics
Pure
Identify the population, sample, parameter, statistic, variable, and data for this example. population sample parameter statistic variable data
Woyo
kinds of probability samples and there advantage
Hajira Reply
are you going to explain it.
Anil
🙂
Meera
hy
Muhammad
what....?
Muhammad
Sampling takes on two forms in statistics: probability sampling and non-probability sampling: Probability sampling uses random sampling techniques to create a sample. Non-probability samplingtechniques use non-random processes like researcher judgment or convenience sampling.
Muhammad
Advantages Cluster sampling: convenience and ease of use. Simple random sampling: creates samples that are highly representative of the population. Stratified random sampling: creates strata or layers that are highly representative of strata or layers in the population. Systematic sampling: creates
Muhammad
any example plz?
Anil
Plz write the uses and miss uses of statistical theory
Pure
what is the difference between weighted simple price index (WSPI ) & Laspeyre's Price Index ( LPI )
Basil Reply
What are the 5 steps of hypothesis testing?
Sixolisiwe Reply
5 steps of hypothesis testing
Sixolisiwe
Make guesses (e.g., customers will leave if we raise our rates) State the null H0 and alternative H1 hypotheses (e.g., H0: there is no correlation) and alpha Select the sampling distribution and specify the test statistic Compute the test statistic Make a decision and interpret the results
Ara
.Five Steps in Hypothesis Testing: 1_Specify the Null Hypothesis. 2_Specify the Alternative Hypothesis. 3_Set the Significance Level (a) 4_Calculate the Test Statistic and Corresponding P-Value. 5_Drawing a Conclusion.
Rachel
.Econometric Results uses Multiple Regression for the basis of looking at number of casual factors (independent χ Variables) such as Employment, being Female etc., to test for any relationship with the dependent γ Variable Wages, in order to find any evidence to support the Alternative Hypothesis(Ha
Rachel
.Alternative Hypothesis (H1 or Ha) of Wage Differentials or in the extreme case, if the strength of relationship is strong enough between the dependent γ Variable, and multiple χ Variables, suggesting evidence for the Null Hypothesis ( Ho) that Wage Discrimination may exist.
Rachel
.The Significance Level which is also the Critical Value gives the maximum allowable probability of making a Type I error – the Significance Level value of which is decided upon before the data sample is collected and analysed, as a guide to avoid or control making a Type I error.
Rachel
Type I Error occurs when the Null Hypothesis (Ho) is not accepted when in reality the Null Hypothesis is true. A Type II Error however, occurs when one fails to reject the Null Hypothesis when in reality, the Null Hypothesis (Ho) is not true.
Rachel
.The #P-Value measures the likelihood of getting the sample results if the Null Hypothesis were true, and could be defined as the smallest level of significance (observed level of significance) at which the Null Hypothesis will be rejected, assuming the Null Hypothesis (Ho) is true.
Rachel
.In most cases, the research attempt is to find support for the Alternative Hypothesis (Ha or H1). Thus, the smaller the P-Value, the more the (the father out the #Test-Statistics is on the Standard Normal Distribution Diagram, and the more confident the researcher can be about rejecting the Null H
Rachel
.#Test-Statistics is on the Standard Normal Distribution Diagram, and the more confident the researcher can be about rejecting the Null Hypothesis (Ho) in support for the Alternative Hypothesis (H1 / Ha).
Rachel
.The #P-Value is less than the Critical Values (Significance Level) of 1% (0.01), 5% (0.05), and 10% (0.10) given in Table (1) in the Appendix, means the Null Hypothesis (Ho) that there is Wage Discrimination is not reflective of the population or equal to the Mean of the Population
Rachel
.Mean of the Population(data sample of Sample Mean distribution of the Population ) which confirms that the Researcher Rejects the Null Hypothesis (Ho) and Accepts the (Alternative Hypothesis).
Rachel
.See ISBN 1537512757 ; link : https://smile.amazon.co.uk/Winston-Chellie-Economics-TheBachelor-questions/dp/1537512757/ref=mp_s_a_1_1?keywords=Rachel+Adeniji&qid=1572318698&sr=8-1
Rachel
see publication ' Winston and Chellie by Rachel Adeniji '
Rachel
correction, dependent x variables such as Employment, being Female; dependent y variable Wages
Rachel
correction, Wage Differentials such as Employment, Region affecting Wages; Wage Discrimination such as being Female or Ethnicity affecting Wages
Rachel
correction_, linear regression/equation is computed as y=mx + c or y=m • x1+x2+x3+c where independent x variables eg Employment x1, Female x2 , Ethnicity x3, and dependent y variable Wages
Rachel
how do you draw a line of best fit?
Josh Reply
***youtu.be/l2BOZDosuIk
William
informal explanation:lets suppose you have 10 points and you want a line to best fit on all of them. all you need to keep in mind that the distance and error should be minimum and you will get the best fit line.
umair
how was the data collected to draw the graph
Nji
draw a straight line through the points on the graph that are most clustered with other data / points
Rachel
suppose that 30% of the employees in a large factory of smokers what is the probability that there will be exactly two smokers in a randomly-chosen five-person workgroup
rayhaanah Reply
binomialPdf(5, .3, 2) .3087
Ara
are the fraction integers
Amir Reply
The ratio of male to female nurses is 2:3 or 2/3. There are 40 nurses in the ward. For every 5 nurses, how many male and female nurses are there? How many groups can be divided into shifts. Pls show the solution and explain.
DokBads Reply
in a group of 5, the probability tbat exactly 3 of the nurses are male is .6630 or 66% calculation P(X=0)+(...)+P(X=3)=.6630
Ara
i dont think u got a correct answer. you are computing for the probability not the ratio and proportion
DokBads
(2+5)/40*2 = male , (2+5)/40*3 = female
Rachel
40/(2+3)*2 = male , 40/(2+3)*3=female, ....sorry correction
Rachel
thank you so much for the help
DokBads
x
Rachel
Iyhoo
Sixolisiwe
if x is a continuous random variable and` c` is a constant then p(x=c)
Neha Reply
the length of human pregnancies from conception to birth approximates a normal distribution with a mean of 266days and a standard deviation of 16days.(i) what length of time marks the shortest 10%of all pregnancies ?
Neha
27.6390625 days
festus
steps?
Neha
how can I solve a Hypothetic problem that provide sample data such as 45,3_,45,28,17 ect...what is the first step
Leticia
find the mean and standard deviation first
Kwadwo
how can I get line of best fit?
Josh

Get the best Introductory statistics course in your pocket!





Source:  OpenStax, Introductory statistics. OpenStax CNX. May 06, 2016 Download for free at http://legacy.cnx.org/content/col11562/1.18
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask