# 2.2 Histograms, frequency polygons, and time series graphs  (Page 2/15)

 Page 2 / 15

The smallest data value is 60. Since the data with the most decimal places has one decimal (for instance, 61.5), we want our starting point to have two decimal places. Since the numbers 0.5, 0.05, 0.005, etc. are convenient numbers, use 0.05 and subtract it from 60, the smallest value, for the convenient starting point.

60 – 0.05 = 59.95 which is more precise than, say, 61.5 by one decimal place. The starting point is, then, 59.95.

The largest value is 74, so 74 + 0.05 = 74.05 is the ending value.

Next, calculate the width of each bar or class interval. To calculate this width, subtract the starting point from the ending value and divide by the number of bars (you must choose the number of bars you desire). Suppose you choose eight bars.

$\frac{74.05-59.95}{8}=1.76$

## Note

We will round up to two and make each bar or class interval two units wide. Rounding up to two is one way to prevent a value from falling on a boundary. Rounding to the next number is often necessary even if it goes against the standard rules of rounding. For this example, using 1.76 as the width would also work. A guideline that is followed by some for the width of a bar or class interval is to take the square root of the number of data values and then round to the nearest whole number, if necessary. For example, if there are 150 values of data, take the square root of 150 and round to 12 bars or intervals.

The boundaries are:

• 59.95
• 59.95 + 2 = 61.95
• 61.95 + 2 = 63.95
• 63.95 + 2 = 65.95
• 65.95 + 2 = 67.95
• 67.95 + 2 = 69.95
• 69.95 + 2 = 71.95
• 71.95 + 2 = 73.95
• 73.95 + 2 = 75.95

The heights 60 through 61.5 inches are in the interval 59.95–61.95. The heights that are 63.5 are in the interval 61.95–63.95. The heights that are 64 through 64.5 are in the interval 63.95–65.95. The heights 66 through 67.5 are in the interval 65.95–67.95. The heights 68 through 69.5 are in the interval 67.95–69.95. The heights 70 through 71 are in the interval 69.95–71.95. The heights 72 through 73.5 are in the interval 71.95–73.95. The height 74 is in the interval 73.95–75.95.

The following histogram displays the heights on the x -axis and relative frequency on the y -axis.

## Try it

The following data are the shoe sizes of 50 male students. The sizes are continuous data since shoe size is measured. Construct a histogram and calculate the width of each bar or class interval. Suppose you choose six bars.
9; 9; 9.5; 9.5; 10; 10; 10; 10; 10; 10; 10.5; 10.5; 10.5; 10.5; 10.5; 10.5; 10.5; 10.5
11; 11; 11; 11; 11; 11; 11; 11; 11; 11; 11; 11; 11; 11.5; 11.5; 11.5; 11.5; 11.5; 11.5; 11.5
12; 12; 12; 12; 12; 12; 12; 12.5; 12.5; 12.5; 12.5; 14

Smallest value: 9

Largest value: 14

Convenient starting value: 9 – 0.05 = 8.95

Convenient ending value: 14 + 0.05 = 14.05

$\frac{14.05-8.95}{6}=0.85$

The calculations suggests using 0.85 as the width of each bar or class interval. You can also use an interval with a width equal to one.

The following data are the number of books bought by 50 part-time college students at ABC College. The number of books is discrete data , since books are counted.
1; 1; 1; 1; 1; 1; 1; 1; 1; 1; 1
2; 2; 2; 2; 2; 2; 2; 2; 2; 2
3; 3; 3; 3; 3; 3; 3; 3; 3; 3; 3; 3; 3; 3; 3; 3
4; 4; 4; 4; 4; 4
5; 5; 5; 5; 5
6; 6

Because the data are integers, subtract 0.5 from 1, the smallest data value and add 0.5 to 6, the largest data value. Then the starting point is 0.5 and the ending value is 6.5.

Next, calculate the width of each bar or class interval. If the data are discrete and there are not too many different values, a width that places the data values in the middle of the bar or class interval is the most convenient. Since the data consist of the numbers 1, 2, 3, 4, 5, 6, and the starting point is 0.5, a width of one places the 1 in the middle of the interval from 0.5 to 1.5, the 2 in the middle of the interval from 1.5 to 2.5, the 3 in the middle of the interval from 2.5 to 3.5, the 4 in the middle of the interval from _______ to _______, the 5 in the middle of the interval from _______ to _______, and the _______ in the middle of the interval from _______ to _______ .

• 3.5 to 4.5
• 4.5 to 5.5
• 6
• 5.5 to 6.5

Calculate the number of bars as follows:

$\frac{6.5-0.5}{\mathrm{number of bars}}=1$

where 1 is the width of a bar. Therefore, bars = 6.

The following histogram displays the number of books on the x -axis and the frequency on the y -axis.

what is sampling
sampling is technique to draw the sample from population
Vikram
On February 14, 1985, that “fewer Americans have health insurance coverage than previously thought”. The survey was based on a sample of 20000 households, concluded that about 85% of the population is covered by health insurance− a far cry from t
On February 14, 1985, the Bureau of the Census released a survey indicating that “fewer Americans have health insurance coverage than previously thought”. The survey was based on a sample of 20000 households, concluded that about 85% of the population is covered by health insurance− a far cry from t
Rangeen
What Is The Confidence Interval
sample mean 25, sample standard deviation 20, sample size 200, calculate the confidence interval using the given values and the original confidence level of 90%.
Can you help me in mathematical statistics problems?
yes
Kc
Pls who can help me to teach me statistics
nasir
i need tutor for statistics plz
Rana
ok
Ekene
the power of the test is
please can anyone help me solve these questions below? I need help please.
MMSI
a)An investor wants to eliminate seven of the investments in her portfolio by selling 4 stocks and 3 bonds. In how many can these be sold if among 25 securities in the portfolio,13 are stocks and the rest bonds?
MMSI
a)If a random variable has the standard normal distribution,what are the probabilities that it will take on a value: i)Less than 1.64 ii)Greater than-0.47
MMSI
b)A random variable has a normal distribution with a mean of 60 and standard deviation 5.2.What are the probabilities that the random variable will take on a value: i)Less than 65.2 ii)Between 48 and 72?
MMSI
b)If the probability that an individual suffers a bad reaction from injection of a given serum is 0.001,use the Poisson law to calculate the probability that out of 2000 individuals i)Exactly 3 individuals will suffer a bad reaction. ii)More than 2 individuals will suffer a bad reaction.
MMSI
b)The breakfast menu serve data popular 5-star Hotel in Accra consists of the following items: Juice-Mango,Grape,Apple. Toast-Whitewheat,Whole wheat. Egg:Fried,Hard-boiled,Scrambled. Beverage:Coffee,Tea,Cocoa.
MMSI
Continuation of the last question.Assist the Hotel manager to determine the number of possible breakfast combinations that can be served, one from each category
MMSI
MMSI
3x2x3
Vince
Are you answering the last question?
MMSI
MMSI
bias came in sampling due to
sampling error
Vikram
what is the difference between population and sample
Inam
Sample is the group of individual who participate in your study. Sample is a subset of population. Population is the broader group of people to whom you intend to generalize the results of your study.
Ekene
how do you find z if you only know the area of .0808
construct a frequency distribution
Sana
How to take a random sample of 30 observations
you can use the random function to generate 30 numbers or observation
smita
How we can calculate chi-square if observed x٫y٫z/frequency 40,30,20 Total/90
calculate chi-square if observed x,y,z frequency 40,30,20total 90
Insha
find t value,if boysN1, ،32,M1,87.43 S1square,39.40.GirlsN2,34,M2,82.58S2square,40.80 Determine whether the results are significant or insignificant
Insha
The heights of a random sample of 100 entering HRM Freshman of a certain college is 157 cm with a standard deviation of 8cm. test the data against the claim that the overall height of all entering HRM students is 160 cm. previous studies showed that
complete the question.. as data given N = 100,mean= 157 cm, std dev = 8 cm..
smita
Z=x-mu/ std dev
smita
the power of the test is
Ejaz
find the mean of 25,26,23,25,45,45,58,58,50,25
add all n divide by 10 i.e 38
smita
38
hhaa
amit
1 . The “average increase” for all NASDAQ stocks is the:
STATISTICS IN PRACTICE: This is a group assignment that seeks to reveal students understanding of statistics in general and it’s practical usefulness. The following are the guidelines; 1.      Each group has to identify a natural process or activity and gather data about/from the process. 2.
The diameter of an electric cable,say, X is assumed to be continoues random variable with p.d.f f(x)=6x(1-x); ≤x≤1 a)check that f(X) is p.d.f b) determine a number b such that p(Xb)
regression line can have variables ? i- only two or tow or more
Muzammil
Hi Muzammil, regression line can have two or more variables depending upon the type of regression we would be doing.
ANKAN
Simple linear regression generally account for two variables, one dependent and one independent. Multiple linear regression can have more than two variables.
ANKAN