<< Chapter < Page Chapter >> Page >
This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by EwaPaszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

Estimation

Once a model is specified with its parameters and data have been collected, one is in a position to evaluate the model’s goodness of fit, that is, how well the model fits the observed pattern of data. Finding parameter values of a model that best fits the data — a procedure called parameter estimation, which assesses goodness of fit .

There are two generally accepted methods of parameter estimation. They are least squares estimation (LSE) and maximum likelihood estimation (MLE) . The former is well known as linear regression, the sum of squares error, and the root means squared deviation is tied to the method. On the other hand, MLE is not widely recognized among modelers in psychology, though it is, by far, the most commonly used method of parameter estimation in the statistics community. LSE might be useful for obtaining a descriptive measure for the purpose of summarizing observed data, but MLE is more suitable for statistical inference such as model comparison. LSE has no basis for constructing confidence intervals or testing hypotheses whereas both are naturally built into MLE.

Properties of estimators

UNBIASED AND BIASED ESTIMATORS

Let consider random variables for which the functional form of the p.d.f. is know, but the distribution depends on an unknown parameter θ , that may have any value in a set θ , which is called the parameter space . In estimation the random sample from the distribution is taken to elicit some information about the unknown parameter θ . The experiment is repeated n independent times, the sample X 1 , X 2 ,..., X n is observed and one try to guess the value of θ using the observations x 1 , x 2 ,... x n .

The function of X 1 , X 2 ,..., X n used to guess θ is called an estimator of θ . We want it to be such that the computed estimate u ( x 1 , x 2 ,... x n ) is usually close to θ . Let Y = u ( x 1 , x 2 ,... x n ) be an estimator of θ . If Y to be a good estimator of θ , a very desirable property is that it means be equal to θ , namely E ( Y ) = θ .

If E [ u ( x 1 , x 2 ,..., x n ) ] = θ is called an unbiased estimator of θ . Otherwise, it is said to be biased .

It is required not only that an estimator has expectation equal to θ , but also the variance of the estimator should be as small as possible. If there are two unbiased estimators of θ , it could be probably possible to choose the one with the smaller variance. In general, with a random sample X 1 , X 2 ,..., X n of a fixed sample size n , a statistician might like to find the estimator Y = u ( X 1 , X 2 ,..., X n ) of an unknown parameter θ which minimizes the mean (expected) value of the square error (difference) Y θ that is, minimizes E [ ( Y θ ) 2 ] = E { [ u ( X 1 , X 2 ,..., X n ) θ ] 2 } .

The statistic Y that minimizes E [ ( Y θ ) 2 ] is the one with minimum mean square error. If we restrict our attention to unbiased estimators only, then V a r ( Y ) = E [ ( Y θ ) 2 ] , and the unbiased statistics Y that minimizes this expression is said to be the unbiased minimum variance estimator of θ .

Method of moments

One of the oldest procedures for estimating parameters is the method of moments . Another method for finding an estimator of an unknown parameter is called the method of maximum likelihood . In general, in the method of moments, if there are k parameters that have to be estimated, the first k sample moments are set equal to the first k population moments that are given in terms of the unknown parameters.

Let the distribution of X be N ( μ , σ 2 ) . Then E ( X ) = μ and E ( X 2 ) = σ 2 + μ 2 . Given a random sample of size n , the first two moments are given by m 1 = 1 n i = 1 n x i and m 2 = 1 n i = 1 n x i .

We set m 1 = E ( X ) and m 2 = E ( X 2 ) and solve for μ and σ 2 , 1 n i = 1 n x i = μ and 1 n i = 1 n x i = σ 2 + μ 2 .

The first equation yields x ¯ as the estimate of μ . Replacing μ 2 with x ¯ 2 in the second equation and solving for σ 2 ,

we obtain 1 n i = 1 n x i x ¯ 2 = v for the solution of σ 2 .

Got questions? Get instant answers now!

Thus the method of moment estimators for μ and σ 2 are μ ˜ = X ¯ and σ ˜ 2 = V . Of course, μ ˜ = X ¯ is unbiased whereas σ ˜ 2 = V . is biased.

At this stage arises the question, which of two different estimators θ ^ and θ ˜ , for a parameter θ one should use. Most statistician select he one that has the smallest mean square error, for example, E [ ( θ ^ θ ) 2 ] < E [ ( θ ˜ θ ) 2 ] , then θ ^ seems to be preferred. This means that if E ( θ ^ ) = E ( θ ˜ ) = θ , then one would select the one with the smallest variance.

Next, other questions should be considered. Namely, given an estimate for a parameter, how accurate is the estimate? How confident one is about the closeness of the estimate to the unknown parameter?

CONFIDENCE INTERVALS I and CONFIDENCE INTERVALS II

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introduction to statistics. OpenStax CNX. Oct 09, 2007 Download for free at http://cnx.org/content/col10343/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introduction to statistics' conversation and receive update notifications?

Ask