<< Chapter < Page Chapter >> Page >

Suppose you measure a collection of scalars x 1 , , x N . You believe the data is distributed in one of two ways. Your first model, call it H 0 , postulates the data to be governed by the density f 0 x (some fixed density). Your second model, H 1 , postulates a different density f 1 x . These models, termed hypotheses , are denoted as follows: H 0 : x n f 0 x , n = 1 N H 1 : x n f 1 x , n = 1 N A hypothesis test is a rule that, given a measurement x , makes a decision as to which hypothesis best "explains" the data.

Suppose you are confident that your data is normally distributed with variance 1, but you are uncertain aboutthe sign of the mean. You might postulate H 0 : x n -1 1 H 1 : x n 1 1 These densities are depicted in .

Assuming each hypothesis is a priori equally likely, an intuitively appealing hypothesis test is to compute the sample mean x 1 N n 1 N x n , and choose H 0 if x 0 , and H 1 if x 0 . As we will see later, this test is in fact optimal under certain assumptions.

Generalizations and nomenclature

The concepts introduced above can be extended inseveral ways. In what follows we provide more rigorous definitions, describe different kinds of hypothesis testing, andintroduce terminology.

Data

In the most general setup, the observation is a collection x 1 , , x N of random vectors. A common assumption, which facilitates analysis, is that the data are independent and identicallydistributed (IID). The random vectors may be continuous, discrete, or in some cases mixed. It is generally assumedthat all of the data is available at once, although for some applications, such as Sequential Hypothesis Testing , the data is a never ending stream.

Binary versus m-ary tests

When there are two competing hypotheses, we refer to a binary hypothesis test. When the number of hypotheses is M 2 , we refer to an M-ary hypothesis test. Clearly, binary is a special case of M -ary, but binary tests are accorded a special status for certain reasons. These includetheir simplicity, their prevalence in applications, and theoretical results that do not carry over to the M -ary case.

Phase-shift keying

Suppose we wish to transmit a binary string of length r over a noisy communication channel. We assign each of the M 2 r possible bit sequences to a signal s k , k 1 M where s n k 2 f 0 n 2 k 1 M This symboling scheme is known as phase-shift keying (PSK). After transmitting a signal across the noisy channel, the receiver faces an M -ary hypothesis testing problem: H 0 : x s 1 w H M : x s M w where w 0 2 I .

In many binary hypothesis tests, one hypothesis represents the absence of a ceratinfeature. In such cases, the hypothesis is usually labelled H 0 and called the null hypothesis. The other hypothesis is labelled H 1 and called the alternative hypothesis.

Waveform detection

Consider the problem of detecting a known signal s s 1 s N in additive white Gaussian noise (AWGN). This scenario is common in sonar and radar systems. Denotingthe data as x x 1 x N , our hypothesis testing problem is H 0 : x w H 1 : x s w where w 0 2 I . H 0 is the null hypothesis, corresponding to the absence of a signal.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Statistical signal processing. OpenStax CNX. Jun 14, 2004 Download for free at http://cnx.org/content/col10232/1.1
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Statistical signal processing' conversation and receive update notifications?

Ask