2.11 Machine learning lecture 11 course notes

Machine learning Page 1 / 3

Independent components analysis

Our next topic is Independent Components Analysis (ICA). Similar to PCA, this will find a new basis in which to represent our data. However, the goal is verydifferent.

As a motivating example, consider the “cocktail party problem.” Here, $n$ speakers are speaking simultaneously at a party, and any microphone placed in the room records only an overlapping combination of the $n$ speakers' voices. But let's say we have $n$ different microphones placed in the room, and because each microphone is a different distance from eachof the speakers, it records a different combination of the speakers' voices. Using these microphone recordings, can we separate out the original $n$ speakers' speech signals?

To formalize this problem, we imagine that there is some data $s \in R^{n}$ that is generated via $n$ independent sources. What we observe is

x = A s,

where $A$ is an unknown square matrix called the mixing matrix . Repeated observations gives us a dataset ${x^{(i)}; i = 1, ..., m}$ , and our goal is to recover the sources $s^{(i)}$ that had generated our data ( $x^{(i)} = A s^{(i)}$ ).

In our cocktail party problem, $s^{(i)}$ is an $n$ -dimensional vector, and $s_{j}^{(i)}$ is the sound that speaker $j$ was uttering at time $i$ . Also, $x^{(i)}$ in an $n$ -dimensional vector, and $x_{j}^{(i)}$ is the acoustic reading recorded by microphone $j$ at time $i$ .

Let $W = A^{- 1}$ be the unmixing matrix. Our goal is to find $W$ , so that given our microphone recordings $x^{(i)}$ , we can recover the sources by computing $s^{(i)} = W x^{(i)}$ . For notational convenience, we also let $w_{i}^{T}$ denote the $i$ -th row of $W$ , so that

W = [\begin{matrix} — & w_{1}^{T} & — \\ ⋮ \\ — & w_{n}^{T} & — \end{matrix}] .

Thus, $w_{i} \in R^{n}$ , and the $j$ -th source can be recovered by computing $s_{j}^{(i)} = w_{j}^{T} x^{(i)}$ .

Ica ambiguities

To what degree can $W = A^{- 1}$ be recovered? If we have no prior knowledge about the sources and the mixing matrix, it is not hard to see that there are some inherent ambiguities in $A$ that are impossible to recover, given only the $x^{(i)}$ 's.

Specifically, let $P$ be any $n$ -by- $n$ permutation matrix. This means that each row and each column of $P$ has exactly one “1.” Here're some examples of permutation matrices:

P = [\begin{matrix} 0 & 1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 1 \end{matrix}]; P = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}]; P = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] .

If $z$ is a vector, then $P z$ is another vector that's contains a permuted version of $z$ 's coordinates. Given only the $x^{(i)}$ 's, there will be no way to distinguish between $W$ and $P W$ . Specifically, the permutation of the original sources is ambiguous, which should be no surprise. Fortunately, thisdoes not matter for most applications.

Further, there is no way to recover the correct scaling of the $w_{i}$ 's. For instance, if $A$ were replaced with $2 A$ , and every $s^{(i)}$ were replaced with $(0.5) s^{(i)}$ , then our observed $x^{(i)} = 2 A \cdot (0.5) s^{(i)}$ would still be the same. More broadly, if a single column of $A$ were scaled by a factor of $α$ , and the corresponding source were scaled by a factor of $1 / α$ , then there is again no way, given only the $x^{(i)}$ 's to determine that this had happened. Thus, we cannot recover the “correct” scaling of the sources. However, for the applications that we are concernedwith—including the cocktail party problem—this ambiguity also does not matter. Specifically, scaling a speaker's speech signal $s_{j}^{(i)}$ by some positive factor $α$ affects only the volume of that speaker's speech. Also, sign changes do not matter, and $s_{j}^{(i)}$ and $- s_{j}^{(i)}$ sound identical when played on a speaker. Thus, if the $w_{i}$ found by an algorithm is scaled by any non-zero real number, the corresponding recovered source $s_{i} = w_{i}^{T} x$ will be scaled by the same factor; but this usually does not matter. (These comments also apply to ICA for the brain/MEGdata that we talked about in class.)

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Machine learning. OpenStax CNX. Oct 14, 2013 Download for free at http://cnx.org/content/col11500/1.4

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Machine learning' conversation and receive update notifications?

Ask

	Pre-class Online Quiz - Case Control Studies By Mucho Mizinduko Start Quiz
	4 BOD Hemolymphatic -Dr. Han By Brooke Delaney Start Exam
	U.s. history By OpenStax Read Online Course
	Assembly Programming Language By JavaChamp Team Start Quiz
	4 Arts Society: Theater 4 By Jonathan Long Start Quiz
	How to Analyze Stocks By Yasser Ibrahim Start Quiz
	1 Endocrine System MCQ By Nick Swain Start Quiz
©flickr: Jonathan	Power By Megan Earhart Start Quiz
	Psychology By OpenStax Read Online Course
	5 AP 05 Integumentary System MCQ By OpenStax Start Quiz