<< Chapter < Page Chapter >> Page >
Using the Cooley-Tukey algorithm to derive fast transforms.

To derive the FFT, we assume that the signal's duration is a power of two: N 2 l . Consider what happens to the even-numbered and odd-numbered elements of the sequence in the DFT calculation.

S k s 0 s 2 2 2 k N s N 2 2 N 2 k N s 1 2 k N s 3 2 2 1 k N s N 1 2 N 2 1 k N s 0 s 2 2 k N 2 s N 2 2 N 2 1 k N 2 s 1 s 3 2 k N 2 s N 1 2 N 2 1 k N 2 2 k N

Each term in square brackets has the form of a N 2 -length DFT. The first one is a DFT of the even-numbered elements, and the second of the odd-numbered elements. Thefirst DFT is combined with the second multiplied by the complex exponential 2 k N . The half-length transforms are each evaluated at frequency indices k 0 N 1 . Normally, the number of frequency indices in a DFT calculation range between zero and the transform length minusone. The computational advantage of the FFT comes from recognizing the periodic nature of the discrete Fouriertransform. The FFT simply reuses the computations made in the half-length transforms and combines them through additions andthe multiplication by 2 k N , which is not periodic over N 2 , to rewrite the length-N DFT. [link] illustrates this decomposition. As it stands, we now compute two length- N 2 transforms (complexity 2 O N 2 4 ), multiply one of them by the complex exponential (complexity O N ), and add the results (complexity O N ). At this point, the total complexity is still dominated by the half-length DFT calculations, but the proportionalitycoefficient has been reduced.

Now for the fun. Because N 2 l , each of the half-length transforms can be reduced to two quarter-length transforms, each of these to two eighth-lengthones, etc. This decomposition continues until we are left with length-2 transforms. This transform is quite simple, involvingonly additions. Thus, the first stage of the FFT has N 2 length-2 transforms (see the bottom part of [link] ). Pairs of these transforms are combined by adding one to the other multiplied by a complexexponential. Each pair requires 4 additions and 4 multiplications, giving a total number of computations equaling 8 N 4 N 2 . This number of computations does not change from stage to stage. Because the number of stages, the number of times thelength can be divided by two, equals 2 logbase --> N , the complexity of the FFT is O N N .

Length-8 dft decomposition

The initial decomposition of a length-8 DFT into the terms using even- and odd-indexed inputs marks the first phase ofdeveloping the FFT algorithm. When these half-length transforms are successively decomposed, we are left with the diagram shownin the bottom panel that depicts the length-8 FFT computation.

Doing an example will make computational savings more obvious. Let's look at the details of a length-8 DFT. As shown on [link] , we first decompose the DFT into two length-4 DFTs, with the outputs added and subtractedtogether in pairs. Considering [link] as the frequency index goes from 0 through 7, we recycle values from the length-4 DFTs into the final calculationbecause of the periodicity of the DFT output. Examining how pairs of outputs are collected together, we create the basiccomputational element known as a butterfly ( [link] ).

Butterfly

The basic computational element of the fast Fourier transform is the butterfly. It takes two complex numbers, representedby a and b , and forms the quantities shown. Each butterfly requires onecomplex multiplication and two complex additions.

By considering together the computations involving common output frequencies from the two half-length DFTs, we see that the twocomplex multiplies are related to each other, and we can reduce our computational work even further. By further decomposing thelength-4 DFTs into two length-2 DFTs and combining their outputs, we arrive at the diagram summarizing the length-8 fastFourier transform ( [link] ). Although most of the complex multiplies are quite simple(multiplying by means negating real and imaginary parts), let's count those for purposes of evaluating the complexity as full complexmultiplies. We have N 2 4 complex multiplies and 2 N 16 additions for each stage and 2 logbase --> N 3 stages, making the number of basic computations 3 N 2 2 logbase --> N as predicted.

Note that the ordering of the input sequence in the two parts of [link] aren't quite the same. Why not? How is the ordering determined?

The upper panel has not used the FFT algorithm to compute the length-4 DFTs while the lower one has. The ordering isdetermined by the algorithm.

Got questions? Get instant answers now!

Other "fast" algorithms were discovered, all of which make use of how many common factors the transform length N has. Innumber theory, the number of prime factors a given integer has measures how composite it is. The numbers 16 and 81 are highly composite (equaling 2 4 and 3 4 respectively), the number 18 is less so ( 2 1 3 2 ), and 17 not at all (it's prime). In over thirty years of Fourier transform algorithm development, the originalCooley-Tukey algorithm is far and away the most frequently used. It is so computationally efficient that power-of-twotransform lengths are frequently used regardless of what the actual length of the data.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Signals and systems. OpenStax CNX. Aug 14, 2014 Download for free at http://legacy.cnx.org/content/col10064/1.15
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Signals and systems' conversation and receive update notifications?

Ask