This page is optimized for mobile devices, if you would prefer the desktop version just click here

0.2 3 - voice recognition: chosen methods of investigation

Chosen Methods of Investigation

The limited timeframe of our project meant both DTW and HMM-based approaches were impractical, requiring many hundreds more man-hours than was available. We chose to focus on achieving solid results from a more primitive algorithm, the LPC, and work on making it more robust thereafter.

We collected the several hundreds of data samples used to train the library from ourselves.

We featured-matched input and stored data using the Yule-Walker autocorrelation method, minimizing the forward prediction error in the least squares sense. This was done using Matlab’s Yule-Walker AR Estimator.

Testing the algorithm resulted in an abysmal 20-30% accuracy.

We thought to produce better base accuracy with an algorithm of our own making. Our final results are based upon the following algorithm outlined:

  1. Convolution-based segmentation
  2. Feature extraction of formants via nonlinear power filter
  3. Display filtered spectrum on a discrete, weighted scatter plot
  4. Trace out contours of the maximum-likelihood Gaussian Mixture Model (GMM) using a maximum-likelihood GMM estimator
  5. Construct a standardized GMM parameter library for each number
  6. Find the GMM matching the input with a maximum-likelihood fit

<< Chapter < Page Page > Chapter >>

Read also:

OpenStax, Elec 301 project: voice recognition. OpenStax CNX. Dec 19, 2011 Download for free at http://cnx.org/content/col11396/1.3
Google Play and the Google Play logo are trademarks of Google Inc.
Jobilize.com uses cookies to ensure that you get the best experience. By continuing to use Jobilize.com web-site, you agree to the Terms of Use and Privacy Policy.