Speech recognition using MFCC and DTW classifier

Mukrimah, Nawir

View/Open

Abstract,Acknowledgement.pdf (123.0Kb)

Introduction.pdf (117.4Kb)

Literature Review.pdf (340.3Kb)

Methodology.pdf (545.8Kb)

Results and Discussion.pdf (489.8Kb)

Conclusion and Recommendation.pdf (100.8Kb)

Refference and Appendics.pdf (145.4Kb)

Date

2015-06

Author

Mukrimah, Nawir

Metadata

Show full item record

Abstract

Speech recognition had been used broadly in many applications such as security systems, healthcare, and equipment designed for handicapped. This project is about design speech recognition by encoding and modeling the systems in the Digital Signal Processing Toolbox, using two algorithms Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) adapted for feature extraction and classification. First, record the words to accomplish the simulations of the programmed system. An experimental database is obtained by speaking 10 numbers (0-9) during the training phase. Second, that training word has been tested to be matched in order to recognize it. The analysis of coding was modified according to the four elements. They are a number of sample frequency (Fs), types of window used, number of triangles (windowing) and size of the window. From these changes elements we can get the result and determine the best performances of speech recognition. The best performance of this speech recognition using MFCC and DTW algorithms are 90% recognition rate. Thus, the designed systems actually work well in the speech recognition.

URI

http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41832

Collections

School of Computer and Communication Engineering (FYP) [310]