Gaussian Mixture Model for speech recognition
1 次查看(过去 30 天)
显示 更早的评论
Hi all! I'm implementing a tool for speech recognition (command based).
My training data are 21 commands (7 different commands with 3 utterances for each). I did:
- the pre-processing phase (silence removal and end-point detection)
- the features extraction phase (with MFCC calculation).
So, for every utterance in my training set, i have a MFCC matrix with 12 columns (12=number of MFCC) and as much rows as the number of frames i divided the signal.
For the recognition phase, i was wondering to use the gmdistribution tool.
I read this article:
http://www.mathworks.it/company/newsletters/digest/2010/jan/word-recognition-system-matlab.html but i didn't understand this code line:
% model = gmdistribution.fit(MFCCtraindata,M);
What is the MFCCtraindata parameter?
Is it the MFCC matrix associated with every utterance?
For each command i have 3 utterances, so i have 3 different MFCC matrixes.
How can i do to create a unique gmm if, for every command, i will got 3 different gmm?
Any kind of help will be appreciated.
Thank you!!
0 个评论
回答(5 个)
Rania Ziedan
2015-10-22
i really need help in the same issue if you handled it could you help me thanks in advance
0 个评论
MUZITIANXINJIE
2016-6-26
Yes,I want,but no one help me! I really need to use the deep learning tu classfy the voice recognition . thanks for your help.
0 个评论
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Speech Recognition 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!