Gaussian Mixture Model for speech recognition

1 次查看(过去 30 天)
Hi all! I'm implementing a tool for speech recognition (command based).
My training data are 21 commands (7 different commands with 3 utterances for each). I did:
  • the pre-processing phase (silence removal and end-point detection)
  • the features extraction phase (with MFCC calculation).
So, for every utterance in my training set, i have a MFCC matrix with 12 columns (12=number of MFCC) and as much rows as the number of frames i divided the signal.
For the recognition phase, i was wondering to use the gmdistribution tool.
I read this article:
% model = gmdistribution.fit(MFCCtraindata,M);
What is the MFCCtraindata parameter?
Is it the MFCC matrix associated with every utterance?
For each command i have 3 utterances, so i have 3 different MFCC matrixes.
How can i do to create a unique gmm if, for every command, i will got 3 different gmm?
Any kind of help will be appreciated.
Thank you!!

回答(5 个)

Castalia
Castalia 2013-3-8
Nobody could give me any advice, please?

Rania Ziedan
Rania Ziedan 2015-10-22
i really need help in the same issue if you handled it could you help me thanks in advance

MUZITIANXINJIE
MUZITIANXINJIE 2016-6-26
Yes,I want,but no one help me! I really need to use the deep learning tu classfy the voice recognition . thanks for your help.

yasir riaz
yasir riaz 2016-12-21
please help

hanieh rafiee
hanieh rafiee 2017-2-19
Hi Is the answer to your question receipts? Will you help me please?

类别

Help CenterFile Exchange 中查找有关 Speech Recognition 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by