extracting speech from audio

Question

0 个投票

i want to extract 10 sections of a speech signal having spelled 1 to 10.in section i just get the 10th value i need all the values of all the spoken words. please help..

file = wavread( 'C:\Users\Desktop\samples\A');
%sound(file,11025);
totaltime = linspace(0,8,length(file));
i=0;
x=0;
y=0.8;
a=1;
b=8820;
section=(0);
while i<=9
time = linspace(x,y,8820);
%fourier = fft(file);
section1 = file(a:b,:);
section(i)=section1;
sound(section(i), 11025);
plot(time,section(i));
x=x+0.8;
y=y+0.8;
a = a + 8820;
b=b+8820;
i=i+1;
end

this is my project task.im new here. help needed.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Follow Question

Answer 1

Walter Roberson 2012-4-12

编辑：Walter Roberson 2017-10-4

0 个投票

That code should not run at all.

section1 is set to a slice of 8820 samples (in each channel). You then try to store that entire array into a single entry of a numeric array, section(i) . You cannot store 8820 (or 8820 by 2) numeric values into a single numeric location. Your code should exit.

Also, on the first iteration of the while, i is 0, and you are trying to store into section(i) which would be section(0) and that would crash because there is no element #0 in MATLAB arrays.

6 个评论
显示 4更早的评论隐藏 4更早的评论

Walter Roberson 2012-4-13

For any one file, permute() and reshape() and permute() again, and store the result into a 4 dimensional array indexed by segment number, sample within segment, channel, and file number.

faiza khan 2012-4-13

by any one file u mean the file having 10 spoken words from one person only.?

请先登录，再进行评论。

Answer 2

Image Analyst 2012-4-12

0 个投票

That (if done right) only crops out chunks of the wavefile. It does not extract speech from an audio file -- like you asked for in your subject -- that has speech plus other unwanted sounds. I'm no audio expert but if you want to do that, you might try ICA to do blind source separation, as discussed in these web sites:

http://cnl.salk.edu/~tony/ica.html

http://research.ics.tkk.fi/ica/icademo/

http://research.ics.tkk.fi/ica/fastica/

Even if you did Fourier analysis, like you hinted at in your comment, this would simply do frequency filtering and wouldn't necessarily extract out speech from other sounds occupying that frequency range.

2 个评论
显示无隐藏无

faiza khan 2012-4-13

im loading 10 speech wav files into matlab..the first one loads correctly..in the other files the data duplicates.

Image Analyst 2012-4-13

You mean the extracted output files, right - they are all the same? Try this:

Delete this line:

section=(0);

and change these lines from this:

section1 = file(a:b,:);

section(i)=section1;

sound(section(i), 11025);

plot(time,section(i));

to this:

thisSection = file(a:b,:);

sound(thisSection , 11025);

plot(time, thisSection);

请先登录，再进行评论。

extracting speech from audio

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（2 个）

6 个评论
显示 4更早的评论隐藏 4更早的评论

2 个评论
显示无隐藏无

类别

标签

Community Treasure Hunt

extracting speech from audio

0 个评论 显示 -2更早的评论 隐藏 -2更早的评论

回答（2 个）

6 个评论 显示 4更早的评论 隐藏 4更早的评论

2 个评论 显示 无 隐藏 无

类别

标签

另请参阅

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

6 个评论
显示 4更早的评论隐藏 4更早的评论

2 个评论
显示无隐藏无