input shape to the LSTM net when doing inference for VAD tasks

Question

YUKAI SHEN 2023-3-7

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1924515-input-shape-to-the-lstm-net-when-doing-inference-for-vad-tasks

回答： Brian Hemmat 2023-3-7

Hi, I am following this article to train a LSTM network for VAD tasks: https://www.mathworks.com/help/deeplearning/ug/voice-activity-detection-in-noise-using-deep-learning.html

My question is, when testing a trained LSTM network, as in the article did, the input data is not shaped as the training input as (#frames, #time_steps, #features), does this mean, when doing inference, the trained LSTM network will take each frame as a input independetly, and classify if this frame is noise or voice, so basically there is no hidden states used when doing inference, am I right?

Thank you in advance!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Brian Hemmat 2023-3-7

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1924515-input-shape-to-the-lstm-net-when-doing-inference-for-vad-tasks#answer_1187540

I did not look at the dimensions you're discussing, but I can say that you are correct that the "streaming" code in the example classifies chunks independently. Note that it is calling classify and not classifyAndUpdateState.

Stay tuned for the R2023a release, where we have updated the example to maintain state (should be coming in the next few weeks).

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

input shape to the LSTM net when doing inference for VAD tasks

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

input shape to the LSTM net when doing inference for VAD tasks

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论