This is machine translation

Translated by Microsoft
Mouseover text to see original. Click the button below to return to the English version of the page.

Note: This page has been translated by MathWorks. Click here to see
To view all translated materials including this page, select Country from the country navigator on the bottom of this page.

Feature Extraction and Deep Learning

Audio labeling, datastore, voice activity detection, MFCC, pitch, loudness

Audio Toolbox™ enables you to extract auditory features common to machine-learning and deep-learning tasks. Use Audio Labeler to interactively define and visualize ground-truth for audio datasets. Use audioDatastore to handle large collections of audio recordings for batch processing or machine and deep learning applications.

Apps

Audio LabelerDefine and visualize ground-truth labels

Functions

expand all

audioDatastoreDatastore for collection of audio files
erb2hzConvert from equivalent rectangular bandwidth (ERB) scale to hertz
bark2hzConvert from Bark scale to hertz
mel2hzConvert from mel scale to hertz
hz2erbConvert from hertz to equivalent rectangular bandwidth (ERB) scale
hz2barkConvert from hertz to Bark scale
hz2melConvert from hertz to mel scale
integratedLoudnessMeasure integrated loudness and loudness range
loudnessMeterStandard-compliant loudness measurements
harmonicRatioHarmonic ratio
pitchEstimate fundamental frequency of audio signal
voiceActivityDetectorDetect presence of speech in audio signal
mfccExtract mfcc, log energy, delta, and delta-delta of audio signal
gtccExtract gammatone cepstral coefficients, log-energy, delta, and delta-delta
cepstralFeatureExtractorExtract cepstral features from audio segment
spectralCentroidSpectral centroid for audio signals and auditory spectrograms
spectralCrestSpectral crest for audio signals and auditory spectrograms
spectralDecreaseSpectral decrease for audio signals and auditory spectrograms
spectralEntropySpectral entropy for audio signals and auditory spectrograms
spectralFlatnessSpectral flatness for audio signals and auditory spectrograms
spectralFluxSpectral flux for audio signals and auditory spectrograms
spectralKurtosisSpectral kurtosis for audio signals and auditory spectrograms
spectralRolloffPointSpectral rolloff point for audio signals and auditory spectrograms
spectralSkewnessSpectral skewness for audio signals and auditory spectrograms
spectralSlopeSpectral slope for audio signals and auditory spectrograms
spectralSpreadSpectral spread for audio signals and auditory spectrograms
melSpectrogramMel spectrogram
kbdwinKaiser-Bessel-derived window
mdctModified discrete cosine transform
imdctInverse modified discrete cosine transform

Blocks

Voice Activity DetectorDetect presence of speech in audio signal
Cepstral Feature ExtractorExtract cepstral features from audio segment
Loudness MeterStandard-compliant loudness measurements

Topics

Label Audio Using Audio Labeler

Interactively define and visualize ground-truth labels for audio datasets

Speech-to-Text Transcription

Perform speech-to-text transcription in MATLAB® using third-party cloud-based APIs.

Spectral Descriptors

Overview and applications of spectral descriptors

Featured Examples