为深度神经网络预处理数据

管理和预处理深度学习数据

预处理数据以确保它采用网络可接受的格式是深度学习工作流中常见的第一步。例如，您可以调整图像输入的大小以匹配图像输入层的大小。您还可以对数据进行预处理，以增强所需的特征或减少可能导致网络偏差的伪影。例如，您可以对输入数据进行归一化或去噪。

您可以使用 MATLAB^® 和 Deep Learning Toolbox™ 中提供的数据存储和函数通过调整大小等操作来预处理图像输入。其他 MATLAB 工具箱提供了用于标注、处理和增强深度学习数据的函数、数据存储和 App。您可以使用其他 MATLAB 工具箱中的专用工具，针对图像处理、目标检测、语义分割、信号处理、音频处理和文本分析等领域处理数据。

App

图像标注器	为计算机视觉应用标注图像
视频标注器	Label video for computer vision applications
真实值标注器	Label ground truth data for automated driving applications
激光雷达标注器	Label ground truth data in lidar point clouds
信号标注器	Label signal attributes, regions, and points of interest

函数

`imageDatastore`	图像数据的数据存储
`augmentedImageDatastore`	变换批量以增强图像数据
`imageDataAugmenter`	Configure image data augmentation
`transform`	变换数据存储
`combine`	合并来自多个数据存储的数据
`augment`	对多个图像应用相同的随机变换
`minibatchqueue`	Create mini-batches for deep learning
`TransformedDatastore`	用于变换基础数据存储的数据存储
`CombinedDatastore`	数据存储会合并从多个基础数据存储读取的数据
`padsequences`	Pad or truncate sequence data to same length

主题

预处理深度学习数据

Data Sets for Deep Learning
Discover data sets for various deep learning tasks.
预处理图像以进行深度学习
了解如何调整图像大小以进行训练、预测和分类，以及如何使用数据增强、变换和专用数据存储对图像进行预处理。
Preprocess Volumes for Deep Learning (Image Processing Toolbox)
Read and preprocess volumetric image and label data for 3-D deep learning.
在 MATLAB 中进行深度学习
通过使用卷积神经网络进行分类和回归来探索 MATLAB 的深度学习能力，包括预训练网络和迁移学习，以及在 GPU、CPU、集群和云上进行训练。
Deep Learning Tips and Tricks
Learn how to improve the accuracy of deep learning networks.
Preprocess Data for Domain-Specific Deep Learning Applications

Perform deterministic or randomized data processing for domains such as image processing, object detection, semantic segmentation, signal and audio processing, and text analytics.
- Augment Images for Deep Learning (Image Processing Toolbox)
- Augment Pixel Labels for Semantic Segmentation (Computer Vision Toolbox)
- Augment Bounding Boxes for Object Detection (Computer Vision Toolbox)
- 使用深度学习训练语音命令识别模型
- 使用深度学习对文本数据进行分类

自定义数据存储

Datastores for Deep Learning
Learn how to use datastores in deep learning applications.
使用无法放入内存的序列数据训练网络
此示例说明如何通过变换和合并数据存储基于无法放入内存的序列数据来训练深度学习网络。
使用卷积神经网络对文本数据进行分类
此示例说明如何使用卷积神经网络对文本数据进行分类。
Optimize Datastores for Deep Learning Performance
Explore methods for speeding up deep learning workflows that use datastores.
Develop Custom Mini-Batch Datastore
Create a fully customized mini-batch datastore that contains training and test data sets for network training, prediction, and classification.
使用序列数据的自定义小批量数据存储来训练网络
此示例说明如何使用自定义小批量数据存储基于无法放入内存的序列数据来训练深度学习网络。

标注真实值训练数据

Choose an App to Label Ground Truth Data
Decide which app to use to label ground truth data: Image Labeler, Video Labeler, Ground Truth Labeler, Lidar Labeler, Signal Labeler, or Medical Image Labeler.
Get Started with Ground Truth Labeling (Automated Driving Toolbox)
Interactively label multiple lidar and video signals simultaneously.
Custom Labeling Functions (Signal Processing Toolbox)
Create and manage custom labeling functions.
Label Spoken Words in Audio Signals (Signal Processing Toolbox)
Use Signal Labeler to label spoken words in an audio signal.
Label Pixels for Semantic Segmentation (Computer Vision Toolbox)
Label pixels for training a semantic segmentation network by using a labeling app.

精选示例

Create and Explore Datastore for Image Classification

Create, read, and augment an image datastore for use in training a deep learning network. In particular, this example shows how to create an ImageDatastore object from a collection of images, read and extract the properties of the datastore, and create an augmentedImageDatastore for use during training.