Main Content

计算机视觉

利用计算机视觉应用扩展深度学习工作流

通过将 Computer Vision Toolbox™ 与 Deep Learning Toolbox™ 结合使用,将深度学习应用于计算机视觉应用。

App

图像标注器Label images for computer vision applications
视频标注器Label video for computer vision applications

函数

全部展开

boxLabelDatastoreDatastore for bounding box label data (自 R2019b 起)
pixelLabelDatastoreDatastore for pixel label data
visionTransformerPretrained vision transformer (ViT) neural network (自 R2023b 起)
patchEmbeddingLayerPatch embedding layer (自 R2023b 起)
unetCreate U-Net convolutional neural network for semantic segmentation (自 R2024a 起)
unet3dCreate 3-D U-Net convolutional neural network for semantic segmentation of volumetric images (自 R2024a 起)
deeplabv3plusCreate DeepLab v3+ convolutional neural network for semantic image segmentation (自 R2024a 起)
rcnnObjectDetectorDetect objects using R-CNN deep learning detector
fastRCNNObjectDetectorDetect objects using Fast R-CNN deep learning detector
fasterRCNNObjectDetectorDetect objects using Faster R-CNN deep learning detector
ssdObjectDetectorDetect objects using SSD deep learning detector (自 R2020a 起)
yolov2ObjectDetectorDetect objects using YOLO v2 object detector
yolov3ObjectDetectorDetect objects using YOLO v3 object detector (自 R2021a 起)
yolov4ObjectDetectorDetect objects using YOLO v4 object detector (自 R2022a 起)
solov2Segment objects using SOLOv2 instance segmentation network (自 R2023b 起)
maskrcnnDetect objects using Mask R-CNN instance segmentation (自 R2021b 起)
posemaskrcnnPredict object pose using Pose Mask R-CNN pose estimation (自 R2024a 起)
reidentificationNetworkRe-identification deep learning network for re-identifying and tracking objects (自 R2024a 起)
fcddAnomalyDetectorDetect anomalies using fully convolutional data description (FCDD) network for anomaly detection (自 R2022b 起)
fastFlowAnomalyDetectorDetect anomalies using FastFlow network (自 R2023a 起)
patchCoreAnomalyDetectorDetect anomalies using PatchCore network (自 R2023a 起)
yoloxObjectDetectorDetect objects using YOLOX object detector (自 R2023b 起)
detectTextCRAFTDetect texts in images by using CRAFT deep learning model (自 R2022a 起)

主题

图像分类

目标检测和实例分割

自动化视觉检查

语义分割

视频分类