计算机视觉

利用计算机视觉应用扩展深度学习工作流

通过将 Computer Vision Toolbox™ 与 Deep Learning Toolbox™ 结合使用，将深度学习应用于计算机视觉应用。

App

图像标注器	为计算机视觉应用标注图像
视频标注器	Label video for computer vision applications

函数

全部展开

训练数据的数据存储

`boxLabelDatastore`	Datastore for bounding box label data
`pixelLabelDatastore`	Datastore for pixel label data

ViT（视觉变换器）

`visionTransformer`	Pretrained vision transformer (ViT) neural network (自 R2023b 起)
`patchEmbeddingLayer`	Patch embedding layer (自 R2023b 起)

语义分割

`unet`	Create U-Net convolutional neural network for semantic segmentation (自 R2024a 起)
`unet3d`	Create 3-D U-Net convolutional neural network for semantic segmentation of volumetric images (自 R2024a 起)
`deeplabv3plus`	Create DeepLab v3+ convolutional neural network for semantic image segmentation (自 R2024a 起)

目标检测

`rtmdetObjectDetector`	Detect objects using RTMDet object detector (自 R2024b 起)
`yolov4ObjectDetector`	Detect objects using YOLO v4 object detector (自 R2022a 起)
`yolov2ObjectDetector`	Detect objects using YOLO v2 object detector
`yolov3ObjectDetector`	Detect objects using YOLO v3 object detector (自 R2021a 起)
`ssdObjectDetector`	Detect objects using SSD deep learning detector

实例分割

`solov2`	Segment objects using SOLOv2 instance segmentation network (自 R2023b 起)
`maskrcnn`	Detect objects using Mask R-CNN instance segmentation (自 R2021b 起)

位姿估计

posemaskrcnn Predict object pose using Pose Mask R-CNN pose estimation (自 R2024a 起)

目标跟踪和重新识别

reidentificationNetwork Re-identification deep learning network for re-identifying and tracking objects (自 R2024a 起)

自动化视觉检查

`yoloxObjectDetector`	Detect objects using YOLOX object detector (自 R2023b 起)
`efficientADAnomalyDetector`	Detect anomalies using EfficientAD network (自 R2024b 起)
`patchCoreAnomalyDetector`	Detect anomalies using PatchCore network (自 R2023a 起)
`fcddAnomalyDetector`	Detect anomalies using fully convolutional data description (FCDD) network for anomaly detection (自 R2022b 起)
`fastFlowAnomalyDetector`	Detect anomalies using FastFlow network (自 R2023a 起)

文本检测和识别

detectTextCRAFT Detect texts in images by using CRAFT deep learning model (自 R2022a 起)

主题

图像分类

Train Vision Transformer Network for Image Classification
This example shows how to fine-tune a pretrained vision transformer (ViT) neural network to perform classification on a new collection of images.

目标检测和实例分割

Get Started with Object Detection Using Deep Learning (Computer Vision Toolbox)
Perform object detection using deep learning neural networks such as YOLOX, YOLO v4, and SSD.
Get Started with Instance Segmentation Using Deep Learning (Computer Vision Toolbox)
Segment objects using an instance segmentation model such as SOLOv2 or Mask R-CNN.
Choose an Object Detector (Computer Vision Toolbox)
Compare object detection deep learning models, such as YOLOX, YOLO v4, RTMDet, and SSD.
Augment Bounding Boxes for Object Detection
This example shows how to perform common kinds of image and bounding box augmentation as part of object detection workflows.
Import Pretrained ONNX YOLO v2 Object Detector
This example shows how to import a pretrained ONNX™ (Open Neural Network Exchange) you only look once (YOLO) v2 [1] object detection network and use the network to detect objects.
Export YOLO v2 Object Detector to ONNX
This example shows how to export a YOLO v2 object detection network to ONNX™ (Open Neural Network Exchange) model format.
将目标检测模型部署为微服务 (MATLAB Compiler SDK)
使用微服务检测图像中的目标。

自动化视觉检查

Getting Started with Anomaly Detection Using Deep Learning (Computer Vision Toolbox)
Anomaly detection using deep learning is an increasingly popular approach to automating visual inspection tasks.
Detect Image Anomalies Using Explainable FCDD Network (Computer Vision Toolbox)
Use an anomaly detector to distinguish between normal pills and pills with anomalous chips or contamination.
Classify Defects on Wafer Maps Using Deep Learning (Computer Vision Toolbox)
Classify manufacturing defects on wafer maps using a simple convolutional neural network (CNN).
Detect Image Anomalies Using Pretrained ResNet-18 Feature Embeddings (Computer Vision Toolbox)
Train a similarity-based anomaly detector using one-class learning of feature embeddings extracted from a pretrained ResNet-18 convolutional neural network.
Localize Industrial Defects Using PatchCore Anomaly Detector (Computer Vision Toolbox)
Perform localization of anomalous defects in printed circuit boards (PCBs) using anomaly heat maps generated with the PatchCore anomaly detector.

语义分割

Get Started with Semantic Segmentation Using Deep Learning (Computer Vision Toolbox)
Segment objects by class using deep learning networks such as U-Net and DeepLab v3+.
Augment Pixel Labels for Semantic Segmentation
This example shows how to perform common kinds of image and pixel label augmentation as part of semantic segmentation workflows.
使用扩张卷积进行语义分割
此示例说明如何使用扩张卷积训练语义分割网络。
使用深度学习对多光谱图像进行语义分割 (Computer Vision Toolbox)
此示例说明如何使用 U-Net 对包含七个通道的多光谱图像执行语义分割。
Explore Semantic Segmentation Network Using Grad-CAM
This example shows how to explore the predictions of a pretrained semantic segmentation network using Grad-CAM.
Generate Adversarial Examples for Semantic Segmentation (Computer Vision Toolbox)
Generate adversarial examples for a semantic segmentation network using the basic iterative method (BIM).
Prune and Quantize Semantic Segmentation Network
Reduce the memory footprint of a semantic segmentation network and speed-up inference by compressing the network using pruning and quantization.

视频分类

Activity Recognition from Video and Optical Flow Data Using Deep Learning
This example first shows how to perform activity recognition using a pretrained Inflated 3-D (I3D) two-stream convolutional neural network based video classifier and then shows how to use transfer learning to train such a video classifier using RGB and optical flow data from videos [1].
Gesture Recognition using Videos and Deep Learning
Perform gesture recognition using a pretrained SlowFast video classifier.

精选示例

Identify Defects in Air Compressors Using Spectrogram Images

Detect and localize defects in acoustic recordings of air compressors using Mel spectrogram images and an EfficientAD anomaly detector.

(Computer Vision Toolbox)

自 R2025a 起

Detect Small Objects Using Tiled Training of YOLOX Network

Detect small objects in full-resolution images using tiled training of a you only look once version X (YOLOX) deep learning network.

(Computer Vision Toolbox)

自 R2024b 起

Automatically Label Ground Truth Using Segment Anything Model

Produce pixel labels for semantic segmentation using the Segment Anything Model (SAM) in the 图像标注器 (Computer Vision Toolbox) app. The SAM is an automatic segmentation technique that you can use to segment object regions to label with just a few clicks, or automatically segment the entire image and instantaneously create labels for selected regions. In this example, you interactively label pixels for semantic segmentation in two ways.

(Computer Vision Toolbox)

自 R2024b 起

Detect Defects Using Tiled Training of EfficientAD Anomaly Detector

Detect and localize defects on anomalous chewing gum images by training an EfficientAD anomaly detection network on tiled normal images.

(Computer Vision Toolbox)

自 R2024b 起

Localize Industrial Defects Using PatchCore Anomaly Detector

Perform localization of anomalous defects in printed circuit boards (PCBs) using anomaly heat maps generated with the PatchCore anomaly detector.

(Computer Vision Toolbox)

Detect Defects on Printed Circuit Boards Using YOLOX Network

Detect, localize, and classify defects in printed circuit boards (PCBs) using a you only look once version X (YOLOX) deep learning network.

(Computer Vision Toolbox)

Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning

Perform six degrees-of-freedom (6-DoF) pose estimation by estimating the 3-D position and orientation of machine parts in a bin using RGB-D images and a deep learning network.

打开实时脚本

Reidentify People Throughout a Video Sequence Using ReID Network

Track people throughout a video sequence using re-identification with a residual network.

打开实时脚本

Perform Instance Segmentation Using SOLOv2

Segment object instances of randomly rotated machine parts in a bin using a deep learning SOLOv2 network.

(Computer Vision Toolbox)

使用 YOLO v2 深度学习进行目标检测

此示例说明如何训练 you only look once (YOLO) v2 目标检测器。

打开实时脚本

Object Detection Using SSD Deep Learning

Train a Single Shot Detector (SSD).

打开实时脚本

Object Detection Using YOLO v4 Deep Learning

Detect objects in images using you only look once version 4 (YOLO v4) deep learning network. In this example, you will

打开实时脚本

Perform Instance Segmentation Using Mask R-CNN

Segment individual instances of people and cars using a multiclass mask region-based convolutional neural network (R-CNN).

打开实时脚本

使用深度学习进行语义分割

此示例说明如何使用语义分割网络来分割图像。

打开实时脚本

Generate Image from Segmentation Map Using Deep Learning

Generate a synthetic image of a scene from a semantic segmentation map.

打开实时脚本

Estimate Body Pose Using Deep Learning

Estimate the body pose of one or more people using the OpenPose algorithm.

打开实时脚本

Activity Recognition from Video and Optical Flow Data Using Deep Learning

First shows how to perform activity recognition using a pretrained Inflated 3-D (I3D) two-stream convolutional neural network based video classifier and then shows how to use transfer learning to train such a video classifier using RGB and optical flow data from videos [1].

打开实时脚本

Gesture Recognition using Videos and Deep Learning

Perform gesture recognition using a pretrained SlowFast video classifier.

打开实时脚本