Computer Vision Toolbox 提供多种用于设计和测试计算机视觉系统的算法和 App。您可以执行视觉检查、目标检测和跟踪,以及特征检测、提取和匹配。您可以自动化单目相机、鱼眼相机、立体相机和多相机配置的标定工作流。对于三维视觉,该工具箱支持立体视觉、点云处理、基于运动进行构建以及实时视觉和点云 SLAM。计算机视觉 App 提供对自动化的团队真值标注,以及相机标定的支持。
该工具箱提供了多种 AI 技术,包括预训练的卷积神经网络 (CNN)、视觉 Transformer 以及视觉-语言模型。您可以使用这些开箱即用的模型来执行图像分类、目标检测、图像分割、姿态估计、字幕生成和光学字符识别 (OCR) 等任务,也可以通过迁移学习对其进行进一步定制。
您可以生成 C、C++、用于 GPU 执行的、以及硬件描述语言 (HDL) 代码。
深度学习和机器学习
训练机器学习模型和深度学习网络,或使用预训练网络,进行目标检测和分割。评估这些网络的性能,并通过生成 C/C++ 或 CUDA® 代码部署这些网络。
代码生成和第三方支持
基于您的计算机视觉算法生成代码,用于快速原型构建、部署和验证。将基于 OpenCV 的工程和函数集成到 MATLAB 和 Simulink 中。
产品资源:
Computer Vision Toolbox
Computer Vision Toolbox provides algorithms and apps for designing and testing computer vision systems, including visual inspection, object detection and tracking, feature detection, extraction, and matching.
The toolbox provides pretrained convolutional neural networks (CNNs), vision transformers, and vision-language models for tasks like image classification, object detection, segmentation, pose estimation, captioning, and optical character recognition (OCR), as well as zero-shot models for vision tasks such as optical flow and 3D depth estimation.
Yes, the Video Labeler and Image Labeler apps enable team-based ground truth labeling with automation workflows for object detection, semantic segmentation, instance segmentation, and scene classification.
Computer Vision Toolbox automates calibration workflows for single, fisheye, stereo, and multi-camera configurations using the Camera Calibrator app, Stereo Camera Calibrator app, or built-in functions.
Yes, the toolbox supports stereo vision, structure from motion, neural radiance fields (NeRF), and real-time visual and point cloud SLAM for 3D vision applications.
You can generate code in C, C++, CUDA for GPU execution, and in hardware description languages (HDL) for rapid prototyping, deployment, and verification.
Yes, the Automated Visual Inspection Library enables you to automatically identify anomalies or defects as part of a manufacturing quality assurance process.
Yes, you can segment, cluster, downsample, denoise, register, and fit geometrical shapes with lidar or 3D point cloud data.