产品
解决方案
应用

探索从机器人到人工智能等各种应用的技术解决方案

学科

探索用于教学和研究的工程和科学资源

行业

了解 MATLAB 和 Simulink 如何支持行业特定的工作流和标准

功能

查找从代码生成到硬件支持等特性和功能

联系我们

版本亮点

了解 MATLAB 和 Simulink 最新版本的新功能

了解更多
学习
培训

自定进度在线课程

教师授课培训

MathWorks 认证计划

活动

MATLAB 和 Simulink 活动

活动会议资料

往期线上研讨会和视频

学习资源

使用 MATLAB 教学

使用 MATLAB 研究

学生活动

相关书籍

联系我们

访问帮助中心，浏览产品文档，参与社区论坛，查看发行说明，以及更多。

MATLAB 和 Simulink 视频

了解产品，观看演示，并浏览新功能

浏览视频
公司
公司

关于 MathWorks

使命和价值

社会愿景

MathWorks 致力于脱碳

客户案例

招聘

招聘概览

职位搜索

团队和岗位

办公地点

联系我们

MathWorks 致力于脱碳

了解 MathWorks 如何保护和恢复地球资源

了解更多
帮助中心
获取 MATLAB MATLAB
登录
获取 MATLAB MATLAB 联系我们
搜索

视频与网上研讨会

Description

Quantizing a Deep Learning Network in MATLAB

In this video, we demonstrate the deep learning quantization workflow in MATLAB. Using the Model Quantization Library Support Package, we illustrate how you can calibrate, quantize, and validate a deep learning network such as Resnet50. We also highlight the impact of quantization on reducing the memory of some standard networks such as Resnet101 and InceptionV3.

Published: 23 Apr 2020

Full Transcript

Deep Learning quantization is a key optimization strategy for efficient deployment of deep learning networks, particularly on embedded platforms.

I am Ram Cherukuri, senior product manager at MathWorks and in this video I will give you an overview of the deep learning quantization workflow in MATLAB.

Quantizing the weights, biases, and activations to lower precision data types like INT8 or FP16 significantly reduces the memory footprint of the AI algorithm and can result in improved inference performance on the embedded hardware.

You can use the Model Quantization Library Support Package for quantizing your deep learning network in MATLAB. You can download it from the Add-On Explorer as shown here.

The quantization workflow leverages instrumentation, based on a calibration datastore to compute the instrumentation statistics that are used to quantize the weights, biases, and activations of the layers of the network.

Finally, the validation step computes accuracy metrics to analyze and understand the impact of quantization on the accuracy of the network. Let’s take Resnet50 as an example network to go through this workflow.

Here is the Deep Learning Quantizer app, where you first import the network from the MATLAB workspace and you will see the network structure displayed on the left side pane.

Next, you select the data store that you would like to use for calibration and the app displays the computed statistics such as the min and max values of weights, biases, and activations of each layer. You can also choose the layers that you can quantize and then validate the impact of quantization using a validation datastore.

In this example, we have used the default top 1 accuracy metric and you can see that there is a 67% reduction in memory with no drop in accuracy. You can then proceed to generate code from the quantized network for deployment.

We repeated this workflow with a few networks, only quantizing the compute-intensive conv layers to INT8.

You can see the impact of quantization in the chart here. For instance, the largest network here with 180 MB in memory, Resnet101, sees 72% compression with 2% drop in accuracy. InceptionV3, on the other hand, has the largest drop in accuracy of 4%, with 67% compression, going from 100 MB to 33 MB in memory.

This highlights the significant impact of quantization for efficient deployment of deep learning networks.

Please refer to the resources below the video to learn how to get started and explore these new capabilities in MATLAB.

Related Resources

Related Products

Learn More

Download the support package

What Is int8 Quantization and Why Is It Popular for Deep Neural Networks?

INT8 Quantization with Deep Network Quantizer

Quantization of Deep Neural Networks

Related Information

Try out the Deep Network Quantizer app

Featured Product

Deep Learning Toolbox

Up Next:

Virtual engineering technology has undergone rapid progress in recent years and has been widely accepted for commercial product development. Product design and manufacturing organizations are moving from the traditional multiple and serial test cycle — Optimal Neural Network for Automotive Product Development

Related Videos:

Connect your Controller Area Network (CAN) to MATLAB and Simulink using the Vehicle Network Toolbox to inspect, test, and validate the operation of a vehicle before an actual dynamic event. — CAN Communication with Your ECUs and the Vehicle Network...

Learn how MATLAB addresses common challenges encountered while developing object recognition systems and see new capabilities for deep learning, machine learning, and computer vision. — Deep Learning for Computer Vision

Learn how machine learning tools in MATLAB can be used to solve regression, clustering, and classification problems. — Machine Learning with MATLAB Overview

In this webinar, Associate Professor Kathleen Meehan of Virginia Tech demonstrates how professors and course instructors can use MATLAB and Digilent Analog Discovery hardware to teach hands-on laboratory experiments in the area of circuit analysis. — Hands-on Learning with MATLAB and Analog Discovery

View more related videos