主要内容

剪枝

使用一阶泰勒逼近对网络滤波器进行剪枝;减少可学习参数的数量

使用一阶泰勒逼近从卷积层中对滤波器进行剪枝。然后,您可以从剪枝过的网络生成 C/C++ 或 CUDA® 代码。

有关 Deep Learning Toolbox™ Model Compression Library 中提供的压缩技术的详细概述,请参阅Reduce Memory Footprint of Deep Neural Networks

Simplified illustration of pruning. On the left is a sketch of a neural network with three layers that consist of four, three, and four neurons, respectively. All neurons are connected to all other neurons. An arrow points to a second sketch on the right that shows the same network, but one neuron has been cut out from the middle layer, and two neurons have been cut out from the final layer.

函数

taylorPrunableNetworkNeural network suitable for compression using Taylor pruning (自 R2022a 起)
forwardCompute deep learning network output for training
predict计算深度学习网络输出以进行推断
updatePrunablesRemove filters from prunable layers based on importance scores (自 R2022a 起)
updateScoreCompute and accumulate Taylor-based importance scores for pruning (自 R2022a 起)

主题

精选示例