Main Content
Network Compression Applications
Compress a deep neural network by performing quantization, learnables
compression, or pruning
Generate code for deep learning networks with reduced memory footprint and computational requirements.
Topics
- Generate INT8 Code for Deep Learning Networks
Quantize and generate code for a pretrained convolutional neural network.