Deep Learning Network Quantization for Deployment to Embedded Targets
Overview
Quantization enables deploying semantic segmentation algorithms for Deep Learning Networks in limited resource targets. The deployment into Arm, FPGA, and GPU targets will be shown. The challenges of maintaining the accuracy of the network while reducing both the size of the network and the size of the memory needed will be explored.
Highlights
- Deploying Deep Learning Networks on resource constrained targets
- Semantic segmentation example of trained network compression while preserving accuracy
- Generate code for deploying Deep Networks to Arm devices
About the Presenters
Greg Coppenrath
Greg is the product marketing manager for Fixed-Point Designer and Deep Learning Toolbox Model Quantization Library. He has experience in the development of embedded systems and product development in the semiconductor industry. He received an MBA from Worcester Polytechnic Institute, an M.S. in Electrical Engineering from the University of Massachusetts Lowell, and received a B.S. in Electrical Engineering from WPI.
Brenda Zhuang
Brenda Zhuang is a software engineering manager and leads a team that develops software tools for automatic deployment of embedded applications in microprocessors and FPGAs. Brenda has contributed to the development and evolution of many new features in the MATLAB and Simulink product families. She received her Ph.D. in Systems Engineering from Boston University and M.S. in Electrical and Electronics Engineering from Hong Kong University of Science and Technology.
Recorded: 27 Apr 2021