Quantized Neural Networks for FPGA Inference

Low precision quantization for neural networks supports AI application specifications by providing greater throughput for the same footprint or reducing resource usage. Block floating point (BFP) is particularly useful in this scenario due to its high dynamic range which allows for lower precision while maintaining accuracy. Any drop in accuracy can be recouped by retraining using our open source software.

Download PDF

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Quantized Neural Networks for FPGA Inference

Related Links

Intel® FPGAs for AI Overview