Speeding up Deep Learning with Quantization

SoonYau
5 min readNov 18, 2018

In last week, Facebook has just open sourced their matrix multiplication library which you can read it here .Readers may quickly find the word “quantized” or “quantization” appear a lot in that article and wonder what is magical about this new hype word that help giving 2.4x performance boost on CPU. I’m going to give some beginner’s introduction to quantization, I may use some simple maths a long the way but don’t worry, I promise it is very gentle.

Recent advancement in AI or more specifically a technique called deep learning (DL) brought a lot of…

--

--

SoonYau

Independent AI Consultant | Book author of “Hands-on Image Generation with TensorFlow” http://linkedin.com/in/soonyau