concept `quantization` in category `deep learning`

appears as: qutization, qutization

Deep Learning with PyTorch

This is an excerpt from Manning's book Deep Learning with PyTorch. Login to get full access to this book.

In chapter 15, we will briefly touch on quantization. Then stateless bits like activations suddenly become stateful because information about the quantization needs to be captured. This means if we aim to quantize our model, it might be worthwhile to stick with the modular API if we go for non-JITed quantization. There is one style matter that will help you avoid surprises with (originally unforeseen) uses: if you need several applications of stateless modules (like nn.HardTanh or nn.ReLU), it is probably a good idea to have a separate instance for each. Reusing the same module appears to be clever and will give correct results with our standard Python usage here, but tools analyzing your model may trip over it.

to see more go to 8 Using convolutions to generalize

Another approach to is to reduce the footprint of each parameter and operation: instead of expending the usual 32-bit per parameter in the form of a float, we convert our model to work with integers (a typical choice is 8-bit). This is quantization.¹⁸

to see more go to 15 Deploying to production

concept quantization in category deep learning

Deep Learning with PyTorch

Unable to load book!

concept `quantization` in category `deep learning`