Solved: How to get INT8 precision of ResNet50 of OpenVINO2021.1

guanglil · ‎11-22-2020

HI all,

How can i get a model of ResNet-50 with INT8 precision of OpenVINO2021.1?

Vladimir_Dudnik · ‎11-23-2020

Hello,

it is possible to quantize model with using Post Optimization Toolkit and in Open Model Zoo we provide for convenience the script, which simplify the task (it will call POT with appropriate parameters) and quantize model, see Model Quantizer Usage section of Open Model Zoo documentation.

Note, as a prerequisite for quantization you may need to download suitable dataset.

Regards,
Vladimir

View solution in original post

Vladimir_Dudnik · ‎11-23-2020

Hello,

it is possible to quantize model with using Post Optimization Toolkit and in Open Model Zoo we provide for convenience the script, which simplify the task (it will call POT with appropriate parameters) and quantize model, see Model Quantizer Usage section of Open Model Zoo documentation.

Note, as a prerequisite for quantization you may need to download suitable dataset.

Regards,
Vladimir

guanglil · ‎11-23-2020

Hi Vladimir,

Thanks for your info. I use this method to transfer ResNet50 to INT8 and FP32 successfully.

And I run benchmark_app with ResNet50 FP32 and INT8 precision. But I got different results in FP32 between model gets from https://download.01.org/opencv/2021/openvinotoolkit/2021.1/open_model_zoo/models_bin/1/resnet50-binary-0001/ and transfer by myself. for example, in batch_size=8, using my transferred model, it only got 38.66fps while using downloading from 01.org, i can get 173.37fps.

So, could you please give me a hint why got so different result?

Guangli

Vladimir_Dudnik · ‎11-24-2020

As you may see from model description, resnet50-binary-0001 is FP32-INT1 model

This is a classical classification network for 1000 classes trained on ImageNet. The difference is that most convolutional layers were replaced by binary once that can be implemented as XNOR+POPCOUN operations. Only input, final and shortcut layers were kept as FP32, all the rest convolutional layers are replaced by BinaryConvolution layers.

and so, it is different from FP32-INT8 model you get through quantization with POT tool

Vladimir

Munesh_Intel · ‎11-24-2020

Hi Guangli,

We would like to confirm your question here. You are comparing two FP32 models here (your ‘transferred’ ResNet50 FP32 model against resnet50-binary-0001 FP32 model downloaded from 01.org), aren’t you?

If that’s the case, please share more information about your model, such as the layers in use, command given to Model Optimizer to convert the trained model to Intermediate Representation (IR), and environment details (versions of OS, Python, CMake, TensorFlow, etc.).

Regards,

Munesh

Munesh_Intel · ‎12-03-2020

Hi Guangli,

This thread will no longer be monitored since we have provided solution and explanation. If you need any additional information from Intel, please submit a new question.

Regards,

Munesh