Intel® Distribution of OpenVINO™ Toolkit
Community assistance about the Intel® Distribution of OpenVINO™ toolkit, OpenCV, and all aspects of computer vision-related on Intel® platforms.
6403 Discussions

Inferencing BERT model for Neural Machine Translation

Kannan_K_Intel
Employee
374 Views

Hi,

We were able to optimize the BERT model for Neural machine translation successfully. But there is no sample or reference available to infer the model on openVINO. 

Can anyone provide me with any documentations or reference on how to write the inference script for the model?

0 Kudos
1 Reply
SuryaPSC_Intel
Employee
374 Views

Hi Kannan,

Try using benchmark_app with the IR files and use it as a reference to generate the inference script. Kindly refer to this thread.

Hemanth Kumar G. (Intel) wrote:

Hi,

Try to use benchmark_app to experiment with the core utilization. I used the following script on a machine having 18 cores, 2 sockets, which gives 72 logical processors all utilized to 100% during the peak of loading the model.

numactl -l ~/inference_engine_samples_build/intel64/Release/benchmark_app -i bert_input.bin -m bert_model.ckpt.xml -niter 100 -nthreads 72 -nstreams 72 -nireq 72

Output:-

Count:      144 iterations
Duration:   1921.27 ms
Latency:    833.773 ms
Throughput: 74.9504 FPS
 

Best Regards,

Surya

0 Kudos
Reply