Intel® Distribution of OpenVINO™ Toolkit
Community assistance about the Intel® Distribution of OpenVINO™ toolkit, OpenCV, and all aspects of computer vision-related on Intel® platforms.
6503 Discussions

is ISA (x86_64/AVX/AMX) and precision (fp32/fp16/bf16/i8) controllable?

CFR
New Contributor II
508 Views

I'm looking to do some studies about running transformer/LLM on CPU resources.  I'd like to take a single Sapphire Rapids box and, in a controlled way, vary ISA/precision.  I see the ability to give OpenVINO a "hint" about precision but it's unclear how compelling that "hint" is to the underlying software (oneDNN?).  I don't see any OpenVINO API that controls ISA nor do I see any diagnostics that would tell me what an actually run ended up using.

Does OpenVINO have some way to control the things I want so I can do controlled studies? Is there a better choice (like maybe extensions for pytorch)?

0 Kudos
1 Solution
CFR
New Contributor II
349 Views

Thanks.  Looks like that gets me going in the right direction.

View solution in original post

0 Kudos
3 Replies
Aznie_Intel
Moderator
446 Views

Hi CFR,

 

Thanks for reaching out. I listed the documentation that might be helpful for your reference.

 

 

 

Hope this helps.

 

 

Regards,

Aznie


0 Kudos
CFR
New Contributor II
350 Views

Thanks.  Looks like that gets me going in the right direction.

0 Kudos
Aznie_Intel
Moderator
334 Views

 

Hi CFR,

 

Glad to hear that. This thread will no longer be monitored since this issue has been resolved. If you need any additional information from Intel, please submit a new question.

 

 

Regards,

Aznie


0 Kudos
Reply