NEW RELEASE: OpenVINO 2024.3 Available Now!

Luis_at_Intel · ‎08-01-2024

We're excited to announce the latest release of the OpenVINO™ toolkit, 2024.3. This update brings continued improvements in LLM performance, empowering your generative AI workloads with OpenVINO.

Top 3 Feature Highlights for 2024.3

MODEL: OpenVINO pre-optimized models are now available in Hugging Face making it easier for developers to get started with these models.
OPTIMIZE: Significant improvement in LLM performance on discrete Intel® GPUs with the addition of Multi-Head Attention (MHA) and OneDNN enhancements.
DEPLOY: Improved CPU performance when serving LLMs with the inclusion of vLLM and continuous batching in the OpenVINO Model Server (OVMS). vLLM is an easy-to-use open-source library that supports efficient LLM inferencing and model serving.

Download the 2024.3 Release
Download Latest Release Now

Get all the details
See 2024.3 release notes

NNCF RELEASE

Check out the new NNCF release

Helpful Links

NOTE: Links open in a new window.