- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
We're excited to announce the latest release of the OpenVINO™ toolkit, 2024.3. This update brings continued improvements in LLM performance, empowering your generative AI workloads with OpenVINO.
Top 3 Feature Highlights for 2024.3
- MODEL: OpenVINO pre-optimized models are now available in Hugging Face making it easier for developers to get started with these models.
- OPTIMIZE: Significant improvement in LLM performance on discrete Intel® GPUs with the addition of Multi-Head Attention (MHA) and OneDNN enhancements.
- DEPLOY: Improved CPU performance when serving LLMs with the inclusion of vLLM and continuous batching in the OpenVINO Model Server (OVMS). vLLM is an easy-to-use open-source library that supports efficient LLM inferencing and model serving.
Download the 2024.3 Release
Download Latest Release Now
Get all the details
See 2024.3 release notes
NNCF RELEASE
Check out the new NNCF release
Helpful Links
NOTE: Links open in a new window.
Link Copied
0 Replies

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page