Blogs

cancel
Showing results for 
Search instead for 
Did you mean: 

Blogs

Posts

Lowering Multimodal Inference Cost with Heterogeneous E/PD Disaggregation

This technical brief evaluates whether separating the vision encode stage from the prefill/decode path can improve latency and normalized cost efficiency using NVIDIA Dynamo.
0 Kudos
0 Replies

Does DeepSeek* Solve the Small Scale Model Performance Puzzle?

Learn how the DeepSeek-R1 distilled reasoning model performs and see how it works on Intel hardware.
12 Kudos
2 Replies

From Production Lines to Foundry Leadership: A Career Across Malaysia and Taiwan

What does it take to grow from the factory floor to a leadership role?Bob Thor’s journey across Mala...
0 Kudos
0 Replies

From Models to Systems: Enabling Heterogeneous AI Inference with Open Orchestration

The future of AI inference isn’t about any single chip—it’s about how effectively systems work toget...
2 Kudos
2 Replies

Enhancing long-context, high-concurrency LLM serving on a 32 GB workstation GPU: TurboQuant KV Cache

Long context LLM serving is running into a memory wall. The problem is not always raw compute. It is...
1 Kudos
3 Replies

VLSI 2026: Intel 18A Platform Momentum from Devices to Routed Designs

At VLSI 2026, Intel Foundry highlighted how CPU cores built with backside power delivery show dramat...
1 Kudos
0 Replies

Lablup adds Intel Arc Pro B70 support to Backend.AI

Building on its existing support for Intel Gaudi 2/3 AI accelerators, Backend.AI now extends its Int...
1 Kudos
0 Replies

Edge Devices From Sensors to Servers and the Silicon Inside

Edge devices are physical hardware that processes data at or near its source rather than in a centra...
0 Kudos
0 Replies

Remote Edge Management Beyond Deployment Day

Remote management at the edge is the ability to monitor, update, recover, and reconfigure distribute...
0 Kudos
0 Replies

Local AI and the Compute Architecture That Makes It Work

Local AI runs models directly on the device rather than sending data to the cloud. The definition is...
0 Kudos
0 Replies

Edge Computer Vision Beyond Pattern Matching

Edge computer vision is AI processing visual data on local devices rather than sending images and vi...
0 Kudos
0 Replies

Edge Computing vs Cloud Computing Beyond the Binary

Edge computing processes data at or near its source. Cloud computing processes data in centralized d...
0 Kudos
0 Replies

Data Sovereignty Starts Where Your Data Stays

Data sovereignty is the principle that data is subject to the laws of the country or region where it...
0 Kudos
0 Replies

Edge AI for Real-Time Analytics Beyond Low Latency

Edge AI for real-time analytics is processing data at or near its source to generate actionable inte...
0 Kudos
0 Replies

Edge Computing Latency Beyond Network Proximity

Edge computing latency is the time between a device generating data and receiving a processed respon...
0 Kudos
0 Replies