Intel FPGA AI Sutie Inference Engine

RubenPadial · ‎02-09-2025

Is there any official documentation on the DLA runtime or inference engine for managing the DLA from the ARM side? I need to develop a custom application for running inference, but so far, I’ve only found the dla_benchmark (main.cpp) and streaming_inference_app.cpp example files. There should be some documentation covering the SDK. The only documentation that i found related with is the Intel FPGA AI suite PCIe based design example https://www.intel.com/content/www/us/en/docs/programmable/768977/2024-3/fpga-runtime-plugin.html

From what I understand, the general inference workflow involves the following steps:

Identify the hardware architecture
Deploy the model
Prepare the input data
Send inference requests to the DLA
Retrieve the output data

RubenPadial · ‎05-14-2025

Hello @JohnT_Intel ,

I mean the original example you suggested is CPU/GPU indeded.

The real problem is how inference are manged. The examples collect multiple input images into a batch and request inference for the entire batch. I need to request an inference every time a new data is available. That's when the DLA instatiation problem arises.

Artificial Intelligence

FPGA Interface Manager (FIM)

SW|HDL Development