Programmable Devices
CPLDs, FPGAs, SoC FPGAs, Configuration, and Transceivers
21589 Discussions

Agilex 7 FP32/FP16 TFlops performance calculation

davidhorowitz
Novice
3,625 Views

Hello, 

 

I would like to calculate the following metrics for the Agilex 7 series:

 

-Peak FP32/FP16 FMA TFlops 

-Peak FP32/FP16 TFlops 

-Peak FP32/FP16 TFlops / mm^2 of fabric

 

How would I calculate these? 

 

I am also looking for the Local Memory Bandwidth, Memory Bandwidth, and die area of the part.

 

Can someone assist me? 

 

Thank you for your time!

0 Kudos
10 Replies
ShengN_Intel
Employee
3,521 Views

Hi,


Seem like a gpu problem. Could you post here https://community.intel.com/t5/Graphics/bd-p/graphics to get help?


Thanks,

Regards,

Sheng


0 Kudos
ShengN_Intel
Employee
3,490 Views

Hi,


Please ignore previous post.


Peak FP32/FP16 FMA TFlops:

Peak FMA TFlops = (Number of FP Units × Clock Frequency in GHz × 2)GFlops / 10^12


Peak FP32/FP16 TFlops:

Peak TFlops = (Number of FP Units × Clock Frequency in GHz)GFlops / 10^12


Peak FP32/FP16 TFlops / mm^2 of fabric:

TFlops/mm² = Peak TFlops / Die Area in mm²


Shows example for AGF027:

From here, https://www.intel.com/content/www/us/en/content-details/690880/agilex-7-fpgas-and-socs-f-series-product-table.html

can get total DSP is 8528, FP32/FP16 fmax is 750mhz/0.75ghz (refer here https://www.intel.com/content/www/us/en/docs/programmable/683301/current/dsp-block-specifications.html )

FP32 Peak FMA TFlops = (8528*0.75Ghz*2)GFlops/10^12 = 12.8TFlops

FP16 Peak FMA TFlops = 12.8*2 = 25.6TFlops


I'll research and post later the rest formula.


Thanks,

Regards,

Sheng


0 Kudos
davidhorowitz
Novice
3,380 Views

Thank you for your help! I will wait for the rest. 

0 Kudos
davidhorowitz
Novice
2,787 Views

Are peak TFLOPs the same for FP32 and FP16 ? These equations seem to suggest that. 

 

Disregard, I see that the equations are different. I mis-read. 

0 Kudos
AdzimZM_Intel
Employee
2,880 Views

Hi David,


For your question about local memory bandwidth, memory bandwidth and die area of the part, are you looking for M20K, MLAB or EMIF? Or another memory?




Regards,

Adzim


0 Kudos
davidhorowitz
Novice
2,800 Views

For die area, I'm asking for die area of the FPGA itself.

 

For my memory question, I would like the maximum bandwidth for the EMIF as well as M20K and MLAB. I understand that these numbers are dependent on multiple factors, but I'm looking for a highest performance type of metric. 

0 Kudos
davidhorowitz
Novice
2,757 Views

For DDR4 memory, i have this equation:

 

DDR4 Memory Bandwidth = (Data Rate (MT/s) * Bus Width (bits) * #channels ) /8

 

Based on the F-Series product table I calculated the DDR4 memory bandwidth as follows:

 

DDR4 Memory Bandwidth = (3.2) * 64 * 4) / 8  = 102.4 GB/s

 

Does this make sense? 

 

For the M20K and MLAB, I don't know what formula would be use to calculate bandwidth.

0 Kudos
AdzimZM_Intel
Employee
2,528 Views
0 Kudos
AdzimZM_Intel
Employee
2,218 Views

Hi David,


Did my previous reply have answered your questions?

Or you still got questions regarding that?


Regards,

Adzim


0 Kudos
AdzimZM_Intel
Employee
1,399 Views

As we do not receive any response from you on the previous reply that we have provided, I now transition this thread to community support. Please login to ‘https://supporttickets.intel.com’, view details of the desire request, and post a feed/response within the next 15 days to allow me to continue to support you. After 15 days, this thread will be transitioned to community support. The community users will be able to help you on your follow-up questions.


0 Kudos
Reply