Intel® Fortran Compiler
Build applications that can scale for the future with optimized code designed for Intel® Xeon® and compatible processors.

ifx performance issues

Umar__Sait
Novice
274 Views

There is a significany drop in performance between ifort and ifx. For Intel CPUs:

Intel(R) Xeon(R) Gold 6246R CPU @ 3.40GHz (16+16=32 core)

Intel ifort 2021.13.1
CFLAGS= -free -warn all -diag-disable=10448 -nogen-interfaces -no-prec-div -O3 -fp-model=fast=2 -xHost
real 124m20.650s
user 3049m40.783s
sys 10m40.430s

Intel ifx 2025.3.0
CFLAGS= -free -warn all -nogen-interfaces -O3 -xHost -qopenmp
real 162m36.896s
user 3966m5.817s
sys 12m34.195s

 

albeit faster than gfortran at 216m. The problem is double precision complex algebra. The code

is also using openmp.

 

In addition, ifort runs much faster on AMD CPU:

AMD Ryzen Threadripper PRO 7975WX @ 4.0GHz 32-Cores

Intel ifort 2021.13.1
CFLAGS= -free -warn all -nogen-interfaces -diag-disable=10448 -Ofast -march=SSE4.2,CORE-AVX2,znver4 -qopenmp

real 67m37.749s
user 1744m37.811s
sys 13m21.562s

 

Intel ifx 2025.3.0
CFLAGS= -free -warn all -nogen-interfaces -O3 -Ofast -march=znver4 -qopenmp

real 113m5.164s
user 2487m13.638s
sys 13m46.812s

 

Intel seems to have removed some of the optimization features for AMD processorts from

ifx....can't use SSE4.2,CORE-AVX2 anymore.....

0 Kudos
5 Replies
Igor_V_Intel
Moderator
246 Views

Could you please share a code showing this performance drop?

Note that LLVM IR doesn't have complex data types and thus code with complex type algebra is a known issue vs ifort (there Intel had native complex type support on the proprietary IR level). It should be improved in the next major release of ifx.

0 Kudos
Umar__Sait
Novice
219 Views

This is a very large nuclear reactions code written in Fortran 95, so it is hard to share. Normally, in some cases the code has to be run for many days, even a week to get the answers. These were shorter runs for timing purposes. But one can see that all llvm based compilers are running slower than ifort. I am looking forward to the improvements you mention and hopefully we can switch to ifx at some point, but as you can see running for 5 days vs 10 days makes a big difference so we will stick with ifort until then.

0 Kudos
mecej4O
New Contributor I
119 Views

The attached program conj11.f90 runs for about 1 second and produces a counterexample for the Euler Conjecture on the sum of fifth powers.

The EXEs generated using IFort consistently run faster than those produced by Ifx. I hope that this example code will help you to make Ifx produce EXEs that are not slower than those produced by Ifort.

Thanks.

0 Kudos
JohnNichols
Valued Contributor III
85 Views

conj11 performance

Core I7, VS 2022, latest Oneapi, Windows Preview

debug 32 bit == 2.781 seconds 4.5 times slower

debug 64 bit == 1.781 seconds 2.98 times slower

release 32 bit == 1.875 seconds 3.125 times slower

release 64 bit == 0.6 seconds 1 times slower (humour ok)

There is no evidence using a stock standard anything that IFX is slower, for this program it is not.  

0 Kudos
andrew_4619
Honored Contributor III
66 Views

??? how could you test 32bit on latest Oneapi????

0 Kudos
Reply