Intel® Fortran Compiler
Build applications that can scale for the future with optimized code designed for Intel® Xeon® and compatible processors.

ifx performance issues

Umar__Sait
Novice
501 Views

There is a significany drop in performance between ifort and ifx. For Intel CPUs:

Intel(R) Xeon(R) Gold 6246R CPU @ 3.40GHz (16+16=32 core)

Intel ifort 2021.13.1
CFLAGS= -free -warn all -diag-disable=10448 -nogen-interfaces -no-prec-div -O3 -fp-model=fast=2 -xHost
real 124m20.650s
user 3049m40.783s
sys 10m40.430s

Intel ifx 2025.3.0
CFLAGS= -free -warn all -nogen-interfaces -O3 -xHost -qopenmp
real 162m36.896s
user 3966m5.817s
sys 12m34.195s

 

albeit faster than gfortran at 216m. The problem is double precision complex algebra. The code

is also using openmp.

 

In addition, ifort runs much faster on AMD CPU:

AMD Ryzen Threadripper PRO 7975WX @ 4.0GHz 32-Cores

Intel ifort 2021.13.1
CFLAGS= -free -warn all -nogen-interfaces -diag-disable=10448 -Ofast -march=SSE4.2,CORE-AVX2,znver4 -qopenmp

real 67m37.749s
user 1744m37.811s
sys 13m21.562s

 

Intel ifx 2025.3.0
CFLAGS= -free -warn all -nogen-interfaces -O3 -Ofast -march=znver4 -qopenmp

real 113m5.164s
user 2487m13.638s
sys 13m46.812s

 

Intel seems to have removed some of the optimization features for AMD processorts from

ifx....can't use SSE4.2,CORE-AVX2 anymore.....

0 Kudos
10 Replies
Igor_V_Intel
Moderator
473 Views

Could you please share a code showing this performance drop?

Note that LLVM IR doesn't have complex data types and thus code with complex type algebra is a known issue vs ifort (there Intel had native complex type support on the proprietary IR level). It should be improved in the next major release of ifx.

0 Kudos
Umar__Sait
Novice
446 Views

This is a very large nuclear reactions code written in Fortran 95, so it is hard to share. Normally, in some cases the code has to be run for many days, even a week to get the answers. These were shorter runs for timing purposes. But one can see that all llvm based compilers are running slower than ifort. I am looking forward to the improvements you mention and hopefully we can switch to ifx at some point, but as you can see running for 5 days vs 10 days makes a big difference so we will stick with ifort until then.

0 Kudos
mecej4O
New Contributor I
346 Views

The attached program conj11.f90 runs for about 1 second and produces a counterexample for the Euler Conjecture on the sum of fifth powers.

The EXEs generated using IFort consistently run faster than those produced by Ifx. I hope that this example code will help you to make Ifx produce EXEs that are not slower than those produced by Ifort.

Thanks.

0 Kudos
JohnNichols
Valued Contributor III
312 Views

conj11 performance

Core I7, VS 2022, latest Oneapi, Windows Preview

debug 32 bit == 2.781 seconds 4.5 times slower

debug 64 bit == 1.781 seconds 2.98 times slower

release 32 bit == 1.875 seconds 3.125 times slower

release 64 bit == 0.6 seconds 1 times slower (humour ok)

There is no evidence using a stock standard anything that IFX is slower, for this program it is not.  

0 Kudos
andrew_4619
Honored Contributor III
293 Views

??? how could you test 32bit on latest Oneapi????

0 Kudos
JohnNichols
Valued Contributor III
192 Views

If you update using the Control Panel it keeps the latest versions of the program you require available, who ever invented it, is neat person.  IFORT is not updated now, but with this method it is available and is the last one issued.  Intel do not update all programs with each oneapi, it is merely a name, I can test IFORT and IFX in 32 and 64 bit, but IFORT is the dodo bird, but I reported what I had. 

I have not seen the latest ONEAPI noted as released, maybe I missed it.  

But the timings are interesting, most of the effort is the excellent code.  

 

0 Kudos
JohnNichols
Valued Contributor III
172 Views

If we can go back to the Euler Conjecture, the result shown when you consider the infinite number of potential solutions to these problems, I suggest means, that the fact that one of them had a zero component for the fifth factor is just a simple bizarre finding of no significance,   merely proving that the Math Gods have a sense of humour. 

The fact that we can write code to find the numbers in 0.6 seconds is a tribute to coding skill. 

Jim:

The transverse and longitudinal accelerations on one bridge is negatively correlated nicely, I was surprised how nice it was.  

 

------------------------------------------------------------------------------------

Ron:

I want to complain, the blasted IFX is so fast with my structural analysis program, program is courtesy of the mecej4 fixes, that I cannot even get the coke bottle to my mouth before it has run.  I used to be able to make a cup of tea. Could you pass my complaint to the makers, ask them to slow it up!

Thanks

John

 

0 Kudos
andrew_4619
Honored Contributor III
154 Views

I at least now understand what it said. It would have made more sense  to add the words  IFORT and IFX in the description otherwise the comparisons made don't say much.  Why even make benchmark comparisons in debug, that is for debugging the speed is always slow and not very relevant if we are debugging and have break points. 

0 Kudos
mecej4O
New Contributor I
53 Views

It is actually reasonable that IFx, with debug options specified, produces a slower EXE, because the compiler inserts more machine code to do all the checking (array bounds, arrays allocated before use, etc.), than the older compiler did.

0 Kudos
JohnNichols
Valued Contributor III
28 Views

1. If I had left out debug someone would have asked why, as I almost always code in debug and rarely use release, it seemed normal

2. One API is generic name, if you want to know what version is associated with 2025.3 you have look at the different elements and some have not been updated for a long time.  If you update using the Control Panel you lose nothing. 

3. IFX release with the Harrison 1973 Structural Program with Eigen solver is blindingly fast, it is actually annoying as I cannot read the blasted write statements as it flashes by.   Every time I run this program I thank the math gods for mecej4. 

4. I have been looking at the FHA Post Tensioned Box Girder Design Manual - if you want to see the extension of the 1849 Royal Commissioner's UK report on bridge safety to the maximum extent before you start thinking about frequency this is it.  

 

0 Kudos
Reply