<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: HPL compiled with OneAPI HPC toolkit failed to converge in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1683103#M37074</link>
    <description>&lt;P&gt;Additional notes: compilation works with GCC (mpicc) and linked with MKL. And it appears to be the way many compilation is done. And my question is why OneAPI C compiler doesn't work?&lt;/P&gt;</description>
    <pubDate>Tue, 15 Apr 2025 13:11:00 GMT</pubDate>
    <dc:creator>PingLin</dc:creator>
    <dc:date>2025-04-15T13:11:00Z</dc:date>
    <item>
      <title>HPL compiled with OneAPI HPC toolkit failed to converge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1682797#M37073</link>
      <description>&lt;P&gt;I have tried to compile HPL using the new 2025 intel OneAPI HPC toolkit to run benchmark test on RedHat Linux Enterprise 8 and 9. I downloaded the Netlib HPL source code from &lt;A href="http://www.netlib.org/benchmark/hpl/" target="_blank"&gt;http://www.netlib.org/benchmark/hpl/&lt;/A&gt; .&lt;/P&gt;&lt;P&gt;And following the steps suggested by the manual:&lt;BR /&gt;1. Download and extract the source code.&lt;BR /&gt;2. Copy the makefile:&lt;BR /&gt;$&amp;gt; cp setup/Make.Linux_Intel64 .&lt;BR /&gt;3. Edit Make.Linux_Intel64 as appropriate&lt;BR /&gt;4. Build the HPL binary:&lt;BR /&gt;$&amp;gt; make arch=Linux_Intel64&lt;BR /&gt;5. Check that the built binary is available in the bin/Linux_Intel64 directory.&lt;/P&gt;&lt;P&gt;I have played with different changes on&amp;nbsp;Make.Linux_Intel64 file. While they all compiled without complains. During the test run, all&amp;nbsp;864 tests completed and failed residual checks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here is one of the changes I made to the&amp;nbsp;Make.Linux_Intel64 file (parts that changed shown).&lt;/P&gt;&lt;PRE&gt;# - Message Passing library (MPI) --------------------------------------&lt;BR /&gt;MPdir = /opt/intel/oneapi/mpi/2021.15/&lt;BR /&gt;MPinc = -I$(MPdir)/include&lt;BR /&gt;MPlib = $(MPdir)/lib/libmpi.a&lt;BR /&gt;&lt;BR /&gt;# - Linear Algebra library (BLAS or VSIPL) -----------------------------&lt;BR /&gt;LAdir = $(MKLROOT)&lt;BR /&gt;ifndef LAinc&lt;BR /&gt;LAinc = $(LAdir)/include&lt;BR /&gt;endif&lt;BR /&gt;ifndef LAlib&lt;BR /&gt;LAlib = -L$(LAdir)/lib \&lt;BR /&gt;        -Wl,--start-group \&lt;BR /&gt;        $(LAdir)/lib/libmkl_intel_lp64.a \&lt;BR /&gt;        $(LAdir)/lib/libmkl_intel_thread.a \&lt;BR /&gt;        $(LAdir)/lib/libmkl_core.a \&lt;BR /&gt;        -Wl,--end-group -lpthread -ldl&lt;BR /&gt;endif&lt;BR /&gt;# - Compilers / linkers - Optimization flags ---------------------------&lt;BR /&gt;CC = mpiicx&lt;BR /&gt;CCNOOPT = $(HPL_DEFS)&lt;BR /&gt;OMP_DEFS = -qopenmp&lt;BR /&gt;CCFLAGS = $(HPL_DEFS) -O3 -w -ansi-alias -z noexecstack -z relro -z now -Wall&lt;/PRE&gt;&lt;P&gt;I have changes the settings and none is working.&amp;nbsp;&lt;/P&gt;&lt;P&gt;My question is, what is the&amp;nbsp;appropriate change for the&amp;nbsp;Make.Linux_Intel64 file?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;The Intel® Distribution for LINPACK Benchmark may contain additional optimizations compared to the reference Netlib HPL implementation. Is there anyway to find out the additional optimizations?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The output from the failed test is attached.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Apr 2025 15:12:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1682797#M37073</guid>
      <dc:creator>PingLin</dc:creator>
      <dc:date>2025-04-14T15:12:46Z</dc:date>
    </item>
    <item>
      <title>Re: HPL compiled with OneAPI HPC toolkit failed to converge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1683103#M37074</link>
      <description>&lt;P&gt;Additional notes: compilation works with GCC (mpicc) and linked with MKL. And it appears to be the way many compilation is done. And my question is why OneAPI C compiler doesn't work?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Apr 2025 13:11:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1683103#M37074</guid>
      <dc:creator>PingLin</dc:creator>
      <dc:date>2025-04-15T13:11:00Z</dc:date>
    </item>
    <item>
      <title>Re: HPL compiled with OneAPI HPC toolkit failed to converge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1685764#M37093</link>
      <description>&lt;P&gt;As it turns out, OneAPI C compiler includes certain level of optimization even if not specify -O0, which cause the subroutine HPL_dlamch.c to fail. On line 274 of the file, `{ a *= HPL_rtwo; c = HPL_dlamc3( a, one ); c = HPL_dlamc3( c, -a ); } ` should be executed without optimization, so it can use limited precision of the float data type to determine the value of `a` where `float( a + 1.0 ) == a`. With optimization it returns `a = inf`. That results TEST-&amp;gt;epsil = 0 instead of a value of about ~ 1e-16.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 27 Apr 2025 23:49:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1685764#M37093</guid>
      <dc:creator>PingLin</dc:creator>
      <dc:date>2025-04-27T23:49:25Z</dc:date>
    </item>
    <item>
      <title>Re: HPL compiled with OneAPI HPC toolkit failed to converge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1686044#M37096</link>
      <description>&lt;P&gt;Can you add -O0 option into CCNOOPT?&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;CCNOOPT = $(HPL_DEFS) -O0&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Apr 2025 16:23:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1686044#M37096</guid>
      <dc:creator>Kazushige_G_Intel</dc:creator>
      <dc:date>2025-04-28T16:23:36Z</dc:date>
    </item>
    <item>
      <title>Re: HPL compiled with OneAPI HPC toolkit failed to converge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1686106#M37098</link>
      <description>&lt;P&gt;&lt;EM&gt;As it turns out, OneAPI C compiler includes certain level of optimization even if not specify -O0, ...&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;gt; If -O[n] is not specify, the default is -O2.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Apr 2025 20:03:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1686106#M37098</guid>
      <dc:creator>Fengrui</dc:creator>
      <dc:date>2025-04-28T20:03:05Z</dc:date>
    </item>
    <item>
      <title>Re: HPL compiled with OneAPI HPC toolkit failed to converge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1686676#M37104</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Changing intel default with `-fp-model=precise` while leaving -O2 option on also enable HPL to be compiled correctly and passed the tests. But the combination of both is too aggressive.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Apr 2025 12:20:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/HPL-compiled-with-OneAPI-HPC-toolkit-failed-to-converge/m-p/1686676#M37104</guid>
      <dc:creator>PingLin</dc:creator>
      <dc:date>2025-04-30T12:20:24Z</dc:date>
    </item>
  </channel>
</rss>

