<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic possible perfomance regression on 2019 MKL (solved in 2021.3?) in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1309955#M31929</link>
    <description>&lt;P&gt;We observe significant (4x..5x), reproducible raise of run time of one of our projects, when the binary of interest is (dynamically) linked to Intel MKL shipped with Intel compiler 19.1.3.304.&lt;/P&gt;
&lt;P&gt;When the same binady is used and is linked against another version of MKL (by loading just another compiler), the run time is OK. Not affected are MKLs shipped with Intel Compilers 18.0.3.222, 19.0.1.144, 19.0.5.281, 2021.3.0 (so 19.1.3.304 is the only known version with that performance regression).&lt;/P&gt;
&lt;P&gt;The application is programmed in Fortran, run w/48 threads on Intel(R) Xeon(R) Platinum 8160 w/48 cores (HT disabled) and known to call zgetrf, zgetrs, zcopy (and maybe more).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is such an performance regression (likely fixed in 2021?) known?&lt;/P&gt;
&lt;P&gt;Are intel developers interested in a reproducer?&lt;/P&gt;
&lt;P&gt;Best&lt;/P&gt;
&lt;P&gt;Paul Kapinos&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;===== Processor composition =====&lt;BR /&gt;Processor name : Intel(R) Xeon(R) Platinum 8160 &lt;BR /&gt;Packages(sockets) : 2&lt;BR /&gt;Cores : 48&lt;BR /&gt;Processors(CPUs) : 48&lt;BR /&gt;Cores per package : 24&lt;BR /&gt;Threads per core : 1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;$ for i in 18.0 19.0 19.0.5.281 19.1 2021.3.0 2021.3.0ng; do echo $i "\t\t" &lt;BR /&gt;$R_DELIMITER ; module switch intel intel/$i 2&amp;gt;/dev/null; time env &lt;BR /&gt;OMP_NUM_THREADS=48 ./test_kloop.x | grep test_kloop; echo; done&lt;BR /&gt;18.0 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.66s user 2.66s system 4463% cpu 1.015 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.015 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;19.0 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 44.60s user 1.45s system 4488% cpu 1.026 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.026 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;19.0.5.281 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.91s user 2.43s system 4344% cpu 1.043 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.043 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;19.1 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 233.22s user 2.92s system 4698% cpu 5.026 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 5.026 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2021.3.0 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.15s user 2.34s system 4380% cpu 1.016 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.015 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2021.3.0ng ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.89s user 2.81s system 4376% cpu 1.044 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.044 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 26 Aug 2021 08:08:29 GMT</pubDate>
    <dc:creator>RWTHAachen</dc:creator>
    <dc:date>2021-08-26T08:08:29Z</dc:date>
    <item>
      <title>possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1309955#M31929</link>
      <description>&lt;P&gt;We observe significant (4x..5x), reproducible raise of run time of one of our projects, when the binary of interest is (dynamically) linked to Intel MKL shipped with Intel compiler 19.1.3.304.&lt;/P&gt;
&lt;P&gt;When the same binady is used and is linked against another version of MKL (by loading just another compiler), the run time is OK. Not affected are MKLs shipped with Intel Compilers 18.0.3.222, 19.0.1.144, 19.0.5.281, 2021.3.0 (so 19.1.3.304 is the only known version with that performance regression).&lt;/P&gt;
&lt;P&gt;The application is programmed in Fortran, run w/48 threads on Intel(R) Xeon(R) Platinum 8160 w/48 cores (HT disabled) and known to call zgetrf, zgetrs, zcopy (and maybe more).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is such an performance regression (likely fixed in 2021?) known?&lt;/P&gt;
&lt;P&gt;Are intel developers interested in a reproducer?&lt;/P&gt;
&lt;P&gt;Best&lt;/P&gt;
&lt;P&gt;Paul Kapinos&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;===== Processor composition =====&lt;BR /&gt;Processor name : Intel(R) Xeon(R) Platinum 8160 &lt;BR /&gt;Packages(sockets) : 2&lt;BR /&gt;Cores : 48&lt;BR /&gt;Processors(CPUs) : 48&lt;BR /&gt;Cores per package : 24&lt;BR /&gt;Threads per core : 1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;$ for i in 18.0 19.0 19.0.5.281 19.1 2021.3.0 2021.3.0ng; do echo $i "\t\t" &lt;BR /&gt;$R_DELIMITER ; module switch intel intel/$i 2&amp;gt;/dev/null; time env &lt;BR /&gt;OMP_NUM_THREADS=48 ./test_kloop.x | grep test_kloop; echo; done&lt;BR /&gt;18.0 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.66s user 2.66s system 4463% cpu 1.015 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.015 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;19.0 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 44.60s user 1.45s system 4488% cpu 1.026 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.026 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;19.0.5.281 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.91s user 2.43s system 4344% cpu 1.043 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.043 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;19.1 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 233.22s user 2.92s system 4698% cpu 5.026 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 5.026 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2021.3.0 ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.15s user 2.34s system 4380% cpu 1.016 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.015 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2021.3.0ng ------------------------------------------------------------------------------&lt;BR /&gt;env OMP_NUM_THREADS=48 ./test_kloop.x 42.89s user 2.81s system 4376% cpu 1.044 &lt;BR /&gt;total&lt;BR /&gt;grep test_kloop 0.00s user 0.00s system 0% cpu 1.044 total&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Aug 2021 08:08:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1309955#M31929</guid>
      <dc:creator>RWTHAachen</dc:creator>
      <dc:date>2021-08-26T08:08:29Z</dc:date>
    </item>
    <item>
      <title>Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1310277#M31936</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We would like to request you to share the reproducer, so that we could look into the issue further.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Shanmukh.SS&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 27 Aug 2021 09:01:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1310277#M31936</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-08-27T09:01:32Z</dc:date>
    </item>
    <item>
      <title>Re: Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1311450#M31961</link>
      <description>&lt;P&gt;Dear Shanmukh.SS,&lt;/P&gt;
&lt;P&gt;I've added a reproducer to &lt;SPAN class="label"&gt;Request &lt;/SPAN&gt;&lt;SPAN class="value"&gt;05188832&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="value"&gt;The software is a modified version of&lt;/SPAN&gt;&lt;/P&gt;
&lt;PRE class="moz-quote-pre"&gt;(hier: &lt;A class="moz-txt-link-freetext" href="https://co-design.pop-coe.eu/programs/jukkr-kloop/index.html" target="_blank"&gt;https://co-design.pop-coe.eu/programs/jukkr-kloop/index.html&lt;/A&gt;).&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Sep 2021 14:14:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1311450#M31961</guid>
      <dc:creator>RWTHAachen</dc:creator>
      <dc:date>2021-09-01T14:14:18Z</dc:date>
    </item>
    <item>
      <title>Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1311716#M31968</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 12px;"&gt;Hi,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 12px;"&gt;It seems there is no reproducer as mentioned, being attached to the Request ID 05188832.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 12px;"&gt;Could you please reconfirm the reproducer, if not kindly attach the same so that we could work on your issue.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 12px;"&gt;Best Regards,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 12px;"&gt;Shanmukh.SS&lt;/SPAN&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 02 Sep 2021 11:37:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1311716#M31968</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-09-02T11:37:40Z</dc:date>
    </item>
    <item>
      <title>Re: possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1311787#M31969</link>
      <description>&lt;P&gt;crazy. I am able to see the file attached to the ticket:&lt;/P&gt;
&lt;P&gt;$ md5sum jukkr-kloop.tar.bz2 &lt;BR /&gt;7088e0191275695bf15707d12d429676 jukkr-kloop.tar.bz2&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;take a look at the screenshot!&lt;/P&gt;</description>
      <pubDate>Thu, 02 Sep 2021 18:28:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1311787#M31969</guid>
      <dc:creator>RWTHAachen</dc:creator>
      <dc:date>2021-09-02T18:28:18Z</dc:date>
    </item>
    <item>
      <title>Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1312383#M31991</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We have received the attachment. Apologies for inconvenience caused.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are working on your issue. We will get back to you soon.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Shanmukh.SS&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 06 Sep 2021 09:32:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1312383#M31991</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-09-06T09:32:06Z</dc:date>
    </item>
    <item>
      <title>Re: possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1312980#M32015</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for sharing the reproducer.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have tried compiling the source as per the steps shared. However, we faced some issues while running "compile_kloop_openmp.sh" in both 2019.1.3 and 2021.3.0 versions. Attached the error log with this thread. Could you please let us know if anything is being missed.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best Regards,&lt;/P&gt;
&lt;P&gt;Shanmukh.SS&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 08 Sep 2021 11:06:58 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1312980#M32015</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-09-08T11:06:58Z</dc:date>
    </item>
    <item>
      <title>Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1314754#M32060</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Reminder:&lt;/P&gt;&lt;P&gt;Could you please help us with exact steps to compile the earlier shared code, as we are facing errors(shared the error log in previous reply) while compiling the code with the steps shared. &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Shanmukh.SS&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 15 Sep 2021 10:06:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1314754#M32060</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-09-15T10:06:41Z</dc:date>
    </item>
    <item>
      <title>Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1317996#M32113</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for letting us know the details regarding performance of MKL with various versions. It seems the issue is with 2019 (19.1.3) version of MKL and it got resolved in  version (2021.3). Since there is no issue in 2021.3, kindly let us know if we could close this issue.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Shanmukh.SS&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 29 Sep 2021 10:32:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1317996#M32113</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-09-29T10:32:02Z</dc:date>
    </item>
    <item>
      <title>Re:possible perfomance regression on 2019 MKL (solved in 2021.3?)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1319987#M32156</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We assume that your query has been answered. If you need any additional information, please submit a new question as this thread will no longer be monitored.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Shanmukh.SS&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 07 Oct 2021 10:55:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/possible-perfomance-regression-on-2019-MKL-solved-in-2021-3/m-p/1319987#M32156</guid>
      <dc:creator>ShanmukhS_Intel</dc:creator>
      <dc:date>2021-10-07T10:55:53Z</dc:date>
    </item>
  </channel>
</rss>

