<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Performance issue of dgemm on Gold 6230R CPU in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1392045#M33274</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;When the number of threads is large, the performance curve will rise and then fall&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;Could you please provide us with the MKL version being used in this case?&lt;/P&gt;
&lt;P&gt;And by default, MKL utilizes all the available physical cores if you run it in parallel mode.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading.html" target="_blank" rel="noopener"&gt;https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;For Intel compilers the option is -qmkl=parallel&lt;/P&gt;
&lt;P&gt;Here are some more details about Managing Multi-core performance&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading/managing-multi-core-performance.html" target="_blank" rel="noopener"&gt;https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading/managing-multi-core-performance.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;You can also make use of the suggestions recommended by the &lt;A href="https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl-link-line-advisor.html#gs.2yw3hp" target="_self"&gt;Link Line Advisor&lt;/A&gt;&amp;nbsp;for compiling and linking options depending on the environment you are working with.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Additionally, could you please provide us with the sample reproducer and the command you are using for compiling and executing so that we could test it from our end as well?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Vidya.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 13 Jun 2022 11:29:52 GMT</pubDate>
    <dc:creator>VidyalathaB_Intel</dc:creator>
    <dc:date>2022-06-13T11:29:52Z</dc:date>
    <item>
      <title>Performance issue of dgemm on Gold 6230R CPU</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1391770#M33267</link>
      <description>&lt;P&gt;Hi, I have some questions about &lt;STRONG&gt;the performance of dgemm on Intel(R) Xeon(R) Gold 6230R CPU&lt;/STRONG&gt;.&amp;nbsp; On my machine, the performance of DGEMM seems weird. When the number of threads is large, the performance curve will rise and then fall, which is very difficult to explain. Below are some details. I really hope to get your help, thanks.&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;&lt;STRONG&gt;My Machine&lt;/STRONG&gt;&lt;BR /&gt;CPU(s): 104&lt;BR /&gt;On-line CPU(s) list: 0-103&lt;BR /&gt;Thread(s) per core: 2&lt;BR /&gt;Core(s) per socket: 26&lt;BR /&gt;Socket(s): 2&lt;BR /&gt;NUMA node(s): 2&lt;BR /&gt;CPU family: 6&lt;BR /&gt;Model: 85&lt;BR /&gt;Model name: Intel(R) Xeon(R) Gold 6230R CPU @ 2.10GHz&lt;BR /&gt;L1d cache: 32K&lt;BR /&gt;L1i cache: 32K&lt;BR /&gt;L2 cache: 1024K&lt;BR /&gt;L3 cache: 36608K&lt;BR /&gt;NUMA node0 CPU(s): 0-25,52-77&lt;BR /&gt;NUMA node1 CPU(s): 26-51,78-103&lt;/P&gt;
&lt;P&gt;Core topology: two sockets, 26 cores per socket, 52 cores total&lt;BR /&gt;SMT status: enabled, but not utilized&lt;BR /&gt;Max clock rate:&lt;STRONG&gt; 2.0GHz(single-core and multicore)&lt;/STRONG&gt; &lt;BR /&gt;Peak performance:&lt;BR /&gt;--single-core: &lt;STRONG&gt;64 GFLOPS(double-precision)&lt;/STRONG&gt;&lt;BR /&gt;--multicore: &lt;STRONG&gt;64 GFLOPS/core (double-precision)&lt;/STRONG&gt;&lt;BR /&gt;I have fixed the frequency of the CPU at 2.0GHz by commands: sudo cpupower -c all frequency-set -u 2.0GHz, sudo cpupower -c all frequency-set -d 2.0GHz&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;The dgemm performance on my machine&lt;/STRONG&gt;&lt;BR /&gt;Multithreaded (8 core) execution&lt;/P&gt;
&lt;P&gt;export GOMP_CPU_AFFINITY="0-7:1" MKL_NUM_THREADS=8&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Multithreaded (13 core) execution &lt;BR /&gt;export GOMP_CPU_AFFINITY="0-12:1" MKL_NUM_THREADS=13&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Multithreaded (26 core) execution&lt;/P&gt;
&lt;P&gt;export GOMP_CPU_AFFINITY="0-25:1" MKL_NUM_THREADS=26&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Multithreaded (52 core) execution&lt;/P&gt;
&lt;P&gt;export GOMP_CPU_AFFINITY="0-51:1" MKL_NUM_THREADS=52&lt;/P&gt;
&lt;P&gt; &lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="core_1.jpg" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/30451iD35978299A36B56C/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="core_1.jpg" alt="core_1.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="core_8.jpg" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/30445iE82B26A9CB3D0B95/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="core_8.jpg" alt="core_8.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="core_13.jpg" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/30446i6BC3DD9CE4812CDF/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="core_13.jpg" alt="core_13.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="core_26.jpg" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/30449i00B5672DEBC10054/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="core_26.jpg" alt="core_26.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt; &lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="core_52.jpg" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/30450iED70784E6539DE5F/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="core_52.jpg" alt="core_52.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt; &lt;/P&gt;
&lt;P&gt; &lt;/P&gt;
&lt;P&gt; &lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 11 Jun 2022 02:05:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1391770#M33267</guid>
      <dc:creator>lianchen</dc:creator>
      <dc:date>2022-06-11T02:05:44Z</dc:date>
    </item>
    <item>
      <title>Re: Performance issue of dgemm on Gold 6230R CPU</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1392045#M33274</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;When the number of threads is large, the performance curve will rise and then fall&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;Could you please provide us with the MKL version being used in this case?&lt;/P&gt;
&lt;P&gt;And by default, MKL utilizes all the available physical cores if you run it in parallel mode.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading.html" target="_blank" rel="noopener"&gt;https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;For Intel compilers the option is -qmkl=parallel&lt;/P&gt;
&lt;P&gt;Here are some more details about Managing Multi-core performance&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading/managing-multi-core-performance.html" target="_blank" rel="noopener"&gt;https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/managing-performance-and-memory/improving-performance-with-threading/managing-multi-core-performance.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;You can also make use of the suggestions recommended by the &lt;A href="https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl-link-line-advisor.html#gs.2yw3hp" target="_self"&gt;Link Line Advisor&lt;/A&gt;&amp;nbsp;for compiling and linking options depending on the environment you are working with.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Additionally, could you please provide us with the sample reproducer and the command you are using for compiling and executing so that we could test it from our end as well?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Vidya.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jun 2022 11:29:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1392045#M33274</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-06-13T11:29:52Z</dc:date>
    </item>
    <item>
      <title>Re:Performance issue of dgemm on Gold 6230R CPU</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1393754#M33297</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Reminder:&lt;/P&gt;&lt;P&gt;Could you please provide us with an update regarding your issue? Please provide us with the above-mentioned details if your issue still persists.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 20 Jun 2022 04:54:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1393754#M33297</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-06-20T04:54:54Z</dc:date>
    </item>
    <item>
      <title>Re:Performance issue of dgemm on Gold 6230R CPU</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1395537#M33321</link>
      <description>&lt;P&gt;As we haven't heard back from you, we are closing this thread. Please post a new question if you need any additional assistance from Intel as this thread will no longer be monitored.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 27 Jun 2022 04:13:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-issue-of-dgemm-on-Gold-6230R-CPU/m-p/1395537#M33321</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-06-27T04:13:24Z</dc:date>
    </item>
  </channel>
</rss>

