<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic I don't understand. in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005142#M18828</link>
    <description>&lt;P&gt;I don't understand.&lt;/P&gt;</description>
    <pubDate>Thu, 01 May 2014 06:49:03 GMT</pubDate>
    <dc:creator>Dave_O_</dc:creator>
    <dc:date>2014-05-01T06:49:03Z</dc:date>
    <item>
      <title>MKL spurns only minimal number of threads for SGEMM</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005140#M18826</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;

&lt;P&gt;I have a Xeon E5-2620 processor and benchmarking with SGEMM. Why does MKL spurn only 6 threads (hardware threads) instead of the expected 12 threads (hardware plus software threads)?&lt;/P&gt;

&lt;P&gt;The same code on xeon Phi spurns the entire 240 threads (hardware and software).&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Apr 2014 20:24:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005140#M18826</guid>
      <dc:creator>Dave_O_</dc:creator>
      <dc:date>2014-04-30T20:24:47Z</dc:date>
    </item>
    <item>
      <title>Most of the basic explanation</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005141#M18827</link>
      <description>Most of the basic explanation is in the section about MKL_DYNAMIC in the MKL user guide.  The default is intended to maximize performance rather than number of hyperthreads in use or power consumption.  Of course, a single 6-core CPU may not have been tested to the extent that 8 and 10-core dual CPU platforms have, so you are welcome to change the setting to see if it helps your case, or even if you simply like to peg your displays.   
While FPU performance on Xeon peaks with 1 thread per core, on Intel(r) Xeon Phi(tm), 3 threads per core are needed to reach full VPU performance, and the MKL can use all 4 threads per core effectively (on large enough problems) when considering data shuffling optimizations.</description>
      <pubDate>Thu, 01 May 2014 01:12:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005141#M18827</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2014-05-01T01:12:21Z</dc:date>
    </item>
    <item>
      <title>I don't understand.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005142#M18828</link>
      <description>&lt;P&gt;I don't understand.&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 06:49:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005142#M18828</guid>
      <dc:creator>Dave_O_</dc:creator>
      <dc:date>2014-05-01T06:49:03Z</dc:date>
    </item>
    <item>
      <title>Do you have specific</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005143#M18829</link>
      <description>Do you have specific questions; did you try setting MKL_DYNAMIC=false and setting specific numbers of threads and affinity schemes?
I don't think it makes sense to repeat the entire story about MKL_DYNAMIC here when you can easily look it up, or to guess which aspect of hyperthreading you may want explained.
Are you "spurning" the documentation as well as the dictionary definition of the word?</description>
      <pubDate>Thu, 01 May 2014 11:23:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005143#M18829</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2014-05-01T11:23:41Z</dc:date>
    </item>
    <item>
      <title>men, what are you talking</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005144#M18830</link>
      <description>&lt;P&gt;men, what are you talking about.&lt;/P&gt;</description>
      <pubDate>Thu, 01 May 2014 17:10:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005144#M18830</guid>
      <dc:creator>Dave_O_</dc:creator>
      <dc:date>2014-05-01T17:10:42Z</dc:date>
    </item>
    <item>
      <title>Hi Dave, </title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005145#M18831</link>
      <description>&lt;P&gt;Hi Dave,&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Sorry for the taketive:) &amp;nbsp; Roughly speaking, &amp;nbsp;yes, in order to get better performance, MKL spurn the threads based on &lt;STRONG&gt;hardware resource and&amp;nbsp;&lt;/STRONG&gt;&amp;nbsp;experience. &amp;nbsp;&lt;/P&gt;

&lt;UL sizcache09952467724460774="8238" sizset="0"&gt;&lt;/UL&gt;

&lt;UL&gt;
	&lt;LI&gt;-on Xeon machine, &amp;nbsp;use hardware core number&amp;nbsp;by default. &amp;nbsp;In your xeon machine, it is 6. &amp;nbsp;( And the 12 software threads, we call it Hyper-Threading thread in Xeon processor)&lt;/LI&gt;
	&lt;LI&gt;on Xeon phi machine, use Hardware + software threads. &amp;nbsp;It is 60x4 = 240, &amp;nbsp;( total 61 core , and 1 core was reserved)&amp;nbsp;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;The reason on Xeon machine is that &lt;/SPAN&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;Hyper-Threading Technology (HT Technology) &amp;nbsp;On Xeon is &amp;nbsp;only effective when each thread is performing &lt;STRONG&gt;different &lt;/STRONG&gt;types of operations and when there are under-utilized resources on the processor. &amp;nbsp;The threads in Intel MKL do exact same operation, so it can't benefit from HT thread. &amp;nbsp;As a result, MKL fork 6 threads instead 12.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Same reason, but the HT technology on Xeon phi was implemented in different way than Xeon, &amp;nbsp;it require at least 3 or 4 to feed all computing resources. in order to get better performance on Xeon Phi, MKL fork 240 thread be default.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;Please see more in&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&lt;A href="https://software.intel.com/en-us/forums/topic/294954" target="_blank"&gt;https://software.intel.com/en-us/forums/topic/294954&lt;/A&gt; &amp;nbsp;for Xeon&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;and you can &amp;nbsp;change the thread number by MKL_DYNAMIC and MKL_SET_NUM_THREADS setting.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Best Regards,&lt;/P&gt;

&lt;P&gt;Ying&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 04 May 2014 08:54:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-spurns-only-minimal-number-of-threads-for-SGEMM/m-p/1005145#M18831</guid>
      <dc:creator>Ying_H_Intel</dc:creator>
      <dc:date>2014-05-04T08:54:57Z</dc:date>
    </item>
  </channel>
</rss>

