<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Are you heeding documentation in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029553#M20066</link>
    <description>&lt;P&gt;Are you heeding documentation of mkl_dynamic ? &lt;A href="https://software.intel.com/en-us/node/528547" target="_blank"&gt;https://software.intel.com/en-us/node/528547&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Did you compare performance of your task at various num_threads settings?&amp;nbsp; On 6-core Westmere it would be particularly difficult to avoid performance degradation when using all hyperthreads, but X5650 normally has 2 6-core CPUs, so you would need 24 threads to take full disadvantage of hyperthreading.&amp;nbsp; 2 of the 6 cores which have their own path to last level cache are better able to support hyperthreading than the other 4, so the Westmere has some unique characteristics not shared by other models.&lt;/P&gt;</description>
    <pubDate>Sun, 07 Jun 2015 00:55:37 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2015-06-07T00:55:37Z</dc:date>
    <item>
      <title>Why mkl 11.2.3 can not use all the cpu in Intel(R) Core(TM) i7-4700HQ CPU @ 2.40GHz</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029552#M20065</link>
      <description>&lt;P&gt;Hi:&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp; I use mkl_dss to solve a problem. I already use mkl_set_num_threads(8) to set the maximum threads of my computer. But when I run the program, I use top command and just can see only 4 cpu are 100% running, the other 4 cpu just 1%. My cpu is Intel(R) Core(TM) i7-4700HQ CPU @ 2.40GHz, 4 cores and 8 threads.&lt;/P&gt;

&lt;P&gt;However, I run the same program on the other machine Intel(R) Xeon(R) CPU&amp;nbsp; X5650&amp;nbsp; @ 2.67GHz, 6 cores 12 threads. I use mkl_set_num_threads(12) to set the maximum threads. The program can take full advantage of all 12 cpu.&lt;/P&gt;

&lt;P&gt;I use g++ -std=c++14 -O2 -march=native source.cpp -lmkl_rt to compile my program.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 07 Jun 2015 00:00:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029552#M20065</guid>
      <dc:creator>Chaowen_G_</dc:creator>
      <dc:date>2015-06-07T00:00:15Z</dc:date>
    </item>
    <item>
      <title>Are you heeding documentation</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029553#M20066</link>
      <description>&lt;P&gt;Are you heeding documentation of mkl_dynamic ? &lt;A href="https://software.intel.com/en-us/node/528547" target="_blank"&gt;https://software.intel.com/en-us/node/528547&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Did you compare performance of your task at various num_threads settings?&amp;nbsp; On 6-core Westmere it would be particularly difficult to avoid performance degradation when using all hyperthreads, but X5650 normally has 2 6-core CPUs, so you would need 24 threads to take full disadvantage of hyperthreading.&amp;nbsp; 2 of the 6 cores which have their own path to last level cache are better able to support hyperthreading than the other 4, so the Westmere has some unique characteristics not shared by other models.&lt;/P&gt;</description>
      <pubDate>Sun, 07 Jun 2015 00:55:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029553#M20066</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2015-06-07T00:55:37Z</dc:date>
    </item>
    <item>
      <title>Well, that is not the case. I</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029554#M20067</link>
      <description>&lt;P&gt;Well, that is not the case. I think it is the problem with mkl 11.2.3. Because Intel(R) Core(TM) i7-4700HQ CPU @ 2.40GHz can use all the 8 threads in older version of mkl.&lt;/P&gt;</description>
      <pubDate>Sun, 07 Jun 2015 20:19:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029554#M20067</guid>
      <dc:creator>Chaowen_G_</dc:creator>
      <dc:date>2015-06-07T20:19:30Z</dc:date>
    </item>
    <item>
      <title>Ok, I figure out the problem.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029555#M20068</link>
      <description>&lt;P&gt;Ok, I figure out the problem. Thanks to Tim Prince, he is right. I did not pay attention to MKL_DYNAMIC. After I add the statement mkl_set_dynamic(false), Intel(R) Core(TM) i7-4700HQ CPU @ 2.40GHz can use all the threads. I just need to compare performance of my program at various num_threads settings.&lt;/P&gt;</description>
      <pubDate>Sun, 07 Jun 2015 21:48:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Why-mkl-11-2-3-can-not-use-all-the-cpu-in-Intel-R-Core-TM-i7/m-p/1029555#M20068</guid>
      <dc:creator>Chaowen_G_</dc:creator>
      <dc:date>2015-06-07T21:48:52Z</dc:date>
    </item>
  </channel>
</rss>

