<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic using Logical core gots more slow by  mkl. in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/using-Logical-core-gots-more-slow-by-mkl/m-p/766317#M273</link>
    <description>You may find some answers to your questions in this earlier thread:&lt;BR /&gt;&lt;BR /&gt; &lt;A href="http://software.intel.com/en-us/forums/showthread.php?t=77753&amp;amp;o=a&amp;amp;s=lr" target="_blank"&gt;http://software.intel.com/en-us/forums/showthread.php?t=77753&amp;amp;o=a&amp;amp;s=lr&lt;/A&gt;</description>
    <pubDate>Fri, 08 Apr 2011 20:03:34 GMT</pubDate>
    <dc:creator>mecej4</dc:creator>
    <dc:date>2011-04-08T20:03:34Z</dc:date>
    <item>
      <title>using Logical core gots more slow by  mkl.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/using-Logical-core-gots-more-slow-by-mkl/m-p/766316#M272</link>
      <description>Hi:&lt;BR /&gt;&lt;BR /&gt;I have using MKL to replace lapack and fftw3 by run the &lt;A href="http://www.openmx-square.org/"&gt;openmx&lt;/A&gt;, a material simulation code.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;gcc -msse4.2 for the openmx &lt;BR /&gt;( the openmx could not been build by icc, or you would got wrong result.)&lt;BR /&gt;&lt;BR /&gt;======================&lt;BR /&gt;case 1:&lt;BR /&gt;ifort -msse4.2 for lapack and blas&lt;BR /&gt;&lt;BR /&gt;icc -msse2 -openmp for fftw3&lt;BR /&gt;&lt;BR /&gt;vs &lt;BR /&gt;&lt;BR /&gt;case2:&lt;BR /&gt;mkl&lt;BR /&gt;========================&lt;BR /&gt;&lt;BR /&gt;my cpu is i3 330M, as you know , it is 2 true core with 4 logical core.&lt;BR /&gt;input is in the work folder.&lt;BR /&gt;&lt;BR /&gt;case 1 with 4 thread :&lt;BR /&gt;&lt;BR /&gt;Met.dat : 41s&lt;BR /&gt;&lt;B&gt;GaAs : 347s&lt;/B&gt;&lt;BR /&gt;C60: 81s&lt;BR /&gt;&lt;BR /&gt;case1 with 2 thread:&lt;BR /&gt;&lt;BR /&gt;Met.dat : 41s&lt;BR /&gt;
&lt;B&gt;GaAs : 290s&lt;/B&gt;&lt;BR /&gt;
C60: 87s&lt;BR /&gt;&lt;BR /&gt;case2 with 4 thread:&lt;BR /&gt;&lt;BR /&gt;Met.dat : 40s&lt;BR /&gt;

&lt;B&gt;GaAs : 327s&lt;/B&gt;&lt;BR /&gt;

C60: 94s&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;I do not know why the mkl would slow than auto-vectorizing by using 4 threads.&lt;BR /&gt;MKL is vectorizing lapack/blas by hand, not by compiler, it should better than machine done that.&lt;BR /&gt;Is it means, application should using number of true core instead of logical core by using MKL?&lt;BR /&gt;&lt;BR /&gt;thank you.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 08 Apr 2011 18:30:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/using-Logical-core-gots-more-slow-by-mkl/m-p/766316#M272</guid>
      <dc:creator>Gaiger_Chen</dc:creator>
      <dc:date>2011-04-08T18:30:40Z</dc:date>
    </item>
    <item>
      <title>using Logical core gots more slow by  mkl.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/using-Logical-core-gots-more-slow-by-mkl/m-p/766317#M273</link>
      <description>You may find some answers to your questions in this earlier thread:&lt;BR /&gt;&lt;BR /&gt; &lt;A href="http://software.intel.com/en-us/forums/showthread.php?t=77753&amp;amp;o=a&amp;amp;s=lr" target="_blank"&gt;http://software.intel.com/en-us/forums/showthread.php?t=77753&amp;amp;o=a&amp;amp;s=lr&lt;/A&gt;</description>
      <pubDate>Fri, 08 Apr 2011 20:03:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/using-Logical-core-gots-more-slow-by-mkl/m-p/766317#M273</guid>
      <dc:creator>mecej4</dc:creator>
      <dc:date>2011-04-08T20:03:34Z</dc:date>
    </item>
  </channel>
</rss>

