<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Severe slowdown with  PDSYGST and PDSYEVX for 64 cores in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775326#M954</link>
    <description>Thanks. I will also ask that they install the latest openmpi. They are currently doing some tests on the system to see if there is anything else going wrong.&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;N.B., hyperthreading is off, and I am using the sequential libraries.&lt;/DIV&gt;&lt;/DIV&gt;</description>
    <pubDate>Thu, 02 Aug 2012 14:27:29 GMT</pubDate>
    <dc:creator>L__D__Marks</dc:creator>
    <dc:date>2012-08-02T14:27:29Z</dc:date>
    <item>
      <title>Severe slowdown with  PDSYGST and PDSYEVX for 64 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775322#M950</link>
      <description>I am currently benchmarking a small cluster on a vendor's system, and noticed a very severe slowdown with 64 cores, and even slightly for 32 cores. The slowdown is specific to both of these routines, and they take twice as long with 64 cores as they do for 32. And, yes, I really do mean twice as long.&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;P&gt;&lt;BR /&gt;

The vendor has scomposer_xe_2011_sp1.11.339which I used for the tests. The mkl fromcomposerxe-2011.3.174 (which I had access to) is slightly better, but not a lot. From /proc/cpuinfo these areIntel Xeon CPU E5-2660 0 @ 2.20GHz machines, 16 cores per node with IB, openmpi-1.4.5.&lt;/P&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Any suggestions? (It is not a coding issue or anything else, the code being used is a DFT standard.)&lt;/DIV&gt;</description>
      <pubDate>Wed, 01 Aug 2012 23:24:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775322#M950</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2012-08-01T23:24:38Z</dc:date>
    </item>
    <item>
      <title>Severe slowdown with  PDSYGST and PDSYEVX for 64 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775323#M951</link>
      <description>Could you please try to run these tests with IntelMPI as well?</description>
      <pubDate>Thu, 02 Aug 2012 03:53:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775323#M951</guid>
      <dc:creator>Andrei_Moskalev__Int</dc:creator>
      <dc:date>2012-08-02T03:53:14Z</dc:date>
    </item>
    <item>
      <title>Severe slowdown with  PDSYGST and PDSYEVX for 64 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775324#M952</link>
      <description>I have requested that the vendor install it. I will provide more information later today.</description>
      <pubDate>Thu, 02 Aug 2012 12:07:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775324#M952</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2012-08-02T12:07:52Z</dc:date>
    </item>
    <item>
      <title>Severe slowdown with  PDSYGST and PDSYEVX for 64 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775325#M953</link>
      <description>If you are using an MPI process per core, you should activate the core pinning option of OpenMPI, if you haven't done so, as well as using the mkl_sequential. Latest development versions of OpenMPI should include options to support multiple MPI/OpenMP hybrid processes per node, as Intel MPI has done for several years. Intel MPI also provides for recognition of active HyperThreading and using a single process per core; I'm doubtful of OpenMPI in that mode.</description>
      <pubDate>Thu, 02 Aug 2012 13:44:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775325#M953</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2012-08-02T13:44:03Z</dc:date>
    </item>
    <item>
      <title>Severe slowdown with  PDSYGST and PDSYEVX for 64 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775326#M954</link>
      <description>Thanks. I will also ask that they install the latest openmpi. They are currently doing some tests on the system to see if there is anything else going wrong.&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;N.B., hyperthreading is off, and I am using the sequential libraries.&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 02 Aug 2012 14:27:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775326#M954</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2012-08-02T14:27:29Z</dc:date>
    </item>
    <item>
      <title>Severe slowdown with  PDSYGST and PDSYEVX for 64 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775327#M955</link>
      <description>Hmmm. The Intel MPI appears to be both substantially faster and not have the same scaling problems. Since I am borrowing use of a test cluster I cannot say exactly what the issue was.&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;BR /&gt;I have posted information on a listerver for the specific DFT code (Wien2k) since others may want to start using Intel MPI a the current version seems to be rather good.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV id="_mcePaste"&gt;For the record, the timings are with the first number the cores, the second nodes:&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;16 1:    TIME HAMILT (CPU) =   7.5, HNS =   8.0, DIAG =  61.3&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;32 2:    TIME HAMILT (CPU) =   5.1, HNS =   4.4, DIAG =  40.8&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;48 3:    TIME HAMILT (CPU) =   4.1, HNS =   3.2, DIAG =  31.8&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;64 4:    TIME HAMILT (CPU) =   3.4, HNS =   2.6, DIAG =  25.1&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;&lt;BR /&gt;The "HAMILT" and "HNS" parts of the code are mainly simple mpi, i.e. spliiting of the effort over different machines. Both scale well with both Intel MPI and openmpi, with openmpi being perhaps slightly faster although the difference was small enough to be noise.&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;&lt;/DIV&gt;&lt;DIV id="_mcePaste"&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV id="_mcePaste"&gt;The "DIAG" part of the code is dominated by the scalapack calls PDSYGST &amp;amp; PDSYEVX. This does not scale quite as well as the others, but does scale relatively well. These were scaling badly with the version of openmpi that I was provided with.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;N.B., if anyone wants to provide additional options to test to see if they make any difference I may still have access to the test cluster for a bit.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 03 Aug 2012 14:18:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Severe-slowdown-with-PDSYGST-and-PDSYEVX-for-64-cores/m-p/775327#M955</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2012-08-03T14:18:33Z</dc:date>
    </item>
  </channel>
</rss>

