<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Get Very Low Performance with MP Linpack benchmark in HPC cluster in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/Get-Very-Low-Performance-with-MP-Linpack-benchmark-in-HPC/m-p/1099806#M5825</link>
    <description>&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;Dear all,&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;I have a problem with the result of MKL MP_Linkpack. In my system, I have 24 compute nodes with both Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz and Xeon Phi Q7200, RAM 256GB. On each node, I run ./runme_intel64, the performance is good ~ 700-900 GFlops (only Xeon CPU).&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;But when I run HPL on 4 nodes, 8 nodes or more, the result is very bad, sometimes it cannot return the result with the error: MPI TERMINATED,... After that, I run the test (runme_intel64) on each node again, and the performance is very low:&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;~ 11,243 GFLops,&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;~ 10,845 GFlops,&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;....&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;But I don't know the reason&amp;nbsp;why,&amp;nbsp;I guess the reason is the&amp;nbsp;power&amp;nbsp;of cluster (it is not enough for a whole system) and HPE Bios configured is Balanced Mode for the cluster (automatically change to lower power mode when the system cannot get enough the power). But when I just run on some nodes and configure the power is maximum, the problem is still not solved.&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;Please help me&amp;nbsp;about&amp;nbsp;this problem, thank you all!&lt;/P&gt;</description>
    <pubDate>Tue, 21 Feb 2017 04:26:34 GMT</pubDate>
    <dc:creator>MChun4</dc:creator>
    <dc:date>2017-02-21T04:26:34Z</dc:date>
    <item>
      <title>Get Very Low Performance with MP Linpack benchmark in HPC cluster</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Get-Very-Low-Performance-with-MP-Linpack-benchmark-in-HPC/m-p/1099806#M5825</link>
      <description>&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;Dear all,&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;I have a problem with the result of MKL MP_Linkpack. In my system, I have 24 compute nodes with both Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz and Xeon Phi Q7200, RAM 256GB. On each node, I run ./runme_intel64, the performance is good ~ 700-900 GFlops (only Xeon CPU).&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;But when I run HPL on 4 nodes, 8 nodes or more, the result is very bad, sometimes it cannot return the result with the error: MPI TERMINATED,... After that, I run the test (runme_intel64) on each node again, and the performance is very low:&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;~ 11,243 GFLops,&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;~ 10,845 GFlops,&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;....&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;But I don't know the reason&amp;nbsp;why,&amp;nbsp;I guess the reason is the&amp;nbsp;power&amp;nbsp;of cluster (it is not enough for a whole system) and HPE Bios configured is Balanced Mode for the cluster (automatically change to lower power mode when the system cannot get enough the power). But when I just run on some nodes and configure the power is maximum, the problem is still not solved.&lt;/P&gt;

&lt;P style="word-wrap: break-word; font-size: 12px;"&gt;Please help me&amp;nbsp;about&amp;nbsp;this problem, thank you all!&lt;/P&gt;</description>
      <pubDate>Tue, 21 Feb 2017 04:26:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Get-Very-Low-Performance-with-MP-Linpack-benchmark-in-HPC/m-p/1099806#M5825</guid>
      <dc:creator>MChun4</dc:creator>
      <dc:date>2017-02-21T04:26:34Z</dc:date>
    </item>
    <item>
      <title>Hi Minh</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Get-Very-Low-Performance-with-MP-Linpack-benchmark-in-HPC/m-p/1099807#M5826</link>
      <description>&lt;P&gt;Hi Minh&lt;/P&gt;

&lt;P&gt;&lt;SPAN id="result_box" lang="en"&gt;&lt;SPAN&gt;In my opinion, if the problem was in the power, then OS will be send like "Power Throttle" in /var/log/messages. Some servers send like this such message when you take out the second power supply&lt;/SPAN&gt;&lt;/SPAN&gt;.&lt;/P&gt;

&lt;P&gt;if one node linpack work fine then (I think) low performance may be in some situations:&lt;/P&gt;

&lt;P&gt;- wrong P Q in HPL.dat&lt;/P&gt;

&lt;P&gt;- problems with interconnect&lt;/P&gt;

&lt;P&gt;- low mesh use in HPL.dat. Low memory usage. It will be not less 85% of summary memory of all nodes&lt;/P&gt;

&lt;P&gt;For max performance you need setup in BIOS and /proc/cpu_freq - "max performance" and&lt;/P&gt;

&lt;PRE&gt;for c in ./cpu[0-9]* ; do
  echo $maxFreq &amp;gt;${c}/cpufreq/scaling_max_freq
  echo $maxFreq &amp;gt;${c}/cpufreq/scaling_min_freq
done&lt;/PRE&gt;</description>
      <pubDate>Fri, 21 Apr 2017 19:26:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Get-Very-Low-Performance-with-MP-Linpack-benchmark-in-HPC/m-p/1099807#M5826</guid>
      <dc:creator>SB17</dc:creator>
      <dc:date>2017-04-21T19:26:00Z</dc:date>
    </item>
  </channel>
</rss>

