<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic It turns out that the problem in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-performance-problem-on-inter-switch-connection/m-p/1063858#M4547</link>
    <description>&lt;P&gt;It turns out that the problem is&amp;nbsp;link contention between the two switches.&lt;/P&gt;</description>
    <pubDate>Sun, 08 Jan 2017 05:28:39 GMT</pubDate>
    <dc:creator>seongyun_k_</dc:creator>
    <dc:date>2017-01-08T05:28:39Z</dc:date>
    <item>
      <title>MPI performance problem on inter-switch connection</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-performance-problem-on-inter-switch-connection/m-p/1063857#M4546</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I have a cluster with 32 machines. The first 25 machines are on the first rack and the rest 7 machines are on the second rack.&lt;BR /&gt;
	Each rack has a 1Gbps Ethernet switch.&lt;/P&gt;

&lt;P&gt;I run a MPI application which uses 32 machines (1 process per host machine).&lt;BR /&gt;
	&lt;SPAN style="font-size: 1em;"&gt;When I used the network performance benchmark tool like 'iperf' to measure the network speed between the machines, there is no problem (all point-to-point connection within 32 machines can exploit the full bandwidth).&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;In my application (MPI_Send/MPI_Recv), each mpi process sends a few 4MB sized data to the other machines. (so it is not the message size problem)&lt;BR /&gt;
	&lt;SPAN style="font-size: 13px;"&gt;I found that the communication speed between the first 25 machines and the next 7 machines was very poor (~ 10 ~ 20 MB/sec)&lt;/SPAN&gt;&lt;BR /&gt;
	&lt;SPAN style="font-size: 13px;"&gt;(The communication speed within the first 25 machines and the next 7 machines are fast; 100 ~ 110 MB/sec)&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 13px;"&gt;What is the possible cause here? Is the latency killing it?&lt;/SPAN&gt;&lt;BR /&gt;
	&lt;SPAN style="font-size: 1em;"&gt;What can I do here to improve the performance?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Is there any suggested optimization?&lt;/P&gt;</description>
      <pubDate>Sat, 07 Jan 2017 18:49:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-performance-problem-on-inter-switch-connection/m-p/1063857#M4546</guid>
      <dc:creator>seongyun_k_</dc:creator>
      <dc:date>2017-01-07T18:49:38Z</dc:date>
    </item>
    <item>
      <title>It turns out that the problem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-performance-problem-on-inter-switch-connection/m-p/1063858#M4547</link>
      <description>&lt;P&gt;It turns out that the problem is&amp;nbsp;link contention between the two switches.&lt;/P&gt;</description>
      <pubDate>Sun, 08 Jan 2017 05:28:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-performance-problem-on-inter-switch-connection/m-p/1063858#M4547</guid>
      <dc:creator>seongyun_k_</dc:creator>
      <dc:date>2017-01-08T05:28:39Z</dc:date>
    </item>
  </channel>
</rss>

