<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic good point - for Phi you need in Intel® Moderncode for Parallel Architectures</title>
    <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052330#M6816</link>
    <description>&lt;P&gt;good point - for Phi you need 2 or more threads per core to get peak out of the core. &amp;nbsp;if the MPI processes are single threaded, the performance on Phi may be disappointing.&lt;/P&gt;

&lt;P&gt;ron&lt;/P&gt;</description>
    <pubDate>Mon, 17 Nov 2014 23:29:58 GMT</pubDate>
    <dc:creator>Ron_Green</dc:creator>
    <dc:date>2014-11-17T23:29:58Z</dc:date>
    <item>
      <title>Intel MPI for Phi tuning tips?</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052327#M6813</link>
      <description>&lt;P style="font-size: 12px;"&gt;Does setting&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp;I_MPI_MIC=enable&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;change other MPI environment variables, particularly any&amp;nbsp;that would tune MPI for the MIC system architecture? &amp;nbsp;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;As a side question, has anyone written a Tuning and Tweaking guide for IMPI for Phi? &amp;nbsp;For example, what I_MPI variables could one use to help tune an app targeting 480 ranks across 8 Phis?&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;Thanks&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;Ron&lt;/P&gt;</description>
      <pubDate>Mon, 10 Nov 2014 14:58:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052327#M6813</guid>
      <dc:creator>Ron_Green</dc:creator>
      <dc:date>2014-11-10T14:58:56Z</dc:date>
    </item>
    <item>
      <title>Are the MPI processes single</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052328#M6814</link>
      <description>&lt;P&gt;Are the MPI processes single threaded?&lt;/P&gt;

&lt;P&gt;If yes, then you should realize that 480 ranks over 8 Phis results in one thread per core (assuming you somehow restrict one process per core).&lt;/P&gt;

&lt;P&gt;With Xeon Phi, a second hardware thread running within a core is almost free. Therefore, consider using 960 ranks over 8 Phis (also try 1440).&lt;/P&gt;

&lt;P&gt;While I haven't done this, you might try I_MPI_PIN_DOMAIN=core, or, I_MPI_PIN_DOMAIN=480:scatter&lt;/P&gt;

&lt;P&gt;I am not sure how this applies when you have multiple MIC's (as to if this is also multiple nodes).&lt;/P&gt;

&lt;P&gt;What you&amp;nbsp;asked for (480 ranks) is one process per core across 8 MICs.&lt;/P&gt;

&lt;P&gt;Tim Prince may be able to answer this better.&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;</description>
      <pubDate>Wed, 12 Nov 2014 18:55:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052328#M6814</guid>
      <dc:creator>jimdempseyatthecove</dc:creator>
      <dc:date>2014-11-12T18:55:00Z</dc:date>
    </item>
    <item>
      <title>Intel mpi defaults are</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052329#M6815</link>
      <description>&lt;P&gt;Intel mpi defaults are generally effective. A second or 3 threads per &amp;nbsp;core are well worth while if done by openmp or equivalent inside mpi with cache locality&lt;/P&gt;</description>
      <pubDate>Sun, 16 Nov 2014 22:55:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052329#M6815</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2014-11-16T22:55:00Z</dc:date>
    </item>
    <item>
      <title>good point - for Phi you need</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052330#M6816</link>
      <description>&lt;P&gt;good point - for Phi you need 2 or more threads per core to get peak out of the core. &amp;nbsp;if the MPI processes are single threaded, the performance on Phi may be disappointing.&lt;/P&gt;

&lt;P&gt;ron&lt;/P&gt;</description>
      <pubDate>Mon, 17 Nov 2014 23:29:58 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Intel-MPI-for-Phi-tuning-tips/m-p/1052330#M6816</guid>
      <dc:creator>Ron_Green</dc:creator>
      <dc:date>2014-11-17T23:29:58Z</dc:date>
    </item>
  </channel>
</rss>

