<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: IMPI oversubscribing CPUs to ranks in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314704#M8743</link>
    <description>&lt;P&gt;Manage to fix this problem.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Though it was an issue with MPI and it only occured when we updated from intel mpi 2018 to intel mpi 2019, but was actually caused by PBS / cgroups, where scheduler was forcing the job to only use 32 cores.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 15 Sep 2021 07:28:47 GMT</pubDate>
    <dc:creator>Tim_Pook</dc:creator>
    <dc:date>2021-09-15T07:28:47Z</dc:date>
    <item>
      <title>IMPI oversubscribing CPUs to ranks</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314649#M8742</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Context:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Running job via PBS Pro batch scheduler on compute node with 128 cores.&lt;/P&gt;
&lt;P&gt;When requesting 64 cores for the job, only 32 cores are used ( found via htop )&lt;/P&gt;
&lt;P&gt;When requestion 128 cores for the same job, it uses all 128 cores.&lt;/P&gt;
&lt;P&gt;No hyperthreading.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;64 core job:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Pins cpu 0 to rank 0 and 32, cpu 1 to rank 1 and 33 etc. Thus, cpu's 32-63 are ignored.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="64 cores - MPI debug" style="width: 999px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/19398iC461E53374DDE732/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="64core_job.png" alt="64 cores - MPI debug" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;64 cores - MPI debug&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;These are the other enabled environment variables:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-left" image-alt="Other envvars" style="width: 999px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/19400i44C5562F2040BF1C/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="Screenshot 2021-09-15 at 12.46.03 PM.png" alt="Other envvars" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;Other envvars&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So far I've fixed this with I_MPI_HYDRA_TOPOLIB=ipl, but this also causes other issues when trying to run jobs over InfiniBand so isn't ideal. Also, the pinning behaviour isn't desirable as shown in screenshot below.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2021-09-15 at 12.51.33 PM.png" style="width: 835px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/19401iA91EF09B78547C6B/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="Screenshot 2021-09-15 at 12.51.33 PM.png" alt="Screenshot 2021-09-15 at 12.51.33 PM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Any advise on how to enforce proper process pinning would be very helpful.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Sep 2021 04:55:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314649#M8742</guid>
      <dc:creator>Tim_Pook</dc:creator>
      <dc:date>2021-09-15T04:55:24Z</dc:date>
    </item>
    <item>
      <title>Re: IMPI oversubscribing CPUs to ranks</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314704#M8743</link>
      <description>&lt;P&gt;Manage to fix this problem.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Though it was an issue with MPI and it only occured when we updated from intel mpi 2018 to intel mpi 2019, but was actually caused by PBS / cgroups, where scheduler was forcing the job to only use 32 cores.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Sep 2021 07:28:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314704#M8743</guid>
      <dc:creator>Tim_Pook</dc:creator>
      <dc:date>2021-09-15T07:28:47Z</dc:date>
    </item>
    <item>
      <title>Re: IMPI oversubscribing CPUs to ranks</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314777#M8745</link>
      <description>&lt;P&gt;&lt;FONT size="3"&gt;Hi,&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;Thanks for reaching out to us.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;Glad to know that your issue is resolved. Thanks for sharing the solution with us. If you need any additional information, &lt;SPAN style="background-color: var(--lwc-colorbackgroundinput,#ffffff); color: var(--lwc-colortextweak,#3e3e3c); font-size: var(--lwc-fontSize3,0.8125rem); font-family: var(--lwc-fontFamily,-apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji', 'Segoe UI Symbol');"&gt;&amp;nbsp;&lt;/SPAN&gt;please post a new question as this thread will no longer be monitored by Intel.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;Thanks &amp;amp; Regards&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;Shivani&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Sep 2021 12:44:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IMPI-oversubscribing-CPUs-to-ranks/m-p/1314777#M8745</guid>
      <dc:creator>ShivaniK_Intel</dc:creator>
      <dc:date>2021-09-15T12:44:16Z</dc:date>
    </item>
  </channel>
</rss>

