<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Re:IntelMPI not following machinefile with slurm in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1223922#M7286</link>
    <description>&lt;P&gt;Hi James,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Do you know why it differs between your run internally and our run? Is there any setting we're missing for our run?&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Erica&lt;/P&gt;</description>
    <pubDate>Thu, 29 Oct 2020 19:51:15 GMT</pubDate>
    <dc:creator>mpiuser1</dc:creator>
    <dc:date>2020-10-29T19:51:15Z</dc:date>
    <item>
      <title>IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218055#M7237</link>
      <description>&lt;P&gt;Hi Intel community,&lt;/P&gt;
&lt;P&gt;I am using IntelMPI 2019.8 with slurm. I have noticed that when running with a machinefile, it does not follow the assigned nodes exactly. For example, all the processes assigned to node1 are all assigned to node2, and all the processes assigned to node2 are assigned to another node.&amp;nbsp; How do we make it follow the machinefile exactly? I am attaching the sample program we are running to test the machinefile along with the slurm script.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Erica&lt;/P&gt;</description>
      <pubDate>Thu, 15 Oct 2020 14:58:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218055#M7237</guid>
      <dc:creator>mpiuser1</dc:creator>
      <dc:date>2020-10-15T14:58:41Z</dc:date>
    </item>
    <item>
      <title>Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218363#M7239</link>
      <description>&lt;P&gt;Hi Erica,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for reporting this to us.&lt;/P&gt;&lt;P&gt;We have observed similar behaviour in SLURM. The process placement is accurate for other job schedulers (we have checked for PBS).&lt;/P&gt;&lt;P&gt;So, we are transferring this to our internal team for better support.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 16 Oct 2020 11:42:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218363#M7239</guid>
      <dc:creator>PrasanthD_intel</dc:creator>
      <dc:date>2020-10-16T11:42:35Z</dc:date>
    </item>
    <item>
      <title>Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218464#M7241</link>
      <description>&lt;P&gt;When I tested with 2019 Update 8 on an internal cluster, I am seeing the expected behavior.  Can you please send the full output with I_MPI_DEBUG=16?&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 16 Oct 2020 17:48:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218464#M7241</guid>
      <dc:creator>James_T_Intel</dc:creator>
      <dc:date>2020-10-16T17:48:12Z</dc:date>
    </item>
    <item>
      <title>Re: Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218531#M7242</link>
      <description>&lt;P&gt;Hi James,&lt;/P&gt;
&lt;P&gt;Here is the output with corresponding machinefile.&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Erica&lt;/P&gt;</description>
      <pubDate>Fri, 16 Oct 2020 20:32:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1218531#M7242</guid>
      <dc:creator>mpiuser1</dc:creator>
      <dc:date>2020-10-16T20:32:52Z</dc:date>
    </item>
    <item>
      <title>Re: Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1223922#M7286</link>
      <description>&lt;P&gt;Hi James,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Do you know why it differs between your run internally and our run? Is there any setting we're missing for our run?&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Erica&lt;/P&gt;</description>
      <pubDate>Thu, 29 Oct 2020 19:51:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1223922#M7286</guid>
      <dc:creator>mpiuser1</dc:creator>
      <dc:date>2020-10-29T19:51:15Z</dc:date>
    </item>
    <item>
      <title>Re: Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1223923#M7287</link>
      <description>&lt;P&gt;Hi James,&lt;/P&gt;
&lt;P&gt;Could you share your slurm job script with us so we can test it? Which version of slurm did you test it on?&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Erica&lt;/P&gt;</description>
      <pubDate>Thu, 29 Oct 2020 19:53:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1223923#M7287</guid>
      <dc:creator>mpiuser1</dc:creator>
      <dc:date>2020-10-29T19:53:08Z</dc:date>
    </item>
    <item>
      <title>Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1300915#M8630</link>
      <description>&lt;P&gt;I apologize for dropping this.  Here is the script I used for testing.  I randomized the order of hosts in order to ensure that the machinefile is being used rather than the SLURM nodelist.  Tested on a customized version of SLURM 20.11.7.  The output matches the order in the machinefile.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;#!/bin/bash&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;#SBATCH -N 8&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;scontrol show hostnames $SLURM_JOB_NODELIST | shuf &amp;gt; machinefile.txt&lt;/P&gt;&lt;P&gt;scontrol show hostnames $SLURM_JOB_NODELIST | shuf &amp;gt;&amp;gt; machinefile.txt&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;source /opt/intel/oneAPI/latest/setvars.sh&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;mpirun -n 16 -machinefile machinefile.txt -genv I_MPI_DEBUG 3 -bootstrap ssh ./a.out&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 23 Jul 2021 17:33:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1300915#M8630</guid>
      <dc:creator>James_T_Intel</dc:creator>
      <dc:date>2021-07-23T17:33:41Z</dc:date>
    </item>
    <item>
      <title>Re:IntelMPI not following machinefile with slurm</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1303697#M8651</link>
      <description>&lt;P&gt;I am closing the Intel support case related to this thread.  Everything appears to be functioning as expected in multiple test scenarios.  Any further replies on this thread will be considered community only.  If you require additional support assistance on this issue, please start a new thread with current details and logs.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 04 Aug 2021 17:36:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/IntelMPI-not-following-machinefile-with-slurm/m-p/1303697#M8651</guid>
      <dc:creator>James_T_Intel</dc:creator>
      <dc:date>2021-08-04T17:36:46Z</dc:date>
    </item>
  </channel>
</rss>

