<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic $SLURM_NODELIST not interpreted correctly in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795050#M679</link>
    <description>Dmitry,&lt;BR /&gt;&lt;BR /&gt;I'm glad to know it was a known problem.&lt;BR /&gt;&lt;BR /&gt;Thanks for your assitance. I have submited the tracker in Premier Support as you requested.&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Wed, 03 Nov 2010 15:11:19 GMT</pubDate>
    <dc:creator>ccladmin</dc:creator>
    <dc:date>2010-11-03T15:11:19Z</dc:date>
    <item>
      <title>$SLURM_NODELIST not interpreted correctly</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795047#M676</link>
      <description>Hello,&lt;BR /&gt;&lt;BR /&gt;I've recently setup a new cluster that uses slurm for resource allocation. Upon starting a new mpi job with:&lt;BR /&gt;$ salloc -n 32 sh&lt;BR /&gt;$ mpirun -np 32 -nolocal a.out&lt;BR /&gt;&lt;BR /&gt;I get the following errors:&lt;BR /&gt;&lt;BR /&gt;failed to connect to the socket (sock2): {socket.gaierror, (-2, 'Name or service not known')}. Probable reason: host "node2" is invalid&lt;BR /&gt;(mpdboot 494): failed to connect to the socket (sock2): {socket.error, (9, 'Bad file descriptor')}. Probable reason: host "node2" is invalid&lt;BR /&gt;&lt;THIS repeats="" a="" number="" of="" times=""&gt;&lt;BR /&gt;totalnum=3 numhosts=2&lt;BR /&gt;there are not enough hosts on which to start all processes&lt;BR /&gt;&lt;BR /&gt;I then checked to see if slurm nodelist is set properly with:&lt;BR /&gt;$ echo $SLURM_NODELIST&lt;BR /&gt;node[01-02]&lt;BR /&gt;and to confirm its intel mpi:&lt;BR /&gt;$ which mpirun&lt;BR /&gt;/opt/intel/impi/4.0.0.028/intel64/bin/mpirun&lt;BR /&gt;&lt;BR /&gt;The nodes below 10 have a leading zero in their hostname. It looks like mpirun is dropping this leading zero in the hostname.&lt;BR /&gt;&lt;BR /&gt;Am I missing something really simple and there is a straightforward fix?&lt;/THIS&gt;</description>
      <pubDate>Wed, 03 Nov 2010 05:26:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795047#M676</guid>
      <dc:creator>ccladmin</dc:creator>
      <dc:date>2010-11-03T05:26:53Z</dc:date>
    </item>
    <item>
      <title>$SLURM_NODELIST not interpreted correctly</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795048#M677</link>
      <description>Hi ccladmin,&lt;BR /&gt;&lt;BR /&gt;&amp;gt;It looks like mpirun is dropping this leading zero in the hostname.&lt;BR /&gt;Unfortunately this is an mpirun's bug.&lt;BR /&gt;Could you rename your nodes starting from 100 (or maybe from 1000) - it's the easiest workaround for this issue so far.&lt;BR /&gt;&lt;BR /&gt;Regards!&lt;BR /&gt; Dmitry&lt;BR /&gt;</description>
      <pubDate>Wed, 03 Nov 2010 06:56:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795048#M677</guid>
      <dc:creator>Dmitry_K_Intel2</dc:creator>
      <dc:date>2010-11-03T06:56:06Z</dc:date>
    </item>
    <item>
      <title>$SLURM_NODELIST not interpreted correctly</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795049#M678</link>
      <description>ccladmin,&lt;BR /&gt;&lt;BR /&gt;Could you submit a tracker through Premier Support so I will be able to provide you a patch.&lt;BR /&gt;&lt;BR /&gt;Regards!&lt;BR /&gt; Dmitry&lt;BR /&gt;</description>
      <pubDate>Wed, 03 Nov 2010 08:50:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795049#M678</guid>
      <dc:creator>Dmitry_K_Intel2</dc:creator>
      <dc:date>2010-11-03T08:50:07Z</dc:date>
    </item>
    <item>
      <title>$SLURM_NODELIST not interpreted correctly</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795050#M679</link>
      <description>Dmitry,&lt;BR /&gt;&lt;BR /&gt;I'm glad to know it was a known problem.&lt;BR /&gt;&lt;BR /&gt;Thanks for your assitance. I have submited the tracker in Premier Support as you requested.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 03 Nov 2010 15:11:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/SLURM-NODELIST-not-interpreted-correctly/m-p/795050#M679</guid>
      <dc:creator>ccladmin</dc:creator>
      <dc:date>2010-11-03T15:11:19Z</dc:date>
    </item>
  </channel>
</rss>

