<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic problem to use 4 nodes in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778938#M341</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Welcome to the Intel HPC forums!&lt;/P&gt;&lt;P&gt;It seems like your PBS script is fine but the "caused collective abort of all ranks" error is fairly generic. It mostly means your application failed. It would be great if you could provide your full PBS script, with your mpirun/mpiexec command line, etc. Also, any info on your cluster (e.g. OS version, using InfiniBand or Ethernet, MPI library version, math library version - MKL or something else, etc) would be helpful.&lt;/P&gt;&lt;P&gt;Looking forward to hearing back.&lt;/P&gt;&lt;P&gt;Regards,&lt;BR /&gt;~Gergana&lt;/P&gt;</description>
    <pubDate>Thu, 15 Jul 2010 17:11:04 GMT</pubDate>
    <dc:creator>Gergana_S_Intel</dc:creator>
    <dc:date>2010-07-15T17:11:04Z</dc:date>
    <item>
      <title>problem to use 4 nodes</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778937#M340</link>
      <description>I am trying to calculate a function's gradient vector of size 12 on a PC cluster. It seems to be running ok if I request one node with 4 cores by using:&lt;BR /&gt;#PBS -l select=1:ncpus=4:mem=1200mb&lt;BR /&gt;This way, each processor will calculate 3 elements of the gradient. &lt;BR /&gt;&lt;BR /&gt;Now, I want to use 12 processors to calculate the 12 elements gradient vector in one go, so I tried to request 3 nodes by using:&lt;BR /&gt; #PBS -l select=3:ncpus=4:mem=1200mb&lt;BR /&gt;It complained:&lt;BR /&gt;&amp;gt;&amp;gt;&amp;gt; rank 7 in job 1 cx1-5-3-2.cx1.hpc.ic.ac.uk_49216 caused collective abort of all ranks exit status of rank 7: return code 29 &lt;BR /&gt;...... etc.&lt;BR /&gt;&lt;BR /&gt;I am new to MPI. Is there anything I should be aware of when requesting multinodes? Many thanks for reading my thread. &lt;BR /&gt;</description>
      <pubDate>Thu, 15 Jul 2010 16:10:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778937#M340</guid>
      <dc:creator>yyxt11a</dc:creator>
      <dc:date>2010-07-15T16:10:08Z</dc:date>
    </item>
    <item>
      <title>problem to use 4 nodes</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778938#M341</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Welcome to the Intel HPC forums!&lt;/P&gt;&lt;P&gt;It seems like your PBS script is fine but the "caused collective abort of all ranks" error is fairly generic. It mostly means your application failed. It would be great if you could provide your full PBS script, with your mpirun/mpiexec command line, etc. Also, any info on your cluster (e.g. OS version, using InfiniBand or Ethernet, MPI library version, math library version - MKL or something else, etc) would be helpful.&lt;/P&gt;&lt;P&gt;Looking forward to hearing back.&lt;/P&gt;&lt;P&gt;Regards,&lt;BR /&gt;~Gergana&lt;/P&gt;</description>
      <pubDate>Thu, 15 Jul 2010 17:11:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778938#M341</guid>
      <dc:creator>Gergana_S_Intel</dc:creator>
      <dc:date>2010-07-15T17:11:04Z</dc:date>
    </item>
    <item>
      <title>problem to use 4 nodes</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778939#M342</link>
      <description>I have asked the HPC administrator. He said each node has its own harddrive. Because there are input data files that are required by all MPI ranks, I 
have to copy the files to all the local harddrives on the nodes of the job by using "pbsdsh" command in the job script. &lt;BR /&gt;&lt;BR /&gt;This seems to have solved my problem.</description>
      <pubDate>Fri, 16 Jul 2010 12:55:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/problem-to-use-4-nodes/m-p/778939#M342</guid>
      <dc:creator>yyxt11a</dc:creator>
      <dc:date>2010-07-16T12:55:38Z</dc:date>
    </item>
  </channel>
</rss>

