<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MPI run crashes on more than one node in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-run-crashes-on-more-than-one-node/m-p/970159#M3283</link>
    <description>&lt;P&gt;Hi everyone,&lt;/P&gt;
&lt;P&gt;I'm using MPICH2 v1.5 to run my WRF model on INTEL Xeon Processors. I can run on one node with as many cores as I want but if it exceeds the number of porcessors in a core it will crash with following error:&lt;/P&gt;
&lt;P&gt;*********************************************************************************************************************************************&lt;/P&gt;
&lt;P&gt;[proxy:0:0@hpc1934] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:883): assert (!closed) failed&lt;BR /&gt;[proxy:0:0@hpc1934] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status&lt;BR /&gt;[proxy:0:0@hpc1934] main (./pm/pmiserv/pmip.c:210): demux engine error waiting for event&lt;BR /&gt;[proxy:0:7@hpc1945] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:883): assert (!closed) failed&lt;BR /&gt;[proxy:0:7@hpc1945] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status&lt;BR /&gt;[proxy:0:7@hpc1945] main (./pm/pmiserv/pmip.c:210): demux engine error waiting for event&lt;BR /&gt;[proxy:0:5@hpc1940] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:883): assert (!closed) failed&lt;BR /&gt;[proxy:0:5@hpc1940] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status&lt;BR /&gt;[proxy:0:5@hpc1940] main (./pm/pmiserv/pmip.c:210): demux engine error waiting for event&lt;BR /&gt;[mpiexec@hpc1934] HYDT_bscu_wait_for_completion (./tools/bootstrap/utils/bscu_wait.c:76): one of the processes terminated badly; aborting&lt;BR /&gt;[mpiexec@hpc1934] HYDT_bsci_wait_for_completion (./tools/bootstrap/src/bsci_wait.c:23): launcher returned error waiting for completion&lt;BR /&gt;[mpiexec@hpc1934] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:216): launcher returned error waiting for completion&lt;BR /&gt;[mpiexec@hpc1934] main (./ui/mpich/mpiexec.c:325): process manager error waiting for completion&lt;/P&gt;
&lt;P&gt;**********************************************************************************************************************************************&lt;/P&gt;
&lt;P&gt;and this is how I run the model:&lt;/P&gt;
&lt;P&gt;$ulimit -s unlimited&lt;/P&gt;
&lt;P&gt;$source ~/setup-intel.sh&lt;/P&gt;
&lt;P&gt;$mpiexec -np nproc ./wrf.exe &amp;gt;&amp;amp; benchmark#n.log&lt;/P&gt;
&lt;P&gt;I would appreciate any help in this regard.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Bests,&lt;/P&gt;
&lt;P&gt;Arash&lt;/P&gt;</description>
    <pubDate>Wed, 30 Oct 2013 21:25:37 GMT</pubDate>
    <dc:creator>arash_m_</dc:creator>
    <dc:date>2013-10-30T21:25:37Z</dc:date>
    <item>
      <title>MPI run crashes on more than one node</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-run-crashes-on-more-than-one-node/m-p/970159#M3283</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;
&lt;P&gt;I'm using MPICH2 v1.5 to run my WRF model on INTEL Xeon Processors. I can run on one node with as many cores as I want but if it exceeds the number of porcessors in a core it will crash with following error:&lt;/P&gt;
&lt;P&gt;*********************************************************************************************************************************************&lt;/P&gt;
&lt;P&gt;[proxy:0:0@hpc1934] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:883): assert (!closed) failed&lt;BR /&gt;[proxy:0:0@hpc1934] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status&lt;BR /&gt;[proxy:0:0@hpc1934] main (./pm/pmiserv/pmip.c:210): demux engine error waiting for event&lt;BR /&gt;[proxy:0:7@hpc1945] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:883): assert (!closed) failed&lt;BR /&gt;[proxy:0:7@hpc1945] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status&lt;BR /&gt;[proxy:0:7@hpc1945] main (./pm/pmiserv/pmip.c:210): demux engine error waiting for event&lt;BR /&gt;[proxy:0:5@hpc1940] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:883): assert (!closed) failed&lt;BR /&gt;[proxy:0:5@hpc1940] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status&lt;BR /&gt;[proxy:0:5@hpc1940] main (./pm/pmiserv/pmip.c:210): demux engine error waiting for event&lt;BR /&gt;[mpiexec@hpc1934] HYDT_bscu_wait_for_completion (./tools/bootstrap/utils/bscu_wait.c:76): one of the processes terminated badly; aborting&lt;BR /&gt;[mpiexec@hpc1934] HYDT_bsci_wait_for_completion (./tools/bootstrap/src/bsci_wait.c:23): launcher returned error waiting for completion&lt;BR /&gt;[mpiexec@hpc1934] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:216): launcher returned error waiting for completion&lt;BR /&gt;[mpiexec@hpc1934] main (./ui/mpich/mpiexec.c:325): process manager error waiting for completion&lt;/P&gt;
&lt;P&gt;**********************************************************************************************************************************************&lt;/P&gt;
&lt;P&gt;and this is how I run the model:&lt;/P&gt;
&lt;P&gt;$ulimit -s unlimited&lt;/P&gt;
&lt;P&gt;$source ~/setup-intel.sh&lt;/P&gt;
&lt;P&gt;$mpiexec -np nproc ./wrf.exe &amp;gt;&amp;amp; benchmark#n.log&lt;/P&gt;
&lt;P&gt;I would appreciate any help in this regard.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Bests,&lt;/P&gt;
&lt;P&gt;Arash&lt;/P&gt;</description>
      <pubDate>Wed, 30 Oct 2013 21:25:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-run-crashes-on-more-than-one-node/m-p/970159#M3283</guid>
      <dc:creator>arash_m_</dc:creator>
      <dc:date>2013-10-30T21:25:37Z</dc:date>
    </item>
  </channel>
</rss>

