<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hi Richard, in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936950#M2629</link>
    <description>&lt;P&gt;Hi Richard,&lt;/P&gt;
&lt;P&gt;Are you able to run with Hydra?&amp;nbsp; Please use either mpirun or mpiexec.hydra instead of mpiexec, and all of the other options should remain the same.&lt;/P&gt;
&lt;P&gt;Sincerely,&lt;BR /&gt; James Tullos&lt;BR /&gt; Technical Consulting Engineer&lt;BR /&gt; Intel® Cluster Tools&lt;/P&gt;</description>
    <pubDate>Mon, 23 Sep 2013 19:38:44 GMT</pubDate>
    <dc:creator>James_T_Intel</dc:creator>
    <dc:date>2013-09-23T19:38:44Z</dc:date>
    <item>
      <title>Intermittently Cannot Connect To Local MPD</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936949#M2628</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are intermittently seeing this error message when running an MPI job with the latest MPI Run-Time Library V4:&lt;/P&gt;&lt;P&gt;/usr/diags/mpi/impi/4.1.1.036/bin64/mpiexec -genv LD_LIBRARY_PATH /usr/diags/mpi/impi/4.1.1.036/lib64 -machinefile /tmp/mymachlist.103060.run -n 32 /usr/diags/mpi/intel/intel/bin/olconft.intel RUNTIME=2&lt;BR /&gt;mpdroot: cannot connect to local mpd at: /tmp/mpd2.console_root&lt;BR /&gt;probable cause: no mpd daemon on this machine&lt;BR /&gt;possible cause: unix socket /tmp/mpd2.console_root has been removed&lt;BR /&gt;mpiexec_A00A6D99 (__init__ 1524): forked process failed; status=255&lt;/P&gt;&lt;P&gt;Any idea what caues this error or can you help us determine the exact reason for the fork failure?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Sep 2013 19:24:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936949#M2628</guid>
      <dc:creator>Richard_R_4</dc:creator>
      <dc:date>2013-09-23T19:24:44Z</dc:date>
    </item>
    <item>
      <title>Hi Richard,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936950#M2629</link>
      <description>&lt;P&gt;Hi Richard,&lt;/P&gt;
&lt;P&gt;Are you able to run with Hydra?&amp;nbsp; Please use either mpirun or mpiexec.hydra instead of mpiexec, and all of the other options should remain the same.&lt;/P&gt;
&lt;P&gt;Sincerely,&lt;BR /&gt; James Tullos&lt;BR /&gt; Technical Consulting Engineer&lt;BR /&gt; Intel® Cluster Tools&lt;/P&gt;</description>
      <pubDate>Mon, 23 Sep 2013 19:38:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936950#M2629</guid>
      <dc:creator>James_T_Intel</dc:creator>
      <dc:date>2013-09-23T19:38:44Z</dc:date>
    </item>
    <item>
      <title>I updated our olconft Perl</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936951#M2630</link>
      <description>&lt;P&gt;I updated our olconft Perl script to use Hydra instead of MPD by setting this evnironemnt variable before executing mpboot:&lt;/P&gt;
&lt;P&gt;$ENV{"I_MPI_PROCESS_MANAGER"} = "hydra";&lt;/P&gt;
&lt;P&gt;olconf&amp;nbsp; Start time: Mon Sep 23 15:09:48 CDT 2013 &lt;BR /&gt;Running /usr/diags/mpi/intel/intel/bin/olconft.intel on nodes: A00A6D61.&lt;BR /&gt;&lt;BR /&gt;/usr/diags/mpi/impi/4.1.1.036/bin64/mpiexec -genv LD_LIBRARY_PATH /usr/diags/mpi/impi/4.1.1.036/lib64&amp;nbsp; -machinefile /tmp/mymachlist.42867.run -n 32 /usr/diags/mpi/intel/intel/bin/olconft.intel RUNTIME=2 &amp;nbsp;&lt;BR /&gt;mpdroot: cannot connect to local mpd at: /tmp/mpd2.console_root&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; probable cause:&amp;nbsp; no mpd daemon on this machine&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; possible cause:&amp;nbsp; unix socket /tmp/mpd2.console_root has been removed&lt;BR /&gt;mpiexec_A00A6D61 (__init__ 1524): forked process failed; status=255&lt;BR /&gt;Error: Return Status for /usr/diags/mpi/intel/intel/bin/olconft.intel is: 65280 &lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Sep 2013 20:14:43 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936951#M2630</guid>
      <dc:creator>Richard_R_4</dc:creator>
      <dc:date>2013-09-23T20:14:43Z</dc:date>
    </item>
    <item>
      <title>We found that by removing</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936952#M2631</link>
      <description>&lt;P&gt;We found that by removing this sysctl command from our MPI execution script the MPD problem was eliminated:&lt;/P&gt;
&lt;P&gt;sysctl -w vm.drop_caches=3&lt;/P&gt;
&lt;P&gt;This command clears page cache and inode/dentry cache and was used as a work-around for another unrelated problem about a year ago.&amp;nbsp; We do not know why running this command seems to cause the fork error seen as the MPD connect failure.&lt;/P&gt;
&lt;P&gt;However, we also are intermittently seeing a second error related to MPD:&lt;/P&gt;
&lt;P&gt;mpdboot failed: Inappropriate ioctl for device at &lt;I&gt;/usr/diags/bin/&lt;/I&gt;../lib/mpi_setup.pm line 173.&lt;/P&gt;
&lt;P&gt;when attempting to execute this mpdboot command from our Perl script:&lt;/P&gt;
&lt;P&gt;system("$MPDBOOT --rsh=/usr/bin/ssh --totalnum=$nhosts_plus1 --file=$NODE_LIST_FIL&amp;nbsp;&amp;nbsp;&amp;nbsp; E&amp;nbsp; &amp;gt; /dev/null 2&amp;gt;&amp;amp;1 ") &amp;amp;&amp;amp; die "mpdboot failed: $!";&lt;/P&gt;
&lt;P&gt;This problem also goes away when the sysctl command is removed, but still occurs with the sysctl command and running Hydra.&lt;/P&gt;
&lt;P&gt;In summary, running the sysctl to clear caches intermittently causes two distinct MPD-related errors.&amp;nbsp; Removing the sysctl command eliminates both errors.&lt;/P&gt;
&lt;P&gt;Leaving the sysctl command in our script and running with Hydra eliminates the connect failure caused by the fork error, but not the ioctl MPD error.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Sep 2013 22:08:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intermittently-Cannot-Connect-To-Local-MPD/m-p/936952#M2631</guid>
      <dc:creator>Richard_R_4</dc:creator>
      <dc:date>2013-09-25T22:08:44Z</dc:date>
    </item>
  </channel>
</rss>

