<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Problems mpdboot  in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858440#M1578</link>
    <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
Does ssh without password connect to that node, or does it refuse to connect? This can be as simple as stale entries in ~/.ssh/known_hosts or a disconnected or powered off component.&lt;BR /&gt;</description>
    <pubDate>Wed, 15 Apr 2009 22:09:23 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2009-04-15T22:09:23Z</dc:date>
    <item>
      <title>Problems mpdboot</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858439#M1577</link>
      <description>Hello,&lt;BR /&gt;&lt;BR /&gt;I am having a the following problem when executing mpdboot:&lt;BR /&gt;&lt;BR /&gt;$ mpdboot -n 2 -f /home/comsol/mpd.hosts -r ssh&lt;BR /&gt;mpdboot_cluster (handle_mpd_output 672): Failed to establish a socket connection with cl1n001:42406 : (111, 'Connection refused')&lt;BR /&gt;mpdboot_cluster (handle_mpd_output 689): failed to connect to mpd on cl1n001&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;I need to utilize mpi to be able to make Comsol 3.5 work in parallel form.&lt;BR /&gt;Comsol is paralleled in the following form:&lt;BR /&gt;cluster comsol35/bin&amp;gt; ./comsol -nn 2 mpd boot -f /home/comsol/mpd.hosts &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;The error I get is:&lt;BR /&gt;&lt;BR /&gt;mpdboot_cluster (handle_mpd_output 725): from mpd on cl1n001, invalid port info:&lt;BR /&gt;cl1n001: Connection refused&lt;BR /&gt;&lt;BR /&gt;Information:&lt;BR /&gt;Operating System: SLES 10 sp2&lt;BR /&gt;Version Intel Mpi: 3.1&lt;BR /&gt;&lt;BR /&gt;I really hope someone can help me.&lt;BR /&gt;&lt;BR /&gt;Thank you.&lt;BR /&gt;</description>
      <pubDate>Wed, 15 Apr 2009 19:39:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858439#M1577</guid>
      <dc:creator>carlos_veralive_cl</dc:creator>
      <dc:date>2009-04-15T19:39:26Z</dc:date>
    </item>
    <item>
      <title>Re: Problems mpdboot</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858440#M1578</link>
      <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
Does ssh without password connect to that node, or does it refuse to connect? This can be as simple as stale entries in ~/.ssh/known_hosts or a disconnected or powered off component.&lt;BR /&gt;</description>
      <pubDate>Wed, 15 Apr 2009 22:09:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858440#M1578</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2009-04-15T22:09:23Z</dc:date>
    </item>
    <item>
      <title>Re: Problems mpdboot</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858441#M1579</link>
      <description>Hi Carlos,&lt;BR /&gt;&lt;BR /&gt;The issue here is that, when you try to start the MPD daemons from the '&lt;STRONG&gt;cluster&lt;/STRONG&gt;' node, it's unable to connect to the '&lt;STRONG&gt;cl1n001&lt;/STRONG&gt;' node.&lt;BR /&gt;&lt;BR /&gt;As Tim mentioned, can you verify that passwordless SSH is setup on the cluster? Meaning that you can ssh from &lt;STRONG&gt;cluster&lt;/STRONG&gt; to &lt;STRONG&gt;cl1n001&lt;/STRONG&gt; without being prompted for a password? That's a requirement for the Intel MPI Library.&lt;BR /&gt;&lt;BR /&gt;Also, make sure that no old MPD daemons are running on the cluster. To do so, execute:&lt;BR /&gt;&lt;BR /&gt;&lt;CODE&gt;$ ps aux | grep mpd&lt;/CODE&gt;&lt;BR /&gt;&lt;BR /&gt;If you see a listing of any 'mpd' python processes running under your account, kill -9 those to clear out the port Intel MPI is trying to use (both for &lt;STRONG&gt;cluster&lt;/STRONG&gt; and &lt;STRONG&gt;cl1n001&lt;/STRONG&gt;).&lt;BR /&gt;&lt;BR /&gt;Finally, this could be an issue where Intel MPI tries to create the initial mpd logfile but it can't. By default, this will be done in /tmp on the node. Can you verify that you have access and can indeed write into /tmp, or if there is a file called /tmp/mpd2.logfile_&lt;USER&gt;?&lt;BR /&gt;&lt;BR /&gt;Generally, I would also recommend upgrading to the latest Intel MPI Library 3.2 Update 1.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;~Gergana&lt;/USER&gt;</description>
      <pubDate>Wed, 15 Apr 2009 22:21:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problems-mpdboot/m-p/858441#M1579</guid>
      <dc:creator>Gergana_S_Intel</dc:creator>
      <dc:date>2009-04-15T22:21:52Z</dc:date>
    </item>
  </channel>
</rss>

