<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: impi/mkl_scalapack causing kernel panic in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863564#M1662</link>
    <description>&lt;OL&gt;
&lt;LI&gt;Is it allowed to establish connection from compute nodes to the head node?mpdboot alwaysstart mpd daemon on local node first .After that remote mpd daemonst attemt to perfrom connection to it?&lt;/LI&gt;
&lt;LI&gt;Do you able to start mpd manually?&lt;/LI&gt;
&lt;UL&gt;
&lt;LI&gt;Run the mpd -e -d command on the head node. The port number will be printed on stdout. &lt;/LI&gt;
&lt;LI&gt;Run the mpd -h head_node -p &lt;PORT_NUMBER&gt; -d command to establish MPD ring. Use the port number printed at pervious step.&lt;/PORT_NUMBER&gt;&lt;/LI&gt;
&lt;LI&gt;Check if ring was established succesfully. Run the mpdtrace command for that.&lt;/LI&gt;&lt;/UL&gt;&lt;/OL&gt;
&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 30 Jan 2008 12:39:42 GMT</pubDate>
    <dc:creator>Andrey_D_Intel</dc:creator>
    <dc:date>2008-01-30T12:39:42Z</dc:date>
    <item>
      <title>impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863555#M1653</link>
      <description>I'm testing out the cluster toolkit for Linux on em64t and the scalapack Cholesky factorization routine (pdpotrf_) causes a Machine Check Exception and kernel panic very predictably. No other scalapack/pblas routine so far has caused any problems.
&lt;BR /&gt;
&lt;BR /&gt;Additionally, each time I start an mpi job using 'mpiexec', the mpd daemon outputs the following:
&lt;BR /&gt;unable to parse pmi message from the process :cmd=put kvsname=kvs_nerf_4268_1_0 key=DAPL_PROVIDER value=
&lt;BR /&gt;
&lt;BR /&gt;It may not matter, but the only way I've found to get the mpd ring running is:
&lt;BR /&gt;[host]$ mpd --ifhn=10.0.0.1 -l 4268 &amp;amp;
&lt;BR /&gt;[node]$ mpd -h nerf -p 4268 &amp;amp;
&lt;BR /&gt;
&lt;BR /&gt;Any help clearing these up would be greatly appreciated, as I won't be purchasing the software otherwise. MCEs are unacceptable.</description>
      <pubDate>Mon, 21 Jan 2008 19:26:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863555#M1653</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-01-21T19:26:21Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863556#M1654</link>
      <description>The forum pulled out the last part of the error message:
"value="</description>
      <pubDate>Mon, 21 Jan 2008 19:28:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863556#M1654</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-01-21T19:28:04Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863557#M1655</link>
      <description>In brackets, "NULL string"</description>
      <pubDate>Mon, 21 Jan 2008 19:28:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863557#M1655</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-01-21T19:28:38Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863558#M1656</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;Could you clarify the Intel MPI Library version you use? Please check package ID information in the mpisupport.txt file.&lt;/P&gt;
&lt;P&gt;By the way, you can fill a bug report at &lt;A href="https://primer.intel.com"&gt;https://primer.intel.com&lt;/A&gt; to get a technical assistance.&lt;/P&gt;
&lt;P&gt;Best regards,&lt;/P&gt;
&lt;P&gt;Andrey&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Jan 2008 12:21:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863558#M1656</guid>
      <dc:creator>Andrey_D_Intel</dc:creator>
      <dc:date>2008-01-24T12:21:29Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863559#M1657</link>
      <description>Package ID: l_mpi_p_3.1.026
&lt;BR /&gt;
&lt;BR /&gt;If I had to guess, I would say it's in pdsyrk. I heavily performance tested pdtrsm and dpotrf before trying pdpotrf.</description>
      <pubDate>Thu, 24 Jan 2008 19:54:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863559#M1657</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-01-24T19:54:36Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863560#M1658</link>
      <description>That link doesn't work.</description>
      <pubDate>Fri, 25 Jan 2008 15:19:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863560#M1658</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-01-25T15:19:54Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863561#M1659</link>
      <description>Could you give more details on cluster configuration? I'd like to understand why you was not able to use mpdboot to launch MPD ring.</description>
      <pubDate>Fri, 25 Jan 2008 15:27:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863561#M1659</guid>
      <dc:creator>Andrey_D_Intel</dc:creator>
      <dc:date>2008-01-25T15:27:45Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863562#M1660</link>
      <description>I did a misprint. Sorry. The right link is &lt;A href="https://premier.intel.com"&gt;https://premier.intel.com&lt;/A&gt;</description>
      <pubDate>Fri, 25 Jan 2008 15:29:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863562#M1660</guid>
      <dc:creator>Andrey_D_Intel</dc:creator>
      <dc:date>2008-01-25T15:29:53Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863563#M1661</link>
      <description>The test cluster consists of 2 4 processor machines behind a firewall. The headnode, nerf, has two ethernet ports, one connected to the firewall, one to the node, ball. All IPs are in the 10.0.0.0 network. 
&lt;BR /&gt;
&lt;BR /&gt;When I try:
&lt;BR /&gt;mpdboot --totalnum=2 --file=./mpd.hosts --rsh=ssh
&lt;BR /&gt;
&lt;BR /&gt;the output is:
&lt;BR /&gt;mpdboot_nerf (handle_mpd_output 681): failed to ping mpd on ball; received output={}
&lt;BR /&gt;
&lt;BR /&gt;Also, the premier support link won't let me in, as I'm only evaluating the software right now.</description>
      <pubDate>Fri, 25 Jan 2008 16:23:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863563#M1661</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-01-25T16:23:40Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863564#M1662</link>
      <description>&lt;OL&gt;
&lt;LI&gt;Is it allowed to establish connection from compute nodes to the head node?mpdboot alwaysstart mpd daemon on local node first .After that remote mpd daemonst attemt to perfrom connection to it?&lt;/LI&gt;
&lt;LI&gt;Do you able to start mpd manually?&lt;/LI&gt;
&lt;UL&gt;
&lt;LI&gt;Run the mpd -e -d command on the head node. The port number will be printed on stdout. &lt;/LI&gt;
&lt;LI&gt;Run the mpd -h head_node -p &lt;PORT_NUMBER&gt; -d command to establish MPD ring. Use the port number printed at pervious step.&lt;/PORT_NUMBER&gt;&lt;/LI&gt;
&lt;LI&gt;Check if ring was established succesfully. Run the mpdtrace command for that.&lt;/LI&gt;&lt;/UL&gt;&lt;/OL&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Jan 2008 12:39:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863564#M1662</guid>
      <dc:creator>Andrey_D_Intel</dc:creator>
      <dc:date>2008-01-30T12:39:42Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863565#M1663</link>
      <description>Ops! I see that you can start ring manually. Could you share the content of your mpd.hosts file? Could you share the output from mpdboot -d -v... command? Is there any useful information in /tmp/mpd2.logfile_&lt;YOUR_USERID&gt;&lt;/YOUR_USERID&gt;</description>
      <pubDate>Wed, 30 Jan 2008 12:46:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863565#M1663</guid>
      <dc:creator>Andrey_D_Intel</dc:creator>
      <dc:date>2008-01-30T12:46:32Z</dc:date>
    </item>
    <item>
      <title>Re: impi/mkl_scalapack causing kernel panic</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863566#M1664</link>
      <description>After reconfiguring the network settings several times, and reorganizing all of my environment variables (I had several MPI implementations installed), the problem went away, and I could boot up the MPD daemons via:
&lt;BR /&gt;mpdboot --file= --rsh=ssh
&lt;BR /&gt;
&lt;BR /&gt;I wish I could explain more specifically, but I changed far too many things in the process of compiling ScaLAPACK from scratch for several MPI implementations.</description>
      <pubDate>Sat, 02 Feb 2008 01:33:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/impi-mkl-scalapack-causing-kernel-panic/m-p/863566#M1664</guid>
      <dc:creator>poulson_jack</dc:creator>
      <dc:date>2008-02-02T01:33:00Z</dc:date>
    </item>
  </channel>
</rss>

