<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Problem with intelmpi 4.0, process desapear, will be zombies or in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770976#M179</link>
    <description>The options that appear in the doccument '--mca' not work for me ... it is normal?&lt;BR /&gt;&lt;BR /&gt;Thank you ro the answer.&lt;BR /&gt;JP</description>
    <pubDate>Thu, 29 Jul 2010 20:08:47 GMT</pubDate>
    <dc:creator>jperaltac</dc:creator>
    <dc:date>2010-07-29T20:08:47Z</dc:date>
    <item>
      <title>Problem with intelmpi 4.0, process desapear, will be zombies or just finish</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770974#M177</link>
      <description>Dear Support team&lt;BR /&gt;&lt;BR /&gt;i have some problems using intelmpi, sometimes the process work fine without problems, (I use quantum-espresso software) but other times the process just desappear of the nodes and the queue system (torque) not finish the job. By the way, some works that are working (and in the nodes the process appears R) not continue writing in my ouput file&lt;BR /&gt;&lt;BR /&gt;i use qsub to send the pbs system this is a example of the 'principal part' of the pbs file :&lt;BR /&gt;&lt;BR /&gt;tmpfile=nodelist&lt;BR /&gt;rm -f ${tmpfile}&lt;BR /&gt;for s in `sort &amp;lt; ${PBS_NODEFILE} | uniq `&lt;BR /&gt;do echo " ${s}" &amp;gt;&amp;gt; ${tmpfile} ; numcoresf=`expr ${numcoresf} + ${NCORES}`; done&lt;BR /&gt;:&lt;BR /&gt;source /lustre/jperalta/intel/impi/4.0.0.028/intel64/bin/mpivars.sh&lt;BR /&gt;export I_MPI_PERHOST=8&lt;BR /&gt;export I_MPI_FABRIC="shm:dapl"&lt;BR /&gt;export I_MPI_DAPL_PROVIDER="ofa-v2-mlx4_0-2"&lt;BR /&gt;# DEFINE THE COMMAND&lt;BR /&gt;PWCOMMAND="mpirun -f ${tmpfile} -n ${numcoresf} /lustre/jperalta/src/espresso-4.2.1/bin-impi/qe_pw.x "&lt;BR /&gt;echo Final executable command $PWCOMMAND&lt;BR /&gt;&lt;BR /&gt;# EXECUTE THE COMMAND&lt;BR /&gt;${PWCOMMAND} &amp;lt; ${INPUTFILE} &amp;gt;&amp;gt; ${OUTPUTFILE}&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Sometimes the work finish well other times not, and others send me messages like as :&lt;BR /&gt;&lt;BR /&gt;mpdboot_n13 (handle_mpd_output 883): Failed to establish a socket connection with n9:53000 : (111, 'Connection refused')&lt;BR /&gt;mpdboot_n13 (handle_mpd_output 900): failed to connect to mpd on n9&lt;BR /&gt;&lt;BR /&gt;But if i send again .. this run! .. i don't know what happend.&lt;BR /&gt;&lt;BR /&gt;If this is a problem of the cluster, What i should say to the admin?&lt;BR /&gt;&lt;BR /&gt;And the last .. i have a 'very strange?' excellent performance vs openmpi 1.4 using espresso ... from 1d6h to 3 hours! .. so is very important to my try to correct and take the desicion of buy (i'm in my trial period) (if i buy intel-mpi i have upgrades free too?)&lt;BR /&gt;&lt;BR /&gt;Regards&lt;BR /&gt;JP</description>
      <pubDate>Thu, 29 Jul 2010 16:36:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770974#M177</guid>
      <dc:creator>jperaltac</dc:creator>
      <dc:date>2010-07-29T16:36:01Z</dc:date>
    </item>
    <item>
      <title>Problem with intelmpi 4.0, process desapear, will be zombies or</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770975#M178</link>
      <description>&lt;DIV&gt;If you are using expresso, you may find interesting the following document.&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;A href="http://www.hpcadvisorycouncil.com/pdf/ESPRESSO_Best_Practices.pdf" target="_blank"&gt;http://www.hpcadvisorycouncil.com/pdf/ESPRESSO_Best_Practices.pdf&lt;/A&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;Hope it helps.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="font-family: Verdana, Arial, Helvetica, sans-serif;"&gt;-- Andres&lt;/SPAN&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 29 Jul 2010 17:59:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770975#M178</guid>
      <dc:creator>Andres_M_Intel4</dc:creator>
      <dc:date>2010-07-29T17:59:26Z</dc:date>
    </item>
    <item>
      <title>Problem with intelmpi 4.0, process desapear, will be zombies or</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770976#M179</link>
      <description>The options that appear in the doccument '--mca' not work for me ... it is normal?&lt;BR /&gt;&lt;BR /&gt;Thank you ro the answer.&lt;BR /&gt;JP</description>
      <pubDate>Thu, 29 Jul 2010 20:08:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770976#M179</guid>
      <dc:creator>jperaltac</dc:creator>
      <dc:date>2010-07-29T20:08:47Z</dc:date>
    </item>
    <item>
      <title>Problem with intelmpi 4.0, process desapear, will be zombies or</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770977#M180</link>
      <description>--mca option is specific to openmpi. I don't know that doc, but you can use only advice which is good for MPI in general or given specifically for Intel MPI. If you are having difficulty understanding the Intel MPI equivalent of one of the common --mca options, you could likely get help here if you would explain what you want.</description>
      <pubDate>Thu, 29 Jul 2010 22:05:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770977#M180</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2010-07-29T22:05:08Z</dc:date>
    </item>
    <item>
      <title>Problem with intelmpi 4.0, process desapear, will be zombies or</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770978#M181</link>
      <description>Thank you.&lt;BR /&gt;&lt;BR /&gt;The administrator was clean and restart all nodes, now some jobs work better but sometimes if i send a work and this fails (for technical reasons, like a input wrong or similars) the job not finished in torque. The job still 'R' and then I kill this (by hand using qdel) but sometimes i recibe this information and the node continue with this process in Zombie status (with PPID=1).&lt;BR /&gt;&lt;BR /&gt;257 Traceback (most recent call last):&lt;BR /&gt;258 File "/lustre/jperalta/intel/impi/4.0.0.028/intel64/bin/mpdcleanup", line 239, in ?&lt;BR /&gt;259 mpdcleanup()&lt;BR /&gt;260 File "/lustre/jperalta/intel/impi/4.0.0.028/intel64/bin/mpdcleanup", line 215, in mpdcleanup&lt;BR /&gt;261 pid = re.split(r'\s+', first_string)[5]&lt;BR /&gt;262 IndexError: list index out of range&lt;BR /&gt;&lt;BR /&gt;Anybody can help me, in order to avoid leave Zombie process in the nodes? How i can kill the jobs and make a deep clean of mpdboot before start en each node?&lt;BR /&gt;&lt;BR /&gt;Thanks in advance&lt;BR /&gt;Joaquin</description>
      <pubDate>Tue, 03 Aug 2010 20:57:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Problem-with-intelmpi-4-0-process-desapear-will-be-zombies-or/m-p/770978#M181</guid>
      <dc:creator>jperaltac</dc:creator>
      <dc:date>2010-08-03T20:57:19Z</dc:date>
    </item>
  </channel>
</rss>

