<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic HI TimP, in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983135#M3512</link>
    <description>&lt;P&gt;HI TimP,&lt;/P&gt;
&lt;P&gt;I have tested on one node by&lt;/P&gt;
&lt;P&gt;mpiexec -n 5 test&lt;/P&gt;
&lt;P&gt;with and without setting I_MPI_PIN_DOMAIN=omp. If don't set that, the total CPU usage is only 80% and if set that, the total CPU usage is 100%. The program runs a little faster when setting I_MPI_PIN_DOMAIN=omp.&lt;/P&gt;
&lt;P&gt;Now the problem is that the mpiexec can't work for 10-node cluster after set I_MPI_PIN_DOMAIN=omp.&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Zhanghong Tang&lt;/P&gt;</description>
    <pubDate>Sun, 25 Aug 2013 15:19:08 GMT</pubDate>
    <dc:creator>Laoya__Tang</dc:creator>
    <dc:date>2013-08-25T15:19:08Z</dc:date>
    <item>
      <title>question about Hybrid MPI/OpenMP</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983132#M3509</link>
      <description>&lt;P&gt;Dear all,&lt;/P&gt;
&lt;P&gt;I run the program by the following command:&lt;/P&gt;
&lt;P&gt;mpiexec -wdir z:\directional -mapall -hosts 10 n01 5 n02 5 n03 5 n04 5 n05 5 n06 5 n07 5 n08 5 n09 5 n10 5 test&lt;/P&gt;
&lt;P&gt;The cluster has 10 nodes with 24 logical cores ( 2*Intel(R) Xeon(R) CPU x5675) on every node. The program test have openMP based parallel calculation in some part, but also a considerable part is not parallelized. However, the problem is that the program 'test' only use 4 cores when running in parallel part (total CPU usage is only 80%), I noticed that when set I_MPI_PIN_DOMAIN=omp, every process 'test' will use all 24 cores. I have tested the program 'test' on one node by&lt;/P&gt;
&lt;P&gt;mpiexec -wdir z:\directional -mapall -n 5 test&lt;/P&gt;
&lt;P&gt;The program 'test' runs what I wanted (total CPU usage is 100% when in parallel part).&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;Now the problem is that the first command failed after I set I_MPI_PIN_DOMAIN=omp:&lt;/P&gt;
&lt;P&gt;Fatal error in MPI_Init: Other MPI error, error stack:&lt;BR /&gt;MPIR_Init_thread(658)................:&lt;BR /&gt;MPID_Init(195).......................: channel initialization failed&lt;BR /&gt;MPIDI_CH3_Init(104)..................:&lt;BR /&gt;MPID_nem_tcp_post_init(345)..........:&lt;BR /&gt;MPID_nem_newtcp_module_connpoll(3102):&lt;BR /&gt;gen_read_fail_handler(1196)..........: read from socket failed - The specified network name is no longer available.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;What should I do to let the program use 100% CPU on every node?&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Zhanghong Tang&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2013 12:53:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983132#M3509</guid>
      <dc:creator>Laoya__Tang</dc:creator>
      <dc:date>2013-08-25T12:53:50Z</dc:date>
    </item>
    <item>
      <title>omp_num_threads=4</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983133#M3510</link>
      <description>&lt;P&gt;omp_num_threads=4&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2013 15:04:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983133#M3510</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2013-08-25T15:04:46Z</dc:date>
    </item>
    <item>
      <title>HI TimP,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983134#M3511</link>
      <description>&lt;P&gt;HI TimP,&lt;/P&gt;
&lt;P&gt;Thanks for your quick reply. I have already set omp_num_threads=24 on every node which means that the parallel part of the program can use 100% CPU resource.&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Zhanghong Tang&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2013 15:07:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983134#M3511</guid>
      <dc:creator>Laoya__Tang</dc:creator>
      <dc:date>2013-08-25T15:07:48Z</dc:date>
    </item>
    <item>
      <title>HI TimP,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983135#M3512</link>
      <description>&lt;P&gt;HI TimP,&lt;/P&gt;
&lt;P&gt;I have tested on one node by&lt;/P&gt;
&lt;P&gt;mpiexec -n 5 test&lt;/P&gt;
&lt;P&gt;with and without setting I_MPI_PIN_DOMAIN=omp. If don't set that, the total CPU usage is only 80% and if set that, the total CPU usage is 100%. The program runs a little faster when setting I_MPI_PIN_DOMAIN=omp.&lt;/P&gt;
&lt;P&gt;Now the problem is that the mpiexec can't work for 10-node cluster after set I_MPI_PIN_DOMAIN=omp.&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Zhanghong Tang&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2013 15:19:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/question-about-Hybrid-MPI-OpenMP/m-p/983135#M3512</guid>
      <dc:creator>Laoya__Tang</dc:creator>
      <dc:date>2013-08-25T15:19:08Z</dc:date>
    </item>
  </channel>
</rss>

