<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic root caused why libiomp5.so in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144563#M5968</link>
    <description>&lt;P&gt;root caused why libiomp5.so got much hotter,&lt;/P&gt;

&lt;P style="margin: 0in 0in 0pt;"&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;,&amp;nbsp; set command as above, tring to make MPI communication thread pin on core 67 and openmp threads pin on core 0-core66, Vtune shows that MPI communication did pined on core 67 and OPenmp has 67 threads, but OMP_thread66 pined on core 67, so it lag the whole performance, making libiomp,so has lots of spin time.But I still didn’t figure out how to making it work correctly… &lt;/FONT&gt;&lt;/P&gt;

&lt;P style="margin: 0in 0in 0pt;"&gt;&amp;nbsp;&lt;/P&gt;

&lt;P style="margin: 0in 0in 0pt;"&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;​any idea? thanks&lt;/FONT&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 12 Jul 2017 01:18:55 GMT</pubDate>
    <dc:creator>Zhoulong_J_Intel</dc:creator>
    <dc:date>2017-07-12T01:18:55Z</dc:date>
    <item>
      <title>openmp application performance dropped with I_MPI_ASYNC_PROGRESS=enable</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144562#M5967</link>
      <description>&lt;P style="font-size: 13.008px;"&gt;Hi,&lt;/P&gt;

&lt;P style="font-size: 13.008px;"&gt;I tried MPI/openmp process pining, it seems that When I use non-blocking api(Iallreduce) and specific I_MPI_ASYNC_PROGRESS like the following command, it I set&amp;nbsp;&lt;SPAN style="font-size: 13.008px;"&gt;I_MPI_ASYNC_PROGRESS=enable, then application will spent much more time on libiomp.so(kmp_hyper_barrier_release), and vmlinux also got a little hotter, compare with (I_MPI_ASYNC_PROGRESS=disable), is there any issue with my configuration? I use vtune and it shows that all the cores are pin in the right cores. the only difference is core 67 is used by MPI communication thread.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="font-size: 13.008px;"&gt;========command=================&lt;/P&gt;

&lt;P style="font-size: 13.008px;"&gt;mpirun &amp;nbsp; &amp;nbsp;-n 2 -ppn 1 &amp;nbsp; &amp;nbsp;-genv OMP_PROC_BIND=true -genv &amp;nbsp;I_MPI_ASYNC_PROGRESS= -genv I_MPI_ASYNC_PROGRESS_PIN=67 -genv I_MPI_PIN_PROCS=0-66 -genv OMP_NUM_THREADS=67 &amp;nbsp;-genv I_MPI_PIN_DOMAIN=sock -genv I_MPI_FABRICS=ofi -f ./hostfile &amp;nbsp; python train_imagenet_cpu.py &amp;nbsp;--arch alex --batchsize 256 --loaderjob 68 &amp;nbsp;--epoch 100 --train_root /home/jiangzho/imagenet/ILSVRC2012_img_train --val_root /home/jiangzho/imagenet/ILSVRC2012_img_val --communicator naive /home/jiangzho/train.txt /home/jiangzho/val.txt&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jul 2017 11:19:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144562#M5967</guid>
      <dc:creator>Zhoulong_J_Intel</dc:creator>
      <dc:date>2017-07-11T11:19:03Z</dc:date>
    </item>
    <item>
      <title>root caused why libiomp5.so</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144563#M5968</link>
      <description>&lt;P&gt;root caused why libiomp5.so got much hotter,&lt;/P&gt;

&lt;P style="margin: 0in 0in 0pt;"&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;,&amp;nbsp; set command as above, tring to make MPI communication thread pin on core 67 and openmp threads pin on core 0-core66, Vtune shows that MPI communication did pined on core 67 and OPenmp has 67 threads, but OMP_thread66 pined on core 67, so it lag the whole performance, making libiomp,so has lots of spin time.But I still didn’t figure out how to making it work correctly… &lt;/FONT&gt;&lt;/P&gt;

&lt;P style="margin: 0in 0in 0pt;"&gt;&amp;nbsp;&lt;/P&gt;

&lt;P style="margin: 0in 0in 0pt;"&gt;&lt;FONT color="#000000" face="Calibri" size="3"&gt;​any idea? thanks&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jul 2017 01:18:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144563#M5968</guid>
      <dc:creator>Zhoulong_J_Intel</dc:creator>
      <dc:date>2017-07-12T01:18:55Z</dc:date>
    </item>
    <item>
      <title>root caused, thanks</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144564#M5969</link>
      <description>&lt;P&gt;root caused, thanks&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jul 2017 05:58:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/openmp-application-performance-dropped-with-I-MPI-ASYNC-PROGRESS/m-p/1144564#M5969</guid>
      <dc:creator>Zhoulong_J_Intel</dc:creator>
      <dc:date>2017-07-12T05:58:06Z</dc:date>
    </item>
  </channel>
</rss>

