<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:Time-consuming issue at MPI startup in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273490#M8088</link>
    <description>&lt;P&gt;Hii,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are working on it and will get back to you soon.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards&lt;/P&gt;&lt;P&gt;Shivani&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Wed, 14 Apr 2021 11:46:40 GMT</pubDate>
    <dc:creator>ShivaniK_Intel</dc:creator>
    <dc:date>2021-04-14T11:46:40Z</dc:date>
    <item>
      <title>Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273221#M8076</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hello All,&lt;/P&gt;
&lt;P&gt;There is a problem that takes a lot of time during MPI startup, so I ask you a question.&lt;BR /&gt;The section that takes time is: library kind -&amp;gt; libfabric version -&amp;gt; libfabric provider -&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;(which takes the most)&lt;/STRONG&gt;&lt;/FONT&gt;&amp;gt; load tuning file.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;Tue Apr 13 13:50:08 UTC 2021&lt;BR /&gt;[0] MPI startup(): Intel(R) MPI Library, Version 2021.2 Build 20210302 (id: f4f7c92cd)&lt;BR /&gt;[0] MPI startup(): Copyright (C) 2003-2021 Intel Corporation. All rights reserved.&lt;BR /&gt;&lt;STRONG&gt;[0] MPI startup(): library kind: release&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;[0] MPI startup(): libfabric version: 1.11.0-impi&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;[0] MPI startup(): libfabric provider: mlx&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;[0] MPI startup(): Load tuning file: "/opt/local/mpi/2021.2.0/etc/tuning_icx_shm-ofi_mlx.dat"&lt;/STRONG&gt;&lt;BR /&gt;[0] MPI startup(): Rank Pid Node name Pin cpu&lt;BR /&gt;[0] MPI startup(): 0 276153 ****0721.maru 0&lt;BR /&gt;[0] MPI startup(): 1 276154 ****0721.maru 1&lt;BR /&gt;[0] MPI startup(): 2 276155 ****0721.maru 2&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The executed executable file is IMB-MPI1 and the execution script is as follows.&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;export I_MPI_HYDRA_PMI_CONNECT=alltoall&lt;BR /&gt;export I_MPI_DEBUG=5&lt;BR /&gt;export I_MPI_FABRICS=shm:ofi&lt;BR /&gt;export I_MPI_PIN=1&lt;BR /&gt;export I_MPI_PIN_PROCESSOR_LIST=0-75&lt;BR /&gt;export FI_PROVIDER=mlx&lt;BR /&gt;export UCX_TLS=rc,dc_mlx5,sm,self&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;{time mpiexec.hydra -genvall -f ./hostlist -n 33972 -ppn 76 IMB-MPI1 Bcast Allreduce -npmin 33972; } &amp;gt;&amp;gt; ${OUTFILE} 2&amp;gt;&amp;amp;1&lt;BR /&gt;#{ time mpiexec.hydra -genvall -f ./hostlist -n 67944 -ppn 76 IMB-MPI1 Bcast Allreduce -npmin 67944; } &amp;gt;&amp;gt; ${OUTFILE} 2&amp;gt;&amp;amp;1&lt;BR /&gt;#{ time mpiexec.hydra -genvall -f ./hostlist -n 131 328 -ppn 76 IMB-MPI1 Bcast Allreduce -npmin 67944; } &amp;gt;&amp;gt; ${OUTFILE} 2&amp;gt;&amp;amp;1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I testd 3 cases of mpirank (33,972 ranks, 67,944 ranks, 131,328 ranks), and it took about 33 seconds, 79 seconds and 131 seconds respectively. Startup takes a large part of the overall execution time, so please give us your opinion on what work to do to reduce it.&lt;/P&gt;
&lt;P&gt;*Intel MPI version is 2021.2.0, UCX is 1.10.0 &amp;amp; MOFED 5.2-1.0.4.0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks, Kihang&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 13 Apr 2021 14:24:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273221#M8076</guid>
      <dc:creator>youn__kihang</dc:creator>
      <dc:date>2021-04-13T14:24:29Z</dc:date>
    </item>
    <item>
      <title>Re: Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273389#M8083</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For information,&lt;/P&gt;
&lt;P&gt;I can reproduce with MPI_INIT function only.&lt;BR /&gt;When I use 76,000 mpi rank, it take 68~72 seconds in MPI startup(MPI_INIT).&lt;BR /&gt;Here is more detailed log(I_MPI_DEBUG=1000).&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;[0] MPI startup(): libfabric version: 1.11.0-impi&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_CUDA not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ROCR not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ZE not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: ofi_rxm (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_CUDA not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ROCR not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ZE not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: verbs (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "verbs" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: tcp (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "tcp" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: mlx (1.4)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: shm (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "shm" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: sockets (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "sockets" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: ofi_hook_noop (111.0)&lt;BR /&gt;libfabric:361875:core:core:fi_getinfo_():1117&amp;lt;info&amp;gt; Found provider with the highest priority mlx, must_use_util_prov = 0&lt;BR /&gt;libfabric:361875:core:core:fi_getinfo_():1117&amp;lt;info&amp;gt; Found provider with the highest priority mlx, must_use_util_prov = 0&lt;BR /&gt;[0] MPI startup(): libfabric provider: mlx&lt;BR /&gt;libfabric:361875:core:core:fi_fabric_():1406&amp;lt;info&amp;gt; Opened fabric: mlx&lt;BR /&gt;[0] MPI startup(): max_ch4_vcis: 1, max_reg_eps 1, enable_sep 0, enable_shared_ctxs 0, do_av_insert 1&lt;BR /&gt;[0] MPI startup(): addrnamelen: 1024&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;&amp;lt;Here is most time-consuming part&amp;gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;[3323] MPI startup(): selected platform: icx&lt;BR /&gt;[2667] MPI startup(): selected platform: icx&lt;BR /&gt;[3561] MPI startup(): selected platform: icx&lt;BR /&gt;[2743] MPI startup(): selected platform: icx&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please let me know if there are any suggestions.&lt;/P&gt;</description>
      <pubDate>Wed, 14 Apr 2021 02:47:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273389#M8083</guid>
      <dc:creator>youn__kihang</dc:creator>
      <dc:date>2021-04-14T02:47:38Z</dc:date>
    </item>
    <item>
      <title>Re: Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273390#M8084</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For information,&lt;/P&gt;
&lt;P&gt;I can reproduce with MPI_INIT function only.&lt;BR /&gt;When I use 76,000 mpi rank, it take 68~72 seconds in MPI startup(MPI_INIT).&lt;BR /&gt;Here is more detailed log(I_MPI_DEBUG=1000).&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;[0] MPI startup(): libfabric version: 1.11.0-impi&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_CUDA not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ROCR not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ZE not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: ofi_rxm (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_CUDA not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ROCR not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_hmem_init():202&amp;lt;info&amp;gt; Hmem iface FI_HMEM_ZE not supported&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: verbs (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "verbs" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: tcp (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "tcp" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: mlx (1.4)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: shm (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "shm" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: sockets (111.0)&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():455&amp;lt;info&amp;gt; "sockets" filtered by provider include/exclude list, skipping&lt;BR /&gt;libfabric:361875:core:core:ofi_register_provider():427&amp;lt;info&amp;gt; registering provider: ofi_hook_noop (111.0)&lt;BR /&gt;libfabric:361875:core:core:fi_getinfo_():1117&amp;lt;info&amp;gt; Found provider with the highest priority mlx, must_use_util_prov = 0&lt;BR /&gt;libfabric:361875:core:core:fi_getinfo_():1117&amp;lt;info&amp;gt; Found provider with the highest priority mlx, must_use_util_prov = 0&lt;BR /&gt;[0] MPI startup(): libfabric provider: mlx&lt;BR /&gt;libfabric:361875:core:core:fi_fabric_():1406&amp;lt;info&amp;gt; Opened fabric: mlx&lt;BR /&gt;[0] MPI startup(): max_ch4_vcis: 1, max_reg_eps 1, enable_sep 0, enable_shared_ctxs 0, do_av_insert 1&lt;BR /&gt;[0] MPI startup(): addrnamelen: 1024&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;&amp;lt;Here is most time-consuming part&amp;gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;[3323] MPI startup(): selected platform: icx&lt;BR /&gt;[2667] MPI startup(): selected platform: icx&lt;BR /&gt;[3561] MPI startup(): selected platform: icx&lt;BR /&gt;[2743] MPI startup(): selected platform: icx&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please let me know if there are any suggestions.&lt;BR /&gt;Thanks in advance, Kihang&lt;/P&gt;</description>
      <pubDate>Wed, 14 Apr 2021 02:48:43 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273390#M8084</guid>
      <dc:creator>youn__kihang</dc:creator>
      <dc:date>2021-04-14T02:48:43Z</dc:date>
    </item>
    <item>
      <title>Re:Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273490#M8088</link>
      <description>&lt;P&gt;Hii,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are working on it and will get back to you soon.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards&lt;/P&gt;&lt;P&gt;Shivani&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 14 Apr 2021 11:46:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1273490#M8088</guid>
      <dc:creator>ShivaniK_Intel</dc:creator>
      <dc:date>2021-04-14T11:46:40Z</dc:date>
    </item>
    <item>
      <title>Re:Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1274661#M8117</link>
      <description>&lt;P&gt;Hi Kihang,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I was sending a request to the IMPI architect for better startup parameters.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;My first idea would be:&lt;/P&gt;&lt;P&gt;Do you have a file system that is faster than /opt/kma_local/mpi/2021.2.0/etc/ ?&lt;/P&gt;&lt;P&gt;Maybe it helps to read the tuning file from another file system. You may try to use variables from the autotuner to provide a different location for the tuning file:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;$ export I_MPI_TUNING_BIN=&amp;lt;tuning-results.dat&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;see:&lt;/P&gt;&lt;P&gt;&lt;A href="https://software.intel.com/content/www/us/en/develop/documentation/mpi-developer-reference-linux/top/environment-variable-reference/tuning-environment-variables/autotuning.html" target="_blank"&gt;https://software.intel.com/content/www/us/en/develop/documentation/mpi-developer-reference-linux/top/environment-variable-reference/tuning-environment-variables/autotuning.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;best regards,&lt;/P&gt;&lt;P&gt;Heinrich&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 19 Apr 2021 09:19:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1274661#M8117</guid>
      <dc:creator>Heinrich_B_Intel</dc:creator>
      <dc:date>2021-04-19T09:19:10Z</dc:date>
    </item>
    <item>
      <title>Re:Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1274669#M8118</link>
      <description>&lt;P&gt;Reply from our architect:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;I would recommend to try UD (you may simply remove UCX_TLS) as a way to improve startup time&lt;/LI&gt;&lt;LI&gt;Could you please ask them to clarify the way they come to conclusions that it is the tuning file reading?&lt;/LI&gt;&lt;LI&gt;Please ask them to try:&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;export I_MPI_STARTUP_MODE=pmi_shm_netmod&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;best regards,&lt;/P&gt;&lt;P&gt;Heinrich&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 19 Apr 2021 10:08:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1274669#M8118</guid>
      <dc:creator>Heinrich_B_Intel</dc:creator>
      <dc:date>2021-04-19T10:08:21Z</dc:date>
    </item>
    <item>
      <title>Re: Re:Time-consuming issue at MPI startup</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1274716#M8119</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hi Heinrich,&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;The option "I_MPI_STARTUP_MODE=pmi_shm_netmod" you recommend is works!&lt;/P&gt;
&lt;P&gt;Could you explain the pmi_shm_netmod means? or Is there any manual about that?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks, Kihang&lt;/P&gt;</description>
      <pubDate>Mon, 19 Apr 2021 13:07:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Time-consuming-issue-at-MPI-startup/m-p/1274716#M8119</guid>
      <dc:creator>youn__kihang</dc:creator>
      <dc:date>2021-04-19T13:07:12Z</dc:date>
    </item>
  </channel>
</rss>

