<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Intel MPI multinode jobs running problems in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1555013#M11294</link>
    <description>&lt;P&gt;&lt;FONT size="4"&gt;Hi,&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Thanks for posting in Intel communities!&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Can you please try setting up the following Environment variable:&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;export I_MPI_HYDRA_IFACE="ib0"&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;After implementing this change, kindly execute the process in a multinode environment and share the results with us.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Additionally, we kindly request the following details for further investigation:&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Reproducer code.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Recreation steps.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Interconnect hardware details.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;FI_PROVIDER information.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Logs generated after running the Intel® MPI Benchmark (IMB) with the same number of nodes as in the case where the issue occurred.&lt;/FONT&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;We thank you in advance for your cooperation.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Regards,&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Veena&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 18 Dec 2023 10:46:45 GMT</pubDate>
    <dc:creator>VeenaJ_Intel</dc:creator>
    <dc:date>2023-12-18T10:46:45Z</dc:date>
    <item>
      <title>Intel MPI multinode jobs running problems</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1553858#M11273</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I have a four-node cluster connected by infiniband switch. I have a common NFS home directory for each node. The startup sequence for each node (bashrc, etc.) is thus identical. Let's call the nodes, node1, node2, and node3. I have disabled the firewalls on each server (which are running OS version of Rocky 8.7).&lt;BR /&gt;We have a PBS PRO (&lt;SPAN&gt;pbs_version = 2022.1.4.20231010124201&lt;BR /&gt;&lt;/SPAN&gt;) queuing system and when we run jobs on a single node everything works fine, but when we try to run jobs in multinode mode we get the following errors:&lt;BR /&gt;check_exit_codes (../../../../../src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:117): unable to run bstrap_proxy on node1&lt;BR /&gt;poll_for_event (../../../../../src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:159): check exit codes error&lt;BR /&gt;HYD_dmx_poll_wait_for_proxy_event (../../../../../src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:212): poll for event error&lt;BR /&gt;HYD_bstrap_setup (../../../../../src/pm/i_hydra/libhydra/bstrap/src/intel/i_hydra_bstrap.c:1065): error waiting for event&lt;BR /&gt;HYD_print_bstrap_setup_error_message (../../../../../src/pm/i_hydra/mpiexec/intel/i_mpiexec.c:1026): error setting up the bootstrap proxies&lt;/P&gt;&lt;P&gt;Also in cluster&amp;nbsp;&lt;SPAN&gt;Intel® oneAPI HPC Toolkit 2023 installed.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;I would be happy to provide additional information if needed.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Dec 2023 08:13:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1553858#M11273</guid>
      <dc:creator>Jenya</dc:creator>
      <dc:date>2023-12-14T08:13:40Z</dc:date>
    </item>
    <item>
      <title>Re: Intel MPI multinode jobs running problems</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1555013#M11294</link>
      <description>&lt;P&gt;&lt;FONT size="4"&gt;Hi,&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Thanks for posting in Intel communities!&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Can you please try setting up the following Environment variable:&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;export I_MPI_HYDRA_IFACE="ib0"&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;After implementing this change, kindly execute the process in a multinode environment and share the results with us.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Additionally, we kindly request the following details for further investigation:&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Reproducer code.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Recreation steps.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Interconnect hardware details.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;FI_PROVIDER information.&lt;/FONT&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;FONT size="4"&gt;Logs generated after running the Intel® MPI Benchmark (IMB) with the same number of nodes as in the case where the issue occurred.&lt;/FONT&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;We thank you in advance for your cooperation.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Regards,&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="4"&gt;Veena&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Dec 2023 10:46:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1555013#M11294</guid>
      <dc:creator>VeenaJ_Intel</dc:creator>
      <dc:date>2023-12-18T10:46:45Z</dc:date>
    </item>
    <item>
      <title>Re: Intel MPI multinode jobs running problems</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1555815#M11305</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;SPAN&gt;Veena,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;We are going to check what you wrote earlier and provide you with the results.&lt;/P&gt;&lt;P&gt;BR,&lt;/P&gt;&lt;P&gt;Jenya.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Dec 2023 07:58:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1555815#M11305</guid>
      <dc:creator>Jenya</dc:creator>
      <dc:date>2023-12-20T07:58:47Z</dc:date>
    </item>
    <item>
      <title>Re:Intel MPI multinode jobs running problems</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1564918#M11444</link>
      <description>&lt;P&gt;&lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/328828"&gt;@Jenya&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;in case you still have problems, please open a new thread.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 22 Jan 2024 09:13:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-multinode-jobs-running-problems/m-p/1564918#M11444</guid>
      <dc:creator>TobiasK</dc:creator>
      <dc:date>2024-01-22T09:13:17Z</dc:date>
    </item>
  </channel>
</rss>

