<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Intel MPI NTERNAL ERROR: invalid error code (Ring Index out of range) MPIDI_NM_mpi_allgather in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1394240#M9617</link>
    <description>&lt;P&gt;Thank you for the quick reply.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1.&amp;nbsp; The OS and CPU are:&amp;nbsp;&amp;nbsp;Red Hat Enterprise Linux release 8 running on&amp;nbsp;AMD EPYC 7713 64-Core Processor (dual) compute nodes&lt;/P&gt;
&lt;P&gt;2.&amp;nbsp; 32 nodes (4096 tasks) and the code succeeds.&amp;nbsp; 33 nodes (4224 tasks)&amp;nbsp; and the code generates the errors listed in my original report, above.&lt;/P&gt;
&lt;P&gt;3.&amp;nbsp; The OFI provider is mlx, as shown in the output below from a run with I_MPI_DEBUG=4&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;[0] MPI startup(): Intel(R) MPI Library, Version 2021.5  Build 20211102 (id: 9279b7d62)
[0] MPI startup(): Copyright (C) 2003-2021 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): libfabric version: 1.13.2rc1-impi
[0] MPI startup(): libfabric provider: mlx
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;Thank you!&lt;/P&gt;
&lt;P&gt;John&lt;/P&gt;</description>
    <pubDate>Tue, 21 Jun 2022 17:43:26 GMT</pubDate>
    <dc:creator>John_Michalakes</dc:creator>
    <dc:date>2022-06-21T17:43:26Z</dc:date>
    <item>
      <title>Intel MPI NTERNAL ERROR: invalid error code (Ring Index out of range) MPIDI_NM_mpi_allgather</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1393470#M9602</link>
      <description>&lt;P&gt;We are seeing the following error on our cluster running Intel MPI, OneAPI version 2021.5.1&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;INTERNAL ERROR: invalid error code ffffffff (Ring Index out of range) in MPIDI_NM_mpi_allgather:202&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The error occurs on every task when running on more than 4096 tasks.&lt;/P&gt;
&lt;P&gt;I am attaching a session log which includes the listing for a short (43 line) reproducer program.&lt;/P&gt;
&lt;P&gt;Please let me know if you have questions or need additional information.&lt;/P&gt;
&lt;P&gt;Thank you,&lt;/P&gt;
&lt;P&gt;John Michalakes, UCAR&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2022 18:52:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1393470#M9602</guid>
      <dc:creator>John_Michalakes</dc:creator>
      <dc:date>2022-06-17T18:52:40Z</dc:date>
    </item>
    <item>
      <title>Re: Intel MPI NTERNAL ERROR: invalid error code (Ring Index out of range) MPIDI_NM_mpi_allgather</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1393814#M9606</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you for posting in Intel Communities.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could you please provide us with the below details to investigate more on your issue?&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Operating System &amp;amp; CPU details.&lt;/LI&gt;
&lt;LI&gt;How many nodes you are using to launch the MPI job.&lt;/LI&gt;
&lt;LI&gt;What is the &lt;A href="https://www.intel.com/content/www/us/en/develop/documentation/mpi-developer-guide-linux/top/running-applications/fabrics-control/ofi-providers-support.html" target="_blank"&gt;OFI provider&lt;/A&gt;(tcp/mlx/psm2 etc..) you are using?&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Santosh&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2022 09:27:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1393814#M9606</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-06-20T09:27:45Z</dc:date>
    </item>
    <item>
      <title>Re: Intel MPI NTERNAL ERROR: invalid error code (Ring Index out of range) MPIDI_NM_mpi_allgather</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1394240#M9617</link>
      <description>&lt;P&gt;Thank you for the quick reply.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1.&amp;nbsp; The OS and CPU are:&amp;nbsp;&amp;nbsp;Red Hat Enterprise Linux release 8 running on&amp;nbsp;AMD EPYC 7713 64-Core Processor (dual) compute nodes&lt;/P&gt;
&lt;P&gt;2.&amp;nbsp; 32 nodes (4096 tasks) and the code succeeds.&amp;nbsp; 33 nodes (4224 tasks)&amp;nbsp; and the code generates the errors listed in my original report, above.&lt;/P&gt;
&lt;P&gt;3.&amp;nbsp; The OFI provider is mlx, as shown in the output below from a run with I_MPI_DEBUG=4&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;[0] MPI startup(): Intel(R) MPI Library, Version 2021.5  Build 20211102 (id: 9279b7d62)
[0] MPI startup(): Copyright (C) 2003-2021 Intel Corporation.  All rights reserved.
[0] MPI startup(): library kind: release
[0] MPI startup(): libfabric version: 1.13.2rc1-impi
[0] MPI startup(): libfabric provider: mlx
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;Thank you!&lt;/P&gt;
&lt;P&gt;John&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jun 2022 17:43:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1394240#M9617</guid>
      <dc:creator>John_Michalakes</dc:creator>
      <dc:date>2022-06-21T17:43:26Z</dc:date>
    </item>
    <item>
      <title>Re:Intel MPI NTERNAL ERROR: invalid error code (Ring Index out of range) MPIDI_NM_mpi_allgather</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1403176#M9719</link>
      <description>&lt;P&gt;Hi John,&lt;/P&gt;&lt;P&gt;Thank you for your inquiry.&amp;nbsp;We offer support for hardware platforms that the Intel® oneAPI product supports.&amp;nbsp;These platforms include those that are part of the Intel® Core™ processor family or higher, the Intel® Xeon® processor family, the Intel® Xeon® Scalable processor family, and others which can be found here – &lt;A href="https://software.intel.com/content/www/us/en/develop/articles/intel-oneapi-base-toolkit-system-requirements.html" rel="noopener noreferrer" target="_blank"&gt;Intel® oneAPI Base Toolkit System Requirements&lt;/A&gt;, &lt;A href="https://software.intel.com/content/www/us/en/develop/articles/intel-oneapi-hpc-toolkit-system-requirements.html" rel="noopener noreferrer" target="_blank"&gt;Intel® oneAPI HPC Toolkit System Requirements&lt;/A&gt;, &lt;A href="https://software.intel.com/content/www/us/en/develop/articles/intel-oneapi-iot-toolkit-system-requirements.html" rel="noopener noreferrer" target="_blank"&gt;Intel® oneAPI IoT Toolkit System Requirements&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; font-size: 11pt;"&gt;If you wish to use oneAPI on hardware that is not listed at one of the sites above, we encourage you to visit and contribute to the open oneAPI specification - &lt;/SPAN&gt;&lt;A href="https://www.oneapi.io/spec/" rel="noopener noreferrer" target="_blank" style="font-family: Calibri, sans-serif; font-size: 11pt;"&gt;https://www.oneapi.io/spec/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; font-size: 11pt;"&gt;Best regards,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; font-size: 11pt;"&gt;Jyotsna&lt;/SPAN&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 25 Jul 2022 16:30:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1403176#M9719</guid>
      <dc:creator>JyotsnaK_Intel</dc:creator>
      <dc:date>2022-07-25T16:30:48Z</dc:date>
    </item>
    <item>
      <title>Re:Intel MPI NTERNAL ERROR: invalid error code (Ring Index out of range) MPIDI_NM_mpi_allgather</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1405029#M9733</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are closing this issue. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 01 Aug 2022 06:04:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-MPI-NTERNAL-ERROR-invalid-error-code-Ring-Index-out-of/m-p/1405029#M9733</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-08-01T06:04:27Z</dc:date>
    </item>
  </channel>
</rss>

