<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:fpe0 and mpi_init fails in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567962#M11507</link>
    <description>&lt;P&gt;&lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/337918"&gt;@dr_jfloyd&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Can you please try with:&lt;/P&gt;&lt;P&gt;export I_MPI_FABRICS=shm:ofi FI_PROVIDER=psm3 ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I am still not able to reproduce this issue.&lt;/P&gt;&lt;P&gt;Can you please provide the output of lscpu?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Wed, 31 Jan 2024 16:06:54 GMT</pubDate>
    <dc:creator>TobiasK</dc:creator>
    <dc:date>2024-01-31T16:06:54Z</dc:date>
    <item>
      <title>fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567597#M11491</link>
      <description>&lt;P&gt;Cross-posting from the Fortran forum:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Have a large mpi application (cfd model) where we use fpe0 when running our verification suite in debug.&amp;nbsp;After updating to the current OneAPI compilers, using either ifort or ifx the following program fails at MPI_INIT if fpe0 is used as a compiler option.&amp;nbsp; This means no verificaiton cases can be run in debug, as they all require intializing MPI (the actual application uses MPI_INIT_THREAD but that also fails). The error message follows the source code.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;program test_mpi&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;use mpi_f08&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;implicit none&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;integer i, size, rank, namelen, ierr&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;character (len=MPI_MAX_PROCESSOR_NAME) :: name&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;type(mpi_status) :: stat&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;call MPI_INIT (ierr)&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;call MPI_COMM_SIZE (MPI_COMM_WORLD, size, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_COMM_RANK (MPI_COMM_WORLD, rank, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_GET_PROCESSOR_NAME (name, namelen, ierr)&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;if (rank.eq.0) then&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;print *, 'Hello world: rank ', rank, ' of ', size, ' running on ', name&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;do i = 1, size - 1&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_RECV (rank, 1, MPI_INTEGER, i, 1, MPI_COMM_WORLD, stat, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_RECV (size, 1, MPI_INTEGER, i, 1, MPI_COMM_WORLD, stat, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_RECV (namelen, 1, MPI_INTEGER, i, 1, MPI_COMM_WORLD, stat, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;name = ''&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_RECV (name, namelen, MPI_CHARACTER, i, 1, MPI_COMM_WORLD, stat, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;print *, 'Hello world: rank ', rank, ' of ', size, ' running on ', name&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;enddo&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;else&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;call MPI_SEND (rank, 1, MPI_INTEGER, 0, 1, MPI_COMM_WORLD, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_SEND (size, 1, MPI_INTEGER, 0, 1, MPI_COMM_WORLD, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_SEND (namelen, 1, MPI_INTEGER, 0, 1, MPI_COMM_WORLD, ierr)&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;call MPI_SEND (name, namelen, MPI_CHARACTER, 0, 1, MPI_COMM_WORLD, ierr)&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;endif&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;call MPI_FINALIZE (ierr)&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;end&lt;/SPAN&gt;&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;forrtl: error (65): floating invalid&lt;/P&gt;&lt;P class=""&gt;&lt;SPAN class=""&gt;Image PC Routine Line Source&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libc.so.6 000014D70BA54DB0 Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2E717E ucp_proto_perf_en Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2E881D ucp_proto_init_pa Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2EFDF7 ucp_proto_common_ Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2F15F5 ucp_proto_multi_i Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C316404 Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2F2D42 Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2F4561 ucp_proto_select_ Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2F4A25 ucp_proto_select_ Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2E9A89 ucp_worker_get_ep Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C33F39C ucp_wireup_init_l Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2D19CE ucp_ep_create_to_ Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libucp.so.0.0.0 000014D70C2D2B33 ucp_ep_create Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libmlx-fi.so 000014D709A08460 Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libmpi.so.12.0.0 000014D70CB7295E Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libmpi.so.12.0.0 000014D70C71C60A Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libmpi.so.12.0.0 000014D70C9E414E Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libmpi.so.12.0.0 000014D70C9E396B MPI_Init Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libmpifort.so.12. 000014D70E0B90A6 mpi_init_f08_ Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;a.out 00000000004052BF Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;a.out 000000000040521D Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libc.so.6 000014D70BA3FEB0 Unknown Unknown Unknown&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class=""&gt;libc.so.6 000014D70BA3FF60 __libc_start_main Unknown Unknown&lt;/SPAN&gt;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Tue, 30 Jan 2024 19:40:58 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567597#M11491</guid>
      <dc:creator>dr_jfloyd</dc:creator>
      <dc:date>2024-01-30T19:40:58Z</dc:date>
    </item>
    <item>
      <title>Re:fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567877#M11496</link>
      <description>&lt;P&gt;&lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/337918"&gt;@dr_jfloyd&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I tested your code and it compiles / runs fine.&lt;/P&gt;&lt;P&gt;Can you please post your full compilation/link line and the execution command?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best&lt;/P&gt;&lt;P&gt;Tobias&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 31 Jan 2024 11:52:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567877#M11496</guid>
      <dc:creator>TobiasK</dc:creator>
      <dc:date>2024-01-31T11:52:15Z</dc:date>
    </item>
    <item>
      <title>Re: fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567898#M11502</link>
      <description>&lt;P&gt;&lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/245425"&gt;@TobiasK&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for taking the time to look at this.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Compile:&lt;/P&gt;&lt;P&gt;% mpiifx -fpe0 test_mpi.f90&lt;/P&gt;&lt;P&gt;Run:&lt;/P&gt;&lt;P&gt;% mpiexec -n 2 ./a.out&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Compile:&lt;/P&gt;&lt;P&gt;% mpiifort -diag-disable=10448 -fpe0 test_mpi.f90&lt;/P&gt;&lt;P&gt;Run:&lt;/P&gt;&lt;P&gt;% mpiexec -n 2 ./a.out&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Without -fpe0 these run.&amp;nbsp; Our system is running RHEL 9.3.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Jason&lt;/P&gt;</description>
      <pubDate>Wed, 31 Jan 2024 13:12:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567898#M11502</guid>
      <dc:creator>dr_jfloyd</dc:creator>
      <dc:date>2024-01-31T13:12:55Z</dc:date>
    </item>
    <item>
      <title>Re:fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567912#M11503</link>
      <description>&lt;P&gt;&lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/337918"&gt;@dr_jfloyd&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Can you please run with:&lt;/P&gt;&lt;P&gt;export I_MPI_DEBUG=10&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;1)&lt;/P&gt;&lt;P&gt;export I_MPI_FABRICS=shm&lt;/P&gt;&lt;P&gt;If that works, please also run with&lt;/P&gt;&lt;P&gt;2)&lt;/P&gt;&lt;P&gt;export I_MPI_FABRICS=shm:ofi FI_PROVIDER=psm3&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 31 Jan 2024 14:11:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567912#M11503</guid>
      <dc:creator>TobiasK</dc:creator>
      <dc:date>2024-01-31T14:11:06Z</dc:date>
    </item>
    <item>
      <title>Re: fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567941#M11505</link>
      <description>&lt;P&gt;Both worked with the test program.&lt;/P&gt;&lt;P&gt;Using the full application running on the head node both worked.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Trying the full application on the compute nodes (InfiniBand):&lt;/P&gt;&lt;P&gt;-Option 1)&lt;/P&gt;&lt;P&gt;--worked as long as I kept the core count to one node.&amp;nbsp;&lt;/P&gt;&lt;P&gt;--did not work over multiple nodes and gives the following (line 89 is the call to MPI_INIT_THREAD)&lt;/P&gt;&lt;P&gt;forrtl: severe (71): integer divide by zero&lt;BR /&gt;Image PC Routine Line Source&lt;BR /&gt;libc.so.6 00001471E4454DB0 Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4DDCADD Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4B14A64 Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4ABBB45 Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4AB9F7A Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4C490E1 Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4C46C2B Unknown Unknown Unknown&lt;BR /&gt;libmpi.so.12.0.0 00001471E4C4ACD5 PMPI_Init_thread Unknown Unknown&lt;BR /&gt;libmpifort.so.12. 00001471E6258110 mpi_init_thread_f Unknown Unknown&lt;BR /&gt;fds_impi_intel_li 0000000004179918 MAIN__ 89 main.f90&lt;BR /&gt;fds_impi_intel_li 0000000000407A1D Unknown Unknown Unknown&lt;BR /&gt;libc.so.6 00001471E443FEB0 Unknown Unknown Unknown&lt;BR /&gt;libc.so.6 00001471E443FF60 __libc_start_main Unknown Unknown&lt;BR /&gt;fds_impi_intel_li 0000000000407935 Unknown Unknown Unknown&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-Option 2) did not work either way&lt;/P&gt;&lt;P&gt;Error is below. The underlined lines repeated mulitple times.&lt;/P&gt;&lt;P&gt;Abort(1614479) on node 0 (rank 0 in comm 0): Fatal error in PMPI_Init_thread: Unknown error class, error stack:&lt;BR /&gt;&lt;U&gt;MPIR_Init_thread(189)........: &lt;/U&gt;&lt;BR /&gt;&lt;U&gt;MPID_Init(1561)..............: &lt;/U&gt;&lt;BR /&gt;&lt;U&gt;MPIDI_OFI_mpi_init_hook(1624): &lt;/U&gt;&lt;BR /&gt;&lt;U&gt;create_vni_context(2221).....: OFI endpoint open failed (ofi_init.c:2221:create_vni_context:Invalid argument)&lt;/U&gt;&lt;BR /&gt;&lt;U&gt;Abort(1614479) on node 0 (rank 0 in comm 0): Fatal error in PMPI_Init_thread: Unknown error class, error stack:&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 31 Jan 2024 15:14:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567941#M11505</guid>
      <dc:creator>dr_jfloyd</dc:creator>
      <dc:date>2024-01-31T15:14:16Z</dc:date>
    </item>
    <item>
      <title>Re:fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567962#M11507</link>
      <description>&lt;P&gt;&lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/337918"&gt;@dr_jfloyd&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Can you please try with:&lt;/P&gt;&lt;P&gt;export I_MPI_FABRICS=shm:ofi FI_PROVIDER=psm3 ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I am still not able to reproduce this issue.&lt;/P&gt;&lt;P&gt;Can you please provide the output of lscpu?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 31 Jan 2024 16:06:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567962#M11507</guid>
      <dc:creator>TobiasK</dc:creator>
      <dc:date>2024-01-31T16:06:54Z</dc:date>
    </item>
    <item>
      <title>Re: fpe0 and mpi_init fails</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567990#M11508</link>
      <description>&lt;P&gt;I did try "&lt;SPAN&gt;export I_MPI_FABRICS=shm:ofi FI_PROVIDER=psm3".&amp;nbsp; That was the option 2) result in my prior post.&amp;nbsp; &amp;nbsp;But since had some success with just the shm, I felt you had pointed me in the right direction.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;I started playing with various options and&amp;nbsp; just now had success with both:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;export I_MPI_FABRICS=shm:ofi FI_PROVIDER=verbs&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;and&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;export I_MPI_FABRICS=shm:ofa FI_PROVIDER=verbs&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thanks for all the help!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 31 Jan 2024 17:06:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/fpe0-and-mpi-init-fails/m-p/1567990#M11508</guid>
      <dc:creator>dr_jfloyd</dc:creator>
      <dc:date>2024-01-31T17:06:54Z</dc:date>
    </item>
  </channel>
</rss>

