<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:Segfault testing parallel HDF5 with Intel MPI in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1390155#M9561</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We assume that your issue is resolved. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Mon, 06 Jun 2022 06:25:44 GMT</pubDate>
    <dc:creator>SantoshY_Intel</dc:creator>
    <dc:date>2022-06-06T06:25:44Z</dc:date>
    <item>
      <title>Segfault testing parallel HDF5 with Intel MPI</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1344762#M9027</link>
      <description>&lt;P&gt;I get a segmentation fault when running the test suite for parallel HDF5 (1.10.8) when compiling with oneAPI 2021 update 4. The tests pass OK with the Gnu compiler (4.8.5 or 9.1), with an older Intel compiler (2019 update 5 with 2019 update 8 MPI), or when using OpenMPI 4.1.2 compiled with the oneAPI compiler. The system specs are&lt;/P&gt;
&lt;P&gt;% cat /etc/redhat-release&lt;BR /&gt;Red Hat Enterprise Linux Server release 7.9 (Maipo)&lt;/P&gt;
&lt;P&gt;% ucx_info -v&lt;BR /&gt;# UCT version=1.8.1 revision 6b29558&lt;BR /&gt;# configured with: --build=x86_64-redhat-linux-gnu --host=x86_64-redhat-linux-gnu --program-prefix= --disable-dependency-tracking --prefix=/usr --exec-prefix=/usr --bindir=/usr/bin --sbindir=/usr/sbin --sysconfdir=/etc --datadir=/usr/share --includedir=/usr/include --libdir=/usr/lib64 --libexecdir=/usr/libexec --localstatedir=/var --sharedstatedir=/var/lib --mandir=/usr/share/man --infodir=/usr/share/info --disable-optimizations --disable-logging --disable-debug --disable-assertions --disable-params-check --enable-examples --without-java --enable-cma --without-cuda --without-gdrcopy --with-verbs --without-cm --without-knem --with-rdmacm --without-rocm --without-xpmem --without-ugni&lt;/P&gt;
&lt;P&gt;From the hdf5 directory, the steps to produce the error are:&lt;/P&gt;
&lt;P&gt;setenv CC icc&lt;BR /&gt;setenv CXX icpc&lt;BR /&gt;setenv FC ifort&lt;BR /&gt;setenv F77 ifort&lt;BR /&gt;setenv F90 ifort&lt;BR /&gt;setenv CFLAGS "-O3 -fp-model precise"&lt;BR /&gt;setenv CXXFLAGS "-O3 -fp-model precise"&lt;BR /&gt;setenv FFLAGS "-O3 -fp-model precise"&lt;BR /&gt;setenv FCFLAGS "-O3 -fp-model precise"&lt;BR /&gt;setenv RUNPARALLEL "mpirun -np 4"&lt;/P&gt;
&lt;P&gt;./configure --prefix=/usr/local/apps/phdf5-1.10.8/intel-21.4 --enable-fortran --enable-shared --enable-parallel --with-pic CC=mpiicc FC=mpiifort&lt;/P&gt;
&lt;P&gt;make &amp;gt;&amp;amp; make.pintelmpi.log&lt;BR /&gt;make check &amp;gt;&amp;amp; make.pintelmpi.check&lt;/P&gt;
&lt;P&gt;All is fine until the test for t_bigio. It can be run standalone from the testpar directory:&lt;/P&gt;
&lt;P&gt;% mpirun -np 4 ./t_bigio&lt;/P&gt;
&lt;P&gt;Testing Dataset1 write by ROW&lt;/P&gt;
&lt;P&gt;Testing Dataset2 write by COL&lt;/P&gt;
&lt;P&gt;Testing Dataset3 write select ALL proc 0, NONE others&lt;/P&gt;
&lt;P&gt;Testing Dataset4 write point selection&lt;/P&gt;
&lt;P&gt;Read Testing Dataset1 by COL&lt;BR /&gt;[atmos5:24597:0:24597] Caught signal 11 (Segmentation fault: address not mapped to object at address (nil))&lt;BR /&gt;==== backtrace (tid: 24597) ====&lt;BR /&gt;0 0x000000000004d445 ucs_debug_print_backtrace() ???:0&lt;BR /&gt;1 0x00000000006e5850 MPII_Dataloop_stackelm_offset() /build/impi/_buildspace/release/../../src/mpi/datatype/typerep/dataloop/segment.c:1026&lt;BR /&gt;2 0x00000000005216b9 segment_init() /build/impi/_buildspace/release/../../src/mpi/datatype/typerep/dataloop/looputil.c:318&lt;BR /&gt;3 0x00000000005216b9 MPIR_Segment_alloc() /build/impi/_buildspace/release/../../src/mpi/datatype/typerep/dataloop/looputil.c:369&lt;BR /&gt;4 0x000000000078fbef MPIR_Typerep_pack() /build/impi/_buildspace/release/../../src/mpi/datatype/typerep/src/typerep_pack.c:64&lt;BR /&gt;5 0x000000000081d215 pack_frame() /build/impi/_buildspace/release/../../src/mpid/ch4/shm/posix/eager/include/intel_transport_send.h:560&lt;BR /&gt;6 0x000000000081d215 progress_serialization_frame() /build/impi/_buildspace/release/../../src/mpid/ch4/shm/posix/eager/include/intel_transport_send.h:1235&lt;BR /&gt;7 0x000000000081603d impi_shm_progress() /build/impi/_buildspace/release/../../src/mpid/ch4/shm/posix/eager/include/intel_transport_progress.h:198&lt;BR /&gt;8 0x00000000001de023 MPIDI_SHM_progress() /build/impi/_buildspace/release/../../src/mpid/ch4/shm/src/../src/shm_noinline.h:106&lt;BR /&gt;9 0x00000000001de023 MPIDI_Progress_test() /build/impi/_buildspace/release/../../src/mpid/ch4/src/ch4_progress.c:152&lt;BR /&gt;10 0x00000000001de023 MPID_Progress_test() /build/impi/_buildspace/release/../../src/mpid/ch4/src/ch4_progress.c:218&lt;BR /&gt;11 0x00000000001de023 MPID_Progress_wait() /build/impi/_buildspace/release/../../src/mpid/ch4/src/ch4_progress.c:279&lt;BR /&gt;12 0x000000000079c74a MPIR_Waitall_impl() /build/impi/_buildspace/release/../../src/mpi/request/waitall.c:46&lt;BR /&gt;13 0x000000000079c74a MPID_Waitall() /build/impi/_buildspace/release/../../src/mpid/ch4/include/mpidpost.h:186&lt;BR /&gt;14 0x000000000079c74a MPIR_Waitall() /build/impi/_buildspace/release/../../src/mpi/request/waitall.c:173&lt;BR /&gt;15 0x000000000079c74a PMPI_Waitall() /build/impi/_buildspace/release/../../src/mpi/request/waitall.c:331&lt;BR /&gt;16 0x0000000000088824 ADIOI_R_Exchange_data() /build/impi/_buildspace/release/src/mpi/romio/../../../../../src/mpi/romio/adio/common/ad_read_coll.c:991&lt;BR /&gt;17 0x000000000008743d ADIOI_Read_and_exch() /build/impi/_buildspace/release/src/mpi/romio/../../../../../src/mpi/romio/adio/common/ad_read_coll.c:802&lt;BR /&gt;18 0x000000000008743d ADIOI_GEN_ReadStridedColl() /build/impi/_buildspace/release/src/mpi/romio/../../../../../src/mpi/romio/adio/common/ad_read_coll.c:247&lt;BR /&gt;19 0x0000000000adc6d9 MPIOI_File_read_all() /build/impi/_buildspace/release/src/mpi/romio/../../../../../src/mpi/romio/mpi-io/read_all.c:115&lt;BR /&gt;20 0x0000000000adc992 PMPI_File_read_at_all() /build/impi/_buildspace/release/src/mpi/romio/../../../../../src/mpi/romio/mpi-io/read_atall.c:58&lt;BR /&gt;21 0x00000000003b222a H5FD_get_mpio_atomicity() ???:0&lt;BR /&gt;22 0x000000000014ef20 H5FD_read() ???:0&lt;BR /&gt;23 0x0000000000126725 H5F__accum_read() ???:0&lt;BR /&gt;24 0x00000000002613e9 H5PB_read() ???:0&lt;BR /&gt;25 0x0000000000134959 H5F_block_read() ???:0&lt;BR /&gt;26 0x00000000003a3bd2 H5D__mpio_select_read() ???:0&lt;BR /&gt;27 0x00000000003a3e32 H5D__contig_collective_read() ???:0&lt;BR /&gt;28 0x00000000000eea22 H5D__read() ???:0&lt;BR /&gt;29 0x00000000000ee1cf H5Dread() ???:0&lt;BR /&gt;30 0x000000000040a86c ???() /work/STAFF/eanderso/hdf5-1.10.8/testpar/.libs/t_bigio:0&lt;BR /&gt;31 0x0000000000404436 ???() /work/STAFF/eanderso/hdf5-1.10.8/testpar/.libs/t_bigio:0&lt;BR /&gt;32 0x0000000000022555 __libc_start_main() ???:0&lt;BR /&gt;33 0x0000000000404269 ???() /work/STAFF/eanderso/hdf5-1.10.8/testpar/.libs/t_bigio:0&lt;BR /&gt;=================================&lt;/P&gt;
&lt;P&gt;===================================================================================&lt;BR /&gt;= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES&lt;BR /&gt;= RANK 0 PID 24597 RUNNING AT atmos5&lt;BR /&gt;= KILLED BY SIGNAL: 11 (Segmentation fault)&lt;BR /&gt;===================================================================================&lt;/P&gt;
&lt;P&gt;===================================================================================&lt;BR /&gt;= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES&lt;BR /&gt;= RANK 1 PID 24598 RUNNING AT atmos5&lt;BR /&gt;= KILLED BY SIGNAL: 9 (Killed)&lt;BR /&gt;===================================================================================&lt;/P&gt;
&lt;P&gt;===================================================================================&lt;BR /&gt;= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES&lt;BR /&gt;= RANK 2 PID 24599 RUNNING AT atmos5&lt;BR /&gt;= KILLED BY SIGNAL: 9 (Killed)&lt;BR /&gt;===================================================================================&lt;/P&gt;
&lt;P&gt;===================================================================================&lt;BR /&gt;= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES&lt;BR /&gt;= RANK 3 PID 24600 RUNNING AT atmos5&lt;BR /&gt;= KILLED BY SIGNAL: 9 (Killed)&lt;BR /&gt;===================================================================================&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Dec 2021 21:49:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1344762#M9027</guid>
      <dc:creator>ecanders</dc:creator>
      <dc:date>2021-12-16T21:49:03Z</dc:date>
    </item>
    <item>
      <title>Re: Segfault testing parallel HDF5 with Intel MPI</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1345346#M9033</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for providing the steps to reproduce your issue at our end.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have tried to reproduce your issue on both Ubuntu 18.04 and Rocky Linux 8.5 machines using oneAPI 2021.3 and oneAPI 2021.4 by following the below steps:&lt;/P&gt;
&lt;P&gt;1. git clone &lt;A href="https://github.com/HDFGroup/hdf5.git" target="_blank"&gt;https://github.com/HDFGroup/hdf5.git&lt;/A&gt;&lt;BR /&gt;2. cd hdf5&lt;BR /&gt;3. source /opt/intel/oneapi/setvars.sh&lt;BR /&gt;4. CC=icc CXX=icpc FC=ifort F77=ifort F90=ifort CFLAGS="-O3 -fp-model precise" CXXFLAGS="-O3 -fp-model precise" FFLAGS="-O3 -fp-model precise" FCFLAGS="-O3 -fp-model precise" RUNPARALLEL="mpirun -np 4" ./configure --prefix=/usr/local/apps/phdf5-1.10.8/intel-21.4 --enable-fortran --enable-shared --enable-parallel --with-pic CC=mpiicc FC=mpiifort&lt;BR /&gt;5. make &amp;gt;&amp;amp; make.pintelmpi.log &lt;BR /&gt;6. make check &amp;gt;&amp;amp; make.pintelmpi.check &lt;BR /&gt;(or)&lt;BR /&gt;cd testpar/ &amp;amp;&amp;amp; mpirun -np 4 ./t_bigio&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;**Using oneAPI &lt;STRONG&gt;2021.3&lt;/STRONG&gt;, it is &lt;STRONG&gt;working&lt;/STRONG&gt; fine on both Ubuntu 18.04 and Rocky Linux 8.5 machines as shown in the below screenshot.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SantoshY_Intel_1-1639999173349.png" style="width: 444px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/24954i9CFED94FCAF37E81/image-dimensions/444x370?v=v2&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" width="444" height="370" role="button" title="SantoshY_Intel_1-1639999173349.png" alt="SantoshY_Intel_1-1639999173349.png" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But, using oneAPI &lt;STRONG&gt;2021.4&lt;/STRONG&gt;, we are getting the&amp;nbsp;&lt;STRONG&gt;segmentation&lt;/STRONG&gt; &lt;STRONG&gt;fault&lt;/STRONG&gt; error on both Ubuntu 18.04 and Rocky Linux 8.5 machines as shown in the below screenshot.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SantoshY_Intel_0-1639999128526.png" style="width: 543px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/24953i31624F7D0275D60F/image-dimensions/543x129?v=v2&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" width="543" height="129" role="button" title="SantoshY_Intel_0-1639999128526.png" alt="SantoshY_Intel_0-1639999128526.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We are working on your issue internally and will get back to you soon.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Santosh&lt;/P&gt;</description>
      <pubDate>Thu, 23 Dec 2021 07:53:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1345346#M9033</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2021-12-23T07:53:57Z</dc:date>
    </item>
    <item>
      <title>Re: Segfault testing parallel HDF5 with Intel MPI</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1346223#M9042</link>
      <description>&lt;P&gt;Hello!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As workaround for this issue I can suggest you to set I_MPI_SHM=off. We are working on fix for this.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Vadim.&lt;/P&gt;</description>
      <pubDate>Thu, 23 Dec 2021 09:57:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1346223#M9042</guid>
      <dc:creator>Vadim_Ku_intel</dc:creator>
      <dc:date>2021-12-23T09:57:22Z</dc:date>
    </item>
    <item>
      <title>Re:Segfault testing parallel HDF5 with Intel MPI</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1388752#M9541</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thank you for your patience. The issue raised by you has been fixed in Intel MPI 2021.6 version(HPC Toolkit 2022.2). Please download and let us know if this resolves your issue.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 31 May 2022 04:49:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1388752#M9541</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-05-31T04:49:27Z</dc:date>
    </item>
    <item>
      <title>Re:Segfault testing parallel HDF5 with Intel MPI</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1390155#M9561</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We assume that your issue is resolved. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 06 Jun 2022 06:25:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Segfault-testing-parallel-HDF5-with-Intel-MPI/m-p/1390155#M9561</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-06-06T06:25:44Z</dc:date>
    </item>
  </channel>
</rss>

