<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic   in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034746#M4223</link>
    <description>&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;James Tullos (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Can you provide a reproducer code?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Dear James,&lt;/P&gt;

&lt;P&gt;I am trying to narrow code. However, right now I am facing an another problem with derived datatypes. Enclosed please find a reproducer code.&amp;nbsp;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Just compile &amp;nbsp;it and pass &amp;nbsp;the following parameters:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:plain;"&gt;mpicc  mpi_tvec2_rma.c -o mpi_tvec2_rma
mpirun -np 8 ./mpi_tvec2_rma 128 40000&lt;/PRE&gt;

&lt;P&gt;When I am using the Intel MPI (Intel C compiler) v4.1.3.048 (v15.0.0) it crashes with the following error message:&lt;/P&gt;

&lt;PRE class="brush:plain;"&gt;Assertion failed in file src/mpid/ch3/src/ch3u_handle_send_req.c at line 61: win_ptr-&amp;gt;at_completion_counter &amp;gt;= 0
internal ABORT - process 0
&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;The MPICH developers claimed that this problem has been probably fixed in development&amp;nbsp;&lt;/SPAN&gt;version&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;of MPICH3. I will check it out. However, if I switch to Intel MPI v5.0.1.035, then it is getting more and more interesting:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:plain;"&gt;Fatal error in MPI_Win_lock: Other MPI error, error stack:
MPI_Win_lock(165)......................: MPI_Win_lock(lock_type=234, rank=1, assert=0, win=0xa0000000) failed
MPIDI_Win_lock(2702)...................: 
MPIDI_CH3I_Acquire_local_lock(3615)....:  Detected an error while in progress wait for RMA messages
MPIDI_CH3I_Progress(504)...............: 
MPID_nem_handle_pkt(1368)..............: 
MPIDI_CH3_PktHandler_EagerSend(748)....: failure occurred while posting a receive for message data (MPIDI_CH3_PKT_EAGER_SEND)
MPIDI_CH3U_Receive_data_unexpected(253): Out of memory (unable to allocate -1703399408 bytes)&lt;/PRE&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;It seems to me as an integer overflow problem somewhere inside Intel MPI. Could you please have a look at it?&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;&amp;nbsp;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;With best regards,&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;Victor.&lt;/P&gt;</description>
    <pubDate>Mon, 10 Nov 2014 16:02:28 GMT</pubDate>
    <dc:creator>Victor_V_</dc:creator>
    <dc:date>2014-11-10T16:02:28Z</dc:date>
    <item>
      <title>[6] Assertion failed in file ../../segment.c</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034744#M4221</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;we have compiled our parallel code by using the latest Intel's software stack. We do use a lot of passive RMA one-sided PUT/GET operations along with a derived datatypes.&amp;nbsp;Now we are expericincing problem that sometimes our application fails with either segmentation fault or with the following error message:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[6] Assertion failed in file ../../segment.c at line 669: cur_elmp-&amp;gt;curcount &amp;gt;= 0&lt;/P&gt;&lt;P&gt;[6] internal ABORT - process 6&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Intel's inspector shows a problem inside the Intel MPI library:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;libmpi_dbg.so.4!MPID_Segment_blkidx_m2m - segment_packunpack.c:313&lt;BR /&gt;libmpi_dbg.so.4!MPID_Segment_manipulate - segment.c:552&lt;BR /&gt;libmpi_dbg.so.4!MPID_Segment_unpack - segment_packunpack.c:88&lt;BR /&gt;libmpi_dbg.so.4!MPIDI_CH3U_Receive_data_found - ch3u_handle_recv_pkt.c:190&lt;BR /&gt;libmpi_dbg.so.4!MPIDI_CH3_PktHandler_GetResp - ch3u_rma_sync.c:3691&lt;BR /&gt;libmpi_dbg.so.4!MPID_nem_handle_pkt - ch3_progress.c:1477&lt;BR /&gt;libmpi_dbg.so.4!MPIDI_CH3I_Progress - ch3_progress.c:498&lt;BR /&gt;libmpi_dbg.so.4!MPIDI_Win_unlock - ch3u_rma_sync.c:1959&lt;BR /&gt;libmpi_dbg.so.4!PMPI_Win_unlock - win_unlock.c:119&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does it mean that the something is wrong with the derived datatypes? If yes, how I can debug the problem? The problem never appears within OpenMPI.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The SW stack used:&lt;/P&gt;&lt;P&gt;Intel C/Fortran compilers v15.0.0.090&lt;/P&gt;&lt;P&gt;Intel MPI Library v5.0.1.035&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any help will be greatly appreciated!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best,&lt;/P&gt;&lt;P&gt;Victor.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 24 Oct 2014 13:30:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034744#M4221</guid>
      <dc:creator>Victor_V_</dc:creator>
      <dc:date>2014-10-24T13:30:53Z</dc:date>
    </item>
    <item>
      <title>Can you provide a reproducer</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034745#M4222</link>
      <description>&lt;P&gt;Can you provide a reproducer code?&lt;/P&gt;</description>
      <pubDate>Fri, 31 Oct 2014 19:23:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034745#M4222</guid>
      <dc:creator>James_T_Intel</dc:creator>
      <dc:date>2014-10-31T19:23:06Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034746#M4223</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;James Tullos (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Can you provide a reproducer code?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Dear James,&lt;/P&gt;

&lt;P&gt;I am trying to narrow code. However, right now I am facing an another problem with derived datatypes. Enclosed please find a reproducer code.&amp;nbsp;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Just compile &amp;nbsp;it and pass &amp;nbsp;the following parameters:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:plain;"&gt;mpicc  mpi_tvec2_rma.c -o mpi_tvec2_rma
mpirun -np 8 ./mpi_tvec2_rma 128 40000&lt;/PRE&gt;

&lt;P&gt;When I am using the Intel MPI (Intel C compiler) v4.1.3.048 (v15.0.0) it crashes with the following error message:&lt;/P&gt;

&lt;PRE class="brush:plain;"&gt;Assertion failed in file src/mpid/ch3/src/ch3u_handle_send_req.c at line 61: win_ptr-&amp;gt;at_completion_counter &amp;gt;= 0
internal ABORT - process 0
&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;The MPICH developers claimed that this problem has been probably fixed in development&amp;nbsp;&lt;/SPAN&gt;version&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;of MPICH3. I will check it out. However, if I switch to Intel MPI v5.0.1.035, then it is getting more and more interesting:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:plain;"&gt;Fatal error in MPI_Win_lock: Other MPI error, error stack:
MPI_Win_lock(165)......................: MPI_Win_lock(lock_type=234, rank=1, assert=0, win=0xa0000000) failed
MPIDI_Win_lock(2702)...................: 
MPIDI_CH3I_Acquire_local_lock(3615)....:  Detected an error while in progress wait for RMA messages
MPIDI_CH3I_Progress(504)...............: 
MPID_nem_handle_pkt(1368)..............: 
MPIDI_CH3_PktHandler_EagerSend(748)....: failure occurred while posting a receive for message data (MPIDI_CH3_PKT_EAGER_SEND)
MPIDI_CH3U_Receive_data_unexpected(253): Out of memory (unable to allocate -1703399408 bytes)&lt;/PRE&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;It seems to me as an integer overflow problem somewhere inside Intel MPI. Could you please have a look at it?&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;&amp;nbsp;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;With best regards,&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-size: 11px; line-height: normal; font-family: Menlo;"&gt;Victor.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Nov 2014 16:02:28 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/6-Assertion-failed-in-file-segment-c/m-p/1034746#M4223</guid>
      <dc:creator>Victor_V_</dc:creator>
      <dc:date>2014-11-10T16:02:28Z</dc:date>
    </item>
  </channel>
</rss>

