<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic This means default machine in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183367#M6747</link>
    <description>&lt;P&gt;This means default machine topology detection library, hwloc crashes on your system. You can switch to Update 3, which contains related fixes, if convenient, or&lt;/P&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;export&amp;nbsp;I_MPI_HYDRA_TOPOLIB=ipl;&lt;/PRE&gt;

&lt;P&gt;at some point before starting mpirun.&lt;/P&gt;</description>
    <pubDate>Thu, 11 Apr 2019 14:55:10 GMT</pubDate>
    <dc:creator>Maksim_B_Intel</dc:creator>
    <dc:date>2019-04-11T14:55:10Z</dc:date>
    <item>
      <title>bug: mpiexec segmentation fault</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183364#M6744</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;Starting from Parallel Studio 2019 Update 1, mpiexec fails to run any executable. Example: "mpiexec -np 1 /bin/ls". Any call to mpiexec (except calls like "mpiexec -help") results in Segmentation fault.&lt;/P&gt;&lt;P&gt;Please help. I can provide additional information if necessary. However testing is a bit complicated because I had to revert to the Initial release, and Updates cannot be installed concurrently AFAIK, so please request testing only if absolutely necessary.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note 1: It is on Linux Mint 19. As you may know, this distribution is heavily based on Ubuntu 18.04. By "heavily" I mean that only cosmetic packages differ, like the desktop environment packages. System packages (libc and the like) are taken directly from the Ubuntu repositories.&lt;/P&gt;&lt;P&gt;Note 2: This problem was originally reported in the C++ compiler forum, &lt;A href="https://software.intel.com/en-us/node/805992"&gt;here&lt;/A&gt;. It was spotted on Opensuse (which shares most code with SLES, a distribution completely independent of Ubuntu).&lt;/P&gt;</description>
      <pubDate>Mon, 08 Apr 2019 10:08:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183364#M6744</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-04-08T10:08:03Z</dc:date>
    </item>
    <item>
      <title>Please try</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183365#M6745</link>
      <description>&lt;P&gt;Please try&lt;/P&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra --verbose -n 1 /bin/ls&lt;/PRE&gt;

&lt;P&gt;and if it fails too, could you provide a coredump or run&amp;nbsp;&lt;/P&gt;

&lt;PRE class="brush:bash; class-name:dark;"&gt;( export I_MPI_HYDRA_TOPOLIB=ipl; gdb --args mpiexec.hydra --verbose -n 1 /bin/ls )&lt;/PRE&gt;

&lt;P&gt;and post a backtrace?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Apr 2019 06:25:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183365#M6745</guid>
      <dc:creator>Maksim_B_Intel</dc:creator>
      <dc:date>2019-04-09T06:25:18Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183366#M6746</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Thank you for answering&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Maksim B. (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please try&lt;/P&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra --verbose -n 1 /bin/ls&lt;/PRE&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;It works and outputs:&lt;/P&gt;

&lt;PRE class="brush:bash; class-name:dark;"&gt;[mpiexec@maxwell] Launch arguments: /opt/intel/compilers_and_libraries_2019.3.199/linux/mpi/intel64/bin//hydra_bstrap_proxy --upstream-fd 7 --pgid 0 --proxy-id 0 --node-id 0 --launcher ssh --base-path /opt/intel/compilers_and_libraries_2019.3.199/linux/mpi/intel64/bin/ --subtree-size 1 --tree-width 16 --tree-level 1 --time-left -1 --debug /opt/intel/compilers_and_libraries_2019.3.199/linux/mpi/intel64/bin//hydra_pmi_proxy --usize -1 --auto-cleanup 1&lt;/PRE&gt;

&lt;P&gt;&lt;EM&gt;&amp;lt;directory contents&amp;gt;&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So what now?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Apr 2019 14:33:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183366#M6746</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-04-11T14:33:01Z</dc:date>
    </item>
    <item>
      <title>This means default machine</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183367#M6747</link>
      <description>&lt;P&gt;This means default machine topology detection library, hwloc crashes on your system. You can switch to Update 3, which contains related fixes, if convenient, or&lt;/P&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;export&amp;nbsp;I_MPI_HYDRA_TOPOLIB=ipl;&lt;/PRE&gt;

&lt;P&gt;at some point before starting mpirun.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Apr 2019 14:55:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183367#M6747</guid>
      <dc:creator>Maksim_B_Intel</dc:creator>
      <dc:date>2019-04-11T14:55:10Z</dc:date>
    </item>
    <item>
      <title>I'm already running Update 3!</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183368#M6748</link>
      <description>&lt;P&gt;I'm already running Update 3!&lt;/P&gt;&lt;P&gt;Thank you. Is there anything to do at compile time?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Apr 2019 15:14:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183368#M6748</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-04-11T15:14:42Z</dc:date>
    </item>
    <item>
      <title>No, compile-time switches do</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183369#M6749</link>
      <description>&lt;P&gt;No, compile-time switches do not impact process manager.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Apr 2019 06:42:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183369#M6749</guid>
      <dc:creator>Maksim_B_Intel</dc:creator>
      <dc:date>2019-04-12T06:42:29Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183370#M6750</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;It looks like this impacts performance. Is this expected?&lt;/P&gt;&lt;P&gt;I compared the running times of the Quantum Espresso "pw" binary: (i) compiled with 2019 initial release and launched with the default topo lib, and (ii) compiled with 2019 update 1 and launched with the ipl topo lib. Results on three test cases, which differ only by parallelization parameters:&lt;/P&gt;&lt;P&gt;- case 1: (i) 2h24 (ii) 2h08&lt;/P&gt;&lt;P&gt;- case 2: (i) 2h29 (ii) 3h00&lt;/P&gt;&lt;P&gt;- case 3: (i) 1h20 (ii) 1h26&lt;/P&gt;&lt;P&gt;(note: the variability of these times is less than 1 minute so the differences are significant)&lt;/P&gt;&lt;P&gt;I should add that I'm setting CPU and memory affinity using the "cset" package tools, to ensure the software is running alone on reserved sockets of the server.&lt;/P&gt;</description>
      <pubDate>Fri, 19 Apr 2019 12:34:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183370#M6750</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-04-19T12:34:44Z</dc:date>
    </item>
    <item>
      <title>Hello,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183371#M6751</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;(Not sure if I should have started a new thread, forum moderators feel free to split this.)&lt;/P&gt;&lt;P&gt;Update on the situation. I have just installed PS XE 2019 Update 4 Cluster Edition and &lt;STRONG&gt;it is worse.&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;"raw" mpiexec still does not work&lt;/LI&gt;&lt;LI&gt;using I_MPI_HYDRA_TOPOLIB=ipl now fails for some -n values, but not all! Some tests:&lt;/LI&gt;&lt;/UL&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;(cd to some dir containing only a dir called "rpm")

&amp;gt; I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra --verbose -n 1 /bin/ls
[mpiexec@maxwell] Launch arguments: /opt/intel/compilers_and_libraries_2019.4.243/linux/mpi/intel64/bin//hydra_bstrap_proxy --upstream-host maxwell --upstream-port 45565 --pgid 0 --launcher ssh --launcher-number 0 --base-path /opt/intel/compilers_and_libraries_2019.4.243/linux/mpi/intel64/bin/ --tree-width 16 --tree-level 1 --time-left -1 --collective-launch 1 --debug --proxy-id 0 --node-id 0 --subtree-size 1 --upstream-fd 7 /opt/intel/compilers_and_libraries_2019.4.243/linux/mpi/intel64/bin//hydra_pmi_proxy --usize -1 --auto-cleanup 1 --abort-signal 9
rpm

&amp;gt; I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra  -n 2 /bin/ls
malloc(): memory corruption
[mpiexec@maxwell] wait_proxies_to_terminate (../../../../../src/pm/i_hydra/mpiexec/intel/i_mpiexec.c:538): downstream from host maxwell was killed by signal 6 (Aborted)
[mpiexec@maxwell] main (../../../../../src/pm/i_hydra/mpiexec/mpiexec.c:2041): assert (exitcodes != NULL) failed&lt;/PRE&gt;

&lt;P&gt;(same for -n ranging from 3 to 5; from 10 to 12; from 17 to 24)&lt;/P&gt;

&lt;PRE class="brush:bash; class-name:dark;"&gt;&amp;gt; I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra  -n 6 /bin/ls
rpm
rpm
rpm
rpm
rpm
rpm&lt;/PRE&gt;

&lt;P&gt;(OK as well for -n from 7 to 9, and from 25 upwards AFAICT)&lt;/P&gt;

&lt;PRE class="brush:bash; class-name:dark;"&gt;&amp;gt; I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra  -n 13 /bin/ls
rpm
rpm
rpm
rpm
rpm
rpm
rpm
rpm
rpm
double free or corruption (out)
[mpiexec@maxwell] wait_proxies_to_terminate (../../../../../src/pm/i_hydra/mpiexec/intel/i_mpiexec.c:538): downstream from host maxwell was killed by signal 6 (Aborted)
[mpiexec@maxwell] main (../../../../../src/pm/i_hydra/mpiexec/mpiexec.c:2041): assert (exitcodes != NULL) failed&lt;/PRE&gt;

&lt;P&gt;(same kind of failure with partial execution for -n from 14 to 16)&lt;/P&gt;
&lt;P&gt;See also &lt;A href="https://software.intel.com/en-us/forums/intel-clusters-and-hpc-technology/topic/808360"&gt;that bug report&lt;/A&gt; I made, which is why I installed Update 4 (supposedly contains a fix for 4 socket systems).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please help.&lt;/P&gt;</description>
      <pubDate>Tue, 04 Jun 2019 15:22:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183371#M6751</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-06-04T15:22:18Z</dc:date>
    </item>
    <item>
      <title>Hi, Lucas. </title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183372#M6752</link>
      <description>&lt;P&gt;Hi, Lucas.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could you set HYDRA_BSTRAP_VALGRIND=1 variable, run failed case and&amp;nbsp;provide me the output? It can help to understand where we have&amp;nbsp;memory corruption and&amp;nbsp;double&amp;nbsp;free&amp;nbsp;or corruption.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also&amp;nbsp;from the letter above, I see that you use&amp;nbsp;cset for setting CPU and memory affinity. Could you run without "cset" to check that it does not affect?&lt;/P&gt;&lt;P&gt;--&lt;/P&gt;&lt;P&gt;Best regards, Anatoliy.&lt;/P&gt;</description>
      <pubDate>Wed, 05 Jun 2019 06:45:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183372#M6752</guid>
      <dc:creator>Anatoliy_R_Intel</dc:creator>
      <dc:date>2019-06-05T06:45:56Z</dc:date>
    </item>
    <item>
      <title>Hi Anatoliy,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183373#M6753</link>
      <description>&lt;P&gt;Hi Anatoliy,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for helping.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Anatoliy R. (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Could you set HYDRA_BSTRAP_VALGRIND=1 variable, run failed case and&amp;nbsp;provide me the output? It can help to understand where we have&amp;nbsp;memory corruption and&amp;nbsp;double&amp;nbsp;free&amp;nbsp;or corruption.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;&amp;gt; HYDRA_BSTRAP_VALGRIND=1 I_MPI_HYDRA_TOPOLIB=ipl mpiexec.hydra  -n 2 /bin/ls &amp;gt;&amp;amp; /tmp/log.log&lt;/PRE&gt;

&lt;P&gt;&lt;IMG alt="Binary Data" class="file-icon" src="https://community.intel.com/modules/file/icons/application-octet-stream.png" title="application/octet-stream" /&gt; &lt;A href="https://community.intel.com/legacyfs/online/drupal_files/managed/48/aa/log.log"&gt;log.log&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Also&amp;nbsp;from the letter above, I see that you use&amp;nbsp;cset for setting CPU and memory affinity. Could you run without "cset" to check that it does not affect?&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;You hit the spot, it does affect. No error when run "outside" the cpusets (i.e. in the root cpuset). But I need to be able to isolate tasks; is there a way other than cpusets, if the mpiexec bug cannot be fixed quickly?&lt;/P&gt;</description>
      <pubDate>Thu, 06 Jun 2019 20:11:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183373#M6753</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-06-06T20:11:00Z</dc:date>
    </item>
    <item>
      <title>I did not find something</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183374#M6754</link>
      <description>&lt;P&gt;I did not find something wrong in valgrind output.&amp;nbsp;&lt;/P&gt;&lt;P&gt;We can also try HYDRA_BSTRAP_XTERM=1 variable, that&amp;nbsp;will run&amp;nbsp;xterm windows with launched gdb.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please set this variable and run mpirun. After that you will see xterm windows with launched gdb. Then type `run` in each windows and you will see&amp;nbsp;fail in one of the windows. Then type `bt`, it will show backtrace. Please send me this&amp;nbsp;backtrace.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;But I need to be able to isolate tasks; is there a way other than cpusets, if the mpiexec bug cannot be fixed quickly?&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;You can specify on which cpus to run mpi processes via I_MPI_PIN* variables. For example&amp;nbsp;I_MPI_PIN_PROCESSOR_LIST=0,1 will run rank 0 on cpu 0 and rank 1 on cpu 1:&lt;/P&gt;&lt;P&gt;$ I_MPI_PIN_PROCESSOR_LIST=0,1 mpiexec -n 2 -genv I_MPI_DEBUG=4 ./test.exe&lt;BR /&gt;..&lt;BR /&gt;[0] MPI startup(): Rank &amp;nbsp; &amp;nbsp;Pid &amp;nbsp; &amp;nbsp; &amp;nbsp;Node name&amp;nbsp; &amp;nbsp; &amp;nbsp; Pin cpu&lt;BR /&gt;[0] MPI startup():&amp;nbsp; &amp;nbsp;0 &amp;nbsp; &amp;nbsp; &amp;nbsp; 64298&amp;nbsp; &amp;nbsp; &amp;nbsp; host&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&lt;BR /&gt;[0] MPI startup():&amp;nbsp; &amp;nbsp;1 &amp;nbsp; &amp;nbsp; &amp;nbsp; 64299&amp;nbsp; &amp;nbsp; &amp;nbsp; host&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 07:31:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183374#M6754</guid>
      <dc:creator>Anatoliy_R_Intel</dc:creator>
      <dc:date>2019-06-07T07:31:16Z</dc:date>
    </item>
    <item>
      <title>Quote:Anatoliy R. (Intel)</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183375#M6755</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Anatoliy R. (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I did not find something wrong in valgrind output.&amp;nbsp;&lt;/P&gt;&lt;P&gt;We can also try HYDRA_BSTRAP_XTERM=1 variable, that&amp;nbsp;will run&amp;nbsp;xterm windows with launched gdb.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please set this variable and run mpirun. After that you will see xterm windows with launched gdb. Then type `run` in each windows and you will see&amp;nbsp;fail in one of the windows. Then type `bt`, it will show backtrace. Please send me this&amp;nbsp;backtrace.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The "-n 2" run failed in the second xterm. Backtrace:&lt;/P&gt;
&lt;PRE class="brush:; class-name:dark;"&gt;(gdb) bt
#0  __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:51
#1  0x00007ffff704b801 in __GI_abort () at abort.c:79
#2  0x00007ffff7094897 in __libc_message (action=action@entry=do_abort, 
    fmt=fmt@entry=0x7ffff71c1b9a "%s\n") at ../sysdeps/posix/libc_fatal.c:181
#3  0x00007ffff709b90a in malloc_printerr (
    str=str@entry=0x7ffff71bfe0e "malloc(): memory corruption") at malloc.c:5350
#4  0x00007ffff709f994 in _int_malloc (av=av@entry=0x7ffff73f6c40 &amp;lt;main_arena&amp;gt;, 
    bytes=bytes@entry=8) at malloc.c:3738
#5  0x00007ffff70a20fc in __GI___libc_malloc (bytes=8) at malloc.c:3057
#6  0x0000000000435b47 in ipl_domain_ordering (info=0x2, ord=0x7fffffffa930, lord=0)
    at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/ipl_service.c:1505
#7  0x000000000043a02d in ipl_create_domains (pi=0x2, scale=-22224)
    at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/ipl_service.c:2233
#8  0x00000000004345e7 in ipl_one_to_many_pinning (info=0x2)
    at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/ipl_service.c:2281
#9  0x0000000000444fa4 in i_mpi_bind_init (
    binding=0x2 &amp;lt;error: Cannot access memory at address 0x2&amp;gt;, 
    bindlib=0x7fffffffa930 "", map=0x0, nrank=2)
    at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/i_mpi_bind.c:372
#10 0x000000000040a563 in launch_processes ()
    at ../../../../../src/pm/i_hydra/proxy/proxy.c:387
#11 0x0000000000408785 in main (argc=2, argv=0x7fffffffa930)
    at ../../../../../src/pm/i_hydra/proxy/proxy.c:895
&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;&lt;P&gt;But I need to be able to isolate tasks; is there a way other than cpusets, if the mpiexec bug cannot be fixed quickly?&lt;/P&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;You can specify on which cpus to run mpi processes via I_MPI_PIN* variables. For example&amp;nbsp;I_MPI_PIN_PROCESSOR_LIST=0,1 will run rank 0 on cpu 0 and rank 1 on cpu 1:&lt;/P&gt;
&lt;P&gt;$ I_MPI_PIN_PROCESSOR_LIST=0,1 mpiexec -n 2 -genv I_MPI_DEBUG=4 ./test.exe&lt;BR /&gt;..&lt;BR /&gt;[0] MPI startup(): Rank &amp;nbsp; &amp;nbsp;Pid &amp;nbsp; &amp;nbsp; &amp;nbsp;Node name&amp;nbsp; &amp;nbsp; &amp;nbsp; Pin cpu&lt;BR /&gt;[0] MPI startup():&amp;nbsp; &amp;nbsp;0 &amp;nbsp; &amp;nbsp; &amp;nbsp; 64298&amp;nbsp; &amp;nbsp; &amp;nbsp; host&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&lt;BR /&gt;[0] MPI startup():&amp;nbsp; &amp;nbsp;1 &amp;nbsp; &amp;nbsp; &amp;nbsp; 64299&amp;nbsp; &amp;nbsp; &amp;nbsp; host&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I'd like the process to have exclusive access to some sockets (CPU+memory); with cset I can both pin the computation task to sockets 1-3 &lt;EM&gt;and&lt;/EM&gt; pin all other tasks (user and system tasks) to socket 0. If I pin the computation task to sockets 1-3 with your method, I suppose the CPU allocator of the OS will "naturally" move other tasks to socket 0 (if I disable hyperthreading); but what will the RAM allocator do? We are investigating RAM bandwidth issues, so I have to prevent other tasks from using the RAM of sockets 1-3.&lt;/P&gt;
&lt;P&gt;Thanks again for your help.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 09:33:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183375#M6755</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-06-07T09:33:50Z</dc:date>
    </item>
    <item>
      <title>Thank you for backtrace. I</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183376#M6756</link>
      <description>&lt;P&gt;Thank you for backtrace. I will see what can be wrong.&amp;nbsp;&lt;/P&gt;&lt;P&gt;As a workaround you can use legacy hydra process manager.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please try to run `PATH=${I_MPI_ROOT}/intel64/bin/legacy:${PATH} mpiexec.hydra ...`&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;--&lt;/P&gt;&lt;P&gt;Best regards, Anatoliy.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 09:42:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183376#M6756</guid>
      <dc:creator>Anatoliy_R_Intel</dc:creator>
      <dc:date>2019-06-07T09:42:33Z</dc:date>
    </item>
    <item>
      <title>Quote:Anatoliy R. (Intel)</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183377#M6757</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Anatoliy R. (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;As a workaround you can use legacy hydra process manager.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please try to run `PATH=${I_MPI_ROOT}/intel64/bin/legacy:${PATH} mpiexec.hydra ...`&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Great, it works! And I don't have to set the "topolib" anymore.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 09:51:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183377#M6757</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-06-07T09:51:06Z</dc:date>
    </item>
    <item>
      <title>Yes, ipl is the default</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183378#M6758</link>
      <description>&lt;P&gt;Yes, ipl is the default topolib in legacy hydra.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I will create a ticket for new hydra process manager.&amp;nbsp;&lt;/P&gt;&lt;P&gt;--&lt;/P&gt;&lt;P&gt;Best regards, Anatoliy.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2019 09:55:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183378#M6758</guid>
      <dc:creator>Anatoliy_R_Intel</dc:creator>
      <dc:date>2019-06-07T09:55:39Z</dc:date>
    </item>
    <item>
      <title>Hi, Lucas</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183379#M6759</link>
      <description>&lt;P&gt;Hi, Lucas&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could you also run lscpu?&lt;/P&gt;&lt;P&gt;--&lt;/P&gt;&lt;P&gt;Best regards, Anatoliy.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 12:40:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183379#M6759</guid>
      <dc:creator>Anatoliy_R_Intel</dc:creator>
      <dc:date>2019-06-11T12:40:05Z</dc:date>
    </item>
    <item>
      <title>Hi Anatoliy,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183380#M6760</link>
      <description>&lt;P&gt;Hi Anatoliy,&lt;/P&gt;&lt;P&gt;Here you go&lt;/P&gt;
&lt;PRE class="brush:; class-name:dark;"&gt;&amp;gt; lscpu
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              192
On-line CPU(s) list: 0-191
Thread(s) per core:  2
Core(s) per socket:  24
Socket(s):           4
NUMA node(s):        4
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8168 CPU @ 2.70GHz
Stepping:            4
CPU MHz:             3000.000
BogoMIPS:            5400.00
Virtualization:      VT-x
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            33792
NUMA node0 CPU(s):&amp;nbsp;&amp;nbsp; 0,4,8,12,16,20,24,28,32,36,40,44,48,52,56,60,64,68,72,76,80,
84,88,92,96,100,104,108,112,116,120,124,128,132,136,140,144,148,152,156,160,164,168,
172,176,180,184,188
NUMA node1 CPU(s):&amp;nbsp;&amp;nbsp; 1,5,9,13,17,21,25,29,33,37,41,45,49,53,57,61,65,69,73,77,81,
85,89,93,97,101,105,109,113,117,121,125,129,133,137,141,145,149,153,157,161,165,169
173,177,181,185,189
NUMA node2 CPU(s):&amp;nbsp;&amp;nbsp; 2,6,10,14,18,22,26,30,34,38,42,46,50,54,58,62,66,70,74,78,82,
86,90,94,98,102,106,110,114,118,122,126,130,134,138,142,146,150,154,158,162,166,170,
174,178,182,186,190
NUMA node3 CPU(s):&amp;nbsp;&amp;nbsp; 3,7,11,15,19,23,27,31,35,39,43,47,51,55,59,63,67,71,75,79,83,
87,91,95,99,103,107,111,115,119,123,127,131,135,139,143,147,151,155,159,163,167,171,
175,179,183,187,191
Flags:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca
cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx
pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl
xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx
smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe
popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch
cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd mba ibrs ibpb
stibp tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2
smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap
clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1
xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln
pts pku ospke md_clear flush_l1d
&lt;/PRE&gt;

&lt;P&gt;(Edit : last lines looked truncated, I wrapped them.)&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 13:00:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183380#M6760</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-06-11T13:00:00Z</dc:date>
    </item>
    <item>
      <title>Also, please have a look at</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183381#M6761</link>
      <description>&lt;P&gt;Also, please have a look at &lt;A href="https://software.intel.com/en-us/forums/intel-clusters-and-hpc-technology/topic/808360#comment-1940365"&gt;my recent comment in the other thread&lt;/A&gt;, because my other problem turns out to be linked to CPU sets as well, and I have done some detailed testing: some CPU sets work but not all!&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2019 15:25:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183381#M6761</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-06-11T15:25:22Z</dc:date>
    </item>
    <item>
      <title>Hi Lucas,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183382#M6762</link>
      <description>&lt;P&gt;Hi Lucas,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could you check 2019 Update 5? The issue bellow should be fixed there.&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;The "-n 2" run failed in the second xterm. Backtrace:&lt;/P&gt;&lt;P&gt;(gdb) bt&lt;/P&gt;&lt;P&gt;#0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:51&lt;/P&gt;&lt;P&gt;#1 0x00007ffff704b801 in __GI_abort () at abort.c:79&lt;/P&gt;&lt;P&gt;#2 0x00007ffff7094897 in __libc_message (action=action@entry=do_abort, fmt=fmt@entry=0x7ffff71c1b9a "%s\n") at ../sysdeps/posix/libc_fatal.c:181&lt;/P&gt;&lt;P&gt;#3 0x00007ffff709b90a in malloc_printerr ( str=str@entry=0x7ffff71bfe0e "malloc(): memory corruption") at malloc.c:5350&lt;/P&gt;&lt;P&gt;#4 0x00007ffff709f994 in _int_malloc (av=av@entry=0x7ffff73f6c40 &amp;lt;main_arena&amp;gt;, bytes=bytes@entry=8) at malloc.c:3738&lt;/P&gt;&lt;P&gt;#5 0x00007ffff70a20fc in __GI___libc_malloc (bytes=8) at malloc.c:3057&lt;/P&gt;&lt;P&gt;#6 0x0000000000435b47 in ipl_domain_ordering (info=0x2, ord=0x7fffffffa930, lord=0) at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/ipl_service.c:1505&lt;/P&gt;&lt;P&gt;#7 0x000000000043a02d in ipl_create_domains (pi=0x2, scale=-22224) at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/ipl_service.c:2233&lt;/P&gt;&lt;P&gt;#8 0x00000000004345e7 in ipl_one_to_many_pinning (info=0x2) at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/ipl_service.c:2281&lt;/P&gt;&lt;P&gt;#9 0x0000000000444fa4 in i_mpi_bind_init ( binding=0x2 &amp;lt;error: Cannot access memory at address 0x2&amp;gt;, bindlib=0x7fffffffa930 "", map=0x0, nrank=2) at ../../../../../src/pm/i_hydra/../../intel/ipl/include/../src/i_mpi_bind.c:372&lt;/P&gt;&lt;P&gt;#10 0x000000000040a563 in launch_processes () at ../../../../../src/pm/i_hydra/proxy/proxy.c:387&lt;/P&gt;&lt;P&gt;#11 0x0000000000408785 in main (argc=2, argv=0x7fffffffa930) at ../../../../../src/pm/i_hydra/proxy/proxy.c:895&lt;/P&gt;&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Mon, 16 Sep 2019 07:38:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183382#M6762</guid>
      <dc:creator>Anatoliy_R_Intel</dc:creator>
      <dc:date>2019-09-16T07:38:00Z</dc:date>
    </item>
    <item>
      <title>Hello Anatoliy,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183383#M6763</link>
      <description>&lt;P&gt;Hello Anatoliy,&lt;/P&gt;&lt;P&gt;Thank you very much for your followup. I keep this topic bookmarked and will test as soon as possible, but it might have to wait for one or two months (the server is currently loaded for production, not the right time to run tests :-) ).&lt;/P&gt;</description>
      <pubDate>Mon, 16 Sep 2019 09:20:49 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/bug-mpiexec-segmentation-fault/m-p/1183383#M6763</guid>
      <dc:creator>Levrel__Lucas</dc:creator>
      <dc:date>2019-09-16T09:20:49Z</dc:date>
    </item>
  </channel>
</rss>

