<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: MPI MPMD jobs error in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1432057#M10075</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;":/" is a typing my mistake during typing. sorry..;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And, the error log file of the test for the proposed '-hosts' option is attached.&lt;BR /&gt;I think the '-hosts' option is disabled in the our LSF scheduler..&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;run script is:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;mpirun -hosts host0001,host0002 -n 2 -env OMP_NUM_THREADS=1 ./mpi-hello :\
       -hosts host0003,host0004 -n 2 -env OMP_NUM_THREADS=2 ./hybrid&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;</description>
    <pubDate>Tue, 22 Nov 2022 07:31:06 GMT</pubDate>
    <dc:creator>syhong</dc:creator>
    <dc:date>2022-11-22T07:31:06Z</dc:date>
    <item>
      <title>MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1429340#M10023</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have a MPMD job :&lt;/P&gt;
&lt;P&gt;mpiexec -f machinefile1 -n 2304 -ppn 48 ./apps1 &amp;gt; apps1.log 2&amp;gt;&amp;amp;1:\&lt;LI-EMOJI id="lia_confused-face" title=":confused_face:"&gt;&lt;/LI-EMOJI&gt;&lt;LI-EMOJI id="lia_confused-face" title=":confused_face:"&gt;&lt;/LI-EMOJI&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -f machinefile2 -n 900 -ppn -env OMP_NUM_THREADS=4 ./apps2 &amp;gt; apps2.log 2&amp;gt;&amp;amp;1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;and module load :&lt;/P&gt;
&lt;P&gt;intel/18.4, impi/2021.2.0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;+ machinefile1 :&lt;/P&gt;
&lt;P&gt;host1001:48&lt;/P&gt;
&lt;P&gt;...&lt;/P&gt;
&lt;P&gt;host1048:48&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;+ machinefile2 :&lt;/P&gt;
&lt;P&gt;host1049:76&lt;/P&gt;
&lt;P&gt;...&lt;/P&gt;
&lt;P&gt;host1095:76&lt;/P&gt;
&lt;P&gt;host1096:28&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When each task(apps1 and apps2) is performed independently, it is normally performed.&lt;/P&gt;
&lt;P&gt;However, if I do the above, the following error occurs :&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;
&lt;P&gt;[mpiexec@...] HYD_arg_set_int (../../../../src/pm/i_hydra/libhydra/arg/hydra_arg.c:163): duplicate setting: ppn&lt;/P&gt;
&lt;P&gt;[mpiexec@...] ppn_fn (../../../../src/pm/i_hydra/mpiexec/mpiexec_params.c:580): error setting ppn&lt;/P&gt;
&lt;P&gt;...&lt;/P&gt;
&lt;P&gt;[mpiexec@...] main (../../../../src/pm/i_hydra/mpiexec/mpiexec.c:1749): error parsing patameters&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So, if I do the job without -ppn option, the following error occurs :&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;
&lt;P&gt;...&lt;/P&gt;
&lt;P&gt;Abort(821) on node 3200 (rank 3200 in comm 0): application called MPI_Abort(MPI_COMM_WORLD, 821) - process 3200&lt;/P&gt;
&lt;P&gt;...&lt;/P&gt;
&lt;P&gt;lsb_launch(): Failed while waiting for tasks to finish.&lt;/P&gt;
&lt;P&gt;[mpiexec@...] wait_proxies_to_terminate (../../../../src/rm/i_hydra/mpiexec/intel/i_mpiexec.c:538): downstream from host host3386 exited with status 255&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. apps1(=mpi) and apps2(=hybrid) should be submitted as jobs and, cores and threads are normally placed in the compute node only when the '-f machinefile -ppn' option are included.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;Please let me know so that the MPMD operation can be performed using this.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. If the intel mpi(=impi) module version is the problem, please let me know from what version of the intel mpi library supports MPMD, and if there are any versions that do not support it.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You help will be appreciated.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Nov 2022 05:58:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1429340#M10023</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-11T05:58:57Z</dc:date>
    </item>
    <item>
      <title>Re:MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1429899#M10032</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for posting in the Intel forums.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Could you please provide us the following details to help us in further investigation of your issue?&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Operating System &amp;amp; its version&lt;/LI&gt;&lt;LI&gt;Sample reproducer code for app1(MPI code) &amp;amp; app2(Hybrid code) to reproduce the issue from our end.&lt;/LI&gt;&lt;LI&gt;What is the job scheduler used for launching MPI jobs on your cluster?&lt;/LI&gt;&lt;LI&gt;Have you tried with both Intel MPI 2018  update 4 &amp;amp; Intel MPI 2021.2? &lt;/LI&gt;&lt;LI&gt;What is the FI_PROVIDER you are using?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 14 Nov 2022 11:11:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1429899#M10032</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-11-14T11:11:00Z</dc:date>
    </item>
    <item>
      <title>Re: Re:MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1430470#M10045</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. CentOS Linux 8, rhel fedora&lt;/P&gt;
&lt;P&gt;2. sample reproducer code cannot be provided due to internal security policy..&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;(this HPC system has 76cores in a node.)&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;apps1 is a scalability model, and there is no limit on the number of cores, but it is set to 48core*48node for memory.&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;apps2 is a model that improves scalability by using threads. so, at least 2 threads must be used.&lt;/P&gt;
&lt;P&gt;3. We are using LSF scheduler (BSUB job).&lt;/P&gt;
&lt;P&gt;4. We tried intel mpi 2021.2 and 2019.9.304. (but, not working..)&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;we have intel mpi libraries 2018.4, 2019.5, 2019.9, 2021.2, 2021.3, and 2022.1.&lt;/P&gt;
&lt;P&gt;5. FI_PROVIDER is mlx.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;You help will be appreciated.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Nov 2022 01:40:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1430470#M10045</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-16T01:40:16Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431193#M10058</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for providing all the requested details.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could you please run the below commands on your cluster and provide us with the complete debug log?&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;mpiicc mpi-hello.c -o mpi-hello
mpiicc hybrid.c -o hybrid

I_MPI_DEBUG=10 mpirun -f nodefile1 -n 2 -ppn 1 ./mpi-hello :\
-f nodefile2 -n 2 -ppn 1 -env OMP_NUM_THREADS=4 ./hybrid&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please find the attached source code(mpi-helloc &amp;amp; hybrid.c).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Santosh&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 18 Nov 2022 04:17:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431193#M10058</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-11-18T04:17:45Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431650#M10063</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have attached the error log.&lt;/P&gt;
&lt;P&gt;And the submit shell is:&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;
&lt;P&gt;#!/bin/sh&lt;/P&gt;
&lt;P&gt;#BSUB -J test&lt;/P&gt;
&lt;P&gt;#BSUB -q normal&lt;/P&gt;
&lt;P&gt;#BSUB -n 4 --ptile 1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;export I_MPI_DEBUG=10&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;mpirun -n 2 -ppn 1 ./mpi-hello:\&lt;LI-EMOJI id="lia_confused-face" title=":confused_face:"&gt;&lt;/LI-EMOJI&gt;&lt;LI-EMOJI id="lia_confused-face" title=":confused_face:"&gt;&lt;/LI-EMOJI&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -n 2 -ppn 1 -env OMP_NUM_THREADS=4 ./hybrid&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&lt;BR /&gt;please check.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 04:23:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431650#M10063</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-21T04:23:18Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431684#M10065</link>
      <description>&lt;P&gt;Additionally, the job was submitted with some modifications to the submit shell.&lt;/P&gt;
&lt;P&gt;It actually seemed to run well, but the node was not applied in MPMD mode.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;submit shell is:&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;
&lt;P&gt;#!/bin/sh&lt;/P&gt;
&lt;P&gt;#BSUB -J test&lt;/P&gt;
&lt;P&gt;#BSUB -q normal&lt;/P&gt;
&lt;P&gt;#BSUB -n 4 --ptile 1&lt;/P&gt;
&lt;P&gt;#BSUB -W 12:00&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;export I_MPI_DEBUG=10&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;rm -f&amp;nbsp;machinefile*&lt;BR /&gt;cat $LSB_DJOB_HOSTFILE | sort -u &amp;gt; machinefile.tmp&lt;/P&gt;
&lt;P&gt;####################################################&lt;BR /&gt;for node in $(head -n 2 machinefile.tmp | tail -n 2)&lt;BR /&gt;do&lt;BR /&gt;echo ${node}:1 &amp;gt;&amp;gt; machinefile1 # ptile=1&lt;BR /&gt;done&lt;/P&gt;
&lt;P&gt;for node in $(head -n 4 machinefile.tmp | tail -n 2)&lt;BR /&gt;do&lt;BR /&gt;echo ${node}:1 &amp;gt;&amp;gt; machinefile2 # ptile=1&lt;BR /&gt;done&lt;/P&gt;
&lt;P&gt;####################################################&lt;/P&gt;
&lt;P&gt;mpirun -f machinefile1 -n 2 ./mpi-hello:\&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -f machinefile2 -n 2 -env OMP_NUM_THREADS=4 ./hybrid&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;machinefile1 is :&lt;/P&gt;
&lt;P&gt;host0001:1&lt;/P&gt;
&lt;P&gt;host0002:1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;machinefile2 is:&lt;/P&gt;
&lt;P&gt;host0003:1&lt;/P&gt;
&lt;P&gt;host0004:1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;'top' command result in node (host0001 &amp;amp; host0002):&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;3758274 root 20 0 276636 5608 3416 R 0.3 0.0 0:00.04 top&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;'top' command result in node (host0003 &amp;amp; host0004):&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;692423 syhong 20 0 2443120 149884 11596 R 100.0 0.1 62:43.33 mpi-hello&lt;BR /&gt;692424 syhong 20 0 2626560 151820 12012 R 100.0 0.1 62:45.40 hybrid&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I think that the MPMD mode is not applied.., (with threads)&lt;/P&gt;
&lt;P&gt;and the actual job is run with only the host0003&amp;amp;host0004 nodes in machinefile2.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;&lt;SPAN class="sub_section_element_selectors"&gt;You help will be appreciated.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;&lt;SPAN class="sub_section_element_selectors"&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 06:50:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431684#M10065</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-21T06:50:38Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431770#M10069</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for running the sample &amp;amp; sharing the outcomes with us.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Now, could you please try running your applications(apps1 &amp;amp; apps2) using the "-hosts host1,host2....." option instead of "-f nodefile" option as shown in the below command?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;mpiexec -hosts host1001,...,host1048 -n 2304 -ppn 48 ./apps1 &amp;gt; apps1.log 2&amp;gt;&amp;amp;1:\&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;-hosts host1049,....,host1096 -n 900 -ppn -env OMP_NUM_THREADS=4 ./apps2 &amp;gt; apps2.log 2&amp;gt;&amp;amp;1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please let us know if this also causes the same behavior i.e launching only 1 job(apps1) but not a job with threads(aap2).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Santosh&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2022 13:20:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431770#M10069</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-11-21T13:20:57Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431964#M10072</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To use '-hosts' option, we need to set up same nodes in BSUB as well.&lt;/P&gt;
&lt;P&gt;We plan to use about 120 nodes over in the future, so if other jobs are using a specific node out of 120 nodes, our job can be in the 'pending' state indefinitely..&lt;/P&gt;
&lt;P&gt;So, the above method is not efficient.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please suggest another way.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also, tested using the '-f machinefile' option for a mpi job (not MPMD) was working.&lt;/P&gt;
&lt;P&gt;At this time, it was confirmed that both threads and cores were normally batched.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2022 01:06:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431964#M10072</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-22T01:06:24Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431975#M10073</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Additionally, using the -h hosts option results in the following error.&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;
&lt;P&gt;mpirun -hosts host0001 host0002 -n 2 ./mpi-hello:/&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; -hosts host0003 host0004 -n 2 -env OMP_NUM_THREADS=4 ./hybrid&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;TABLE border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="100%"&gt;
&lt;P&gt;[proxy:0:0@host0003] HYD_spawn (... /src/pm/i_hydra/libhydra/spawn/intel/hydra_spawn.c:145) execvp error on file host0002 (No such file or directory)&lt;/P&gt;
&lt;P&gt;[proxy:0:0@host0003] HYD_spawn (... /src/pm/i_hydra/libhydra/spawn/intel/hydra_spawn.c:145) execvp error on file host0004 (No such file or directory)&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;&lt;SPAN class="sub_section_element_selectors"&gt;You help will be appreciated.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;&lt;SPAN class="sub_section_element_selectors"&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2022 01:39:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1431975#M10073</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-22T01:39:11Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1432005#M10074</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#808080"&gt;&lt;EM&gt;&amp;gt;&amp;gt;&amp;gt;"[proxy:0:0@host0003] HYD_spawn (... /src/pm/i_hydra/libhydra/spawn/intel/hydra_spawn.c:145) execvp error on file host0002 (No such file or directory)"&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;We need to keep a comma(,) between the hostnames as shown in the below command:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;mpirun -hosts host0001,host0002 -n 2 ./mpi-hello :\
       -hosts host0003,host0004 -n 2 -env OMP_NUM_THREADS=4 ./hybrid&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#808080"&gt;&lt;EM&gt;&amp;gt;&amp;gt;&amp;gt;"mpirun -hosts host0001 host0002 -n 2 ./mpi-hello:/"&lt;/EM&gt;&lt;/FONT&gt;&lt;BR /&gt;In your previous post, I can see you are using ":/". Please make sure to keep &lt;STRONG&gt;":\"&lt;/STRONG&gt;.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Santosh&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2022 06:16:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1432005#M10074</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-11-22T06:16:35Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1432057#M10075</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;":/" is a typing my mistake during typing. sorry..;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And, the error log file of the test for the proposed '-hosts' option is attached.&lt;BR /&gt;I think the '-hosts' option is disabled in the our LSF scheduler..&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;run script is:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;mpirun -hosts host0001,host0002 -n 2 -env OMP_NUM_THREADS=1 ./mpi-hello :\
       -hosts host0003,host0004 -n 2 -env OMP_NUM_THREADS=2 ./hybrid&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Tue, 22 Nov 2022 07:31:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1432057#M10075</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-11-22T07:31:06Z</dc:date>
    </item>
    <item>
      <title>Re:MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1434707#M10096</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are working on your issue and will get back to you soon.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 01 Dec 2022 07:08:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1434707#M10096</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-12-01T07:08:09Z</dc:date>
    </item>
    <item>
      <title>Re:MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1435744#M10103</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for reporting this issue. We were able to reproduce it and we have informed the development team about it. &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Santosh&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 05 Dec 2022 11:03:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1435744#M10103</guid>
      <dc:creator>SantoshY_Intel</dc:creator>
      <dc:date>2022-12-05T11:03:23Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1436379#M10110</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you for your kind answer. we can wait! &lt;LI-EMOJI id="lia_slightly-smiling-face" title=":slightly_smiling_face:"&gt;&lt;/LI-EMOJI&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;S.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Dec 2022 06:45:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1436379#M10110</guid>
      <dc:creator>syhong</dc:creator>
      <dc:date>2022-12-07T06:45:55Z</dc:date>
    </item>
    <item>
      <title>Re: MPI MPMD jobs error</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1588373#M11651</link>
      <description>&lt;P&gt;Hello syhong,&lt;BR /&gt;&lt;BR /&gt;The reproducer in this thread with hybrid.c and mpi-hello.c are unfortunately not expected to work in MPMD mode. All participating binaries must have a common MPI_COMM_WORLD. Is this the design in your application? Are they expected to interact with each other?&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Best regards,&lt;BR /&gt;Amar&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Apr 2024 14:05:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/MPI-MPMD-jobs-error/m-p/1588373#M11651</guid>
      <dc:creator>DrAmarpal_K_Intel</dc:creator>
      <dc:date>2024-04-11T14:05:23Z</dc:date>
    </item>
  </channel>
</rss>

