<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:Asynchronous progress slows down my program in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1372934#M9370</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;As we have worked with you internally and your issue has been resolved, we are closing this thread. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Hemanth.&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Wed, 30 Mar 2022 11:00:11 GMT</pubDate>
    <dc:creator>HemanthCH_Intel</dc:creator>
    <dc:date>2022-03-30T11:00:11Z</dc:date>
    <item>
      <title>Asynchronous progress slows down my program</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1367494#M9284</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I am running PETSc code on our institute's supercomputer Pravega. I have compiled my PETSc code after loading the following modules and sourcing mpivars.sh:&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;module load spack&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;. /home-ext/apps/spack/share/spack/setup-env.sh&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;spack load gcc&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;module load compiler/intel/2020.4.304&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;source /opt/ohpc/pub/compiler/intel/2020_4/compilers_and_libraries_2020.4.304/linux/mpi/intel64/bin/mpivars.sh release_mt&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;STRONG&gt;After compilation, I am running the code with the following script where export I_MPI_ASYNC_PROGRESS=0:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;#!/bin/bash&lt;BR /&gt;#SBATCH -N 2&lt;BR /&gt;#SBATCH --ntasks-per-node=48&lt;BR /&gt;#SBATCH --exclusive&lt;BR /&gt;#SBATCH --time=00:03:00&lt;BR /&gt;#SBATCH --job-name=ex2&lt;BR /&gt;#SBATCH --error=ex2.e%J&lt;BR /&gt;#SBATCH --output=ex2.o%J&lt;BR /&gt;##SBATCH --partition=standard&lt;/P&gt;
&lt;P&gt;module load spack&lt;BR /&gt;. /home-ext/apps/spack/share/spack/setup-env.sh&lt;BR /&gt;spack load gcc&lt;/P&gt;
&lt;P&gt;module load compiler/intel/2020.4.304&lt;BR /&gt;source /opt/ohpc/pub/compiler/intel/2020_4/compilers_and_libraries_2020.4.304/linux/mpi/intel64/bin/mpivars.sh release_mt&lt;/P&gt;
&lt;P&gt;# // Below are Intel MPI specific settings //&lt;BR /&gt;#export I_MPI_FABRICS=ofi:ofi &lt;BR /&gt;#export I_MPI_DEBUG=10&lt;BR /&gt;export I_MPI_ASYNC_PROGRESS=0&lt;BR /&gt;#export I_MPI_ASYNC_PROGRESS_THREADS=1&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;cd $SLURM_SUBMIT_DIR&lt;BR /&gt;nprocs=96&lt;/P&gt;
&lt;P&gt;ulimit -aH&lt;BR /&gt;ulimit -c unlimited&lt;BR /&gt;ulimit -s unlimited&lt;/P&gt;
&lt;P&gt;mpiexec.hydra -n $nprocs ./ex2 -m 80 -n 80 -ksp_monitor_short -ksp_type pipecg2 -pc_type jacobi&lt;/P&gt;
&lt;P&gt;*************end of script**************&lt;/P&gt;
&lt;P&gt;Time taken by the above job to run is&amp;nbsp;0.280063 sec.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;I am also running the code with the following script where export I_MPI_ASYNC_PROGRESS=1&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;#!/bin/bash&lt;BR /&gt;#SBATCH -N 2&lt;BR /&gt;#SBATCH --ntasks-per-node=48&lt;BR /&gt;#SBATCH --exclusive&lt;BR /&gt;#SBATCH --time=00:03:00&lt;BR /&gt;#SBATCH --job-name=ex2&lt;BR /&gt;#SBATCH --error=ex2.e%J&lt;BR /&gt;#SBATCH --output=ex2.o%J&lt;BR /&gt;##SBATCH --partition=standard&lt;/P&gt;
&lt;P&gt;module load spack&lt;BR /&gt;. /home-ext/apps/spack/share/spack/setup-env.sh&lt;BR /&gt;spack load gcc&lt;/P&gt;
&lt;P&gt;module load compiler/intel/2020.4.304&lt;BR /&gt;source /opt/ohpc/pub/compiler/intel/2020_4/compilers_and_libraries_2020.4.304/linux/mpi/intel64/bin/mpivars.sh release_mt&lt;/P&gt;
&lt;P&gt;# // Below are Intel MPI specific settings //&lt;BR /&gt;#export I_MPI_FABRICS=ofi:ofi &lt;BR /&gt;#export I_MPI_DEBUG=10&lt;BR /&gt;export I_MPI_ASYNC_PROGRESS=1&lt;BR /&gt;#export I_MPI_ASYNC_PROGRESS_THREADS=1&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;cd $SLURM_SUBMIT_DIR&lt;BR /&gt;nprocs=96&lt;/P&gt;
&lt;P&gt;ulimit -aH&lt;BR /&gt;ulimit -c unlimited&lt;BR /&gt;ulimit -s unlimited&lt;/P&gt;
&lt;P&gt;mpiexec.hydra -n $nprocs ./ex2 -m 80 -n 80 -ksp_monitor_short -ksp_type pipecg2 -pc_type jacobi&lt;/P&gt;
&lt;P&gt;*************end of script**************&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Time taken by the above job to run is 22.659463 sec.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I observe that my code takes much more time to run when I use I_MPI_ASYNC_PROGRESS=1 than when I use I_MPI_ASYNC_PROGRESS=0. Why is this happening? I read this blog (&lt;A href="https://techdecoded.intel.io/resources/hiding-communication-latency-using-mpi-3-non-blocking-collectives/?elq_cid=9262301_ts1646820353975&amp;amp;erpm_id=11789269_ts1646820353975#gs.seb071)" target="_blank"&gt;https://techdecoded.intel.io/resources/hiding-communication-latency-using-mpi-3-non-blocking-collectives/?elq_cid=9262301_ts1646820353975&amp;amp;erpm_id=11789269_ts1646820353975#gs.seb071)&lt;/A&gt;&amp;nbsp;but could not quite understand what needs to be done in order to bring performance improvement. Please help.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Manasi&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Mar 2022 10:03:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1367494#M9284</guid>
      <dc:creator>manasi-t24</dc:creator>
      <dc:date>2022-03-10T10:03:17Z</dc:date>
    </item>
    <item>
      <title>Re: Asynchronous progress slows down my program</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1368536#M9304</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;" I have compiled my PETSc code after loading the following modules and sourcing mpivars.sh:"&lt;/EM&gt;&lt;BR /&gt;Could you please provide the steps that you have followed to compile the PETsc code?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;"mpiexec.hydra -n $nprocs ./ex2 -m 80 -n 80 -ksp_monitor_short -ksp_type pipecg2 -pc_type jacobi"&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;Could you please provide us the path to the "ex2" binary from the PETsc directory?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could you please provide the OS details?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Hemanth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 14 Mar 2022 12:13:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1368536#M9304</guid>
      <dc:creator>HemanthCH_Intel</dc:creator>
      <dc:date>2022-03-14T12:13:07Z</dc:date>
    </item>
    <item>
      <title>Re: Asynchronous progress slows down my program</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1368862#M9314</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I downloaded the PETSc repository from gitlab using:&lt;/P&gt;
&lt;P&gt;git clone -b release &lt;A href="https://gitlab.com/petsc/petsc.git" target="_blank"&gt;https://gitlab.com/petsc/petsc.git&lt;/A&gt; petsc-cpu-intel&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After loading the following modules:&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;module load spack&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;. /home-ext/apps/spack/share/spack/setup-env.sh&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;spack load gcc&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;module load compiler/intel/2020.4.304&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="1"&gt;&lt;SPAN&gt;source /opt/ohpc/pub/compiler/intel/2020_4/compilers_and_libraries_2020.4.304/linux/mpi/intel64/bin/mpivars.sh release_mt&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;I configured the PETSc code with:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;./configure --with-cc=mpiicc --with-cxx=mpiicpc --with-fc=mpiifort --download-make --download-fblaslapack=1&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And then I compiled it with:&lt;/P&gt;
&lt;P&gt;make PETSC_DIR=/scratch/cdsmanas/petsc-cpu-intel PETSC_ARCH=arch-linux-c-debug all&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The directory for ex2 is:&lt;/P&gt;
&lt;P&gt;&amp;lt;petsc directory&amp;gt;/src/ksp/ksp/tutorials/ex2&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;OS and its version:&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;[cdsmanas@login10 tutorials]$ cat /etc/os-release&lt;BR /&gt;NAME="CentOS Linux"&lt;BR /&gt;VERSION="7 (Core)"&lt;BR /&gt;ID="centos"&lt;BR /&gt;ID_LIKE="rhel fedora"&lt;BR /&gt;VERSION_ID="7"&lt;BR /&gt;PRETTY_NAME="CentOS Linux 7 (Core)"&lt;BR /&gt;ANSI_COLOR="0;31"&lt;BR /&gt;CPE_NAME="cpe:/o:centos:centos:7"&lt;BR /&gt;HOME_URL="&lt;A class="sub_section_element_selectors" href="https://www.centos.org/" target="_blank" rel="nofollow noopener noreferrer"&gt;https://www.centos.org/&lt;/A&gt;"&lt;BR /&gt;BUG_REPORT_URL="&lt;A class="sub_section_element_selectors" href="https://bugs.centos.org/" target="_blank" rel="nofollow noopener noreferrer"&gt;https://bugs.centos.org/&lt;/A&gt;"&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;CENTOS_MANTISBT_PROJECT="CentOS-7"&lt;BR /&gt;CENTOS_MANTISBT_PROJECT_VERSION="7"&lt;BR /&gt;REDHAT_SUPPORT_PRODUCT="centos"&lt;BR /&gt;REDHAT_SUPPORT_PRODUCT_VERSION="7"&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;Regards,&lt;/P&gt;
&lt;P class="sub_section_element_selectors"&gt;Manasi&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Mar 2022 07:02:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1368862#M9314</guid>
      <dc:creator>manasi-t24</dc:creator>
      <dc:date>2022-03-15T07:02:18Z</dc:date>
    </item>
    <item>
      <title>Re:Asynchronous progress slows down my program</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1372934#M9370</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;As we have worked with you internally and your issue has been resolved, we are closing this thread. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Hemanth.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 30 Mar 2022 11:00:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Asynchronous-progress-slows-down-my-program/m-p/1372934#M9370</guid>
      <dc:creator>HemanthCH_Intel</dc:creator>
      <dc:date>2022-03-30T11:00:11Z</dc:date>
    </item>
  </channel>
</rss>

