<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic integration problem between Torque 4 and Intel(R) MPI Library for Linux* OS, Version 2019 Update 1 in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/integration-problem-between-Torque-4-and-Intel-R-MPI-Library-for/m-p/1136510#M5750</link>
    <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;I have successfully compiled and linked a program with IntelMPI and if I run it interactively or in background it runs very fast and without any problems on our new server (ProLiant DL580 Gen10, 1 node with 4 processors with 18 cores each, total 72 cores, hyperthreading disabled). If I try to submit it by Torque (version 4) strange things happen, for example:&lt;/P&gt;&lt;P&gt;1) if I submit 2 jobs asking each 8 cores they are both fine&lt;/P&gt;&lt;P&gt;2) if I submit a third job (8 cores) it is 4 times slower becasue the 8 process runs on two cores!&lt;/P&gt;&lt;P&gt;3) if I submit a fourth job it runs properly, but if I qdel all the four jobs, all of them disappear from qstat -a but the fourth is keeping running!&lt;/P&gt;&lt;P&gt;From previous discussion I notice in this forum, I have the feeling it is an integration problem between intelmpi and torque, so I did the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp;export I_MPI_PIN=off&lt;BR /&gt;&amp;nbsp;export I_MPI_PIN_DOMAIN=socket&lt;/P&gt;&lt;P&gt;to run the program I did the following call of mpirun:&lt;/P&gt;&lt;P&gt;/opt/intel/compilers_and_libraries_2019.1.144/linux/mpi/intel64/bin/mpirun -d -rmk pbs -bootstrap pbsdsh .................&lt;/P&gt;&lt;P&gt;I have checked and PBS_ENVIRONMENT is properly set to PBS_BATCH&lt;/P&gt;&lt;P&gt;Also torque configuration is apparently correct, the file&lt;/P&gt;&lt;P&gt;/var/lib/torque/server_priv/nodes contains the following line:&lt;/P&gt;&lt;P&gt;dscfbeta1.units.it np=72 num_node_boards=1&lt;/P&gt;&lt;P&gt;This is a severe problem for me, since the machine is shared so we do need a scheduler like torque (pbs) to run jobs compiled and linked to intelmpi. Any help suggestion is welcome!&lt;/P&gt;&lt;P&gt;thank you in advance&lt;/P&gt;&lt;P&gt;Mauro&lt;/P&gt;</description>
    <pubDate>Sat, 19 Jan 2019 17:03:14 GMT</pubDate>
    <dc:creator>stener__mauro</dc:creator>
    <dc:date>2019-01-19T17:03:14Z</dc:date>
    <item>
      <title>integration problem between Torque 4 and Intel(R) MPI Library for Linux* OS, Version 2019 Update 1</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/integration-problem-between-Torque-4-and-Intel-R-MPI-Library-for/m-p/1136510#M5750</link>
      <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;I have successfully compiled and linked a program with IntelMPI and if I run it interactively or in background it runs very fast and without any problems on our new server (ProLiant DL580 Gen10, 1 node with 4 processors with 18 cores each, total 72 cores, hyperthreading disabled). If I try to submit it by Torque (version 4) strange things happen, for example:&lt;/P&gt;&lt;P&gt;1) if I submit 2 jobs asking each 8 cores they are both fine&lt;/P&gt;&lt;P&gt;2) if I submit a third job (8 cores) it is 4 times slower becasue the 8 process runs on two cores!&lt;/P&gt;&lt;P&gt;3) if I submit a fourth job it runs properly, but if I qdel all the four jobs, all of them disappear from qstat -a but the fourth is keeping running!&lt;/P&gt;&lt;P&gt;From previous discussion I notice in this forum, I have the feeling it is an integration problem between intelmpi and torque, so I did the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp;export I_MPI_PIN=off&lt;BR /&gt;&amp;nbsp;export I_MPI_PIN_DOMAIN=socket&lt;/P&gt;&lt;P&gt;to run the program I did the following call of mpirun:&lt;/P&gt;&lt;P&gt;/opt/intel/compilers_and_libraries_2019.1.144/linux/mpi/intel64/bin/mpirun -d -rmk pbs -bootstrap pbsdsh .................&lt;/P&gt;&lt;P&gt;I have checked and PBS_ENVIRONMENT is properly set to PBS_BATCH&lt;/P&gt;&lt;P&gt;Also torque configuration is apparently correct, the file&lt;/P&gt;&lt;P&gt;/var/lib/torque/server_priv/nodes contains the following line:&lt;/P&gt;&lt;P&gt;dscfbeta1.units.it np=72 num_node_boards=1&lt;/P&gt;&lt;P&gt;This is a severe problem for me, since the machine is shared so we do need a scheduler like torque (pbs) to run jobs compiled and linked to intelmpi. Any help suggestion is welcome!&lt;/P&gt;&lt;P&gt;thank you in advance&lt;/P&gt;&lt;P&gt;Mauro&lt;/P&gt;</description>
      <pubDate>Sat, 19 Jan 2019 17:03:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/integration-problem-between-Torque-4-and-Intel-R-MPI-Library-for/m-p/1136510#M5750</guid>
      <dc:creator>stener__mauro</dc:creator>
      <dc:date>2019-01-19T17:03:14Z</dc:date>
    </item>
  </channel>
</rss>

