<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Oh never mind, of course &amp;quot; in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/How-to-map-consecutive-ranks-to-same-node/m-p/1135621#M5742</link>
    <description>Oh never mind, of course "-hostfile" and "-machinefile" are completely separate options. Yikes.</description>
    <pubDate>Thu, 27 Sep 2018 20:40:50 GMT</pubDate>
    <dc:creator>4f0drlp7eyj3</dc:creator>
    <dc:date>2018-09-27T20:40:50Z</dc:date>
    <item>
      <title>How to map consecutive ranks to same node</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/How-to-map-consecutive-ranks-to-same-node/m-p/1135620#M5741</link>
      <description>Hi,

Intel Parallel Studio Cluster Edition, 2017 Update 5, on CentOS 7.3

I am trying to run a hybrid parallel NWChem job with 2 ranks per 24-core node, 12 threads per rank. The underlying ARMCI library seems to expect consecutive ranks to reside on the same node, i.e., ranks 0 and 1 on node 1, ranks 2 and 3 on node 2, etc. With the simple "mpirun ... -perhost 2", I see round-robin assignment, instead of the documented group-round-robin assignment (4 nodes):

[cchang@login1 03:38:06 /scratch/cchang/C6H6_CCSD_NWC]$ mpirun -h | grep perhost
    -perhost &lt;N&gt;                     place consecutive &lt;N&gt; processes on each host

[0] MPI startup(): Rank    Pid      Node name  Pin cpu
[0] MPI startup(): 0       16252    n1757      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 1       3900     n1756      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 2       28323    n1738      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 3       13358    n1733      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 4       16253    n1757      {6,7,8,9,10,11,18,19,20,21,22,23}
[0] MPI startup(): 5       3901     n1756      {6,7,8,9,10,11,18,19,20,21,22,23}
[0] MPI startup(): 6       28324    n1738      {6,7,8,9,10,11,18,19,20,21,22,23}
[0] MPI startup(): 7       13359    n1733      {6,7,8,9,10,11,18,19,20,21,22,23}

If I try I_MPI_PIN_DOMAIN, or using a hexmap in the nodefile, all ranks end up on the same node:

[cchang@login1 03:31:24 /scratch/cchang/C6H6_CCSD_NWC]$ cat nodefile
n2123:2 binding=map=[03F03F,FC0FC0]
n1942:2 binding=map=[03F03F,FC0FC0]
n1915:2 binding=map=[03F03F,FC0FC0]
n1876:2 binding=map=[03F03F,FC0FC0]
[cchang@login1 03:31:27 /scratch/cchang/C6H6_CCSD_NWC]$ head -20 proc8.log
[0] MPI startup(): Multi-threaded optimized library
[0] MPI startup(): shm data transfer mode
[1] MPI startup(): shm data transfer mode
[2] MPI startup(): shm data transfer mode
[3] MPI startup(): shm data transfer mode
[4] MPI startup(): shm data transfer mode
[5] MPI startup(): shm data transfer mode
[6] MPI startup(): shm data transfer mode
[7] MPI startup(): shm data transfer mode
[0] MPI startup(): Rank    Pid      Node name  Pin cpu
[0] MPI startup(): 0       8510     n2123      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 1       8511     n2123      {6,7,8,9,10,11,18,19,20,21,22,23}
[0] MPI startup(): 2       8512     n2123      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 3       8513     n2123      {6,7,8,9,10,11,18,19,20,21,22,23}
[0] MPI startup(): 4       8514     n2123      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 5       8515     n2123      {6,7,8,9,10,11,18,19,20,21,22,23}
[0] MPI startup(): 6       8516     n2123      {0,1,2,3,4,5,12,13,14,15,16,17}
[0] MPI startup(): 7       8517     n2123      {6,7,8,9,10,11,18,19,20,21,22,23}
...

What is Intel's preferred mechanism to achieve paired consecutive ranks, with each multi-threaded rank bound to a socket?

Thanks; Chris&lt;/N&gt;&lt;/N&gt;</description>
      <pubDate>Wed, 26 Sep 2018 21:39:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/How-to-map-consecutive-ranks-to-same-node/m-p/1135620#M5741</guid>
      <dc:creator>4f0drlp7eyj3</dc:creator>
      <dc:date>2018-09-26T21:39:39Z</dc:date>
    </item>
    <item>
      <title>Oh never mind, of course "</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/How-to-map-consecutive-ranks-to-same-node/m-p/1135621#M5742</link>
      <description>Oh never mind, of course "-hostfile" and "-machinefile" are completely separate options. Yikes.</description>
      <pubDate>Thu, 27 Sep 2018 20:40:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/How-to-map-consecutive-ranks-to-same-node/m-p/1135621#M5742</guid>
      <dc:creator>4f0drlp7eyj3</dc:creator>
      <dc:date>2018-09-27T20:40:50Z</dc:date>
    </item>
  </channel>
</rss>

