<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Performance issues with Omni Path in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132679#M5683</link>
    <description>&lt;P&gt;Hi all,&lt;/P&gt;

&lt;P&gt;I installed two Omni Path Fabric cards on two Xeon Servers.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Following the instructions present in this web site: &lt;A href="https://software.intel.com/en-us/articles/using-intel-omni-path-architecture&amp;nbsp;" target="_blank"&gt;https://software.intel.com/en-us/articles/using-intel-omni-path-architecture&amp;nbsp;&lt;/A&gt;;&lt;/P&gt;

&lt;P&gt;The performance tests in this link shows that the network achieved 100 Gb/s - (4194304 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 10 &amp;nbsp; &amp;nbsp; &amp;nbsp; 360.39 &amp;nbsp; &amp;nbsp; &amp;nbsp; 360.39 &amp;nbsp; &amp;nbsp; &amp;nbsp; 360.39 &amp;nbsp; &amp;nbsp; 23276.25)&lt;/P&gt;

&lt;P&gt;I the network i deployed i achieved half of this performance (&lt;SPAN style="font-size: 13.008px;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4194304 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 10 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; 12683.17&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;): &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Is there some configuration needed to achieve 100 Gb/s using Omni Path?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Here is the complete output of benchmark execute:&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;mpirun -PSM2 -host 10.0.0.3 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv : -host 10.0.0.1 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv&lt;/P&gt;

&lt;P&gt;[silvio@phi03 ~]$ mpirun -PSM2 -host 10.0.0.3 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv : -host 10.0.0.1 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv&lt;BR /&gt;
	#------------------------------------------------------------&lt;BR /&gt;
	# &amp;nbsp; &amp;nbsp;Intel (R) MPI Benchmarks 2018 Update 1, MPI-1 part &amp;nbsp; &amp;nbsp;&lt;BR /&gt;
	#------------------------------------------------------------&lt;BR /&gt;
	# Date &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: Fri Feb &amp;nbsp;2 11:14:01 2018&lt;BR /&gt;
	# Machine &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : x86_64&lt;BR /&gt;
	# System &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: Linux&lt;BR /&gt;
	# Release &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 3.10.0-693.17.1.el7.x86_64&lt;BR /&gt;
	# Version &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : #1 SMP Thu Jan 25 20:13:58 UTC 2018&lt;BR /&gt;
	# MPI Version &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 3.1&lt;BR /&gt;
	# MPI Thread Environment:&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	# Calling sequence was:&amp;nbsp;&lt;/P&gt;

&lt;P&gt;# /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv&lt;/P&gt;

&lt;P&gt;# Minimum message length in bytes: &amp;nbsp; 0&lt;BR /&gt;
	# Maximum message length in bytes: &amp;nbsp; 4194304&lt;BR /&gt;
	#&lt;BR /&gt;
	# MPI_Datatype &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : &amp;nbsp; MPI_BYTE&amp;nbsp;&lt;BR /&gt;
	# MPI_Datatype for reductions &amp;nbsp; &amp;nbsp;: &amp;nbsp; MPI_FLOAT&lt;BR /&gt;
	# MPI_Op &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : &amp;nbsp; MPI_SUM &amp;nbsp;&lt;BR /&gt;
	#&lt;BR /&gt;
	#&lt;/P&gt;

&lt;P&gt;# List of Benchmarks to run:&lt;/P&gt;

&lt;P&gt;# Sendrecv&lt;/P&gt;

&lt;P&gt;#-----------------------------------------------------------------------------&lt;BR /&gt;
	# Benchmarking Sendrecv&amp;nbsp;&lt;BR /&gt;
	# #processes = 2&amp;nbsp;&lt;BR /&gt;
	#-----------------------------------------------------------------------------&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;#bytes #repetitions &amp;nbsp;t_min[usec] &amp;nbsp;t_max[usec] &amp;nbsp;t_avg[usec] &amp;nbsp; Mbytes/sec&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.92 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.92 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.92 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0.00&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.85 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.85 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.85 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.08&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.17&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.35&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 8 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.76 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.76 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.76 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 9.10&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;16 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;15.44&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.06 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;30.98&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.02 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.02 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.02 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;63.46&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 128 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; 123.26&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 256 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.11 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.11 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.11 &amp;nbsp; &amp;nbsp; &amp;nbsp; 242.41&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 512 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; 454.30&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1024 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.56 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.56 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.56 &amp;nbsp; &amp;nbsp; &amp;nbsp; 575.46&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2048 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.19 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.19 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.19 &amp;nbsp; &amp;nbsp; &amp;nbsp; 976.91&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4096 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5.16 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5.16 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5.16 &amp;nbsp; &amp;nbsp; &amp;nbsp;1586.69&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8192 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.15 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.15 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.15 &amp;nbsp; &amp;nbsp; &amp;nbsp;2290.80&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 16384 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;14.32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;14.32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;14.32 &amp;nbsp; &amp;nbsp; &amp;nbsp;2288.44&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 32768 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;20.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;20.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;20.77 &amp;nbsp; &amp;nbsp; &amp;nbsp;3154.69&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 65536 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;640 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;26.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;26.09 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;26.09 &amp;nbsp; &amp;nbsp; &amp;nbsp;5024.04&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;131072 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;320 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;34.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;34.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;34.77 &amp;nbsp; &amp;nbsp; &amp;nbsp;7538.32&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;262144 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;160 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;53.03 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;53.03 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;53.03 &amp;nbsp; &amp;nbsp; &amp;nbsp;9886.58&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;524288 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 80 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;93.55 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;93.55 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;93.55 &amp;nbsp; &amp;nbsp; 11208.78&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; 1048576 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 172.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; 172.28 &amp;nbsp; &amp;nbsp; &amp;nbsp; 172.26 &amp;nbsp; &amp;nbsp; 12173.26&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; 2097152 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 20 &amp;nbsp; &amp;nbsp; &amp;nbsp; 355.15 &amp;nbsp; &amp;nbsp; &amp;nbsp; 355.21 &amp;nbsp; &amp;nbsp; &amp;nbsp; 355.18 &amp;nbsp; &amp;nbsp; 11808.02&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; 4194304 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 10 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; 12683.17&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	# All processes entering MPI_Finalize&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Thanks in advance!&lt;/P&gt;

&lt;P&gt;Silvio&lt;/P&gt;</description>
    <pubDate>Fri, 02 Feb 2018 13:19:57 GMT</pubDate>
    <dc:creator>silvio_stanzani</dc:creator>
    <dc:date>2018-02-02T13:19:57Z</dc:date>
    <item>
      <title>Performance issues with Omni Path</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132679#M5683</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;

&lt;P&gt;I installed two Omni Path Fabric cards on two Xeon Servers.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Following the instructions present in this web site: &lt;A href="https://software.intel.com/en-us/articles/using-intel-omni-path-architecture&amp;nbsp;" target="_blank"&gt;https://software.intel.com/en-us/articles/using-intel-omni-path-architecture&amp;nbsp;&lt;/A&gt;;&lt;/P&gt;

&lt;P&gt;The performance tests in this link shows that the network achieved 100 Gb/s - (4194304 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 10 &amp;nbsp; &amp;nbsp; &amp;nbsp; 360.39 &amp;nbsp; &amp;nbsp; &amp;nbsp; 360.39 &amp;nbsp; &amp;nbsp; &amp;nbsp; 360.39 &amp;nbsp; &amp;nbsp; 23276.25)&lt;/P&gt;

&lt;P&gt;I the network i deployed i achieved half of this performance (&lt;SPAN style="font-size: 13.008px;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;4194304 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 10 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; 12683.17&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;): &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Is there some configuration needed to achieve 100 Gb/s using Omni Path?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Here is the complete output of benchmark execute:&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;mpirun -PSM2 -host 10.0.0.3 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv : -host 10.0.0.1 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv&lt;/P&gt;

&lt;P&gt;[silvio@phi03 ~]$ mpirun -PSM2 -host 10.0.0.3 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv : -host 10.0.0.1 -n 1 /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv&lt;BR /&gt;
	#------------------------------------------------------------&lt;BR /&gt;
	# &amp;nbsp; &amp;nbsp;Intel (R) MPI Benchmarks 2018 Update 1, MPI-1 part &amp;nbsp; &amp;nbsp;&lt;BR /&gt;
	#------------------------------------------------------------&lt;BR /&gt;
	# Date &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: Fri Feb &amp;nbsp;2 11:14:01 2018&lt;BR /&gt;
	# Machine &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : x86_64&lt;BR /&gt;
	# System &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: Linux&lt;BR /&gt;
	# Release &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 3.10.0-693.17.1.el7.x86_64&lt;BR /&gt;
	# Version &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : #1 SMP Thu Jan 25 20:13:58 UTC 2018&lt;BR /&gt;
	# MPI Version &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 3.1&lt;BR /&gt;
	# MPI Thread Environment:&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	# Calling sequence was:&amp;nbsp;&lt;/P&gt;

&lt;P&gt;# /opt/intel/impi/2018.1.163/bin64/IMB-MPI1 Sendrecv&lt;/P&gt;

&lt;P&gt;# Minimum message length in bytes: &amp;nbsp; 0&lt;BR /&gt;
	# Maximum message length in bytes: &amp;nbsp; 4194304&lt;BR /&gt;
	#&lt;BR /&gt;
	# MPI_Datatype &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : &amp;nbsp; MPI_BYTE&amp;nbsp;&lt;BR /&gt;
	# MPI_Datatype for reductions &amp;nbsp; &amp;nbsp;: &amp;nbsp; MPI_FLOAT&lt;BR /&gt;
	# MPI_Op &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : &amp;nbsp; MPI_SUM &amp;nbsp;&lt;BR /&gt;
	#&lt;BR /&gt;
	#&lt;/P&gt;

&lt;P&gt;# List of Benchmarks to run:&lt;/P&gt;

&lt;P&gt;# Sendrecv&lt;/P&gt;

&lt;P&gt;#-----------------------------------------------------------------------------&lt;BR /&gt;
	# Benchmarking Sendrecv&amp;nbsp;&lt;BR /&gt;
	# #processes = 2&amp;nbsp;&lt;BR /&gt;
	#-----------------------------------------------------------------------------&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;#bytes #repetitions &amp;nbsp;t_min[usec] &amp;nbsp;t_max[usec] &amp;nbsp;t_avg[usec] &amp;nbsp; Mbytes/sec&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.92 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.92 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.92 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0.00&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.85 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.85 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.85 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.08&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.17&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.84 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.35&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 8 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.76 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.76 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1.76 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 9.10&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;16 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;15.44&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.06 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.07 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;30.98&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;64 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.02 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.02 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.02 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;63.46&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 128 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; 123.26&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 256 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.11 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.11 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.11 &amp;nbsp; &amp;nbsp; &amp;nbsp; 242.41&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 512 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; 454.30&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1024 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.56 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.56 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3.56 &amp;nbsp; &amp;nbsp; &amp;nbsp; 575.46&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2048 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.19 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.19 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 4.19 &amp;nbsp; &amp;nbsp; &amp;nbsp; 976.91&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4096 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5.16 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5.16 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5.16 &amp;nbsp; &amp;nbsp; &amp;nbsp;1586.69&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8192 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.15 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.15 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 7.15 &amp;nbsp; &amp;nbsp; &amp;nbsp;2290.80&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 16384 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;14.32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;14.32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;14.32 &amp;nbsp; &amp;nbsp; &amp;nbsp;2288.44&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 32768 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1000 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;20.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;20.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;20.77 &amp;nbsp; &amp;nbsp; &amp;nbsp;3154.69&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 65536 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;640 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;26.08 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;26.09 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;26.09 &amp;nbsp; &amp;nbsp; &amp;nbsp;5024.04&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;131072 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;320 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;34.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;34.77 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;34.77 &amp;nbsp; &amp;nbsp; &amp;nbsp;7538.32&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;262144 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;160 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;53.03 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;53.03 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;53.03 &amp;nbsp; &amp;nbsp; &amp;nbsp;9886.58&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;524288 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 80 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;93.55 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;93.55 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;93.55 &amp;nbsp; &amp;nbsp; 11208.78&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; 1048576 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 172.25 &amp;nbsp; &amp;nbsp; &amp;nbsp; 172.28 &amp;nbsp; &amp;nbsp; &amp;nbsp; 172.26 &amp;nbsp; &amp;nbsp; 12173.26&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; 2097152 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 20 &amp;nbsp; &amp;nbsp; &amp;nbsp; 355.15 &amp;nbsp; &amp;nbsp; &amp;nbsp; 355.21 &amp;nbsp; &amp;nbsp; &amp;nbsp; 355.18 &amp;nbsp; &amp;nbsp; 11808.02&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; 4194304 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 10 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; &amp;nbsp; 661.40 &amp;nbsp; &amp;nbsp; 12683.17&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	# All processes entering MPI_Finalize&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Thanks in advance!&lt;/P&gt;

&lt;P&gt;Silvio&lt;/P&gt;</description>
      <pubDate>Fri, 02 Feb 2018 13:19:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132679#M5683</guid>
      <dc:creator>silvio_stanzani</dc:creator>
      <dc:date>2018-02-02T13:19:57Z</dc:date>
    </item>
    <item>
      <title>Hello Silvio,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132680#M5684</link>
      <description>&lt;P&gt;Hello Silvio,&lt;/P&gt;

&lt;P&gt;From your results it looks like that you use Xeon Phi nodes but not Xeon.&lt;BR /&gt;
	In general to improve Omni-Path bandwidth numbers on&amp;nbsp;Xeon Phi you need to use more that 1 core or use BKMs described in OPA tuning guide (&lt;SPAN style="font-size: 13.008px;"&gt;&lt;A href="https://www.intel.com/content/dam/support/us/en/documents/network-and-i-o/fabric-products/Intel_OP_Performance_Tuning_UG_H93143_v10_0.pdf" target="_blank"&gt;https://www.intel.com/content/dam/support/us/en/documents/network-and-i-o/fabric-products/Intel_OP_Performance_Tuning_UG_H93143_v10_0.pdf&lt;/A&gt;, for example at section "9.1&amp;nbsp;&lt;/SPAN&gt;Mapping from MPI Processes to SDMA Engines")&lt;/P&gt;

&lt;P&gt;Sendrecv benchmark does isend and irecv on each iteration so it is bidirectional benchmark and bidirectional bandwidth limit for Omni-Path is 25 Gbytes/sec. To be closer to this number I would suggest to use new thread-split model which is available with IMPI 2019 (&lt;SPAN style="font-size: 13.008px;"&gt;&lt;A href="https://software.intel.com/en-us/articles/intel-mpi-library-2019-technical-preview" target="_blank"&gt;https://software.intel.com/en-us/articles/intel-mpi-library-2019-technical-preview&lt;/A&gt;&lt;/SPAN&gt;). To get more information about thread-split mode read section "&lt;SPAN class="fontstyle0"&gt;4. Multiple Endpoints Support&amp;nbsp;&lt;/SPAN&gt;" from Developer Reference (should be placed at &amp;lt;install_path&amp;gt;/&lt;SPAN style="font-size: 13.008px;"&gt;compilers_and_libraries_2018.1.163/linux/mpi_2019/doc/Developer_Reference.pdf&lt;/SPAN&gt;).&lt;/P&gt;

&lt;P&gt;Here is example of usage on IMB-MT (it is suite of multi-threaded benchmarks which can employ thread-split model):&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 13.008px;"&gt;source &lt;/SPAN&gt;&amp;lt;install_path&amp;gt;/compilers_and_libraries_2018.1.163/linux/mpi_2019&lt;SPAN style="font-size: 13.008px;"&gt;/intel64/bin/mpivars.sh release_mt&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 13.008px;"&gt;I_MPI_THREAD_RUNTIME=openmp OMP_NUM_THREADS=4 I_MPI_THREAD_SPLIT=1 mpiexec.hydra -n 2 -ppn 1 -hosts host1,host2 IMB-MT sendrecvmt -count 1000000 -thread_level multiple&lt;/SPAN&gt;&lt;/P&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;#-----------------------------------------------------------------------------&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;# Benchmarking SendRecvMT&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;# #processes = 2 (threads: 4)&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;#-----------------------------------------------------------------------------&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;#bytes #repetitions&amp;nbsp; t_min[usec]&amp;nbsp; t_max[usec]&amp;nbsp; t_avg[usec]&amp;nbsp; &amp;nbsp;Mbytes/sec&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;16000000&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1000&amp;nbsp; &amp;nbsp; &amp;nbsp; 1231.05&amp;nbsp; &amp;nbsp; &amp;nbsp; 1344.16&amp;nbsp; &amp;nbsp; &amp;nbsp; 1298.92&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;STRONG&gt;24635.81&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Tue, 13 Feb 2018 22:58:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132680#M5684</guid>
      <dc:creator>Mikhail_S_Intel</dc:creator>
      <dc:date>2018-02-13T22:58:00Z</dc:date>
    </item>
    <item>
      <title>dear Mikhail Shiryaev,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132681#M5685</link>
      <description>&lt;P&gt;dear&amp;nbsp;&lt;A href="https://software.intel.com/en-us/user/1121654" style="font-size: 11px; background-color: rgb(238, 238, 238);"&gt;Mikhail Shiryaev&lt;/A&gt;,&lt;/P&gt;

&lt;P&gt;Thanks a lot for answering this!&lt;/P&gt;

&lt;P&gt;Yes. I am performing tests using two xeon phi nodes.&lt;/P&gt;

&lt;P&gt;The execution of &lt;SPAN style="font-size: 12px;"&gt;IMB-MT finished with errors on my cluster.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;[silvio@phi05 mpi-benchmarks]$ source &amp;nbsp;/opt/intel/parallel_studio_xe_2018/compilers_and_libraries_2018/linux/mpi_2019/intel64/bin/mpivars.sh release_mt&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;[silvio@phi05 mpi-benchmarks]$ I_MPI_THREAD_RUNTIME=openmp OMP_NUM_THREADS=4 I_MPI_THREAD_SPLIT=1 mpiexec.hydra -n 2 -ppn 1 -hosts 10.0.0.5,10.0.0.6 IMB-MT sendrecvmt -count 1000000 -thread_level multiple&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	IMB-MT: /usr/lib64/libfabric.so.1: version `FABRIC_1.1' not found (required by /opt/intel/compilers_and_libraries_2018.1.163/linux/mpi_2019/intel64/lib/release_mt/libmpi.so.12)&lt;BR /&gt;
	[mpiexec@phi05] HYDU_sock_write (../../utils/sock/sock.c:418): write error (Bad file descriptor)&lt;BR /&gt;
	[mpiexec@phi05] HYD_pmcd_pmiserv_send_signal (../../pm/pmiserv/pmiserv_cb.c:253): unable to write data to proxy&lt;BR /&gt;
	IMB-MT: /usr/lib64/libfabric.so.1: &lt;STRONG&gt;version `FABRIC_1.1' not found&lt;/STRONG&gt; (required by /opt/intel/compilers_and_libraries_2018.1.163/linux/mpi_2019/intel64/lib/release_mt/libmpi.so.12)&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;The operation system i am using is centos 7 which provides libfabric version 4.&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;Do i need to perform downgrade of&amp;nbsp;&lt;SPAN style="font-size: 13.008px;"&gt;libfabric.so?&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;&lt;B&gt;I tried to install the version 1.1 but it misses infiniband/driver.h that i cound not find in any package&lt;/B&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 15 Feb 2018 17:55:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132681#M5685</guid>
      <dc:creator>silvio_stanzani</dc:creator>
      <dc:date>2018-02-15T17:55:05Z</dc:date>
    </item>
    <item>
      <title>Hi Silvio,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132682#M5686</link>
      <description>&lt;P&gt;Hi Silvio,&lt;/P&gt;

&lt;P&gt;You can check your libfabric version using "&lt;SPAN style="font-size: 13.008px;"&gt;fi_info --version&lt;/SPAN&gt;".&lt;BR /&gt;
	If you have old libfabric then you can download and build it manually (on node with OPA software stack, for example on worker node):&lt;/P&gt;

&lt;OL&gt;
	&lt;LI&gt;git clone&amp;nbsp;&lt;SPAN style="font-size: 13.008px;"&gt;&lt;A href="https://github.com/ofiwg/libfabric.git" target="_blank"&gt;https://github.com/ofiwg/libfabric.git&lt;/A&gt;&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;cd ./libfabric&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;./autogen.sh&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;./configure --prefix=&amp;lt;libfabric_install_path&amp;gt; --enable-psm2&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;make clean &amp;amp;&amp;amp; make all &amp;amp;&amp;amp; make install&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;source &amp;lt;...&amp;gt;/mpivars.sh release_mt&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;export LD_LIBRARY_PATH=&lt;/SPAN&gt;&amp;lt;libfabric_install_path&amp;gt;&lt;SPAN style="font-size: 13.008px;"&gt;/lib/:${LD_LIBRARY_PATH}&lt;/SPAN&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;SPAN style="font-size: 13.008px;"&gt;mpiexec.hydra ...&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Thu, 15 Feb 2018 18:28:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132682#M5686</guid>
      <dc:creator>Mikhail_S_Intel</dc:creator>
      <dc:date>2018-02-15T18:28:15Z</dc:date>
    </item>
    <item>
      <title>Hi Mikhail,</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132683#M5687</link>
      <description>&lt;P&gt;Hi Mikhail,&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp;Adding to this thread rather than starting over because it seems most germane to my issue. I am trying to enable Intel MPI 2019 on a system without a libfabric, and without any Omni-Path packages (since it's a Mellanox-based network). So, I built dependencies (because I get unresolved symbols from psm and psm2 at runtime despite disabling them during the libfabric build) from fresh clones of intel/psm and intel/opa-psm2, using the Centos 7.4&amp;nbsp;GCC (v4.8.5). I then built libfabric 1.8.0 with&lt;/P&gt;
&lt;PRE class="brush:plain; class-name:dark;"&gt;./configure --prefix=/nopt/nrel/apps/centos/7.4 --enable-psm=no --enable-psm2=no --enable-sockets=yes --enable-verbs=yes --enable-mlx=/opt/mellanox/mxm --enable-udp=yes --enable-tcp=yes --enable-rxm=no --enable-mrail=no --enable-rxd=no --enable-bgq=no --enable-shm=yes --enable-rstream=no --enable-perf=no&lt;/PRE&gt;

&lt;P&gt;Build goes fine, but then when I try a little MPI test, I get&lt;/P&gt;

&lt;PRE class="brush:plain; class-name:dark;"&gt;srun --nodes=2 --ntasks=4 --time=5:00 --account=hpcapps ./test

/home/cchang/tests/IMPI/./test: /nopt/nrel/apps/centos/7.4/lib64/libfabric.so.1: version `FABRIC_1.1' not found (required by /nopt/nrel/apps/compilers/2018-11-19/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2019.1-nn7isnm3kmcdwixuelyzaedqcyisum4j/impi/2019.1.144/intel64/lib/release/libmpi.so.12)

/home/cchang/tests/IMPI/./test: /nopt/nrel/apps/centos/7.4/lib64/libfabric.so.1: version `FABRIC_1.1' not found (required by /nopt/nrel/apps/compilers/2018-11-19/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2019.1-nn7isnm3kmcdwixuelyzaedqcyisum4j/impi/2019.1.144/intel64/lib/release/libmpi.so.12)

/home/cchang/tests/IMPI/./test: /nopt/nrel/apps/centos/7.4/lib64/libfabric.so.1: version `FABRIC_1.1' not found (required by /nopt/nrel/apps/compilers/2018-11-19/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2019.1-nn7isnm3kmcdwixuelyzaedqcyisum4j/impi/2019.1.144/intel64/lib/release/libmpi.so.12)

/home/cchang/tests/IMPI/./test: /nopt/nrel/apps/centos/7.4/lib64/libfabric.so.1: version `FABRIC_1.1' not found (required by /nopt/nrel/apps/compilers/2018-11-19/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2019.1-nn7isnm3kmcdwixuelyzaedqcyisum4j/impi/2019.1.144/intel64/lib/release/libmpi.so.12)

&lt;/PRE&gt;

&lt;P&gt;Sure&amp;nbsp;enough, I only see 1.0 in the libfabric binary:&lt;/P&gt;

&lt;PRE class="brush:plain; class-name:dark;"&gt;[cchang@el2 IMPI]$ readelf -a /nopt/nrel/apps/centos/7.4/lib64/libfabric.so.1.2.3 | grep FABRIC
   272: 0000000000011da0   330 FUNC    GLOBAL DEFAULT   12 fi_log@@FABRIC_1.0
   273: 0000000000011d60    62 FUNC    GLOBAL DEFAULT   12 fi_log_enabled@@FABRIC_1.0
   274: 0000000000012350   763 FUNC    GLOBAL DEFAULT   12 fi_param_get@@FABRIC_1.0
   275: 000000000000f070   166 FUNC    GLOBAL DEFAULT   12 fi_freeinfo@@FABRIC_1.0
   276: 000000000000f120  1031 FUNC    GLOBAL DEFAULT   12 fi_getinfo@@FABRIC_1.0
   277: 0000000000011f20   299 FUNC    GLOBAL DEFAULT   12 fi_getparams@@FABRIC_1.0
   278: 0000000000012110   563 FUNC    GLOBAL DEFAULT   12 fi_param_define@@FABRIC_1.0
   279: 000000000000fa20   127 FUNC    GLOBAL DEFAULT   12 fi_fabric@@FABRIC_1.0
   280: 00000000000111c0  2518 FUNC    GLOBAL DEFAULT   12 fi_tostr@@FABRIC_1.0
   281: 000000000000fab0    53 FUNC    GLOBAL DEFAULT   12 fi_strerror@@FABRIC_1.0
   282: 0000000000012050    83 FUNC    GLOBAL DEFAULT   12 fi_freeparams@@FABRIC_1.0
   283: 0000000000000000     0 OBJECT  GLOBAL DEFAULT  ABS FABRIC_1.0
   284: 000000000000f530  1254 FUNC    GLOBAL DEFAULT   12 fi_dupinfo@@FABRIC_1.0
   285: 000000000000faa0     6 FUNC    GLOBAL DEFAULT   12 fi_version@@FABRIC_1.0
  110:   2 (FABRIC_1.0)    2 (FABRIC_1.0)    2 (FABRIC_1.0)    2 (FABRIC_1.0) 
  114:   2 (FABRIC_1.0)    2 (FABRIC_1.0)    2 (FABRIC_1.0)    2 (FABRIC_1.0) 
  118:   2 (FABRIC_1.0)    2 (FABRIC_1.0)    2 (FABRIC_1.0)    2 (FABRIC_1.0) 
  11c:   2 (FABRIC_1.0)    2 (FABRIC_1.0) 
  0x001c: Rev: 1  Flags: none  Index: 2  Cnt: 1  Name: FABRIC_1.0
&lt;/PRE&gt;

&lt;P&gt;This is the latest libfabric release, so how would I go about getting symbols compatible with Intel MPI 2019?&lt;/P&gt;
&lt;P&gt;Thanks; Chris&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jul 2019 19:10:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132683#M5687</guid>
      <dc:creator>4f0drlp7eyj3</dc:creator>
      <dc:date>2019-07-01T19:10:48Z</dc:date>
    </item>
    <item>
      <title>OK, never mind, appears GCC 4</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132684#M5688</link>
      <description>&lt;P&gt;OK, never mind, I was looking at an older file installed from a Centos&amp;nbsp;RPM.&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jul 2019 19:22:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Performance-issues-with-Omni-Path/m-p/1132684#M5688</guid>
      <dc:creator>4f0drlp7eyj3</dc:creator>
      <dc:date>2019-07-01T19:22:00Z</dc:date>
    </item>
  </channel>
</rss>

