<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hello Sunny, in Software Archive</title>
    <link>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087799#M64048</link>
    <description>&lt;P&gt;Hello Sunny,&lt;/P&gt;

&lt;P&gt;&amp;nbsp;I have increased the problem size to 45k then the hpl is running and performance&amp;nbsp; is 154Gf. More than 45k the hpl is terminating by throwing the following error. error in scifi_send 0 : success&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 15 Jan 2016 04:31:47 GMT</pubDate>
    <dc:creator>girish_b_</dc:creator>
    <dc:date>2016-01-15T04:31:47Z</dc:date>
    <item>
      <title>Less performance on mic</title>
      <link>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087797#M64046</link>
      <description>&lt;P&gt;&amp;nbsp;HPL benchmark performance obtained on a host + 1 MIC cards is coming only 154GFlops. The Host system has 102 GB memory. The theoretical peak is 1.2TF +&amp;nbsp; + 256GFLOPS = 1.4TF.&amp;nbsp; May I please&amp;nbsp; know how to optimize the hpl performance? I've used the OFFLOAD execution, with the executable xhpl_offload_intel64.When i run hpl benchmark on simple host i am able to achieve 92 % performance. I am attaching all the files that i am using. Awaiting your quick reply.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jan 2016 16:07:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087797#M64046</guid>
      <dc:creator>girish_b_</dc:creator>
      <dc:date>2016-01-14T16:07:06Z</dc:date>
    </item>
    <item>
      <title>Hello Girish,</title>
      <link>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087798#M64047</link>
      <description>&lt;P&gt;Hello Girish,&lt;/P&gt;

&lt;P&gt;Your HPL benchmark optimized performance will depend on lot of parameters including the problem size (Ns). The problem size you have in your compressed folder has it set to 4000. In order to investigate the issue further can you please let me know what change do you see when you update that number to something like 16K or 64K.&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jan 2016 19:56:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087798#M64047</guid>
      <dc:creator>Sunny_G_Intel</dc:creator>
      <dc:date>2016-01-14T19:56:48Z</dc:date>
    </item>
    <item>
      <title>Hello Sunny,</title>
      <link>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087799#M64048</link>
      <description>&lt;P&gt;Hello Sunny,&lt;/P&gt;

&lt;P&gt;&amp;nbsp;I have increased the problem size to 45k then the hpl is running and performance&amp;nbsp; is 154Gf. More than 45k the hpl is terminating by throwing the following error. error in scifi_send 0 : success&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 15 Jan 2016 04:31:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087799#M64048</guid>
      <dc:creator>girish_b_</dc:creator>
      <dc:date>2016-01-15T04:31:47Z</dc:date>
    </item>
    <item>
      <title>Hi Girish,</title>
      <link>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087800#M64049</link>
      <description>&lt;P style="box-sizing: border-box; margin-bottom: 1.06667em; line-height: 1.4; max-width: 700px; color: rgb(85, 85, 85); font-family: 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 15px;"&gt;Hi Girish,&lt;/P&gt;

&lt;P style="box-sizing: border-box; margin-bottom: 1.06667em; line-height: 1.4; max-width: 700px; color: rgb(85, 85, 85); font-family: 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 15px;"&gt;Sorry for the delayed reply. I was out of office on Monday.&lt;/P&gt;

&lt;P style="box-sizing: border-box; margin-bottom: 1.06667em; line-height: 1.4; max-width: 700px; color: rgb(85, 85, 85); font-family: 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 15px;"&gt;Regarding the SCIF error you are getting can you please ensure that the host is able to reach the coprocessor. What do you see in your HPL output for "Number of Intel(R) Xeon Phi(TM) coprocessors : ". &amp;nbsp;If you see anything less than 1, then I suggest you restart the MPSS service on your host and verify if the host can reach the coprocessor. MPSS service can be restarted as follows"&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;sudo service mpss restart&lt;/PRE&gt;

&lt;P style="box-sizing: border-box; margin-bottom: 1.06667em; line-height: 1.4; max-width: 700px; color: rgb(85, 85, 85); font-family: 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 15px;"&gt;&lt;SPAN style="line-height: 1.4;"&gt;I see that in the HPL_Offload.dat file you have, P and Q is set to 4,4. Would it possible to try different decompostion like 1,1 and 1,2 and correspondingly set number of &lt;/SPAN&gt;MPI_PROC_NUM&amp;nbsp;to PxQ&lt;SPAN style="line-height: 1.4;"&gt;? Currently you have PxQ = 16 which might not be the optimized setting for the configuration you have. Also, I see you have&amp;nbsp;&lt;/SPAN&gt;MPI_PER_NODE&amp;nbsp;set to 2 which should correspond to the number of sockets on your host for better performance.&amp;nbsp;&lt;/P&gt;

&lt;P style="box-sizing: border-box; margin-bottom: 1.06667em; line-height: 1.4; max-width: 700px; color: rgb(85, 85, 85); font-family: 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 15px;"&gt;Let me know if this works.&lt;/P&gt;

&lt;P style="box-sizing: border-box; margin-bottom: 1.06667em; line-height: 1.4; max-width: 700px; color: rgb(85, 85, 85); font-family: 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 15px;"&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Tue, 19 Jan 2016 19:45:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087800#M64049</guid>
      <dc:creator>Sunny_G_Intel</dc:creator>
      <dc:date>2016-01-19T19:45:04Z</dc:date>
    </item>
    <item>
      <title>Hi Sunny,</title>
      <link>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087801#M64050</link>
      <description>&lt;P&gt;Hi Sunny,&lt;/P&gt;

&lt;P&gt;I am able to run the HPL as specified by you.&lt;/P&gt;

&lt;P&gt;problem size 65536 ,block size 256 ,p*q is 1*2 but the performance is 519.2GF.&lt;/P&gt;

&lt;P&gt;With P*Q values like 1,1 the performance is low and there are two sockets on the board.&lt;/P&gt;

&lt;P&gt;Kindly suggest me for optimization and please let me know the optimized performance of MIC card that you have achieved.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2016 10:59:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Less-performance-on-mic/m-p/1087801#M64050</guid>
      <dc:creator>girish_b_</dc:creator>
      <dc:date>2016-01-20T10:59:53Z</dc:date>
    </item>
  </channel>
</rss>

