<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Parallel two medium size GEMM? in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Parallel-two-medium-size-GEMM/m-p/1155921#M27540</link>
    <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;

&lt;P&gt;i have a special use case which needs to compute two independent GEMMs.&lt;/P&gt;

&lt;P&gt;each one with a MNK in the range of [20~4000], on Xeon skylake 8180, only reaching 600~700 GFlops/sec.&lt;/P&gt;

&lt;P&gt;from the algorithm level, the two GEMMs has no denpendency, so they can be launched in parallel.&lt;/P&gt;

&lt;P&gt;how can i parallel these two GEMMs? say one socket for each one, perhaps. i suppose i can't use batch GEMM for this.&lt;/P&gt;</description>
    <pubDate>Sun, 15 Jul 2018 03:58:34 GMT</pubDate>
    <dc:creator>Mingfei_M_Intel</dc:creator>
    <dc:date>2018-07-15T03:58:34Z</dc:date>
    <item>
      <title>Parallel two medium size GEMM?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Parallel-two-medium-size-GEMM/m-p/1155921#M27540</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;

&lt;P&gt;i have a special use case which needs to compute two independent GEMMs.&lt;/P&gt;

&lt;P&gt;each one with a MNK in the range of [20~4000], on Xeon skylake 8180, only reaching 600~700 GFlops/sec.&lt;/P&gt;

&lt;P&gt;from the algorithm level, the two GEMMs has no denpendency, so they can be launched in parallel.&lt;/P&gt;

&lt;P&gt;how can i parallel these two GEMMs? say one socket for each one, perhaps. i suppose i can't use batch GEMM for this.&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jul 2018 03:58:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Parallel-two-medium-size-GEMM/m-p/1155921#M27540</guid>
      <dc:creator>Mingfei_M_Intel</dc:creator>
      <dc:date>2018-07-15T03:58:34Z</dc:date>
    </item>
    <item>
      <title>Hi, in that case you may try</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Parallel-two-medium-size-GEMM/m-p/1155922#M27541</link>
      <description>&lt;P&gt;Hi, in that case you may try to explicitly set MKL_NUM_THREADS=#of physical treads/2 and try to call gemm at the same time. You need also properly set the affinity mask to avoid threads migration:&amp;nbsp;&lt;SPAN style="font-size: 13.008px;"&gt;export KMP_AFFINITY=compact,1,0,granularity=fine&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 Jul 2018 02:29:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Parallel-two-medium-size-GEMM/m-p/1155922#M27541</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2018-07-16T02:29:34Z</dc:date>
    </item>
  </channel>
</rss>

