<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MKL 7.0  (zgemm) performance with small matrices in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-7-0-zgemm-performance-with-small-matrices/m-p/932593#M13848</link>
    <description>I reported this issue with MKL 6.1. Hopefully it is on the enhancement list for MKL 7.X.&lt;BR /&gt;I have a general purpose MATRIX library ( Rogue Wave Math.h++) that was easy to retrofit to use MKL. My application uses a wide variety of complex matrix sizes from 6x6 to 500x500.&lt;BR /&gt;For small matrices, I took a big performance hit in using MKL due to some obvious overhead in MKL calls.&lt;BR /&gt;I ended up having to derive special small matrix classes to call an inline zgemm.&lt;BR /&gt;Using Rational Quantify shows the bottleneck to clearly be zgemm in MKL.</description>
    <pubDate>Wed, 20 Oct 2004 03:33:16 GMT</pubDate>
    <dc:creator>AndrewC</dc:creator>
    <dc:date>2004-10-20T03:33:16Z</dc:date>
    <item>
      <title>MKL 7.0  (zgemm) performance with small matrices</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-7-0-zgemm-performance-with-small-matrices/m-p/932593#M13848</link>
      <description>I reported this issue with MKL 6.1. Hopefully it is on the enhancement list for MKL 7.X.&lt;BR /&gt;I have a general purpose MATRIX library ( Rogue Wave Math.h++) that was easy to retrofit to use MKL. My application uses a wide variety of complex matrix sizes from 6x6 to 500x500.&lt;BR /&gt;For small matrices, I took a big performance hit in using MKL due to some obvious overhead in MKL calls.&lt;BR /&gt;I ended up having to derive special small matrix classes to call an inline zgemm.&lt;BR /&gt;Using Rational Quantify shows the bottleneck to clearly be zgemm in MKL.</description>
      <pubDate>Wed, 20 Oct 2004 03:33:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-7-0-zgemm-performance-with-small-matrices/m-p/932593#M13848</guid>
      <dc:creator>AndrewC</dc:creator>
      <dc:date>2004-10-20T03:33:16Z</dc:date>
    </item>
  </channel>
</rss>

