<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Let me answer my question in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Packed-versus-compact-versus-normal-routines-versus-jit/m-p/1158456#M27758</link>
    <description>&lt;P&gt;Let me answer my question myself. It seems all but packed matrices are for very small matrices.&lt;/P&gt;&lt;P&gt;My computational test shows using packed matrices&amp;nbsp;reduces run time by&amp;nbsp;20% in the best case. For smallish matrices there are no benefit.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 06 Jun 2019 11:20:00 GMT</pubDate>
    <dc:creator>erling_andersen</dc:creator>
    <dc:date>2019-06-06T11:20:00Z</dc:date>
    <item>
      <title>Packed versus compact versus normal routines versus jit</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Packed-versus-compact-versus-normal-routines-versus-jit/m-p/1158455#M27757</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;I need to do a tons of&lt;/P&gt;&lt;P&gt;C = A*B^T&lt;/P&gt;&lt;P&gt;Where all matrices n times n big. n is typical 256 but could be smaller or bigger. Both A and B are used multiple times. Moreover the C is used in later multiplications i.e. C replace A or B.&amp;nbsp;&lt;/P&gt;&lt;P&gt;NOTE I am only interested in the sequential case. I do not want MKL to parallelize anything.&lt;/P&gt;&lt;P&gt;It seems the matrices are too large for the compact type. In any case compact seems to be for multiple matrices i.e. liked batched.&lt;/P&gt;&lt;P&gt;In the packed type the C will not be packed so I have to pack it.&amp;nbsp;&lt;/P&gt;&lt;P&gt;There is also the mkl_jit_create* routines.&lt;/P&gt;&lt;P&gt;Now my question is what I should go for among the possible matrix multiplication methods?&lt;/P&gt;&lt;P&gt;PS: An interesting alternative is to use BLASFEO(https://github.com/giaf/blasfeo) which course you cannot say anything about but give an idea about my use case.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 03 Jun 2019 12:42:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Packed-versus-compact-versus-normal-routines-versus-jit/m-p/1158455#M27757</guid>
      <dc:creator>erling_andersen</dc:creator>
      <dc:date>2019-06-03T12:42:45Z</dc:date>
    </item>
    <item>
      <title>Let me answer my question</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Packed-versus-compact-versus-normal-routines-versus-jit/m-p/1158456#M27758</link>
      <description>&lt;P&gt;Let me answer my question myself. It seems all but packed matrices are for very small matrices.&lt;/P&gt;&lt;P&gt;My computational test shows using packed matrices&amp;nbsp;reduces run time by&amp;nbsp;20% in the best case. For smallish matrices there are no benefit.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 06 Jun 2019 11:20:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Packed-versus-compact-versus-normal-routines-versus-jit/m-p/1158456#M27758</guid>
      <dc:creator>erling_andersen</dc:creator>
      <dc:date>2019-06-06T11:20:00Z</dc:date>
    </item>
  </channel>
</rss>

