<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: modification of MKL source code in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/modification-of-MKL-source-code/m-p/910214#M12086</link>
    <description>If you compiled netlib DGEMM with a vectorizing compiler, it ought to perform well on problems which don't need cache blocking. By removing the if tests which skip loops involving 0. operations, you would give ifort -O3 more freedom to optimize.&lt;BR /&gt;</description>
    <pubDate>Fri, 14 Sep 2007 13:13:45 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2007-09-14T13:13:45Z</dc:date>
    <item>
      <title>modification of MKL source code</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/modification-of-MKL-source-code/m-p/910213#M12085</link>
      <description>hello,&lt;BR /&gt;i would like to perform the operation A*D*B, where D is diagonal, and A,B are appropriate dimensions using BLAS. there are no routines specific to diagonal matrices, so i was able to make a one-line modification to the DGEMM code available on netlib to implement this. it runs much slower than the BLAS in MKL (of course). is the source code available to make such a modification?&lt;BR /&gt;</description>
      <pubDate>Tue, 11 Sep 2007 23:29:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/modification-of-MKL-source-code/m-p/910213#M12085</guid>
      <dc:creator>xilin1</dc:creator>
      <dc:date>2007-09-11T23:29:38Z</dc:date>
    </item>
    <item>
      <title>Re: modification of MKL source code</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/modification-of-MKL-source-code/m-p/910214#M12086</link>
      <description>If you compiled netlib DGEMM with a vectorizing compiler, it ought to perform well on problems which don't need cache blocking. By removing the if tests which skip loops involving 0. operations, you would give ifort -O3 more freedom to optimize.&lt;BR /&gt;</description>
      <pubDate>Fri, 14 Sep 2007 13:13:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/modification-of-MKL-source-code/m-p/910214#M12086</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2007-09-14T13:13:45Z</dc:date>
    </item>
  </channel>
</rss>

