<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:  cblas_dgemm slows down a lot for Linux on Pentium 4 machin in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875297#M8861</link>
    <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
Hi xearthl,&lt;BR /&gt;&lt;BR /&gt;I compared MKL 7.2 and MKL 10.2 Update 2 and obtained about 15% increase in preformance with MKL 10.2 Update 2.&lt;BR /&gt;&lt;BR /&gt;I performed multiplication of 10000 random matrices of size 32*32 using the code like this:&lt;BR /&gt;&lt;BR /&gt;start_time = dsecnd();&lt;BR /&gt;&lt;BR /&gt;for(i = 0; i &amp;lt; 10000; i++)&lt;BR /&gt;{&lt;BR /&gt;cblas_dgemm(...);&lt;BR /&gt;}&lt;BR /&gt;&lt;BR /&gt;end_time = dsecnd();&lt;BR /&gt;&lt;BR /&gt;printf("Execution time in seconds: %fn", tend - tstart);&lt;BR /&gt;&lt;BR /&gt;Could you please provide an example of code that causes the performance slow down?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Art&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Thu, 17 Dec 2009 09:21:33 GMT</pubDate>
    <dc:creator>Artem_V_Intel</dc:creator>
    <dc:date>2009-12-17T09:21:33Z</dc:date>
    <item>
      <title>cblas_dgemm slows down a lot for Linux on Pentium 4 machine</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875293#M8857</link>
      <description>Hi,&lt;BR /&gt;I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.&lt;BR /&gt;Is this a known issue? Has it been fixed?&lt;BR /&gt;Thanks a lot!</description>
      <pubDate>Fri, 27 Nov 2009 06:29:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875293#M8857</guid>
      <dc:creator>xearthl</dc:creator>
      <dc:date>2009-11-27T06:29:26Z</dc:date>
    </item>
    <item>
      <title>Re:  cblas_dgemm slows down a lot for Linux on Pentium 4 machin</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875294#M8858</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/451032"&gt;xearthl&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;&lt;BR /&gt;I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.&lt;BR /&gt;Is this a known issue? Has it been fixed?&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
Did MKL 7.2 have a library specifically optimized for P4? It might not be entirely surprising that a recent MKL was not optimized specifically for an out of production CPU. Of course, P4 covered a fairly wide range from the original 32-bit one to the later 64-bit version. "small" might be in the eyes of the beholder; I doubt there was ever an effort to optimize MKL for cases such as 6x6 or less, where MKL never could compete with Fortran MATMUL.&lt;BR /&gt;</description>
      <pubDate>Fri, 27 Nov 2009 08:50:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875294#M8858</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2009-11-27T08:50:45Z</dc:date>
    </item>
    <item>
      <title>Re:  cblas_dgemm slows down a lot for Linux on Pentium 4 machin</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875295#M8859</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/451032"&gt;xearthl&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;Hi,&lt;BR /&gt;I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.&lt;BR /&gt;Is this a known issue? Has it been fixed?&lt;BR /&gt;Thanks a lot!&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;Hello,&lt;BR /&gt;&lt;BR /&gt;First of all the newest version of MKL 10.2 Update 2 is available now at the &lt;A title="https://registrationcenter.intel.com/regcenter/register.aspx" href="https://registrationcenter.intel.com/regcenter/register.aspx" target="_blank"&gt;Registration Center&lt;/A&gt;.&lt;BR /&gt;&lt;BR /&gt;Could you tell please what is the size of your matrix?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Art&lt;BR /&gt;</description>
      <pubDate>Mon, 30 Nov 2009 06:36:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875295#M8859</guid>
      <dc:creator>Artem_V_Intel</dc:creator>
      <dc:date>2009-11-30T06:36:46Z</dc:date>
    </item>
    <item>
      <title>Re:  cblas_dgemm slows down a lot for Linux on Pentium 4 machin</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875296#M8860</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="margin-top: 5px; width: 100%;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/438571"&gt;Artem Vorobiev (Intel)&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;&lt;BR /&gt;Hello,&lt;BR /&gt;&lt;BR /&gt;First of all the newest version of MKL 10.2 Update 2 is available now at the &lt;A title="https://registrationcenter.intel.com/regcenter/register.aspx" href="https://registrationcenter.intel.com/regcenter/register.aspx" target="_blank"&gt;Registration Center&lt;/A&gt;.&lt;BR /&gt;&lt;BR /&gt;Could you tell please what is the size of your matrix?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Art&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;Hi,&lt;BR /&gt;The execution time doubles when the matrix is 32*32 in my benchmark.&lt;BR /&gt;Thanks,&lt;BR /&gt;xearthl</description>
      <pubDate>Wed, 02 Dec 2009 09:42:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875296#M8860</guid>
      <dc:creator>xearthl</dc:creator>
      <dc:date>2009-12-02T09:42:02Z</dc:date>
    </item>
    <item>
      <title>Re:  cblas_dgemm slows down a lot for Linux on Pentium 4 machin</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875297#M8861</link>
      <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
Hi xearthl,&lt;BR /&gt;&lt;BR /&gt;I compared MKL 7.2 and MKL 10.2 Update 2 and obtained about 15% increase in preformance with MKL 10.2 Update 2.&lt;BR /&gt;&lt;BR /&gt;I performed multiplication of 10000 random matrices of size 32*32 using the code like this:&lt;BR /&gt;&lt;BR /&gt;start_time = dsecnd();&lt;BR /&gt;&lt;BR /&gt;for(i = 0; i &amp;lt; 10000; i++)&lt;BR /&gt;{&lt;BR /&gt;cblas_dgemm(...);&lt;BR /&gt;}&lt;BR /&gt;&lt;BR /&gt;end_time = dsecnd();&lt;BR /&gt;&lt;BR /&gt;printf("Execution time in seconds: %fn", tend - tstart);&lt;BR /&gt;&lt;BR /&gt;Could you please provide an example of code that causes the performance slow down?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Art&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 17 Dec 2009 09:21:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875297#M8861</guid>
      <dc:creator>Artem_V_Intel</dc:creator>
      <dc:date>2009-12-17T09:21:33Z</dc:date>
    </item>
    <item>
      <title>Re:  cblas_dgemm slows down a lot for Linux on Pentium 4 machin</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875298#M8862</link>
      <description>&lt;P&gt;Hi Art,&lt;/P&gt;
&lt;P&gt;Thanks a lot for your great effort!&lt;/P&gt;
&lt;P&gt;I haven't try the latest version. But it used to only slow down when &lt;STRONG&gt;beta&lt;/STRONG&gt; is set to &lt;STRONG&gt;0.&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;xearthl&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jan 2010 09:45:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/cblas-dgemm-slows-down-a-lot-for-Linux-on-Pentium-4-machine/m-p/875298#M8862</guid>
      <dc:creator>xearthl</dc:creator>
      <dc:date>2010-01-14T09:45:13Z</dc:date>
    </item>
  </channel>
</rss>

