<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic mkl_dcsrgemv doesn't yield any significant speedup in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906451#M11748</link>
    <description>Hi, &lt;BR /&gt;&lt;BR /&gt;Is it expected that mkl_dcsrgemv() doesn't outperform hand-coded routine compiled with VS8?&lt;BR /&gt;On my matrix, MKL yields 45.011 msecs, while hand-coded routne 46.04 msecs.&lt;BR /&gt;&lt;BR /&gt;Another observation is that two-threads are only 10% faster than a single thread.&lt;BR /&gt;&lt;BR /&gt;The matrix size is 2Mx2M, nnz=15M.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Agnonchik.</description>
    <pubDate>Thu, 30 Jul 2009 13:46:33 GMT</pubDate>
    <dc:creator>agnonchik</dc:creator>
    <dc:date>2009-07-30T13:46:33Z</dc:date>
    <item>
      <title>mkl_dcsrgemv doesn't yield any significant speedup</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906451#M11748</link>
      <description>Hi, &lt;BR /&gt;&lt;BR /&gt;Is it expected that mkl_dcsrgemv() doesn't outperform hand-coded routine compiled with VS8?&lt;BR /&gt;On my matrix, MKL yields 45.011 msecs, while hand-coded routne 46.04 msecs.&lt;BR /&gt;&lt;BR /&gt;Another observation is that two-threads are only 10% faster than a single thread.&lt;BR /&gt;&lt;BR /&gt;The matrix size is 2Mx2M, nnz=15M.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Agnonchik.</description>
      <pubDate>Thu, 30 Jul 2009 13:46:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906451#M11748</guid>
      <dc:creator>agnonchik</dc:creator>
      <dc:date>2009-07-30T13:46:33Z</dc:date>
    </item>
    <item>
      <title>Re: mkl_dcsrgemv doesn't yield any significant speedup</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906452#M11749</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="margin-top: 5px; width: 100%;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/436876"&gt;agnonchik&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;Hi, &lt;BR /&gt;&lt;BR /&gt;Is it expected that mkl_dcsrgemv() doesn't outperform hand-coded routine compiled with VS8?&lt;BR /&gt;On my matrix, MKL yields 45.011 msecs, while hand-coded routne 46.04 msecs.&lt;BR /&gt;&lt;BR /&gt;Another observation is that two-threads are only 10% faster than a single thread.&lt;BR /&gt;&lt;BR /&gt;The matrix size is 2Mx2M, nnz=15M.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Agnonchik.&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;The routine you mentioned is aLevel 2 routine. The performance behavour described by you is typical for dense Level 2 BLAS and Level 2Sparse BLASroutines when the size of data exceeds the size of cache memory. For large data sizes performance of dense Level 2 BLAS as well as Sparse BLAS Level 2mainly depends onmemory bandwidth because of small amount of arithmetic operations. So changing algorithm for the usage of Level 3 can help to improve performance because Level 3 routines can reuse data in cache.&lt;BR /&gt;&lt;BR /&gt;Unlike dense BLAS, performance of Sparse BLAS depends on matrix structure&lt;STRONG&gt;. &lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;All the best&lt;BR /&gt;Sergey</description>
      <pubDate>Fri, 31 Jul 2009 05:00:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906452#M11749</guid>
      <dc:creator>Sergey_K_Intel1</dc:creator>
      <dc:date>2009-07-31T05:00:06Z</dc:date>
    </item>
    <item>
      <title>Re: mkl_dcsrgemv doesn't yield any significant speedup</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906453#M11750</link>
      <description>&lt;DIV&gt;&lt;/DIV&gt;
Thanks for your explanation!&lt;BR /&gt;Agnonchik.&lt;BR /&gt;</description>
      <pubDate>Fri, 31 Jul 2009 09:05:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-dcsrgemv-doesn-t-yield-any-significant-speedup/m-p/906453#M11750</guid>
      <dc:creator>agnonchik</dc:creator>
      <dc:date>2009-07-31T09:05:14Z</dc:date>
    </item>
  </channel>
</rss>

