<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic mkl_scscmm performance problem in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-scscmm-performance-problem/m-p/1115778#M24641</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;

&lt;P&gt;I am building the attached program in OLCF's RHEA supercomputer&amp;nbsp;(https://www.olcf.ornl.gov/computing-resources/rhea/)&amp;nbsp;&lt;SPAN style="font-size: 13.008px; line-height: 19.512px;"&gt;&amp;nbsp;with Intel compiler&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="s1" style="font-size: 13.008px; line-height: 19.512px;"&gt;icc (ICC) 14.0.4 20140805. Armadillo has a naive implementation of cscmm. I run the program with MKL_NUM_THREADS=1 in rhea to multiply the sparse matrix of size&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="color: rgb(34, 34, 34); font-family: arial, sans-serif; font-size: 12.8px; line-height: normal;"&gt;83328x124992 with a dense matrix of size&amp;nbsp;124992x50. The following is the output.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;./a.out 83328 124992 50 0.00001&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;The output of the test code&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;::A::83328x124992&lt;BR /&gt;
	nnz::104153&lt;BR /&gt;
	::B::124992x50&lt;BR /&gt;
	mkl cscmm::162.13&lt;BR /&gt;
	::C::50x83328&lt;BR /&gt;
	arma ::0.06&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(34, 34, 34); font-family: arial, sans-serif; font-size: 12.8px; line-height: normal;"&gt;I am seeing MKL_CSCMM to be really slow over armadillo naive implementation. Kindly let me know what am I doing wrong here.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Ramki&lt;/P&gt;</description>
    <pubDate>Fri, 01 Jul 2016 20:06:40 GMT</pubDate>
    <dc:creator>Ramakrishnan_K_</dc:creator>
    <dc:date>2016-07-01T20:06:40Z</dc:date>
    <item>
      <title>mkl_scscmm performance problem</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-scscmm-performance-problem/m-p/1115778#M24641</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;

&lt;P&gt;I am building the attached program in OLCF's RHEA supercomputer&amp;nbsp;(https://www.olcf.ornl.gov/computing-resources/rhea/)&amp;nbsp;&lt;SPAN style="font-size: 13.008px; line-height: 19.512px;"&gt;&amp;nbsp;with Intel compiler&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="s1" style="font-size: 13.008px; line-height: 19.512px;"&gt;icc (ICC) 14.0.4 20140805. Armadillo has a naive implementation of cscmm. I run the program with MKL_NUM_THREADS=1 in rhea to multiply the sparse matrix of size&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="color: rgb(34, 34, 34); font-family: arial, sans-serif; font-size: 12.8px; line-height: normal;"&gt;83328x124992 with a dense matrix of size&amp;nbsp;124992x50. The following is the output.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;./a.out 83328 124992 50 0.00001&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;The output of the test code&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;::A::83328x124992&lt;BR /&gt;
	nnz::104153&lt;BR /&gt;
	::B::124992x50&lt;BR /&gt;
	mkl cscmm::162.13&lt;BR /&gt;
	::C::50x83328&lt;BR /&gt;
	arma ::0.06&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(34, 34, 34); font-family: arial, sans-serif; font-size: 12.8px; line-height: normal;"&gt;I am seeing MKL_CSCMM to be really slow over armadillo naive implementation. Kindly let me know what am I doing wrong here.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Ramki&lt;/P&gt;</description>
      <pubDate>Fri, 01 Jul 2016 20:06:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-scscmm-performance-problem/m-p/1115778#M24641</guid>
      <dc:creator>Ramakrishnan_K_</dc:creator>
      <dc:date>2016-07-01T20:06:40Z</dc:date>
    </item>
    <item>
      <title>may you check if this gap</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-scscmm-performance-problem/m-p/1115779#M24642</link>
      <description>&lt;P&gt;may you check if this gap exists with the latest mkl 11.3.update 3? you may take evaluation package of intel complier v.16. &amp;nbsp;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;May you also check CSR ( &lt;/SPAN&gt;&lt;SPAN style="color: rgb(51, 51, 51); font-family: &amp;quot;Courier New&amp;quot;, Courier, monospace; font-size: 13.3333px; line-height: 20px;"&gt;mkl_dcsrmm&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;) &amp;nbsp;case?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 03 Jul 2016 05:14:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/mkl-scscmm-performance-problem/m-p/1115779#M24642</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-07-03T05:14:40Z</dc:date>
    </item>
  </channel>
</rss>

