<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Multithreading in MKL in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827574#M5225</link>
    <description>Not every program can display speedup using compiler switches to enable parallelism. Nor do all routines of MKL have inherent capability to use SMP parallelism. &lt;BR /&gt;&lt;BR /&gt;See the section &lt;I&gt;Parallelism&lt;/I&gt; in the &lt;I&gt;Overview&lt;/I&gt; chapter of the &lt;A href="https://community.intel.com/../../sites/products/documentation/hpc/composerxe/en-us/mklxe/mkl_manual_win_mac/frontmatter/mkl_title.htm" target="right" class="treeSelected"&gt;Intel Math Kernel Library Reference Manual for guidance on the ways to take advantage of parallelism.&lt;BR /&gt;&lt;/A&gt;</description>
    <pubDate>Sat, 08 Jan 2011 13:11:03 GMT</pubDate>
    <dc:creator>mecej4</dc:creator>
    <dc:date>2011-01-08T13:11:03Z</dc:date>
    <item>
      <title>Multithreading in MKL</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827573#M5224</link>
      <description>Hello,&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;I want to do a simplemulti-threadedmatrix using the routine mkl_dcoomm()&lt;/DIV&gt;&lt;DIV&gt;in MKL.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;While compiling, I use these threaded libraries-lmkl_gnu_thread -lmkl_core -lmkl_intel_lp64&lt;/DIV&gt;&lt;DIV&gt;the program compiles, but I do not see any speedup, I have tried a range of matrix sizes, however&lt;/DIV&gt;&lt;DIV&gt;I use this routine for matrix-vector product, since matrix-vector is notmulti threadedin mkl sparse blas.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt; While running the program, I set the number of threads by : set MKL_NUM_THREADS=4&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt; I do not see any significant speedup with varying the number of cores!&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Am I missing something that is required to enable parallelism ?&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Thanks,&lt;/DIV&gt;&lt;DIV&gt;Pawan&lt;/DIV&gt;</description>
      <pubDate>Sat, 08 Jan 2011 01:50:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827573#M5224</guid>
      <dc:creator>pawanlri</dc:creator>
      <dc:date>2011-01-08T01:50:21Z</dc:date>
    </item>
    <item>
      <title>Multithreading in MKL</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827574#M5225</link>
      <description>Not every program can display speedup using compiler switches to enable parallelism. Nor do all routines of MKL have inherent capability to use SMP parallelism. &lt;BR /&gt;&lt;BR /&gt;See the section &lt;I&gt;Parallelism&lt;/I&gt; in the &lt;I&gt;Overview&lt;/I&gt; chapter of the &lt;A href="https://community.intel.com/../../sites/products/documentation/hpc/composerxe/en-us/mklxe/mkl_manual_win_mac/frontmatter/mkl_title.htm" target="right" class="treeSelected"&gt;Intel Math Kernel Library Reference Manual for guidance on the ways to take advantage of parallelism.&lt;BR /&gt;&lt;/A&gt;</description>
      <pubDate>Sat, 08 Jan 2011 13:11:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827574#M5225</guid>
      <dc:creator>mecej4</dc:creator>
      <dc:date>2011-01-08T13:11:03Z</dc:date>
    </item>
    <item>
      <title>Multithreading in MKL</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827575#M5226</link>
      <description>&lt;P&gt;Pawan,&lt;/P&gt;

&lt;P&gt;&lt;QUOTE&gt;: I do not see any significant
speedup with varying the number of cores!&lt;/QUOTE&gt;&lt;/P&gt;

&lt;P&gt;The performance of any spare matrix
operations is much lower that the dense BLAS because the memory access patterns
are irregular and the ration of float point operations is lower than in some dense
operations. So thats the reason why you dont see any significant speedup.&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;

&lt;P&gt;So, if the matrix sizes are fit with the
RAM, when it would be more efficient to use dense BLAS calculations.&lt;/P&gt;

&lt;P&gt;In such cases It may be make a sense to
convert from sparse to dense, then use m-v calculation for dense routines&lt;/P&gt;--Gennady</description>
      <pubDate>Sun, 09 Jan 2011 17:52:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Multithreading-in-MKL/m-p/827575#M5226</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2011-01-09T17:52:35Z</dc:date>
    </item>
  </channel>
</rss>

