<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MKL routines within OMP parallel loop in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061433#M21695</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;How many threads will an MKL routine (e.g. cblas_sgemm) launches when that routine is included within an OMP parallel loop? Is it the same in both Xeon and Xeon Phi?&lt;/P&gt;

&lt;P&gt;Thanks!&lt;/P&gt;</description>
    <pubDate>Mon, 02 Feb 2015 15:16:41 GMT</pubDate>
    <dc:creator>YW</dc:creator>
    <dc:date>2015-02-02T15:16:41Z</dc:date>
    <item>
      <title>MKL routines within OMP parallel loop</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061433#M21695</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;How many threads will an MKL routine (e.g. cblas_sgemm) launches when that routine is included within an OMP parallel loop? Is it the same in both Xeon and Xeon Phi?&lt;/P&gt;

&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 02 Feb 2015 15:16:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061433#M21695</guid>
      <dc:creator>YW</dc:creator>
      <dc:date>2015-02-02T15:16:41Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061434#M21696</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;By default, only 1 thread will be created for MKL, if it's an openmp parallel region. If you want MKL to use multiple threads, you can set MKL_DYNAMIC=false.&lt;/P&gt;

&lt;P&gt;--Vipin&lt;/P&gt;</description>
      <pubDate>Tue, 03 Feb 2015 04:13:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061434#M21696</guid>
      <dc:creator>VipinKumar_E_Intel</dc:creator>
      <dc:date>2015-02-03T04:13:03Z</dc:date>
    </item>
    <item>
      <title>Quote:Vipin Kumar E K (Intel)</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061435#M21697</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Vipin Kumar E K (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;By default, only 1 thread will be created for MKL, if it's an openmp parallel region. If you want MKL to use multiple threads, you can set MKL_DYNAMIC=false.&lt;/P&gt;

&lt;P&gt;--Vipin&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Thanks! It seems that the number of running threads is easy to go wild if MKL_NYNAMIC is set to be false, right? Is there a way to control the number of threads (&amp;gt;1 but smaller than a certain number) an MKL routines could launch?&lt;/P&gt;</description>
      <pubDate>Tue, 03 Feb 2015 16:10:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061435#M21697</guid>
      <dc:creator>YW</dc:creator>
      <dc:date>2015-02-03T16:10:00Z</dc:date>
    </item>
    <item>
      <title> Please refer https:/</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061436#M21698</link>
      <description>&lt;P&gt;&amp;nbsp;Please refer &lt;A href="https://software.intel.com/en-us/node/528380"&gt;https://software.intel.com/en-us/node/528380&lt;/A&gt; for more details on various api functions and env. variables to set for calling MKL in a nested region.&lt;/P&gt;

&lt;P&gt;--Vipin&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Feb 2015 06:14:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061436#M21698</guid>
      <dc:creator>VipinKumar_E_Intel</dc:creator>
      <dc:date>2015-02-06T06:14:11Z</dc:date>
    </item>
    <item>
      <title>You can also control the</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061437#M21699</link>
      <description>&lt;P&gt;You can also control the threads by setting these. Assuming that the user openmp regions use 60 threads and 4 threads should run dgemm (any mkl function), set these variables at the run time.&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;#enable nested OpenMP
export OMP_NESTED=TRUE
export OMP_NUM_THREADS=60,4

#OpenMP 4 placement: 4 threads per core do dgemm
export OMP_PLACES=threads
export OMP_PROC_BIND=spread,close

#Enable HOT TEAMS: Intel compiler 15 update 1
export KMP_HOT_TEAMS_MAX_LEVEL=2
export KMP_HOT_TEAMS_MODE=1&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Feb 2015 23:38:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-routines-within-OMP-parallel-loop/m-p/1061437#M21699</guid>
      <dc:creator>Jeongnim_K_Intel1</dc:creator>
      <dc:date>2015-02-26T23:38:21Z</dc:date>
    </item>
  </channel>
</rss>

