<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Understanding OMP_NUM_THREADS in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Understanding-OMP-NUM-THREADS/m-p/913680#M12349</link>
    <description>I am able to answer my own question to some extent. I am running both 32-bit and 64-bit versions of our app on XP 64.
&lt;BR /&gt;The 32-bit version show this odd behaviour. The 64 bit version behaves as expected.
&lt;BR /&gt;
&lt;BR /&gt;For example , on another benchmark
&lt;BR /&gt;
&lt;BR /&gt;64 - bit version
&lt;BR /&gt;OMP_NUM_THREADS=2
&lt;BR /&gt;Process time : 180
&lt;BR /&gt;Wall Clock Time : 134
&lt;BR /&gt;
&lt;BR /&gt;OMP_NUM_THREADS=1
&lt;BR /&gt;Process time : 180
&lt;BR /&gt;Wall Clock Time : 180
&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;32 - bit version
&lt;BR /&gt;OMP_NUM_THREADS=2
&lt;BR /&gt;Process time : 264
&lt;BR /&gt;Wall Clock time : 134
&lt;BR /&gt;
&lt;BR /&gt;OMP_NUM_THREADS=1
&lt;BR /&gt;Process time : 180
&lt;BR /&gt;Wall Clock time : 180
&lt;BR /&gt;
&lt;BR /&gt;So it seems that a 32 - bit app running under XP64 reports excessive process time when multi-threaded.
&lt;BR /&gt;</description>
    <pubDate>Fri, 18 Aug 2006 05:35:17 GMT</pubDate>
    <dc:creator>AndrewC</dc:creator>
    <dc:date>2006-08-18T05:35:17Z</dc:date>
    <item>
      <title>Understanding OMP_NUM_THREADS</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Understanding-OMP-NUM-THREADS/m-p/913679#M12348</link>
      <description>I am running our application on a dual-core intel EM64T and am trying to understand why I am not seeing any benefits from multi-threading of MKL.
&lt;BR /&gt;
&lt;BR /&gt;I set OMP_NUM_THREADS to 1, and run my benchmark.
&lt;BR /&gt;
&lt;BR /&gt;CPU usage pegged at 50% for my process in Task Manager
&lt;BR /&gt;Process Time: 30s
&lt;BR /&gt;Wall Clock Time: 30s
&lt;BR /&gt;
&lt;BR /&gt;I set OMP_NUM_THREADS to 2, and run my benchmark
&lt;BR /&gt;
&lt;BR /&gt;CPU usage pegged at 90-99% for my process in Task Manager, both CPU's apparently maxed-out.
&lt;BR /&gt;But...
&lt;BR /&gt;Process Time: 58s
&lt;BR /&gt;Wall Clock Time: 30s
&lt;BR /&gt;So how is this posssible?  I would have expected that in the OMP_NUM_THREADS=2 case, that process time would go up somewhat , but that wall clock time would drop.
&lt;BR /&gt;I did not expect CPU time to double, but wall clock stay the same. MKL seems to be using two CPUs, but accomplishing nothing.
&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;</description>
      <pubDate>Fri, 18 Aug 2006 03:29:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Understanding-OMP-NUM-THREADS/m-p/913679#M12348</guid>
      <dc:creator>AndrewC</dc:creator>
      <dc:date>2006-08-18T03:29:32Z</dc:date>
    </item>
    <item>
      <title>Re: Understanding OMP_NUM_THREADS</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Understanding-OMP-NUM-THREADS/m-p/913680#M12349</link>
      <description>I am able to answer my own question to some extent. I am running both 32-bit and 64-bit versions of our app on XP 64.
&lt;BR /&gt;The 32-bit version show this odd behaviour. The 64 bit version behaves as expected.
&lt;BR /&gt;
&lt;BR /&gt;For example , on another benchmark
&lt;BR /&gt;
&lt;BR /&gt;64 - bit version
&lt;BR /&gt;OMP_NUM_THREADS=2
&lt;BR /&gt;Process time : 180
&lt;BR /&gt;Wall Clock Time : 134
&lt;BR /&gt;
&lt;BR /&gt;OMP_NUM_THREADS=1
&lt;BR /&gt;Process time : 180
&lt;BR /&gt;Wall Clock Time : 180
&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;32 - bit version
&lt;BR /&gt;OMP_NUM_THREADS=2
&lt;BR /&gt;Process time : 264
&lt;BR /&gt;Wall Clock time : 134
&lt;BR /&gt;
&lt;BR /&gt;OMP_NUM_THREADS=1
&lt;BR /&gt;Process time : 180
&lt;BR /&gt;Wall Clock time : 180
&lt;BR /&gt;
&lt;BR /&gt;So it seems that a 32 - bit app running under XP64 reports excessive process time when multi-threaded.
&lt;BR /&gt;</description>
      <pubDate>Fri, 18 Aug 2006 05:35:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Understanding-OMP-NUM-THREADS/m-p/913680#M12349</guid>
      <dc:creator>AndrewC</dc:creator>
      <dc:date>2006-08-18T05:35:17Z</dc:date>
    </item>
  </channel>
</rss>

