<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Performance gap between oepnmp nested parallelism under Linux and windows in Intel® oneAPI DPC++/C++ Compiler</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1348334#M1798</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We haven't heard back from you. Could you please let us know whether you have any issues with Intel oneAPI products? If not, could you please let us know whether we can close this thread from our end?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Hemanth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 04 Jan 2022 11:03:10 GMT</pubDate>
    <dc:creator>HemanthCH_Intel</dc:creator>
    <dc:date>2022-01-04T11:03:10Z</dc:date>
    <item>
      <title>Performance gap between oepnmp nested parallelism under Linux and windows</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1346600#M1779</link>
      <description>&lt;P&gt;&lt;SPAN&gt;In my program, I use nested parallelism. Using Microsoft's MSVC compiler on Windows system, I have achieved good performance. However, under the gcc compiler on Linux, the performance of nested parallelism seems to be relatively poor, and the completion time is about 40% worse than that of windows. My server has two physical CPUs, 48 cores and 96 threads, and I have six parallel sections outside. An obvious performance is that on Linux system, if I open more than 4 threads in the nested parallel, the system performance will decline sharply, but not on windows. Linux version is 3.10, CentOS 7, and GCC version is 4.8 1. So I wonder if there is something wrong with my settings, or the difference between gcc compiler and MSVC compiler, or the difference between Linux and windows system?How can I solve this problem? My code is as follows：&lt;/SPAN&gt;&lt;/P&gt;
&lt;PRE class="lang-c s-code-block"&gt;&lt;CODE class="hljs language-c"&gt;omp_set_dynamic(&lt;SPAN class="hljs-number"&gt;1&lt;/SPAN&gt;);
omp_set_nested(&lt;SPAN class="hljs-number"&gt;1&lt;/SPAN&gt;);
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp parallel sections&lt;/SPAN&gt;
{
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp section&lt;/SPAN&gt;
{
func()
}
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp section&lt;/SPAN&gt;
{
func()
}
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp section&lt;/SPAN&gt;
{
func()
}
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp section&lt;/SPAN&gt;
{
func()
}
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp section&lt;/SPAN&gt;
{
func()
}
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp section&lt;/SPAN&gt;
{
func()
}
}

&lt;SPAN class="hljs-type"&gt;void&lt;/SPAN&gt; &lt;SPAN class="hljs-title function_"&gt;func&lt;/SPAN&gt;&lt;SPAN class="hljs-params"&gt;()&lt;/SPAN&gt;
{
&lt;SPAN class="hljs-meta"&gt;#&lt;SPAN class="hljs-keyword"&gt;pragma&lt;/SPAN&gt; omp parallel for schedule(static) num_threads(4)&lt;/SPAN&gt;
        &lt;SPAN class="hljs-keyword"&gt;for&lt;/SPAN&gt;(&lt;SPAN class="hljs-type"&gt;int&lt;/SPAN&gt; k=&lt;SPAN class="hljs-number"&gt;0&lt;/SPAN&gt;;k&amp;lt;&lt;SPAN class="hljs-number"&gt;1280000000&lt;/SPAN&gt;;k++)
        {
        }
}&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Sat, 25 Dec 2021 09:13:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1346600#M1779</guid>
      <dc:creator>dtzAlex</dc:creator>
      <dc:date>2021-12-25T09:13:10Z</dc:date>
    </item>
    <item>
      <title>Re: Performance gap between oepnmp nested parallelism under Linux and windows</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1346779#M1783</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As we can see that you are comparing between gcc compiler in Linux and MSVC compiler in windows, this forum is intended to support the queries related to the Intel oneAPI DPC++ and other Intel oneAPI products. So, could you please let us know whether you have any issues with Intel oneAPI products?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Hemanth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Feb 2022 10:32:28 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1346779#M1783</guid>
      <dc:creator>HemanthCH_Intel</dc:creator>
      <dc:date>2022-02-21T10:32:28Z</dc:date>
    </item>
    <item>
      <title>Re: Performance gap between oepnmp nested parallelism under Linux and windows</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1348334#M1798</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We haven't heard back from you. Could you please let us know whether you have any issues with Intel oneAPI products? If not, could you please let us know whether we can close this thread from our end?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Hemanth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 04 Jan 2022 11:03:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1348334#M1798</guid>
      <dc:creator>HemanthCH_Intel</dc:creator>
      <dc:date>2022-01-04T11:03:10Z</dc:date>
    </item>
    <item>
      <title>Re: Performance gap between oepnmp nested parallelism under Linux and windows</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1350412#M1823</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We haven't heard back from you. This thread will no longer be monitored by Intel. If you need further assistance, please post a new question.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Hemanth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jan 2022 06:21:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Performance-gap-between-oepnmp-nested-parallelism-under-Linux/m-p/1350412#M1823</guid>
      <dc:creator>HemanthCH_Intel</dc:creator>
      <dc:date>2022-01-12T06:21:42Z</dc:date>
    </item>
  </channel>
</rss>

