<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:dpcpp program performance gets drop by adding c... in Intel® oneAPI DPC++/C++ Compiler</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1188403#M527</link>
    <description>&lt;P&gt;Hi Jim,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Could you please let us know if your issue is resolved. &lt;/P&gt;&lt;P&gt;If not do let us know. So that we will be able to help you regarding the same. &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;--Goutham&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Tue, 30 Jun 2020 13:20:55 GMT</pubDate>
    <dc:creator>GouthamK_Intel</dc:creator>
    <dc:date>2020-06-30T13:20:55Z</dc:date>
    <item>
      <title>dpcpp program performance gets drop by adding compile option '-g'</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1186517#M488</link>
      <description>&lt;P&gt;Hi all&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I found&amp;nbsp;dpcpp program performance gets drop by adding compile option '-g'.&lt;/P&gt;&lt;P&gt;And I also test the related cuda program, nvcc does not have this problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The program source code has been uploaded.&lt;/P&gt;&lt;P&gt;The compile cmdline is :&lt;!--StartFragment --&gt;&lt;/P&gt;
&lt;PRE class="brush:bash; class-name:dark;"&gt;dpcpp -o a -O2 ./convSep_nocg.dp.cpp

dpcpp -o a_g -g -O2 ./convSep_nocg.dp.cpp&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Program 'a_g' spend near 2x time vs 'a'.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;

&lt;PRE class="brush:bash; class-name:dark;"&gt;# ./a 10240 10240 1000
[./a] - Starting...
Image Width x Height = 10240 x 10240

Allocating and initializing host arrays...
Allocating and initializing CUDA arrays...
Running GPU convolution (1000 identical iterations)...

convolutionSeparable, Throughput = 1321.5942 MPixels/sec, Time = 0.07934 s, Size = 104857600 Pixels, NumDevsUsed = 1, Workgroup = 0

Reading back GPU results...

Checking the results...
 ...running convolutionRowCPU()
 ...running convolutionColumnCPU()
 ...comparing the results
 ...Relative L2 norm: 0.000000E+00

Shutting down...


# ./a_g 10240 10240 1000
[./a_g] - Starting...
Image Width x Height = 10240 x 10240

Allocating and initializing host arrays...
Allocating and initializing CUDA arrays...
Running GPU convolution (1000 identical iterations)...

convolutionSeparable, Throughput = 773.0526 MPixels/sec, Time = 0.13564 s, Size = 104857600 Pixels, NumDevsUsed = 1, Workgroup = 0

Reading back GPU results...

Checking the results...
 ...running convolutionRowCPU()
 ...running convolutionColumnCPU()
 ...comparing the results
 ...Relative L2 norm: 0.000000E+00

Shutting down...&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;OS Version: &lt;!--StartFragment --&gt;Ubuntu 18.04.3 LTS&lt;/P&gt;
&lt;P&gt;linux-kernel: 4.15.18&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;
&lt;P&gt;&lt;!--EndFragment --&gt;&lt;/P&gt;
&lt;P&gt;oneAPI Basekit Version: 2021.1-beta06&lt;/P&gt;
&lt;P&gt;CPU:&amp;nbsp;&lt;!--StartFragment --&gt;Intel(R) Xeon(R) CPU E3-1585 v5 @ 3.50GHz&lt;/P&gt;
&lt;P&gt;GPU:&amp;nbsp;&lt;!--StartFragment --&gt;Intel Corporation Iris Pro Graphics P580&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2020 12:10:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1186517#M488</guid>
      <dc:creator>Jim</dc:creator>
      <dc:date>2020-06-17T12:10:39Z</dc:date>
    </item>
    <item>
      <title>Hi Jim,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1186518#M489</link>
      <description>&lt;P&gt;Hi Jim,&lt;/P&gt;&lt;P&gt;When we use -g debug flag in dpcpp, it generates&amp;nbsp;debug information for both host as well as device part of the code.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Enabling -g option creates another section called debug section. So this will, in turn, create overhead during compilation, hence there could be a considerable increase during run time.&lt;/P&gt;&lt;P&gt;For more information, you refer to this link.&lt;/P&gt;&lt;P&gt;&lt;A href="https://software.intel.com/content/www/us/en/develop/documentation/get-started-with-debugging-dpcpp-linux/top.html"&gt;https://software.intel.com/content/www/us/en/develop/documentation/get-started-with-debugging-dpcpp-linux/top.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;However, when -g flag is passed to nvcc compiler it generates&amp;nbsp;debug information only for the host. To generate debug&amp;nbsp;information for the device there is a different flag that needs to be passed to the nvcc compiler.&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jun 2020 11:55:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1186518#M489</guid>
      <dc:creator>PrasanthD_intel</dc:creator>
      <dc:date>2020-06-18T11:55:36Z</dc:date>
    </item>
    <item>
      <title>Re:dpcpp program performance gets drop by adding c...</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1188403#M527</link>
      <description>&lt;P&gt;Hi Jim,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Could you please let us know if your issue is resolved. &lt;/P&gt;&lt;P&gt;If not do let us know. So that we will be able to help you regarding the same. &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;--Goutham&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 30 Jun 2020 13:20:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1188403#M527</guid>
      <dc:creator>GouthamK_Intel</dc:creator>
      <dc:date>2020-06-30T13:20:55Z</dc:date>
    </item>
    <item>
      <title>Re:dpcpp program performance gets drop by adding c...</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1190548#M553</link>
      <description>&lt;P&gt;Hi Jim,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;This issue has been resolved and we will no longer respond to this thread.&amp;nbsp;If you require additional assistance from Intel, please start a new thread.&amp;nbsp;Any further interaction in this thread will be considered community only&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 08 Jul 2020 07:24:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/dpcpp-program-performance-gets-drop-by-adding-compile-option-g/m-p/1190548#M553</guid>
      <dc:creator>PrasanthD_intel</dc:creator>
      <dc:date>2020-07-08T07:24:22Z</dc:date>
    </item>
  </channel>
</rss>

