<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Intel GPU OpenCL with g++ -O3 flag. in GPU Compute Software</title>
    <link>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512354#M988</link>
    <description>&lt;P&gt;With -O2 happens the same.&lt;/P&gt;&lt;P&gt;I try to put the variables "static" but did not worked as well.&lt;/P&gt;&lt;P&gt;And It also does not accept "volatile" because of incompability of types.&amp;nbsp;&lt;/P&gt;&lt;P&gt;g++ (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0&lt;/P&gt;</description>
    <pubDate>Wed, 09 Aug 2023 08:57:02 GMT</pubDate>
    <dc:creator>luisgo007</dc:creator>
    <dc:date>2023-08-09T08:57:02Z</dc:date>
    <item>
      <title>Intel GPU OpenCL with g++ -O3 flag.</title>
      <link>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512054#M985</link>
      <description>&lt;P&gt;Dear All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a dynamic library (XXX.so) with a OpenCL kernel.&lt;/P&gt;&lt;P&gt;When I compile the library without -O3 flag the kernel works Ok.&lt;/P&gt;&lt;P&gt;When I compile with the -O3 flag I obtain the follow error from the run of the kernel.&lt;/P&gt;&lt;P&gt;-54 CL_INVALID_WORK_GROUP_SIZE&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;g++ -c -march=native -O3 -fPIC AoA.cpp -o AoA.o&lt;BR /&gt;g++ -shared -march=native  -o AoA.so AoA.o -L/usr/lib -llapack -lblas -lOpenCL&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;Is this normal?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Luís Gonçalves&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Aug 2023 12:58:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512054#M985</guid>
      <dc:creator>luisgo007</dc:creator>
      <dc:date>2023-08-08T12:58:40Z</dc:date>
    </item>
    <item>
      <title>Re: Intel GPU OpenCL with g++ -O3 flag.</title>
      <link>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512130#M986</link>
      <description>&lt;P&gt;Can you see what the arguments are to the call to clEnqueueNDRangeKernel that is generating the error when your program is compiled with -O3?&amp;nbsp; Are they different than when your program is compiled with -O2?&lt;/P&gt;&lt;P&gt;You might be able to do this with the debugger,&amp;nbsp; or you can use a tool like the OpenCL Intercept Layer to generate an API "call log":&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/intel/opencl-intercept-layer" target="_blank"&gt;https://github.com/intel/opencl-intercept-layer&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/intel/opencl-intercept-layer/blob/main/docs/controls.md#calllogging-bool" target="_blank"&gt;https://github.com/intel/opencl-intercept-layer/blob/main/docs/controls.md#calllogging-bool&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I'm guessing that aggressive compiler optimizations are causing the local_work_size to be something other than what you are expecting, which is causing the OpenCL driver to return an error.&lt;/P&gt;</description>
      <pubDate>Tue, 08 Aug 2023 16:11:51 GMT</pubDate>
      <guid>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512130#M986</guid>
      <dc:creator>Ben_A_Intel</dc:creator>
      <dc:date>2023-08-08T16:11:51Z</dc:date>
    </item>
    <item>
      <title>Re: Intel GPU OpenCL with g++ -O3 flag.</title>
      <link>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512354#M988</link>
      <description>&lt;P&gt;With -O2 happens the same.&lt;/P&gt;&lt;P&gt;I try to put the variables "static" but did not worked as well.&lt;/P&gt;&lt;P&gt;And It also does not accept "volatile" because of incompability of types.&amp;nbsp;&lt;/P&gt;&lt;P&gt;g++ (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0&lt;/P&gt;</description>
      <pubDate>Wed, 09 Aug 2023 08:57:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512354#M988</guid>
      <dc:creator>luisgo007</dc:creator>
      <dc:date>2023-08-09T08:57:02Z</dc:date>
    </item>
    <item>
      <title>Re: Intel GPU OpenCL with g++ -O3 flag.</title>
      <link>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512393#M989</link>
      <description>&lt;P&gt;I had a bug. Compiles ok with -O3. Sorry.&lt;/P&gt;</description>
      <pubDate>Wed, 09 Aug 2023 09:54:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/GPU-Compute-Software/Intel-GPU-OpenCL-with-g-O3-flag/m-p/1512393#M989</guid>
      <dc:creator>luisgo007</dc:creator>
      <dc:date>2023-08-09T09:54:13Z</dc:date>
    </item>
  </channel>
</rss>

