<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:Debug/Profile tool to inspect events/queues in Analyzers</title>
    <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1195889#M19151</link>
    <description>&lt;P&gt;Hey RN1,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Have you tried out the solution provided. Does that resolve your issue? Please let us know in case you need any other information.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &lt;/P&gt;&lt;P&gt;Arun&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Thu, 30 Jul 2020 03:53:11 GMT</pubDate>
    <dc:creator>ArunJ_Intel</dc:creator>
    <dc:date>2020-07-30T03:53:11Z</dc:date>
    <item>
      <title>Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1194450#M19099</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;Does oneAPI provide any tool to inspect the command queues and SYCL/OpenCL operations performed?&lt;/P&gt;
&lt;P&gt;In other platforms, such as AMD, we have CodeXL to inspect the OpenCL of CPU/GPU devices.&lt;BR /&gt;In the past (1-2y ago) I tried to use VTune to profile/inspect OpenCL calls in Intel devices, but it only worked for GPUs.&lt;BR /&gt;&lt;BR /&gt;I want to see the parallelization and bottlenecks, if I am doing something wrong with the buffers/memory, since I am running matmul with oneAPI and I get in CPU 25s, in iGPU 5s, but when using CPU+GPU (workload splitted accordingly, iGPU with 95% CPU with 5% aprox), I get around 7s. I did many tests and never below 5s, so, something must be wrong.. but I would like to know how can I inspect the real OpenCL operations performed and if they are executed in parallel (I don't mind if is GUI or CLI).&lt;BR /&gt;&lt;BR /&gt;Doing manual profiling (chrono + profiling queues) I can see this:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;selecting
cpu selector
selecting
gpu selector
0.275022 gpu submit
Running on: Intel(R) Gen9 size: 2304 offset: 0
0.43391 cpu submit
Running on: Intel(R) Core(TM) i5-6200U CPU @ 2.30GHz size: 256 offset: 2304
6.95277 gpu callback function
gpu queue 6220.89 ms
6.97933 exit gpu
7.50185 cpu callback function
cpu queue 6248.3 ms
7.53788 exit cpu
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 24 Jul 2020 09:54:31 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1194450#M19099</guid>
      <dc:creator>RN1</dc:creator>
      <dc:date>2020-07-24T09:54:31Z</dc:date>
    </item>
    <item>
      <title>Re:Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1194921#M19101</link>
      <description>&lt;P&gt;Hey&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Vtune has 2 reports for analysis of&amp;nbsp;cpu/gpu OpenCL kernels.&lt;/P&gt;
&lt;P&gt;1)GPU Offload analysis&lt;/P&gt;
&lt;P&gt;2)GPU compute media hotspot analysis&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You could use vtunes&amp;nbsp;"GPU Offload analysis" to analyze&amp;nbsp;CPU-based workloads together with GPU-based workloads within a unified time domain. In this analysis type in the GUI there is an option to&amp;nbsp;Collect CPU-side stacks&amp;nbsp;this option&amp;nbsp;can be used to analyze call stacks executed on the CPU and identify critical paths.&lt;/P&gt;
&lt;P&gt;You could use the time line pane in the graphics pane to compare how effectively your program uses OpenCL kernels and further analysis on this can be done with GPU Compute/Media Hotspots analysis.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In "GPU compute media hotspot analysis" report the graphics window displays CPU and GPU usage data per thread and provides an extended list of GPU hardware metrics that help analyze accesses to different types of GPU memory. You could also analyse the cpu hotspots in the same report by navigating to cpu hotspots view in vtune report. Please find attached a screen print to list and navigate to multiple views in vtune.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="views.PNG" style="width: 999px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/11423i3ED8C9083480831D/image-size/large?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="views.PNG" alt="views.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Hope this helps&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;Arun&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 27 Jul 2020 09:07:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1194921#M19101</guid>
      <dc:creator>ArunJ_Intel</dc:creator>
      <dc:date>2020-07-27T09:07:04Z</dc:date>
    </item>
    <item>
      <title>Re:Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1195312#M19120</link>
      <description>&lt;P&gt;Hey &lt;a href="https://community.intel.com/t5/user/viewprofilepage/user-id/78339"&gt;@RN1&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To add on to my answer if you are looking particularly for cpu gpu concurrency. You can select the GPU rendering view in the vtune report.&lt;/P&gt;
&lt;P&gt;In "GPU rendering" view in the platform pane you can see the gpu and cpu utilisation in a time series line plot. It also gives the cpu-gpu concurrency.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please find attached a screen print for reference.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="gpu-rendering.png" style="width: 999px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/11471i8FFD8598E6B921E7/image-size/large?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="gpu-rendering.png" alt="gpu-rendering.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;Arun Jose&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jul 2020 13:00:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1195312#M19120</guid>
      <dc:creator>ArunJ_Intel</dc:creator>
      <dc:date>2020-07-28T13:00:21Z</dc:date>
    </item>
    <item>
      <title>Re:Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1195889#M19151</link>
      <description>&lt;P&gt;Hey RN1,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Have you tried out the solution provided. Does that resolve your issue? Please let us know in case you need any other information.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks &lt;/P&gt;&lt;P&gt;Arun&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 30 Jul 2020 03:53:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1195889#M19151</guid>
      <dc:creator>ArunJ_Intel</dc:creator>
      <dc:date>2020-07-30T03:53:11Z</dc:date>
    </item>
    <item>
      <title>Re:Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1197223#M19206</link>
      <description>&lt;P&gt;Hey RN1,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;An edit/update to the information I had&amp;nbsp;provided. To visualise cpu/gpu concurrency in latest vesions of vtune. You must do some additional configuration. Please find below the steps&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1- Launch VTune as admin(some options do not show up if vtune is not launched with admin privilege)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2- Create a custom copy of GPU offload analysis with advanced options of How pane.(To create custom copy, you just need to click the highlighted button in below snap)&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="create custom copy.PNG" style="width: 806px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/11611i1B8F84466BCBC3A3/image-size/large?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="create custom copy.PNG" alt="create custom copy.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;3- Check analyse system wide context switches option&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="analyse system wide.PNG" style="width: 999px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/11612i3C7FB065B27AB0BF/image-size/large?v=v2&amp;amp;px=999&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="analyse system wide.PNG" alt="analyse system wide.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Let us know if you would prefer &lt;SPAN style="font-family: Calibri, sans-serif; font-size: 11pt;"&gt; this graph being collected without creating a custom copy. We would be happy to take this forward with engineering as a feedback from your side.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 04 Aug 2020 09:52:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1197223#M19206</guid>
      <dc:creator>ArunJ_Intel</dc:creator>
      <dc:date>2020-08-04T09:52:17Z</dc:date>
    </item>
    <item>
      <title>Re:Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1199608#M19271</link>
      <description>&lt;P&gt;Hey RN1,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have you tried out the solution provided to obtain cpu/gpu concurrency plot. Could you please confirm whether the solution provided helps.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;Arun&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Aug 2020 05:40:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1199608#M19271</guid>
      <dc:creator>ArunJ_Intel</dc:creator>
      <dc:date>2020-08-12T05:40:18Z</dc:date>
    </item>
    <item>
      <title>Re:Debug/Profile tool to inspect events/queues</title>
      <link>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1200694#M19294</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are assuming the solution provided resolves your issue as we have not heard back from you for sometime. We wouldn't be monitoring this thread anymore. Please feel free to raise a new thread in case of further issues.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Arun&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 17 Aug 2020 04:31:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Debug-Profile-tool-to-inspect-events-queues/m-p/1200694#M19294</guid>
      <dc:creator>ArunJ_Intel</dc:creator>
      <dc:date>2020-08-17T04:31:56Z</dc:date>
    </item>
  </channel>
</rss>

