<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Intel VTune - Estimate data offload to GPU in Analyzers</title>
    <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425894#M22650</link>
    <description>&lt;P&gt;Thanks for the clarification! Before marking the thread solved I would like to be sure that there is no way to estimate data transfer with reuse, like Intel Advisor does, for GPU offloading analysis with Intel VTune. It only perform real measurement, Is it correct?&lt;/P&gt;
&lt;P&gt;I saw that also the "Memory access" analysis calculate the number of accesses expressed ad loads and store, so if I know that they are float I need to multiply by 4 bytes to have the number of bytes read and wrote, correct?&lt;BR /&gt;&lt;BR /&gt;Thanks again.&lt;/P&gt;</description>
    <pubDate>Fri, 28 Oct 2022 12:45:20 GMT</pubDate>
    <dc:creator>HPCAnalisys</dc:creator>
    <dc:date>2022-10-28T12:45:20Z</dc:date>
    <item>
      <title>Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1424520#M22636</link>
      <description>&lt;P&gt;Hi, I'm interested in estimate the data transfer, in terms of bytes, of an algorithm or function to be executed on a GPU using Intel VTune. For example, if my algorithm computes a multiplication between two vectors of 10 floats elements each, the result after the offloading would be: 10+10 float elements sent to the GPU and 1, the result, is sent back, so we have 84 bytes in total (21*4). Keep in mind that I'm interested in an estimation, not the actual result on a GPU, since I don't have one available.&lt;/P&gt;
&lt;DIV class="postcell post-layout--right"&gt;
&lt;DIV class="s-prose js-post-body"&gt;
&lt;P&gt;With Intel Advisor is possible to do so and is called "Estimated data transfer with reuse", as I attach in the following screen:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screenshot from 2022-10-24 16-49-17.png" style="width: 200px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/34489i573EE55468DAC7F1/image-size/small/is-moderation-mode/true?v=v2&amp;amp;px=200&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="Screenshot from 2022-10-24 16-49-17.png" alt="Screenshot from 2022-10-24 16-49-17.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In Intel VTune the only way I found is via the "Memory Access" analysis but It express the result as number of loads and stores and probably using hardware counters, so if there are multiple readings from main memory caused by huge data structures, they will be taken into account and does not returns the number of bytes.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screenshot from 2022-10-24 16-56-17.png" style="width: 200px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/34490iC49DAA262FD40D40/image-size/small/is-moderation-mode/true?v=v2&amp;amp;px=200&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="Screenshot from 2022-10-24 16-56-17.png" alt="Screenshot from 2022-10-24 16-56-17.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Is there a way to perform a similar analysis with Intel VTune? Thanks&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Mon, 24 Oct 2022 16:29:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1424520#M22636</guid>
      <dc:creator>HPCAnalisys</dc:creator>
      <dc:date>2022-10-24T16:29:32Z</dc:date>
    </item>
    <item>
      <title>Re: Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425149#M22639</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good day to you.&lt;/P&gt;
&lt;P&gt;Thank you for posting in the Intel communities.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you have a core CPU in your system, it will have UHD graphics. When you try GPU offload using the Intel Vtune profiler, you can see GPU memory access (read and write) metrics in GB/sec. I have attached a screen shot for your reference. To try GPU offload analysis, you need to have a sample that runs on GPU as well as a system with Intel GPU.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="JaideepK_Intel_0-1666790595751.png" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/34545i696A7932A9523062/image-size/medium?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="JaideepK_Intel_0-1666790595751.png" alt="JaideepK_Intel_0-1666790595751.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If this resolves your issue, make sure to accept this as a solution. This would help others with similar issue. Thank you!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Jaideep&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Oct 2022 13:24:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425149#M22639</guid>
      <dc:creator>JaideepK_Intel</dc:creator>
      <dc:date>2022-10-26T13:24:09Z</dc:date>
    </item>
    <item>
      <title>Re: Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425191#M22640</link>
      <description>&lt;P&gt;Thanks for your answer! So the analysis you are proposing is not an estimation of the amount of data to offloaded on a GPU but is it a real measurement that requires a graphics card to be performed and collect the data? Is the type of variable taken int account (float, double...) or only the number of reads and writes?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Oct 2022 15:18:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425191#M22640</guid>
      <dc:creator>HPCAnalisys</dc:creator>
      <dc:date>2022-10-26T15:18:18Z</dc:date>
    </item>
    <item>
      <title>Re: Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425805#M22648</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good day to you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;&amp;gt;&amp;gt;&lt;/EM&gt;&lt;EM style="font-size: 16px; font-family: intel-clear;"&gt;So, the analysis you are proposing is not an estimation of the amount of data to offloaded on a GPU but is it a real measurement that requires a graphics card to be performed and collect the data?&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Yes, the date which we get is not an estimation data and it is a real measurement.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;&amp;gt;&amp;gt;&lt;/EM&gt;&lt;EM style="font-size: 16px; font-family: intel-clear;"&gt;Is the type of variable taken int account (float, double...) or only the number of reads and writes?&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;It is only the number of reads and writes.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If this resolves your issue, make sure to accept this as a solution. This would help others with similar issue. Thank you!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Jaideep&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2022 07:30:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425805#M22648</guid>
      <dc:creator>JaideepK_Intel</dc:creator>
      <dc:date>2022-10-28T07:30:38Z</dc:date>
    </item>
    <item>
      <title>Re: Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425894#M22650</link>
      <description>&lt;P&gt;Thanks for the clarification! Before marking the thread solved I would like to be sure that there is no way to estimate data transfer with reuse, like Intel Advisor does, for GPU offloading analysis with Intel VTune. It only perform real measurement, Is it correct?&lt;/P&gt;
&lt;P&gt;I saw that also the "Memory access" analysis calculate the number of accesses expressed ad loads and store, so if I know that they are float I need to multiply by 4 bytes to have the number of bytes read and wrote, correct?&lt;BR /&gt;&lt;BR /&gt;Thanks again.&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2022 12:45:20 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1425894#M22650</guid>
      <dc:creator>HPCAnalisys</dc:creator>
      <dc:date>2022-10-28T12:45:20Z</dc:date>
    </item>
    <item>
      <title>Re: Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1433049#M22741</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good day to you.&lt;/P&gt;
&lt;P&gt;Sorry for the delay,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;I would like to be sure that there is no way to estimate data transfer with reuse, like Intel Advisor does, for GPU offloading analysis with Intel VTune. It only performs real measurement, Is it correct?&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;Yes, Vtune only gives real measurements.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;I saw that also the "Memory access" analysis calculates the number of accesses expressed ad loads and store, so if I know that they are float I need to multiply by 4 bytes to have the number of bytes read and wrote, correct?&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;If it is a float, we can multiply with 4 bytes.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If this resolves your issue, make sure to accept this as a solution. This would help others with similar issue. Thank you!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Jaideep&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 25 Nov 2022 05:03:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1433049#M22741</guid>
      <dc:creator>JaideepK_Intel</dc:creator>
      <dc:date>2022-11-25T05:03:15Z</dc:date>
    </item>
    <item>
      <title>Re:Intel VTune - Estimate data offload to GPU</title>
      <link>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1437612#M22822</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We have not heard back from you. This thread will no longer be monitored by Intel. If you need further assistance, please post a new question.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Jaideep&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 12 Dec 2022 06:11:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Analyzers/Intel-VTune-Estimate-data-offload-to-GPU/m-p/1437612#M22822</guid>
      <dc:creator>JaideepK_Intel</dc:creator>
      <dc:date>2022-12-12T06:11:56Z</dc:date>
    </item>
  </channel>
</rss>

