<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Michael, in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951608#M2222</link>
    <description>Michael,

thanks. There are a few things that need more detail.

The incoming QPI &lt;B&gt;data&lt;/B&gt; traffic only includes data payload but not the non-data overhead (snoops, coherency requests). The "data + &lt;B&gt;non-data&lt;/B&gt;" QPI metric is supported on more recent processors. 

According to the definition the "QPI data traffic/Memory controller traffic" should be interpreted as follows:
If all memory accesses are local then QPI data traffic should be negligible (zero). Then the metric is close to 0.
If all memory accesses are remote then they must go through QPI, in this case QPI data traffic could be even &amp;gt;= memory controller traffic. The metric is then &amp;gt;= 1. The metric is an indicator for the NUMA-awareness of the applications running on the system. NUMA-optimized applications should have close to 0 metric value.

--
Roman</description>
    <pubDate>Fri, 16 Nov 2012 18:31:01 GMT</pubDate>
    <dc:creator>Roman_D_Intel</dc:creator>
    <dc:date>2012-11-16T18:31:01Z</dc:date>
    <item>
      <title>Intel Performance Counter(pcm): Total QPI incoming data traffic ?</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951606#M2220</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I'm running PCM tool on the machine with&amp;nbsp;Intel(R) Xeon(R) CPU X5690 @ 3.47GHz "Intel(r) microarchitecture codename Westmere-EP". The output of the PCM tool has the following information on QPI. Can you please share details on the below in the output, what does it actually mean ?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Total QPI incoming data traffic: &amp;nbsp; 65 M &amp;nbsp; &amp;nbsp; QPI data traffic/Memory controller traffic: 0.84&lt;/P&gt;
&lt;P&gt;And also please let us know how data cache misses affect QPI output value.&lt;/P&gt;
&lt;P&gt;Thanks,&amp;nbsp;Prabhu&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Nov 2012 07:07:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951606#M2220</guid>
      <dc:creator>Prabhu_T_</dc:creator>
      <dc:date>2012-11-16T07:07:25Z</dc:date>
    </item>
    <item>
      <title>Hi Prabhu,</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951607#M2221</link>
      <description>Hi Prabhu,

The QuickPath Interconnect (QPI) is the link between your two Xeon sockets.
Your incoming QPI traffic means that 65 MB of data were transferred from the remote socket to the local socket. This data includes remote main memory (DRAM) access, as well as synchronization handshakes like the cache-coherency protocol. 
The QPI / Memory controller traffic ratio, shows you that about 80 % of your memory access was local. If you want to get more control of where memory is accessed, you might have a look at the numactl linux utility.
Cache misses can affect the QPI, when the resulting memory access is remote or when the cache coherency protocol is involved.

Regards,
Michael Steyer</description>
      <pubDate>Fri, 16 Nov 2012 13:56:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951607#M2221</guid>
      <dc:creator>Michael_Intel</dc:creator>
      <dc:date>2012-11-16T13:56:12Z</dc:date>
    </item>
    <item>
      <title>Michael,</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951608#M2222</link>
      <description>Michael,

thanks. There are a few things that need more detail.

The incoming QPI &lt;B&gt;data&lt;/B&gt; traffic only includes data payload but not the non-data overhead (snoops, coherency requests). The "data + &lt;B&gt;non-data&lt;/B&gt;" QPI metric is supported on more recent processors. 

According to the definition the "QPI data traffic/Memory controller traffic" should be interpreted as follows:
If all memory accesses are local then QPI data traffic should be negligible (zero). Then the metric is close to 0.
If all memory accesses are remote then they must go through QPI, in this case QPI data traffic could be even &amp;gt;= memory controller traffic. The metric is then &amp;gt;= 1. The metric is an indicator for the NUMA-awareness of the applications running on the system. NUMA-optimized applications should have close to 0 metric value.

--
Roman</description>
      <pubDate>Fri, 16 Nov 2012 18:31:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Intel-Performance-Counter-pcm-Total-QPI-incoming-data-traffic/m-p/951608#M2222</guid>
      <dc:creator>Roman_D_Intel</dc:creator>
      <dc:date>2012-11-16T18:31:01Z</dc:date>
    </item>
  </channel>
</rss>

