<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic On Intel or any architecture. in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946512#M2060</link>
    <description>&lt;P&gt;On Intel or any architecture... I would propose looking at the front end "uops per clock" provided while the front end is busy. &amp;nbsp;So count the clks that the front end is actually doing something, that includes the DSB / MS and ILD, and then compare that with the execution core's upc ( uops per clock ). &amp;nbsp;If the front end upc while it's busy == that of the execution core, then you might be front end limited. &amp;nbsp;I only mention this since you're focusing on activity.. and thought you might think there's some limitation in the front of the machine. &amp;nbsp;In my inspections on many applications, Intel's rarely limited in the front end of their pipeline and their DSB provides much greater throughput than the execution core can &amp;nbsp;chew. &amp;nbsp;You also might want to generate a distribution of the throughput of the various front end and execution resources to see how often nothing is done... it's a large % of the time.&lt;/P&gt;
&lt;P&gt;perfwise&lt;/P&gt;</description>
    <pubDate>Mon, 18 Feb 2013 13:02:26 GMT</pubDate>
    <dc:creator>perfwise</dc:creator>
    <dc:date>2013-02-18T13:02:26Z</dc:date>
    <item>
      <title>Performance Counters on SandyBridge</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946501#M2049</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;
&lt;P&gt;I've just started doing some profiling work on SandyBridge recently, so the following questions might be stupid.&lt;/P&gt;
&lt;P&gt;I've checked the Intel SDM and found CYCLE_ACTIVITY should be very useful for my work. But when I actually tried to get that counter but found it seems that only IvyBridge has it. It that right?&lt;/P&gt;
&lt;P&gt;In other words, my goal is to find how many cycles are stalled on data for a certain application. How can I do that on a SandyBridge machine (or IveyBridge).&lt;/P&gt;
&lt;P&gt;Thanks a lot!&lt;/P&gt;</description>
      <pubDate>Fri, 15 Feb 2013 21:45:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946501#M2049</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-15T21:45:14Z</dc:date>
    </item>
    <item>
      <title>You can use VTune for</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946502#M2050</link>
      <description>&lt;P&gt;You can use VTune for gathering CPU related activity(Count of retired uops and etc...).&lt;/P&gt;</description>
      <pubDate>Sat, 16 Feb 2013 07:11:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946502#M2050</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-02-16T07:11:44Z</dc:date>
    </item>
    <item>
      <title>I was wondering if there is</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946503#M2051</link>
      <description>&lt;P&gt;I was wondering if there is suppor for CYCLE_ACTIVITY on Sandy Bridge. Because the optimization manual said so but I couldn't find it on the Software Develop's Manual. Thanks!&lt;/P&gt;</description>
      <pubDate>Sat, 16 Feb 2013 22:22:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946503#M2051</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-16T22:22:57Z</dc:date>
    </item>
    <item>
      <title>Quote:Yunqi Z. wrote:</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946504#M2052</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Yunqi Z. wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I was wondering if there is suppor for CYCLE_ACTIVITY on Sandy Bridge. Because the optimization manual said so but I couldn't find it on the Software Develop's Manual. Thanks!&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Where in the SDM did you try to find it?You need to refer to Volume 3 System Programming Manual chapters 18 and 19.&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 05:48:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946504#M2052</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-02-17T05:48:27Z</dc:date>
    </item>
    <item>
      <title>In the 64 IA-32 Architectures</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946505#M2053</link>
      <description>&lt;P&gt;In the 64 IA-32 Architectures Optimization Manual, Appendix B.3.2.3, there mentioned to use&amp;nbsp;CYCLE_ACTIVITY.STALLS_LDM_PENDING and other CYCLE_ACTIVITY counters to characterize memory subsystem. But in SDM Volume 3 19.4 for Sandy Bridge, there aren't these counters.&lt;/P&gt;
&lt;P&gt;And I also found a link (http://software.intel.com/en-us/forums/topic/277820) saying there should be these counters on Sandy Bridge.&lt;/P&gt;
&lt;P&gt;Thanks a lot!&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 06:16:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946505#M2053</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-17T06:16:25Z</dc:date>
    </item>
    <item>
      <title>In the 64 IA-32 Architectures</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946506#M2054</link>
      <description>&lt;P&gt;In the 64 IA-32 Architectures Optimization Manual, Appendix B.3.2.3, there mentioned to use&amp;nbsp;CYCLE_ACTIVITY.STALLS_LDM_PENDING and other CYCLE_ACTIVITY counters to characterize memory subsystem. But in SDM Volume 3 19.4 for Sandy Bridge, there aren't these counters.&lt;/P&gt;
&lt;P&gt;And I also found a link (the title is "Ivy Bridge performance monitoring events CYCLE_ACTIVITY.*?") saying there should be these counters on Sandy Bridge.&lt;/P&gt;
&lt;P&gt;Thanks a lot!&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 06:20:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946506#M2054</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-17T06:20:53Z</dc:date>
    </item>
    <item>
      <title>Are you refering to this link</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946507#M2055</link>
      <description>&lt;P&gt;Are you refering to this link :&lt;A href="http://software.intel.com/en-us/forums/topic/277820"&gt;http://software.intel.com/en-us/forums/topic/277820&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;I went through the all posts in that thread and it was clearly stated by one of the Intel engineer that futute editions of SDM will include information about the counters on Sandy Bridge.&lt;/P&gt;
&lt;P&gt;What SDM revision do you use?&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 06:32:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946507#M2055</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-02-17T06:32:12Z</dc:date>
    </item>
    <item>
      <title>Btw. You have a nice avatar</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946508#M2056</link>
      <description>&lt;P&gt;Btw. You have a nice avatar.IIRC this is J.B Fourier.&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 09:41:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946508#M2056</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-02-17T09:41:18Z</dc:date>
    </item>
    <item>
      <title>Aha, that's right! Thanks</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946509#M2057</link>
      <description>&lt;P&gt;In the 64 IA-32 Architectures Optimization Manual, Appendix B.3.2.3, there mentioned to use&amp;nbsp;CYCLE_ACTIVITY.STALLS_LDM_PENDING and other CYCLE_ACTIVITY counters to characterize memory subsystem. But in SDM Volume 3 19.4 for Sandy Bridge, there aren't these counters.&lt;/P&gt;
&lt;P&gt;And I also found a link (the title is "Ivy Bridge performance monitoring events CYCLE_ACTIVITY.*?") saying there should be these counters on Sandy Bridge.&lt;/P&gt;
&lt;P&gt;Thanks a lot!&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 23:20:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946509#M2057</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-17T23:20:00Z</dc:date>
    </item>
    <item>
      <title>Aha, that's right! Thanks</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946510#M2058</link>
      <description>&lt;P&gt;Aha, that's right! Thanks&lt;/P&gt;</description>
      <pubDate>Sun, 17 Feb 2013 23:21:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946510#M2058</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-17T23:21:03Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;&gt;And I also found a link</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946511#M2059</link>
      <description>&lt;P&gt;&amp;gt;&amp;gt;&amp;gt;And I also found a link (the title is "Ivy Bridge performance monitoring events CYCLE_ACTIVITY.*?") saying there should be these counters on Sandy Bridge&amp;gt;&amp;gt;&amp;gt;&lt;/P&gt;
&lt;P&gt;Did you mention this link :And I also found a link :&lt;A href="http://software.intel.com/en-us/forums/topic/277820"&gt;http://software.intel.com/en-us/forums/topic/277820&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;There is a respone from one of the Intel engineers he clearly states that future revision of SDM will include those counters mentioned by you.&lt;/P&gt;</description>
      <pubDate>Mon, 18 Feb 2013 05:10:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946511#M2059</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-02-18T05:10:53Z</dc:date>
    </item>
    <item>
      <title>On Intel or any architecture.</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946512#M2060</link>
      <description>&lt;P&gt;On Intel or any architecture... I would propose looking at the front end "uops per clock" provided while the front end is busy. &amp;nbsp;So count the clks that the front end is actually doing something, that includes the DSB / MS and ILD, and then compare that with the execution core's upc ( uops per clock ). &amp;nbsp;If the front end upc while it's busy == that of the execution core, then you might be front end limited. &amp;nbsp;I only mention this since you're focusing on activity.. and thought you might think there's some limitation in the front of the machine. &amp;nbsp;In my inspections on many applications, Intel's rarely limited in the front end of their pipeline and their DSB provides much greater throughput than the execution core can &amp;nbsp;chew. &amp;nbsp;You also might want to generate a distribution of the throughput of the various front end and execution resources to see how often nothing is done... it's a large % of the time.&lt;/P&gt;
&lt;P&gt;perfwise&lt;/P&gt;</description>
      <pubDate>Mon, 18 Feb 2013 13:02:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946512#M2060</guid>
      <dc:creator>perfwise</dc:creator>
      <dc:date>2013-02-18T13:02:26Z</dc:date>
    </item>
    <item>
      <title>Thanks a lot perfwise. :)</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946513#M2061</link>
      <description>&lt;P&gt;Thanks a lot perfwise. :)&lt;/P&gt;</description>
      <pubDate>Mon, 18 Feb 2013 19:35:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Performance-Counters-on-SandyBridge/m-p/946513#M2061</guid>
      <dc:creator>Yunqi_Z_</dc:creator>
      <dc:date>2013-02-18T19:35:07Z</dc:date>
    </item>
  </channel>
</rss>

