<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic L3 caches in Nehalem in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797798#M747</link>
    <description>Thanks for the reply. First of all, should I move this thread to VTune Performance Analyzer forum then?&lt;BR /&gt;&lt;BR /&gt;The L3_LAT_CACHE.REFERENCE and L3_LAT_CACHE.MISS are indeed the performance counters I would need to look at. I understand that I can read the values of these counters using RDMSR. Is this correct? And is there any interrrupt or exception generated when these counters overflow?</description>
    <pubDate>Thu, 17 Jun 2010 10:01:06 GMT</pubDate>
    <dc:creator>Aastha_Mehta</dc:creator>
    <dc:date>2010-06-17T10:01:06Z</dc:date>
    <item>
      <title>L3 caches in Nehalem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797795#M744</link>
      <description>Hello,&lt;BR /&gt;&lt;BR /&gt;I would like to know if any presently available tool can
be used to get the L3 cache misses numbers from the performance
counters of Nehalem based architecture? Also, is it possible to
identify the exact point in the execution of a program when this L3
cache miss occurs?&lt;BR /&gt;&lt;BR /&gt;Thanks in advance,&lt;BR /&gt;&lt;BR /&gt;Aastha.</description>
      <pubDate>Tue, 15 Jun 2010 20:40:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797795#M744</guid>
      <dc:creator>Aastha_Mehta</dc:creator>
      <dc:date>2010-06-15T20:40:47Z</dc:date>
    </item>
    <item>
      <title>L3 caches in Nehalem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797796#M745</link>
      <description>Vtune (make sure you have the latest version)&lt;BR /&gt;&lt;BR /&gt;Not sure but I think these event couters are what you need:&lt;BR /&gt;LONGEST_LAT_CACHE.MISS and LONGEST_LAT_CACHE.REFERENCE&lt;BR /&gt;&lt;BR /&gt;Otherwise have a look in OFFCORE_RESPONSE_0 but the help is not realy helpfull to understand all the different counters you can use.&lt;BR /&gt;</description>
      <pubDate>Wed, 16 Jun 2010 13:32:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797796#M745</guid>
      <dc:creator>gert_massa</dc:creator>
      <dc:date>2010-06-16T13:32:01Z</dc:date>
    </item>
    <item>
      <title>L3 caches in Nehalem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797797#M746</link>
      <description>These should also do the job according to the &lt;A href="http://software.intel.com/sites/products/collateral/hpc/vtune/performance_analysis_guide.pdf"&gt;Perfromance Analysis Guide&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Mem_inst_retired.latency_above_threshold_32&lt;BR /&gt;Mem_inst_retired.latency_above_threshold_128&lt;BR /&gt;&lt;BR /&gt;P.S. This in a bit offtopic in the part of the forum. You should go to the &lt;A href="https://community.intel.com/intel-vtune-performance-analyzer/"&gt;Intel
 VTune Performance Analyzer &lt;/A&gt;&lt;B&gt;&lt;A href="https://community.intel.com/intel-vtune-performance-analyzer/"&gt;&lt;B&gt;&lt;BR /&gt;&lt;/B&gt;&lt;/A&gt;&lt;/B&gt;</description>
      <pubDate>Wed, 16 Jun 2010 13:51:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797797#M746</guid>
      <dc:creator>gert_massa</dc:creator>
      <dc:date>2010-06-16T13:51:30Z</dc:date>
    </item>
    <item>
      <title>L3 caches in Nehalem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797798#M747</link>
      <description>Thanks for the reply. First of all, should I move this thread to VTune Performance Analyzer forum then?&lt;BR /&gt;&lt;BR /&gt;The L3_LAT_CACHE.REFERENCE and L3_LAT_CACHE.MISS are indeed the performance counters I would need to look at. I understand that I can read the values of these counters using RDMSR. Is this correct? And is there any interrrupt or exception generated when these counters overflow?</description>
      <pubDate>Thu, 17 Jun 2010 10:01:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797798#M747</guid>
      <dc:creator>Aastha_Mehta</dc:creator>
      <dc:date>2010-06-17T10:01:06Z</dc:date>
    </item>
    <item>
      <title>L3 caches in Nehalem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797799#M748</link>
      <description>Just use VTune. It is worth every cent ;)</description>
      <pubDate>Thu, 17 Jun 2010 13:06:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797799#M748</guid>
      <dc:creator>gert_massa</dc:creator>
      <dc:date>2010-06-17T13:06:33Z</dc:date>
    </item>
    <item>
      <title>L3 caches in Nehalem</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797800#M749</link>
      <description>ok :) &lt;BR /&gt;but, i do need to go into the architectural details, so i am pursuing this.&lt;BR /&gt;&lt;BR /&gt;i need to know what type of interrupts are generated on overflow of the 
performance counters and what data is available to the exception 
handlers for these. i figured out something like, there is a INT bit 
(bit 22) in the IA32_PERFEVTSELx MSR that you are using for monitoring 
your event, which is set when the corresponding PMC overflows and a 
local APIC interrupt is generated.&lt;BR /&gt;
&lt;BR /&gt;
and if a performance counter is configured for PEBS, overflow condition 
in the counter generates a performance monitoring interrupt signalling a
 PEBS event.&lt;BR /&gt;
&lt;BR /&gt;are these two same things?&lt;BR /&gt;
&lt;BR /&gt;and i think L3_LAT_CACHE.MISS and L3_LAT_CACHE.REFERENCE are the events which have to be attached to IA32_PERFEVTSELx MSR and its associated IA32_PMCx MSR. you do not have separate counters for these events, right?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 18 Jun 2010 09:40:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/L3-caches-in-Nehalem/m-p/797800#M749</guid>
      <dc:creator>Aastha_Mehta</dc:creator>
      <dc:date>2010-06-18T09:40:05Z</dc:date>
    </item>
  </channel>
</rss>

