<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic HP Integrated Lights-Out can in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924160#M1398</link>
    <description>HP Integrated Lights-Out can report ECC memory errors.
Link to HP whitepaper :http://h20000.www2.hp.com/bc/docs/support/SupportManual/c02878598/c02878598.pdf</description>
    <pubDate>Tue, 18 Jun 2013 05:52:32 GMT</pubDate>
    <dc:creator>Bernard</dc:creator>
    <dc:date>2013-06-18T05:52:32Z</dc:date>
    <item>
      <title>Montor ECC memory status?</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924159#M1397</link>
      <description>Hello there,

Does anyone know a tool for monitoring number of errors detected by ECC memory/controller?

Thanks,
-- dd</description>
      <pubDate>Mon, 17 Jun 2013 17:07:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924159#M1397</guid>
      <dc:creator>ddbug1</dc:creator>
      <dc:date>2013-06-17T17:07:35Z</dc:date>
    </item>
    <item>
      <title>HP Integrated Lights-Out can</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924160#M1398</link>
      <description>HP Integrated Lights-Out can report ECC memory errors.
Link to HP whitepaper :http://h20000.www2.hp.com/bc/docs/support/SupportManual/c02878598/c02878598.pdf</description>
      <pubDate>Tue, 18 Jun 2013 05:52:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924160#M1398</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-06-18T05:52:32Z</dc:date>
    </item>
    <item>
      <title>Thank you.</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924161#M1399</link>
      <description>Thank you.
I wanted to get an ECC enabled machine to see how often DRAM errors occur in my environment.
(Interesting: you cannot understand whether you need ECC, unless you already have it?)

But, after reading the xeon-e5-2600-uncore-guide, this HP paper and MS WHEA docum, the whole ECC topic looks too intimidating.
I'll surrender for now...

- dd</description>
      <pubDate>Tue, 18 Jun 2013 12:21:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924161#M1399</guid>
      <dc:creator>ddbug1</dc:creator>
      <dc:date>2013-06-18T12:21:09Z</dc:date>
    </item>
    <item>
      <title>Hi dd,</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924162#M1400</link>
      <description>&lt;P&gt;Hi dd,&lt;/P&gt;
&lt;P&gt;Please look at this &lt;A href="http://www.intel.com/content/dam/www/public/us/en/documents/performance-briefs/xeon-e7-family-uncore-performance-programming-guide.pdf"&gt;manual for Intel Xeon E7&lt;/A&gt;&amp;nbsp;processors.&amp;nbsp;FVC events can be configured to count&amp;nbsp;memory ECC errors (see page 2-126 for example). They can also count corrected/uncorrected memory request responses.&lt;/P&gt;
&lt;P&gt;Best regards,&lt;/P&gt;
&lt;P&gt;Roman&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2013 16:08:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924162#M1400</guid>
      <dc:creator>Roman_D_Intel</dc:creator>
      <dc:date>2013-06-18T16:08:00Z</dc:date>
    </item>
    <item>
      <title>Low level details of hardware</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924163#M1401</link>
      <description>&lt;P&gt;Low level details of hardware and/or its programming interface are not an easy thing to grasp very quickly:)&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2013 16:09:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924163#M1401</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-06-18T16:09:38Z</dc:date>
    </item>
    <item>
      <title>Thanks guys. I see your point</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924164#M1402</link>
      <description>&lt;P&gt;Thanks guys. I see your point, Ilya...&amp;nbsp; There's an anecdote about senior and junior toilet cleaners... ;)&lt;/P&gt;
&lt;P&gt;My goal is to measure how often RAM errors occur on my machines and whether I want ECC.&lt;/P&gt;
&lt;P&gt;But the DRAM controller of Xeons (and the ECC RAM itself of course) looks much more complex than on "normal" non-ECC mobos, there are more parts that may fail.&amp;nbsp; Do you think that measurement of RAM errors rate on ECC enabled machine can be extrapolated to a simpler non-ECC sandy/ivy bridge system?&lt;/P&gt;
&lt;P&gt;Building the PCM to get the counters is not a problem.&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;-- dd&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2013 13:49:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924164#M1402</guid>
      <dc:creator>ddbug1</dc:creator>
      <dc:date>2013-06-19T13:49:11Z</dc:date>
    </item>
    <item>
      <title>Does PCM measure ECC errors?</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924165#M1403</link>
      <description>&lt;P&gt;Does PCM measure ECC errors?&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2013 16:20:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924165#M1403</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-06-19T16:20:24Z</dc:date>
    </item>
    <item>
      <title>Hello ddbug,</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924166#M1404</link>
      <description>&lt;P&gt;Hello ddbug,&lt;/P&gt;
&lt;P&gt;So... is ECC worth the extra money... that is a good question.&lt;/P&gt;
&lt;P&gt;My first response is, how much does it matter whether you can catch memory errors?&lt;/P&gt;
&lt;P&gt;If you are doing something where you don't mind rebooting then you probably don't need ECC memory.&lt;/P&gt;
&lt;P&gt;For mission critical applications where you absolutely need to know whether there are memory issues (yes, DIMMs do go bad) then ECC is a requirement. This is why servers always have ECC support.&lt;/P&gt;
&lt;P&gt;I think you can monitor ECC errors on windows in the system event log in the event viewer (eventvwr.msc).&lt;/P&gt;
&lt;P&gt;Pat&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2013 17:17:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924166#M1404</guid>
      <dc:creator>Patrick_F_Intel1</dc:creator>
      <dc:date>2013-06-19T17:17:34Z</dc:date>
    </item>
    <item>
      <title>&gt; Does PCM measure ECC errors</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924167#M1405</link>
      <description>&amp;gt; Does PCM measure ECC errors?

I have not checked this yet. Even if not, the docum explains how to get these counters.

&amp;gt; So... is ECC worth the extra money... that is a good question.

The ECC RAM modules cost not much more, it is a whole new machine of a higher class that is expensive...

Finally we've got approval for a Dell server. The exact model and h/w details not known yet.

thanks,

-- dd</description>
      <pubDate>Fri, 21 Jun 2013 13:13:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924167#M1405</guid>
      <dc:creator>ddbug1</dc:creator>
      <dc:date>2013-06-21T13:13:11Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;&gt;I think you can monitor</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924168#M1406</link>
      <description>&lt;P&gt;&amp;gt;&amp;gt;&amp;gt;I think you can monitor ECC errors on windows in the system event log in the event viewer (eventvwr.msc).&amp;gt;&amp;gt;&amp;gt;&lt;/P&gt;
&lt;P&gt;This is implemented by WHEA architecture.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jun 2013 14:17:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924168#M1406</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-06-21T14:17:34Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;But, after reading the xeon</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924169#M1407</link>
      <description>&amp;gt;&amp;gt;But, after reading the xeon-e5-2600-uncore-guide, this HP paper and MS WHEA docum, the whole ECC topic
&amp;gt;&amp;gt;looks too intimidating. I'll surrender for now...

In 2012 I saw some Intel equipment and I remember it allowed to simulate some memory errors for server platforms. Honestly, I didn't dare to ask how much it is...</description>
      <pubDate>Sat, 22 Jun 2013 05:12:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/Montor-ECC-memory-status/m-p/924169#M1407</guid>
      <dc:creator>SergeyKostrov</dc:creator>
      <dc:date>2013-06-22T05:12:08Z</dc:date>
    </item>
  </channel>
</rss>

